# CS 110 Computer Architecture RISC-V Instruction Formats

#### Instructor:

Sören Schwertfeger

http://shtech.org/courses/ca/

School of Information Science and Technology SIST

ShanghaiTech University

Slides based on UC Berkley's CS61C

#### Stack

- Stack frame includes:
  - Return "instruction" address
  - Parameters
  - Space for other local variables
- Stack frames contiguous blocks of memory; stack pointer tells where bottom of stack frame is
- When procedure ends, stack frame is tossed off the stack; frees memory for future stack frames



#### Basic Structure of a Function

#### **Prologue**

```
entry_label:
addi sp,sp, -framesize
sw ra, framesize-4(sp) # save ra
save other regs if need be

Body ··· (call other functions...)
```

#### **Epilogue**

```
restore other regs if need be
lw ra, framesize-4(sp) # restore $ra
addi sp, sp, framesize
jr ra
```

# Where is the Stack in Memory?

- RV32 convention (RV64 and RV128 have different memory layouts)
- Stack starts in high memory and grows down
  - Hexadecimal: bfff\_fff0<sub>hex</sub>
  - Stack must be aligned on 16-byte boundary (not true in previous examples)
- RV32 programs (text segment) in low end
  - $-0001_0000_{hex}$
- static data segment (constants and other static variables) above text for static variables
  - RISC-V convention global pointer (gp) points to static
  - $RV32 gp = 1000_0000_{hex}$
- Heap above static for data structures that grow and shrink; grows up to high addresses

# **RV32 Memory Allocation**



### RISC-V ISA so far...

- Registers we know so far (All of them!)
  - a0-a7 for function arguments, a0-a1 for return values
  - sp, stack pointer, ra return address
  - s0-s11 saved registers
  - t0-t6 temporaries
  - zero
- Instructions we know:
  - Arithmetic: add, addi, sub
  - Logical: sll, srl, sla, slli, srli, slai, and, or, xor, andi, ori, xori
  - Decision: beq, bne, blt, bge
  - Unconditional branches (jumps): j, jr
  - Functions called with jal, return with jr ra.
- The stack is your friend: Use it to save anything you need. Just leave it the way you found it!

# Levels of Representation/Interpretation

```
temp = v[k];
        High Level Language
                                             v(k) = v(k+1);
         Program (e.g., C)
                                             v[k+1] = temp;
                   Compiler
                                                  xt0, 0(x2)
         Assembly Language
                                                  xt1, 4(x2)
                                                  xt1, 0(x2)
        Program (e.g., RISC-V)
                                             SW
                                                  xt0, 4(x2)
                                             SW
                   Assembler
         Machine Language
          Program (RISC-V)
                                             0101 1000 0000 1001 1100 0110 1010 1111
Machine
Interpretation
                                                 Register File
  Hardware Architecture Description
         (e.g., block diagrams)
                                                   ALU
Architecture
Implementation
         Logic Circuit Description
       (Circuit Schematic Diagrams)
```

# Big Idea: Stored-Program Computer

First Draft of a Report on the EDVAC
by
John von Neumann
Contract No. W-670-ORD-4926
Between the
United States Army Ordnance Department and the
University of Pennsylvania
Moore School of Electrical Engineering
University of Pennsylvania

June 30, 1945

- Instructions are represented as bit patterns can think of these as numbers
- Therefore, entire programs can be stored in memory to be read or written just like data
- Can reprogram quickly (seconds), don't have to rewire computer (days)
- Known as the "von Neumann" computers after widely distributed tech report on EDVAC project
  - Wrote-up discussions of Eckert and Mauchly
  - Anticipated earlier by Turing and Zuse

# Consequence #1: Everything Addressed

- Since all instructions and data are stored in memory, everything has a memory address: instructions, data words
  - both branches and jumps use these
- C pointers are just memory addresses: they can point to anything in memory
  - Unconstrained use of addresses can lead to nasty bugs; up to you in C; limited in Java by language design
- One register keeps address of instruction being executed: "Program Counter" (PC)
  - Basically a pointer to memory: Intel calls it Instruction Pointer (a better name)

### Consequence #2: Binary Compatibility

- Programs are distributed in binary form
  - Programs bound to specific instruction set
  - Different version for ARM (phone) and PCs
- New machines want to run old programs ("binaries")
  as well as programs compiled to new instructions
- Leads to "backward-compatible" instruction set evolving over time
- Selection of Intel 8086 in 1981 for 1<sup>st</sup> IBM PC is major reason latest PCs still use 80x86 instruction set; could still run program from 1981 PC today

# Instructions as Numbers (1/2)

- Currently most data we work with is in words (32bit chunks):
  - Each register is a word.
  - 1w and sw both access memory one word at a time.
- So how do we represent instructions?
  - Remember: Computer only understands 1s and 0s, so
     "add x10, x11, x0" is meaningless.
  - RISC-V seeks simplicity: since data is in words, make instructions be fixed-size 32-bit words, too
    - Same 32-bit instructions used for RV32, RV64, RV128

# Instructions as Numbers (2/2)

- One word is 32 bits, so divide instruction word into "fields".
- Each field tells processor something about instruction.
- We could define different fields for each instruction, but RISC-V seeks simplicity, so define 6 basic types of instruction formats:
  - R-format for register-register arithmetic operations
  - I-format for register-immediate arithmetic operations and loads
  - S-format for stores
  - B-format for branches (minor variant of S-format, called SB before)
  - U-format for 20-bit upper immediate instructions
  - J-format for jumps (minor variant of U-format, called UJ before)

# R-Format Instruction Layout



- 32-bit instruction word divided into six fields of varying numbers of bits each: 7+5+5+3+5+7 = 32
- Examples
  - opcode is a 7-bit field that lives in bits 6-0 of the instruction
  - rs2 is a 5-bit field that lives in bits 24-20 of the instruction

#### R-Format Instructions opcode/funct fields



- opcode: partially specifies what instruction it is
  - Note: This field is equal to 0110011<sub>two</sub> for all R-Format register-register arithmetic instructions
- funct7+funct3: combined with opcode,
   these two fields describe what operation to
   perform
- Question: You have been professing simplicity, so why aren't opcode and funct7 and funct3 a single 17-bit field?
  - We'll answer this later

#### R-Format Instructions register specifiers



- <u>rs1</u> (Source Register #1): specifies register containing first operand
- <u>rs2</u>: specifies second register operand
- <u>rd</u> (Destination Register): specifies register which will receive result of computation
- Each register field holds a 5-bit unsigned integer (0-31) corresponding to a register number (x0-x31)

# R-Format Example

RISC-V Assembly Instruction:
 add x18,x19,x10



#### All RV32 R-format instructions

| _               |         |    |     |     |     |         |
|-----------------|---------|----|-----|-----|-----|---------|
| ade             | 0110011 | rd | 000 | rs1 | rs2 | 0000000 |
| sul             | 0110011 | rd | 000 | rs1 | rs2 | 0100000 |
| sl              | 0110011 | rd | 001 | rs1 | rs2 | 0000000 |
| sl <sup>.</sup> | 0110011 | rd | 010 | rs1 | rs2 | 0000000 |
| slt             | 0110011 | rd | 011 | rs1 | rs2 | 0000000 |
| xo:             | 0110011 | rd | 100 | rs1 | rs2 | 0000000 |
| sr              | 0110011 | rd | 101 | rs1 | rs2 | 0000000 |
| sr              | 0110011 | rd | 101 | rs1 | rs2 | 0100000 |
| or              | 0110011 | rd | 110 | rs1 | rs2 | 0000000 |
| and             | 0110011 | rd | 111 | rs1 | 752 | 0000000 |
| -               |         |    |     |     |     |         |

Different encoding in funct7 + funct3 selects different operations

#### Peer Instruction

What is correct encoding of add x4, x3, x2?

A: 4021 8233<sub>hex</sub>

B: 0021 82b3<sub>hex</sub>

C: 4021 82b3<sub>hex</sub>

D: 0021 8233<sub>hex</sub>

E: 0021 8234<sub>hex</sub>

| 31 | 25     | 24 20 | 19 15 | 14 12 | 11 7 | 6 0     |
|----|--------|-------|-------|-------|------|---------|
| 00 | 00000  | rs2   | rs1   | 000   | rd   | 0110011 |
| 01 | .00000 | rs2   | rs1   | 000   | rd   | 0110011 |
| 00 | 00000  | rs2   | rs1   | 100   | rd   | 0110011 |
| 00 | 00000  | rs2   | rs1   | 110   | rd   | 0110011 |
| 00 | 00000  | rs2   | rs1   | 111   | rd   | 0110011 |

sub xor or

#### **I-Format Instructions**

- What about instructions with immediates?
  - 5-bit field only represents numbers up to the value
     31: immediates may be much larger than this
  - Ideally, RISC-V would have only one instruction format (for simplicity): unfortunately, we need to compromise
- Define new instruction format that is mostly consistent with R-format
  - Notice if instruction has immediate, then uses at most 2 registers (one source, one destination)

# **I-Format Instruction Layout**



- Only one field is different from R-format, rs2 and funct7 replaced by 12-bit signed immediate, imm[11:0]
- Remaining fields (rs1, funct3, rd, opcode) same as before
- imm[11:0] can hold values in range [-2048<sub>ten</sub>, +2047<sub>ten</sub>]
- Immediate is always sign-extended to 32-bits before use in an arithmetic operation
- We'll later see how to handle immediates > 12 bits

# **I-Format Example**

RISC-V Assembly Instruction:

| 31        | 20 19 | 15 | <u>14 12</u> | 11 | 7 6    | 0 |
|-----------|-------|----|--------------|----|--------|---|
| imm[11:0] | r     | s1 | funct3       | rd | opcode |   |
| 12        |       | 5  | 3            | 5  | 7      |   |

| 111111001110 | 00001 | 000 | 01111 | 0010011 |
|--------------|-------|-----|-------|---------|
| imm=-50      | rs1=1 | add | rd=15 | OP-Imm  |

# All RV32 I-format Arithmetic Instructions

| imm[1:    | imm[11:0] |     | 000 | rd | 0010011 |
|-----------|-----------|-----|-----|----|---------|
| imm[1:    | 1:0]      | rs1 | 010 | rd | 0010011 |
| imm[1:    | 1:0]      | rs1 | 011 | rd | 0010011 |
| imm[1:    | 1:0]      | rs1 | 100 | rd | 0010011 |
| imm[11:0] |           | rs1 | 110 | rd | 0010011 |
| imm[1:    | 1:0]      | rs1 | 111 | rd | 0010011 |
| 0000000   | shamt     | rs1 | 001 | rd | 0010011 |
| 900000    | shamt     | rs1 | 101 | rd | 0010011 |
| 01/00000  | shamt     | rs1 | 101 | rd | 0010011 |

addi
slti
sltiu
xori
ori
andi
slli
srli
srai

One of the higher-order immediate bits is used to distinguish "shift right logical" (SRLI) from "shift right arithmetic" (SRAI)

"Shift-by-immediate" instructions only use lower 5 bits of the immediate value for shift amount (can only shift by 0-31 bit positions)

# Load Instructions are also I-Type



- The 12-bit signed immediate is added to the base address in register rs1 to form the memory address
  - This is very similar to the add-immediate operation but used to create address not to create final result
- The value loaded from memory is stored in register rd

# I-Format Load Example

RISC-V Assembly Instruction:

lw x14, 8(x2)

| 31 |              | 20 | 19    | 15 14 | 12     | 11    | 76   | 0   |
|----|--------------|----|-------|-------|--------|-------|------|-----|
|    | imm[11:0]    |    | rs1   |       | funct3 | rd    | opco | de  |
|    | 12           |    | 5     |       | 3      | 5     | 7    |     |
|    | offset[11:0] |    | base  |       | width  | dest  | LOA  | AD  |
|    |              |    |       |       |        |       |      |     |
|    | 00000001000  |    | 00010 | )     | 010    | 01110 | 0000 | 011 |
|    | imm=+8       |    | rs1=2 | 2     | lw     | rd=14 | l LO | AD  |
|    |              |    |       | (loa  | ad wor | d)    |      |     |

#### All RV32 Load Instructions

|           |     |     |    |         | _  |
|-----------|-----|-----|----|---------|----|
| imm[11:0] | rs1 | 000 | rd | 0000011 | lb |
| imm[11:0] | rs1 | 010 | rd | 0000011 | lh |
| imm[11:0] | rs1 | 011 | rd | 0000011 | lw |
| imm[11:0] | rs1 | 100 | rd | 0000011 | 1b |
| imm[11:0] | rs1 | 110 | rd | 0000011 | lh |

ou nu

funct3 field encodes size and 'signedness' of load data

- LBU is "load unsigned byte"
- LH is "load halfword", which loads 16 bits (2 bytes) and sign-extends to fill destination 32bit register
- LHU is "load unsigned halfword", which zero-extends 16 bits to fill destination 32-bit register
- There is no LWU in RV32, because there is no sign/zero extension needed when copying 32 bits from a memory location into a 32-bit register

#### S-Format Used for Stores



- Store needs to read two registers, rs1 for base memory address, and rs2 for data to be stored, as well immediate offset!
- Can't have both rs2 and immediate in same place as other instructions!
- Note that stores don't write a value to the register file, no rd!
- RISC-V design decision is move low 5 bits of immediate to where rd field was in other instructions – keep rs1/rs2 fields in same place
  - register names more critical than immediate bits in hardware design

# S-Format Example

RISC-V Assembly Instruction:

sw x14, 8(x2)



#### All RV32 Store Instructions

| .1 \$ | 0100011 | imm[4:0] | 000 | rs1 | rs2 | Imm[11:5] |
|-------|---------|----------|-----|-----|-----|-----------|
| .1 5  | 0100011 | imm[4:0] | 001 | rs1 | rs2 | Imm[11:5] |
| .1    | 0100011 | imm[4:0] | 010 | rs1 | rs2 | Imm[11:5] |

width

• Store byte, halfword, word

#### RISC-V Conditional Branches

- E.g., BEQ x1, x2, Label
- Branches read two registers but don't write a register (similar to stores)
- How to encode label, i.e., where to branch to?

# **Branching Instruction Usage**

- Branches typically used for loops (if-else, while, for)
  - Loops are generally small (< 50 instructions)</li>
  - Function calls and unconditional jumps handled with jump instructions (J-Format)
- Recall: Instructions stored in a localized area of memory (Code/Text)
  - Largest branch distance limited by size of code
  - Address of current instruction stored in the program counter (PC)

# **PC-Relative Addressing**

- PC-Relative Addressing: Use the immediate field as a two's-complement offset to PC
  - Branches generally change the PC by a small amount
  - Can specify ± 2<sup>11</sup> 'unit' addresses from the PC
  - (We will see in a bit that we can encode 12-bit offsets as immediates)
- Why not use byte as a unit of offset from PC?
  - Because instructions are 32-bits (4-bytes)
  - We don't branch into middle of instruction

# Scaling Branch Offset

- One idea: To improve the reach of a single branch instruction, multiply the offset by four bytes before adding to PC
- This would allow one branch instruction to reach  $\pm 2^{11} \times 32$ -bit instructions either side of PC
  - Four times greater reach than using byte offset

#### **Branch Calculation**

If we don't take the branch:

```
PC = PC + 4 (i.e., next instruction)
```

If we do take the branch:

```
PC = PC + immediate*4
```

#### Observations:

 immediate is number of instructions to jump (remember, specifies words) either forward (+) or backwards (-)

#### RISC-V Feature, n×16-bit instructions

- Extensions to RISC-V base ISA support 16-bit compressed instructions and also variable-length instructions that are multiples of 16-bits in length
- To enable this, RISC-V scales the branch offset by 2 bytes even when there are no 16-bit instructions
- Reduces branch reach by half and means that ½ of possible targets will be errors on RISC-V processors that only support 32-bit instructions (as used in this class)
- RISC-V conditional branches can only reach  $\pm 2^{10} \times 32$ -bit instructions on either side of PC

#### RISC-V B-Format for Branches



- B-format is mostly same as S-Format, with two register sources (rs1/rs2) and a 12-bit immediate imm[12:1]
- But now immediate represents values -4096 to +4094 in 2-byte increments
- The 12 immediate bits encode even 13-bit signed byte offsets (lowest bit of offset is always zero, so no need to store it)

# Branch Example, Determine Offset

RISC-V Code:

```
Loop: beq x19,x10,End
add x18,x18,x10
addi x19,x19,-1
j Loop

End: # target instruction

1 Count instructions from branch
```

- Branch offset = 4×32-bit instructions = 16 bytes
- (Branch with offset of 0, branches to itself)

## Branch Example, Determine Offset

#### RISC-V Code:

```
Count
     Loop: beq x19,x10,End
            add x18, x18, x10
                                       from branch
                  x19, x19, -1
            addi
                  Loop
          01010
                  10011
                                            1100011
3333333
                            000
                                    33333
                            BEQ
         rs2=10
                  rs1=19
                                     imm
  imm
                                             BRANCH
```

# Branch Example, Encode Offset

RISC-V Code:

| 3333333 | 01010  | 10011  | 000 | 33333 | 1100011 |
|---------|--------|--------|-----|-------|---------|
| imm     | rs2=10 | rs1=19 | BEQ | imm   | BRANCH  |

# RISC-V Immediate Encoding

Instruction encodings, inst[31:0]

| 31 | 30        | 25   | 24   | 20 | 19  | 151 | .4 12  | 11       | 8 7 6    | 0   | _      |
|----|-----------|------|------|----|-----|-----|--------|----------|----------|-----|--------|
|    | funct7    |      | rs2  |    | rs1 |     | funct3 | rd       | opco     | ode | R-type |
|    | imn       | ı[11 | L:0] |    | rs1 |     | funct3 | rd       | opco     | ode | I-type |
| in | nm [11:5] | ]    | rs2  |    | rs1 |     | funct3 | imm[4:(  | opco     | ode | S-type |
| im | m[12 10   | :5]  | rs2  |    | rs1 |     | funct3 | imm [4:1 | 11] opco | ode | B-type |

32-bit immediates produced, imm[31:0]

# RISC-V Immediate Encoding



Upper bits sign-extended from inst[31] always

Only bit 7 of instruction changes role in immediate between S and B

### Branch Example, complete encoding



### All RISC-V Branch Instructions

| imm[12 10:5] | rs2 | rs1 | 000 | imm[4:1 11] | 1100011 |
|--------------|-----|-----|-----|-------------|---------|
| imm[12 10:5] | rs2 | rs1 | 001 | imm[4:1 11] | 1100011 |
| imm[12 10:5] | rs2 | rs1 | 100 | imm[4:1 11] | 1100011 |
| imm[12 10:5] | rs2 | rs1 | 101 | imm[4:1 11] | 1100011 |
| imm[12 10:5] | rs2 | rs1 | 110 | imm[4:1 11] | 1100011 |
| imm[12 10:5] | rs2 | rs1 | 111 | imm[4:1 11] | 1100011 |

BEQ BNE BLT BGE BLTU

# Questions on PC-addressing

- Does the value in branch immediate field change if we move the code?
  - If moving individual lines of code, then yes
  - If moving all of code, then no ('position-independent code')
- What do we do if destination is > 2<sup>10</sup> instructions away from branch?
  - Other instructions save us

# Questions on PC-addressing

- Does the value in branch immediate field change if we move the code?
  - If moving individual lines of code, then yes
  - If moving all of code, then no (because PC-relative offsets)
- What do we do if destination is > 2<sup>10</sup> instructions away from branch?
  - Other instructions save us

```
beq x10,x0, far bne x10,x0, next \rightarrow j far next: # next instr
```

# U-Format for "Upper Immediate" Instructions



- Has 20-bit immediate in upper 20 bits of 32-bit instruction word
- One destination register, rd
- Used for two instructions
  - LUI Load Upper Immediate
  - AUIPC Add Upper Immediate to PC

# LUI to Create Long Immediates

- LUI writes the upper 20 bits of the destination with the immediate value, and clears the lower 12 bits.
- Together with an ADDI to set low 12 bits, can create any 32-bit value in a register using two instructions (LUI/ADDI).

```
LUI x10, 0x87654 # x10 = 0x87654000
ADDI x10, x10, 0x321# x10 = 0x87654321
```

### One Corner Case

#### How to set 0xDEADBEEF?

```
LUI x10, 0xDEADB # x10 = 0xDEADB000
ADDI x10, x10, 0xEEF# x10 = 0xDEADAEEF
```

ADDI 12-bit immediate is always sign-extended, if top bit is set, will subtract 1 from upper 20 bits

### Solution

#### How to set 0xDEADBEEF?

```
LUI x10, 0xDEADC # x10 = 0xDEADC000
ADDI x10, x10, 0xEEF# x10 = 0xDEADBEEF
```

Pre-increment value placed in upper 20 bits, if sign bit will be set on immediate in lower 12 bits.

Assembler pseudo-op handles all of this:

```
li x10, 0xDEADBEEF # Creates two
instructions
```

### **AUIPC**

- Adds upper immediate value to PC and places result in destination register
- Used for PC-relative addressing

Label: AUIPC x10, 0 # Puts address of label in x10

# J-Format for Jump Instructions



- JAL saves PC+4 in register rd (the return address)
  - Assembler "j" jump is pseudo-instruction, uses JAL but sets rd=x0 to discard return address
- Set PC = PC + offset (PC-relative jump)
- Target somewhere within ±2<sup>19</sup> locations, 2 bytes apart
  - ±2<sup>18</sup> 32-bit instructions
- Immediate encoding optimized similarly to branch instruction to reduce hardware cost

### Uses of JAL

```
# j pseudo-instruction

j Label = jal \times 0, Label # Discard return address

# Call function within 2^{18} instructions of PC

jal ra, FuncName
```

## JALR Instruction (I-Format)



- JALR rd, rs, immediate
  - Writes PC+4 to rd (return address)
  - Sets PC = rs + immediate
  - Uses same immediates as arithmetic and loads
    - no multiplication by 2 bytes
    - In contrast to branches and JAL

### Uses of JALR

```
# ret and jr psuedo-instructions
ret = jr ra = jalr x0, ra, 0
# Call function at any 32-bit absolute address
lui x1, <hi20bits>
jalr ra, x1, <lo12bits>
# Jump PC-relative with 32-bit offset
auipc x1, <hi20bits>
jalr x0, x1, <lo12bits>
```

# Summary of RISC-V Instruction Formats

| <u>31</u> 30 <u>25</u> | 24 21 20 | 19 15  | 14 12  | 2 11 8 7    | 6 0    | <u>.                                    </u> |
|------------------------|----------|--------|--------|-------------|--------|----------------------------------------------|
| funct7                 | rs2      | rs1    | funct3 | rd          | opcode | R-type                                       |
| imm[11                 | .:0]     | rs1    | funct3 | rd          | opcode | l-type                                       |
| imm[11:5]              | rs2      | rs1    | funct3 | imm[4:0]    | opcode | S-type                                       |
| imm[12 10:5]           | rs2      | rs1    | funct3 | imm[4:1 11] | opcode | B-type                                       |
|                        | imm[3    | 31:12] |        | rd          | opcode | U-type                                       |
| imm[20 10:             | 1 11]]   | imm[   | 19:12] | rd          | opcode | J-type                                       |

# Complete RV32I ISA

|              |                 |     |          |             |         | 1    |
|--------------|-----------------|-----|----------|-------------|---------|------|
|              | imm[31:12]      | rd  | 0110111  | LUI         |         |      |
|              | imm[31:12]      | rd  | 0010111  | AUIPC       |         |      |
| im           | m[20 10:1 11 1: | rd  | 1101111  | JAL         |         |      |
| imm[11:      | [0]             | rs1 | . 000 rd |             | 1100111 | JALR |
| imm[12 10:5] | rs2             | rs1 | 000      | imm[4:1 11] | 1100011 | BEQ  |
| imm[12 10:5] | rs2             | rs1 | 001      | imm[4:1 11] | 1100011 | BNE  |
| imm[12 10:5] | rs2             | rs1 | 100      | imm[4:1 11] | 1100011 | BLT  |
| imm[12 10:5] | rs2             | rs1 | 101      | imm[4:1 11] | 1100011 | BGE  |
| imm[12 10:5] | rs2             | rs1 | 110      | imm[4:1 11] | 1100011 | BLTU |
| imm[12 10:5] | rs2             | rs1 | 111      | imm[4:1 11] | 1100011 | BGEU |
| imm[11:      | 0               | rs1 | 000      | rd 0000011  |         | LB   |
| imm[11:      | [0]             | rs1 | 001      | rd 0000011  |         | LH   |
| imm[11:      | imm[11:0]       |     |          | rd          | 0000011 | LW   |
| imm[11:      | imm[11:0]       |     | 100      | rd          | 0000011 | LBU  |
| imm[11:      | [0]             | rs1 | 101      | rd          | 0000011 | LHU  |
| imm[11:5]    | rs2             | rs1 | 000      | imm[4:0]    | 0100011 | SB   |
| imm[11:5]    | rs2             | rs1 | 001      | imm[4:0]    | 0100011 | SH   |
| imm[11:5]    | rs2             | rs1 | 010      | imm[4:0]    | 0100011 | SW   |
| imm[11:      | 0               | rs1 | 000      | rd          | 0010011 | ADDI |
| imm[11:      | rs1             | 010 | rd       | 0010011     | SLTI    |      |
| imm[11:      | rs1             | 011 | rd       | 0010011     | SLTIU   |      |
| imm[11:      | rs1             | 100 | rd       | 0010011     | XORI    |      |
| imm[11:0]    |                 | rs1 | 110      | rd          | 0010011 | ORI  |
| imm[11:      | [0]             | rs1 | 111      | rd          | 0010011 | ANDI |
|              |                 |     |          |             |         |      |

|         |               |        | 1     |      | 1     | 1       |
|---------|---------------|--------|-------|------|-------|---------|
| 0000000 | 0000000 shamt |        | rs1   | 001  | rd    | 0010011 |
| 0000000 | 0000000 shamt |        | rs1   | 101  | rd    | 0010011 |
| 0100000 | )             | shamt  | rs1   | 101  | rd    | 0010011 |
| 0000000 | )             | rs2    | rs1   | 000  | rd    | 0110011 |
| 0100000 | )             | rs2    | rs1   | 000  | rd    | 0110011 |
| 0000000 | )             | rs2    | rs1   | 001  | rd    | 0110011 |
| 0000000 | )             | rs2    | rs1   | 010  | rd    | 0110011 |
| 0000000 | )             | rs2    | rs1   | 011  | rd    | 0110011 |
| 0000000 | )             | rs2    | rs1   | 100  | rd    | 0110011 |
| 0000000 | )             | rs2    | rs1   | 101  | rd    | 0110011 |
| 0100000 | )             | rs2    | rs1   | 101  | rd    | 0110011 |
| 0000000 | )             | rs2    | rs1   | 110  | rd    | 0110011 |
| 0000000 | )             | rs2    | rs1   | 111  | rd    | 0110011 |
| 0000    | pre           | d succ | 00000 | 000  | 00000 | 0001111 |
| 0000    | 000           | 0 0000 | 00000 | 001  | 00000 | 0001111 |
| 000     | 000000        | 000    | 00000 | 000  | 00000 | 1110011 |
| 000     | 000000        | 001    | 00000 | 000  | 00000 | 1110011 |
|         | csr           |        |       | 001  | rd    | 1110011 |
|         | csr           | lot in |       | ecti | Ires  | 1110011 |
|         | csr           |        | rs1   | 011  | rd    | 1110011 |
|         | csr           |        | zimm  | 101  | rd    | 1110011 |
|         | csr           |        | zimm  | 110  | rd    | 1110011 |
|         | csr           |        | zimm  | 111  | rd    | 1110011 |

SLLISRLI SRAI ADD SUBSLLSLTSLTU XOR SRLSRAORAND FENCE FENCE.I **ECALL EBREAK** CSRRW CSRRS CSRRC **CSRRWI CSRRSI CSRRCI** 

# "And in Conclusion..."

- Simplification works for RISC-V: Instructions are same size as data word (one word) so that they can use the same memory.
- Computer actually stores programs as a series of these 32-bit numbers.
- We have covered all RISC-V instructions and registers
  - R-type, I-type, S-type, B-type, U-type and J-type instructions
  - Practice assembling and disassembling