

# Language of the Computer: Learning via RISC-V

Debiprasanna Sahoo

Assistant Professor

Department of Computer Science and Engineering  
School of Electrical and Computer Sciences  
Indian Institute of Technology Bhubaneswar



# Content

## Book

Computer Organization and Design: The Hardware/Software Interface- RISC-V Edition, 5th Edition, 2017

Chapter-2

Computer Architecture: A Quantitative Approach

Appendix-A

David A. Patterson and John L. Henessey

## Manual

The RISC-V Instruction Set Manual  
Volume I: User-Level ISA

Document Version 2.2

Andrew Waterman and Krste Asanovi

# Learning Outcomes

- Understanding concepts like registers
- Classify different means data addressing
- List different instructions of RISC-V instruction set
- Map the instruction sets to different category

# Components of Instruction



# General Purpose Registers



Storage is internal to the processor and is faster than memory



Registers are more efficiently allocated by compilers



How many registers are sufficient? Depends on the smartness of the compiler.



How many of the operands are memory addresses in an instruction?

# Control and Status Registers



These registers used for specific functionality of the systems



**Control registers** manage system level functions like memory management and process/thread scheduling  
**Status registers** like flag and interrupt registers are used to process dependent instructions,



Operating system and other system software controls it along with CPU itself



Such registers are for specific purpose. A role of operating system designer is critical for deciding the type of such registers

# Classification of Instruction Set



**Stack:** The first operand is removed from the stack, the result takes the place of the second operand, and TOS is updated to point to the result.

**Accumulator:** The accumulator stores the operand and the result.

**Register-Memory:** One input operand is a register, one is in memory, and the result goes to a register.

**Register-Register:** All operands are registers.

# Classification of Instruction Set (Cont.).

## How is $C = A + B$ Executed?

| Stack  | Accumulator | Register<br>(register-memory) | Register<br>(load-store) |
|--------|-------------|-------------------------------|--------------------------|
| Push A | Load A      | Load R1,A                     | Load R1,A                |
| Push B | Add B       | Add R3,R1,B                   | Load R2,B                |
| Add    | Store C     | Store R3,C                    | Add R3,R1,R2             |
| Pop C  |             |                               | Store R3,C               |

**Stack:** A data structure for spilling registers organized as a last-in-first-out queue.

**Stack Pointer:** A value denoting the most recently allocated address in a stack that shows where registers should be spilled or where old register values can be found.

Advanced  
RISC  
Machines

Million  
Instructions  
Per Second

Apple-IBM-  
Motorola  
(NXP)

Sun  
Microsystems  
(Oracle)

| Number of memory addresses | Maximum number of operands allowed | Type of architecture | Examples                                                |
|----------------------------|------------------------------------|----------------------|---------------------------------------------------------|
| 0                          | 3                                  | Load-store           | ARM, MIPS, PowerPC, SPARC, RISC-V                       |
| 1                          | 2                                  | Register-memory      | IBM 360/370, Intel 80x86, Motorola 68000, TI TMS320C54x |
| 2                          | 2                                  | Memory-memory        | VAX (also has three-operand formats)                    |
| 3                          | 3                                  | Memory-memory        | VAX (also has two-operand formats)                      |

## Combination for memory operands

Digital  
Equipment  
Corporations

## Advantages

- Simple, fixed-length instruction encoding
- Simple code generation model.
- Instructions take similar numbers of clocks to execute

## Disadvantages

- Higher instruction count than architectures with memory references in instructions.
- More instructions and lower instruction density lead to larger programs, which may have some instruction cache effects

What is  
Instruction  
Encoding?

ADD R1, R2, R3 → R1 = R2 + R3

What is  
Instruction  
Density?

# Register-Register Operands

## Advantages

- Data can be accessed without a separate load instruction first.
- Lower Instruction count
- Higher Instruction Density

## Disadvantages

- Clocks per instruction vary by operand location
- Complex Code Generation Process
- Variable Length Encoding

ADD R1, R2, 10[R3]

$R1 = R2 + M[10+R3]$

# Register-Memory Operands

Word: Integer, Float (4)

Double Word: Long Int, Double (8)

Half Word: Short Integer (2)

Quarter Word: Character (1)

## TYPES AND SIZE OF OPERANDS

# Interpreting Memory Address

## Little Endian

- Last byte of binary representation of multi-byte data-type is stored first
- 0x01234567 is stored in the memory as 67 45 23 01
- It's easy to cast the value to a smaller type like from int16\_t to int8\_t since int8\_t is the byte at the beginning of int16\_t.

## Big Endian

- First byte of binary representation of multi-byte data-type is stored first
- 0x01234567 is stored in the memory as 01 23 45 57
- More human understandable
- Effective for signed numbers because 1st bit is signed bit.



How to find if  
a system is  
little-endian or  
big-endian?

# Aligned and Misaligned addresses

X86  
supports  
misaligned  
addresses

RISC does  
not support  
misaligned  
addresses

| Width of object       | Value of three low-order bits of byte address |            |            |            |            |            |            |            |
|-----------------------|-----------------------------------------------|------------|------------|------------|------------|------------|------------|------------|
|                       | 0                                             | 1          | 2          | 3          | 4          | 5          | 6          | 7          |
| 1 byte (byte)         | Aligned                                       | Aligned    | Aligned    | Aligned    | Aligned    | Aligned    | Aligned    | Aligned    |
| 2 bytes (half word)   | Aligned                                       | Aligned    | Aligned    | Aligned    | Aligned    | Aligned    | Aligned    | Aligned    |
| 2 bytes (half word)   |                                               | Misaligned |
| 4 bytes (word)        | Aligned                                       | Aligned    | Aligned    | Aligned    | Aligned    | Aligned    | Aligned    | Aligned    |
| 4 bytes (word)        |                                               | Misaligned |
| 4 bytes (word)        |                                               |            | Misaligned | Misaligned | Misaligned | Misaligned | Misaligned | Misaligned |
| 8 bytes (double word) | Aligned                                       | Aligned    | Aligned    | Aligned    | Aligned    | Aligned    | Aligned    | Aligned    |
| 8 bytes (double word) |                                               | Misaligned |
| 8 bytes (double word) |                                               |            | Misaligned | Misaligned | Misaligned | Misaligned | Misaligned | Misaligned |
| 8 bytes (double word) |                                               |            |            | Misaligned | Misaligned | Misaligned | Misaligned | Misaligned |
| 8 bytes (double word) |                                               |            |            |            | Misaligned | Misaligned | Misaligned | Misaligned |
| 8 bytes (double word) |                                               |            |            |            |            | Misaligned | Misaligned | Misaligned |
| 8 bytes (double word) |                                               |            |            |            |            |            | Misaligned | Misaligned |

**Word:** A natural unit of access in a computer, usually a group of 32 bits.

# RISC-V General Purpose Register (GPR)

| register | ABI   | description                      | register | ABI | description         |
|----------|-------|----------------------------------|----------|-----|---------------------|
| x0       | zero  | Hardwired zero                   | x16      | a6  | Function argument 6 |
| x1       | ra    | Return address                   | x17      | a7  | Function argument 7 |
| x2       | sp    | Stack pointer                    | x18      | s2  | Saved register 2    |
| x3       | gp    | Global pointer                   | x19      | s3  | Saved register 3    |
| x4       | tp    | Thread pointer                   | x20      | s4  | Saved register 4    |
| x5       | t0    | Temporary 0                      | x21      | s5  | Saved register 5    |
| x6       | t1    | Temporary 1                      | x22      | s6  | Saved register 6    |
| x7       | t2    | Temporary 2                      | x23      | s7  | Saved register 7    |
| x8       | s0/fp | Saved register0/frame pointer    | x24      | s8  | Saved register 8    |
| x9       | s1    | Saved register 1                 | x25      | s9  | Saved register 9    |
| x10      | a0    | Function argument/return value 0 | x26      | s10 | Saved register 10   |
| x11      | a1    | Function argument/return value 1 | x27      | s11 | Saved register 11   |
| x12      | a2    | Function argument 2              | x28      | t3  | Temporary 3         |
| x13      | a3    | Function argument 3              | x29      | t4  | Temporary 4         |
| x14      | a4    | Function argument 4              | x30      | t5  | Temporary 5         |
| x15      | a5    | Function argument 5              | x31      | t6  | Temporary 6         |
| pc       | -     | Program counter                  |          |     |                     |

# X86 General Purpose Registers

| 64-bit register | Lower 32 bits | Lower 16 bits | Lower 8 bits |
|-----------------|---------------|---------------|--------------|
| <b>rax</b>      | <b>eax</b>    | <b>ax</b>     | <b>al</b>    |
| <b>rbx</b>      | <b>ebx</b>    | <b>bx</b>     | <b>bl</b>    |
| <b>rcx</b>      | <b>ecx</b>    | <b>cx</b>     | <b>cl</b>    |
| <b>rdx</b>      | <b>edx</b>    | <b>dx</b>     | <b>dl</b>    |
| <b>rsi</b>      | <b>esi</b>    | <b>si</b>     | <b>sil</b>   |
| <b>rdi</b>      | <b>edi</b>    | <b>di</b>     | <b>idl</b>   |
| <b>rbp</b>      | <b>ebp</b>    | <b>bp</b>     | <b>bpl</b>   |
| <b>rsp</b>      | <b>esp</b>    | <b>sp</b>     | <b>spl</b>   |
| <b>r8</b>       | <b>r8d</b>    | <b>r8w</b>    | <b>r8b</b>   |
| <b>r9</b>       | <b>r9d</b>    | <b>r9w</b>    | <b>r9b</b>   |
| <b>r10</b>      | <b>r10d</b>   | <b>r10w</b>   | <b>r10b</b>  |
| <b>r11</b>      | <b>r11d</b>   | <b>r11w</b>   | <b>r11b</b>  |
| <b>r12</b>      | <b>r12d</b>   | <b>r12w</b>   | <b>r12b</b>  |
| <b>r13</b>      | <b>r13d</b>   | <b>r13w</b>   | <b>r13b</b>  |
| <b>r14</b>      | <b>r14d</b>   | <b>r14w</b>   | <b>r14b</b>  |
| <b>r15</b>      | <b>r15d</b>   | <b>r15w</b>   | <b>r15b</b>  |

- **Accumulator Register (AX):** Stores output or one input
- **Base Registers (BX):** Stores offset of the bases used
- **Counter Registers (CX):** Looping and rotation
- **Data Registers (DX):** Other data along with all the numbered registers
- **Stack Pointer (SP):** Points to topmost element on stack
- **Base Pointer (BP):** Accessing Parameter Passing to Stack
- **Destination Index Register (DI):** Pointer addressing of data and as a destination in some string-related operations
- **Source Index Register (SI):** Pointer addressing of data and as a destination in some string-related operations

# Addressing Mode

**Definition:** One of several addressing regimes delimited by their varied use of operands and/or addresses.

**Meaning:** The way in which the operand of an instruction is specified

# RISC-V Instruction Set and Addressing Modes

Register Type (R-Type)

Upper Immediate (U-Type)

Immediate Type (I-Type)

Branch Type (B-Type)

Store Type (S-Type)

Jump Type (J-Type)

# Register Addressing Mode (R-Type)

All the operands are registers

$$c = a + b \rightarrow \text{ADDW dest, src1,src2} \rightarrow rd = rs1 + rs2$$



Standard Arithmetic and logic operations fall into this category. Mostly the operators and special function registers

# Register Type (R-Type) Instruction Format

| <b>func7[31-25]</b> | <b>rs2 [24-20]</b> | <b>rs1 [19-15]</b> | <b>func3 [14-12]</b>                | <b>rd [11-7]</b> | <b>Opcode [6-0]</b> |
|---------------------|--------------------|--------------------|-------------------------------------|------------------|---------------------|
| 0000000             | src2               | src1               | ADD/SLT/SLTU/AND<br>/OR/XOR/SLL/SRL | dest             | OP                  |
| 0100000             | src2               | src1               | SUB/SRA                             | dest             | OP                  |

- Operands are attempted to be placed in same location for easy decoding
- funct3 determines the actual operation to be executed
- funct7 are the extension bits for existing and new extensions
- 30th bit is used to distinguish SUB and Arithmetic Right Shift from other ALU operations

$c = a + b \longrightarrow \text{ADDW rd, rs1, rs2} \longrightarrow rd = rs1 + rs2$

# Immediate Addressing Mode (I-Type)

**Immediate:** One of the operand is a constant within the instruction itself. Other operands are present in registers.

$$c = a + 10 \longrightarrow \text{ADDIW dest, src1,imm} \longrightarrow rd = rs1 + 10$$



# Immediate Type (I-Type): Arithmetic Instructions

| immediate[31-20] | rs1 [19-15] | func3 [14-12] | rd [11-7] | Opcode [6-0] |
|------------------|-------------|---------------|-----------|--------------|
| immediate[11:0]  | src         | ADDI/SLTI[U]  | dest      | OP-IMM       |
| immediate[11:0]  | src         | ANDI/ORI/XORI | dest      | OP-IMM       |

func3 bits selects between the 6 immediate ALU operations: Add, And, Or, Xor, set less than immediate, set less than immediate unsigned

$$c = a + 10 \longrightarrow \text{ADDI dest, src, imm} \longrightarrow rd = rs + \text{imm12}$$

# Immediate Type (I-Type): Arithmetic Instructions

| immediate[31-25] | immediate[24-20] | rs1 [19-15] | func3 [14-12] | rd [11-7] | Opcode [6-0] |
|------------------|------------------|-------------|---------------|-----------|--------------|
| 0000000          | shamt[4:0]       | src         | SLLI/SRLI     | dest      | OP-IMM       |
| 0100000          | shamt[4:0]       | src         | SRAI          | dest      | OP-IMM       |

- func3 bits selects rest 2 ALU operations: Shift Left, Shift Right
- shamt is the short form for shift amount

All func3 bits are exhausted, so bit 30 will be set to have extensions of I-Type instructions

$a = b << 10;$  → SLLI dest, src, imm →  $rd = rs << \text{shamt}5$

# Displaced Addressing Mode

The operand is at the memory location whose address is the sum of a register and a constant in the instruction.



# Load Instructions

| immediate[31-20] | rs1 [19-15] | func3 [14-12]                    | rd[11-7] | Opcode [6-0] |
|------------------|-------------|----------------------------------|----------|--------------|
| offset[11-0]     | src         | width(LW/LD/LH /LB/LWU/LHU/ LBU) | dest     | LOAD         |

func3 bits selects between 7 Load operations: Load Word, Load Double Word, Load Half Word, Load Byte, Load Word Unsigned, Load Half Word Unsigned, Load Byte Unsigned

Access any location → LW rd, imm12[rs] → rd = Mem[ rs + imm12 ]

This is not scanf in C. This is accessing any variable in C.

# Store Type (S-Type)

|                  |            |             |                          |             |            |
|------------------|------------|-------------|--------------------------|-------------|------------|
| immediate[31-25] | rs2[24-20] | rs1 [19-15] | func3 [14-12]            | imm[11-7]   | Store[6-0] |
| Offset[11:5]     | data       | src         | width (SD/SW/<br>SHW/SB) | Offset[4:0] | STORE      |

func3 bits selects between 3 Store Operations: Store Double, Store Word, Store Half Word, Store Byte

store into any location → SW rs2, imm12[rs1] → Mem[ rs1 + imm12 ] = rs2

# NOP Instruction

| immediate[31-20] | rs1 [19-15] | func3 [14-12] | rd[11-7] | Opcode [6-0] |
|------------------|-------------|---------------|----------|--------------|
| 0                | 0           | ADDI          | 0        | OP-IMM       |

- The NOP instruction does not change any user-visible state, except for advancing the pc.
- NOP is encoded as ADDI x0, x0, 0.
- Remember! X0 is set to 0

# PC-relative Addressing Mode (U/J/B-Type)

The branch address is the sum of the PC and a constant in the instruction

BEQ src1, src2, imm12 → if  $\text{src1} == \text{src2}$  then  $\text{pc} = \text{pc} + \text{imm12}$



Most control statements gets converted to one of these. Ex: If-else, Switch, while, do-while. for, goto, etc.

# PC-Relative Example-1: Conditional Branch Type (B-Type)

| immediate[31-25] | rs2[24-20] | rs1 [19-15] | func3 [14-12]             | immediate[11-7] | Opcode [6-0] |
|------------------|------------|-------------|---------------------------|-----------------|--------------|
| Offset[12,10:5]  | src2       | src1        | BEQ/BNE/BLT[U]<br>/BGE[U] | Offset[11,4:1]  | BRANCH       |

func3 bits selects between 6 branch operations: branch equal, branch not equal, branch less than, branch less than unsigned, branch greater equal, branch greater equal unsigned

BEQ src1, src2, imm12 → if src1 == src2 then pc = pc + imm12

# PC-Relative Example-2:

## Unconditional Branch: Jump Type (J-Type)

| Immediate[31:12]         | rd [11-7] | Opcode [6-0] |
|--------------------------|-----------|--------------|
| Offset[20,10:1,11,19:12] | dest      | JAL          |

- The offset/imm is sign-extended and added to the pc to form the jump target address.
- JAL stores the address of the next instruction into register rd, i.e.,  $rd = pc + 4$
- Used in switch case and procedure call or return
- With JAL,  $pc = pc + imm20$
- With JALR,  $pc = rs + imm12$  (JALR is an I-Type Instruction)

| immediate[31-20] | rs1 [19-15] | func3 [14-12] | rd[11-7] | Opcode [6-0] |
|------------------|-------------|---------------|----------|--------------|
| offset[11-0]     | base        | 0             | dest     | JALR         |

# PC-Relative Example-3: Upper Immediate Type (U-Type)

| Immediate[31:12]   | rd [11-7] | Opcode [6-0] |
|--------------------|-----------|--------------|
| U-immediate[31-12] | dest      | LUI/AUIPC    |

- LUI places the U-immediate value in the top 20 bits of the destination register rd, filling in the lowest 12 bits with zeros.
- With LUI,  $rd = imm \ll 12$
- AUIPC forms a 32-bit offset from the 20-bit U-immediate, filling in the lowest 12 bits with zeros, adds this offset to the pc, then places the result in register rd
- With AUIPC,  $rd = pc + (imm \ll 12)$

# RV32I Instruction Set: Arithmetic

## Arithmetic Operation

| Mnemonic             | Instruction                      | Type | Description                         |
|----------------------|----------------------------------|------|-------------------------------------|
| ADD rd, rs1, rs2     | Add                              | R    | $rd \leftarrow rs1 + rs2$           |
| SUB rd, rs1, rs2     | Subtract                         | R    | $rd \leftarrow rs1 - rs2$           |
| ADDI rd, rs1, imm12  | Add immediate                    | I    | $rd \leftarrow rs1 + imm12$         |
| SLT rd, rs1, rs2     | Set less than                    | R    | $rd \leftarrow rs1 < rs2 ? 1 : 0$   |
| SLTI rd, rs1, imm12  | Set less than immediate          | I    | $rd \leftarrow rs1 < imm12 ? 1 : 0$ |
| SLTU rd, rs1, rs2    | Set less than unsigned           | R    | $rd \leftarrow rs1 < rs2 ? 1 : 0$   |
| SLTIU rd, rs1, imm12 | Set less than immediate unsigned | I    | $rd \leftarrow rs1 < imm12 ? 1 : 0$ |
| LUI rd, imm20        | Load upper immediate             | U    | $rd \leftarrow imm20 \ll 12$        |
| AUIP rd, imm20       | Add upper immediate to PC        | U    | $rd \leftarrow PC + imm20 \ll 12$   |

# RV32I Instruction Set: Arithmetic

## Logical Operations

| Mnemonic            | Instruction                      | Type | Description                    |
|---------------------|----------------------------------|------|--------------------------------|
| AND rd, rs1, rs2    | AND                              | R    | $rd \leftarrow rs1 \And rs2$   |
| OR rd, rs1, rs2     | OR                               | R    | $rd \leftarrow rs1 \Or rs2$    |
| XOR rd, rs1, rs2    | XOR                              | R    | $rd \leftarrow rs1 \Xor rs2$   |
| ANDI rd, rs1, imm12 | AND immediate                    | I    | $rd \leftarrow rs1 \And imm12$ |
| ORI rd, rs1, imm12  | OR immediate                     | I    | $rd \leftarrow rs1 \Or imm12$  |
| XORI rd, rs1, imm12 | XOR immediate                    | I    | $rd \leftarrow rs1 \Xor imm12$ |
| SLL rd, rs1, rs2    | Shift left logical               | R    | $rd \leftarrow rs1 \ll rs2$    |
| SRL rd, rs1, rs2    | Shift right logical              | R    | $rd \leftarrow rs1 \gg rs2$    |
| SRA rd, rs1, rs2    | Shift right arithmetic           | R    | $rd \leftarrow rs1 \gg rs2$    |
| SLLI rd, rs1, shamt | Shift left logical immediate     | I    | $rd \leftarrow rs1 \ll shamt$  |
| SRLI rd, rs1, shamt | Shift right logical imm.         | I    | $rd \leftarrow rs1 \gg shamt$  |
| SRAI rd, rs1, shamt | Shift right arithmetic immediate | I    | $rd \leftarrow rs1 \gg shamt$  |

# RV32I Instruction Set: Load Store

## Load / Store Operations

| Mnemonic           | Instruction            | Type | Description                                            |
|--------------------|------------------------|------|--------------------------------------------------------|
| LD rd, imm12(rs1)  | Load doubleword        | I    | $rd \leftarrow \text{mem}[rs1 + \text{imm12}]$         |
| LW rd, imm12(rs1)  | Load word              | I    | $rd \leftarrow \text{mem}[rs1 + \text{imm12}]$         |
| LH rd, imm12(rs1)  | Load halfword          | I    | $rd \leftarrow \text{mem}[rs1 + \text{imm12}]$         |
| LB rd, imm12(rs1)  | Load byte              | I    | $rd \leftarrow \text{mem}[rs1 + \text{imm12}]$         |
| LWU rd, imm12(rs1) | Load word unsigned     | I    | $rd \leftarrow \text{mem}[rs1 + \text{imm12}]$         |
| LHU rd, imm12(rs1) | Load halfword unsigned | I    | $rd \leftarrow \text{mem}[rs1 + \text{imm12}]$         |
| LBU rd, imm12(rs1) | Load byte unsigned     | I    | $rd \leftarrow \text{mem}[rs1 + \text{imm12}]$         |
| SD rs2, imm12(rs1) | Store doubleword       | S    | $rs2 \rightarrow \text{mem}[rs1 + \text{imm12}]$       |
| SW rs2, imm12(rs1) | Store word             | S    | $rs2(31:0) \rightarrow \text{mem}[rs1 + \text{imm12}]$ |
| SH rs2, imm12(rs1) | Store halfword         | S    | $rs2(15:0) \rightarrow \text{mem}[rs1 + \text{imm12}]$ |
| SB rs2, imm12(rs1) | Store byte             | S    | $rs2(7:0) \rightarrow \text{mem}[rs1 + \text{imm12}]$  |

# RV32I Instruction Set: Branching

## Branching

| Mnemonic             | Instruction                           | Type | Description                                           |
|----------------------|---------------------------------------|------|-------------------------------------------------------|
| BEQ rs1, rs2, imm12  | Branch equal                          | SB   | if $rs1 == rs2$<br>$PC \leftarrow PC + imm12$         |
| BNE rs1, rs2, imm12  | Branch not equal                      | SB   | if $rs1 != rs2$<br>$PC \leftarrow PC + imm12$         |
| BGE rs1, rs2, imm12  | Branch greater than or equal          | SB   | if $rs1 \geq rs2$<br>$PC \leftarrow PC + imm12$       |
| BGEU rs1, rs2, imm12 | Branch greater than or equal unsigned | SB   | if $rs1 \geq rs2$<br>$PC \leftarrow PC + imm12$       |
| BLT rs1, rs2, imm12  | Branch less than                      | SB   | if $rs1 < rs2$<br>$PC \leftarrow PC + imm12$          |
| BLTU rs1, rs2, imm12 | Branch less than unsigned             | SB   | if $rs1 < rs2$<br>$PC \leftarrow PC + imm12 \ll 1$    |
| JAL rd, imm20        | Jump and link                         | UJ   | $rd \leftarrow PC + 4$<br>$PC \leftarrow PC + imm20$  |
| JALR rd, imm12(rs1)  | Jump and link register                | I    | $rd \leftarrow PC + 4$<br>$PC \leftarrow rs1 + imm12$ |

# RV32I Instruction Set: Pseudo- Instruction

| Mnemonic              | Instruction                       | Base instruction(s)                                  |
|-----------------------|-----------------------------------|------------------------------------------------------|
| LI rd, imm12          | Load immediate (near)             | ADDI rd, zero, imm12                                 |
| LI rd, imm            | Load immediate (far)              | LUI rd, imm[31:12]<br>ADDI rd, rd, imm[11:0]         |
| LA rd, sym            | Load address (far)                | AUIPC rd, sym[31:12]<br>ADDI rd, rd, sym[11:0]       |
| MV rd, rs             | Copy register                     | ADDI rd, rs, 0                                       |
| NOT rd, rs            | One's complement                  | XORI rd, rs, -1                                      |
| NEG rd, rs            | Two's complement                  | SUB rd, zero, rs                                     |
| BGT rs1, rs2, offset  | Branch if rs1 > rs2               | BLT rs2, rs1, offset                                 |
| BLE rs1, rs2, offset  | Branch if rs1 ≤ rs2               | BGE rs2, rs1, offset                                 |
| BGTU rs1, rs2, offset | Branch if rs1 > rs2<br>(unsigned) | BLTU rs2, rs1, offset                                |
| BLEU rs1, rs2, offset | Branch if rs1 ≤ rs2<br>(unsigned) | BGEU rs2, rs1, offset                                |
| BEQZ rs1, offset      | Branch if rs1 = 0                 | BEQ rs1, zero, offset                                |
| BNEZ rs1, offset      | Branch if rs1 ≠ 0                 | BNE rs1, zero, offset                                |
| BGEZ rs1, offset      | Branch if rs1 ≥ 0                 | BGE rs1, zero, offset                                |
| BLEZ rs1, offset      | Branch if rs1 ≤ 0                 | BGE zero, rs1, offset                                |
| BGTZ rs1, offset      | Branch if rs1 > 0                 | BLT zero, rs1, offset                                |
| J offset              | Unconditional jump                | JAL zero, offset                                     |
| CALL offset12         | Call subroutine (near)            | JALR ra, ra, offset12                                |
| CALL offset           | Call subroutine (far)             | AUIPC ra, offset[31:12]<br>JALR ra, ra, offset[11:0] |
| RET                   | Return from subroutine            | JALR zero, 0(ra)                                     |
| NOP                   | No operation                      | ADDI zero, zero, 0                                   |

# Encoding Instruction Set



(A) Variable (e.g., Intel 80x86, VAX)



(B) Fixed (e.g., RISC V, ARM, MIPS, PowerPC, SPARC)



(C) Hybrid (e.g., RISC V Compressed (RV32IC), IBM 360/370, microMIPS, Arm Thumb2)

## Factors influencing encoding instruction set

- Desire to have as many instruction set or addressing modes as possible
- Impact of size of registers and addressing mode fields on average instruction size and hence average program size
- A desire to have instructions encoded into lengths that will be easy to handle in a pipelined implementation.

*“Good Programmers understand what is  
the machine, they are Programming!”*

*“Good Architects understand for whom  
they are Building!”*

Which one are you?