



# Chapter 2 – L1

## Instructions: Language of the Computer

# Instruction Set

- The repertoire of instructions of a computer
- Different computers have different instruction sets
  - But with many aspects in common
- Early computers had very simple instruction sets
  - Simplified implementation
- Many modern computers also have simple instruction sets

# The MIPS Instruction Set

- Used as the example throughout the book
- Stanford MIPS commercialized by MIPS Technologies ([www.mips.com](http://www.mips.com))
- Typical of many modern ISAs
  - See MIPS Reference Data tear-out card, and Appendixes B and E
- Similar ISAs have a large share of embedded core market
  - Applications in consumer electronics, network/storage equipment, cameras, printers, ...

# Arithmetic Operations

- Add and subtract, three operands

- Two sources and one destination

add a, b, c # a gets b + c

- All arithmetic operations have this form

- *Design Principle 1: Simplicity favors regularity*

- Regularity makes implementation simpler
  - Simplicity enables higher performance at lower cost

# Arithmetic Example

- C code:

$f = (g + h) - (i + j);$

- Compiled MIPS code:

```
add t0, g, h    # temp t0 = g + h  
add t1, i, j    # temp t1 = i + j  
sub f, t0, t1  # f = t0 - t1
```

# Register Operands

- Arithmetic instructions use register operands
- MIPS has a  $32 \times 32$ -bit register file
  - Use for frequently accessed data
  - Numbered 0 to 31
  - 32-bit data called a “word”
- Assembler names
  - \$t0, \$t1, ..., \$t9 for temporary values
  - \$s0, \$s1, ..., \$s7 for saved variables
- *Design Principle 2: Smaller is faster*
  - c.f. main memory: millions of locations

# Register Operand Example

- C code:

$f = (g + h) - (i + j);$

- f, ..., j in \$s0, ..., \$s4

- Compiled MIPS code:

add \$t0, \$s1, \$s2

add \$t1, \$s3, \$s4

sub \$s0, \$t0, \$t1

# Memory Operands

- Main memory used for composite data
  - Arrays, structures, dynamic data
- To apply arithmetic operations
  - Load values from memory into registers
  - Store result from register to memory
- Memory is byte addressed
  - Each address identifies an 8-bit byte
- Words are aligned in memory
  - Address must be a multiple of 4
- MIPS is Big Endian
  - Most-significant byte at most address of a word
  - c.f. Little Endian: least-significant byte at least address

# Memory Operand Example 1

- C code:

```
g = h + A[8];
```

- g in \$s1, h in \$s2, base address of A in \$s3

- Compiled MIPS code:

- Index 8 requires offset of 32
  - 4 bytes per word

```
lw $t0, 32($s3)      # load word  
add $s1, $s2, $t0
```

offset

base register

# Memory Operand Example 2

- C code:

$A[12] = h + A[8];$

- $h$  in  $\$s2$ , base address of  $A$  in  $\$s3$

- Compiled MIPS code:

- Index 8 requires offset of 32

```
lw $t0, 32($s3)      # load word  
add $t0, $s2, $t0  
sw $t0, 48($s3)      # store word
```

# Registers vs. Memory

- Registers are faster to access than memory
- Operating on memory data requires loads and stores
  - More instructions to be executed
- Compiler must use registers for variables as much as possible
  - Only spill to memory for less frequently used variables
  - Register optimization is important!

# Immediate Operands

- Constant data specified in an instruction  
`addi $s3, $s3, 4`
- No subtract immediate instruction
  - Just use a negative constant  
`addi $s2, $s1, -1`
- *Design Principle 3: Make the common case fast*
  - Small constants are common
  - Immediate operand avoids a load instruction

# The Constant Zero

- MIPS register 0 (\$zero) is the constant 0
  - Cannot be overwritten
- Useful for common operations
  - E.g., move between registers  
`add $t2, $s1, $zero`

# Unsigned Binary Integers

- Given an n-bit number

$$x = x_{n-1}2^{n-1} + x_{n-2}2^{n-2} + \dots + x_12^1 + x_02^0$$

- Range: 0 to  $+2^n - 1$

- Example

- $0000\ 0000\ 0000\ 0000\ 0000\ 0000\ 0000\ 1011_2$   
 $= 0 + \dots + 1 \times 2^3 + 0 \times 2^2 + 1 \times 2^1 + 1 \times 2^0$   
 $= 0 + \dots + 8 + 0 + 2 + 1 = 11_{10}$

- Using 32 bits

- 0 to  $+4,294,967,295$

# 2s-Complement Signed Integers

- Given an n-bit number

$$x = -x_{n-1}2^{n-1} + x_{n-2}2^{n-2} + \dots + x_12^1 + x_02^0$$

- Range:  $-2^{n-1}$  to  $+2^{n-1} - 1$

- Example

- $$\begin{aligned} &1111\ 1111\ 1111\ 1111\ 1111\ 1111\ 1111\ 1100_2 \\ &= -1 \times 2^{31} + 1 \times 2^{30} + \dots + 1 \times 2^2 + 0 \times 2^1 + 0 \times 2^0 \\ &= -2,147,483,648 + 2,147,483,644 = -4_{10} \end{aligned}$$

- Using 32 bits

- $-2,147,483,648$  to  $+2,147,483,647$

# 2s-Complement Signed Integers

- Bit 31 is sign bit
  - 1 for negative numbers
  - 0 for non-negative numbers
- $-(-2^{n-1})$  can't be represented
- Non-negative numbers have the same unsigned and 2s-complement representation
- Some specific numbers
  - 0: 0000 0000 ... 0000
  - -1: 1111 1111 ... 1111
  - Most-negative: 1000 0000 ... 0000
  - Most-positive: 0111 1111 ... 1111

# Signed Negation

- Complement and add 1
  - Complement means  $1 \rightarrow 0, 0 \rightarrow 1$

$$x + \bar{x} = 1111\dots111_2 = -1$$

$$\bar{x} + 1 = -x$$

- Example: negate +2
  - $+2 = 0000\ 0000\dots0010_2$
  - $-2 = 1111\ 1111\dots1101_2 + 1$   
 $= 1111\ 1111\dots1110_2$

# Sign Extension

- Representing a number using more bits
  - Preserve the numeric value
- In MIPS instruction set
  - addi: extend immediate value
  - 1b, 1h: extend loaded byte/halfword
  - beq, bne: extend the displacement
- Replicate the sign bit to the left
  - c.f. unsigned values: extend with 0s
- Examples: 8-bit to 16-bit
  - +2: 0000 0010 => 0000 0000 0000 0010
  - -2: 1111 1110 => 1111 1111 1111 1110

# Representing Instructions

- Instructions are encoded in binary
  - Called machine code
- MIPS instructions
  - Encoded as 32-bit instruction words
  - Small number of formats encoding operation code (opcode), register numbers, ...
  - Regularity!
- Register numbers
  - \$t0 – \$t7 are reg's 8 – 15
  - \$t8 – \$t9 are reg's 24 – 25
  - \$s0 – \$s7 are reg's 16 – 23

# Hexadecimal

- Base 16
  - Compact representation of bit strings
  - 4 bits per hex digit

|   |      |   |      |   |      |   |      |
|---|------|---|------|---|------|---|------|
| 0 | 0000 | 4 | 0100 | 8 | 1000 | c | 1100 |
| 1 | 0001 | 5 | 0101 | 9 | 1001 | d | 1101 |
| 2 | 0010 | 6 | 0110 | a | 1010 | e | 1110 |
| 3 | 0011 | 7 | 0111 | b | 1011 | f | 1111 |

- Example: eca8 6420
  - 1110 1100 1010 1000 0110 0100 0010 0000

# MIPS Reference Data

MIPS Reference Data Card ("Green Card") 1. Pull along perforation to separate card 2. Fold bottom side (columns 3 and 4) together



## CORE INSTRUCTION SET

| NAME, MNEMONIC              | FOR-MAT | OPERATION (in Verilog)                                   | OPCODE / FUNCT (Hex)      |
|-----------------------------|---------|----------------------------------------------------------|---------------------------|
| Add                         | add     | R[rd] = R[rs] + R[rt]                                    | (1) 0 / 20 <sub>hex</sub> |
| Add Immediate               | addi    | I R[rt] = R[rs] + SignExtImm                             | (1,2) 8 <sub>hex</sub>    |
| Add Imm. Unsigned           | addiu   | I R[rt] = R[rs] + SignExtImm                             | (2) 9 <sub>hex</sub>      |
| Add Unsigned                | addu    | R[rd] = R[rs] + R[rt]                                    | 0 / 21 <sub>hex</sub>     |
| And                         | and     | R[rd] = R[rs] & R[rt]                                    | 0 / 24 <sub>hex</sub>     |
| And Immediate               | andi    | I R[rt] = R[rs] & ZeroExtImm                             | (3) c <sub>hex</sub>      |
| Branch On Equal             | beq     | I if(R[rs]==R[rt]) PC=PC+4+BranchAddr                    | (4) 4 <sub>hex</sub>      |
| Branch On Not Equal         | bne     | I if(R[rs]!=R[rt]) PC=PC+4+BranchAddr                    | (4) 5 <sub>hex</sub>      |
| Jump                        | j       | J PC=JumpAddr                                            | (5) 2 <sub>hex</sub>      |
| Jump And Link               | jal     | J R[31]=PC+8;PC=JumpAddr                                 | (5) 3 <sub>hex</sub>      |
| Jump Register               | jr      | R PC=R[rs]                                               | 0 / 08 <sub>hex</sub>     |
| Load Byte Unsigned          | lbu     | I R[rt]={24'b0,M[R[rs]]+SignExtImm}(7:0)                 | (2) 24 <sub>hex</sub>     |
| Load Halfword Unsigned      | lhu     | I R[rt]={16'b0,M[R[rs]]+SignExtImm}(15:0)                | (2) 25 <sub>hex</sub>     |
| Load Linked                 | ll      | I R[rt] = M[R[rs]]+SignExtImm                            | (2,7) 30 <sub>hex</sub>   |
| Load Upper Imm.             | lui     | I R[rt] = {imm, 16'b0}                                   | f <sub>hex</sub>          |
| Load Word                   | lw      | I R[rt] = M[R[rs]]+SignExtImm                            | (2) 23 <sub>hex</sub>     |
| Nor                         | nor     | R[rd] = ~ (R[rs]   R[rt])                                | 0 / 27 <sub>hex</sub>     |
| Or                          | or      | R[rd] = R[rs]   R[rt]                                    | 0 / 25 <sub>hex</sub>     |
| Or Immediate                | ori     | I R[rt] = R[rs]   ZeroExtImm                             | (3) d <sub>hex</sub>      |
| Set Less Than               | slt     | R[rd] = (R[rs] < R[rt]) ? 1 : 0                          | 0 / 2a <sub>hex</sub>     |
| Set Less Than Imm.          | slti    | I R[rt] = (R[rs] < SignExtImm) ? 1 : 0                   | (2) a <sub>hex</sub>      |
| Set Less Than Imm. Unsigned | sltiu   | I R[rt] = (R[rs] < SignExtImm) ? 1 : 0                   | (2,6) b <sub>hex</sub>    |
| Set Less Than Unsigned      | sltu    | R[rd] = (R[rs] < R[rt]) ? 1 : 0                          | (6) 0 / 2b <sub>hex</sub> |
| Shift Left Logical          | sll     | R[rd] = R[rt] << shamt                                   | 0 / 00 <sub>hex</sub>     |
| Shift Right Logical         | srl     | R[rd] = R[rt] >> shamt                                   | 0 / 02 <sub>hex</sub>     |
| Store Byte                  | sb      | I M[R[rs]]+SignExtImm](7:0) = R[rt](7:0)                 | (2) 28 <sub>hex</sub>     |
| Store Conditional           | sc      | I M[R[rs]]+SignExtImm] = R[rt]; R[rt] = (atomic) ? 1 : 0 | (2,7) 38 <sub>hex</sub>   |
| Store Halfword              | sh      | I M[R[rs]]+SignExtImm](15:0) = R[rt](15:0)               | (2) 29 <sub>hex</sub>     |
| Store Word                  | sw      | I M[R[rs]]+SignExtImm] = R[rt]                           | (2) 2b <sub>hex</sub>     |
| Subtract                    | sub     | R[rd] = R[rs] - R[rt]                                    | (1) 0 / 22 <sub>hex</sub> |
| Subtract Unsigned           | subu    | R[rd] = R[rs] - R[rt]                                    | 0 / 23 <sub>hex</sub>     |

- (1) May cause overflow exception
- (2) SignExtImm = { 16{immediate[15]}, immediate }
- (3) ZeroExtImm = { 16{1b'0}, immediate }
- (4) BranchAddr = { 14{immediate[15]}, immediate, 2'b0 }
- (5) JumpAddr = { PC+4[31:28], address, 2'b0 }
- (6) Operands considered unsigned numbers (vs. 2's comp.)
- (7) Atomic test&set pair: R[rt] = 1 if pair atomic, 0 if not atomic

## BASIC INSTRUCTION FORMATS

|   |          |       |       |         |       |           |  |
|---|----------|-------|-------|---------|-------|-----------|--|
| R | opcode   | rs    | rt    | rd      | shamt | funct     |  |
|   | 31 26 25 | 21 20 | 16 15 | 11 10   | 6 5   | 0         |  |
| I | opcode   | rs    | rt    |         |       | immediate |  |
|   | 31 26 25 | 21 20 | 16 15 |         |       | 0         |  |
| J | opcode   |       |       | address |       |           |  |
|   | 31 26 25 |       |       |         |       | 0         |  |



## ARITHMETIC CORE INSTRUCTION SET

| NAME, MNEMONIC     | FOR-MAT | OPCODE | OPERATION                                                      | OPCODE / FMT / FUNCT / FUNCT (Hex) |
|--------------------|---------|--------|----------------------------------------------------------------|------------------------------------|
| Branch On FP True  | beft    | FI     | if(FPcond)PC=PC+4+BranchAddr (4)                               | 11/8/1--                           |
| Branch On FP False | belt    | FI     | if(!FPcond)PC=PC+4+BranchAddr(4)                               | 11/8/0--                           |
| Divide             | div     | R      | Lo=R[rs]/R[rt]; Hi=R[rs] % R[rt]                               | 0/-/-/1a                           |
| Divide Unsigned    | divu    | R      | Lo=R[rs]/R[rt]; Hi=R[rs] % R[rt]                               | 0/-/-/1b                           |
| FP Add Single      | add.s   | FR     | F[fd] = F[fs] + F[ft]                                          | 11/10/-/-0                         |
| FP Add             | add.d   | FR     | {F[fd],F[fd+1]} = {F[fs],F[fs+1]} + {F[ft],F[ft+1]}            | 11/11/-/-0                         |
| Double             |         |        |                                                                |                                    |
| FP Compare Single  | c.x.s*  | FR     | FPcond = (F[fs] op F[ft]) ? 1 : 0                              | 11/10/-/-y                         |
| FP Compare         | c.x.d*  | FR     | FPcond = ({F[fs],F[fs+1]} op {F[ft],F[ft+1]}) ? 1 : 0          | 11/11/-/-y                         |
| Double             | *       |        | (x is eq, lt, or le) (op is ==, <, or <=) (y is 32, 3c, or 3e) |                                    |
| FP Divide Single   | div.s   | FR     | F[fd] = F[fs] / F[ft]                                          | 11/10/-/-3                         |
| FP Divide          | div.d   | FR     | {F[fd],F[fd+1]} = {F[fs],F[fs+1]} / {F[ft],F[ft+1]}            | 11/11/-/-3                         |
| Double             |         |        |                                                                |                                    |
| FP Multiply Single | mul.s   | FR     | F[fd] = F[fs] * F[ft]                                          | 11/10/-/-2                         |
| FP Multiply        | mul.d   | FR     | {F[fd],F[fd+1]} = {F[fs],F[fs+1]} * {F[ft],F[ft+1]}            | 11/11/-/-2                         |
| Double             |         |        |                                                                |                                    |
| FP Subtract Single | sub.s   | FR     | F[fd] = F[fs] - F[ft]                                          | 11/10/-/-1                         |
| FP Subtract        | sub.d   | FR     | {F[fd],F[fd+1]} = {F[fs],F[fs+1]} - {F[ft],F[ft+1]}            | 11/11/-/-1                         |
| Double             |         |        |                                                                |                                    |
| Load FP Single     | lwl     | I      | F[rt]=M[R[rs]]+SignExtImm                                      | (2) 31/-/-/-                       |
| Load FP Double     | ldl     | I      | F[rt]=M[R[rs]]+SignExtImm;                                     | (2) 35/-/-/-                       |
| Move From Hi       | mfhi    | R      | R[rd] = Hi                                                     | 0/-/-/10                           |
| Move From Lo       | mflo    | R      | R[rd] = Lo                                                     | 0/-/-/12                           |
| Move From Control  | mfco    | R      | R[rd] = CR[rs]                                                 | 10/0/-/0                           |
| Multiply           | mult    | R      | {Hi,Lo} = R[rs] * R[rt]                                        | 0/-/-/18                           |
| Multiply Unsigned  | multu   | R      | {Hi,Lo} = R[rs] * R[rt]                                        | (6) 0/-/-/19                       |
| Shift Right Arith. | sra     | R      | R[rd] = R[rt] >>> shamt                                        | 0/-/-/3                            |
| Store FP Single    | swl     | I      | M[R[rs]]+SignExtImm] = F[rt]                                   | (2) 39/-/-/-                       |
| Store FP Double    | ndl     | I      | M[R[rs]]+SignExtImm] = F[rt]; M[R[rs]]+SignExtImm+4] = F[rt+1] | 3d/-/-/-                           |

## FLOATING-POINT INSTRUCTION FORMATS

| FR | opcode   | fmt   | ft    | fs    | fd        | funct |
|----|----------|-------|-------|-------|-----------|-------|
|    | 31 26 25 | 21 20 | 16 15 | 11 10 | 6 5       | 0     |
| FI | opcode   | fmt   | ft    |       | immediate |       |
|    | 31 26 25 | 21 20 | 16 15 |       |           | 0     |

## PSEUDOINSTRUCTION SET

| NAME                         | MNEMONIC | OPERATION                   |
|------------------------------|----------|-----------------------------|
| Branch Less Than             | blt      | if(R[rs]<R[rt]) PC = Label  |
| Branch Greater Than          | bgt      | if(R[rs]>R[rt]) PC = Label  |
| Branch Less Than or Equal    | ble      | if(R[rs]<=R[rt]) PC = Label |
| Branch Greater Than or Equal | bge      | if(R[rs]>=R[rt]) PC = Label |
| Load Immediate               | li       | R[rd] = immediate           |
| Move                         | move     | R[rd] = R[rs]               |

## REGISTER NAME, NUMBER, USE, CALL CONVENTION

| NAME    | NUMBER | USE                                                   | PRESERVED ACROSS A CALL? |
|---------|--------|-------------------------------------------------------|--------------------------|
| \$zero  | 0      | The Constant Value 0                                  | N.A.                     |
| Sat     | 1      | Assembler Temporary                                   | No                       |
| Sv0-Sv1 | 2-3    | Values for Function Results and Expression Evaluation | No                       |
| Sa0-Sa3 | 4-7    | Arguments                                             | No                       |
| St0-St7 | 8-15   | Temporaries                                           | No                       |
| Ss0-Ss7 | 16-23  | Saved Temporaries                                     | Yes                      |
| St8-St9 | 24-25  | Temporaries                                           | No                       |
| Sk0-Sk1 | 26-27  | Reserved for OS Kernel                                | No                       |
| Sgp     | 28     | Global Pointer                                        | Yes                      |
| Ssp     | 29     | Stack Pointer                                         | Yes                      |
| Sfp     | 30     | Frame Pointer                                         | Yes                      |
| Sra     | 31     | Return Address                                        | Yes                      |

### OPCODES, BASE CONVERSION, ASCII SYMBOLS

| MIPS (1) MIPS<br>opcode | (1) MIPS<br>funct | (2) MIPS<br>funct | Binary  | Deci-<br>mal | Hexa-<br>ASCII | ASCIIf<br>Char-<br>acter | Deci-<br>mal | Hexa-<br>ASCII | ASCIIf<br>Char-<br>acter |
|-------------------------|-------------------|-------------------|---------|--------------|----------------|--------------------------|--------------|----------------|--------------------------|
| (1)                     | add               | add.f             | 00 0000 | 0            | 0              | NUL                      | 64           | 40             |                          |
|                         |                   | sub.f             | 00 0001 | 1            | 1              | SOH                      | 65           | 41             | A                        |
| j                       | st                | mult.f            | 00 0010 | 2            | 2              | STX                      | 66           | 42             | B                        |
| jal                     | str               | div.f             | 00 0011 | 3            | 3              | ETX                      | 67           | 43             | C                        |
| beq                     | sllv              | sqrt.f            | 00 0100 | 4            | 4              | EOT                      | 68           | 44             | D                        |
| bne                     |                   | abs.f             | 00 0101 | 5            | 5              | ENQ                      | 69           | 45             | E                        |
| blez                    | srly              | mov.f             | 00 0110 | 6            | 6              | ACK                      | 70           | 46             | F                        |
| bgtrz                   | srav              | neg.f             | 00 0111 | 7            | 7              | BEL                      | 71           | 47             | G                        |
| andi                    | jr                |                   | 00 1000 | 8            | 8              | BS                       | 72           | 48             | H                        |
| addiu                   | jalr              |                   | 00 1001 | 9            | 9              | HT                       | 73           | 49             | I                        |
| slti                    | move              |                   | 00 1010 | 10           | a              | LF                       | 74           | 4a             | J                        |
| sltu                    | move              |                   | 00 1011 | 11           | b              | VT                       | 75           | 4b             | K                        |
| andi                    | syscall           | round.wf          | 00 1100 | 12           | c              | FF                       | 76           | 4c             | L                        |
| ori                     | break             | trunc.wf          | 00 1101 | 13           | d              | CR                       | 77           | 4d             | M                        |
| xora                    |                   | ceil.wf           | 00 1110 | 14           | e              | SO                       | 78           | 4e             | N                        |
| lui                     | sync              | float.wf          | 00 1111 | 15           | f              | SI                       | 79           | 4f             | O                        |
| (2)                     | mtwi              |                   | 01 0000 | 16           | 10             | DLF                      | 80           | 50             | P                        |
|                         | mtwi              |                   | 01 0001 | 17           | 11             | DC1                      | 81           | 51             | Q                        |
| mtlo                    | move.f            |                   | 01 0010 | 18           | 12             | DC2                      | 82           | 52             | R                        |
| mtlo                    | move.f            |                   | 01 0011 | 19           | 13             | DC3                      | 83           | 53             | S                        |
|                         |                   |                   | 01 0100 | 20           | 14             | DC4                      | 84           | 54             | T                        |
|                         |                   |                   | 01 0101 | 21           | 15             | NAK                      | 85           | 55             | U                        |
|                         |                   |                   | 01 0110 | 22           | 16             | SYN                      | 86           | 56             | V                        |
|                         |                   |                   | 01 0111 | 23           | 17             | FTB                      | 87           | 57             | W                        |
|                         | muli              |                   | 01 1000 | 24           | 18             | CAN                      | 88           | 58             | X                        |
|                         | muli              |                   | 01 1001 | 25           | 19             | EM                       | 89           | 59             | Y                        |
|                         | div               |                   | 01 1010 | 26           | 1a             | SUB                      | 90           | 5a             | Z                        |
|                         | div               |                   | 01 1011 | 27           | 1b             | ESC                      | 91           | 5b             |                          |
|                         |                   |                   | 01 1100 | 28           | 1c             | FS                       | 92           | 5c             |                          |
|                         |                   |                   | 01 1101 | 29           | 1d             | GS                       | 93           | 5d             |                          |
|                         |                   |                   | 01 1110 | 30           | 1e             | RS                       | 94           | 5e             |                          |
|                         |                   |                   | 01 1111 | 31           | 1f             | US                       | 95           | 5f             |                          |
| lb                      | add               | cvt.wf            | 10 0000 | 32           | 20             | Space                    | 96           | 60             |                          |
| lh                      | addi              | cvt.wf            | 10 0001 | 33           | 21             | !                        | 97           | 61             | a                        |
| lw                      | sub               |                   | 10 0010 | 34           | 22             | "                        | 98           | 62             | b                        |
| lw                      | sub               |                   | 10 0011 | 35           | 23             | #                        | 99           | 63             | c                        |
| lbu                     | and               | cvt.wf            | 10 0100 | 36           | 24             | \$                       | 100          | 64             | d                        |
| lbu                     | or                |                   | 10 0101 | 37           | 25             | %                        | 101          | 65             | e                        |
| lwz                     | xor               |                   | 10 0110 | 38           | 26             | &                        | 102          | 66             | f                        |
| lwz                     | nor               |                   | 10 0111 | 39           | 27             | '                        | 103          | 67             | g                        |
| sb                      |                   |                   | 10 1000 | 40           | 28             | (                        | 104          | 68             | h                        |
| sh                      |                   |                   | 10 1001 | 41           | 29             | )                        | 105          | 69             | i                        |
| sw                      | slt               |                   | 10 1010 | 42           | 2a             | *                        | 106          | 6a             | j                        |
| sw                      | slt               |                   | 10 1011 | 43           | 2b             | +                        | 107          | 6b             | k                        |
|                         |                   |                   | 10 1100 | 44           | 2c             | -                        | 108          | 6c             | l                        |
|                         |                   |                   | 10 1101 | 45           | 2d             | *                        | 109          | 6d             | m                        |
|                         |                   |                   | 10 1110 | 46           | 2e             | -                        | 110          | 6e             | n                        |
|                         |                   |                   | 10 1111 | 47           | 2f             | /                        | 111          | 6f             | o                        |
| ll                      | lge               | c.vtf             | 11 0000 | 48           | 30             | 0                        | 112          | 70             | p                        |
| lqc1                    | lqed              | c.unf             | 11 0001 | 49           | 31             | 1                        | 113          | 71             | q                        |
| lqc2                    | lit               | c.enf             | 11 0010 | 50           | 32             | 2                        | 114          | 72             | r                        |
| pref                    | lit               | c.eeqf            | 11 0011 | 51           | 33             | 3                        | 115          | 73             | s                        |
| lqc1                    | req               | c.vlf             | 11 0100 | 52           | 34             | 4                        | 116          | 74             | t                        |
| lqc1                    | req               | c.vlf             | 11 0101 | 53           | 35             | 5                        | 117          | 75             | u                        |
| lqc2                    | lne               | c.vnef            | 11 0110 | 54           | 36             | 6                        | 118          | 76             | v                        |
| lqc2                    | lne               | c.vnef            | 11 0111 | 55           | 37             | 7                        | 119          | 77             | w                        |
| sc                      | c.vtf             |                   | 11 1000 | 56           | 38             | 8                        | 120          | 78             | x                        |
| smc1                    | c.vneaf           |                   | 11 1001 | 57           | 39             | 9                        | 121          | 79             | y                        |
| smc2                    | c.vneaf           |                   | 11 1010 | 58           | 3a             | -                        | 122          | 7a             | z                        |
| smc2                    | c.vneaf           |                   | 11 1011 | 59           | 3b             | -                        | 123          | 7b             |                          |
|                         | c.vtf             |                   | 11 1100 | 60           | 3c             | <                        | 124          | 7c             |                          |
| sdc1                    | c.ngnf            |                   | 11 1101 | 61           | 3d             | -                        | 125          | 7d             |                          |
| sdc2                    | c.ngnf            |                   | 11 1110 | 62           | 3e             | >                        | 126          | 7e             |                          |
|                         | c.ngnf            |                   | 11 1111 | 63           | 3f             | ?                        | 127          | 7f             | DEL                      |

(1) opcode(31:26) == 0

(2) opcode(31:26) == 17<sub>10</sub> (11<sub>hex</sub>); if fmt(25:21)==16<sub>10</sub> (10<sub>hex</sub>) f=a (single); if fmt(25:21)==17<sub>10</sub> (11<sub>hex</sub>) f=d (double)

③

### IEEE 754 FLOATING POINT STANDARD

$$(-1)^S \times (1 + \text{Fraction}) \times 2^{(\text{Exponent} - \text{Bias})}$$

where Single Precision Bias = 127,  
Double Precision Bias = 1023.

### IEEE Single Precision and Double Precision Formats:

|   |          |          |
|---|----------|----------|
| S | Exponent | Fraction |
| S | Exponent | Fraction |
| S | Exponent | Fraction |

### MEMORY ALLOCATION



### DATA ALIGNMENT

| Double Word                                                        |      |           |      |           |      |           |      |
|--------------------------------------------------------------------|------|-----------|------|-----------|------|-----------|------|
| Word                                                               |      |           |      | Word      |      |           |      |
| Half Word                                                          |      | Half Word |      | Half Word |      | Half Word |      |
| Byte                                                               | Byte | Byte      | Byte | Byte      | Byte | Byte      | Byte |
| Value of three least significant bits of byte address (Big Endian) |      |           |      |           |      |           |      |

### EXCEPTION CONTROL REGISTERS: CAUSE AND STATUS



BD = Branch Delay, UM = User Mode, EL = Exception Level, IE = Interrupt Enable

### EXCEPTION CODES

| Num<br>ber | Name | Cause of Exception              | Num<br>ber | Name | Cause of Exception             |
|------------|------|---------------------------------|------------|------|--------------------------------|
| 0          | Int  | Interrupt (hardware)            | 9          | Bp   | Breakpoint Exception           |
| 4          | AdF  | Address Error Exception         | 10         | RI   | Reserved Instruction Exception |
| 5          | AdES | Address Error Exception (store) | 11         | CplI | Coprocessor Unimplemented      |
| 6          | IBF  | Bus Error on Instruction Fetch  | 12         | Ov   | Arithmetic Overflow Exception  |
| 7          | DBE  | Bus Error on Load or Store      | 13         | Tr   | Trap                           |
| 8          | Sys  | Syscall Exception               | 15         | FPE  | Floating Point Exception       |

### SIZE PREFIXES (10<sup>3</sup> for Disk, Communication; 2<sup>10</sup> for Memory)

| PRE-<br>SIZE                       | FIX   | PRE-<br>SIZE                       | FIX    | PRE-<br>SIZE      | FIX                      |
|------------------------------------|-------|------------------------------------|--------|-------------------|--------------------------|
| 10 <sup>3</sup> , 2 <sup>10</sup>  | Kilo- | 10 <sup>15</sup> , 2 <sup>30</sup> | Peta-  | 10 <sup>-3</sup>  | milli-                   |
| 10 <sup>6</sup> , 2 <sup>20</sup>  | Mega- | 10 <sup>18</sup> , 2 <sup>30</sup> | Exa-   | 10 <sup>-6</sup>  | micro-                   |
| 10 <sup>9</sup> , 2 <sup>30</sup>  | Giga- | 10 <sup>21</sup> , 2 <sup>30</sup> | Zetta- | 10 <sup>-9</sup>  | nano-                    |
| 10 <sup>12</sup> , 2 <sup>40</sup> | Tera- | 10 <sup>24</sup> , 2 <sup>30</sup> | Yotta- | 10 <sup>-12</sup> | pico-                    |
|                                    |       |                                    |        |                   | 10 <sup>-24</sup> yocto- |

The symbol for each prefix is just its first letter, except µ is used for micro.



# Chapter 2 - L2

## Instructions: Language of the Computer

# MIPS R-format Instructions

| op     | rs     | rt     | rd     | shamt  | funct  |
|--------|--------|--------|--------|--------|--------|
| 6 bits | 5 bits | 5 bits | 5 bits | 5 bits | 6 bits |

## Instruction fields

- op: operation code (opcode)
- rs: first source register number
- rt: second source register number
- rd: destination register number
- shamt: shift amount (00000 for now)
- funct: function code (extends opcode)

# R-format Example

| op     | rs     | rt     | rd     | shamt  | funct  |
|--------|--------|--------|--------|--------|--------|
| 6 bits | 5 bits | 5 bits | 5 bits | 5 bits | 6 bits |

add \$t0, \$s1, \$s2

|         |       |       |       |       |        |
|---------|-------|-------|-------|-------|--------|
| special | \$s1  | \$s2  | \$t0  | 0     | add    |
| 0       | 17    | 18    | 8     | 0     | 32     |
| 000000  | 10001 | 10010 | 01000 | 00000 | 100000 |

$0000001000110010010000000100000_2 = 02324020_{16}$

# MIPS I-format Instructions

| op     | rs     | rt     | constant or address |
|--------|--------|--------|---------------------|
| 6 bits | 5 bits | 5 bits | 16 bits             |

- Immediate arithmetic and load/store instructions
  - rt: destination or source register number
  - Constant:  $-2^{15}$  to  $+2^{15} - 1$
  - Address: offset added to base address in rs
- *Design Principle 4: Good design demands good compromises*
  - Different formats complicate decoding, but allow 32-bit instructions uniformly
  - Keep formats as similar as possible



# Stored Program Computers

## The BIG Picture



- Instructions represented in binary, just like data
- Instructions and data stored in memory
- Programs can operate on programs
  - e.g., compilers, linkers, ...
- Binary compatibility allows compiled programs to work on different computers
  - Standardized ISAs

# Logical Operations

- Instructions for bitwise manipulation

| Operation   | C  | Java | MIPS      |
|-------------|----|------|-----------|
| Shift left  | << | <<   | sll       |
| Shift right | >> | >>>  | srl       |
| Bitwise AND | &  | &    | and, andi |
| Bitwise OR  |    |      | or, ori   |
| Bitwise NOT | ~  | ~    | nor       |

- Useful for extracting and inserting groups of bits in a word

# Shift Operations

| op     | rs     | rt     | rd     | shamt  | funct  |
|--------|--------|--------|--------|--------|--------|
| 6 bits | 5 bits | 5 bits | 5 bits | 5 bits | 6 bits |

- shamt: how many positions to shift
- Shift left logical
  - Shift left and fill with 0 bits
  - $sll$  by  $i$  bits multiplies by  $2^i$
- Shift right logical
  - Shift right and fill with 0 bits
  - $srl$  by  $i$  bits divides by  $2^i$  (unsigned only)

# AND Operations

- Useful to mask bits in a word
  - Select some bits, clear others to 0

and \$t0, \$t1, \$t2

|      |                                         |
|------|-----------------------------------------|
| \$t2 | 0000 0000 0000 0000 0000 1101 1100 0000 |
|------|-----------------------------------------|

|      |                                         |
|------|-----------------------------------------|
| \$t1 | 0000 0000 0000 0000 0011 1100 0000 0000 |
|------|-----------------------------------------|

|      |                                         |
|------|-----------------------------------------|
| \$t0 | 0000 0000 0000 0000 0000 1100 0000 0000 |
|------|-----------------------------------------|

# OR Operations

- Useful to include bits in a word
    - Set some bits to 1, leave others unchanged
- or \$t0, \$t1, \$t2

\$t2 0000 0000 0000 0000 0000 1101 1100 0000

\$t1 0000 0000 0000 0000 0011 1100 0000 0000

\$t0 0000 0000 0000 0000 0011 1101 1100 0000

# NOT Operations

- Useful to invert bits in a word
  - Change 0 to 1, and 1 to 0
- MIPS has NOR 3-operand instruction
  - $a \text{ NOR } b == \text{NOT} (a \text{ OR } b)$

nor \$t0, \$t1, \$zero ←

Register 0: always  
read as zero

\$t1 0000 0000 0000 0000 0011 1100 0000 0000

\$t0 1111 1111 1111 1111 1100 0011 1111 1111

# Conditional Operations

- Branch to a labeled instruction if a condition is true
  - Otherwise, continue sequentially
- `beq rs, rt, L1`
  - if ( $rs == rt$ ) branch to instruction labeled L1;
- `bne rs, rt, L1`
  - if ( $rs != rt$ ) branch to instruction labeled L1;
- `j L1`
  - unconditional jump to instruction labeled L1

# Compiling If Statements

## C code:

```
if (i==j) f = g+h;  
else f = g-h;
```

- f, g, ... in \$s0, \$s1, ...

## Compiled MIPS code:

```
bne $s3, $s4, Else  
add $s0, $s1, $s2  
j Exit  
Else: sub $s0, $s1, $s2  
Exit: ...
```



Assembler calculates addresses

# Compiling Loop Statements

- C code:

```
while (save[i] == k) i += 1;
```

- i in \$s3, k in \$s5, address of save in \$s6

- Compiled MIPS code:

```
Loop: sll    $t1, $s3, 2
      add   $t1, $t1, $s6
      lw     $t0, 0($t1)
      bne   $t0, $s5, Exit
      addi  $s3, $s3, 1
      j     Loop
```

```
Exit: ...
```



# Basic Blocks

- A basic block is a sequence of instructions with
  - No embedded branches (except at end)
  - No branch targets (except at beginning)



- A compiler identifies basic blocks for optimization
- An advanced processor can accelerate execution of basic blocks

# More Conditional Operations

- Set result to 1 if a condition is true
  - Otherwise, set to 0
- `slt rd, rs, rt`
  - `if (rs < rt) rd = 1; else rd = 0;`
- `slti rt, rs, constant`
  - `if (rs < constant) rt = 1; else rt = 0;`
- Use in combination with `beq`, `bne`

```
slt $t0, $s1, $s2    # if ($s1 < $s2)
bne $t0, $zero, L    # branch to L
```

# Branch Instruction Design

- Why not `blt`, `bge`, etc?
- Hardware for  $<$ ,  $\geq$ , ... slower than  $=$ ,  $\neq$ 
  - Combining with branch involves more work per instruction, requiring a slower clock
  - All instructions penalized!
- `beq` and `bne` are the common case
- This is a good design compromise

# Signed vs. Unsigned

- Signed comparison: s1t, s1ti
- Unsigned comparison: s1tu, s1tui
- Example
  - \$s0 = 1111 1111 1111 1111 1111 1111 1111 1111 1111 1111
  - \$s1 = 0000 0000 0000 0000 0000 0000 0000 0000 0001
  - s1t \$t0, \$s0, \$s1 # signed
    - $-1 < +1 \Rightarrow \$t0 = 1$
  - s1tu \$t0, \$s0, \$s1 # unsigned
    - $+4,294,967,295 > +1 \Rightarrow \$t0 = 0$

# Procedure Calling

- Steps required
  1. Place parameters in registers
  2. Transfer control to procedure
  3. Acquire storage for procedure
  4. Perform procedure's operations
  5. Place result in register for caller
  6. Return to place of call

# Register Usage

- \$a0 – \$a3: arguments (reg's 4 – 7)
- \$v0, \$v1: result values (reg's 2 and 3)
- \$t0 – \$t9: temporaries
  - Can be overwritten by callee
- \$s0 – \$s7: saved
  - Must be saved/restored by callee
- \$gp: global pointer for static data (reg 28)
- \$sp: stack pointer (reg 29)
- \$fp: frame pointer (reg 30)
- \$ra: return address (reg 31)

# Procedure Call Instructions

- Procedure call: jump and link

`jal ProcedureLabel`

- Address of following instruction put in \$ra
- Jumps to target address

- Procedure return: jump register

`jr $ra`

- Copies \$ra to program counter
- Can also be used for computed jumps
  - e.g., for case/switch statements

# Leaf Procedure Example

- C code:

```
int leaf_example (int g, h, i, j)
{ int f;
  f = (g + h) - (i + j);
  return f;
}
```

- Arguments g, ..., j in \$a0, ..., \$a3
- f in \$s0 (hence, need to save \$s0 on stack)
- Result in \$v0

# Leaf Procedure Example

- MIPS code:

|                        |                    |
|------------------------|--------------------|
| leaf_example:          |                    |
| addi \$sp, \$sp, -4    | Save \$s0 on stack |
| sw \$s0, 0(\$sp)       |                    |
| add \$t0, \$a0, \$a1   | Procedure body     |
| add \$t1, \$a2, \$a3   |                    |
| sub \$s0, \$t0, \$t1   |                    |
| add \$v0, \$s0, \$zero | Result             |
| lw \$s0, 0(\$sp)       |                    |
| addi \$sp, \$sp, 4     | Restore \$s0       |
| jr \$ra                | Return             |

# Non-Leaf Procedures

- Procedures that call other procedures
- For nested call, caller needs to save on the stack:
  - Its return address
  - Any arguments and temporaries needed after the call
- Restore from the stack after the call

# Non-Leaf Procedure Example

- C code:

```
int fact (int n)
{
    if (n < 1) return f;
    else return n * fact(n - 1);
}
```

- Argument n in \$a0
- Result in \$v0

# Non-Leaf Procedure Example

## MIPS code:

```
fact:
    addi $sp, $sp, -8      # adjust stack for 2 items
    sw   $ra, 4($sp)       # save return address
    sw   $a0, 0($sp)       # save argument
    slti $t0, $a0, 1        # test for n < 1
    beq $t0, $zero, L1
    addi $v0, $zero, 1      # if so, result is 1
    addi $sp, $sp, 8        # pop 2 items from stack
    jr   $ra                # and return
L1: addi $a0, $a0, -1      # else decrement n
    jal fact               # recursive call
    lw   $a0, 0($sp)       # restore original n
    lw   $ra, 4($sp)       # and return address
    addi $sp, $sp, 8        # pop 2 items from stack
    mul $v0, $a0, $v0       # multiply to get result
    jr   $ra                # and return
```

# Local Data on the Stack



- Local data allocated by callee
  - e.g., C automatic variables
- Procedure frame (activation record)
  - Used by some compilers to manage stack storage

# Memory Layout

- Text: program code
- Static data: global variables
  - e.g., static variables in C, constant arrays and strings
  - \$gp initialized to address allowing  $\pm$ offsets into this segment
- Dynamic data: heap
  - E.g., malloc in C, new in Java
- Stack: automatic storage

