

# ECE 550D

## Fundamentals of Computer Systems and Engineering

Fall 2025

### Instruction Set Architecture (ISA)

Yiran Chen and Hai “Helen” Li  
Duke University

Slides are derived from work by  
Rabih Younes, John Board, Andrew Hilton, and Tyler Bletsch (Duke)

# Running Software



# Running Software

- **Software** runs on general-purpose hardware (compared to application-specific hardware)
  - The Central Processing Unit (CPU)
    - Also called “processor”
- High-level software is compiled/interpreted into **assembly** (low-level software/code)
- Assembler then converts assembly into **machine code** (1s and 0s)
- → CPU design and machine code instructions should be designed to work together
  - CPU should be able to accommodate every machine code instruction
  - Machine code instructions should be able to run on the CPU hardware
    - ~1 instruction per clock cycle (more on this later)
  - The set of instructions we’re able to run on a CPU is called the “**instruction set architecture**” (ISA)
    - It is the interface between software (code) and hardware (CPU)

# First, Let's See What Assembly Is

- Assembly programming:
  - 1 machine instruction at a time
  - Still in “human readable form”, e.g.:
    - add \$1, \$2, \$3

How to say “\$1” out loud:



- Much “lower-level” than any other programming language
  - Uses registers in regfile (vs unlimited variables in higher languages)

# Registers

- There are 2 places where processors can store data:
  - Registers (saw these -- sort of):
    - In processor
    - Few of them (e.g., 32)
    - Very fast
  - Main memory (later):
    - I.e., RAM
    - Outside of processor
    - Huge (e.g., 8 GB)
    - Much slower than registers
- For now: think of registers like “variables”
  - But we have a limited number of them
  - E.g.,  $\$1 = \$2 + \$3$ 
    - much like  $s = a + b$

# Simple Assembly Code Example

```
// silly C code  
  
int sum, temp, x, y;  
while (true) {  
    temp = x + y;  
    sum = sum + temp;  
}
```

```
// equivalent assembly code (for MIPS*)  
  
loop:   lw $1, Memory[1004]  
        lw $2, Memory[1008]  
        add $3, $1, $2  
        add $4, $4, $3  
        j loop
```

*Memory references  
don't quite work  
like this...we'll  
correct this later.*

OK, so what does this assembly code mean?  
Let's dig into each line ...

# Simple Assembly Code Example

```
loop:    lw $1, Memory[1004]
          lw $2, Memory[1008]
          add $3, $1, $2
          add $4, $4, $3
          j loop
```

```
// silly C code

int sum, temp, x, y;
while (true) {
    temp = x + y;
    sum = sum + temp;
}
```

“loop:” = line label (in case we need to refer to this instruction later)

lw = “load word” = read a word (32 bits) from memory

\$1 = “register 1” → put result read from memory into register 1

Memory[1004] = address in memory to read from (where x lives)

*Note: almost all MIPS instructions put destination (where result gets written) first (in this case, \$1)*

# Simple Assembly Code Example

```
loop:    lw $1, Memory[1004]
          lw $2, Memory[1008]
          add $3, $1, $2
          add $4, $4, $3
          j loop
```

```
// silly C code

int sum, temp, x, y;
while (true) {
    temp = x + y;
    sum = sum + temp;
}
```

lw = “load word” = read a word (32 bits) from memory

\$2 = “register 2” → put result read from memory into register 2

Memory[1008] = address in memory to read from (where y lives)

# Simple Assembly Code Example

```
loop:    lw $1, Memory[1004]
          lw $2, Memory[1008]
          add $3, $1, $2
          add $4, $4, $3
          j loop
```

```
// silly C code

int sum, temp, x, y;
while (true) {
    temp = x + y;
    sum = sum + temp;
}
```

add \$3, \$1, \$2 = add what's in \$1 to what's in \$2 and put result in \$3

# Simple Assembly Code Example

```
loop:    lw $1, Memory[1004]
          lw $2, Memory[1008]
          add $3, $1, $2
          add $4, $4, $3
          j loop
```

```
// silly C code

int sum, temp, x, y;
while (true) {
    temp = x + y;
    sum = sum + temp;
}
```

**add \$4, \$4, \$3 = add what's in \$4 to what's in \$3 and put result in \$4**

*Note: this instruction overwrites previous value in \$4*

# Simple Assembly Code Example

```
loop:    lw $1, Memory[1004]
          lw $2, Memory[1008]
          add $3, $1, $2
          add $4, $4, $3
          j loop
```

```
// silly C code

int sum, temp, x, y;
while (true) {
    temp = x + y;
    sum = sum + temp;
}
```

j = "jump"

loop = address of instruction at label "loop" (the first lw instruction above)  
sets next PC (Program Counter) to the address labeled by "loop"

*Note: all other instructions in this code set next PC = PC++ (remember, this code is stored somewhere in memory and that's why we need a PC to go through the lines of code in memory)*

# Assembly to Machine Code

- Human readable is not (easily) machine executable
  - add \$1, \$2, \$3
- Instructions are numbers too!
  - Bit fields (like FP numbers)
- Instruction Format
  - Establishes a mapping from “instruction” to binary values
  - Which bit positions correspond to which parts of the instruction (operation, operands, etc.)
- In MIPS (a 32-bit CPU), each assembly instruction has a unique 32-bit representation:
  - add \$3, \$2, \$7       $\leftrightarrow$  000000001000111000110000100000
  - lw \$8, Mem[1004]     $\leftrightarrow$  1000110000001000000000111101100
- Assembler does this translation

# What Must Be Specified in an Instruction?

- Instruction “**opcode**”
  - What should this **operation** do? (add, subtract,...)
- Location of operands and result
  - Registers (which ones?)
  - Memory (which address?)
  - Immediates/constants (what value?)
- Data type and size
  - Usually included in opcode
  - E.g., signed vs. unsigned int (if it matters)
- What instruction comes next?
  - Sequentially, the next instruction, or jump elsewhere (if/else, loops)

# The ISA

- Instruction Set Architecture (ISA)
  - **Contract between hardware and software**
  - Specifies everything hardware and software need to agree on
    - Instruction encoding and effects
    - (lots of other things that won't make sense yet)
- Many different ISAs for different processors
  - x86 and x86\_64 (Intel and AMD)
  - POWER (IBM)
  - **MIPS** (*easier to learn; good starting point; we'll use this*)
  - ARM
  - SPARC (Oracle)

# But I don't have a MIPS computer?

- You can use SPIM (MIPS simulator)
  - Command-line version: spim
  - Graphical version: qtspim



# ISAs: RISC vs CISC

- Two broad categories of ISAs:
  - Complex Instruction Set Computing
    - Came first, days when people always directly wrote assembly
    - Big complex instructions
  - Reduced Instruction Set Computing
    - Goal: make hardware simple and fast
    - Write in high level language, let compiler do the dirty work
      - Rely on compiler to optimize for you
- Note:
  - Sometimes fuzzy: ISAs may have some features of each
  - Common misconception: not about how many different insns!

instructions

# ISAs: RISC vs CISC

## Reduced Instruction Set Computing

- Simple, fixed length instruction encoding
- Few **memory addressing modes**
- Instructions have one effect
- “Many” registers (e.g., 32)
- Three-operand arithmetic ( $\text{dest} = \text{src1 op src2}$ )
- **Load-store** ISA

operation

# ISAs: RISC vs CISC

- **Complex Instruction Set Computing**

- Variable length instruction encoding (sometimes quite complex)
- Many addressing modes, some quite complex
- Side-effecting and/or complex instructions
- Few registers (e.g., 8)
- Various operand models
  - Stack
  - Two-operand ( $\text{dest} = \text{src op dest}$ )
  - Implicit operands
- Can operate directly on memory
  - Register = Memory op Register
  - Memory = Memory op Register
  - Memory = Memory op Memory

# Load-Store ISA

- Load-store ISA:
  - Specific instructions (loads/stores) to access memory
    - Loads read memory (and **only** read memory)
    - Stores write memory (and **only** write memory)
    - Everything else happens in registers
- In contrast with
  - General memory operands ( $\$4 = \text{mem}[\$5] + \$3$ )
  - Memory/memory operations:  $\text{mem}[\$4] = \text{mem}[\$5] + \$3$

# Process/Program Memory

Instructions and data  
are stored in memory



# How Is Code Executed?



- **Instruction Fetch:**  
Read instruction bits from memory
- **Decode:**  
Figure out what those bits mean
- **Operand Fetch:**  
Read registers (+ mem to get sources)
- **Execute:**  
Do the actual operation (e.g., add the numbers)
- **Result Store:**  
Write result to register or memory
- **Next Instruction:**  
Figure out mem addr of next insn, repeat

Called Von Neumann model

# MIPS Registers

- Recall: registers (in regfile)
  - Fast
  - In CPU
  - Directly compute on them
- In MIPS:
  - 32 x 32-bit general-purpose int regs
    - With R0 = 0
  - + floating-point registers
  - + special purpose registers, such as
    - PC = Address of next insn



# Assembly Code Execution Example

- What is the simplest computation we might do?
  - Add two numbers:

$$x = a + b;$$

**add \$1, \$2, \$3**

“Add \$2 + \$3, and store it in \$1”

*Note: when writing assembly, basically, pick reg for a, reg for b, reg for x*

Not enough regs for all variables? We'll talk about that later...

# Executing Add



| Address | Instruction       |
|---------|-------------------|
| 1000    | add \$8, \$4, \$6 |
| 1004    | add \$5, \$8, \$7 |
| 1008    | add \$6, \$6, \$6 |
| 100C    | ...               |
| 1010    | ...               |

Memory

PC tells us where to execute next

| Register | Value     |
|----------|-----------|
| \$0      | 0000 0000 |
| \$1      | 1234 5678 |
| \$2      | C001 D00D |
| \$3      | 1BAD F00D |
| \$4      | 9999 9999 |
| \$5      | ABAB ABAB |
| \$6      | 0000 0001 |
| \$7      | 0000 0002 |
| \$8      | 0000 0000 |
| \$9      | 0000 0000 |
| \$10     | 0000 0000 |
| \$11     | 0000 0000 |
| \$12     | 0000 0000 |
| ...      | ...       |
| PC       | 0000 1000 |

RegFile

# Executing Add



| Address | Instruction       |
|---------|-------------------|
| 1000    | add \$8, \$4, \$6 |
| 1004    | add \$5, \$8, \$7 |
| 1008    | add \$6, \$6, \$6 |
| 100C    | ...               |
| 1010    | ...               |

Add reads its source registers, and uses their values directly (“register direct”)

9999 9999  
0000 0001  
9999 999A

| Register   | Value            |
|------------|------------------|
| \$0        | 0000 0000        |
| \$1        | 1234 5678        |
| \$2        | C001 D00D        |
| \$3        | 1BAD F00D        |
| <b>\$4</b> | <b>9999 9999</b> |
| \$5        | ABAB ABAB        |
| <b>\$6</b> | <b>0000 0001</b> |
| \$7        | 0000 0002        |
| \$8        | 0000 0000        |
| \$9        | 0000 0000        |
| \$10       | 0000 0000        |
| \$11       | 0000 0000        |
| \$12       | 0000 0000        |
| ...        | ...              |
| PC         | 0000 1000        |

# Executing Add

| Address | Instruction       |
|---------|-------------------|
| 1000    | add \$8, \$4, \$6 |
| 1004    | add \$5, \$8, \$7 |
| 1008    | add \$6, \$6, \$6 |
| 100C    | ...               |
| 1010    | ...               |

Add writes its result to its destination register

| Register   | Value            |
|------------|------------------|
| \$0        | 0000 0000        |
| \$1        | 1234 5678        |
| \$2        | C001 D00D        |
| \$3        | 1BAD F00D        |
| \$4        | 9999 9999        |
| \$5        | ABAB ABAB        |
| \$6        | 0000 0001        |
| \$7        | 0000 0002        |
| <b>\$8</b> | <b>9999 999A</b> |
| \$9        | 0000 0000        |
| \$10       | 0000 0000        |
| \$11       | 0000 0000        |
| \$12       | 0000 0000        |
| ...        | ...              |
| PC         | 0000 1000        |

# Executing Add

| Address | Instruction       |
|---------|-------------------|
| 1000    | add \$8, \$4, \$6 |
| 1004    | add \$5, \$8, \$7 |
| 1008    | add \$6, \$6, \$6 |
| 100C    | ...               |
| 1010    | ...               |

And goes to the sequentially next instruction  
 $(PC = PC + 4)$

Why 4? Counts in bytes (8 bits)  
 32-bit system  $\rightarrow 32/8 = 4$  bytes per insn

| Register  | Value            |
|-----------|------------------|
| \$0       | 0000 0000        |
| \$1       | 1234 5678        |
| \$2       | C001 D00D        |
| \$3       | 1BAD F00D        |
| \$4       | 9999 9999        |
| \$5       | ABAB ABAB        |
| \$6       | 0000 0001        |
| \$7       | 0000 0002        |
| \$8       | 9999 999A        |
| \$9       | 0000 0000        |
| \$10      | 0000 0000        |
| \$11      | 0000 0000        |
| \$12      | 0000 0000        |
| ...       | ...              |
| <b>PC</b> | <b>0000 1004</b> |

# Executing Add

| Address | Instruction       |
|---------|-------------------|
| 1000    | add \$8, \$4, \$6 |
| 1004    | add \$5, \$8, \$7 |
| 1008    | add \$6, \$6, \$6 |
| 100C    | ...               |
| 1010    | ...               |



We set \$5 equal to  
 $\$8\ (9999\ 999A) + \$7\ (2) = 9999\ 999C$

and PC = PC +4

| Register | Value            |
|----------|------------------|
| \$0      | 0000 0000        |
| \$1      | 1234 5678        |
| \$2      | C001 D00D        |
| \$3      | 1BAD F00D        |
| \$4      | 9999 9999        |
| \$5      | <b>9999 999C</b> |
| \$6      | 0000 0001        |
| \$7      | 0000 0002        |
| \$8      | 9999 999A        |
| \$9      | 0000 0000        |
| \$10     | 0000 0000        |
| \$11     | 0000 0000        |
| \$12     | 0000 0000        |
| ...      | ...              |
| PC       | <b>0000 1008</b> |

# Executing Add

| Address | Instruction       |
|---------|-------------------|
| 1000    | add \$8, \$4, \$6 |
| 1004    | add \$5, \$8, \$7 |
| 1008    | add \$6, \$6, \$6 |
| 100C    | ...               |
| 1010    | ...               |



It's perfectly fine to have \$6 as a src and a dst  
This is just like  $x = x + x$ ; in C, Java, etc:

$$1 + 1 = 2$$

| Register | Value     |
|----------|-----------|
| \$0      | 0000 0000 |
| \$1      | 1234 5678 |
| \$2      | C001 D00D |
| \$3      | 1BAD F00D |
| \$4      | 9999 9999 |
| \$5      | 9999 999C |
| \$6      | 0000 0001 |
| \$7      | 0000 0002 |
| \$8      | 9999 999A |
| \$9      | 0000 0000 |
| \$10     | 0000 0000 |
| \$11     | 0000 0000 |
| \$12     | 0000 0000 |
| ...      | ...       |
| PC       | 0000 1008 |

# Executing Add

| Address | Instruction       |
|---------|-------------------|
| 1000    | add \$8, \$4, \$6 |
| 1004    | add \$5, \$8, \$7 |
| 1008    | add \$6, \$6, \$6 |
| 100C    | ...               |
| 1010    | ...               |

Its perfectly fine to have \$6 as a src and a dst  
 This is just like  $x = x + x$ ; in C, Java, etc:

$$1 + 1 = 2$$

| Register   | Value            |
|------------|------------------|
| \$0        | 0000 0000        |
| \$1        | 1234 5678        |
| \$2        | C001 D00D        |
| \$3        | 1BAD F00D        |
| \$4        | 9999 9999        |
| \$5        | 9999 999C        |
| <b>\$6</b> | <b>0000 0002</b> |
| \$7        | 0000 0002        |
| \$8        | 9999 999A        |
| \$9        | 0000 0000        |
| \$10       | 0000 0000        |
| \$11       | 0000 0000        |
| \$12       | 0000 0000        |
| ...        | ...              |
| PC         | <b>0000 100C</b> |

# **MIPS Instruction Formats**

# MIPS Instruction Formats

R-type: Register-Register



I-type: Register-Immediate



J-type: Jump / Call



Terminology

Op = opcode

Rs, Rt, Rd = register specifier

# R Type: <OP> rd, rs, rt

R-type: Register-Register



|       |                                                                                         |
|-------|-----------------------------------------------------------------------------------------|
| op    | a 6-bit operation code.                                                                 |
| rs    | a 5-bit source register.                                                                |
| rt    | a 5-bit target (source) register.                                                       |
| rd    | a 5-bit destination register.                                                           |
| shamt | a 5-bit shift amount.                                                                   |
| func  | a 6-bit function field<br>(to identify more specific operations that have the same op). |

Example: add \$1, \$2, \$3



# Executing add (with insns values)

| Address | Instruction (assembly) | Machine Code | Register | Value     |
|---------|------------------------|--------------|----------|-----------|
| 1000    | add \$8, \$4, \$6      | 0086 4020    | \$0      | 0000 0000 |
| 1004    | add \$5, \$8, \$7      | 0107 2820    | \$1      | 1234 5678 |
| 1008    | add \$6, \$6, \$6      | 00C6 3020    | \$2      | C001 D00D |
| 100C    | ...                    | ...          | \$3      | 1BAD F00D |
| 1010    | ...                    | ...          | \$4      | 9999 9999 |
|         |                        |              | \$5      | 9999 999C |
|         |                        |              | \$6      | 0000 0002 |
|         |                        |              | \$7      | 0000 0002 |
|         |                        |              | \$8      | 9999 999A |
|         |                        |              | \$9      | 0000 0000 |
|         |                        |              | \$10     | 0000 0000 |
|         |                        |              | ...      | ...       |
|         |                        |              | PC       | 0000 100C |

# qtspim: Shows you similar state

Data memory is in this tab

Your program is in the “text” region of memory

The screenshot shows the QtSpim debugger interface. On the left, there is a 'Registers' pane titled 'Int Regs [16]' containing a list of integer registers (R0-R27) with their current values. On the right, there is a 'Text' pane titled 'User Text Segment [00400000]..[00440000]' which displays assembly code for the user program. The assembly code includes instructions like lw, addiu, sll, addu, jal, and syscall. Annotations in blue text and arrows point to specific parts of the interface: 'Registers' points to the 'Registers' tab in the top menu and the 'Registers' pane; 'Code' points to the 'Text' tab in the top menu and the 'Text' pane.

Registers

Code

```
User Text Segment [00400000]..[00440000]
[00400000] 8fa40000 lw $4, 0($29) ; 183: lv $a0 0($sp) # argc
[00400004] 27a50004 addiu $5, $29, 4 ; 184: addiu $a1 $sp 4 # argv
[00400008] 24a60004 addiu $6, $5, 4 ; 185: addiu $a2 $a1 4 # envp
[0040000c] 00041080 sll $2, $4, 2 ; 186: sll $v0 $a0 2
[00400010] 00c23021 addu $6, $6, $2 ; 187: addu $a2 $a2 $v0
[00400014] 0c100009 jal 0x00400024 [main] ; 188: jal main
[00400018] 00000000 nop ; 189: nop
[0040001c] 3402000a ori $2, $0, 10 ; 191: li $v0 10
[00400020] 0000000c syscall ; 192: syscall # syscall 10 (exit)
[00400024] 34020004 ori $2, $0, 4 ; 40: li $v0, 4 # syscall 4 (print_str)
[00400028] 3c041001 lui $4, 4097 [msg] ; 41: la $a0, msg # argument: string
[0040002c] 0000000c syscall ; 42: syscall # print the string
[00400030] 8f898000 lw $9, -32768($28) ; 43: lv $t1, foobar
[00400034] 03e00008 jr $31 ; 45: jr $ra # retn to caller

Kernel Text Segment [80000000]..[80010000]
[80000180] 0001d821 addu $27, $0, $1 ; 90: move $k1 $at # Save $at
[80000184] 3c019000 lui $1, -28672 ; 92: sv $v0 $1 # Not re-enter and we can't
trust $sp
[80000188] ac220200 sw $2, 512($1)
[8000018c] 3c019000 lui $1, -28672 ; 93: sv $a0 $2 # But we need to use these
registers
[80000190] ac240204 sw $4, 516($1)
[80000194] 401a6800 mfc0 $26, $13 ; 95: mfc0 $k0 $13 # Cause register
[80000198] 001a2082 srl $4, $26, 2 ; 96: srl $a0 $k0 2 # Extract ExcCode Field
[8000019c] 3084001f andi $4, $4, 31 ; 97: andi $a0 $a0 0x1f
[800001a0] 34020004 ori $2, $0, 4 ; 101: li $v0 4 # syscall 4 (print_str)
[800001a4] 3c049000 lui $4, -28672 [_m1_] ; 102: la $a0 _m1_
[800001a8] 0000000c syscall ; 103: syscall
[800001ac] 34020001 ori $2, $0, 1 ; 105: li $v0 1 # syscall 1 (print_int)
[800001b0] 001a2082 srl $4, $26, 2 ; 106: srl $a0 $k0 2 # Extract ExcCode Field
[800001b4] 3084001f andi $4, $4, 31 ; 107: andi $a0 $a0 0x1f
[800001b8] 0000000c syscall ; 108: syscall
[800001bc] 34020004 ori $2, $0, 4 ; 110: li $v0 4 # syscall 4 (print_str)

SPIM is distributed under a BSD license.
See the file README for a full copyright notice.
QtSPIM is linked to the Qt library, which is distributed under the GNU Lesser General Public License version 3 and version 2.1.
Read from unused memory-mapped IO address (0xfffff8000)
Read from unused memory-mapped IO address (0xfffff8000)
```

# Other Similar Instructions

- `sub $rDest, $rSrc1, $rSrc2`
- `mul $rDest, $rSrc1, $rSrc2` (pseudo-instr)
- `div $rDest, $rSrc1, $rSrc2` (pseudo-instr)
- `and $rDest, $rSrc1, $rSrc2`
- `or $rDest, $rSrc1, $rSrc2`
- `xor $rDest, $rSrc1, $rSrc2`
- ...

# Pseudo Instructions

- Some “instructions” are pseudo-instructions
  - Actually, assemble into 2 instructions:
- `mul $1, $2, $3` is really
  - `mul $2, $3`
  - `mflo $1`
- `mul` takes two srcs, writes to special regs `lo` and `hi`
- `mflo` moves from `lo` into dst reg

# I-Type <op> rt, rs, immediate

- Immediate: 16-bit value

I-type: Register-Immediate



Add Immediate Example

addi      \$1, \$2, 100



# Using addi to Put a Constant in a Register

- Can use addi to put a constant into a register:
  - $x = 42;$
  - Can be done with
  - addi \$7, \$0, 42
  - Because \$0 is always 0.
- Common enough it has its own pseudo-instr:
  - li \$7, 42
  - Stands for load immediate, works for 16-bit immediate

# Many Instructions Have Immediate Forms

- Add is not the only one with an immediate form
  - andi, ori, xori, sll, sr, sra, ...
- No subi in MIPS
  - Why not?
    - Can always use addi with negation
- No muli or divi in MIPS
  - Though some ISAs have them

# Assembly Programming Exercise

- Consider the following C fragment that converts temperature:

```
int tempF = 87;  
int a = tempF - 32;  
a = a * 5;  
int tempC = a / 9;
```

- Let's write assembly for it

- First, need registers for our variables:
  - tempF = \$3
  - a = \$4
  - tempC = \$5
- Now, give it a try (use \$6, \$7,... as temps if you need)...

# Assembly Programming Exercise

- Consider the following C fragment that converts temperature:

```
int tempF = 87;           li $3, 87
int a = tempF - 32;       addi $4, $3, -32
a = a * 5;                li $6, 5
                           mul $4, $4, $6
int tempC = a / 9;        li $6, 9
                           div $5, $4, $6
```

- Let's write assembly for it

- First, need registers for our variables:
  - tempF = \$3
  - a = \$4
  - tempC = \$5
- Now, give it a try (use \$6, \$7,... as temps if you need)...

# MIPS' ISA Is a “Load-Store” ISA

```
loop:  lw $1, Memory[1004] → lw $1, 0($6) # put val of x in $1  
lw $2, Memory[1008] → lw $2, 4($6) # put val of y in $2  
add $3, $1, $2  
add $4, $4, $3  
j loop
```

More on the meaning of the `lw` insn in the next slide

# More I-Type Operations/Instructions

- Load Word Example
- $lw \$1, 100(\$2)$  #  $\$1 = \text{Mem}[\$2+100]$

I-type: Register-Immediate



# More I-Type Operations/Instructions

- Store Word Example
- sw \$1, 100(\$2) # Mem[\$2+100] = \$1

I-type: Register-Immediate



| op     | rs    | rt    | immediate           |
|--------|-------|-------|---------------------|
| 101011 | 00010 | 00001 | 0000 0000 0110 0100 |

# Data Sizes/Types

- Loads and Stores come in multiple sizes
  - Reflect different data types
- The 'w' in lw/sw stands for "word" (= 32 bits)
  - Can also load bytes (8 bits) and half words (16 bits)
    - Smaller sizes have signed/unsigned forms

# C to Assembly Exercise: Loads/Stores

- int x # x in \$1
- int \* p # p in \$2
- int \*\* q # q in \$3
- ... ...
- x = \*p;
- \*\*q = x;
- p = \*q;
- p[4] = x;

# C to Assembly Exercise: Loads/Stores

- int x # x in \$1
- int \* p # p in \$2
- int \*\* q # q in \$3
- ... ...
- x = \*p; lw \$1, 0(\$2)
- \*\*q = x; lw \$4, 0(\$3)  
sw \$1, 0(\$4)
- p = \*q; lw \$2, 0(\$3)
- p[4] = x; sw \$1, 16(\$2)

# Executing Memory Ops Example

| Address | Value          |
|---------|----------------|
| 1000    | lw \$1, 0(\$2) |
| 1004    | lw \$4, 4(\$3) |
| 1008    | sw \$1, 8(\$4) |
| 100C    | lw \$2, 0(\$3) |
| 1010    | ...            |
| ...     | ...            |
| 8000    | F00D F00D      |
| 8004    | C001 D00D      |
| 8008    | 1234 4321      |
| 800C    | 4242 4242      |
| 8010    | 0000 8000      |

| Register | Value     |
|----------|-----------|
| \$0      | 0000 0000 |
| \$1      | 1234 5678 |
| \$2      | 0000 8004 |
| \$3      | 0000 800C |
| \$4      | 9999 9999 |
| \$5      | 9999 999C |
| \$6      | 0000 0002 |
| \$7      | 0000 0002 |
| \$8      | 9999 999A |
| \$9      | 0000 0000 |
| \$10     | 0000 0000 |
| \$11     | 0000 0000 |
| \$12     | 0000 0000 |
| ...      | ...       |
| PC       | 0000 1000 |

# Executing Memory Ops Example

| Address | Value          |
|---------|----------------|
| 1000    | lw \$1, 0(\$2) |
| 1004    | lw \$4, 4(\$3) |
| 1008    | sw \$1, 8(\$4) |
| 100C    | lw \$2, 0(\$3) |
| 1010    | ...            |
| ...     | ...            |
| 8000    | F00D F00D      |
| 8004    | C001 D00D      |
| 8008    | 1234 4321      |
| 800C    | 4242 4242      |
| 8010    | 0000 8000      |

| Register | Value     |
|----------|-----------|
| \$0      | 0000 0000 |
| \$1      | C001 D00D |
| \$2      | 0000 8004 |
| \$3      | 0000 800C |
| \$4      | 9999 9999 |
| \$5      | 9999 999C |
| \$6      | 0000 0002 |
| \$7      | 0000 0002 |
| \$8      | 9999 999A |
| \$9      | 0000 0000 |
| \$10     | 0000 0000 |
| \$11     | 0000 0000 |
| \$12     | 0000 0000 |
| ...      | ...       |
| PC       | 0000 1004 |



# Executing Memory Ops Example

| Address | Value          |
|---------|----------------|
| 1000    | lw \$1, 0(\$2) |
| 1004    | lw \$4, 4(\$3) |
| 1008    | sw \$1, 8(\$4) |
| 100C    | lw \$2, 0(\$3) |
| 1010    | ...            |
| ...     | ...            |
| 8000    | F00D F00D      |
| 8004    | C001 D00D      |
| 8008    | 1234 4321      |
| 800C    | 4242 4242      |
| 8010    | 0000 8000      |

| Register | Value     |
|----------|-----------|
| \$0      | 0000 0000 |
| \$1      | C001 D00D |
| \$2      | 0000 8004 |
| \$3      | 0000 800C |
| \$4      | 0000 8000 |
| \$5      | 9999 999C |
| \$6      | 0000 0002 |
| \$7      | 0000 0002 |
| \$8      | 9999 999A |
| \$9      | 0000 0000 |
| \$10     | 0000 0000 |
| \$11     | 0000 0000 |
| \$12     | 0000 0000 |
| ...      | ...       |
| PC       | 0000 1008 |



# Executing Memory Ops Example

| Address | Value          |
|---------|----------------|
| 1000    | lw \$1, 0(\$2) |
| 1004    | lw \$4, 4(\$3) |
| 1008    | sw \$1, 8(\$4) |
| 100C    | lw \$2, 0(\$3) |
| 1010    | ...            |
| ...     | ...            |
| 8000    | F00D F00D      |
| 8004    | C001 D00D      |
| 8008    | C001 D00D      |
| 800C    | 4242 4242      |
| 8010    | 0000 8000      |

| Register | Value     |
|----------|-----------|
| \$0      | 0000 0000 |
| \$1      | C001 D00D |
| \$2      | 0000 8004 |
| \$3      | 0000 800C |
| \$4      | 0000 8000 |
| \$5      | 9999 999C |
| \$6      | 0000 0002 |
| \$7      | 0000 0002 |
| \$8      | 9999 999A |
| \$9      | 0000 0000 |
| \$10     | 0000 0000 |
| \$11     | 0000 0000 |
| \$12     | 0000 0000 |
| ...      | ...       |
| PC       | 0000 100C |



# Executing Memory Ops Example

| Address | Value          |
|---------|----------------|
| 1000    | lw \$1, 0(\$2) |
| 1004    | lw \$4, 4(\$3) |
| 1008    | sw \$1, 8(\$4) |
| 100C    | lw \$2, 0(\$3) |
| 1010    | ...            |
| ...     | ...            |
| 8000    | F00D F00D      |
| 8004    | C001 D00D      |
| 8008    | C001 D00D      |
| 800C    | 4242 4242      |
| 8010    | 0000 8000      |

| Register | Value     |
|----------|-----------|
| \$0      | 0000 0000 |
| \$1      | C001 D00D |
| \$2      | 4242 4242 |
| \$3      | 0000 800C |
| \$4      | 0000 8000 |
| \$5      | 9999 999C |
| \$6      | 0000 0002 |
| \$7      | 0000 0002 |
| \$8      | 9999 999A |
| \$9      | 0000 0000 |
| \$10     | 0000 0000 |
| \$11     | 0000 0000 |
| \$12     | 0000 0000 |
| ...      | ...       |
| PC       | 0000 1010 |



# Making Control Decisions

- Control constructs—decide what to do next:

```
if  (x == y)  {  
...  
}  
else  {  
...  
}  
...  
while (z < q)  {  
...  
}
```



# The Program Counter (PC)

- Special register (PC) that points to instructions
- Contains memory address (like a pointer)
- During instruction fetch:
  - $\text{insn} = \text{mem}[\text{PC}]$
- So far, we have fetched sequentially:  $\text{PC} = \text{PC} + 4$ 
  - Because memory is **byte-addressable** while MIPS insns are **32 bits each**  
Has unique address per byte
  - May want to specify non-sequential fetch
  - Change PC in other ways

# I-Type <op> rt, rs, immediate

- PC relative addressing
- Branch if Not Equal Example:
  - bne \$1, \$2, 100 # If (\$1!= \$2) goto [PC+4+400]  
(not 100)

I-type: Register-Immediate



# I-Type <op> rt, rs, immediate

- PC relative addressing
- Branch if Not Equal Example:
  - bne \$1, \$2, **100** # If (\$1!= \$2) goto [PC+4+**400**]  
(not 100)
- Why **400**, not **100**?
  - Examine the last 2 bits of PC
  - They're always "00"!
    - Since we always increment by 4
  - → So, we always save space when providing the immediate address by omitting the "00" at the end
    - But we need to bring the "00" back when computing PC!
    - → Shift immediate value twice left (i.e., value\*4) to bring back "00"

# MIPS Compare and Branch

- Compare and Branch
  - beq        rs, rt, offset
  - bne        rs, rt, offset
- Compare to zero and Branch
  - blez        rs, offset
  - bgtz        rs, offset
  - bltz        rs, offset
  - bgez        rs, offset
- Also pseudo-insns for unconditional branch (b)
- What do all these instructions mean/do??
  - Check the posted document “MIPS ISA.pdf” on Canvas
  - Also:
    - [https://en.wikipedia.org/wiki/MIPS\\_architecture](https://en.wikipedia.org/wiki/MIPS_architecture)
    - [https://en.wikibooks.org/wiki/MIPS\\_Assembly/Instruction\\_Formats](https://en.wikibooks.org/wiki/MIPS_Assembly/Instruction_Formats)

# MIPS Jump, Branch, Compare Instructions

- Inequality to something other than 0: requires 2 insns
  - Conditionally set reg, branch if not zero or if zero
- `slt $1, $2, $3`                            $\$1 = (\$2 < \$3) ? 1 : 0$ 
  - Compare less than; signed 2's comp.
- `slti $1, $2, 100`                           $\$1 = (\$2 < 100) ? 1 : 0$ 
  - Compare < constant; signed 2's comp.
- `sltu $1, $2, $3`                           $\$1 = (\$2 < \$3) ? 1 : 0$ 
  - Compare less than; unsigned
- `sltiu $1, $2, 100`                          $\$1 = (\$2 < 100) ? 1 : 0$   $\$1=0$ 
  - Compare < constant; unsigned
- `beqz $1, 100`                              if ( $\$1 == 0$ ) go to PC+4+400
  - Branch if equal to 0
- `bnez $1, 100`                              if ( $\$1 != 0$ ) go to PC+4+400
  - Branch if not equal to 0

# Signed vs. Unsigned Comparison

- \$1= 0...00 0000 0000 0000 0001
- \$2= 0...00 0000 0000 0000 0010
- \$3= 1...11 1111 1111 1111 1111
- After executing these instructions:

```
slt    $4,$2,$1
```

```
slt    $5,$3,$1
```

```
sltu   $6,$2,$1
```

```
sltu   $7,$3,$1
```

- What are values of registers \$4 - \$7?

\$4 = 0 ; \$5 = 1 ; \$6 = 0 ; \$7 = 0

# C to Assembly with Branches Exercise

```
int x;                                //assume x in $1
int y;                                //assume y in $2
int z;                                //assume z in $3
...
if (x != y) {
    z = z + 2;
}
else {
    z = z - 4;
}
```

# C to Assembly with Branches Exercise

```
int x;                                //assume x in $1
int y;                                //assume y in $2
int z;                                //assume z in $3
...
if (x != y) {
    z = z + 2;
}
else {
    z = z - 4;
}
```

...

beq \$1, \$2, **L\_else**

addi \$3, \$3, 2

b **L\_end**

**L\_else:** addi \$3, \$3, -4

**L\_end:**

# J-Type <op> immediate

- 16-bit imm limits us to +/- 32K insns
- Usually fine, but sometimes we need more...
- J-type insns provide long range, unconditional jump:

J-type: Jump / Call



- Specifies lowest 28 bits of PC (26 bits followed by "00")
  - Upper 4 bits of PC are unchanged
  - Range: 64 Million instruction (256 MB)
- Can also jump anywhere using `jr $reg#` (jump register)

# Calling Functions

# Back to the Fahrenheit to Celsius Program...

- Consider the following C fragment:

```
int tempF = 87;
```

```
    li      $3, 87
```

```
int a = tempF - 32;
```

```
    addi   $4, $3, -32
```

```
a = a * 5;
```

```
    li      $6, 5
```

```
    mul    $4, $4, $6
```

```
int tempC = a / 9;
```

```
    li      $6, 9
```

```
    div    $5, $4, $6
```

- If we were really doing this...

We would abstract it into a **function** and call it

# More Likely: A Function

- Like this:

```
int f2c (int tempF) {  
    int a = tempF - 32;  
    a = a * 5;  
    int tempC = a / 9;  
    return tempC;  
}
```

...

...

```
int tempC = f2c(87);
```

# We Need a Way to Call f2c and Return

- Call: Jump... but also remember where to go back
  - There may be many calls to f2c() in the program
  - We need some way to know where to return
  - Instruction for this **jal**
    - `jal label`
    - Store PC+4 into \$31
    - Jump to label (memory address/pointer)
- Return: Jump... back to wherever we were
  - Instruction for this **jr**
    - `jr $31`
    - Jump back to address stored by jal in \$31

# f2c

- Like this:

```
int f2c (int tempF) {  
    int a = tempF - 32;  
    a = a * 5;  
    int tempC = a / 9;  
    return tempC;  
} //jr $31  
...  
...  
int tempC = f2c(87); //jal f2c
```

- But that's not all...

# f2c

- Like this:

```
int f2c (int tempF) {  
    int a = tempF - 32;  
    a = a * 5;  
    int tempC = a / 9;  
    return tempC;  
} //jr $31  
...  
...  
int tempC = f2c(87); //jal f2c
```

- Still need to pass 87 as argument to f2c

# f2c

- Like this:

```
int f2c (int tempF) {  
    int a = tempF - 32;  
    a = a * 5;  
    int tempC = a / 9;  
return tempC;                                //jr $31  
}  
...  
...  
int tempC = f2c(87);                         //jal f2c
```

- Need to return tempC

# f2c

- Like this:

```
int f2c (int tempF) {  
    int a = tempF - 32;  
    a = a * 5;  
    int tempC = a / 9;  
    return tempC;  
}                                            //jr $31  
...  
...  
int tempC = f2c(87);                      //jal f2c
```

- Also, may want to reuse the same registers later
  - What if f2c called something? That would re-use \$31

# Calling Convention

- All of these are reasons for a calling convention
  - Agreement of how registers are used
  - Where arguments are passed, results returned
  - Who must save what if they want to use it
  - Etc.

# MIPS Register Usage/Naming Conventions

|     |      |                                |  |
|-----|------|--------------------------------|--|
| 0   | zero | constant                       |  |
| 1   | at   | reserved for assembler         |  |
| 2   | v0   | expression evaluation &        |  |
| 3   | v1   | function results               |  |
| 4   | a0   | arguments                      |  |
| 5   | a1   |                                |  |
| 6   | a2   |                                |  |
| 7   | a3   |                                |  |
| 8   | t0   | temporary (caller saves)       |  |
| ... |      |                                |  |
| 15  | t7   |                                |  |
| 16  | s0   | saved for later (callee saves) |  |
| ... |      |                                |  |
| 23  | s7   |                                |  |
| 24  | t8   | temporary (cont'd)             |  |
| 25  | t9   |                                |  |
| 26  | k0   | reserved for OS kernel         |  |
| 27  | k1   |                                |  |
| 28  | gp   | pointer to global area         |  |
| 29  | sp   | stack pointer                  |  |
| 30  | fp   | frame pointer                  |  |
| 31  | ra   | return address                 |  |

Also 32 floating-point registers: \$f0 .. \$f31

**Important:** The only general-purpose registers are the \$s and \$t registers.  
Everything else has a specific usage:

\$a = arguments, \$v = return values, \$ra = return address, etc.

# ‘t’ vs. ‘s’ Registers

- Caller saves (t) registers
  - If some code is about to call another function...
  - And it needs the value in a caller saves register (\$t0,\$t1...)
  - Then it has to save it on the **stack** before the call
  - And restore it after the call
- Callee saves (s) registers
  - If some code wants to use a callee saves register (at all)
  - It has to save it to the stack before it uses it
  - And restore it before it returns to its caller
  - But, it can assume any function it calls will not change the register
    - Either won't use it, or will save/restore it
- →
  - **t: temporary variable for the current function**
  - **s: variable to be saved for later use (after other function calls)**

# f2c

```
int f2c (int tempF) {  
    int a = tempF - 32;  
    a = a * 5;  
    int tempC = a / 9;  
    return tempC;  
}
```

...

...

```
int tempC = f2c(87)
```

Try to convert this into assembly given the conventions we just learned

# f2c

```
int f2c (int tempF) {  
    int a = tempF - 32;  
    a = a * 5;  
    int tempC = a / 9;  
    return tempC;  
}
```

f2c:

```
addi $t0, $a0, -32
```

tempF is in \$a0 by calling convention

## f2c

```
int f2c (int tempF) {  
    int a = tempF - 32;  
    a = a * 5;  
    int tempC = a / 9;  
    return tempC;  
}
```

f2c:

```
addi $t0, $a0, -32
```

We can use \$t0 for a temp (like a) without saving it

# f2c

```
int f2c (int tempF) {      f2c:  
    int a = tempF - 32;      addi $t0, $a0, -32  
    a = a * 5;              li $t1, 5  
    int tempC = a / 9;      mul $t0, $t0, $t1  
    return tempC;           li $t1, 9  
}                           div $t2, $t0, $t1
```

## f2c

```
int f2c (int tempF) {      f2c:  
    int a = tempF - 32;      addi $t0, $a0, -32  
    a = a * 5;              li $t1, 5  
    int tempC = a / 9;      mul $t0, $t0, $t1  
    return tempC;           li $t1, 9  
}                           div $t2, $t0, $t1  
                           addi $v0, $t2, 0  
                           jr $ra
```

# f2c

```
int f2c (int tempF) {  
    int a = tempF - 32;  
    a = a * 5;  
    int tempC = a / 9;  
    return tempC;  
}
```

```
f2c:  
    addi $t0, $a0, -32  
    li $t1, 5  
    mul $t0, $t0, $t1  
    li $t1, 9  
    div $t2, $t0, $t1  
    addi $v0, $t2, 0  
    jr $ra
```

A smart compiler would just do  
div \$v0, \$t0, \$t1

# f2c

```
int f2c (int tempF) {      f2c:  
    int a = tempF - 32;      addi $t0, $a0, -32  
    a = a * 5;              li $t1, 5  
    int tempC = a / 9;      mul $t0, $t0, $t1  
    return tempC;           li $t1, 9  
}                           div $t2, $t0, $t1  
...                          addi $v0, $t2, 0  
...                          jr $ra  
int tempC = f2c(87)       ...  
                           addi $a0, $0, 87
```

# f2c

```
int f2c (int tempF) {      f2c:  
    int a = tempF - 32;      addi $t0, $a0, -32  
    a = a * 5;              li $t1, 5  
    int tempC = a / 9;      mul $t0, $t0, $t1  
    return tempC;           li $t1, 9  
}                           div $t2, $t0, $t1  
...                          addi $v0, $t2, 0  
...                          jr $ra  
int tempC = f2c(87)       ...  
                           addi $a0, $0, 87  
                           jal f2c
```

## f2c

```
int f2c (int tempF) {      f2c:  
    int a = tempF - 32;      addi $t0, $a0, -32  
    a = a * 5;              li $t1, 5  
    int tempC = a / 9;      mul $t0, $t0, $t1  
    return tempC;           li $t1, 9  
}                           div $t2, $t0, $t1  
...                          addi $v0, $t2, 0  
...                          jr $ra  
int tempC = f2c(87)        ...  
                           addi $a0, $0, 87  
                           jal f2c  
                           addi $t0, $v0, 0
```

# What It Would Take To Make SPIM Happy

```
.globl f2c      # f2c can be called from any file
.text           # goes in "text" region

f2c:           #   (remember memory picture?)
addi $t0, $a0, -32
li $t1, 5
mul $t0, $t0, $t1
li $t1, 9
div $t2, $t0, $t1
addi $v0, $t2, 0
jr $ra

.end f2c       # end of this function
```

# SPIM Assembly Language Sugar

- Directives: tell the assembler what to do...
- Format ".<string> [arg1], [arg2] ...

- Examples

|                            |                                            |
|----------------------------|--------------------------------------------|
| • .data                    | # start a data segment.                    |
| • .text                    | # start a code segment.                    |
| • .align n                 | # align segment on $2^n$ byte boundary.    |
| • .ascii <string>          | # store a string in memory.                |
| • .asciiz <string>         | # store a null terminated string in memory |
| • .word w1, w2, . . . , wn | # store n words in memory.                 |
| • .space n                 | # reserve n bytes of space                 |

# Function Calls? → Let's Check Out The Stack

- May need to use the stack for...
  - Saving registers
    - Across calls
    - Spilling variables (not enough registers)
  - Passing more than 4 arguments
  - Other variables...



# Stack Layout

- Stack is in the memory of every executing process:
  - Loads and stores are used to access it
  - But what address to load/store?
- Two registers for referencing stack:
  - Stack pointer (\$sp): Points to end (bottom) of stack
  - Frame pointer (\$fp): Points to top of current stack frame

# Procedures Use the Stack

- In general, procedure calls obey stack discipline
  - Local procedure state contained in stack frame
  - Where we can save registers
  - When a procedure is called, a new frame opens (frame is “pushed”)
  - When a procedure returns, the frame collapses (frame is “popped”)
- Procedure stack is in memory
  - Starts is at the “top” of the memory and grows down



# MIPS/GCC Procedure Calling Conventions

## **Calling procedure (before jumping to called function):**

- Step-1: Setup the arguments:
  - The first four arguments (arg0-arg3) are passed in registers \$a0-\$a3
  - Remaining arguments are pushed onto the stack  
(in reverse order, arg5 is at the top of the stack)
- Step-2: Save needed temporary registers
  - Save \$t regs to \$s regs if they contain values needed after function call
- Step-3: Execute jal
- Step-4: Clean up stack (if more than 4 args)

# MIPS/GCC Procedure Calling Conventions

## Called routine (before executing function's body):

- Step-1: Establish stack frame
  - Subtract the frame size from the stack pointer
  - Typically, minimum frame size is 32 bytes (8 words)
- Step-2: Save needed regs in the frame
  - \$fp is always saved
  - \$ra is saved if routine makes a call
  - \$s0-\$s7 are saved if they are used
- Step-3: Establish frame pointer
  - Add the stack <frame size> - 4 to the address in \$sp
  - *Note that frame pointer isn't strictly necessary, but it helps with debugging (when using gdb, for example)*

# MIPS/GCC Procedure Calling Conventions

## On return from a call (after executing function's body):

- Step-1: Put returned values in registers \$v0, [\$v1] (if values are returned)
- Step-2: Restore registers
  - Restore \$fp and other saved registers [\$ra, \$s0 - \$s7]
- Step-3: Pop the stack
  - Add the frame size to \$sp
- Step-4: Return
  - Jump to the address in \$ra  
jr \$ra

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |

Just did **jal 1000**

| Reg  | Value     |
|------|-----------|
| \$0  | 0000 0000 |
| \$at | 0000 0000 |
| \$v0 | 4242 4242 |
| \$v1 | 0000 8010 |
| \$a0 | 0000 1234 |
| \$a1 | 5678 0001 |
| \$a2 | 0000 0002 |
| \$a3 | 0000 0007 |
| \$t0 | 9999 999A |
| \$t1 | 0000 0000 |
| \$s0 | 0042 0420 |
| \$sp | 0000 FFE0 |
| \$fp | 0000 FFF0 |
| \$ra | 0000 2348 |
| PC   | 0000 1000 |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 |           |
| FFD4 |           |
| FFD0 |           |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |

| Reg  | Value     |
|------|-----------|
| \$0  | 0000 0000 |
| \$at | 0000 0000 |
| \$v0 | 4242 4242 |
| \$v1 | 0000 8010 |
| \$a0 | 0000 1234 |
| \$a1 | 5678 0001 |
| \$a2 | 0000 0002 |
| \$a3 | 0000 0007 |
| \$t0 | 9999 999A |
| \$t1 | 0000 0000 |
| \$s0 | 0042 0420 |
| \$sp | 0000 FFE0 |
| \$fp | 0000 FFF0 |
| \$ra | 0000 2348 |
| PC   | 0000 1000 |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 |           |
| FFD4 |           |
| FFD0 |           |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

\$sp, \$fp still describe caller's frame

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |

Allocate space on stack

| Reg  | Value            |
|------|------------------|
| \$0  | 0000 0000        |
| \$at | 0000 0000        |
| \$v0 | 4242 4242        |
| \$v1 | 0000 8010        |
| \$a0 | 0000 1234        |
| \$a1 | 5678 0001        |
| \$a2 | 0000 0002        |
| \$a3 | 0000 0007        |
| \$t0 | 9999 999A        |
| \$t1 | 0000 0000        |
| \$s0 | 0042 0420        |
| \$sp | <b>0000 FFDO</b> |
| \$fp | 0000 FFF0        |
| \$ra | 0000 2348        |
| PC   | 0000 1004        |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 |           |
| FFD4 |           |
| FFD0 |           |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |

Save \$fp

| Reg  | Value     |
|------|-----------|
| \$0  | 0000 0000 |
| \$at | 0000 0000 |
| \$v0 | 4242 4242 |
| \$v1 | 0000 8010 |
| \$a0 | 0000 1234 |
| \$a1 | 5678 0001 |
| \$a2 | 0000 0002 |
| \$a3 | 0000 0007 |
| \$t0 | 9999 999A |
| \$t1 | 0000 0000 |
| \$s0 | 0042 0420 |
| \$sp | 0000 FFDO |
| \$fp | 0000 FFF0 |
| \$ra | 0000 2348 |
| PC   | 0000 1008 |

| Addr | Value            |
|------|------------------|
| FFF0 | 0001 0070        |
| FFEC | 1234 5678        |
| FFE8 | 9999 9999        |
| FFE4 | 0000 2568        |
| FEF0 | 0001 0040        |
| FFDC |                  |
| FFD8 |                  |
| FFD4 |                  |
| FFD0 | <b>0000 FFF0</b> |
| FFCC |                  |
| FFC8 |                  |
| FFC4 |                  |
| FFC0 |                  |
| FFBC |                  |

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |

Save \$ra

| Reg  | Value     |
|------|-----------|
| \$0  | 0000 0000 |
| \$at | 0000 0000 |
| \$v0 | 4242 4242 |
| \$v1 | 0000 8010 |
| \$a0 | 0000 1234 |
| \$a1 | 5678 0001 |
| \$a2 | 0000 0002 |
| \$a3 | 0000 0007 |
| \$t0 | 9999 999A |
| \$t1 | 0000 0000 |
| \$s0 | 0042 0420 |
| \$sp | 0000 FFDO |
| \$fp | 0000 FFF0 |
| \$ra | 0000 2348 |
| PC   | 0000 100C |

| Addr | Value            |
|------|------------------|
| FFF0 | 0001 0070        |
| FFEC | 1234 5678        |
| FFE8 | 9999 9999        |
| FFE4 | 0000 2568        |
| FFE0 | 0001 0040        |
| FFDC |                  |
| FFD8 |                  |
| FFD4 | <b>0000 2348</b> |
| FFD0 | 0000 FFF0        |
| FFCC |                  |
| FFC8 |                  |
| FFC4 |                  |
| FFC0 |                  |
| FFBC |                  |

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |

Save \$s0

| Reg  | Value     |
|------|-----------|
| \$0  | 0000 0000 |
| \$at | 0000 0000 |
| \$v0 | 4242 4242 |
| \$v1 | 0000 8010 |
| \$a0 | 0000 1234 |
| \$a1 | 5678 0001 |
| \$a2 | 0000 0002 |
| \$a3 | 0000 0007 |
| \$t0 | 9999 999A |
| \$t1 | 0000 0000 |
| \$s0 | 0042 0420 |
| \$sp | 0000 FFDO |
| \$fp | 0000 FFF0 |
| \$ra | 0000 2348 |
| PC   | 0000 1010 |

| Addr | Value            |
|------|------------------|
| FFF0 | 0001 0070        |
| FFEC | 1234 5678        |
| FFE8 | 9999 9999        |
| FFE4 | 0000 2568        |
| FFE0 | 0001 0040        |
| FFDC |                  |
| FFD8 | <b>0042 0420</b> |
| FFD4 | 0000 2348        |
| FFD0 | 0000 FFF0        |
| FFCC |                  |
| FFC8 |                  |
| FFC4 |                  |
| FFC0 |                  |
| FFBC |                  |

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |

Setup \$fp

| Reg  | Value            |
|------|------------------|
| \$0  | 0000 0000        |
| \$at | 0000 0000        |
| \$v0 | 4242 4242        |
| \$v1 | 0000 8010        |
| \$a0 | 0000 1234        |
| \$a1 | 5678 0001        |
| \$a2 | 0000 0002        |
| \$a3 | 0000 0007        |
| \$t0 | 9999 999A        |
| \$t1 | 0000 0000        |
| \$s0 | 0042 0420        |
| \$sp | 0000 FFDO        |
| \$fp | <b>0000 FFDC</b> |
| \$ra | 0000 2348        |
| PC   | 0000 1014        |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 | 0042 0420 |
| FFD4 | 0000 2348 |
| FFD0 | 0000 FFF0 |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |

\$sp, \$fp now describe  
new frame, ready to start

| Reg  | Value     |
|------|-----------|
| \$0  | 0000 0000 |
| \$at | 0000 0000 |
| \$v0 | 4242 4242 |
| \$v1 | 0000 8010 |
| \$a0 | 0000 1234 |
| \$a1 | 5678 0001 |
| \$a2 | 0000 0002 |
| \$a3 | 0000 0007 |
| \$t0 | 9999 999A |
| \$t1 | 0000 0000 |
| \$s0 | 0042 0420 |
| \$sp | 0000 FF00 |
| \$fp | 0000 FFDC |
| \$ra | 0000 2348 |
| PC   | 0000 1014 |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 | 0042 0420 |
| FFD4 | 0000 2348 |
| FFD0 | 0000 FFF0 |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |

| Reg  | Value            |
|------|------------------|
| \$0  | 0000 0000        |
| \$at | 0000 0000        |
| \$v0 | 4242 4242        |
| \$v1 | 0000 8010        |
| \$a0 | 0000 1234        |
| \$a1 | 5678 0001        |
| \$a2 | 0000 0002        |
| \$a3 | 0000 0007        |
| \$t0 | 9999 999A        |
| \$t1 | 0000 0000        |
| \$s0 | <b>5678 1235</b> |
| \$sp | 0000 FFDO        |
| \$fp | 0000 FFDC        |
| \$ra | 0000 2348        |
| PC   | 0000 1018        |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 | 0042 0420 |
| FFD4 | 0000 2348 |
| FFD0 | 0000 FFF0 |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

Do some computation..



# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |

Call another function  
(not pictured, takes no args)

| Reg  | Value            |
|------|------------------|
| \$0  | 0000 0000        |
| \$at | 0000 0000        |
| \$v0 | 4242 4242        |
| \$v1 | 0000 8010        |
| \$a0 | 0000 1234        |
| \$a1 | 5678 0001        |
| \$a2 | 0000 0002        |
| \$a3 | 0000 0007        |
| \$t0 | 9999 999A        |
| \$t1 | 0000 0000        |
| \$s0 | 5678 1235        |
| \$sp | 0000 FFDO        |
| \$fp | 0000 FFDC        |
| \$ra | <b>0000 101C</b> |
| PC   | 0000 4200        |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 | 0042 0420 |
| FFD4 | 0000 2348 |
| FFD0 | 0000 FFF0 |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

jal sets \$ra, PC

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |

Other function can do what  
It wants to the regs as it computes

| Reg  | Value      |
|------|------------|
| \$0  | 0000 0000  |
| \$at | ???? ????? |
| \$v0 | ???? ????? |
| \$v1 | ???? ????? |
| \$a0 | ???? ????? |
| \$a1 | ???? ????? |
| \$a2 | ???? ????? |
| \$a3 | ???? ????? |
| \$t0 | ???? ????? |
| \$t1 | ???? ????? |
| \$s0 | ???? ????? |
| \$sp | ???? ????? |
| \$fp | ???? ????? |
| \$ra | ???? ????? |
| PC   | ???? ????? |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 | 0042 0420 |
| FFD4 | 0000 2348 |
| FFD0 | 0000 FFF0 |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

And make a stack frame  
Of its own

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |

But before it returns, it is responsible for restoring certain registers

| Reg  | Value            |
|------|------------------|
| \$0  | 0000 0000        |
| \$at | ???? ????        |
| \$v0 | <b>8675 3090</b> |
| \$v1 | ???? ????        |
| \$a0 | ???? ????        |
| \$a1 | ???? ????        |
| \$a2 | ???? ????        |
| \$a3 | ???? ????        |
| \$t0 | ???? ????        |
| \$t1 | ???? ????        |
| \$s0 | <b>5678 1235</b> |
| \$sp | <b>0000 FFD0</b> |
| \$fp | <b>0000 FFDC</b> |
| \$ra | <b>0000 101C</b> |
| PC   | <b>0000 101C</b> |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 | 0042 0420 |
| FFD4 | 0000 2348 |
| FFD0 | 0000 FFF0 |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

Including \$sp and \$fp,  
and \$s0  
Value returned in \$v0

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |



| Reg  | Value            |
|------|------------------|
| \$0  | 0000 0000        |
| \$at | ???? ?????       |
| \$v0 | 8675 3090        |
| \$v1 | ???? ?????       |
| \$a0 | ???? ?????       |
| \$a1 | ???? ?????       |
| \$a2 | ???? ?????       |
| \$a3 | ???? ?????       |
| \$t0 | <b>DCED 42C5</b> |
| \$t1 | ???? ?????       |
| \$s0 | 5678 1235        |
| \$sp | 0000 FFDO        |
| \$fp | 0000 FFDC        |
| \$ra | 0000 101C        |
| PC   | 0000 1020        |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 | 0042 0420 |
| FFD4 | 0000 2348 |
| FFD0 | 0000 FFF0 |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

Do some more computation

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |



Do some more computation  
(load addr not pictured)

| Reg  | Value            |
|------|------------------|
| \$0  | 0000 0000        |
| \$at | ???? ????        |
| \$v0 | <b>0001 0002</b> |
| \$v1 | ???? ????        |
| \$a0 | ???? ????        |
| \$a1 | ???? ????        |
| \$a2 | ???? ????        |
| \$a3 | ???? ????        |
| \$t0 | DCED 42C5        |
| \$t1 | ???? ????        |
| \$s0 | 5678 1235        |
| \$sp | 0000 FFDO        |
| \$fp | 0000 FFDC        |
| \$ra | 0000 101C        |
| PC   | 0000 1024        |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 | 0042 0420 |
| FFD4 | 0000 2348 |
| FFD0 | 0000 FFF0 |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |



| Reg  | Value      |
|------|------------|
| \$0  | 0000 0000  |
| \$at | ???? ????? |
| \$v0 | 0001 0002  |
| \$v1 | ???? ????? |
| \$a0 | ???? ????? |
| \$a1 | ???? ????? |
| \$a2 | ???? ????? |
| \$a3 | ???? ????? |
| \$t0 | DCED 42C5  |
| \$t1 | ???? ????? |
| \$s0 | 0042 0420  |
| \$sp | 0000 FFDO  |
| \$fp | 0000 FFDC  |
| \$ra | 0000 101C  |
| PC   | 0000 1028  |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 | 0042 0420 |
| FFD4 | 0000 2348 |
| FFD0 | 0000 FFF0 |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

Restore registers to return

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |

| Reg  | Value            |
|------|------------------|
| \$0  | 0000 0000        |
| \$at | ???? ????        |
| \$v0 | 0001 0002        |
| \$v1 | ???? ????        |
| \$a0 | ???? ????        |
| \$a1 | ???? ????        |
| \$a2 | ???? ????        |
| \$a3 | ???? ????        |
| \$t0 | DCED 42C5        |
| \$t1 | ???? ????        |
| \$s0 | 0042 0420        |
| \$sp | 0000 FFDO        |
| \$fp | 0000 FFDC        |
| \$ra | <b>0000 2348</b> |
| PC   | 0000 102C        |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 | 0042 0420 |
| FFD4 | 0000 2348 |
| FFD0 | 0000 FFF0 |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

Restore registers to return

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |



Restore registers to return

| Reg  | Value            |
|------|------------------|
| \$0  | 0000 0000        |
| \$at | ???? ????        |
| \$v0 | 0001 0002        |
| \$v1 | ???? ????        |
| \$a0 | ???? ????        |
| \$a1 | ???? ????        |
| \$a2 | ???? ????        |
| \$a3 | ???? ????        |
| \$t0 | DCED 42C5        |
| \$t1 | ???? ????        |
| \$s0 | 0042 0420        |
| \$sp | 0000 FFDO        |
| \$fp | <b>0000 FFF0</b> |
| \$ra | 0000 2348        |
| PC   | 0000 1030        |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 | 0042 0420 |
| FFD4 | 0000 2348 |
| FFD0 | 0000 FFF0 |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |



Restore registers to return

| Reg  | Value            |
|------|------------------|
| \$0  | 0000 0000        |
| \$at | ???? ?????       |
| \$v0 | 0001 0002        |
| \$v1 | ???? ?????       |
| \$a0 | ???? ?????       |
| \$a1 | ???? ?????       |
| \$a2 | ???? ?????       |
| \$a3 | ???? ?????       |
| \$t0 | DCED 42C5        |
| \$t1 | ???? ?????       |
| \$s0 | 0042 0420        |
| \$sp | <b>0000 FFE0</b> |
| \$fp | 0000 FFF0        |
| \$ra | 0000 2348        |
| PC   | 0000 1034        |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 | 0042 0420 |
| FFD4 | 0000 2348 |
| FFD0 | 0000 FFF0 |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |



Restore registers to return

| Reg  | Value     |
|------|-----------|
| \$0  | 0000 0000 |
| \$at | ???? ???? |
| \$v0 | 0001 0002 |
| \$v1 | ???? ???? |
| \$a0 | ???? ???? |
| \$a1 | ???? ???? |
| \$a2 | ???? ???? |
| \$a3 | ???? ???? |
| \$t0 | DCED 42C5 |
| \$t1 | ???? ???? |
| \$s0 | 0042 0420 |
| \$sp | 0000 FFE0 |
| \$fp | 0000 FFF0 |
| \$ra | 0000 2348 |
| PC   | 0000 1034 |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 | 0042 0420 |
| FFD4 | 0000 2348 |
| FFD0 | 0000 FFF0 |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

Now \$sp, \$fp describe  
caller's frame

# Execution Example: Calling With Frames

| Addr | Instruction          |
|------|----------------------|
| 1000 | addi \$sp, \$sp, -16 |
| 1004 | sw \$fp, 0(\$sp)     |
| 1008 | sw \$ra, 4(\$sp)     |
| 100C | sw \$s0, 8(\$sp)     |
| 1010 | addi \$fp, \$sp, 12  |
| 1014 | add \$s0, \$a0, \$a1 |
| 1018 | jal 4200             |
| 101C | add \$t0, \$v0, \$s0 |
| 1020 | lw \$v0, 4(\$t0)     |
| 1024 | lw \$s0, -4(\$fp)    |
| 1028 | lw \$ra, -8(\$fp)    |
| 102C | lw \$fp, -12(\$fp)   |
| 1030 | addi \$sp, \$sp, 16  |
| 1034 | jr \$ra              |

Return to caller  
(code not pictured)

| Reg  | Value      |
|------|------------|
| \$0  | 0000 0000  |
| \$at | ???? ????? |
| \$v0 | 0001 0002  |
| \$v1 | ???? ????? |
| \$a0 | ???? ????? |
| \$a1 | ???? ????? |
| \$a2 | ???? ????? |
| \$a3 | ???? ????? |
| \$t0 | DCED 42C5  |
| \$t1 | ???? ????? |
| \$s0 | 0042 0420  |
| \$sp | 0000 FFE0  |
| \$fp | 0000 FFF0  |
| \$ra | 0000 2348  |
| PC   | 0000 2348  |

| Addr | Value     |
|------|-----------|
| FFF0 | 0001 0070 |
| FFEC | 1234 5678 |
| FFE8 | 9999 9999 |
| FFE4 | 0000 2568 |
| FFE0 | 0001 0040 |
| FFDC |           |
| FFD8 | 0042 0420 |
| FFD4 | 0000 2348 |
| FFD0 | 0000 FFF0 |
| FFCC |           |
| FFC8 |           |
| FFC4 |           |
| FFC0 |           |
| FFBC |           |

# Assembly Writing Tips and Advice

- Write C first (steps!), then translate C → Assembly
  - One function at a time
  - Pick registers for each variable
    - Must be in memory? Give it a stack slot (refer to by \$fp+num)
    - Must live across a procedure call? Use an \$s register
      - Otherwise, use a \$t
  - Write prolog (beginning)
    - Save ra/fp (if needed)
    - Save any \$s registers you use
  - Translate line by line
  - Write epilog (end)
    - Kind of the opposite actions of the prolog
    - Store returns (if needed)
    - Then jr

# Why Do We Need FP?

- The frame pointer is not always required
  - Can often get away without it
- When/why do we need it?
  - Debugging tools (like gdb) use it to find frames
  - If you have variable length arrays
    - Stack pointer changes by amount not known at compile time
    - But variables would still be at constant offset from frame pointer
- How to reference stuff without it?
  - Everything would be offset from the stack pointer: 4(\$sp), 8(\$sp), etc.
- Good practice for this class to use it
  - Don't prematurely optimize

# System Call Instruction

- System call is used to communicate with the operating system and request services (memory allocation, I/O)
  - **syscall** instruction in MIPS
  - Sort of like a procedure call, but call to ask OS for help
- SPIM supports “system calls lite”
  1. Load system call code into register \$v0
    - Example: if \$v0==1, then syscall will print an integer
  2. Load arguments (if any) into registers \$a0, \$a1, or \$f12 (for floating point)
  3. **syscall**
  4. Results returned in registers \$v0 or \$f0

# SPIM System Call Support

| <b>code</b> | <b>service</b> | <b>ArgType</b> | <b>Arg/Result</b>        |
|-------------|----------------|----------------|--------------------------|
| 1           | print          | int            | \$a0                     |
| 2           | print          | float          | \$f12                    |
| 3           | print          | double         | \$f12                    |
| 4           | print          | string         | \$a0 (string address)    |
| 5           | read           | integer        | integer in \$v0          |
| 6           | read           | float          | float in \$f0            |
| 7           | read           | double         | double in \$f0 & \$f1    |
| 8           | read           | string         | \$a0=buffer, \$a1=length |
| 9           | sbrk           | \$a0=amount    | address in \$v0          |
| 10          | exit           |                |                          |

Plus a few more for general file IO which we shouldn't need.

# Echo Number and String

```
.text
main:
    li      $v0, 5          # code to read an integer
    syscall
    move   $a0, $v0         # copy result from $v0 to $a0

    li      $v0, 1          # code to print an integer
    syscall
    # print the integer

    li      $v0, 4          # code to print string
    la     $a0, nlن          # address of string (newline)
    syscall

    li      $v0, 8          # code to read a string
    la     $a0, name        # address of buffer (name)
    li      $a1, 8           # size of buffer (8 bytes)
    syscall

    la     $a0, name        # address of string to print
    li      $v0, 4           # code to print a string
    syscall

    jr      $31             # return

.data
.align 2
name:   .word 0,0
nlن:    .asciiz "\n"
```

# More MIPS Assembly Rules

- One instruction per line.
- Numbers are base-10 integers or Hex w/ leading 0x.
- Identifiers: alphanumeric, '\_', '.' string starting in a letter or \_
- Labels: identifiers starting at the beginning of a line followed by ":"
- Comments: everything following # till end-of-line.
- Instruction format: Space and "," separated fields.
  - [Label:] <op> reg1, [reg2], [reg3]    [# comment]
  - [Label:] <op> reg1, offset(reg2)    [# comment]
  - .Directive [arg1], [arg2], ...

# MIPS in One Page

- Check the quick reference posted on Canvas under Files → Reference Documents

| MIPS reference card             |                                 |                                                                    |                |                     |                |
|---------------------------------|---------------------------------|--------------------------------------------------------------------|----------------|---------------------|----------------|
| <b>add</b> <i>rd, rs, rt</i>    | Add                             | <i>rd = rs + rt</i>                                                | <b>R 0/20</b>  | <b>registers</b>    |                |
| <b>sub</b> <i>rd, rs, rt</i>    | Subtract                        | <i>rd = rs - rt</i>                                                | <b>R 0/22</b>  | \$0 \$zero          |                |
| <b>addi</b> <i>rt, rs, imm</i>  | Add Imm.                        | <i>rt = rs + imm<sub>±</sub></i>                                   | <b>1 8</b>     | \$1 \$at            |                |
| <b>addu</b> <i>rs, rs, rt</i>   | Add Unsigned                    | <i>rd = rs + rt</i>                                                | <b>R 0/21</b>  | \$2-\$3 \$v0-\$v1   |                |
| <b>subu</b> <i>rd, rs, rt</i>   | Subtract Unsigned               | <i>rd = rs - rt</i>                                                | <b>R 0/23</b>  | \$4-\$7 \$a0-\$a3   |                |
| <b>addiu</b> <i>rt, rs, imm</i> | Add Imm. Unsigned               | <i>rt = rs + imm<sub>±</sub></i>                                   | <b>1 9</b>     | \$8-\$15 \$t0-\$t7  |                |
| <b>mult</b> <i>rs, rt</i>       | Multiply                        | <i>(hi, lo) = rs * rt</i>                                          | <b>R 0/18</b>  | \$16-\$23 \$s0-\$s7 |                |
| <b>div</b> <i>rs, rt</i>        | Divide                          | <i>lo = rs / rt; hi = rs % rt</i>                                  | <b>R 0/1a</b>  | \$24-\$25 \$t8-\$t9 |                |
| <b>multu</b> <i>rs, rt</i>      | Multiply Unsigned               | <i>(hi, lo) = rs * rt</i>                                          | <b>R 0/19</b>  | \$26-\$27 \$k0-\$k1 |                |
| <b>divu</b> <i>rs, rt</i>       | Divide Unsigned                 | <i>lo = rs / rt; hi = rs % rt</i>                                  | <b>R 0/1b</b>  | \$28 \$gp           |                |
| <b>mfhi</b> <i>rs</i>           | Move From Hi                    | <i>rd = hi</i>                                                     | <b>R 0/10</b>  | \$29 \$sp           |                |
| <b>mflo</b> <i>rd</i>           | Move From Lo                    | <i>rd = lo</i>                                                     | <b>R 0/12</b>  | \$30 \$fp           |                |
| <b>and</b> <i>rd, rs, rt</i>    | And                             | <i>rd = rs &amp; rt</i>                                            | <b>R 0/24</b>  | \$31 \$ra           |                |
| <b>or</b> <i>rd, rs, rt</i>     | Or                              | <i>rd = rs   rt</i>                                                | <b>R 0/25</b>  | hi —                |                |
| <b>nor</b> <i>rd, rs, rt</i>    | Not                             | <i>rd = ~ (rs   rt)</i>                                            | <b>R 0/27</b>  | lo —                |                |
| <b>xor</b> <i>rd, rs, rt</i>    | eExclusive Or                   | <i>rd = rs ^ rt</i>                                                | <b>R 0/26</b>  | PC —                |                |
| <b>andi</b> <i>rt, rs, imm</i>  | And Imm.                        | <i>rt = rs &amp; imm</i>                                           | <b>1 c</b>     | c0 \$13 c0_cause    |                |
| <b>ori</b> <i>rt, rs, imm</i>   | Or Imm.                         | <i>rt = rs   imm</i>                                               | <b>1 d</b>     | c0 \$14 c0_epc      |                |
| <b>xori</b> <i>rt, rs, imm</i>  | eExclusive Or Imm.              | <i>rt = rs ^ imm</i>                                               | <b>1 e</b>     |                     |                |
| <b>sll</b> <i>rd, rs, sh</i>    | Shift Left Logical              | <i>rd = rt &lt;&lt; sh</i>                                         | <b>R 0/0</b>   |                     |                |
| <b>srl</b> <i>rd, rs, sh</i>    | Shift Right Logical             | <i>rd = rt &gt;&gt; sh</i>                                         | <b>R 0/2</b>   |                     |                |
| <b>sra</b> <i>rd, rs, sh</i>    | Shift Right Arithmetic          | <i>rd = rt &gt;&gt; sh</i>                                         | <b>R 0/3</b>   |                     |                |
| <b>sllv</b> <i>rd, rt, rs</i>   | Shift Left Logical Variable     | <i>rd = rt &gt;&gt; rs</i>                                         | <b>R 0/4</b>   |                     |                |
| <b>sriv</b> <i>rd, rt, rs</i>   | Shift Right Logical Variable    | <i>rd = rt &gt;&gt; rs</i>                                         | <b>R 0/6</b>   |                     |                |
| <b>sraw</b> <i>rd, rt, rs</i>   | Shift Right Arithmetic Variable | <i>rd = rt &gt;&gt; rs</i>                                         | <b>R 0/7</b>   |                     |                |
| <b>slt</b> <i>rd, rs, rt</i>    | Set if Less Than                | <i>rd = rs &lt; rt ? 1 : 0</i>                                     | <b>R 0/2a</b>  |                     |                |
| <b>sltu</b> <i>rd, rs, rt</i>   | Set if Less Than Unsigned       | <i>rd = rs &lt; rt ? 1 : 0</i>                                     | <b>R 0/2b</b>  |                     |                |
| <b>slti</b> <i>rd, rs, imm</i>  | Set if Less Than Imm.           | <i>rd = rs &lt; imm<sub>±</sub>? 1 : 0</i>                         | <b>1 a</b>     |                     |                |
| <b>sltiu</b> <i>rd, rs, imm</i> | Set if Less Than Imm. Unsigned  | <i>rd = rs &lt; imm<sub>±</sub>? 1 : 0</i>                         | <b>1 b</b>     |                     |                |
| <b>j</b> <i>addr</i>            | Jump                            | <i>PC = PC40x000000   (addr &lt;&lt; 2)</i>                        | <b>J 2</b>     |                     |                |
| <b>jal</b> <i>addr</i>          | Jump And Link                   | <i>src = PC - 8; PC = PC40x000000   (addr &lt;&lt; 2)</i>          | <b>J 3</b>     |                     |                |
| <b>jr</b> <i>rs</i>             | Jump Register                   | <i>PC = rs</i>                                                     | <b>R 0/8</b>   |                     |                |
| <b>jalr</b> <i>rs</i>           | Jump And Link Register          | <i>src = PC + 8; PC = rs</i>                                       | <b>R 0/9</b>   |                     |                |
| <b>beq</b> <i>rt, rs, imm</i>   | Branch If Equal                 | <i>if (rs == rt) PC += 4 + (imm&lt;&lt; 2)</i>                     | <b>1 4</b>     |                     |                |
| <b>bne</b> <i>rt, rs, imm</i>   | Branch If Not Equal             | <i>if (rs != rt) PC += 4 + (imm&lt;&lt; 2)</i>                     | <b>1 5</b>     |                     |                |
| <b>syscall</b>                  | System Call                     | <i>cf_cause = 8 &lt;&lt; c0_ape - pc; PC = 0x80000000</i>          | <b>R 0/c</b>   |                     |                |
| <b>lui</b> <i>rt, imm</i>       | Load Upper Imm.                 | <i>rt = imm &lt;&lt; 16</i>                                        | <b>1 f</b>     |                     |                |
| <b>lb</b> <i>rt, imm(rs)</i>    | Load Byte                       | <i>rt = SignExt(M1[rs + imm<sub>±</sub>])</i>                      | <b>1 20</b>    |                     |                |
| <b>lbu</b> <i>rt, imm(rs)</i>   | Load Byte Unsigned              | <i>rt = M1[rs + imm<sub>±</sub>] &amp; 0xFF</i>                    | <b>1 24</b>    |                     |                |
| <b>lh</b> <i>rt, imm(rs)</i>    | Load Half                       | <i>rt = SignExt(M2[rs + imm<sub>±</sub>])</i>                      | <b>1 21</b>    |                     |                |
| <b>lhu</b> <i>rt, imm(rs)</i>   | Load Half Unsigned              | <i>rt = M2[rs + imm<sub>±</sub>] &amp; 0xFFFF</i>                  | <b>1 25</b>    |                     |                |
| <b>lw</b> <i>rt, imm(rs)</i>    | Load Word                       | <i>rt = M4[rs + imm<sub>±</sub>]</i>                               | <b>1 23</b>    |                     |                |
| <b>sb</b> <i>rt, imm(rs)</i>    | Store Byte                      | <i>M1[rs + imm<sub>±</sub>] = rt</i>                               | <b>1 28</b>    |                     |                |
| <b>sh</b> <i>rt, imm(rs)</i>    | Store Half                      | <i>M2[rs + imm<sub>±</sub>] = rt</i>                               | <b>1 29</b>    |                     |                |
| <b>sw</b> <i>rt, imm(rs)</i>    | Store Word                      | <i>M4[rs + imm<sub>±</sub>] = rt</i>                               | <b>1 2b</b>    |                     |                |
| <b>ll</b> <i>rt, imm(rs)</i>    | Load Linked                     | <i>rt = M4[rs + imm<sub>±</sub>] &amp; rt; rt = atomic ? 1 : 0</i> | <b>1 30</b>    |                     |                |
| <b>sc</b> <i>rt, imm(rs)</i>    | Store Conditional               | <i>M4[rs + imm<sub>±</sub>] = rt; rt = atomic ? 1 : 0</i>          | <b>1 38</b>    |                     |                |
| <b>pseudo-instructions</b>      |                                 |                                                                    |                |                     |                |
| <b>bge</b> <i>rs, ry, imm</i>   | Branch if Greater or Equal      | <b>R 6 bits</b>                                                    | <b>5 bits</b>  | <b>5 bits</b>       | <b>5 bits</b>  |
| <b>bgt</b> <i>rs, ry, imm</i>   | Branch if Greater Than          | <b>I 6 bits</b>                                                    | <b>5 bits</b>  | <b>5 bits</b>       | <b>16 bits</b> |
| <b>ble</b> <i>rs, ry, imm</i>   | Branch if Less or Equal         | <b>I op</b>                                                        | <b>rs</b>      | <b>rt</b>           | <b>imm</b>     |
| <b>blt</b> <i>rs, ry, imm</i>   | Branch if Less Than             | <b>J 6 bits</b>                                                    | <b>26 bits</b> |                     |                |
| <b>la</b> <i>rs, label</i>      | Load Address                    | <b>J op</b>                                                        |                |                     |                |
| <b>li</b> <i>rs, imm</i>        | Load Immediate                  |                                                                    |                |                     |                |
| <b>move</b> <i>rs, ry</i>       | Move register                   |                                                                    |                |                     |                |
| <b>nop</b>                      | No Operation                    |                                                                    |                |                     |                |
| <b>exception causes</b>         |                                 |                                                                    |                |                     |                |
| <b>0</b>                        | interrupt                       |                                                                    |                |                     |                |
| <b>1</b>                        | TLB protection                  |                                                                    |                |                     |                |
| <b>2</b>                        | TLB miss L/F                    |                                                                    |                |                     |                |
| <b>3</b>                        | TLB miss S                      |                                                                    |                |                     |                |
| <b>4</b>                        | bad address L/F                 |                                                                    |                |                     |                |
| <b>5</b>                        | bad address S                   |                                                                    |                |                     |                |
| <b>6</b>                        | bus error F                     |                                                                    |                |                     |                |
| <b>7</b>                        | bus error L/S                   |                                                                    |                |                     |                |
| <b>8</b>                        | syscall                         |                                                                    |                |                     |                |
| <b>9</b>                        | break                           |                                                                    |                |                     |                |
| <b>a</b>                        | a reserved instr.               |                                                                    |                |                     |                |
| <b>b</b>                        | coproc. unusable                |                                                                    |                |                     |                |
| <b>c</b>                        | arith. overflow                 |                                                                    |                |                     |                |
| <b>F</b>                        | fetch instr.                    |                                                                    |                |                     |                |
| <b>L</b>                        | load data                       |                                                                    |                |                     |                |
| <b>S</b>                        | store data                      |                                                                    |                |                     |                |

# Mapping C Variables to Memory

# Memory Layout

- Memory is array of bytes, but there are conventions as to what goes where in this array
- Text: instructions (the program to execute)
- Data: global variables
- Stack: local variables and other per-function state; starts at top & grows downward
- Heap: dynamically allocated variables; grows upward
- What if stack and heap overlap????



# Memory Layout: Example

```
int anumber = 3;

int factorial (int x) {
    if (x == 0) {
        return 1;
    }
    else {
        return x * factorial (x - 1);
    }
}

int main (void) {
    int z = factorial (anumber);
    printf ("%d\n", z);
    return 0;
}
```



# Memory Layout: Example

Global int anumber = 3;

.data

anumber: .word 3

Param int factorial (int x) {  
 if (x == 0) {  
 return 1;  
 }  
 else {  
 return x \* factorial (x - 1);  
 }  
}

.text

.globl factorial

factorial:

; find x in \$a0.

; if more args than 4,

; find them in stack

...

int main (void) {

.globl main

Local    int z = factorial (anumber);  
 printf("%d\n", z);  
 return 0;  
}

main:

; make local z exist by

; subtracting 4 from \$sp

addiu \$sp, \$sp, -4

...



# What Is an Array?

- Array is nothing but a contiguous chunk of memory.
- The name of the array is a pointer to the beginning of the chunk of memory array has.
- So `int array[100]` is a contiguous chunk of memory which is of size  $100*4 = 400$  bytes.
- so `array` is almost of type `int*`  
(not exactly `int*` to be very exact but good enough for our purposes)

# So...

- C pointer notation vs. array notation:
  - $\text{array}[0]$  is equal to  $*(\text{array} + 0)$  or  $*(\text{array})$
  - $\text{array}[1]$  is equal to  $*(\text{array} + 1)$
  - $\text{array}[2]$  is equal to  $*(\text{array} + 2)$
  - ...
- In C pointer math, the units are *elements*
- In MIPS memory accesses, the units are *bytes*
- If array lives at 0x1000, then:
  - $\text{array}[0]$  is  $*(\text{array}+0)$ , and it lives at  $0x1000+0*4 = 0x1000$
  - $\text{array}[1]$  is  $*(\text{array}+1)$ , and it lives at  $0x1000+1*4 = 0x1004$
  - $\text{array}[2]$  is  $*(\text{array}+2)$ , and it lives at  $0x1000+2*4 = 0x1008$
  - ...

# Pointers vs. Arrays

- How they're similar:
  - Both are represented by a memory address which has one or more bytes of content at it
- How they differ:
  - Pointers store a memory address in memory. Only allocated one word (4 bytes on MIPS)
    - Global:

```
char* name=NULL; → .data
name: .word 0
```
    - Local:

```
char* name=NULL; → addiu $sp, $sp, -16 ; assuming 3 other words needed
sw $0, 12($sp)
```
  - Arrays allocated enough space for the array itself. Array declaration by itself doesn't store a memory *address* into memory, compiler just knows where the array lives and makes references to it use that memory region.
    - Global:

```
char name[40]=""; → .data
name: .space 40 ; allocates 40 bytes, all null
```
    - Local:

```
char name[40]=""; → addiu $sp, $sp, -52 ; assuming 3 other words needed
sb $0, 12($sp) ; only need to set first byte to null
```

# What About String Literals?

- Pointers to a string literal: put string in read-only data region, store address to it.

- Global:

char\* name="hello"; →

```
.data  
name: .word str_hello  
.rdata  
str_hello: .asciiz "hello"
```

- Local:

char\* name="hello"; →

```
addiu $sp, $sp, -16 ; assuming 3 other words needed  
la $t0, str_hello  
sw $t0, 12($sp)
```

la is "Load address": a pseudo-instruction to put an address into a register.

- Arrays set to string literal: Allocate space for the string and initialize it.

- Global:

char name[40] = "hello"; →

```
.data  
name: .asciiz "hello"  
.space 34
```

- Local:

char name[40] = "hello"; →

```
addiu $sp, $sp, -52 ; assuming 3 other words needed  
add $a0, $sp, 12 ; destination for copy  
la $a1, str_hello ; source for copy  
jal memset ; std function to copy memory
```

# Pointers vs. Arrays

- When you pass an array to a function or store a reference to an array, then you're using pointers.

```
int func(int* ar) { ... }
```

```
int my_array[50];  
func(my_array);
```

- ar gets the address of the start of my\_array
- Still true for this syntax:

```
int func(int ar[]) { ... }  
int func(int ar[40]) { ... }
```

# Multidimensional Arrays

- Again, contiguous chunk of data.
- Behaves like
  - array of arrays in the case of 2 dimensions.
  - array of array of array in 3 dimension and so on.
- All are different interpretations and syntaxes on single chunk of contiguous memory

# Multidimensional Arrays Syntax

```
int mytable[10][20];
```

- We have 10 arrays of array of 20 integers.
- So total  $10 \times 20 \times 4 = 800$  bytes of contiguous memory.
- As before, the array name acts as a memory address to the beginning of the array.
- Type of the pointer is sort of like `int**`, but the row size needs to be known, so it's actually `int (*mytable) [20]`
- **How to access?** Pointer arithmetic.

`myarray[x][y]`  
*is equal to*  
`* (myarray + x * 20 + y)`

# 2D arrays in MIPS

- **Global:**

```
int a[10][20];
int b[10][20] = {{1,2,3,...,20},{101,102,...,120},...}
```



```
.data
a: .space 800
b: .word 1,2,3,...,20,101,...
```

- **Local:**

```
int a[10][20];
int b[10][20] = {{1,2,3,...,20},{101,102,...,120},...}
```



```
addiu $sp, $sp, -1612 ; assuming 3 other words needed
; no initialization for a, so it has whatever trash was on the stack already
li $t0,1
sw $t0, 12($sp)
li $t0,2
sw $t0, 16($sp)
li $t0,3
sw $t0, 20($sp)
    ... ; lots of initialization code
```

# What about `sizeof`?

- `sizeof` tells you how big something is, in bytes
- For variables declared as **arrays**, this is the size of the array:

```
int array[10]; // sizeof(array) is 10*4 = 40 bytes  
int array2d[10][20] // sizeof(array2d) is 10*20*4=800
```

- For pointers, this is the size of one pointer:
- This is true even if the pointer is used to refer to an array:

```
p = array; // p gets the address of the array.  
           // sizeof(p) is still 4 bytes
```