

# **Processor Architecture II: SEQ: Sequential Implementation**

Introduction to Computer Systems  
10<sup>th</sup> Lecture, Oct 20, 2025

**Instructors:**

**Class 1: Chen Xiangqun, Liu Xianhua**

**Class 2: Guan Xuetao**

**Class 3: Lu Junlin**

# Y86-64 Instruction Set #1

| Byte             | 0 | 1  | 2  | 3  | 4    | 5 | 6 | 7 | 8 | 9 |
|------------------|---|----|----|----|------|---|---|---|---|---|
| halt             | 0 | 0  |    |    |      |   |   |   |   |   |
| nop              | 1 | 0  |    |    |      |   |   |   |   |   |
| cmoveXX rA, rB   | 2 | fn | rA | rB |      |   |   |   |   |   |
| irmovq V, rB     | 3 | 0  | F  | rB | V    |   |   |   |   |   |
| rmmovq rA, D(rB) | 4 | 0  | rA | rB | D    |   |   |   |   |   |
| mrmovq D(rB), rA | 5 | 0  | rA | rB | D    |   |   |   |   |   |
| OPq rA, rB       | 6 | fn | rA | rB |      |   |   |   |   |   |
| jXX Dest         | 7 | fn |    |    | Dest |   |   |   |   |   |
| call Dest        | 8 | 0  |    |    | Dest |   |   |   |   |   |
| ret              | 9 | 0  |    |    |      |   |   |   |   |   |
| pushq rA         | A | 0  | rA | F  |      |   |   |   |   |   |
| popq rA          | B | 0  | rA | F  |      |   |   |   |   |   |

# Y86-64 Instruction Set #2

**Byte**

halt



nop



cmoveXX rA, rB



irmovq V, rB



rmmovq rA, D(rB)



mrmovq D(rB), rA



OPq rA, rB



jXX Dest



call Dest



ret



pushq rA



popq rA



# Y86-64 Instruction Set #3

| Byte             | 0 | 1  | 2  | 3  | 4    | 5 | 6 | 7 | 8 | 9 |
|------------------|---|----|----|----|------|---|---|---|---|---|
| halt             | 0 | 0  |    |    |      |   |   |   |   |   |
| nop              | 1 | 0  |    |    |      |   |   |   |   |   |
| cmoveXX rA, rB   | 2 | fn | rA | rB |      |   |   |   |   |   |
| irmovq V, rB     | 3 | 0  | F  | rB | V    |   |   |   |   |   |
| rmmovq rA, D(rB) | 4 | 0  | rA | rB | D    |   |   |   |   |   |
| mrmovq D(rB), rA | 5 | 0  | rA | rB | D    |   |   |   |   |   |
| OPq rA, rB       | 6 | fn | rA | rB |      |   |   |   |   |   |
| jXX Dest         | 7 | fn |    |    | Dest |   |   |   |   |   |
| call Dest        | 8 | 0  |    |    | Dest |   |   |   |   |   |
| ret              | 9 | 0  |    |    |      |   |   |   |   |   |
| pushq rA         | A | 0  | rA | F  |      |   |   |   |   |   |
| popq rA          | B | 0  | rA | F  |      |   |   |   |   |   |

# Y86-64 Instruction Set #4

**Byte**

halt

|   |   |
|---|---|
| 0 | 0 |
|---|---|

nop

|   |   |
|---|---|
| 1 | 0 |
|---|---|

cmoveXX rA, rB

|   |    |    |    |
|---|----|----|----|
| 2 | fn | rA | rB |
|---|----|----|----|

irmovq V, rB

|   |   |   |    |   |
|---|---|---|----|---|
| 3 | 0 | F | rB | V |
|---|---|---|----|---|

rmmovq rA, D(rB)

|   |   |    |    |   |
|---|---|----|----|---|
| 4 | 0 | rA | rB | D |
|---|---|----|----|---|

mrmovq D(rB), rA

|   |   |    |    |   |
|---|---|----|----|---|
| 5 | 0 | rA | rB | D |
|---|---|----|----|---|

OPq rA, rB

|   |    |    |    |
|---|----|----|----|
| 6 | fn | rA | rB |
|---|----|----|----|

jXX Dest

|   |    |      |
|---|----|------|
| 7 | fn | Dest |
|---|----|------|

call Dest

|   |   |      |
|---|---|------|
| 8 | 0 | Dest |
|---|---|------|

ret

|   |   |
|---|---|
| 9 | 0 |
|---|---|

pushq rA

|   |   |    |   |
|---|---|----|---|
| A | 0 | rA | F |
|---|---|----|---|

popq rA

|   |   |    |   |
|---|---|----|---|
| B | 0 | rA | F |
|---|---|----|---|

|     |   |   |
|-----|---|---|
| jmp | 7 | 0 |
| jle | 7 | 1 |
| jl  | 7 | 2 |
| je  | 7 | 3 |
| jne | 7 | 4 |
| jge | 7 | 5 |
| jg  | 7 | 6 |

# Building Blocks

## ■ Combinational Logic

- Compute Boolean functions of inputs
- Continuously respond to input changes
- Operate on data and implement control



## ■ Storage Elements

- Store bits
- Addressable memories
- Non-addressable registers
- Loaded only as clock rises



# Hardware Control Language

- Very simple hardware description language
- Can only express limited aspects of hardware operation
  - Parts we want to explore and modify

## ■ Data Types

- **bool: Boolean**
  - a, b, c, ...
- **int: words**
  - A, B, C, ...
  - Does not specify word size---bytes, 32-bit words, ...

## ■ Statements

- **bool a = *bool-expr* ;**
- **int A = *int-expr* ;**

# HCL Operations

- Classify by type of value returned

## ■ Boolean Expressions

- Logic Operations

- `a && b, a || b, !a`

- Word Comparisons

- `A == B, A != B, A < B, A <= B, A >= B, A > B`

- Set Membership

- `A in { B, C, D }`

- » Same as `A == B || A == C || A == D`

## ■ Word Expressions

- Case expressions

- `[ a : A; b : B; c : C ]`
  - Evaluate test expressions `a, b, c, ...` in sequence
  - Return word expression `A, B, C, ...` for first successful test

# SEQ Hardware Structure

## ■ State

- Program counter register (PC)
- Condition code register (CC)
- Register File
- Memories
  - Access same memory space
  - Data: for reading/writing program data
  - Instruction: for reading instructions

## ■ Instruction Flow

- Read instruction at address specified by PC
- Process through stages
- Update program counter



# SEQ Stages

- Fetch
  - Read instruction from instruction memory
- Decode
  - Read program registers
- Execute
  - Compute value or address
- Memory
  - Read or write data
- Write Back
  - Write program registers
- PC
  - Update program counter



# Instruction Decoding



## ■ Instruction Format

- **Instruction byte**      **icode:ifun**
- **Optional register byte**    **rA:rB**
- **Optional constant word**   **valC**

# Executing Arith./Logical Operation



## ■ Fetch

- Read 2 bytes

## ■ Decode

- Read operand registers

## ■ Execute

- Perform operation
- Set condition codes

## ■ Memory

- Do nothing

## ■ Write back

- Update register

## ■ PC Update

- Increment PC by 2

# Stage Computation: Arith/Log. Ops

|            | $OPq\ rA, rB$                                                                             |                                                                |
|------------|-------------------------------------------------------------------------------------------|----------------------------------------------------------------|
| Fetch      | $icode:ifun \leftarrow M_1[PC]$<br>$rA:rB \leftarrow M_1[PC+1]$<br>$valP \leftarrow PC+2$ | Read instruction byte<br>Read register byte<br>Compute next PC |
| Decode     | $valA \leftarrow R[rA]$<br>$valB \leftarrow R[rB]$                                        | Read operand A<br>Read operand B                               |
| Execute    | $valE \leftarrow valB\ OP\ valA$<br>Set CC                                                | Perform ALU operation<br>Set condition code register           |
| Memory     |                                                                                           |                                                                |
| Write back | $R[rB] \leftarrow valE$                                                                   | Write back result                                              |
| PC update  | $PC \leftarrow valP$                                                                      | Update PC                                                      |

- Formulate instruction execution as sequence of simple steps
- Use same general form for all instructions

# Executing `rmmovq`

`rmmovq rA, D(rB)`



## Fetch

- Read 10 bytes

## Decode

- Read operand registers

## Execute

- Compute effective address

## Memory

- Write to memory

## Write back

- Do nothing

## PC Update

- Increment PC by 10

# Stage Computation: `rmmovq`

|            |                                                                                                                                                                                   |                                                                                       |
|------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------|
|            | <code>rmmovq rA, D(rB)</code>                                                                                                                                                     |                                                                                       |
| Fetch      | $\text{icode:ifun} \leftarrow M_1[\text{PC}]$<br>$\text{rA:rB} \leftarrow M_1[\text{PC+1}]$<br>$\text{valC} \leftarrow M_8[\text{PC+2}]$<br>$\text{valP} \leftarrow \text{PC+10}$ | Read instruction byte<br>Read register byte<br>Read displacement D<br>Compute next PC |
| Decode     | $\text{valA} \leftarrow R[\text{rA}]$<br>$\text{valB} \leftarrow R[\text{rB}]$                                                                                                    | Read operand A<br>Read operand B                                                      |
| Execute    | $\text{valE} \leftarrow \text{valB} + \text{valC}$                                                                                                                                | Compute effective address                                                             |
| Memory     | $M_8[\text{valE}] \leftarrow \text{valA}$                                                                                                                                         | Write value to memory                                                                 |
| Write back |                                                                                                                                                                                   |                                                                                       |
| PC update  | $\text{PC} \leftarrow \text{valP}$                                                                                                                                                | Update PC                                                                             |

- Use ALU for address computation

# Executing popq



## ■ Fetch

- Read 2 bytes

## ■ Decode

- Read stack pointer

## ■ Execute

- Increment stack pointer by 8

## ■ Memory

- Read from old stack pointer

## ■ Write back

- Update stack pointer
- Write result to register

## ■ PC Update

- Increment PC by 2

# Stage Computation: popq

| <code>popq rA</code> |                                                                                      |                                             |
|----------------------|--------------------------------------------------------------------------------------|---------------------------------------------|
| Fetch                | $\text{icode:ifun} \leftarrow M_1[\text{PC}]$<br>$rA:rB \leftarrow M_1[\text{PC+1}]$ | Read instruction byte<br>Read register byte |
|                      | $\text{valP} \leftarrow \text{PC+2}$                                                 | Compute next PC                             |
| Decode               | $\text{valA} \leftarrow R[\%rsp]$<br>$\text{valB} \leftarrow R[\%rsp]$               | Read stack pointer<br>Read stack pointer    |
| Execute              | $\text{valE} \leftarrow \text{valB} + 8$                                             | Increment stack pointer                     |
| Memory               | $\text{valM} \leftarrow M_8[\text{valA}]$                                            | Read from stack                             |
| Write back           | $R[\%rsp] \leftarrow \text{valE}$<br>$R[rA] \leftarrow \text{valM}$                  | Update stack pointer<br>Write back result   |
| PC update            | $\text{PC} \leftarrow \text{valP}$                                                   | Update PC                                   |

- Use ALU to increment stack pointer
- Must update two registers
  - Popped value
  - New stack pointer

# Executing Conditional Moves

cmovxx rA, rB    2 fn rA rB

## ■ Fetch

- Read 2 bytes

## ■ Decode

- Read operand registers

## ■ Execute

- If !cnd, then set destination register to 0xF

## ■ Memory

- Do nothing

## ■ Write back

- Update register (or not)

## ■ PC Update

- Increment PC by 2

# Stage Computation: Cond. Move

|            |                                                                                              |                                                    |
|------------|----------------------------------------------------------------------------------------------|----------------------------------------------------|
|            | <code>cmoveXX rA, rB</code>                                                                  |                                                    |
| Fetch      | $\text{icode:ifun} \leftarrow M_1[\text{PC}]$<br>$rA:rB \leftarrow M_1[\text{PC+1}]$         | Read instruction byte<br>Read register byte        |
|            | $\text{valP} \leftarrow \text{PC+2}$                                                         | Compute next PC                                    |
| Decode     | $\text{valA} \leftarrow R[rA]$<br>$\text{valB} \leftarrow 0$                                 | Read operand A                                     |
| Execute    | $\text{valE} \leftarrow \text{valB} + \text{valA}$<br>If ! Cond(CC,ifun) $rB \leftarrow 0xF$ | Pass valA through ALU<br>(Disable register update) |
| Memory     |                                                                                              |                                                    |
| Write back | $R[rB] \leftarrow \text{valE}$                                                               | Write back result                                  |
| PC update  | $\text{PC} \leftarrow \text{valP}$                                                           | Update PC                                          |

- Read register rA and pass through ALU
- Cancel move by setting destination register to 0xF
  - If condition codes & move condition indicate no move

# Executing Jumps



## Fetch

- Read 9 bytes
- Increment PC by 9

## Decode

- Do nothing

## Execute

- Determine whether to take branch based on jump condition and condition codes

## Memory

- Do nothing

## Write back

- Do nothing

## PC Update

- Set PC to Dest if branch taken or to incremented PC if not branch

# Stage Computation: Jumps

|            | jXX Dest                                                                                                                           |                                                                                                |
|------------|------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------|
| Fetch      | $\text{icode:ifun} \leftarrow M_1[\text{PC}]$<br>$\text{valC} \leftarrow M_8[\text{PC}+1]$<br>$\text{valP} \leftarrow \text{PC}+9$ | <b>Read instruction byte</b><br><b>Read destination address</b><br><b>Fall through address</b> |
| Decode     |                                                                                                                                    |                                                                                                |
| Execute    | $\text{Cnd} \leftarrow \text{Cond}(\text{CC}, \text{ifun})$                                                                        | <b>Take branch?</b>                                                                            |
| Memory     |                                                                                                                                    |                                                                                                |
| Write back |                                                                                                                                    |                                                                                                |
| PC update  | $\text{PC} \leftarrow \text{Cnd} ? \text{valC} : \text{valP}$                                                                      | <b>Update PC</b>                                                                               |

- Compute both addresses
- Choose based on setting of condition codes and branch condition

# Executing call



## ■ Fetch

- Read 9 bytes
- Increment PC by 9

## ■ Decode

- Read stack pointer

## ■ Execute

- Decrement stack pointer by 8

## ■ Memory

- Write incremented PC to new value of stack pointer

## ■ Write back

- Update stack pointer

## ■ PC Update

- Set PC to Dest

# Stage Computation: call

|            |                                                                                                                                    |                                                                                                |
|------------|------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------|
|            | call Dest                                                                                                                          |                                                                                                |
| Fetch      | $\text{icode:ifun} \leftarrow M_1[\text{PC}]$<br>$\text{valC} \leftarrow M_8[\text{PC}+1]$<br>$\text{valP} \leftarrow \text{PC}+9$ | <b>Read instruction byte</b><br><b>Read destination address</b><br><b>Compute return point</b> |
| Decode     | $\text{valB} \leftarrow R[\%rsp]$                                                                                                  | <b>Read stack pointer</b>                                                                      |
| Execute    | $\text{valE} \leftarrow \text{valB} + -8$                                                                                          | <b>Decrement stack pointer</b>                                                                 |
| Memory     | $M_8[\text{valE}] \leftarrow \text{valP}$                                                                                          | <b>Write return value on stack</b>                                                             |
| Write back | $R[\%rsp] \leftarrow \text{valE}$                                                                                                  | <b>Update stack pointer</b>                                                                    |
| PC update  | $\text{PC} \leftarrow \text{valC}$                                                                                                 | <b>Set PC to destination</b>                                                                   |

- Use ALU to decrement stack pointer
- Store incremented PC

# Executing ret



## Fetch

- Read 1 byte

## Decode

- Read stack pointer

## Execute

- Increment stack pointer by 8

## Memory

- Read return address from old stack pointer

## Write back

- Update stack pointer

## PC Update

- Set PC to return address

# Stage Computation: `ret`

|            |                                                                        |                                                          |
|------------|------------------------------------------------------------------------|----------------------------------------------------------|
|            | <code>ret</code>                                                       |                                                          |
| Fetch      | $\text{icode:ifun} \leftarrow M_1[\text{PC}]$                          | Read instruction byte                                    |
| Decode     | $\text{valA} \leftarrow R[\%rsp]$<br>$\text{valB} \leftarrow R[\%rsp]$ | Read operand stack pointer<br>Read operand stack pointer |
| Execute    | $\text{valE} \leftarrow \text{valB} + 8$                               | Increment stack pointer                                  |
| Memory     | $\text{valM} \leftarrow M_8[\text{valA}]$                              | Read return address                                      |
| Write back | $R[\%rsp] \leftarrow \text{valE}$                                      | Update stack pointer                                     |
| PC update  | $\text{PC} \leftarrow \text{valM}$                                     | Set PC to return address                                 |

- Use ALU to increment stack pointer
- Read return address from memory

# Computation Steps

|            |            | OPq rA, rB                              |                            |
|------------|------------|-----------------------------------------|----------------------------|
| Fetch      | icode,ifun | icode:ifun $\leftarrow M_1[PC]$         | Read instruction byte      |
|            | rA,rB      | rA:rB $\leftarrow M_1[PC+1]$            | Read register byte         |
|            | valC       |                                         | [Read constant word]       |
|            | valP       | valP $\leftarrow PC+2$                  | Compute next PC            |
| Decode     | valA, srcA | valA $\leftarrow R[rA]$                 | Read operand A             |
|            | valB, srcB | valB $\leftarrow R[rB]$                 | Read operand B             |
| Execute    | valE       | valE $\leftarrow valB \text{ OP } valA$ | Perform ALU operation      |
|            | Cond code  | Set CC                                  | Set/use cond. code reg     |
| Memory     | valM       |                                         | [Memory read/write]        |
| Write back | dstE       | R[rB] $\leftarrow valE$                 | Write back ALU result      |
|            | dstM       |                                         | [Write back memory result] |
| PC update  | PC         | PC $\leftarrow valP$                    | Update PC                  |

- All instructions follow same general pattern
- Differ in what gets computed on each step

# Computation Steps

|            |            |                                               |                            |
|------------|------------|-----------------------------------------------|----------------------------|
|            |            | call Dest                                     |                            |
| Fetch      | icode,ifun | $\text{icode:ifun} \leftarrow M_1[\text{PC}]$ | Read instruction byte      |
|            | rA,rB      |                                               | [Read register byte]       |
|            | valC       | $\text{valC} \leftarrow M_8[\text{PC+1}]$     | Read constant word         |
|            | valP       | $\text{valP} \leftarrow \text{PC+9}$          | Compute next PC            |
| Decode     | valA, srcA |                                               | [Read operand A]           |
|            | valB, srcB | $\text{valB} \leftarrow R[\%rsp]$             | Read operand B             |
| Execute    | valE       | $\text{valE} \leftarrow \text{valB} + -8$     | Perform ALU operation      |
|            | Cond code  |                                               | [Set /use cond. code reg]  |
| Memory     | valM       | $M_8[\text{valE}] \leftarrow \text{valP}$     | Memory read/write          |
| Write back | dstE       | $R[\%rsp] \leftarrow \text{valE}$             | Write back ALU result      |
|            | dstM       |                                               | [Write back memory result] |
| PC update  | PC         | $\text{PC} \leftarrow \text{valC}$            | Update PC                  |

- All instructions follow same general pattern
- Differ in what gets computed on each step

# Computed Values

## ■ Fetch

|       |                      |
|-------|----------------------|
| icode | Instruction code     |
| ifun  | Instruction function |
| rA    | Instr. Register A    |
| rB    | Instr. Register B    |
| valC  | Instruction constant |
| valP  | Incremented PC       |

## ■ Execute

- valE ALU result
- Cnd Branch/move flag

## ■ Memory

- valM Value from memory

## ■ Decode

|      |                        |
|------|------------------------|
| srcA | Register ID A          |
| srcB | Register ID B          |
| dstE | Destination Register E |
| dstM | Destination Register M |
| valA | Register value A       |
| valB | Register value B       |

# SEQ Hardware

## ■ Key

- Blue boxes: predefined hardware blocks
  - E.g., memories, ALU
- Gray boxes: control logic
  - Describe in HCL
- White ovals: labels for signals
- Thick lines: 64-bit word values
- Thin lines: 4-8 bit values
- Dotted lines: 1-bit values



# Fetch Logic



## ■ Predefined Blocks

- **PC:** Register containing PC
- **Instruction memory:** Read 10 bytes (PC to PC+9)
  - Signal invalid address
- **Split:** Divide instruction byte into icode and ifun
- **Align:** Get fields for rA, rB, and valC

# Fetch Logic



## ■ Control Logic

- **Instr. Valid:** Is this instruction valid?
- **icode, ifun:** Generate no-op if invalid address
- **Need regids:** Does this instruction have a register byte?
- **Need valC:** Does this instruction have a constant word?

# Fetch Control Logic in HCL

```
# Determine instruction code
int icode = [
    imem_error: INOP;
    1: imem_icode;
];

# Determine instruction function
int ifun = [
    imem_error: FNONE;
    1: imem_ifun;
];
```



# Fetch Control Logic in HCL



```
bool need_regs =
    icode in { IRRMOVQ, IOPQ, IPUSHQ, IPOPQ,
               IIRMOVQ, IRMMOVQ, IMRMOVQ };
```

```
bool instr_valid = icode in
{ INOP, IHALT, IRRMOVQ, IIRMOVQ, IRMMOVQ, IMRMOVQ,
  IOPQ, IJXX, ICALL, IRET, IPUSHQ, IPOPQ };
```

# Decode Logic

## ■ Register File

- Read ports A, B
- Write ports E, M
- Addresses are register IDs or 15 (0xF) (no access)

## Control Logic

- srcA, srcB: read port addresses
- dstE, dstM: write port addresses

## Signals

- Cnd: Indicate whether or not to perform conditional move
  - Computed in Execute stage



# A Source

```
int srcA = [
    icode in { IRRMOVQ, IRMMOVQ, IOPOQ, IPUSHQ } : rA;
    icode in { IPOPOQ, IRET } : RRSP;
    1 : RNONE; # Don't need register
];
```

|        |                           |                    |
|--------|---------------------------|--------------------|
|        | OPq rA, rB                |                    |
| Decode | valA $\leftarrow$ R[rA]   | Read operand A     |
|        | cmoveXX rA, rB            |                    |
| Decode | valA $\leftarrow$ R[rA]   | Read operand A     |
|        | rmmovq rA, D(rB)          |                    |
| Decode | valA $\leftarrow$ R[rA]   | Read operand A     |
|        | popq rA                   |                    |
| Decode | valA $\leftarrow$ R[%rsp] | Read stack pointer |
|        | jXX Dest                  |                    |
| Decode |                           | No operand         |
|        | call Dest                 |                    |
| Decode |                           | No operand         |
|        | ret                       |                    |
| Decode | valA $\leftarrow$ R[%rsp] | Read stack pointer |

# E Destination

|                   |                         |                                        |
|-------------------|-------------------------|----------------------------------------|
|                   | <b>OPq rA, rB</b>       |                                        |
| <b>Write-back</b> | <b>R[rB] ← valE</b>     | <b>Write back result</b>               |
|                   | <b>cmovXX rA, rB</b>    |                                        |
| <b>Write-back</b> | <b>R[rB] ← valE</b>     | <b>Conditionally write back result</b> |
|                   | <b>rmmovq rA, D(rB)</b> |                                        |
| <b>Write-back</b> |                         | <b>None</b>                            |
|                   | <b>popq rA</b>          |                                        |
| <b>Write-back</b> | <b>R[%rsp] ← valE</b>   | <b>Update stack pointer</b>            |
|                   | <b>jXX Dest</b>         |                                        |
| <b>Write-back</b> |                         | <b>None</b>                            |
|                   | <b>call Dest</b>        |                                        |
| <b>Write-back</b> | <b>R[%rsp] ← valE</b>   | <b>Update stack pointer</b>            |
|                   | <b>ret</b>              |                                        |
| <b>Write-back</b> | <b>R[%rsp] ← valE</b>   | <b>Update stack pointer</b>            |

```

int dstE = [
    icode in { IRRMOVQ } && Cnd : rB;
    icode in { IIRMOVQ, IOPQ} : rB;
    icode in { IPUSHQ, IPOPQ, ICALL, IRET } : RRSP;
    1 : RNONE; # Don't write any register
];

```

# Execute Logic

## ■ Units

### ■ ALU

- Implements 4 required functions
- Generates condition code values

### ■ CC

- Register with 3 condition code bits

### ■ cond

- Computes conditional jump/move flag

## ■ Control Logic

### ■ Set CC: Should condition code register be loaded?

### ■ ALU A: Input A to ALU

### ■ ALU B: Input B to ALU

### ■ ALU fun: What function should ALU compute?



# ALU A Input

|         |                                |                           |
|---------|--------------------------------|---------------------------|
|         | OPq rA, rB                     |                           |
| Execute | valE $\leftarrow$ valB OP valA | Perform ALU operation     |
|         | cmoveXX rA, rB                 |                           |
| Execute | valE $\leftarrow$ 0 + valA     | Pass valA through ALU     |
|         | rmmovq rA, D(rB)               |                           |
| Execute | valE $\leftarrow$ valB + valC  | Compute effective address |
|         | popq rA                        |                           |
| Execute | valE $\leftarrow$ valB + 8     | Increment stack pointer   |
|         | jXX Dest                       |                           |
| Execute |                                | No operation              |
|         | call Dest                      |                           |
| Execute | valE $\leftarrow$ valB + -8    | Decrement stack pointer   |
|         | ret                            |                           |
| Execute | valE $\leftarrow$ valB + 8     | Increment stack pointer   |

```
int aluA = [
    icode in { IRRMOVQ, IOPQ } : valA,
    icode in { IIRMOVQ, IRMMOVQ, IMRMOVQ } : valC,
    icode in { ICALL, IPUSHQ } : -8,
    icode in { IRET, IPOPQ } : 8,
    # Other instructions don't need ALU
];
```

# ALU Operation

|         |                                                    |                           |
|---------|----------------------------------------------------|---------------------------|
|         | OPI rA, rB                                         |                           |
| Execute | $\text{valE} \leftarrow \text{valB OP valA}$       | Perform ALU operation     |
|         | cmovXX rA, rB                                      |                           |
| Execute | $\text{valE} \leftarrow 0 + \text{valA}$           | Pass valA through ALU     |
|         | rmmovl rA, D(rB)                                   |                           |
| Execute | $\text{valE} \leftarrow \text{valB} + \text{valC}$ | Compute effective address |
|         | popq rA                                            |                           |
| Execute | $\text{valE} \leftarrow \text{valB} + 8$           | Increment stack pointer   |
|         | jXX Dest                                           |                           |
| Execute |                                                    | No operation              |
|         | call Dest                                          |                           |
| Execute | $\text{valE} \leftarrow \text{valB} + -8$          | Decrement stack pointer   |
|         | ret                                                |                           |
| Execute | $\text{valE} \leftarrow \text{valB} + 8$           | Increment stack pointer   |

```
int alufun = [
    icode == IOPQ : ifun;
    1 : ALUADD;
];
```

# Memory Logic

## ■ Memory

- Reads or writes memory word

## ■ Control Logic

- stat: What is instruction status?
- Mem. read: should word be read?
- Mem. write: should word be written?
- Mem. addr.: Select address
- Mem. data.: Select data



# Instruction Status

## ■ Control Logic

- stat: What is instruction status?



```
## Determine instruction status
int Stat = [
    imem_error || dmem_error : SADR;
    !instr_valid: SINS;
    icode == IHALT : SHLT;
    1 : SAOK;
];
```

# Memory Address

|        |                                           |                             |
|--------|-------------------------------------------|-----------------------------|
|        | OPq rA, rB                                |                             |
| Memory |                                           | No operation                |
|        | rmmovq rA, D(rB)                          |                             |
| Memory | $M_8[\text{valE}] \leftarrow \text{valA}$ | Write value to memory       |
|        | popq rA                                   |                             |
| Memory | $\text{valM} \leftarrow M_8[\text{valA}]$ | Read from stack             |
|        | jXX Dest                                  |                             |
| Memory |                                           | No operation                |
|        | call Dest                                 |                             |
| Memory | $M_8[\text{valE}] \leftarrow \text{valP}$ | Write return value on stack |
|        | ret                                       |                             |
| Memory | $\text{valM} \leftarrow M_8[\text{valA}]$ | Read return address         |

```

int mem_addr = [
    icode in { IRMMOVQ, IPUSHQ, ICALL, IMRMOVQ } : valE;
    icode in { IPOPQ, IRET } : valA;
    # Other instructions don't need address
];

```

# Memory Read

|               |                                           |                                    |
|---------------|-------------------------------------------|------------------------------------|
|               | <b>OPq rA, rB</b>                         |                                    |
| <b>Memory</b> |                                           | <b>No operation</b>                |
|               | <b>rmmovq rA, D(rB)</b>                   |                                    |
| <b>Memory</b> | $M_8[\text{valE}] \leftarrow \text{valA}$ | <b>Write value to memory</b>       |
|               | <b>popq rA</b>                            |                                    |
| <b>Memory</b> | $\text{valM} \leftarrow M_8[\text{valA}]$ | <b>Read from stack</b>             |
|               | <b>jXX Dest</b>                           |                                    |
| <b>Memory</b> |                                           | <b>No operation</b>                |
|               | <b>call Dest</b>                          |                                    |
| <b>Memory</b> | $M_8[\text{valE}] \leftarrow \text{valP}$ | <b>Write return value on stack</b> |
|               | <b>ret</b>                                |                                    |
| <b>Memory</b> | $\text{valM} \leftarrow M_8[\text{valA}]$ | <b>Read return address</b>         |

```
bool mem_read = icode in { IMRMOVQ, IPOPQ, IRET };
```

# PC Update Logic

## ■ New PC

- Select next value of PC



# PC Update

|           |                                   |                          |
|-----------|-----------------------------------|--------------------------|
|           | OPq rA, rB                        |                          |
| PC update | PC $\leftarrow$ valP              | Update PC                |
|           | rmmovq rA, D(rB)                  |                          |
| PC update | PC $\leftarrow$ valP              | Update PC                |
|           | popq rA                           |                          |
| PC update | PC $\leftarrow$ valP              | Update PC                |
|           | jXX Dest                          |                          |
| PC update | PC $\leftarrow$ Cnd ? valC : valP | Update PC                |
|           | call Dest                         |                          |
| PC update | PC $\leftarrow$ valC              | Set PC to destination    |
|           | ret                               |                          |
| PC update | PC $\leftarrow$ valM              | Set PC to return address |

```
int new_pc = [
    icode == ICALL : valC;
    icode == IJXX && Cnd : valC;
    icode == IRET : valM;
    1 : valP;
];
```

# SEQ Operation



## ■ State

- PC register
- Cond. Code register
- Data memory
- Register file

*All updated as clock rises*

## ■ Combinational Logic

- ALU
- Control logic
- Memory reads
  - Instruction memory
  - Register file
  - Data memory

# SEQ Operation #2



- **state set according to second `irmovq` instruction**
- **combinational logic starting to react to state changes**

# SEQ Operation #3



| Clock    | Cycle 1                                           | Cycle 2 | Cycle 3 | Cycle 4 |
|----------|---------------------------------------------------|---------|---------|---------|
|          | ①                                                 |         | ②       | ③       |
|          |                                                   |         | ④       |         |
| Cycle 1: | 0x000: irmovq \$0x100,%rbx # %rbx <-- 0x100       |         |         |         |
| Cycle 2: | 0x00a: irmovq \$0x200,%rdx # %rdx <-- 0x200       |         |         |         |
| Cycle 3: | 0x014: addq %rdx,%rbx # %rbx <-- 0x300 CC <-- 000 |         |         |         |
| Cycle 4: | 0x016: je dest # Not taken                        |         |         |         |
| Cycle 5: | 0x01f: rmmovq %rbx,0(%rdx) # M[0x200] <-- 0x300   |         |         |         |



- state set according to second `irmovq` instruction
- combinational logic generates results for `addq` instruction

# SEQ Operation #4



- **state set according to addq instruction**
- **combinational logic starting to react to state changes**

# SEQ Operation #5



- state set according to addq instruction
- combinational logic generates results for je instruction

# SEQ Summary

## ■ Implementation

- Express every instruction as series of simple steps
- Follow same general flow for each instruction type
- Assemble registers, memories, predesigned combinational blocks
- Connect with control logic

## ■ Limitations

- Too slow to be practical
- In one cycle, must propagate through instruction memory, register file, ALU, and data memory
- Would need to run clock very slowly
- Hardware units only active for fraction of clock cycle