

# CS 110

## Computer Architecture

### Lecture 11:

### *Single-Cycle CPU*

### *Control*

Instructors:

Sören Schwertfeger & Chundong Wang

<https://robotics.shanghaitech.edu.cn/courses/ca/20s/>

School of Information Science and Technology SIST

ShanghaiTech University

Slides based on UC Berkley's CS61C

# Agenda

- Review
- Control
- Hardware Description Languages (Video 2)
- Timing (Video 2)

# Agenda

- Review
- Control
- Hardware Description Languages (Video 2)
- Timing (Video 2)

# One-Instruction-Per-Cycle RISC-V Machine



- One clock tick => one instruction
- Current state outputs => inputs to combinational logic => outputs settle at the values of state before next clock edge
- Rising clock edge:
  - all state elements are updated with combinational logic outputs
  - execution moves to next clock cycle

# Stages of Execution on Datapath



# Datapath Components: Combinational

- Combinational Elements



Adder



Multiplexer



ALU

- Storage Elements + Clocking Methodology
- Building Blocks

# Datapath Elements: State and Sequencing (1/3)

- Register



- Write Enable:
  - Negated (or deasserted) (0): Data Out will not change
  - Asserted (1): Data Out will become Data In on positive edge of clock

# Datapath Elements: State and Sequencing (2/3)

- Register file (regfile, RF) consists of 32 registers
  - Two 32-bit output busses: busA and busB
  - One 32-bit input bus: busW
  - In one clock cycle can read two registers and write another!



- Register is selected by:
  - RA (number) selects the register to put on busA (data)
  - RB (number) selects the register to put on busB (data)
  - RW (number) selects the register to be written via busW (data) when Write Enable is 1
- Clock input (clk)
  - Clk input is a factor ONLY during write operation
  - During read operation, behaves as a combinational logic block:
    - RA or RB valid  $\Rightarrow$  busA or busB valid after “access time.”

**Memory Size of Register File?**

# Datapath Elements: State and Sequencing (3/3)

- “Magic” Memory
  - One input bus: Data In
  - One output bus: Data Out
- Memory word is found by:
  - For Read: Address selects the word to put on Data Out
  - For Write: Set Write Enable = 1: address selects the memory word to be written via the Data In bus
- Clock input (CLK)
  - CLK input is a factor ONLY during write operation
  - During read operation, behaves as a combinational logic block: Address valid  $\Rightarrow$  Data Out valid after “access time”



# State Required by RV32I ISA

Each instruction reads and updates this state during execution:

- Registers (**x0 . . x31**)
  - Register file (*regfile*) **Reg** holds 32 registers x 32 bits/register: **Reg [0] . . Reg [31]**
  - First register read specified by *rs1* field in instruction
  - Second register read specified by *rs2* field in instruction
  - Write register (destination) specified by *rd* field in instruction
  - **x0** is always 0 (writes to **Reg [0]** are ignored)
- Program Counter (**PC**)
  - Holds address of current instruction
- Memory (**MEM**)
  - Holds both instructions & data, in one 32-bit byte-addressed memory space
  - We'll use separate memories for instructions (**IMEM**) and data (**DMEM**)
    - *These are placeholders for instruction and data caches*
  - Instructions are read (*fetched*) from instruction memory (assume **IMEM** read-only)
  - Load/store instructions access data memory

# Complete Single-Cycle RV32I Datapath!



# Agenda

- Review
- Control
- Hardware Description Languages (Video 2)
- Timing (Video 2)

# Our Single-Core Computer (without FPU, SIMD)



# Example: add



# add Execution



# Example: SW



# Example: beq



# Control Logic Truth Table

| Inst[31:0]          | BrEq | BrLT | PCSel | ImmSel | BrUn | ASel | BSel | ALUSel | MemRW | RegWEn | WBSel             |
|---------------------|------|------|-------|--------|------|------|------|--------|-------|--------|-------------------|
| <b>add</b>          | *    | *    | +4    | *      | *    | Reg  | Reg  | Add    | Read  | 1      | ALU               |
| <b>sub</b>          | *    | *    | +4    | *      | *    | Reg  | Reg  | Sub    | Read  | 1      | ALU               |
| <b>(R-R<br/>Op)</b> | *    | *    | +4    | *      | *    | Reg  | Reg  | (Op)   | Read  | 1      | ALU               |
| <b>addi</b>         | *    | *    | +4    | I      | *    | Reg  | Imm  | Add    | Read  | 1      | ALU               |
| <b>lw</b>           | *    | *    | +4    | I      | *    | Reg  | Imm  | Add    | Read  | 1      | Mem               |
| <b>sw</b>           | *    | *    | +4    | S      | *    | Reg  | Imm  | Add    | Write | 0      | *                 |
| <b>beq</b>          | 0    | *    | +4    | B      | *    | PC   | Imm  | Add    | Read  | 0      | *                 |
| <b>beq</b>          | 1    | *    | ALU   | B      | *    | PC   | Imm  | Add    | Read  | 0      | *                 |
| <b>bne</b>          | 0    | *    | ALU   | B      | *    | PC   | Imm  | Add    | Read  | 0      | *                 |
| <b>bne</b>          | 1    | *    | +4    | B      | *    | PC   | Imm  | Add    | Read  | 0      | *                 |
| <b>blt</b>          | *    | 1    | ALU   | B      | 0    | PC   | Imm  | Add    | Read  | 0      | *                 |
| <b>bltu</b>         | *    | 1    | ALU   | B      | 1    | PC   | Imm  | Add    | Read  | 0      | *                 |
| <b>jalr</b>         | *    | *    | ALU   | I      | *    | Reg  | Imm  | Add    | Read  | 1      | PC+4              |
| <b>jal</b>          | *    | *    | ALU   | J      | *    | PC   | Imm  | Add    | Read  | 1      | PC+4              |
| <b>auiopc</b>       | *    | *    | +4    | U      | *    | PC   | Imm  | Add    | Read  | 1      | ALU <sup>18</sup> |

# Control Realization Options

- ROM
  - “Read-Only Memory”
    - During “normal” operation it is only being read
  - Regular structure
  - Can be easily reprogrammed
    - fix errors
    - add instructions
  - Popular when designing control logic manually
- Combinatorial Logic
  - Today, chip designers use logic synthesis tools to convert truth tables to networks of gates

# RV32I, a nine-bit ISA!

|                       |     |     |     |             |
|-----------------------|-----|-----|-----|-------------|
| imm[31:12]            |     |     | rd  | 0110111     |
| imm[31:12]            |     |     | rd  | 0010111     |
| imm[20:10:1 11 19:12] |     |     | rd  | 1101111     |
| imm[11:0]             | rs1 | 000 | rd  | 1100111     |
| imm[12:10:5]          | rs2 | rs1 | 000 | imm[4:1 11] |
| imm[12:10:5]          | rs2 | rs1 | 001 | imm[4:1 11] |
| imm[12:10:5]          | rs2 | rs1 | 100 | imm[4:1 11] |
| imm[12:10:5]          | rs2 | rs1 | 101 | imm[4:1 11] |
| imm[12:10:5]          | rs2 | rs1 | 110 | imm[4:1 11] |
| imm[12:10:5]          | rs2 | rs1 | 111 | imm[4:1 11] |
| imm[11:0]             | rs1 | 000 | rd  | 0000011     |
| imm[11:0]             | rs1 | 001 | rd  | 0000011     |
| imm[11:0]             | rs1 | 010 | rd  | 0000011     |
| imm[11:0]             | rs1 | 100 | rd  | 0000011     |
| imm[11:0]             | rs1 | 101 | rd  | 0000011     |
| imm[11:5]             | rs2 | rs1 | 000 | imm[4:0]    |
| imm[11:5]             | rs2 | rs1 | 001 | imm[4:0]    |
| imm[11:5]             | rs2 | rs1 | 010 | imm[4:0]    |
| imm[11:0]             | rs1 | 000 | rd  | 0010011     |
| imm[11:0]             | rs1 | 010 | rd  | 0010011     |
| imm[11:0]             | rs1 | 011 | rd  | 0010011     |
| imm[11:0]             | rs1 | 100 | rd  | 0010011     |
| imm[11:0]             | rs1 | 110 | rd  | 0010011     |
| imm[11:0]             | rs1 | 111 | rd  | 0010011     |

LUI  
AUIPC  
JAL  
JALR  
BEQ  
BNE  
BLT  
BGE  
BLTU  
BGEU  
LB  
LH  
LW  
LBU  
LHU  
SB  
SH  
SW  
ADDI  
SLTI  
SLTIU  
XORI  
ORI  
ANDI

| inst[30]         |       |      | inst[14:12] inst[6:2] |         |           |
|------------------|-------|------|-----------------------|---------|-----------|
| 0000000          | shamt | rs1  | 001                   | rd      | 0010011   |
| 0000000          | shamt | rs1  | 101                   | rd      | 0010011   |
| 0000000          | shamt | rs1  | 101                   | rd      | 0010011   |
| 0000000          | rs2   | rs1  | 000                   | rd      | 0110011   |
| 0000000          | rs2   | rs1  | 000                   | rd      | 0110011   |
| 0000000          | rs2   | rs1  | 001                   | rd      | 0110011   |
| 0000000          | rs2   | rs1  | 010                   | rd      | 0110011   |
| 0000000          | rs2   | rs1  | 011                   | rd      | 0110011   |
| 0000000          | rs2   | rs1  | 100                   | rd      | 0110011   |
| 0000000          | rs2   | rs1  | 101                   | rd      | 0110011   |
| 0000000          | rs2   | rs1  | 101                   | rd      | 0110011   |
| 0000000          | rs2   | rs1  | 110                   | rd      | 0110011   |
| 0000000          | rs2   | rs1  | 111                   | rd      | 0110011   |
| 0000000          | pred  | succ | 00000                 | 000     | 0001111   |
| 00000            | 0000  | 0000 | 00000                 | 001     | 0001111   |
| 0000000000000000 |       |      | 00000                 | 000     | 000000011 |
| 0000000000000001 |       |      | 00000                 | 000     | 000000011 |
| csr              | rs1   | 001  | rd                    | 1110011 |           |
| csr              | rs1   | 010  | rd                    | 1110011 |           |
| csr              | rs1   | 011  | rd                    | 1110011 |           |
| csr              | zimm  | 101  | rd                    | 1110011 |           |
| csr              | zimm  | 110  | rd                    | 1110011 |           |
| csr              | zimm  | 111  | rd                    | 1110011 |           |

Not in CS61C

Instruction type encoded using only 9 bits  
**inst[30],inst[14:12], inst[6:2]**

# Combinational Logic Control

- Simplest example: BrUn

|              |     |     | inst[14:12] |             | inst[6:2] = Branch |      |
|--------------|-----|-----|-------------|-------------|--------------------|------|
| imm[12 10:5] | rs2 | rs1 | 000         | imm[4:1 11] | 1100011            | BEQ  |
| imm[12 10:5] | rs2 | rs1 | 001         | imm[4:1 11] | 1100011            | BNE  |
| imm[12 10:5] | rs2 | rs1 | 100         | imm[4:1 11] | 1100011            | BLT  |
| imm[12 10:5] | rs2 | rs1 | 101         | imm[4:1 11] | 1100011            | BGE  |
| imm[12 10:5] | rs2 | rs1 | 110         | imm[4:1 11] | 1100011            | BLTU |
| imm[12 10:5] | rs2 | rs1 | 111         | imm[4:1 11] | 1100011            | BGEU |

- How to decode whether BrUn is 1?

- BrUn = Inst [13] • Branch

# Control Block Design



# ROM Controller Implementation



# Controller: AND logic (add, sw)



- $\text{add} = \overline{\text{inst}[2]} \cdot \overline{\text{inst}[3]} \cdot \overline{\text{inst}[4]} \cdot \overline{\text{inst}[5]} \cdot \overline{\text{inst}[6]} \cdot \overline{\text{inst}[12]} \cdot \overline{\text{inst}[13]} \cdot \overline{\text{inst}[14]} \cdot \overline{\text{inst}[30]}$
- $\text{sw} = \overline{\text{inst}[2]} \cdot \overline{\text{inst}[3]} \cdot \overline{\text{inst}[4]} \cdot \overline{\text{inst}[5]} \cdot \overline{\text{inst}[6]} \cdot \overline{\text{inst}[12]} \cdot \overline{\text{inst}[13]} \cdot \overline{\text{inst}[14]}$

AND Controller Logic

|     | 2                                                                                                                                                        | 3 | 4 | 5 | 6 | 12 | 13 | 14 | 30 | unused |
|-----|----------------------------------------------------------------------------------------------------------------------------------------------------------|---|---|---|---|----|----|----|----|--------|
| add | xor(i[2], 1 ) • xor(i[3], 1 ) • xor(i[4], 0 ) • xor(i[5], 0 ) • xor(i[6], 1 ) • xor(i[12], 1 ) • xor(i[13], 1 ) • xor(i[14], 1 ) • (xor(i[30], 1 ) + 0 ) |   |   |   |   |    |    |    |    |        |
| sw  | xor(i[2], 1 ) • xor(i[3], 1 ) • xor(i[4], 1 ) • xor(i[5], 0 ) • xor(i[6], 1 ) • xor(i[12], 1 ) • xor(i[13], 1 ) • xor(i[14], 1 ) • (xor(i[30], 1 ) + 1 ) |   |   |   |   |    |    |    |    |        |
| ... |                                                                                                                                                          |   |   |   |   |    |    |    |    |        |

- $\text{bne\_t} = \text{bne} \cdot \overline{\text{BrEQ}}$
- $\text{bne\_f} = \text{bne} \cdot \text{BrEQ}$

# Controller: OR logic



- $\text{MemRW} = \text{sw} + \text{sh} + \text{sb}$
- $\text{WBSel}[0] = \text{add} + (\text{all reg. to reg.}) + \text{AUIPC} + \dots$
- $\text{WBSel}[1] = \text{JALR} + \text{JAL}$
- $\text{PCSel} = \text{beq\_t} + \text{bne\_t} + \text{blt\_t} + \text{bltu\_t} + \text{jal} + \text{jalr}$
- ...



# Control Logic Truth Table

| Inst[31:0]          | BrEq | BrLT | PCSel | ImmSel | BrUn | ASel | BSel | ALUSel | MemRW | RegWEn | WBSel             |
|---------------------|------|------|-------|--------|------|------|------|--------|-------|--------|-------------------|
| <b>add</b>          | *    | *    | +4    | *      | *    | Reg  | Reg  | Add    | Read  | 1      | ALU               |
| <b>sub</b>          | *    | *    | +4    | *      | *    | Reg  | Reg  | Sub    | Read  | 1      | ALU               |
| <b>(R-R<br/>Op)</b> | *    | *    | +4    | *      | *    | Reg  | Reg  | (Op)   | Read  | 1      | ALU               |
| <b>addi</b>         | *    | *    | +4    | I      | *    | Reg  | Imm  | Add    | Read  | 1      | ALU               |
| <b>lw</b>           | *    | *    | +4    | I      | *    | Reg  | Imm  | Add    | Read  | 1      | Mem               |
| <b>sw</b>           | *    | *    | +4    | S      | *    | Reg  | Imm  | Add    | Write | 0      | *                 |
| <b>beq</b>          | 0    | *    | +4    | B      | *    | PC   | Imm  | Add    | Read  | 0      | *                 |
| <b>beq</b>          | 1    | *    | ALU   | B      | *    | PC   | Imm  | Add    | Read  | 0      | *                 |
| <b>bne</b>          | 0    | *    | ALU   | B      | *    | PC   | Imm  | Add    | Read  | 0      | *                 |
| <b>bne</b>          | 1    | *    | +4    | B      | *    | PC   | Imm  | Add    | Read  | 0      | *                 |
| <b>blt</b>          | *    | 1    | ALU   | B      | 0    | PC   | Imm  | Add    | Read  | 0      | *                 |
| <b>bltu</b>         | *    | 1    | ALU   | B      | 1    | PC   | Imm  | Add    | Read  | 0      | *                 |
| <b>jalr</b>         | *    | *    | ALU   | I      | *    | Reg  | Imm  | Add    | Read  | 1      | PC+4              |
| <b>jal</b>          | *    | *    | ALU   | J      | *    | PC   | Imm  | Add    | Read  | 1      | PC+4              |
| <b>auiopc</b>       | *    | *    | +4    | U      | *    | PC   | Imm  | Add    | Read  | 1      | ALU <sup>26</sup> |

# ROM Controller Implementation





# TA Discussion

Yanjie Song



# Q & A



# Quiz



# Quiz

Piazza: "Online Lecture 11 Control Poll"

- Select the statements that are TRUE:
  - A. We should design the control logic as fast as possible because it adds to the critical path in the CPU.
  - B. The control needs all 32 bits of the instruction to decide on the control signals.
  - C. The value of all control signals is known from the instruction bits.
  - D. It is possible to add certain new instructions to the ISA by just modifying the control (e.g. subi).

# CS 110

# Computer Architecture

## Lecture 11:

### *Single-Cycle CPU*

### *Control*

### *Video 2: Timing*

Instructors:  
**Sören Schwertfeger & Chundong Wang**

<https://robotics.shanghaitech.edu.cn/courses/ca/20s/>

**School of Information Science and Technology SIST**

**ShanghaiTech University**

**Slides based on UC Berkley's CS61C**

# Agenda

- Review
- Control
- Hardware Description Languages (Video 2)
- Timing (Video 2)

# Register Transfer Level (RTL)

- RTL describes instructions in figures or text
- Can use C (or Verilog) to describe RTL
- RISC-V green card show RTL for all instructions in Verilog
- RTL is a subset of a Hardware Description Language

## Inst Register Transfers

```
add    R[rd] ← R[rs1] + R[rs2]; PC ← PC + 4
sub    R[rd] ← R[rs1] - R[rs2]; PC ← PC + 4
ori    R[rd] ← R[rs1] | Imm; PC ← PC + 4
jal    R[rd] ← PC + 4; PC ← R[rs1] + Imm
beq    if ( R[rs1] == R[rs2] )
            PC ← PC + Imm
        else PC ← PC + 4
```

# Hardware Description Languages

- Example HDL's : ABEL, VERILOG, VHDL
- Advantages:
  - Documentation
  - Flexibility (easier to make design changes or mods)
  - Portability (if HDL is standard)
  - One language for modeling, simulation (test benches), and synthesis
  - Let synthesis worry about gate generation
    - Engineer productivity
- However: A different way of approaching design
  - engineers are used to thinking and designing using graphics (schematics) instead of text.

# Simulate Circuits (e.g. for FPGA)



# Synthesis

- Use a HDL to create:
  - VLSI (Very Large Scale Integration): create an IC (Integrated Circuit) – a chip – for example a CMOS CPU, Memory
  - ASIC (Application-specific integrated circuit): CMOS chip for a specific application in a specific device
  - a program for an FPGA (Field-programmable gate array): Programmable logic blocks that can perform combinatorial logic
    - Include state elements (memory)
    - E.g. Xilinx VU19P: 9million logic gates -> can simulate 16 Arm Cortex A9 cores at the same time!

# Verilog and VHDL

- VHDL - like Pascal and Ada programming languages
  - VHDL (VHSIC-HDL) (Very High Speed Integrated Circuit Hardware Description Language)
- Verilog - more like ‘C’ programming language
- VHDL is strongly typed; Verilog weakly typed
- Verilog is easier to learn compared to VHDL
- VHDL with library management, more complex datatypes

# Verilog Example:

```
module mux_4_to_1 (Out, In0, In1, In2, In3, Sel1, Sel0);
    output Out;
    input In0, In1, In2, In3, Sel0, Sel1;

    wire NotSel0, NotSel1;
    wire Y0, Y1, Y2, Y3;

    not (NotSel0, Sel0);
    not (NotSel1, Sel1);
    and (Y0, In0, NotSel1, NotSel0);
    and (Y1, In1, NotSel1, Sel0);
    and (Y2, In2, Sel1, NotSel0);
    and (Y3, In3, Sel1, Sel0);
    or (Out, Y0, Y1, Y2, Y3);

endmodule
```

Source: Anil Kumar  
Ram Rakhyani

# Agenda

- Review
- Control
- Hardware Description Languages (Video 2)
- Timing (Video 2)

# Q

## Question: Critical Path

Critical path for xor

$R[rd] = R[rs1] \text{ XOR imm}$



- A.  $t_{clk-q} + t_{IMEM} + \max\{t_{Reg}, t_{Imm}\} + t_{ALU} + 2t_{mux} + t_{Setup}$
- B.  $t_{clk-q} + t_{Add} + t_{IMEM} + t_{Reg} + t_{BComp} + t_{ALU} + t_{DMEM} + t_{mux} + t_{Setup}$
- C.  $t_{clk-q} + t_{IMEM} + \max\{t_{Reg}, t_{Imm}\} + t_{ALU} + 3t_{mux} + t_{DMEM} + t_{Setup}$
- D. None of the above

# Peer Instruction: Critical Path

Critical path for xor

$$R[rd] = R[rs1] + imm$$



- A.  $t_{clk-q} + t_{IMEM} + \max\{t_{Reg}, t_{Imm}\} + t_{ALU} + 2*t_{mux} + t_{Setup}$
- B.  $t_{clk-q} + t_{Add} + t_{IMEM} + t_{Reg} + t_{BComp} + t_{ALU} + t_{DMEM} + t_{mux} + t_{Setup}$
- C.  $t_{clk-q} + t_{IMEM} + \max\{t_{Reg}, t_{Imm}\} + t_{ALU} + 3*t_{mux} + t_{DMEM} + t_{Setup}$
- D. None of the above

# Instruction Timing



| IF     | ID       | EX     | MEM    | WB     | Total         |
|--------|----------|--------|--------|--------|---------------|
| I-MEM  | Reg Read | ALU    | D-MEM  | Reg W  |               |
| 200 ps | 100 ps   | 200 ps | 200 ps | 100 ps | <b>800 ps</b> |

# Instruction Timing

| Instr | IF = 200ps | ID = 100ps | ALU = 200ps | MEM=200ps | WB = 100ps | Total |
|-------|------------|------------|-------------|-----------|------------|-------|
| add   | X          | X          | X           |           | X          | 600ps |
| beq   | X          | X          | X           |           |            | 500ps |
| jal   | X          | X          | X           |           | X          | 600ps |
| lw    | X          | X          | X           | X         | X          | 800ps |
| sw    | X          | X          | X           | X         |            | 700ps |

- Maximum clock frequency
  - $f_{\max} = 1/800\text{ps} = 1.25 \text{ GHz}$
- Most blocks idle most of the time
  - E.g.  $f_{\max, \text{ALU}} = 1/200\text{ps} = 5 \text{ GHz!}$

# Performance

- “Our” RISC-V executes instructions at 1.25 GHz
  - 1 instruction every 800 ps
- Can we improve its performance?
  - What do we mean with this statement?
  - Not so obvious:
    - Quicker response time, so one job finishes faster?
    - More jobs per unit time (e.g. web server returning pages)?
    - Longer battery life?

# Transportation Analogy



|                    | Sports Car | Bus        |
|--------------------|------------|------------|
| Passenger Capacity | 2          | 50         |
| Travel Speed       | 250 km/h   | 100 km/h   |
| Fuel consumption   | 20 l/100km | 20 l/100km |

**Schwerin => Berlin trip: 200 km**



=>



# Transportation Analogy



|                    | Sports Car | Bus        |
|--------------------|------------|------------|
| Passenger Capacity | 2          | 50         |
| Travel Speed       | 250 km/h   | 100 km/h   |
| Fuel consumption   | 20 l/100km | 20 l/100km |

**Schwerin => Berlin trip: 200 km**

|                         | Sports Car | Bus     |
|-------------------------|------------|---------|
| Travel Time             | 48 min     | 120 min |
| Time for 100 passengers | 40 h       | 4 h     |
| Fuel per passenger      | 2000 l     | 80 l    |

# Computer Analogy

| Transportation          | Computer                                                                                                                                                           |
|-------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Travel Time             | Program execution time ( <b>latency</b> )<br>e.g. time to update display                                                                                           |
| Time for 100 passengers | Throughput:<br>e.g. number of server requests handled per hour                                                                                                     |
| Fuel per passenger      | Energy per task*:<br>e.g.: <ul style="list-style-type: none"><li>- how many movies can you watch per battery charge</li><li>- energy bill for datacenter</li></ul> |

\* Note: power is not a good measure, since low-power CPU might run for a long time to complete one task consuming more energy than faster computer running at higher power for a shorter time

# Next Lecture:

- Improve performance through pipelining:
    - One stage per clock cycle!
    - Need 5 clock cycles to complete one instruction
    - Clock cycles much shorter now
- => Higher throughput      ☺
- => Higher latency           ☹

# Call home, we've made HW/SW contact!



# “And In conclusion...”

- We have built a processor!
  - Capable of executing all RISC-V instructions in one cycle each
  - Not all units (hardware) used by all instructions
  - Critical path changes
- 5 Phases of execution
  - IF, ID, EX, MEM, WB
  - Not all instructions are active in all phases
- Controller specifies how to execute instructions
  - Implemented as ROM or logic

# Question

Piazza: "Lecture 11 Video Timing Poll"

- Select the statements that are TRUE:
  - A higher clock frequency will improve throughput and latency.
  - Prof. Schwertfeger likes to drive FAST on the German highways.
  - Schwerin is a beautiful city.
  - Our CPU cycle time depends on the setup time of only the PC.