

# Integer Addition

- Example:  $7 + 6$



- Ripple carry adder







a.



b.





| ALU Control line | Function         |
|------------------|------------------|
| 000              | and              |
| ALU Control line | Function         |
| 0000             | and              |
| 0001             | or               |
| 0010             | add              |
| 0110             | subtract         |
| 0111             | Set on less than |
| 1100             | nor              |



# Introduction

- CPU performance factors
  - Instruction count
    - Determined by ISA and compiler
  - CPI and Cycle time
    - Determined by CPU hardware
- We will examine two MIPS implementations
  - A simplified version
  - A more realistic pipelined version
- Simple subset, shows most aspects
  - Memory reference: l w, sw
  - Arithmetic/logical: add, sub, and, or, sll t
  - Control transfer: beq, j

# Instruction Execution

- PC → instruction memory, fetch instruction
- Register numbers → register file, read registers
- Depending on instruction class
  - Use ALU to calculate
    - Arithmetic result
    - Memory address for load/store
    - Branch target address
  - Access data memory for load/store
  - $\text{PC} \leftarrow \text{target address or } \text{PC} + 4$

# CPU Overview



# Multiplexers



- Can't just join wires together
  - Use multiplexers

# Control



# Logic Design Basics

- Information encoded in binary
  - Low voltage = 0, High voltage = 1
  - One wire per bit
  - Multi-bit data encoded on multi-wire buses
- Combinational element
  - Operate on data
  - Output is a function of input
- State (sequential) elements
  - Store information

# Combinational Elements

- AND-gate
  - $Y = A \& B$



- Multiplexer
  - $Y = S ? I_1 : I_0$



- Adder
  - $Y = A + B$



- Arithmetic/Logic Unit
  - $Y = F(A, B)$



# Sequential Elements

- Register: stores data in a circuit
  - Uses a clock signal to determine when to update the stored value
  - Edge-triggered: update when Clk changes from 0 to 1



# Sequential Elements

- Register with write control
  - Only updates on clock edge when write control input is 1
  - Used when stored value is required later



# Clocking Methodology

- Combinational logic transforms data during clock cycles
  - Between clock edges
  - Input from state elements, output to state element
  - Longest delay determines clock period



# Building a Datapath

- Datapath
  - Elements that process data and addresses in the CPU
    - Registers, ALUs, mux's, memories, ...
- We will build a MIPS datapath incrementally
  - Refining the overview design

# Instruction Fetch



# R-Format Instructions

- Read two register operands
- Perform arithmetic/logical operation
- Write register result



# Register-Register Timing



# Load/Store Instructions

- Read register operands
- Calculate address using 16-bit offset
  - Use ALU, but sign-extend offset
- Load: Read memory and update register
- Store: Write register value to memory





# Branch Instructions

- Read register operands
- Compare operands
  - Use ALU, subtract and check Zero output
- Calculate target address
  - Sign-extend displacement
  - Shift left 2 places (word displacement)
  - Add to PC + 4
    - Already calculated by instruction fetch

# Branch Instructions



# Composing the Elements

- First-cut data path does an instruction in one clock cycle
  - Each datapath element can only do one function at a time
  - Hence, we need separate instruction and data memories
- Use multiplexers where alternate data sources are used for different instructions

# R-Type/Load/Store Datapath



# Full Datapath



## Step 4: Given Datapath: RTL -> Control





**FIGURE 4.15** The datapath of Figure 4.12 with all necessary multiplexors and all control lines identified. The control lines are shown in color. The ALU control block has also been added. The PC does not require a write control, since it is written once at the end of every clock cycle; the branch control logic determines whether it is written with the incremented PC or the branch target address. Copyright © 2009 Elsevier, Inc. All rights reserved.

| Signal name | Effect when deasserted                                                                       | Effect when asserted                                                                                    |
|-------------|----------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------|
| RegDst      | The register destination number for the Write register comes from the rt field (bits 20:16). | The register destination number for the Write register comes from the rd field (bits 15:11).            |
| RegWrite    | None.                                                                                        | The register on the Write register input is written with the value on the Write data input.             |
| ALUSrc      | The second ALU operand comes from the second register file output (Read data 2).             | The second ALU operand is the sign-extended, lower 16 bits of the instruction.                          |
| PCSrc       | The PC is replaced by the output of the adder that computes the value of PC + 4.             | The PC is replaced by the output of the adder that computes the branch target.                          |
| MemRead     | None.                                                                                        | Data memory contents designated by the address input are put on the Read data output.                   |
| MemWrite    | None.                                                                                        | Data memory contents designated by the address input are replaced by the value on the Write data input. |
| MemtoReg    | The value fed to the register Write data input comes from the ALU.                           | The value fed to the register Write data input comes from the data memory.                              |

**FIGURE 5.16 The effect of each of the seven control signals.** When the 1-bit control to a two-way multiplexor is asserted, the multiplexor selects the input corresponding to 1. Otherwise, if the control is deasserted, the multiplexor selects the 0 input. Remember that the state elements all have the clock as an implicit input and that the clock is used in controlling writes. The clock is never gated externally to a state element, since this can create timing problems. (See [Appendix B](#) for further discussion of this problem.)

# ALU Control

- ALU used for
  - Load/Store: F = add
  - Branch: F = subtract
  - R-type: F depends on funct field

| ALU control | Function         |
|-------------|------------------|
| 0000        | AND              |
| 0001        | OR               |
| 0010        | add              |
| 0110        | subtract         |
| 0111        | set-on-less-than |
| 1100        | NOR              |

# ALU Control

- Assume 2-bit ALUOp derived from opcode
  - Combinational logic derives ALU control

| opcode | ALUOp | Operation        | funct  | ALU function     | ALU control |
|--------|-------|------------------|--------|------------------|-------------|
| lw     | 00    | load word        | XXXXXX | add              | 0010        |
| sw     | 00    | store word       | XXXXXX | add              | 0010        |
| beq    | 01    | branch equal     | XXXXXX | subtract         | 0110        |
| R-type | 10    | add              | 100000 | add              | 0010        |
|        |       | subtract         | 100010 | subtract         | 0110        |
|        |       | AND              | 100100 | AND              | 0000        |
|        |       | OR               | 100101 | OR               | 0001        |
|        |       | set-on-less-than | 101010 | set-on-less-than | 0111        |

| ALUOp  |        | Funct field |    |    |    |    |    |      | Operation |
|--------|--------|-------------|----|----|----|----|----|------|-----------|
| ALUOp1 | ALUOp0 | F5          | F4 | F3 | F2 | F1 | F0 |      |           |
| 0      | 0      | X           | X  | X  | X  | X  | X  | 0010 |           |
| 0      | 1      | X           | X  | X  | X  | X  | X  | 0110 |           |
| 1      | 0      | X           | X  | 0  | 0  | 0  | 0  | 0010 |           |
| 1      | X      | X           | X  | 0  | 0  | 1  | 0  | 0110 |           |
| 1      | 0      | X           | X  | 0  | 1  | 0  | 0  | 0000 |           |
| 1      | 0      | X           | X  | 0  | 1  | 0  | 1  | 0001 |           |
| 1      | X      | X           | X  | 1  | 0  | 1  | 0  | 0111 |           |

**FIGURE 4.13 The truth table for the 4 ALU control bits (called Operation).** The inputs are the ALUOp and function code field. Only the entries for which the ALU control is asserted are shown. Some don't-care entries have been added. For example, the ALUOp does not use the encoding 11, so the truth table can contain entries 1X and X1, rather than 10 and 01. Note that when the function field is used, the first 2 bits (F5 and F4) of these instructions are always 10, so they are don't-care terms and are replaced with XX in the truth table. Copyright © 2009 Elsevier, Inc. All rights reserved.

# The Main Control Unit

- Control signals derived from instruction



# Datapath With Control



# R-Type Instruction



# Load Instruction



# Branch-on-Equal Instruction



# Implementing Jumps



- Jump uses word address
- Update PC with concatenation of
  - Top 4 bits of old PC
  - 26-bit jump address
  - 00
- Need an extra control signal decoded from opcode

# Datapath With Jumps Added



# Performance Issues

- Longest delay determines clock period
  - Critical path: load instruction
  - Instruction memory → register file → ALU → data memory → register file
- Not feasible to vary period for different instructions
- Violates design principle
  - Making the common case fast
- We will improve performance by multicycle or pipelining

| Instruction | RegDst | ALUSrc | Memto-Reg | Reg-Write | Mem-Read | Mem-Write | Branch | ALUOp1 | ALUOp0 |
|-------------|--------|--------|-----------|-----------|----------|-----------|--------|--------|--------|
| R-format    | 1      | 0      | 0         | 1         | 0        | 0         | 0      | 1      | 0      |
| lw          | 0      | 1      | 1         | 1         | 1        | 0         | 0      | 0      | 0      |
| sw          | X      | 1      | X         | 0         | 0        | 1         | 0      | 0      | 0      |
| beq         | X      | 0      | X         | 0         | 0        | 0         | 1      | 0      | 1      |

**FIGURE 4.18 The setting of the control lines is completely determined by the opcode fields of the instruction.** The first row of the table corresponds to the R-format instructions (add, sub, AND, OR, and slt). For all these instructions, the source register fields are rs and rt, and the destination register field is rd; this defines how the signals ALUSrc and RegDst are set. Furthermore, an R-type instruction writes a register (RegWrite = 1), but neither reads nor writes data memory. When the Branch control signal is 0, the PC is unconditionally replaced with PC + 4; otherwise, the PC is replaced by the branch target if the Zero output of the ALU is also high. The ALUOp field for R-type instructions is set to 10 to indicate that the ALU control should be generated from the funct field. The second and third rows of this table give the control signal settings for lw and sw. These ALUSrc and ALUOp fields are set to perform the address calculation. The MemRead and MemWrite are set to perform the memory access. Finally, RegDst and RegWrite are set for a load to cause the result to be stored into the rt register. The branch instruction is similar to an R-format operation, since it sends the rs and rt registers to the ALU. The ALUOp field for branch is set for a subtract (ALU control = 01), which is used to test for equality. Notice that the MemtoReg field is irrelevant when the RegWrite signal is 0: since the register is not being written, the value of the data on the register data write port is not used. Thus, the entry MemtoReg in the last two rows of the table is replaced with X for don't care. Don't cares can also be added to RegDst when RegWrite is 0. This type of don't care must be added by the designer, since it depends on knowledge of how the datapath works. Copyright © 2009 Elsevier, Inc. All rights reserved.

# Multicycle datapath



**FIGURE 5.25 The high-level view of the multicycle datapath.** This picture shows the key elements of the datapath: a shared memory unit, a single ALU shared among instructions, and the connections among these shared units. The use of shared functional units requires the addition or widening of multiplexors as well as new temporary registers that hold data between clock cycles of the same instruction. The additional registers are the Instruction register (IR), the Memory data register (MDR), A, B, and ALUOut.



**FIGURE 5.26 Multicycle datapath for MIPS handles the basic instructions.** Although this datapath supports normal incrementing of the PC, a few more connections and a multiplexor will be needed for branches and jumps; we will add these shortly. The additions versus the single-clock datapath include several registers (IR, MDR, A, B, ALUOut), a multiplexor for the memory address, a multiplexor for the top ALU input, and expanding the multiplexor on the bottom ALU input into a four-way selector. These small additions allow us to remove two adders and a memory unit.



**FIGURE 5.27 The multicycle datapath from Figure 5.26 with the control lines shown.** The signals **ALUOp** and **ALUSrcB** are 2-bit control signals, while all the other control lines are 1-bit signals. Neither register A nor B requires a write signal, since their contents are only read on the cycle immediately after it is written. The memory data register has been added to hold the data from a load when the data returns from memory. Data from a load returning from memory cannot be written directly into the register file since the clock cycle cannot accommodate the time required for both the memory access and the register file write. The **MemRead** signal has been moved to the top of the memory unit to simplify the figures. The full set of datapaths and control lines for branches will be added shortly.



**FIGURE 5.28 The complete datapath for the multicycle implementation together with the necessary control lines.** The control lines of Figure 5.27 are attached to the control unit, and the control and datapath elements needed to effect changes to the PC are included. The major additions from Figure 5.27 include the multiplexor used to select the source of a new PC value; gates used to combine the PC write signals; and the control signals PCSource, PCWrite, and PCWriteCond. The PCWriteCond signal is used to decide whether a conditional branch should be taken. Support for jumps is included.

### Actions of the 1-bit control signals

| Signal name | Effect when deasserted                                                               | Effect when asserted                                                                                                  |
|-------------|--------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------|
| RegDst      | The register file destination number for the Write register comes from the rt field. | The register file destination number for the Write register comes from the rd field.                                  |
| RegWrite    | None.                                                                                | The general-purpose register selected by the Write register number is written with the value of the Write data input. |
| ALUSrcA     | The first ALU operand is the PC.                                                     | The first ALU operand comes from the A register.                                                                      |
| MemRead     | None.                                                                                | Content of memory at the location specified by the Address input is put on Memory data output.                        |
| MemWrite    | None.                                                                                | Memory contents at the location specified by the Address input is replaced by value on Write data input.              |
| MemtoReg    | The value fed to the register file Write data input comes from ALUOut.               | The value fed to the register file Write data input comes from the MDR.                                               |
| lorD        | The PC is used to supply the address to the memory unit.                             | ALUOut is used to supply the address to the memory unit.                                                              |
| IRWrite     | None.                                                                                | The output of the memory is written into the IR.                                                                      |
| PCWrite     | None.                                                                                | The PC is written; the source is controlled by PCSource.                                                              |
| PCWriteCond | None.                                                                                | The PC is written if the Zero output from the ALU is also active.                                                     |

### Actions of the 2-bit control signals

| Signal name | Value (binary) | Effect                                                                                                                          |
|-------------|----------------|---------------------------------------------------------------------------------------------------------------------------------|
| ALUOp       | 00             | The ALU performs an add operation.                                                                                              |
|             | 01             | The ALU performs a subtract operation.                                                                                          |
|             | 10             | The funct field of the instruction determines the ALU operation.                                                                |
| ALUSrcB     | 00             | The second input to the ALU comes from the B register.                                                                          |
|             | 01             | The second input to the ALU is the constant 4.                                                                                  |
|             | 10             | The second input to the ALU is the sign-extended, lower 16 bits of the IR.                                                      |
|             | 11             | The second input to the ALU is the sign-extended, lower 16 bits of the IR shifted left 2 bits.                                  |
| PCSource    | 00             | Output of the ALU ( $PC + 4$ ) is sent to the PC for writing.                                                                   |
|             | 01             | The contents of ALUOut (the branch target address) are sent to the PC for writing.                                              |
|             | 10             | The jump target address ( $IR[25:0]$ shifted left 2 bits and concatenated with $PC + 4[31:28]$ ) is sent to the PC for writing. |

**FIGURE 5.29 The action caused by the setting of each control signal in Figure 5.28 on page 323.** The top table describes the 1-bit control signals, while the bottom table describes the 2-bit signals. Only those control lines that affect multiplexors have an action when they are deasserted. This information is similar to that in Figure 5.16 on page 306 for the single-cycle datapath, but adds several new control lines (IRWrite, PCWrite, PCWriteCond, ALUSrcB, and PCSource) and removes control lines that are no longer used or have been replaced (PCSrc, Branch, and Jump).

| Step name                                              | Action for R-type instructions              | Action for memory-reference instructions                                                                                                            | Action for branches                       | Action for jumps                                          |
|--------------------------------------------------------|---------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------|-----------------------------------------------------------|
| Instruction fetch                                      |                                             | $IR \leftarrow \text{Memory}[PC]$<br>$PC \leftarrow PC + 4$                                                                                         |                                           |                                                           |
| Instruction decode/register fetch                      |                                             | $A \leftarrow \text{Reg } [IR[25:21]]$<br>$B \leftarrow \text{Reg } [IR[20:16]]$<br>$ALUOut \leftarrow PC + (\text{sign-extend } (IR[15:0]) \ll 2)$ |                                           |                                                           |
| Execution, address computation, branch/jump completion | $ALUOut \leftarrow A \text{ op } B$         | $ALUOut \leftarrow A + \text{sign-extend } (IR[15:0])$                                                                                              | if ( $A == B$ )<br>$PC \leftarrow ALUOut$ | $PC \leftarrow \{\text{PC } [31:28], (IR[25:0]), 2'b00\}$ |
| Memory access or R-type completion                     | $\text{Reg } [IR[15:11]] \leftarrow ALUOut$ | Load: $MDR \leftarrow \text{Memory}[ALUOut]$<br>or<br>Store: $\text{Memory } [ALUOut] \leftarrow B$                                                 |                                           |                                                           |
| Memory read completion                                 |                                             | Load: $\text{Reg}[IR[20:16]] \leftarrow MDR$                                                                                                        |                                           |                                                           |

**FIGURE 5.30 Summary of the steps taken to execute any instruction class.** Instructions take from three to five execution steps. The first two steps are independent of the instruction class. After these steps, an instruction takes from one to three more cycles to complete, depending on the instruction class. The empty entries for the Memory access step or the Memory read completion step indicate that the particular instruction class takes fewer cycles. In a multicycle implementation, a new instruction will be started as soon as the current instruction completes, so these cycles are not idle or wasted. As mentioned earlier, the register file actually reads every cycle, but as long as the IR does not change, the values read from the register file are identical. In particular, the value read into register B during the Instruction decode stage, for a branch or R-type instruction, is the same as the value stored into B during the Execution stage and then used in the Memory access stage for a store word instruction.



**FIGURE 5.31 The high-level view of the finite state machine control.** The first steps are independent of the instruction class; then a series of sequences that depend on the instruction opcode are used to complete each instruction class. After completing the actions needed for that instruction class, the control returns to fetch a new instruction. Each box in this figure may represent one to several states. The arc labeled *Start* marks the state in which to begin when the first instruction is to be fetched.



**FIGURE 5.32 The instruction fetch and decode portion of every instruction is identical.** These states correspond to the top box in the abstract finite state machine in Figure 5.31. In the first state we assert two signals to cause the memory to read an instruction and write it into the Instruction register (MemRead and IRWrite), and we set IorD to 0 to choose the PC as the address source. The signals ALUSrcA, ALUSrcB, ALUOp, PCWrite, and PCSource are set to compute PC + 4 and store it into the PC. (It will also be stored into ALUOut, but never used from there.) In the next state, we compute the branch target address by setting ALUSrcB to 11 (causing the shifted and sign-extended lower 16 bits of the IR to be sent to the ALU), setting ALUSrcA to 0 and ALUOp to 00; we store the result in the ALUOut register, which is written on every cycle. There are four next states that depend on the class of the instruction, which is known during this state. The control unit input, called Op, is used to determine which of these arcs to follow. Remember that all signals not explicitly asserted are deasserted; this is particularly important for signals that control writes. For multiplexors controls, lack of a specific setting indicates that we do not care about the setting of the multiplexor.



**FIGURE 5.33 The finite state machine for controlling memory-reference instructions has four states.** These states correspond to the box labeled “Memory access instructions” in Figure 5.31. After performing a memory address calculation, a separate sequence is needed for load and for store. The setting of the control signals ALUSrcA, ALUSrcB, and ALUOp is used to cause the memory address computation in state 2. Loads require an extra state to write the result from the MDR (where the result is written in state 3) into the register file.



**FIGURE 5.34 R-type instructions can be implemented with a simple two-state finite state machine.** These states correspond to the box labeled “R-type instructions” in Figure 5.31. The first state causes the ALU operation to occur, while the second state causes the ALU result (which is in ALUOut) to be written in the register file. The three signals asserted during state 7 cause the contents of ALUOut to be written into the register file in the entry specified by the rd field of the Instruction register.



**FIGURE 5.35 The branch instruction requires a single state.** The first three outputs that are asserted cause the ALU to compare the registers (ALUSrcA, ALUSrcB, and ALUOp), while the signals PCSOURCE and PCWriteCond perform the conditional write if the branch condition is true. Notice that we do not use the value written into ALUOut; instead, we use only the Zero output of the ALU. The branch target address is read from ALUOut, where it was saved at the end of state 1.



---

**FIGURE 5.36** The jump instruction requires a single state that asserts two control signals to write the PC with the lower 26 bits of the Instruction register shifted left 2 bits and concatenated to the upper 4 bits of the PC of this instruction.



**FIGURE 5.38 The complete finite state machine control for the datapath shown in Figure 5.28.** The labels on the arcs are conditions that are tested to determine which state is the next state; when the next state is unconditional, no label is given. The labels inside the nodes indicate the output signals asserted during that state; we always specify the setting of a multiplexor control signal if the correct operation requires it. Hence, in some states a multiplexor control will be set to 0.

Initial Representation

Finite state diagram

Microprogram

Sequencing Ctrl

Explicit Next state  
Function

Microprogram Counter  
+ Dispatch ROMs

Logic Representation

Logic Equations

Truth table

Implementation  
Technique

PLA

ROM

Hardwired control

Microprogrammed  
control





**FIGURE 5.37 Finite state machine controllers are typically implemented using a block of combinational logic and a register to hold the current state.** The outputs of the combinational logic are the next-state number and the control signals to be asserted for the current state. The inputs to the combinational logic are the current state and any inputs used to determine the next state. In this case, the inputs are the instruction register opcode bits. Notice that in the finite state machine used in this chapter, the outputs depend only on the current state, not on the inputs. The Elaboration above explains this in more detail.



**FIGURE C.4.1 The control unit using an explicit counter to compute the next state.** In this control unit, the next state is computed using a counter (at least in some states). By comparison, Figure C.3.2 on page C-10 encodes the next state in the control logic for every state. In this control unit, the signals labeled *AddrCtl* control how the next state is determined.

| <b>AddrCtl value</b> | <b>Action</b>             |
|----------------------|---------------------------|
| 0                    | Set state to 0            |
| 1                    | Dispatch with ROM 1       |
| 2                    | Dispatch with ROM 2       |
| 3                    | Use the incremented state |



**FIGURE C.4.2** This is the address select logic for the control unit of Figure C.4.1.

| Dispatch ROM 1 |             |       | Dispatch ROM 2 |             |       |
|----------------|-------------|-------|----------------|-------------|-------|
| Op             | Opcode name | Value | Op             | Opcode name | Value |
| 000000         | R-format    | 0110  | 100011         | lw          | 0011  |
| 000010         | jmp         | 1001  | 101011         | sw          | 0101  |
| 000100         | beq         | 1000  |                |             |       |
| 100011         | lw          | 0010  |                |             |       |
| 101011         | sw          | 0010  |                |             |       |

**FIGURE C.4.3 The dispatch ROMs each have  $2^6 = 64$  entries that are 4 bits wide, since that is the number of bits in the state encoding.** This figure only shows the entries in the ROM that are of interest for this subset. The first column in each table indicates the value of Op, which is the address used to access the dispatch ROM. The second column shows the symbolic name of the opcode. The third column indicates the value at that address in the ROM.

| <b>State number</b> | <b>Address-control action</b> | <b>Value of AddrCtl</b> |
|---------------------|-------------------------------|-------------------------|
| 0                   | Use incremented state         | 3                       |
| 1                   | Use dispatch ROM 1            | 1                       |
| 2                   | Use dispatch ROM 2            | 2                       |
| 3                   | Use incremented state         | 3                       |
| 4                   | Replace state number by 0     | 0                       |
| 5                   | Replace state number by 0     | 0                       |
| 6                   | Use incremented state         | 3                       |
| 7                   | Replace state number by 0     | 0                       |
| 8                   | Replace state number by 0     | 0                       |
| 9                   | Replace state number by 0     | 0                       |

**FIGURE C.4.4** The values of the address-control lines are set in the control word that corresponds to each state.

| <b>State number</b> | <b>Control word bits 17–2</b> | <b>Control word bits 1–0</b> |
|---------------------|-------------------------------|------------------------------|
| 0                   | 1001010000001000              | 11                           |
| 1                   | 0000000000011000              | 01                           |
| 2                   | 0000000000010100              | 10                           |
| 3                   | 0011000000000000              | 11                           |
| 4                   | 0000001000000010              | 00                           |
| 5                   | 0010100000000000              | 00                           |
| 6                   | 0000000001000100              | 11                           |
| 7                   | 0000000000000011              | 00                           |
| 8                   | 0100000010100100              | 00                           |
| 9                   | 1000000100000000              | 00                           |

**FIGURE C.4.5 The contents of the control memory for an implementation using an explicit counter.** The first column shows the state, while the second shows the datapath control bits, and the last column shows the address-control bits in each control word. Bits 17–2 are identical to those in Figure C.3.7.



**FIGURE C.4.6 The control unit as a microcode.** The use of the word “micro” serves to distinguish between the program counter in the datapath and the microprogram counter, and between the microcode memory and the instruction memory.

- Design a Microinstruction set
  - Start with list of control signals
  - Group signal together that make sense (vs. random) → called field
  - Places fields in some logical order (e.g., ALUOperation and ALU operands first and microinstruction sequencing last)
  - Create a symbolic legend for the microinstruction format, showing name of field values and how they set the control signals
    - Use computer to design computer
  - To minimize the width, encode operations that will never be used at the same time

| Field name       | Value        | Signals active                           | Comment                                                                                                                      |
|------------------|--------------|------------------------------------------|------------------------------------------------------------------------------------------------------------------------------|
| ALU control      | Add          | ALUOp = 00                               | Cause the ALU to add.                                                                                                        |
|                  | Subt         | ALUOp = 01                               | Cause the ALU to subtract; this implements the compare for branches.                                                         |
|                  | Func code    | ALUOp = 10                               | Use the instruction's function code to determine ALU control.                                                                |
| SRC1             | PC           | ALUSrcA = 0                              | Use the PC as the first ALU input.                                                                                           |
|                  | A            | ALUSrcA = 1                              | Register A is the first ALU input.                                                                                           |
| SRC2             | B            | ALUSrcB = 00                             | Register B is the second ALU input.                                                                                          |
|                  | 4            | ALUSrcB = 01                             | Use 4 as the second ALU input.                                                                                               |
|                  | Extend       | ALUSrcB = 10                             | Use output of the sign extension unit as the second ALU input.                                                               |
|                  | Extshft      | ALUSrcB = 11                             | Use the output of the shift-by-two unit as the second ALU input.                                                             |
| Register control | Read         |                                          | Read two registers using the rs and rt fields of the IR as the register numbers and putting the data into registers A and B. |
|                  | Write ALU    | RegWrite,<br>RegDst = 1, MemtoReg = 0    | Write a register using the rd field of the IR as the register number and the contents of ALUOut as the data.                 |
|                  | Write MDR    | RegWrite,<br>RegDst = 0,<br>MemtoReg = 1 | Write a register using the rt field of the IR as the register number and the contents of the MDR as the data.                |
| Memory           | Read PC      | MemRead,<br>lOrD = 0,<br>IRWrite         | Read memory using the PC as address; write result into IR (and the MDR).                                                     |
|                  | Read ALU     | MemRead,<br>lOrD = 1                     | Read memory using ALUOut as address; write result into MDR.                                                                  |
|                  | Write ALU    | MemWrite,<br>lOrD = 1                    | Write memory using the ALUOut as address, contents of B as the data.                                                         |
| PC write control | ALU          | PCSource = 00,<br>PCWrite                | Write the output of the ALU into the PC.                                                                                     |
|                  | ALUOut-cond  | PCSource = 01,<br>PCWriteCond            | If the Zero output of the ALU is active, write the PC with the contents of the register ALUOut.                              |
|                  | jump address | PCSource = 10, PCWrite                   | Write the PC with the jump address from the instruction.                                                                     |
| Sequencing       | Seq          | AddrCtl = 11                             | Choose the next microinstruction sequentially.                                                                               |
|                  | Fetch        | AddrCtl = 00                             | Go to the first microinstruction to begin a new instruction.                                                                 |
|                  | Dispatch 1   | AddrCtl = 01                             | Dispatch using the ROM 1.                                                                                                    |
|                  | Dispatch 2   | AddrCtl = 10                             | Dispatch using the ROM 2.                                                                                                    |

**FIGURE C.5.1 Each microcode field translates to a set of control signals to be set.** This table specifies a value for each of the fields that were originally specified in Figure 5.7.3 on page 5.7-10. These 22 different values of the fields specify all the required combinations of the 18 control lines. Control lines that are not set which correspond to actions are 0 by default. Multiplexor control lines are set to 0 if the output matters. If a multiplexor control line is not explicitly set, its output is a don't care and is not used.

| Label    | ALU control | SRC1 | SRC2    | Register control | Memory    | PCWrite control | Sequencing |
|----------|-------------|------|---------|------------------|-----------|-----------------|------------|
| Fetch    | Add         | PC   | 4       |                  | Read PC   | ALU             | Seq        |
|          | Add         | PC   | Extshft | Read             |           |                 | Dispatch 1 |
| Mem1     | Add         | A    | Extend  |                  |           |                 | Dispatch 2 |
| LW2      |             |      |         |                  | Read ALU  |                 | Seq        |
|          |             |      |         | Write MDR        |           |                 | Fetch      |
| SW2      |             |      |         |                  | Write ALU |                 | Fetch      |
| Rformat1 | Func code   | A    | B       |                  |           |                 | Seq        |
|          |             |      |         | Write ALU        |           |                 | Fetch      |
| BEQ1     | Subt        | A    | B       |                  |           | ALUOut-cond     | Fetch      |
| JUMP1    |             |      |         |                  |           | Jump address    | Fetch      |

**Figure 5.46 The microprogram for the control unit.** Recall that the labels are used to determine the targets for the dispatch operations. Dispatch 1 does a jump based on teh IR to a label ending with a 1, while Diapatch 2 does a jump based on the IR to a label ending with 2.

| Microcode dispatch table 1 |             |          | Microcode dispatch table 2 |             |       |
|----------------------------|-------------|----------|----------------------------|-------------|-------|
| Opcode field               | Opcode name | Value    | Opcode field               | Opcode name | Value |
| 000000                     | R-format    | Rformat1 | 100011                     | LW          | LW2   |
| 000010                     | jmp         | JUMP1    | 101011                     | SW          | SW2   |
| 000100                     | beq         | BEQ1     |                            |             |       |
| 100011                     | LW          | Mem1     |                            |             |       |
| 101011                     | SW          | Mem1     |                            |             |       |

**FIGURE C.5.2** The two microcode dispatch ROMs showing the contents in symbolic form and using the labels in the microprogram.

- Microprogramming Pros and Cons
  - Ease of design
  - Flexibility
    - Easy to adapt to changes in organization, timing, technology
    - Can make change later in design cycle, or even in the field
  - Can implement very powerful instruction sets
  - Generality
    - Can implement multiple instruction sets on same machine
    - Can tailor instruction set to application
  - Compatibility
    - Many organization, same instruction set
  - Costly to implement
  - slow

# Exception

- 2 types of exception
  - Interrupts
    - Caused by external event
    - Asynchronous to program execution
    - May be handled between instruction
    - Simply suspend and resume user program
  - Trap
  - Caused by internal events
    - Exceptional condition (overflow)
    - Errors (parity)
    - Faults(non-resident page)
  - Synchronous to program execution
  - Condition must be remedied by the handler
  - Instruction may be retried or simulated and program continued or program may be aborted
- MIPS → unexpected change in control flow without distinguishing internal or external

# Exception

- MIPS → support exception
  - EPC → 32 bit register used to hold the address of the affected instruction → PC not PC+4
  - Cause → register used to record the cause the exception



**FIGURE 5.39 The multicycle datapath with the addition needed to implement exceptions.** The specific additions include the Cause and EPC registers, a multiplexor to control the value sent to the Cause register, an expansion of the multiplexor controlling the value written into the PC, and control lines for the added multiplexor and registers. For simplicity, this figure does not show the ALU overflow signal, which would need to be stored in a one-bit register and delivered as an additional input to the control unit (see Figure 5.40 to see how it is used).



**FIGURE 5.40 This shows the finite state machine with the additions to handle exception detection.** States 10 and 11 are the new states that generate the appropriate control for exceptions. The branch out of state 1 labeled  $(Op = \text{other})$  indicates the next state when the input does not match the opcode of any of  $\text{lw}$ ,  $\text{sw}$ ,  $0$  (R-type),  $j$ , or  $\text{beq}$ . The branch out of state 7 labeled *Overflow* indicates the action to be taken when the ALU signals an overflow.