

This document is a PDF version of old exam questions by topic, ordered from least difficulty to greatest difficulty.

**Questions:**

- Fall 2019 Midterm Q6
- Summer 2018 Midterm 2 Q3
- Summer 2019 Midterm 2 Q4
- Fall 2019 Final Q6
- Fall 2017 Final Q6
- Spring 2018 Final Q8
- Spring 2018 Midterm 2 Q3
- Fall 2018 Final F3A+F3B

**Q6) comp a0, RISC-V, <3 (18 pts = 5\*1 + 7\*1 + 4 + 2)**

You add a new R-Type *signed* compare instruction called **comp**, into the RISC-V single-cycle datapath, to compare  $R[rs1]$  and  $R[rs2]$  and set  $R[rd]$  appropriately. The RTL for it is shown on the right.

**comp rd, rs1, rs2**

```
if R[rs1] > R[rs2]: R[rd] = 1
elif R[rs1] == R[rs2]: R[rd] = 0
else: do nothing
```



- a) You want to change the datapath to make this work. You start by adding two more inputs (0x00000000 and 0x00000001) to the rightmost WBSel MUX. What else is **required** to make this instruction work?

- True  False Modify Branch Comp
- True  False Modify Imm. Gen.
- True  False Modify the ALU and ALUSel control signals
- True  False Modify the control logic for RegWEn
- True  False Modify the control logic for MemWEn

The only change necessary on top of adding new inputs to the WBSel MUX is to change the logic for RegWEn so that for COMP instructions, it is 1 only if  $R[rs1] \geq R[rs2]$ .

- b) You realize you can also implement this with **NO** changes to the datapath! **From this point until the end of the page**, let's assume that's what we're going to do. Fill in the control signals for it. We did the first one, COMP, which is a new boolean variable within the control logic that is only set to 1 when we have a **comp** instruction.

|                 | COMP                                                          | PCSel                                                              | BrUn                                                          | BSel                                                          | ASel                                                          | ALUSel                                                                                           | MemRW                                                                | WBSel                                                                                           |
|-----------------|---------------------------------------------------------------|--------------------------------------------------------------------|---------------------------------------------------------------|---------------------------------------------------------------|---------------------------------------------------------------|--------------------------------------------------------------------------------------------------|----------------------------------------------------------------------|-------------------------------------------------------------------------------------------------|
| comp x1, x2, x3 | <input checked="" type="radio"/> 1<br><input type="radio"/> 0 | <input type="radio"/> ALU<br><input checked="" type="radio"/> PC+4 | <input type="radio"/> 1<br><input checked="" type="radio"/> 0 | <input type="radio"/> 1<br><input checked="" type="radio"/> 0 | <input type="radio"/> 1<br><input checked="" type="radio"/> 0 | <input type="radio"/> ADD<br><input type="radio"/> SUB<br><input checked="" type="radio"/> OTHER | <input checked="" type="radio"/> Read<br><input type="radio"/> Write | <input type="radio"/> PC+4<br><input checked="" type="radio"/> ALU<br><input type="radio"/> MEM |

- comp is not a branch or jump, so **PCSel** should be PC + 4 -- after comp, we always want to execute the next instruction in sequence

- **BrUn** should be 0, because in the case if  $R[rs1] > R[rs2]$ , we want to do a signed comparison. If we were only looking at  $R[rs1] == R[rs2]$ , it wouldn't matter what **BrUn** was
- We'll be doing the comparison in the ALU, so **ASel** and **BSel** should be 0, so that both inputs are registers
- **ALUSel** should be other, since we're adding a new operation to the ALU, which will output 0, 1, or don't care (when **comp** doesn't write) for the input  $R[rs1]$  and  $R[rs2]$
- **MemRW** should be read -- only operations that modify memory (e.g. store word) should have this set to 1 or Write
- **WBSel** should be ALU, since we're getting our modified output of 0 / 1 from the ALU Note: We are fine with outputting a 0 / 1 into the Regfile since we would disable the RegWEn if the comparison was less.

c) The control signal **RegWEn** can be represented by the Boolean expression “add+addi+sub+...” (where add is only 1 for add instructions, addi is only 1 for addi instructions, etc.). What new Boolean expression should we add (i.e., Boolean logic “or”) to the original **RegWEn** expression to handle the **comp** instruction? Select ONE.

COMP       COMP\*BrLT       COMP\*BrEq       COMP\*!BrLT       COMP\*!BrEq  
 COMP\*(!BrLT+!BrEq)       COMP\*(BrLT+!BrEq)       COMP\*(!BrLT+BrEq)       COMP\*(BrLT+BrEq)

A **comp** instruction writes back to the RegFile in two cases: (1) if  $R[rs1] > R[rs2]$  and (2) if  $R[rs1] == R[rs2]$ . This is essentially equivalent to when  $R[rs1]$  is NOT less than  $R[rs2]$ . This is where we get the answer **COMP\*!BrLT**.

For **comp** in particular, it's also the case that we always write when **BrEq** is true -- this means there's an additional (if less efficient) correct answer: **COMP\*(!BrLT+BrEq)**.

d) Select all of the stages of the datapath this instruction will use. Select all that apply.

Instruction fetch (IF)     Instruction decode (ID)     Execute (EX)     Memory (MEM)     Writeback (WB)

Every instruction uses the IF, ID, and EX stages -- every instruction must be fetched and decoded, and every instruction uses the ALU for some purpose, either to perform an arithmetic/logical operation or to calculate an address.

Only load and store instructions use the MEM stage; **comp** does not modify memory or retrieve data from memory, so this stage isn't needed.

Some instructions (specifically store and branch instructions) do not modify the RegFile; they have no destination register (**rd**). **comp** does have a destination register, and should use the WB stage.

### Question 3: Maddi-o and Lui-p (20 pts)

As discussed in lecture, the standard RISC-V approach for loading in a 32-bit immediate into a register is by using the following **lui-addi** pair of instructions:

```
lui    rd HIGH_IMM
addi   rd rd LOW_IMM
```

We will explore an alternative method of storing a 32-bit immediate into a register by implementing a new **U-type** instruction named **lui<sub>p</sub>**, which will load a 20-bit immediate into the upper 20 bits of the rd register and **preserve** the lower 12 bits of the rd register. This way, we can use an addi-lui<sub>p</sub> pair to load a 32-bit immediate into a register, as shown on the left. On the right is a description of what **lui<sub>p</sub>** does:

```
addi   rd x0 LOW_IMM
luip  rd HIGH_IMM
```

$$R[rd] = R[rd] [11:0] + \{imm, 12b'0\}$$

Below is the existing RISC-V datapath as presented in lecture, except there are two boxes labeled "A" and "B" which are areas in the datapath that you will fill in with the necessary hardware to implement **lui<sub>p</sub>**.



For Parts A and B, please **clearly circle** the correct hardware that would fill their respective dashed boxes above. For Part C, **circle** all the ALUSel operations that could be used by the **lui<sub>p</sub>** instruction in the ALU unit.



**C.**

|     |      |
|-----|------|
| AND |      |
| ADD | SUB  |
| XOR | BSEL |

D. Fill out the following control signals that must be generated by the control unit in order to correctly implement **lui**. Fill in the bubble for “X” if the exact value of the control signal does not matter (e.g. a value of ① or ② is incorrect when the signal could instead be X).

- |                                  |                                 |                              |
|----------------------------------|---------------------------------|------------------------------|
| ① Signal = 0<br>② Write Disabled | ① Signal = 1<br>② Write Enabled | ② Signal = 2<br>X Don't Care |
|----------------------------------|---------------------------------|------------------------------|

| lui | PCSel | RegWEn | BrUn  | ASel  | BSel  | MemRW | WBSel   |
|-----|-------|--------|-------|-------|-------|-------|---------|
| 1   | ① ② X | ② X X  | ① ② X | ① ② X | ① ② X | ② X X | ① ② ③ X |

E. Now that we've added **lui** to our instruction set architecture, we need to give it an opcode. We now have 3 U-type instructions: **auipc**, **lui**, and **lui**. The opcode for auipc is 0b001 0111 and the opcode for lui is 0b011 0111. For this question, we refer to the most significant (leftmost) opcode bit as bit 6 and the least significant (rightmost) opcode bit as bit 0.

You are given the following truth table whose two inputs are two of the opcode bits, and the output is which U-type instruction is being identified by the two opcode bits. Fill out all possible opcode bits that could be the input to the truth table in order to generate the specified outputs (**an output of X means there is no defined instruction with that opcode in our ISA**).

(Hint: the other five bits of the opcode are compared to a **constant** value which identifies the opcode as being a U-type opcode—your task is figuring out which bits can identify which U-type instruction it is.)

| Opcode bit(s):<br>⑥ ⑤ ④ ③ ② ① ② | Opcode bit(s):<br>⑥ ⑤ ④ ③ ② ① ② | Instruction |
|---------------------------------|---------------------------------|-------------|
| 0                               | 0                               | X           |
| 0                               | 1                               | lui         |
| 1                               | 0                               | auipc       |
| 1                               | 1                               | lui         |

## Question 4: I'm afraid of datapaths, so iarrn away - 29 pts

Morgan notices much of the assembly code she writes involves iterating through arrays of integers. Instead of using several instructions to calculate the address of the next element, she proposes a new instruction,

iarrn rd rs1 rs2

which places into rd the address of the rs2-th element of the array pointed to by rs1. This instruction does not do bounds checking and it assumes the size of an integer is 4B (32 bits). Do *not* assume this instruction belongs to a specific type.

In verilog, the instruction is described as follows:

$$R[rd] = R[rs1] + (4 * R[rs2])$$

Morgan is interested in modifying our RISC-V datapath to support this instruction. Assume we have introduced a new control bit "IArrN" which is 1 when the current instruction is iarrn and 0 otherwise. Using the datapath below, fill in the following table with the rest of the control bits for this instruction. If the control bit can be set to "\*\*", please draw an X in the table below.



| IArrN | PCSel | RegWEn | MemRW | WBSel | BrUn | ALUSel |
|-------|-------|--------|-------|-------|------|--------|
| 1     | 0     | 1      | 0     | 1     | X    | ADD    |

Morgan notices this instruction involves changing a few hardware pieces on the datapath in addition to changing control bits above. She proposes modifying the ASel and BSel muxes, and their associated control bits (circled below).



(Questions on next page)

How should we change BSel to allow our new instruction, and **all other RISC-V instructions**, to execute correctly?

**(A) Option A**

**(B) Option B**

**(C) Option C**

**(D) Option D**

**(E) Option E**

NOTE: some options showcase original hardware from the datapath. If you believe no changes are necessary, you should select this option. Assume our ALU, RegFile, and memory units remain unchanged internally.



How should we change ASel to allow our new instruction, and all other RISC-V instructions, to execute correctly?

(A) Option A

(B) Option B

(C) Option C

(D) Option D

NOTE: some options showcase original hardware from the datapath. If you believe no changes are necessary, you should select this option. Assume our ALU, RegFile, and memory units remain unchanged internally.



**Q6) RISCV Exam-isim Debug – Single Cycle (12 pts = 2 x 6)**

For your CPU project, you followed the datapath diagram we gave you exactly and built a single-cycle CPU. However, something is not working correctly. *All instructions besides some of the I-types and SB-types are working.* You start by testing with an **addi a0 a0 -3** instruction. The **a0** register initially holds a value of **7** and all other registers initially hold **0**. This instruction is stored in IMEM at address **0x00000004**. DMEM reflects the initial IMEM. Undefined ImmSel outputs an I-type imm. You put a probe at the data read from IMEM and find the instruction is correct. You next put a probe at **wb**, and see the output is **0b0000\_0000\_0000\_0001\_0000\_0000\_0100 (0x00001004)**.

**a)** Since the output is incorrect, what errors alone would cause the erroneous behavior? (select all that apply)

- |                                                                                    |                                                                 |
|------------------------------------------------------------------------------------|-----------------------------------------------------------------|
| <input type="checkbox"/> The RegWEn is set to false.                               | <input type="checkbox"/> PCSel is set to PC + 4.                |
| <input checked="" type="checkbox"/> The Immediate Generator is not sign extending. | <input type="checkbox"/> The writeback MUX is selecting PC + 4. |
| <input type="checkbox"/> Read Reg1 and Read Reg2 are flipped.                      | <input type="checkbox"/> MemRW is set to write.                 |
| <input type="checkbox"/> The writeback MUX is selecting DMEM.                      | <input type="checkbox"/> BSel is selecting rs2 and not imm.     |

a) RegWEn does not change the state of wb since this is the only inst.

b) This is the issue!

c) If the two registers were flipped, we would get a different value since we would find we would be using a register with 0 in it thus would be getting back 0xFFC thus these are actually correct.

d) This is not correct since we would be getting the machine code of the current instruction since the correct address would be 0x4 which is this instruction. Thus this is not a root issue.

e) We would not be able to detect an issue here by looking at wb since this does not affect the wb of the current program.

f) If the write back mux was set to PC + 4, we would receive an output of 0x0000000c thus this is not correct either.

g) MemRW will not change the output of wb. Even if we were outputting the read of the address, it would be incorrect.

h) If BSel selected RS2, we would have selected t6 which we know has a value of 0 and would have had 0 + 5 which is not the result we got.

You fix that issue. You then test **beq x0 a0 label** but something is still not working. This instruction is at address **0xbfffff00** and **label** is at address **0xbfffff40**. The register **a0** holds **0** and all other registers hold **1**. Assume that we get the correct instruction machine code for **beq x0 a0 label** when we probe it.

You put a probe before the PC register and see this incorrect output: **0xbfffff20**.

*Note: All other instruction types are working.*

**b)** What errors alone would cause the erroneous behavior? (select all that apply)

- |                                                                        |                                                                                     |
|------------------------------------------------------------------------|-------------------------------------------------------------------------------------|
| <input type="checkbox"/> WBSel is incorrect.                           | <input checked="" type="checkbox"/> The ImmGen is not correctly padding w/ extra 0. |
| <input type="checkbox"/> ImmSel is Incorrect.                          | <input type="checkbox"/> The inputs to the ASel MUX are flipped.                    |
| <input type="checkbox"/> PCSel is set to PC + 4.                       | <input type="checkbox"/> The inputs to the BSel MUX are flipped.                    |
| <input type="checkbox"/> There is an error in the Read Data1/rs1 wire. | <input type="checkbox"/> Read Reg1 and Read Reg2 are flipped.                       |

a) We do not care about the WBSel since it does not change the input to the pc.

b) The immediate select seems to be correct as the other incorrect immediates are not equal to 0x20. (If you use the same bits of this instruction and interpret an immediate of other types, you will see none of them are 0x20).

c) We see the PC is not set to 0xbfffff04 thus PC + 4 was not taken.

d) The branch seems to be taken as we are not getting 0xbfffff04

e) Seems like the address is shifted left by one so this is an error.

f) Since we know all registers are 1, the output of the ALU is not possible if these are flipped.

g) Since we know the output of the ALU and that all registers are 1, we know BSel is correct.

h) Even if these were flipped, the branch should not be affected as we are doing an equality of the values.

## Q6: Game of Signals

The following canonical single-cycle datapath for a RISC-V architecture supports the base RISC-V ISA (RV32I).



For the following questions, we want to expand our instruction set to support some new instructions. For each of the proposed instructions below, choose ONE of the options below.

- A. Can be implemented without changing datapath wiring, only changes in control signals are needed.(i.e. change existing control signals to recognize the new instruction)
- B. Can be implemented, but needs changes in datapath wiring, only additional wiring, logical gates and muxes are needed.
- C. Can be implemented, but needs change in datapath wiring, and additional arithmetic units are needed (e.g. comparators, adders, shifters etc.).
- D. Cannot be implemented.

Note that the options from A to D gradually add complexity; thus, selecting B implies that A is not sufficient. You should select the option that changes the datapath the least (e.g. do not select C if B is sufficient). You can assume that necessary changes in the control signals will be made if the datapath wiring is changed.

- 1) Allowing software to deal with 2's complement is very prone to error. Instead, we want to implement the negate instruction, neg rd,rs1, which puts  $-R[rs1]$  in  $R[rd]$ .

**Ans: A. This is a tricky question! Notice neg doesn't use all available bits, so we could make neg rd, rs1 into a special R-type instruction neg rd, x0, rs1, such that the instruction does  $R[rd] = x0 - R[rs1]$ . Notice that subtraction is supported by our default datapath. So we only need to add the new control signal neg, which will produce the same ALUSel, Asel, Bsel, ... signals as sub does.**

- 2) Sometimes, it is necessary to allow a program to self-destruct. Implement segfault rs1, offset(rs2). This instruction compares the value in  $R[rs1]$  and the value in  $MEM[R[rs2]+offset]$ . If the two values are equal, write 0 into the PC; otherwise treat this instruction as a NOP.

**Ans: C. Need to 1) Add a comparator after memory unit and wire the output to PCSel. 2) Add a zero wired to mux before PC. 3) Change corresponding PCSel signal width.**

**Problem 8 [M2-4] Datapath (10 points)**

Recall the standard 5-stage, single cycle datapath contains stages for Instruction Fetch, Decode, Execute (ALU), Memory, and Write-back. Datapath designers are interested in reducing the phases necessary for execution such that instead of accessing both the Execute (ALU) phase and the Memory phase, instructions access either one or the other, but not both. This would create a 4-stage, single cycle datapath with the following stages: Instruction Fetch, Decode, Execute OR Memory, and Write-back.

| Instr Fetch | Instr Decode | Execute (ALU) | Memory | Write-back |
|-------------|--------------|---------------|--------|------------|
| 100ps       | 150ps        | 200ps         | 350ps  | 150ps      |

- (a) Given the table above and the described datapath above, what is the time it takes for a single instruction that utilizes all stages to execute on the typical 5-stage, single cycle datapath?

**Solution:**  $100 + 150 + 200 + 350 + 150 = 950\text{ps}$

- (b) What is the time it takes for a single instruction that utilizes all stages to execute on the new 4-stage, single cycle datapath?

**Solution:**  $100 + 150 + \text{MAX}(200, 350) + 150 = 750\text{ps}$

- (c) If the designers go ahead with this modification, which instructions will NOT function correctly? Why? Please limit your answer to two sentences or less.

**Solution:** Load and store instructions which use a non-zero offset will not function correctly because they require BOTH the ALU and Memory phase. The address must first be calculated before memory can be accessed.

- (d) Propose a program-level modification that will fix the issue. Do NOT propose a modification to the datapath. Please describe your modification in two sentences or less.

**Solution:** Instead of allowing load/store instructions with non-zero offsets to appear in code, the compiler (or the programmer) can expand these instructions into a load/store + addi pair. This way, the address is calculated by a separate instruction before the memory access takes place.

**Problem 3 Set ... If Zero**

(19 points)

We wish to introduce a new instruction into our single-cycle datapath. The instruction **SIZ** (set if zero) works as follows:

```
if (R[rs2] == 0):
    R[rd] = R[rs1]
```

Given the single cycle datapath below, select the correct modifications in parts (a) - (d) such that the datapath executes correctly for this new instruction (and all core instructions!). You can make the following assumptions:

- the **SIZ** signal is 1 if and only if the instruction is **SIZ**
- ALUSel is **ADD** when we have **SIZ** instruction.
- the immediate generator outputs **ZERO** when we have a **SIZ** instruction.



- (a) Consider the following modifications to the branch comparator inputs. Which configuration will allow this instruction to execute correctly without breaking the execution of other instructions in our instruction set?



**Solution:** Note that for this question we have two requirements: the modification we pick must support our new instruction AND it must make it so all other instructions (those in our core instruction set) continue to execute correctly as if no changes were made. If we look at the logic describing the instruction behaviour in the first part of the question, we notice we must compare the value in register rs2 to zero. Unlike normal branch instructions, we are not comparing to another register value; we are comparing to a constant. This eliminates choice A. Noting the conditions of our modification (that it must be able to support core instructions, too) we can eliminate D because it removes the ability to compare DataA and DataB which we need for regular branch instructions. We are left with options B and C. Again, if we revisit the instruction logic, we see the item we're comparing to zero is the value in rs2; this is equal to DataB. We therefore pick option B which adds a MUX on DataA to allow us to select 0 as our second operand in the case of a SIZ instruction.

- (b) Consider the following modifications to the ALU inputs. Which configuration will allow this instruction to execute correctly without breaking the execution of other instructions in our instruction set? Select the configuration that requires **minimum** modifications to the original datapath. Notice in the bottom left choice BSel is unused.



**Solution:** This modification must also support our new instruction while allowing other instructions to continue executing as normal. This section concerns inputs to our ALU—the output of which we will write. Looking again at the instruction logic, we can see the write we need to make is  $R[rd] = R[rs1]$ ; the value in register  $rs1$  should be written to the  $rd$  register. Therefore, the output of our ALU should be the value in register  $rs1$ . If we look at our datapath, this is already possible by manipulating existing controls ( $ASel$ ,  $BSel$ ), and so we do not need to make any modifications; A is the correct answer.

- (c) Consider the following modifications to the WB mux inputs. Which configuration will allow this instruction to execute correctly without breaking the execution of other instructions in our instruction set? Select the configuration that requires **minimum** modifications to the original datapath.



**Solution:** Similar to the previous question, we do not need to make a modification to the datapath and should therefore select option A. We know our ALU is emitting the value stored in register rs2. Because the standard datapath already allows us to write back the output of our ALU (and because write-back by default writes to our specified destination register rd), the value in rs2 can be written back to rd by setting our existing control bit WBSel to ALU. Again, no modifications are required.

- (d) Consider the following modifications to the RegWEn inputs. Which configuration will allow this instruction to execute correctly without breaking the execution of other instructions in our instruction set?



**Solution:** To answer this question we should look at the instruction logic to find out under what conditions the write occurs. Note that, in the SIZ instruction, we should only set  $R[rd] = R[rs1]$  in the case that  $R[rs2] == 0$  is true. In order to check that condition, we need to make sure of two things: (a) the instruction we're writing back for should be a SIZ instruction and (b) the result of the branch equality comparison should be true (BrEq). Because we want both of these to be true before we write, we use an AND gate. To preserve existing functionality, we also want to keep our RegWEn control bit around. Because the additional logic we added for the SIZ instruction will be false for all other instructions (add, load, etc.) we use an OR gate to support write-back functionality for our core instructions.

- (e) Given your selections above, decide the rest of the control signals for this instruction based on the diagram given at the beginning of the problem. Select X when a signal's value doesn't matter. You can assume:

- the SIZ signal is 1 if and only if the instruction is SIZ
- ALUSel is ADD when we have a SIZ instruction.
- the immediate generator outputs ZERO when we have a SIZ instruction.

1. PCSel:

1       0       X

**Solution:** Though this instruction is similar to other branch instructions in that it uses the branch comparator to check equality, it does not alter our control flow as a result, therefore we should select PC as PC+4 like we do normally.

2. RegWEn:

1 (Enable)       0 (Disable)       X

**Solution:** Our write to rd should only happen in the case that our if case is true. We added logic to support this in the previous question and, in order for that check to happen, we have to set RegWEn to false. If it were true, we would write to rd on every SIZ instruction, not just those where R[rs2] == 0.

3. BrUn:

1 (Signed)       0 (Unsigned)       X

**Solution:** We are doing a comparison to zero. Whether the branch comparison is signed or unsigned has no effect on the outcome.

4. BSel:

1       0       X

**Solution:** We want our ALU to produce rs1 as its output. We do not care what the value of our second operand is (because regardless, it isn't the output we want) and therefore it doesn't matter if we pass in the immediate or DataB.

5. ASel:

- 1       0       X

**Solution:** Though we don't care about BSel, we do care about ASel because this is where we have the option of passing in rs1 to the ALU. If we want our write to contain the correct information, we must select DataA as our input here.

6. MemRW:

- 1 (Enable)       0 (Disable)       X

**Solution:** This instruction makes no modifications to memory and therefore shouldn't write to memory. The reason we can't leave this control bit as an X (don't care) is because modifying memory is a state change; if we "don't care" and modify memory on some SIZ instructions and not on others, we could end up overwriting, deleting, or forcing garbage data into our memory and messing up execution of later instructions.

7. WBSel

- |                                         |                                                    |
|-----------------------------------------|----------------------------------------------------|
| <input checked="" type="radio"/> ALUOut | <input type="radio"/> MemOut                       |
| <input type="radio"/> PC + 4            | <input type="radio"/> Other: Please specify: _____ |

**Solution:** See solution to part C above

**F3) Datopathology [this is a 2-page question] (20 points = 4+10+6, 30 minutes)**

The datapath below implements the RV32I instruction set. We'd like to implement sign extension for loaded data, but our loaded data can come in different sizes (recall: **1b**, **1h**, **1w**) and different intended signs (**1bu** vs. **1b** and **1hu** vs **1h**). Each load instruction will retrieve the data from the memory and "right-aligns" the LSB of the byte or the half-word with the LSB of the word to form **mem[31:0]**.



- a) To correctly load the data into the registers, we've created two control signals **SelH** and **SelW** that perform sign extension of **mem[31:0]** to **memx[31:0]** (see below). **SelH** controls the half-word sign extension, while **SelW** controls sign extension in the two most significant bytes. What are the Boolean logic expressions for the four (0, 1, 2, 3) **SelW** cases in terms of **Inst[14:12]** bits to handle these five instructions (**1b**, **1h**, **1w**, **1bu** and **1hu**)? **SelH** has been done for you. In writing your answers, use the shorthands "I14" for **Inst[14]**, "I13" for **Inst[13]** and "I12" for **Inst[12]**. You don't have to reduce the Boolean expressions to simplest form. (Hint: green card!) [Answers](#) (and simplified form)



|               |                                                   |
|---------------|---------------------------------------------------|
| <b>SelW=3</b> | <b>I14 • ~I13 • ~I12 + I14 • ~I13 • I12 = I14</b> |
| <b>SelW=2</b> | <b>~I14 • ~I13 • I12 = ~I14 • I12</b>             |
| <b>SelW=1</b> | <b>~I14 • ~I13 • ~I12</b>                         |
| <b>SelW=0</b> | <b>~I14 • I13 • ~I12 = I13</b>                    |
| <b>SelH=2</b> | <b>I14 • ~I13 • ~I12</b>                          |
| <b>SelH=1</b> | <b>~I14 • ~I13 • ~I12</b>                         |
| <b>SelH=0</b> | <b>I13 + I12</b>                                  |

(Single-bit values **mem[7]** and **mem[15]** are wired to 8 or 16 outputs)

Here's how we figured it out -- we made a table:

## INST

**111**

**432**

| Inst | funct3 | byte3+byte2 |        | byte1     |        | byte0    |
|------|--------|-------------|--------|-----------|--------|----------|
| lb   | 000    | mem[7]      | SelW=1 | mem[7]    | SelH=1 | mem[7:0] |
| lh   | 001    | mem[15]     | SelW=2 | mem[15:8] | SelH=0 | mem[7:0] |
| lw   | 010    | mem[31:16]  | SelW=0 | mem[15:8] | SelH=0 | mem[7:0] |
| lbu  | 100    | 0           | SelW=3 | 0         | SelH=2 | mem[7:0] |
| lhu  | 101    | 0           | SelW=3 | mem[15:8] | SelH=0 | mem[7:0] |

... and then reversing the table for the cases based on the INST funct3 bits above yields the values above.

### F3) Datopathology, continued (20 points = 4+10+6, 30 minutes)

(this is the same diagram as on the previous page, with five stages of execution annotated)



b) In the RISC-V datapath above, mark what is used by a `jal` instruction. (See green card for its effect...)

|                       |                        |                                                  |                                               |                                                   |
|-----------------------|------------------------|--------------------------------------------------|-----------------------------------------------|---------------------------------------------------|
| Select one per row    | <b>PCSel Mux:</b>      | <input type="radio"/> "pc + 4" branch            | <input checked="" type="radio"/> "alu" branch | <input type="radio"/> * (don't care)              |
|                       | <b>ASel Mux:</b>       | <input checked="" type="radio"/> "pc" branch     | <input type="radio"/> Reg[rs1] branch         | <input type="radio"/> * (don't care)              |
|                       | <b>BSel Mux:</b>       | <input checked="" type="radio"/> "imm" branch    | <input type="radio"/> Reg[rs2] branch         | <input type="radio"/> * (don't care)              |
|                       | <b>WBSel Mux:</b>      | <input checked="" type="radio"/> "pc + 4" branch | <input type="radio"/> "alu" branch            | <input type="radio"/> "mem" branch                |
| Select all that apply | <b>Datapath units:</b> | <input type="checkbox"/> Branch Comp             | <input checked="" type="checkbox"/> Imm. Gen  | <input type="checkbox"/> Load Extend              |
|                       | <b>RegFile:</b>        | <input type="checkbox"/> Read Reg[rs1]           | <input type="checkbox"/> Read Reg[rs2]        | <input checked="" type="checkbox"/> Write Reg[rd] |