

# Multicycle RISC testbench Documentation

---

## Link to testbenches

Random: <https://www.edaplayground.com/x/hXSs>

Target: <https://www.edaplayground.com/x/7KcW>

## Block Diagrams

Important aspects of every stage.

### Fetch Stage Diagram:

Fetch stage keeps the instruction memory inside and distribute it throughout the pipeline. It has a *pc\_bias* port which makes it possible to load a program counter value, and it also counts with an *pc\_en*, which allows the following instruction when rising clock edge comes. In the diagram it is detailed that *flow control* instructions, which are capable of taking control of the following instruction, are detected and resolved in Execute stage.



### Decode Stage Diagram:



## Execute Stage Diagram:

Execute stage mostly computes operations whenever its needed. Usually its inputs are driven from decode stage, but in *movement* or *flow control* operations, *Imm*, *Imm \**, zeros, and forwarding inputs need to be included.



## Memory Stage Diagram:

Memory stage is simple. When data memory needs to be written its input can either come from C register (*store* instruction) or from the *Imm\** extension register (*storei* instruction). So, input MUX decides whether to choose `regC` which goes throughout the pipeline, `Imm*` which can be selected from the current instruction, or in the special case (forwarding) where the previous instruction intended to write C register, then it can come from the WriteBack Stage. On the other hand, `mem_address` which is used for writing and reading

instructions, and it always comes from alu output.



### WriteBack Stage Diagram:

Write back output changes in the following way:

- $wb\_output = mem\_output$  in *load* instruction.
- $wb\_output = Imm^*$  in *storei* instruction.
- $wb\_output = pc + 8'd1$  in case of *jal* or *jr.al* instructions.
- $wb\_output = alu$  in all the other cases.



## Final Pipeline



## Project Structure

```

└── multicycle
    ├── top_multicycle.v
    │   └── defines.v
    │   └── alu_multicycle.v
    └── memory.v
        └── mem.hex
    └── program_counter_multi.v
        └── pc_mem_RANDOM.hex
        └── pc_mem_TARGET.hex
        └── pc_mem_target_multi.hex
    └── register_file_multi.v
        └── register_mem.hex
        └── register_mem_aux.hex

```

### Top Module:

```

module top_multicycle #(
    parameter MEMORY_FILE    = "mem.hex",
    parameter PC_FILE         = "pc_mem_target_multi.hex",
    parameter REG_FILE        = "register_mem.hex"
)
(
    input    wire                               clk,
    input    wire                               rst_n,
    //Debugging signals!
    //.
    //.
);

```

### Alu Module:

```

module alu_multicycle
#(
    parameter NB_REGISTERS = 34,
    parameter DATA_WIDTH    = 32
)
(
    input  wire [2:0]           i_alu_ctrl, //Controls the operation
of alu
    input  wire [NB_REGISTERS-1:0] i_data_A, //ALU in A
    input  wire [NB_REGISTERS-1:0] i_data_B, //ALU in B

    output wire [NB_REGISTERS-1:0] o_data      //ALU out
);

```

the possible operations that ALU can do are:

```

parameter ADD      = 3'd0;
parameter SUB      = 3'd1;
parameter ADDC     = 3'd2;
parameter SUBC     = 3'd3;
parameter ALU_OR   = 3'd4;
parameter ALU_AND  = 3'd5;
parameter ALU_INV  = 3'd6;
parameter EXCEPTION = 3'd7;

```

## Memory Module:

```

module memory
#(
    parameter MEM_WIDTH  = 8,
    parameter DATA_WIDTH = 32,
    parameter ADDR_WIDTH = 8,
    parameter MEM_INIT_FILE = "mem.hex"
)
(
    input  wire          clk,
    input  wire          rst_n, //disabled
    //Input writing data
    input  wire          i_wenable, //write enable
    input  wire [0:ADDR_WIDTH-1] i_address, //write address
    input  wire [0:DATA_WIDTH-1] i_data, //write data
    //Outputs
    output wire [0:DATA_WIDTH-1] o_data //mem output
);

```

## Register File Module:

```

module register_file_multi
#(
    parameter REG_WIDTH      = 34,
    parameter ADDR_WIDTH     = 5,
    parameter MEM_INIT_FILE = "register_mem.hex"
)
(
    input  wire                  clk,
    input  wire                  rst_n,
    //Inputs:
    //REGISTER A
    input  wire [ADDR_WIDTH-1:0]   i_address_reg_a,    //address A
    input  wire                   i_wenable_reg_a,    //write enable A
    input  wire [REG_WIDTH-1:0]    i_writedata_reg_a,  //reg A write
    data
    //REGISTER B AND C
    input  wire [ADDR_WIDTH-1:0]   i_address_reg_b,    //address B
    input  wire [ADDR_WIDTH-1:0]   i_address_reg_c,    //address C
    //OUTPUTS
    output wire [REG_WIDTH-1:0]   o_data_regb,       //register B
    output wire [REG_WIDTH-1:0]   o_data_recg       //register C
);

```

### Program Counter Module:

```

module program_counter_multi #(
    parameter MEM_INIT_FILE = "pc_mem_target_multi.hex"
)
(
    input  wire                  clk,
    input  wire                  pc_en,           //pc enable
    input  wire                  bias_en,         //bias enable
    input  wire [7:0]             bias,            //load pc
    output wire [31:0]            instruction_output, //instruction
    output wire [7:0]             program_counter //pc
);

```

---

## ALU operations by instruction

---

| Operation       | ALU operation | Operand 2 | Operand 3 | Note     |
|-----------------|---------------|-----------|-----------|----------|
| <b>MOVEMENT</b> |               |           |           |          |
| <b>load</b>     | 0(+)          | Rb        | Imm*      | Mem_addr |

| Operation           | ALU operation | Operand 2 | Operand 3 | Note      |
|---------------------|---------------|-----------|-----------|-----------|
| <b>store</b>        | 0(+)          | Rb        | Imm       | Mem_addr  |
| <b>loadi</b>        | 0(+)          | Imm*      |           | WB_output |
| <b>storei</b>       | 0(+)          | Rb        |           | Mem_addr  |
| <b>mov</b>          | 0(+)          | Rb        |           | WB_output |
| <b>LOGIC</b>        |               |           |           |           |
| <b>or</b>           | 4(or)         | Rb        | Rc        | WB_output |
| <b>inv</b>          | 6(~)          | Rb        | Rc        | WB_output |
| <b>and</b>          | 5(&)          | Rb        | Rc        | WB_output |
| <b>ARITHMETIC</b>   |               |           |           |           |
| <b>add</b>          | 0(+)          | Rb        | Rc        | WB_output |
| <b>sub</b>          | 1(-)          | Rb        | Rc        | WB_output |
| <b>addC</b>         | 2(+c)         | Rb        | Rc        | WB_output |
| <b>subC</b>         | 3(-c)         | Rb        | Rc        | WB_output |
| <b>FLOW CONTROL</b> |               |           |           |           |
| <b>jump</b>         | 0(+)          | Rb        | 0         |           |
| <b>bz</b>           | 0(+)          | Rb        | Imm       |           |
| <b>bnz</b>          | 0(+)          | Rb        | Imm       |           |
| <b>bc</b>           | 0(+)          | Rb        | Imm       |           |
| <b>bv</b>           | 0(+)          | Rb        | Imm       |           |
| <b>jal</b>          | 0(+)          | Rb        | Imm*      |           |
| <b>jral</b>         | 0(+)          | Rb        | Imm*      |           |
| <b>ret</b>          | 0(+)          | Rb        | 0         |           |

## Target Testbench

---

The following documentation corresponds to the work done in the target testbench.

Instructions executed

| Operation  | Operand 1 | Operand 2 | Operand 3 |
|------------|-----------|-----------|-----------|
| <b>or</b>  | 31        | 0         | 1         |
| <b>inv</b> | 31        | 2         |           |

| <b>Operation</b> | <b>Operand 1</b> | <b>Operand 2</b> | <b>Operand 3</b> |
|------------------|------------------|------------------|------------------|
| <b>and</b>       | 31               | 4                | 5                |
| <b>add</b>       | 31               | 31               | 30               |
| <b>sub</b>       | 31               | 28               | 29               |
| <b>addc</b>      | 31               | 27               | 26               |
| <b>subc</b>      | 31               | 25               | 24               |
| <b>load</b>      | 31               | 23               | 22               |
| <b>store</b>     | 21               | 20               | 19               |
| <b>loadi</b>     | 31               | 63               |                  |
| <b>storei</b>    | 31               | 127              |                  |
| <b>mov</b>       | 31               | 15               |                  |
| <b>load</b>      | 31               | 1                | 2                |
| <b>add</b>       | 31               | 31               | 2                |
| <b>loadi</b>     | 30               | 30               |                  |
| <b>jump</b>      | 2                |                  |                  |
| <b>or</b>        | 31               | 0                | 1                |
| <b>or</b>        | 31               | 0                | 1                |
| <b>bz</b>        | 1                | 0                | 2                |
| <b>or</b>        | 31               | 0                | 1                |
| <b>or</b>        | 31               | 0                | 1                |
| <b>bnz</b>       | 2                | 1                | 2                |
| <b>or</b>        | 31               | 0                | 1                |
| <b>or</b>        | 31               | 0                | 1                |
| <b>or</b>        | 31               | 0                | 1                |
| <b>bc</b>        | 1                | 30               | 1                |
| <b>or</b>        | 31               | 0                | 1                |
| <b>bv</b>        | 1                | 30               | 0                |
| <b>jal</b>       | 31               | 3                |                  |
| <b>or</b>        | 31               | 0                | 1                |
| <b>or</b>        | 31               | 0                | 1                |
| <b>jral</b>      | 31               | 0                | 2                |

| Operation | Operand 1 | Operand 2 | Operand 3 |
|-----------|-----------|-----------|-----------|
| or        | 31        | 0         | 1         |
| ret       | 18        |           |           |

Observe the following:

- In *ADD* operation (PCF=5 in Fetch Stage, PCE=3 in Execute Stage) a Forwarding Hazard should arise, since register 31 is intended to use, which has been previously written.
- In *LOADI* operation (PCE=9) a Forwarding Hazard should arise, since Imm\*=63 which makes Rc field be 5'b11111. This goofs Hazard unit into thinking that Rc field is 31st register and 2 operations before, in *LOAD* operation, 31st register was intended to be written (detected in writeback stage). This won't make any change, since ALU control will handle it, ignoring hazard.
- In *STOREI* operation (PCE=10) a Forwarding Hazard should arise, since its intended to use Rb=31 register to access memory, but previous operation *LOADI* tries to write register 31. A 63 should be passed.
- Second call to *LOAD* operation (PCE=12) was introduced to force a Stall Hazard. *LOAD* intends to write 31st register from memory but in Decode Stage is detected that next operation will need that 31st register to do an ALU computation. This will force the pipeline to stop for a while until memory is addressed, and in next clock a Forwarding is made from WriteBack Stage up to Execute Stage.
- Last *RET* instruction goes back to *BV* instruction, thus RISC enters a loop from 19th instruction and on.

## Hazards

How hazards are detected?

1. **FORWARDING:** forwarding is asserted high whenever the Rb or Rc field of instruction in Execute Stage is the same as the Ra field in Memory or WriteBack Stage. This means that whenever Imm\* use is intended, encompassing Rc field, a hazard may arise. These special cases are addressed in control unit.
2. **STALL:** stall hazard is arised when in Memory Stage, a *LOAD* instruction is detected and the destination address is the same as an Rb or Rc address in Execute Stage. This will flush Decode Stage Register.
3. **BRANCH:** branch control hazard is arised whenever a FLOW CONTROL instruction is detected in the Execute Stage. This will flush Decode and Execute registers, and stall Decode Register.



Target testbench executed

These are the instructions tested in target\_testbench. The whole system, signals and waveforms, instructions, and operations, were followed and checked. The following diagram may help as a road map:

- In green, instructions that shall be executed.
- In yellow, instructions that arise hazards. When a hazard occur, the stages involved are highlighted, indicating for example the origin and destination of a forwarding hazard.
- In blue, instructions, flushes, or actions executed in order to solve hazards.
- In red, garbage instructions which shouldnt be executed

| n_clock | n_test | FETCH  | DECODE | EXECUTE | MEMORY | WRITEBACK | HAZARDS!     |
|---------|--------|--------|--------|---------|--------|-----------|--------------|
| 0       |        | or     | X      | X       | X      | X         |              |
| 1       |        | inv    | or     | X       | X      | X         |              |
| 2       |        | and    | inv    | or      | X      | X         |              |
| 3       |        | addc   | and    | inv     | or     | X         |              |
| 4       | 0      | sub    | add    | and     | inv    | or        |              |
| 5       |        | addc   | sub    | add     | and    | inv       | FORWARD!     |
| 6       |        | subc   | addc   | sub     | add    | and       |              |
| 7       |        | load   | subc   | addc    | sub    | add       |              |
| 8       | 1      | store  | load   | subc    | addc   | sub       |              |
| 9       |        | loadi  | store  | load    | subc   | addc      |              |
| 10      |        | storei | loadi  | store   | load   | subc      |              |
| 11      |        | mov    | storei | loadi   | store  | load      | IGNORED (FW) |
| 12      |        | load   | mov    | storei  | loadi  | store     | FORWARD!     |
| 13      | 2      | add    | load   | mov     | storei | loadi     |              |
| 14      |        | loadi  | add    | load    | mov    | storei    | STALL!       |
| 15      |        | loadi  | add    | NOP     | load   | mov       |              |
| 16      |        | jump   | add    | add     | NOP    | load      | FORWARD!     |
| 17      | 3      | or     | jump   | loadi   | add    | NOP       | FORWARD!     |
| 18      |        | or     | or     | jump    | loadi  | add       | BRANCHCTRL   |
| 19      | 4      | bz     | NOP    | NOP     | jump   | loadi     |              |
| 20      |        | or     | bz     | NOP     | NOP    | jump      |              |
| 21      |        | or     | or     | bz      | NOP    | NOP       | BRANCHCTRL   |
| 22      | 5      | bnz    | NOP    | NOP     | bz     | NOP       |              |
| 23      |        | or     | bnz    | NOP     | NOP    | bz        |              |
| 24      |        | or     | or     | bnz     | NOP    | NOP       | BRANCHCTRL   |
| 25      | 6      | bc     | NOP    | NOP     | bnz    | NOP       |              |
| 26      |        | or     | bc     | NOP     | NOP    | bnz       |              |
| 27      |        | bv     | or     | bc      | NOP    | NOP       | BRANCHCTRL   |
| 28      | 7      | bv     | NOP    | NOP     | bc     | NOP       |              |
| 29      |        | jal    | bv     | NOP     | NOP    | bc        |              |
| 30      |        | or     | jal    | bv      | NOP    | NOP       | BRANCHCTRL   |
| 31      | 8      | jal    | NOP    | NOP     | bv     | NOP       |              |
| 32      |        | or     | jal    | NOP     | NOP    | bv        |              |
| 33      |        | or     | or     | jal     | NOP    | NOP       | BRANCHCTRL   |
| 34      | 9      | jral   | NOP    | NOP     | jal    | NOP       |              |
| 35      |        | or     | jral   | NOP     | NOP    | jal       |              |
| 36      |        | ret    | or     | jral    | NOP    | NOP       | BRANCHCTRL   |
| 37      | 10     | ret    | NOP    | NOP     | jral   | NOP       |              |
| 38      |        | or     | ret    | NOP     | NOP    | jral      |              |
| 39      |        | or     | or     | ret     | NOP    | NOP       | BRANCHCTRL   |
| 40      | 11     | bv     | NOP    | NOP     | ret    | NOP       |              |

Some remarks about the comparisons executed:

### 1. TEST N0 - 4th clock:

- Memory: input should be 32'hffffffff since it comes from regC, addc is in decode stage addressing regC -> reg[30] = 32'hffffffff
- Decode: regA input data is 1 as or operation is on WriteBack stage.

**2. TEST N1 - 8th clock:**

- Memory: input should be 32'h16 since it comes from decode stage *regC*. *load* 31,23,22 is in decode stage addressing *regC* -> *regC*[22] = 32'd22 = 32h16
- Decode: *regA* input data is 34'h30000001c as *subc* operation is on WriteBack stage. *subc* operation makes 32'h00000001c - 32'hfffffff - 32'd1, so the result is a 1c with underflow, in the MSB.

**3. TEST N2 - 13th clock:**

- Memory: memory addr is 42 since two clocks before *load* instruction loaded 31st register from memory, acquiring 32'h3d155142 value. Then, since memory addr uses the 8 LSbits, its value is 42.

**Random testbench**

Random testbench uses a randomized program counter memory, which is product of a python script. The main idea of the testbench is to compare the output of the different stages between the top module and the stages recreated in a model made in System Verilog (multicycle\_model.sv).