

# IPA PROJECT

## TEAM MEMBERS DETAILS

Eswar Kumar Anantha (2023102011)

K.Roshan Lal (2023102010)

M.Radheshyam(2023102032)

## RISC-V Processor Architecture Design Report

### Introduction

This report presents the design and implementation of a **RISC-V processor architecture** using Verilog. The design follows the **Reduced Instruction Set Computing (RISC)** principles and supports a subset of the **RISC-V ISA (Instruction Set Architecture)**. The processor is verified through simulation to ensure compliance with the specified requirements.

### Processor Architecture Overview

The processor is implemented in **Verilog** and follows a **sequential-based** and **pipeline-based architecture**.

#### Sequential-based architecture

The **sequential-based processor architecture** follows a simple **single-cycle execution model**, where each instruction is **fetched, decoded, executed, and written back** before moving to the next instruction. This design follows **positive-edge clock updating**, meaning the processor updates the **program counter (PC)** and registers at the rising edge of the clock signal.



Unlike pipelined architectures, where multiple instructions execute in parallel across different stages, a **sequential processor completes one instruction per clock cycle** before starting the next instruction.

- At each **positive edge** of the clock, the processor fetches the next instruction from memory.

- The instruction is fully processed (fetch → decode → execute → memory → write-back).
- Only after completing the current instruction does the processor move to the next instruction in the following clock cycle.

This means that there are **no overlaps** between instructions, and each instruction takes a full clock cycle to complete.

## **Step-by-Step Execution and Design details in Each Clock Cycle**

### **▼ Instruction Fetch (IF)**

#### **Design Details**

The **Instruction Fetch (IF) module** is responsible for fetching instructions from the instruction memory based on the **Program Counter (PC)** value. The fetched instruction is then passed to the next stage of the processor for decoding and execution and so on .

This module is a fundamental part of the **sequential processor** and ensures that the correct instruction is read for every pc .

#### **Module Overview**

The `instruction_fetch` module contains the following key components:

- **Instruction Memory ( `inst_mem` )**: A register array storing 256 instructions, each 32 bits wide.
- **Instruction Fetch Logic**: Reads the instruction based on the current **PC value**.
- **Initial Memory Setup**: The memory is preloaded with a program performing conditional branching and arithmetic operations etc .
- **Display Statements**: Provides debug information during simulation.

```

Fetching instruction at Address:          0
Instruction: 0000000001000001100001101100011
Time: 10000 | PC: 0 | Instruction: 0000000001000001100001101100011

Fetching instruction at Address:          1
Instruction: 0000000001000001100001010110011
Time: 20000 | PC: 4 | Instruction: 0000000001000001100001010110011

Fetching instruction at Address:          2
Instruction: 00000000000000000000000000000001001100011
Time: 30000 | PC: 8 | Instruction: 00000000000000000000000000000001001100011

Fetching instruction at Address:          3
Instruction: 01000000001000011000001010110011
Time: 40000 | PC: c | Instruction: 01000000001000011000001010110011

Fetching instruction at Address:          4
Instruction: 00000000001100010001001101100011
Time: 50000 | PC: 10 | Instruction: 00000000001100010001001101100011

Fetching instruction at Address:          5
Instruction: 00000000010000011110001100110011
Time: 60000 | PC: 14 | Instruction: 00000000010000011110001100110011

Fetching instruction at Address:          6
Instruction: 00000000000000000000000000000001001100011
Time: 70000 | PC: 18 | Instruction: 00000000000000000000000000000001001100011

Fetching instruction at Address:          7
Instruction: 00000000001000011110011001100110011
Time: 80000 | PC: 1c | Instruction: 0000000000100001111001100110011
instruction_fetch_tb_1.v:23: $finish called at 80000 (1ps)

```

To finish the instructions we declared last instruction opcode is 111111 then the program will be finished

#### **CHALLENGES ENCOUNTERED :**

while the branch is taken where it must go back to the previous instruction we didn't consider the negative sign so the pc is not updated correctly and instruction is not fetched correctly

#### **INSTRUCTION DECODE**

The **Instruction Decode Unit** extracts fields from a 32-bit instruction, generates control signals, and determines operand registers.

#### **Inputs and Outputs**

##### **Inputs:**

- **instruction [31:0]** : The **32-bit instruction** fetched from memory.

##### **Outputs:**

- **Instruction Fields:**

- `rs1 [4:0]` : First source register.
- `rs2 [4:0]` : Second source register (only for R-type and B-type).
- `rd [4:0]` : Destination register.
- `funct3 [2:0]` : Function code for ALU operations and branching.
- `funct7 [6:0]` : Additional function code for R-type operations.
- `opcode [6:0]` : The opcode that determines the instruction type.

- **Control Signals:**

- `branch` : 1 if it is a branch instruction.
- `alu_op [3:0]` : Operation code for ALU.
- `alu_src` : Determines if one operand comes from immediate.
- `imm [63:0]` : Sign-extended immediate value.
- `memread` : 1 if data needs to be read from memory.
- `memwrite` : 1 if data needs to be written to memory.
- `memtoreg` : 1 if memory output is written to a register.
- `regwrite` : 1 if the destination register should be written.

## Design Description

### 1. Extract Fields

- `opcode = instruction[6:0]`
- `rd = instruction[11:7]`
- `funct3 = instruction[14:12]`
- `funct7 = instruction[31:25]`
- `rs1 = instruction[19:15]`
- `rs2 = instruction[24:20]`

### 2. Control Logic Based on `opcode`

- **R-type (Register-Register)** (`opcode = 0110011`):
  - Uses `rs1`, `rs2`, and `rd`.
  - ALU operation depends on `funct3` and `funct7`.
  - `regwrite = 1`, `alu_src = 0`, `memread = 0`, `memwrite = 0`.
- **I-type (Immediate Instructions)** (`opcode = 0010011` and `0000011`):
  - Uses `rs1` and `rd`, with immediate value extracted.
  - `alu_src = 1`, `regwrite = 1`, and different `alu_op` values.
- **S-type (Store Instructions)** (`opcode = 0100011`):
  - Uses `rs1`, `rs2`, and an immediate value.
  - `memwrite = 1`, `alu_src = 1`, and `regwrite = 0`.
- **B-type (Branch Instructions)** (`opcode = 1100011`):
  - Uses `rs1`, `rs2`, and immediate.
  - `branch = 1` and different ALU operations (`BEQ`, `BNE`, `BGE`, etc.).

### 3. Immediate Generation

- **I-type immediate** (for `ADDI`, `LW`, etc.):  
 $\text{imm} = \text{SignExtend}(\text{instruction}[31:20])$   
 $\text{imm} = \text{SignExtend}(\text{instruction}[31:20]) \setminus \text{text}\{\text{imm}\} = \setminus \text{text}\{\text{SignExtend}\}(\text{instruction}[31:20])$
- **S-type immediate** (for `SW`, `SH`, etc.):  
 $\text{imm} = \text{SignExtend}(\{\text{instruction}[31:25], \text{instruction}[11:7]\})$   
 $\text{imm} = \text{SignExtend}(\{\text{instruction}[31:25], \text{instruction}[11:7]\}) \setminus \text{text}\{\text{imm}\} = \setminus \text{text}\{\text{SignExtend}\}(\setminus \{\text{instruction}[31:25], \text{instruction}[11:7]\})$
- **B-type immediate** (for `BEQ`, `BNE`, etc.):  
 $\text{imm} = \text{SignExtend}(\{\text{instruction}[31], \text{instruction}[7], \text{instruction}[30:25], \text{instruction}[11:8], 1'b0\})$   
 $\text{imm} = \text{SignExtend}(\{\text{instruction}[31], \text{instruction}[7], \text{instruction}[30:25], \text{instruction}[11:8], 1'b0\}) \setminus \text{text}\{\text{imm}\} = \setminus \text{text}\{\text{SignExtend}\}(\{\text{instruction}[31], \text{instruction}[7], \text{instruction}[30:25], \text{instruction}[11:8], 1'b0\})$

### 4. Debugging Information

- `$display` prints the decoded instruction fields and control signals.

```
[Running] decode tb.v
Instruction: 001101b3 | Opcode: 0110011 | RD: 3 | Funct3: 000 | RS1: 2 | RS2: 1 | Funct7: 0000000 | Immediate: 0 | Control Signals: Branch=0, ALU_OP=0010, ALU_SRC=0, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=1
Instruction: 001101b3 | Opcode: 0110011 | RD: 3 | Funct3: 000 | RS1: 2 | RS2: 1 | Funct7: 0100000 | Immediate: 0 | Control Signals: Branch=0, ALU_OP=0110, ALU_SRC=0, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=1
Instruction: 001101b3 | Opcode: 0110011 | RD: 3 | Funct3: 110 | RS1: 2 | RS2: 1 | Funct7: 0000000 | Immediate: 0 | Control Signals: Branch=0, ALU_OP=0001, ALU_SRC=0, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=1
Instruction: 00508013 | Opcode: 0010011 | RD: 2 | Funct3: 000 | RS1: 1 | RS2: 5 | Funct7: 0000000 | Immediate: 5 | Control Signals: Branch=0, ALU_OP=0010, ALU_SRC=1, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=1
Instruction: 00508013 | Opcode: 0010011 | RD: 2 | Funct3: 110 | RS1: 1 | RS2: 5 | Funct7: 0000000 | Immediate: 5 | Control Signals: Branch=0, ALU_OP=0001, ALU_SRC=1, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=1
Instruction: 0050c113 | Opcode: 0110011 | RD: 2 | Funct3: 100 | RS1: 1 | RS2: 5 | Funct7: 0000000 | Immediate: 5 | Control Signals: Branch=0, ALU_OP=0001, ALU_SRC=1, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=1
Instruction: 0050c113 | Opcode: 0110011 | RD: 2 | Funct3: 110 | RS1: 1 | RS2: 5 | Funct7: 0000000 | Immediate: 5 | Control Signals: Branch=0, ALU_OP=1001, ALU_SRC=1, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=1
Instruction: 0021b223 | Opcode: 0100011 | RD: 4 | Funct3: 011 | RS1: 3 | RS2: 2 | Funct7: 0000000 | Immediate: 4 | Control Signals: Branch=0, ALU_OP=0010, ALU_SRC=0, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=1
Instruction: 0041b223 | Opcode: 0100011 | RD: 4 | Funct3: 011 | RS1: 3 | RS2: 4 | Funct7: 0000000 | Immediate: 4 | Control Signals: Branch=0, ALU_OP=0010, ALU_SRC=1, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=0
Instruction: 0041b103 | Opcode: 0000011 | RD: 2 | Funct3: 011 | RS1: 3 | RS2: 4 | Funct7: 0000000 | Immediate: 4 | Control Signals: Branch=0, ALU_OP=0010, ALU_SRC=1, MemRead=1, MemWrite=0, MemToReg=1, RegWrite=1
Instruction: 0041b103 | Opcode: 0000011 | RD: 2 | Funct3: 011 | RS1: 3 | RS2: 4 | Funct7: 0000000 | Immediate: 4 | Control Signals: Branch=0, ALU_OP=0010, ALU_SRC=1, MemRead=1, MemWrite=0, MemToReg=1, RegWrite=1
Instruction: 0041b203 | Opcode: 0000011 | RD: 4 | Funct3: 011 | RS1: 3 | RS2: 4 | Funct7: 0000000 | Immediate: 4 | Control Signals: Branch=0, ALU_OP=0010, ALU_SRC=1, MemRead=1, MemWrite=0, MemToReg=1, RegWrite=1
Instruction: 00110163 | Opcode: 1100011 | RD: 2 | Funct3: 000 | RS1: 2 | RS2: 1 | Funct7: 0000000 | Immediate: 2 | Control Signals: Branch=1, ALU_OP=1010, ALU_SRC=0, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=0
Instruction: 00110163 | Opcode: 1100011 | RD: 2 | Funct3: 001 | RS1: 2 | RS2: 1 | Funct7: 0000000 | Immediate: 2 | Control Signals: Branch=1, ALU_OP=1011, ALU_SRC=0, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=0
Instruction: 00110163 | Opcode: 1100011 | RD: 2 | Funct3: 101 | RS1: 1 | RS2: 2 | Funct7: 0000000 | Immediate: 2 | Control Signals: Branch=1, ALU_OP=1010, ALU_SRC=0, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=0
Instruction: ffffffff | Opcode: 1111111 | RD: 31 | Funct3: 111 | RS1: 31 | RS2: 31 | Funct7: 1111111 | Immediate: 0 | Control Signals: Branch=0, ALU_OP=0000, ALU_SRC=0, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=0
```

### CHALLENGES ENCOUNTERED :

For the immediate we didnot consider the sign bit so error occurred in the imm gen

### EX\_STAGE

This module is responsible for executing arithmetic, logical, and shift operations based on the **ALU control signals** (`alu_op`). It receives operand values from the register file and an immediate value from the decode stage. The module supports different instruction types, including arithmetic, logical, shift, and comparison operations.

#### ◆ Inputs:

| Signal Name          | Width  | Description                                                                                     |
|----------------------|--------|-------------------------------------------------------------------------------------------------|
| <code>decoder</code> | 1-bit  | Control signal indicating if the instruction is decoded correctly (not used in current design). |
| <code>rs1_val</code> | 64-bit | Value of source register 1 (signed).                                                            |

|                      |        |                                                                                                   |
|----------------------|--------|---------------------------------------------------------------------------------------------------|
| <code>rs2_val</code> | 64-bit | Value of source register 2 (signed).                                                              |
| <code>imm</code>     | 64-bit | Immediate value (signed).                                                                         |
| <code>alu_src</code> | 1-bit  | Control signal to select between <code>rs2_val</code> and <code>imm</code> as the second operand. |
| <code>alu_op</code>  | 4-bit  | Control signal specifying the ALU operation.                                                      |
| <code>branch</code>  | 1-bit  | Control signal for branch operations (not used in current design).                                |

## ◆ Outputs:

| Signal Name             | Width  | Description                                                   |
|-------------------------|--------|---------------------------------------------------------------|
| <code>alu_result</code> | 64-bit | The computed result of the ALU operation.                     |
| <code>overflow</code>   | 1-bit  | Indicates if an overflow occurred in addition or subtraction. |

## ◆ Internal Signals:

| Signal Name                           | Width  | Description                                                                                |
|---------------------------------------|--------|--------------------------------------------------------------------------------------------|
| <code>operand_b</code>                | 64-bit | Selected operand (either <code>rs2_val</code> or <code>imm</code> ).                       |
| <code>add_result</code>               | 64-bit | Result of addition ( <code>rs1_val + operand_b</code> ).                                   |
| <code>sub_result</code>               | 64-bit | Result of subtraction ( <code>rs1_val - operand_b</code> ).                                |
| <code>and_result</code>               | 64-bit | Result of bitwise AND ( <code>rs1_val &amp; operand_b</code> ).                            |
| <code>or_result</code>                | 64-bit | Result of bitwise OR ( <code>rs1_val   operand_b</code> ).                                 |
| <code>xor_result</code>               | 64-bit | Result of bitwise XOR ( <code>rs1_val ^ operand_b</code> ).                                |
| <code>shifted_left</code>             | 64-bit | Result of left shift ( <code>rs1_val &lt;&lt; operand_b</code> ).                          |
| <code>shifted_right_logical</code>    | 64-bit | Result of logical right shift ( <code>rs1_val &gt;&gt; operand_b</code> ).                 |
| <code>shifted_right_arithmetic</code> | 64-bit | Result of arithmetic right shift ( <code>rs1_val &gt;&gt;&gt; operand_b</code> ).          |
| <code>lt_signed</code>                | 1-bit  | Result of signed comparison ( <code>rs1_val &lt; operand_b</code> ).                       |
| <code>lt_unsigned</code>              | 1-bit  | Result of unsigned comparison ( <code>rs1_val &lt; operand_b</code> using unsigned logic). |
| <code>comparator</code>               | 1-bit  | Result of equality comparison ( <code>rs1_val == operand_b</code> ).                       |
| <code>not_comparator</code>           | 1-bit  | Result of inequality comparison ( <code>rs1_val != operand_b</code> ).                     |
| <code>add_overflow</code>             | 1-bit  | Overflow flag for addition.                                                                |
| <code>shift_overflow</code>           | 1-bit  | Overflow flag for shifting (used for left shift).                                          |

## ◆ Module Functionality

### 1. Operand Selection ( `operand_b` )

- If `alu_src = 1`, `operand_b` is `imm` (for immediate instructions).
- If `alu_src = 0`, `operand_b` is `rs2_val` (for register-register instructions).

### 2. ALU Operation Selection ( `alu_op` )

#### • Arithmetic Operations:

- `4'b0010` → Addition ( `add_result` )
- `4'b0110` → Subtraction ( `sub_result` )

#### • Logical Operations:

- `4'b0000` → AND ( `and_result` )
- `4'b0001` → OR ( `or_result` )
- `4'b1001` → XOR ( `xor_result` )

- **Shift Operations:**

- `4'b0011` → Left Shift (`shifted_left`)
- `4'b0100` → Logical Right Shift (`shifted_right_logical`)
- `4'b0101` → Arithmetic Right Shift (`shifted_right_arithmetic`)

- **Comparison Operations:**

- `4'b0111` → Unsigned Less Than (`lt_unsigned`)
- `4'b1000` → Signed Less Than (`lt_signed`)
- `4'b1010` → Equality Check (`comparator`)
- `4'b1011` → Inequality Check (`~comparator`)

### 3. Overflow Handling

- **Addition** (`add_result`) and **Subtraction** (`sub_result`) may cause overflow, which is stored in `overflow`.
- **Shift operations** may also cause overflow (only left shift considered).

## ◆ Component Instantiations

- **Arithmetic Modules:** `add`, `sub`
- **Logical Modules:** `and_gate`, `or_gate`, `xor_gate`
- **Shifting Modules:** `left_shifter`, `right_shifter_logical`, `right_shifter_arithmetic`
- **Comparison Modules:** `signed_lt_comparator`, `unsigned_lt_comparator`, `comparator`

Each module computes its respective function, and the results are **multiplexed using a case statement** inside an `always @(*)` block.

```
vvp processor_tb
VCD Info: dumpfile processor_tb.vcd opened for output.
Time=0 | Instr=00000000001100010000000010110011 | Reg1= 9223372036854775807 | Reg2=
Time=10 | Instr=00000000001100010000000010110011 | Reg1=-9223372036854775808 | Reg2=
Time=20 | Instr=00000000001100010000000010110011 | Reg1= 9223372036854775807 | Reg2=-9223372036854775808 | Result=-1 | Result= 9223372036854775807 | Overflow=1
Time=30 | Instr=00000000001100010000000010110011 | Reg1=-90 | Reg2= 40 | Result=-50 | Overflow=0
Time=40 | Instr=01000000001100010000000010110011 | Reg1= 9223372036854775807 | Reg2=-1 | Result=-9223372036854775808 | Overflow=1
Time=50 | Instr=01000000001100010000000010110011 | Reg1= 9223372036854775808 | Reg2= 1 | Result= 9223372036854775807 | Overflow=1
Time=60 | Instr=01000000001100010000000010110011 | Reg1= 9223372036854775807 | Reg2= 1 | Result=-9223372036854775808 | Overflow=1
Time=70 | Instr=01000000001100010000000010110011 | Reg1=-50 | Reg2= -89 | Result= 39 | Overflow=0
Time=70 | Instr=01000000001100010000000010110011 | Reg1=-50 | Reg2= -89 | Result= 39 | Overflow=0
Time=80 | Instr=0000000000110001011100000010110011 | Reg1= -28147068188896 | Reg2= 71777214294589695 | Result= 71776119077928966 | Overflow=0
Time=90 | Instr=0000000000110001011100000010110011 | Reg1= -28147068188896 | Reg2= 71777214294589695 | Result= -280375465148161 | Overflow=0
Time=100 | Instr=0000000000110001011100000010110011 | Reg1= -28147068188896 | Reg2= 71777214294589695 | Result= -72056494543077121 | Overflow=0
Time=110 | Instr=0000000000110001000000010110011 | Reg1= 9223372036854775807 | Reg2= 1 | Result= -2 | Overflow=1
Time=120 | Instr=0000000000110001000000010110011 | Reg1= 9223372036854775807 | Reg2= 31 | Result= 2147483648 | Overflow=0
Time=130 | Instr=0000000000110001000000010110011 | Reg1= 9223372036854775807 | Reg2= 3 | Result= 8 | Overflow=1
Time=140 | Instr=0000000000110001000000010110011 | Reg1= 9223372036854775807 | Reg2= 54 | Result= -1801439859481984 | Overflow=0
Time=150 | Instr=0000000000110001000000010110011 | Reg1= 9223372036854775807 | Reg2= 1 | Result= 4611606018427387963 | Overflow=0
Time=160 | Instr=0000000000110001000000010110011 | Reg1= 9223372036854775807 | Reg2= 62 | Result= 0 | Overflow=0
Time=170 | Instr=0000000000110001000000010110011 | Reg1= 9223372036854775807 | Reg2= 62 | Result= 0 | Overflow=0
Time=180 | Instr=0000000000110001000000010110011 | Reg1= 9223372036854775807 | Reg2= 45 | Result= 262143 | Overflow=0
Time=190 | Instr=0000000000110001000000010110011 | Reg1= 9223372036854775807 | Reg2= 9 | Result= 1801439859481983 | Overflow=0
Time=200 | Instr=0100000000011000100000010110011 | Reg1= 9223372036854775807 | Reg2= 4 | Result= 0 | Overflow=0
Time=210 | Instr=0100000000011000100000010110011 | Reg1= 9223372036854775807 | Reg2= 34 | Result= -536870912 | Overflow=0
Time=220 | Instr=0100000000011000100000010110011 | Reg1= -1 | Reg2= 1 | Result= -1 | Overflow=0
Time=230 | Instr=000000000011000100000010110011 | Reg1= -1 | Reg2= -30 | Result= 0 | Overflow=0
Time=240 | Instr=000000000011000100000010110011 | Reg1= 9223372036854775807 | Reg2= -9223372036854775808 | Result= 0 | Overflow=0
Time=250 | Instr=000000000011000100000010110011 | Reg1= 30 | Reg2= -15 | Result= 0 | Overflow=0
Time=260 | Instr=000000000011000100000010110011 | Reg1= -40 | Reg2= 50 | Result= 1 | Overflow=0
Time=270 | Instr=000000000011000100000010110011 | Reg1= 1 | Reg2= 30 | Result= 1 | Overflow=0
Time=280 | Instr=000000000011000100000010110011 | Reg1= 9223372036854775807 | Reg2= 9223372036854775807 | Result= 0 | Overflow=0
Time=290 | Instr=000000000011000100000010110011 | Reg1= 30 | Reg2= 15 | Result= 0 | Overflow=0
Time=300 | Instr=000000000011000100000010110011 | Reg1= 40 | Reg2= 50 | Result= 1 | Overflow=0
src/processor_tb.v:79: $fin called at 310 (s)
gtkwave processor_tb.vcd &
```

## MEMORY STAGE

The `memory` module acts as the **data memory unit** in a processor pipeline, providing **load (memread)** and **store (memwrite) operations**. It interfaces with the **ALU result** for memory addressing and interacts with **register data** for storing values.

## ◆ Module Functionality

## 1 Inputs

| Signal     | Width  | Description                                      |
|------------|--------|--------------------------------------------------|
| alu_result | 64-bit | Memory address (used for both read and write)    |
| clk        | 1-bit  | Clock signal (used for write operations)         |
| memread    | 1-bit  | Control signal; 1 enables memory read            |
| memwrite   | 1-bit  | Control signal; 1 enables memory write           |
| rs2        | 64-bit | Data to be written into memory when memwrite = 1 |

## 2 Outputs

| Signal   | Width  | Description                          |
|----------|--------|--------------------------------------|
| mem_data | 64-bit | Data read from memory if memread = 1 |

## ◆ Memory Structure

- The module uses a **256-entry register array** (`data_memory [0:255]`).
- Each memory location holds a **64-bit value**.
- Memory addressing is done using the lower 8 bits** of `alu_result`, divided by 8 (`alu_result[7:0] >> 3`) to align **64-bit word addressing**.

## ◆ Behavior

### 1 Memory Read ( `memread = 1` )

- Reads data from `data_memory` at the address specified by `(alu_result[7:0] >> 3)`.
- The **lower 8 bits** of `alu_result` determine the **word-aligned address**.

### 2 Memory Write ( `memwrite = 1` )

- Stores `rs2` into `data_memory` at the address specified by `(alu_result[7:0] >> 3)`.
- The write operation happens **asynchronously**, which may cause issues in actual hardware.

```
[Running] memory_tb.v
Read Data from Address 0:          0
Read Data from Address 16:         42
Read Data from Address 16 after Overwrite:
memory_tb.v:72: $finish called at 70000 (1ps)
Read Data from Uninitialized Address 200:
[Done] exit with code=0 in 0.06 seconds
```

## Design Description of the `writeback` Module

The `writeback` module is responsible for selecting the final value that will be written back to the **register file** in a processor pipeline. It determines whether the **ALU result** or **memory data** should be written based on control signals.

## ◆ Module Functionality

### 1 Inputs

| Signal        | Width  | Description                                                                         |
|---------------|--------|-------------------------------------------------------------------------------------|
| MemtoReg_reg  | 1-bit  | Control signal: 1 selects mem_data for register write-back, 0 selects ex_alu_result |
| Regwrite_reg  | 1-bit  | Control signal: 1 enables writing back to the register                              |
| ex_alu_result | 64-bit | Computed result from the ALU in the EX stage                                        |
| mem_data      | 64-bit | Data read from memory in the MEM stage                                              |

### 2 Output

| Signal      | Width  | Description                                             |
|-------------|--------|---------------------------------------------------------|
| registerout | 64-bit | The final value to be written back to the register file |

## ◆ Behavior

### 1 Memory Write-back ( MemtoReg\_reg = 1 )

- If MemtoReg\_reg is set, the module outputs mem\_data, meaning the **data from memory** is written to the register file.

### 2 ALU Result Write-back ( MemtoReg\_reg = 0 and Regwrite\_reg = 1 )

- If MemtoReg\_reg is **not set**, and Regwrite\_reg = 1, the module outputs ex\_alu\_result, meaning the **ALU result** is written to the register file.

### 3 Default Case (No Write-back)

- If neither condition is met, registerout is **set to 0**, ensuring no unintended writes.

```
[Running] write_back_tb.v
TC1: registerout = 0000000000000000 (Expected: 0000000000000000)
TC2: registerout = 5a5a5a5a5a5a5a5a (Expected: 5A5A5A5A5A5A5A5A)
TC3: registerout = a5a5a5a5a5a5a5 (Expected: A5A5A5A5A5A5A5)
TC4: registerout = 5a5a5a5a5a5a5a (Expected: 5A5A5A5A5A5A5A)
write_back_tb.v:49: $finish called at 40000 (1ps)
[Done] exit with code=0 in 0.077 seconds
```

## Design Description of the processor Module

The processor module implements a **basic 64-bit RISC-V processor sequential**, covering **instruction fetch, decode, execution, memory, and write-back stages**. It manages a **program counter (PC)**, **register file**, and **control signals** to execute instructions sequentially while handling memory operations and branch conditions.

## ◆ Module Overview

| Component         | Functionality                                     |
|-------------------|---------------------------------------------------|
| instruction_fetch | Fetches 32-bit instructions from memory           |
| decode            | Decodes instructions and extracts control signals |

|                                 |                                                                                 |
|---------------------------------|---------------------------------------------------------------------------------|
| <code>ex_stage</code> (Execute) | Performs ALU operations based on decoded instruction                            |
| <code>memory</code>             | Reads/writes data memory based on ALU result                                    |
| <code>writeback</code>          | Selects the final value to be written back to registers                         |
| <b>Program Counter (PC)</b>     | Tracks instruction sequence and updates on branches                             |
| <b>Register File</b>            | Stores general-purpose 64-bit registers ( <code>x0</code> to <code>x31</code> ) |

The processor follows a **5-stage pipeline** with individual modules performing each stage's operations.

### 1 Instruction Fetch ( `instruction_fetch` )

- Uses the PC (`pc`) to fetch a 32-bit instruction (`if_instruction`).
- Increments PC by **4** unless a branch occurs.
- Handles termination when encountering a **halt instruction** (`opcode == 7'b1111111`).

### 2 Instruction Decode ( `decode` )

- Decodes the **instruction format** and extracts:
  - **Register addresses** (`id_rs1`, `id_rs2`, `id_rd`).
  - **Control signals** (`branch`, `alu_op`, `memread`, `memwrite`, `memtoreg`, `regwrite`).
  - **Immediate values** (`imm`) for arithmetic and branching.

### 3 Execute ( `ex_stage` )

- **Performs ALU operations** based on control signals.
- Selects between **register value** (`rs2`) or **immediate** (`imm`) as the second operand.
- Computes the **ALU result** (`ex_alu_result`).
- Detects **overflow**.

### 4 Memory Access ( `memory` )

- If `Memread = 1`, **reads memory** at `ex_alu_result` and stores it in `mem_data`.
- If `Memwrite = 1`, **writes** `rs2` **into memory** at `ex_alu_result`.

### 5 Writeback ( `writeback` )

- If `MemtoReg = 1`, selects **memory data** (`mem_data`) for writing back.
- Otherwise, writes back **ALU result** (`ex_alu_result`).
- Writes final value to **destination register** (`id_rd`).

## ◆ Register File

- **32 general-purpose 64-bit registers** (`register_file[0:31]`).
- `id_rs1_val` and `id_rs2_val` are assigned from `register_file`.
- Register write (`id_rd = registerout1`) happens after the **writeback stage**.

## ◆ Program Counter (PC) Management

- **Default PC increment:** `pc = pc + 4` (sequential execution).
- **Branch Handling:**

- o `pc_up = branch & ex_alu_result[0]`
- o If `pc_up = 1`, `pc = pc + (imm << 1)`.
- o Uses the ALU result to determine branch execution.

## TEST CASES:

assembly code : for simpler instruction

```

beq x1, x2, label1    # if (x1 == x2) jump to label1 (inst_mem[3])
add x5, x3, x2        # x5 = x3 + x2
beq x0, x0, exit      # Unconditional jump to exit
label1:
sub x5, x3, x2        # x5 = x3 - x2

bne x2, x3, label2    # if (x2 != x3) jump to label2 (inst_mem[7])
or x6, x3, x4          # x6 = x3 | x4
beq x0, x0, exit      # Unconditional jump to exit
label2:
and x6, x3, x2        # x6 = x3 & x2

exit:
exit                  # Infinite loop (halt execution)

```

```

Fetching instruction at Address: 1
Instruction: 00000000000000000000000000000000 | OPCODE: 00000000 | R0: 6 | FUNCT3: 100 | RSI: 1 | RS2: 2 | FUNCT7: 00000000 | Immediate: 16 | X3 : 16 | X4 : 16 | X5 : 16 | X6 : 6 | Control Signals: Branch=1, ALU_OP=0101, ALU_SRC=0, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=0
Register x1 : 1 | x2 : 10 | x3 : 16 | x4 : 16 | x5 : 16 | x6 : 0 | PC : 0 |
Time: 100000 | PC: 1 | Instruction: 10000000000000000000000000000000 | ALU Result: 0 | Mem Data: 0 | Branch: 1|pc_branch| 0|registerout= 0

Fetching instruction at Address: 1
Instruction: 00000000000000000000000000000000 | OPCODE: 01000110 | R0: 6 | FUNCT3: 100 | RSI: 3 | RS2: 2 | FUNCT7: 00000000 | Immediate: 16 | X3 : 16 | X4 : 16 | X5 : 16 | X6 : 6 | Control Signals: Branch=0, ALU_OP=0010, ALU_SRC=0, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=0
Register x1 : 1 | x2 : 10 | x3 : 16 | x4 : 16 | x5 : 16 | x6 : 0 | PC : 0 |
Time: 100000 | PC: 1 | Instruction: 10000000000000000000000000000000 | ALU Result: 0 | Mem Data: 0 | Branch: 0|pc_branch| 0|registerout= 0

Fetching instruction at Address: 2
Instruction: 00000000000000000000000000000000 | OPCODE: 11000110 | R0: 5 | FUNCT3: 000 | RSI: 0 | RS2: 0 | FUNCT7: 00000000 | Immediate: 16 | X3 : 16 | X4 : 16 | X5 : 16 | X6 : 4 | Control Signals: Branch=1, ALU_OP=0101, ALU_SRC=0, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=0
Register x1 : 1 | x2 : 16 | x3 : 16 | x4 : 16 | x5 : 16 | x6 : 32 | X3 : 32 | X4 : 32 | X5 : 32 | X6 : 0 | PC : 8 |
Time: 500000 | PC: 8 | Instruction: 10000000000000000000000000000000 | ALU Result: 1 | Mem Data: 0 | Branch: 1|pc_branch| 0|registerout= 0

Fetching instruction at Address: 4
Instruction: 00000000000000000000000000000000 | OPCODE: 00000000 | R0: 6 | FUNCT3: 000 | RSI: 2 | RS2: 3 | FUNCT7: 00000000 | Immediate: 16 | X3 : 16 | X4 : 16 | X5 : 16 | X6 : 4 | Control Signals: Branch=1, ALU_OP=0101, ALU_SRC=0, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=0
Register x1 : 1 | x2 : 16 | x3 : 16 | x4 : 16 | x5 : 16 | x6 : 32 | X3 : 32 | X4 : 32 | X5 : 32 | X6 : 0 | PC : 20 |
Time: 500000 | PC: 14 | Instruction: 10000000000000000000000000000000 | ALU Result: 0 | Mem Data: 0 | Branch: 0|pc_branch| 0|registerout= 0

Fetching instruction at Address: 5
Instruction: 00000000000000000000000000000000 | OPCODE: 00000000 | R0: 6 | FUNCT3: 100 | RSI: 3 | RS2: 4 | FUNCT7: 00000000 | Immediate: 16 | X3 : 16 | X4 : 16 | X5 : 16 | X6 : 5 | Control Signals: Branch=0, ALU_OP=0000, ALU_SRC=0, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=0
Register x1 : 1 | x2 : 16 | x3 : 16 | x4 : 16 | x5 : 16 | x6 : 32 | X3 : 32 | X4 : 32 | X5 : 32 | X6 : 16 | PC : 20 |
Time: 500000 | PC: 14 | Instruction: 10000000000000000000000000000000 | ALU Result: 100000 | Mem Data: 0 | Branch: 0|pc_branch| 0|registerout= 16

Fetching instruction at Address: 6
Instruction: 00000000000000000000000000000000 | OPCODE: 00000000 | R0: 4 | FUNCT3: 000 | RSI: 0 | RS2: 0 | FUNCT7: 00000000 | Immediate: 16 | X3 : 16 | X4 : 16 | X5 : 16 | X6 : 6 | Control Signals: Branch=1, ALU_OP=0101, ALU_SRC=0, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=0
Register x1 : 1 | x2 : 16 | x3 : 16 | x4 : 16 | x5 : 16 | x6 : 32 | X3 : 32 | X4 : 32 | X5 : 32 | X6 : 16 | PC : 32 |
Time: 500000 | PC: 20 | Instruction: 10000000000000000000000000000000 | ALU Result: 1 | Mem Data: 0 | Branch: 1|pc_branch| 0|registerout= 16

Fetching instruction at Address: 6
Instruction: 00000000000000000000000000000000 | OPCODE: 00000000 | R0: 4 | FUNCT3: 101 | RSI: 2 | RS2: 1 | FUNCT7: 00000000 | Immediate: 16 | X3 : 16 | X4 : 16 | X5 : 16 | X6 : 6 | Control Signals: Branch=0, ALU_OP=0000, ALU_SRC=0, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=0
Register x1 : 1 | x2 : 16 | x3 : 16 | x4 : 16 | x5 : 16 | x6 : 32 | X3 : 32 | X4 : 32 | X5 : 32 | X6 : 16 | PC : 32 |
Time: 500000 | PC: 20 | Instruction: 10000000000000000000000000000000 | ALU Result: 0 | Mem Data: 0 | Branch: 0|pc_branch| 0|registerout= 32

```



## 2nd Test case

Fibananaco

```

Id x3, 0(x15)    # inst_mem[0] | Load value from address x15 into x3
Id x4, 16(x15)   # inst_mem[1] | Load value from address (x15 + 16) into x4

```

```
ld x30, 32(x15)    # inst_mem[2] | Load value from address (x15 + 32) into x30

beq x10, x30, EXIT  # inst_mem[3] | Branch to EXIT if x10 == x30
add x7, x3, x4      # inst_mem[4] | x7 = x3 + x4
add x3, x4, x0      # inst_mem[5] | x3 = x4
add x4, x7, x0      # inst_mem[6] | x4 = x7
addi x11, x10, 1     # inst_mem[7] | x11 = x10 + 1
add x10, x11, x0     # inst_mem[8] | x10 = x11

beq x0, x0, -6      # inst_mem[9] | Unconditional jump to loop (-6 instructions back)
```

EXIT: 0xFFFFFFFF # inst\_mem[10] | Exit instruction (not a real RISC-V instruction)

```

Fetching Instruction at Address: 8
Instruction: 00000000000000000000000000000000 | Opcode: 00000000 | Rd: 0 | Function3: 0000 | RS1: 0 | RS2: 0 | 0 | Funct7: 00000000 | Immediate: 0 | Control Signals: Branch=0, ALU_OP=0010, ALU_SRC=1, MemRead=0, MemWrite=0, MemToReg=0, RegWrite=0
Register X3 (x): 0 | x(4): 0 | x(8): 0 | x(16): 0 | x(32): 0 | x(64): 0 | x(128): 0 | x(256): 0 | x(512): 0 | x(1024): 0 | x(2048): 0 | x(4096): 0 | x(8192): 0 | x(16384): 0 | x(32768): 0 | x(65536): 0 | x(131072): 0 | x(262144): 0 | x(524288): 0 | x(1048576): 0 | x(2097152): 0 | x(4194304): 0 | x(8388608): 0 | x(16777216): 0 | x(33554432): 0 | x(67108864): 0 | x(134217728): 0 | x(268435456): 0 | x(536870912): 0 | x(1073741824): 0 | x(2147483648): 0 | x(4294967296): 0 | x(8589934592): 0 | x(17179869184): 0 | x(34359738368): 0 | x(68719476736): 0 | x(137438953472): 0 | x(274877856944): 0 | x(549755713888): 0 | x(1099511427776): 0 | x(2199022855552): 0 | x(4398045711104): 0 | x(8796091422208): 0 | x(17592182844416): 0 | x(35184365688832): 0 | x(70368731377664): 0 | x(140737462755328): 0 | x(281474925510656): 0 | x(562949851021312): 0 | x(112589970204264): 0 | x(225179940408528): 0 | x(450359880817056): 0 | x(900719761634112): 0 | x(1801439523268224): 0 | x(3602879046536448): 0 | x(7205758093072896): 0 | x(14411516186145792): 0 | x(28823032372291584): 0 | x(57646064744583168): 0 | x(115292129489166336): 0 | x(230584258978332672): 0 | x(461168517956665344): 0 | x(922337035913310688): 0 | x(184467407182662176): 0 | x(368934814365324352): 0 | x(737869628730648704): 0 | x(1475739257461297408): 0 | x(2951478514922594816): 0 | x(5902957029845189632): 0 | x(11805914059690379264): 0 | x(23611828119380758528): 0 | x(47223656238761517056): 0 | x(94447312477523034112): 0 | x(188894624955046068224): 0 | x(377789249910092136448): 0 | x(755578499820184272896): 0 | x(1511156999640368545792): 0 | x(3022313999280737091584): 0 | x(6044627998561474183168): 0 | x(12089255997122948366336): 0 | x(24178511994245896732672): 0 | x(48357023988491793465344): 0 | x(96714047976983586930688): 0 | x(193428095953967173861376): 0 | x(386856191907934347722752): 0 | x(773712383815868695445504): 0 | x(1547424767631737390891008): 0 | x(3094849535263474781782016): 0 | x(6189699070526949563564032): 0 | x(12379398141053899127128064): 0 | x(24758796282107798254256128): 0 | x(49517592564215596508512256): 0 | x(99035185128431193017024512): 0 | x(19807037025686238603404800): 0 | x(39614074051372477206809600): 0 | x(79228148102744954413619200): 0 | x(158456296205489908827238400): 0 | x(316912592410979817654476800): 0 | x(633825984821959635308953600): 0 | x(126765196964391927061788800): 0 | x(253530393928783854123577600): 0 | x(507060787857567708247155200): 0 | x(101412157571513541649430800): 0 | x(202824315143027083298861600): 0 | x(405648630286054166597723200): 0 | x(811297260572108333195446400): 0 | x(1622594521144216666388892800): 0 | x(3245189042288433332777785600): 0 | x(6490378084576866665555571200): 0 | x(12980756169153733331111442400): 0 | x(25961512338307466662222884800): 0 | x(51923024676614933324445769600): 0 | x(103846049353229866648895339200): 0 | x(207692098706459733297780678400): 0 | x(415384197412919466595561356800): 0 | x(830768394825838933191122713600): 0 | x(1661536789651677866382244427200): 0 | x(3323073579303355732764488854400): 0 | x(6646147158606711465528977708800): 0 | x(13292294317213422911057955417600): 0 | x(26584588634426845822115910835200): 0 | x(53169177268853691644231821670400): 0 | x(106338354537707383288463643340800): 0 | x(212676709075414766576927286681600): 0 | x(425353418150829533153854573363200): 0 | x(850706836301659066307709146726400): 0 | x(1701413672603318132615418293452800): 0 | x(3402827345206636265230836586905600): 0 | x(6805654690413272530461673173811200): 0 | x(1361130938082654560892334634762240): 0 | x(2722261876165309121784669269524480): 0 | x(5444523752322618243569338539048960): 0 | x(1088904750464523648713867707809760): 0 | x(2177809500929047297427735415619520): 0 | x(4355619001858094594855470831239040): 0 | x(8711238003716189189710941662478080): 0 | x(17422476007432378379421883324956160): 0 | x(34844952014864756758843766649812320): 0 | x(69689904029729513517687533299624640): 0 | x(13937980805945902703537066659849320): 0 | x(27875961611891805407074133319698640): 0 | x(55751923223783610814148266639397280): 0 | x(111503846447567221628296533278794560): 0 | x(223007692895134443256593066557589120): 0 | x(446015385790268886513186133115178240): 0 | x(892030771580537773026372266230356480): 0 | x(1784061543161075546052744532460712960): 0 | x(3568123086322151092105489064921425920): 0 | x(7136246172644302184210978129842851840): 0 | x(14272492345288604368421956259685703680): 0 | x(28544984690577208736844912519371407360): 0 | x(57089969381154417473689825038742814720): 0 | x(114179938762308834947377650077485629440): 0 | x(228359877524617669894755300154971258880): 0 | x(456719755049235339789510600309942517760): 0 | x(913439510098470679579021200619845035520): 0 | x(1826879020196941391558044001238880071040): 0 | x(3653758040393882783116088002477760142080): 0 | x(7307516080787765566232176004955520284160): 0 | x(1461503216157553113246432009911040568320): 0 | x(2923006432315106226492864019822080136640): 0 | x(5846012864630212452985728039644160273280): 0 | x(11692025729260424905974456079288320546560): 0 | x(23384051458520849811948912184576641093120): 0 | x(46768102917041699623897824368153282186240): 0 | x(93536205834083399247795648736306564372480): 0 | x(18707241166816679849559129575261312744960): 0 | x(37414482333633359699118259150522625489920): 0 | x(74828964667266799398236518301045250979840): 0 | x(149657929334533598796473036602090501959680): 0 | x(29931585866906719759294607320418100391360): 0 | x(59863171733813439518589214640836200782720): 0 | x(119726343467626879037178429281672401565440): 0 | x(239452686935253758074356858563344803130880): 0 | x(478905373870507516148713717126689606261760): 0 | x(957810747741015032297427434253379212523520): 0 | x(1915621495482030064594854868506758425047040): 0 | x(3831242990964060129189709737013516850094080): 0 | x(766248598192812025837941947402703370018160): 0 | x(1532497196385624051675883894805406740036320): 0 | x(3064994392771248023351767789610813480072640): 0 | x(6129988785542496046703535579221626960145280): 0 | x(12259977571084992093407071158443253840290560): 0 | x(24519955142169984186814142316886507680581120): 0 | x(4903991028433996837362828463377301536116240): 0 | x(9807982056867993674725656926754603072232480): 0 | x(19615964113735987349451313853509206144464960): 0 | x(3923192822747197469890262770701841228892960): 0 | x(7846385645494394939780525541403682457785920): 0 | x(15692771290988789879561051082807364915571840): 0 | x(31385542581977579759122102165614729831143680): 0 | x(62771085163955159518244204331229459662287360): 0 | x(125542170327910319036488408662458919324574720): 0 | x(251084340655820638072976817324917838649149440): 0 | x(502168681311641276145953634649835677298298880): 0 | x(100433736262328255229190726929767135586597760): 0 | x(200867472524656510458381453859534271173195520): 0 | x(401734945049313020856762907718568542346391040): 0 | x(803469890098626041713525815437137084692782080): 0 | x(160693978019725208342705630887428168985564160): 0 | x(321387956039450416685411261774856337971128320): 0 | x(642775912078900833370822523549712675942256640): 0 | x(1285551824157801666741645047099425351885013280): 0 | x(2571103648315603333483290094198850703770026560): 0 | x(5142207296631206666966580188397701407540053120): 0 | x(10284414593262413333933160376795402815080026240): 0 | x(20568829186524826667866320753590805630160052480): 0 | x(41137658373049653335732641507181611260320029920): 0 | x(82275316746099306671465283014363222520640059840): 0 | x(164550633492198613342910566028726445041280119680): 0 | x(329101266984397226685821132057452890082560239360): 0 | x(658202533968794453371642264114905780165120478720): 0 | x(1316405067937588906743284528229515560322408957440): 0 | x(2632810135875177813486569056458531120644817914880): 0 | x(5265620271750355626973138112911706241289635829760): 0 | x(1053124054350071125954627624583401282577271659520): 0 | x(2106248108700142251909255249166802565554543319040): 0 | x(4212496217400284503818510498333605131109086638080): 0 | x(8424992434800569007637020996667210262218173376160): 0 | x(16849984869601138015244041993334420524436346752320): 0 | x(33699969739202276030488083986668841048872694504640): 0 | x(67399939478404552060976167973337682097745389009280): 0 | x(134799878956809104121952335946675364195490778018560): 0 | x(269599757913618208243874671893350728389881556037120): 0 | x(53919951582723641648754934378670145677773111207440): 0 | x(10783990316544728329750968655734029135554622414880): 0 | x(21567980633089456659501937311468058271109244829760): 0 | x(43135961266178913319003874622936116542218489659520): 0 | x(86271922532357826638007749245872233084436979319040): 0 | x(172543845064715653276015498491744466168873958638080): 0 | x(345087690129431306552030996983488932337747917276160): 0 | x(690175380258862613104061993966977864675495834552320): 0 | x(138035076057732522620823997993395572935091666904640): 0 | x(276070152115465045241647995986791145870983333809280): 0 | x(552140304230930090483295991973582291741966667618560): 0 | x(1104280608461850180966591983947164583483933355237120): 0 | x(2208561216923700361933183967894329116867866705474240): 0 | x(4417122433847400723866367935788658233735733410948480): 0 | x(8834244867694801447732735871577316467471466821896960): 0 | x(17668489735397602895465477543154632948942936443793920): 0 | x(35336979470795205790930955086309265897885872887587840): 0 | x(70673958941590411581861910172618531795771745775175680): 0 | x(14134791788318082316372382034523706359154349155035360): 0 | x(28269583576636164632744764068547412782308698310070720): 0 | x(56539167153272329265489528137094825564617396620141440): 0 | x(11307833430654465853097905627418965112923479324028280): 0 | x(22615666861308931706195811254837930225846986480565560): 0 | x(45231333722617863412391622509675860451693972961131120): 0 | x(90462667445235726824783245019351720903389455922262240): 0 | x(180925334890471453649566490038703411806778911844524480): 0 | x(361850669780942907299132980077406823603557823689048960): 0 | x(723701339561885814598265960015413647207115647378097920): 0 | x(1447402679123771629195319320030827284414231294756195840): 0 | x(2894805358247543258385638640061654568828462949512391680): 0 | x(5789610716495086516771277280123279113756925899024783360): 0 | x(11579221432980173033544554560246558227513851798049566720): 0 | x(23158442865960346067089109604923116455027703596099133440): 0 | x(46316885731920692134178219209846232900554417192198266880): 0 | x(92633771463841384268356438419688465801108834384396533760): 0 | x(18526754292768276853671287683937693160221766876879306720): 0 | x(37053508585536553707342575367875386320443533753787613440): 0 | x(74107017171073107414685506735750772608867067507575226880): 0 | x(14821403434214621482935101347150154521773413501515045360): 0 | x(29642806868429242965870202694300309043546826703030090720): 0 | x(59285613736858485931740405388600618086833653406060181440): 0 | x(11857122747371697186348081077720123617666730681212036280): 0 | x(23714245494743394372696162155440247235333461362424072560): 0 | x(47428490989486788745392324310880494467666862724848145120): 0 | x(94856981978973577490784648621760988933337725449696290240): 0 | x(19971396395794715495556937724352197786675450899392580480): 0 | x(39942792791589430991113875448670395557351080798785160960): 0 | x(79885585583178861982227750897340791114702161597570321920): 0 | x(15977117116635772396445550179468158222840432319514063920): 0 | x(31954234233271544792891100358936316445680864639028137840): 0 | x(63908468466543089585782200717872632891361730138056275680): 0 | x(12781693693308617917156440143574526578272346227611255120): 0 | x(25563387386617235834312880287148553156544692455222507280): 0 | x(51126774773234471668625760574297106313089384910445014560): 0 | x(102253549546468943337251521145894212627787779820890029120): 0 | x(204507099092937886674503042291788425255575597641780058240): 0 | x(409014198185875773349006084583576850511151195323560116480): 0 | x(81802839637175154669801316966715370102230238664712023360): 0 | x(163605679274350309339602633933435402044606473329440466720): 0 | x(327211358548700618679205267866870804088812946658880933440): 0 | x(654422717097401237358410535733741608177625893317761866880): 0 | x(130884543419480267471682067466752321635325786663552373360): 0 | x(261769086838960534943364134933504643267651573327104746720): 0 | x(523538173677921069886728269867009286535303146654209493440): 0 | x(104707634735584213977345653973401857267060631308411986880): 0 | x(209415269471168427954691307946803704554121262616823973760): 0 | x(418830538942336855909382615893607409088242525233647947520): 0 | x(837661077884673711818765231787214818176485050467294895040): 0 | x(1675322155769347423637534463574428363553770000934589700080): 0 | x(3350644311538694847275068927148856727107440001869179400160): 0 | x(6701288623077389694550137854297713444200000037383588000320): 0 | x(1340257724615477938910027578859426888400000074767176000640): 0 | x(2680515449230955877820055157718853776800000149534352001280): 0 | x(536103089846191175564010235543770755360000299068704002560): 0 | x(10722061796923
```



## Pipeline Design



### Changes Made in the Pipelined Version:

#### Instruction Fetch Stage:

##### New Input Signals:

| Signal Name | Type  | Purpose                                                 |
|-------------|-------|---------------------------------------------------------|
| pc_write    | Input | Controls whether the PC should update (hazard handling) |
| clk         | Input | Synchronizes pipeline stages with clock                 |

##### Added Clock Synchronization (`clk`):

- In the sequential version, the `always @(*)` block fetched instructions asynchronously.
- The pipelined version does not explicitly use `clk` in `always @(*)`, but the presence of `pc_write` suggests that control logic is used to stall or update the instruction fetch stage.

#### Instruction Decode Stage:

##### 2.1 Addition of Pipeline Registers

- Introduced new outputs such as `id_ex_MemRead`, `id_ex_MemtoReg`, `id_ex_Regwrite`, `id_ex_Memwrite`, `id_ex_write`, `id_ex_rs1`, `id_ex_rs2`, and `id_ex_rd` etc to hold values for the next stage.
- These registers ensure that control signals and operands are passed correctly to the execution stage.

## 2.2 Hazard Detection Mechanism

- Added inputs:
  - `id_ex_MemRead`: Used to check if a load instruction in the previous stage could cause a data hazard.
  - `id_ex_Rd`, `if_id_Rs1`, `if_id_Rs2`: Used to identify read-after-write (RAW) dependencies.
  - `hazard_detected`: Control signal indicating the presence of a hazard.
- Added control signals:
  - `PCWrite`, `IF_ID_Write`: These determine whether the program counter and instruction fetch registers should be updated or stalled to prevent hazards.

## 2.3 Control Signal Gating Using `Control` MUX

- Introduced a control signal (`Control`) that is disabled when a hazard is detected.
- If a hazard is present, `PCWrite` and `IF_ID_Write` are disabled, preventing erroneous instruction execution.

The pipelined version of the decode stage improves performance by reducing stalls and ensuring proper data forwarding. The introduction of hazard detection and control gating mechanisms makes the pipeline more robust and efficient. These modifications are crucial for achieving higher throughput in the RISC-V processor design.

```
[Running] decode_tb.v
PCWrite=1, IF_ID_Write=1, id_ex_Memread=0, id_ex_MemtoReg=0, id_ex_Regwrite=1, id_ex_Memwrite=0, id_ex_write=0, id_ex_rs1= 2, id_ex_rs2= 1, id_ex_rd= 3,
opcode=0110011, alu_src=0, alu_op=0010, branch=0, imm= 0
PCWrite=1, IF_ID_Write=1, id_ex_Memread=0, id_ex_MemtoReg=0, id_ex_Regwrite=1, id_ex_Memwrite=0, id_ex_write=0, id_ex_rs1= 1, id_ex_rs2= 5, id_ex_rd= 3,
opcode=0010011, alu_src=1, alu_op=0010, branch=0, imm= 5
PCWrite=1, IF_ID_Write=1, id_ex_Memread=1, id_ex_MemtoReg=1, id_ex_Regwrite=1, id_ex_Memwrite=0, id_ex_write=0, id_ex_rs1= 1, id_ex_rs2= 4, id_ex_rd= 3,
opcode=0000011, alu_src=1, alu_op=0010, branch=0, imm= 4
PCWrite=1, IF_ID_Write=1, id_ex_Memread=0, id_ex_MemtoReg=0, id_ex_Regwrite=0, id_ex_Memwrite=1, id_ex_write=0, id_ex_rs1= 1, id_ex_rs2= 3, id_ex_rd= 4,
opcode=0100011, alu_src=1, alu_op=0010, branch=0, imm= 4
PCWrite=1, IF_ID_Write=1, id_ex_Memread=0, id_ex_MemtoReg=0, id_ex_Regwrite=0, id_ex_Memwrite=0, id_ex_write=0, id_ex_rs1= 2, id_ex_rs2= 1, id_ex_rd= 3,
opcode=1100011, alu_src=0, alu_op=1010, branch=1, imm= 2050
PCWrite=0, IF_ID_Write=0, id_ex_Memread=0, id_ex_MemtoReg=0, id_ex_Regwrite=0, id_ex_Memwrite=0, id_ex_write=0, id_ex_rs1= 2, id_ex_rs2= 1, id_ex_rd= 3,
opcode=1100011, alu_src=0, alu_op=0000, branch=0, imm= 0
decode_tb.v:94: $finish called at 60000 (ips)
PCWrite=1, IF_ID_Write=1, id_ex_Memread=0, id_ex_MemtoReg=0, id_ex_Regwrite=0, id_ex_Memwrite=0, id_ex_write=0, id_ex_rs1= 2, id_ex_rs2= 1, id_ex_rd= 3,
opcode=1100011, alu_src=0, alu_op=1010, branch=1, imm= 2050
[Done] exit with code=0 in 0.051 seconds
```

### Execute Stage:

#### (A) Operand Handling

##### Sequential Code:

- Uses `rs1_val` and `rs2_val` directly from the decoder.
- Immediate value (`imm`) is selected based on `alu_src`.

##### Pipelined Code:

- `operandA` and `operandB` replace `rs1_val` and `rs2_val`.
- `operandB` is determined using `alu_src` in ID/EX stage to support immediate-based instructions.

##### Reason for Change:

- The sequential version directly uses register values.
- The pipeline introduces `ID/EX` pipeline registers that pass operands from the decode stage to ensure correct data forwarding.

| Mux control   | Source | Explanation                                                                    |
|---------------|--------|--------------------------------------------------------------------------------|
| ForwardA = 00 | ID/EX  | The first ALU operand comes from the register file.                            |
| ForwardA = 10 | EX/MEM | The first ALU operand is forwarded from the prior ALU result.                  |
| ForwardA = 01 | MEM/WB | The first ALU operand is forwarded from data memory or an earlier ALU result.  |
| ForwardB = 00 | ID/EX  | The second ALU operand comes from the register file.                           |
| ForwardB = 10 | EX/MEM | The second ALU operand is forwarded from the prior ALU result.                 |
| ForwardB = 01 | MEM/WB | The second ALU operand is forwarded from data memory or an earlier ALU result. |

## (B) Pipeline Registers for Data Transfer

### Sequential Code:

- Outputs results (`alu_result`, `overflow`) immediately.

### Pipelined Code:

- Introduces **EX/MEM pipeline registers** (`ex_mem_*` signals) to store computed ALU results and control signals for the memory stage.

### New Signals Introduced:

| Register Name                  | Bit-width   | Description                                                                                                  |
|--------------------------------|-------------|--------------------------------------------------------------------------------------------------------------|
| <code>ex_mem_alu_result</code> | 64 (signed) | Stores the result of the ALU operation from the Execute stage to be forwarded to the Memory stage.           |
| <code>ex_mem_rd</code>         | 5           | Stores the destination register address for the instruction being executed.                                  |
| <code>ex_mem_Memwrite</code>   | 1           | Control signal indicating whether the instruction writes data to memory (1 = Write, 0 = No write).           |
| <code>ex_mem_Memread</code>    | 1           | Control signal indicating whether the instruction reads data from memory (1 = Read, 0 = No read).            |
| <code>ex_mem_MemtoReg</code>   | 1           | Control signal determining if the data from memory should be written back to the register (1 = Yes, 0 = No). |
| <code>ex_mem_Regwrite</code>   | 1           | Control signal indicating whether the instruction writes data to a register (1 = Write, 0 = No write).       |
| <code>ex_mem_overflow</code>   | 1           | Flag indicating if an overflow occurred during the ALU operation (1 = Overflow, 0 = No overflow).            |

### Reason for Change:

- In pipelined execution, each stage needs to pass values forward to the next stage using pipeline registers.

## (C) ALU Control Handling

### Sequential Code:

- Uses an **always @(\*) block** to select the ALU operation based on `alu_op`.
- Directly updates `alu_result`.

### Pipelined Code:

- Uses **non-blocking assignments** (`<=`) to ensure proper pipeline execution.

- Stores ALU results into `ex_mem_alu_result` instead of updating immediately.

#### Reason for Change:

- The pipeline needs to prevent hazards by buffering values in `ex_mem_*` registers before forwarding them.

## (D) Memory and Writeback Control Signals

#### Sequential Code:

- No explicit memory-related control signals.

#### Pipelined Code:

- Introduces `id_ex_Memwrite`, `id_ex_Memread`, `id_ex_MemtoReg`, `id_ex_Regwrite`, and forwards them to `ex_mem_*` registers.

#### Reason for Change:

- Ensures that memory read/write operations happen at the correct cycle in the **Memory (MEM) stage**.

| Signal Name                                                                                                              | Purpose                                         |
|--------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------|
| <code>operandA</code> , <code>operandB</code>                                                                            | Inputs to ALU from pipeline registers           |
| <code>ex_mem_alu_result</code>                                                                                           | Stores ALU result for the MEM stage             |
| <code>ex_mem_rd</code>                                                                                                   | Stores destination register for WB stage        |
| <code>ex_mem_Memwrite</code> , <code>ex_mem_Memread</code> , <code>ex_mem_MemtoReg</code> , <code>ex_mem_Regwrite</code> | Control signals for memory and writeback stages |
| <code>ex_mem_overflow</code>                                                                                             | Stores overflow flag from ALU                   |

```
[Running] exstage_tb.v
Time=0 | ALU_OP=0010 | OperandA=      10 | OperandB=      5 | Result=      15 | Overflow=
Time=10000 | ALU_OP=0110 | OperandA=      10 | OperandB=      5 | Result=      5 | Overflow=0
Time=20000 | ALU_OP=0000 | OperandA=      10 | OperandB=      5 | Result=      0 | Overflow=0
Time=30000 | ALU_OP=0001 | OperandA=      10 | OperandB=      5 | Result=      15 | Overflow=0
Time=40000 | ALU_OP=1001 | OperandA=      10 | OperandB=      5 | Result=      15 | Overflow=0
Time=50000 | ALU_OP=0011 | OperandA=      10 | OperandB=      5 | Result=      320 | Overflow=0
Time=60000 | ALU_OP=0100 | OperandA=      10 | OperandB=      5 | Result=      0 | Overflow=0
Time=70000 | ALU_OP=0101 | OperandA=      10 | OperandB=      5 | Result=      0 | Overflow=0
Time=80000 | ALU_OP=0111 | OperandA=      10 | OperandB=      5 | Result=      0 | Overflow=0
Time=90000 | ALU_OP=1000 | OperandA=      10 | OperandB=      5 | Result=      0 | Overflow=0
Time=100000 | ALU_OP=1010 | OperandA=      10 | OperandB=      5 | Result=      0 | Overflow=0
Time=110000 | ALU_OP=0101 | OperandA=      10 | OperandB=      5 | Result=      1 | Overflow=0
Time=120000 | ALU_OP=0010 | OperandA= 9223372036854775807 | OperandB=      5 | Result=-9223372036854775804 | Overflow=1
exstage_tb.v:123: $finish called at 140000 (1ps)
[Done] exit with code=0 in 0.659 seconds
```

## Memory Stage:

### Changes in Memory Read and Write Operations

- Memory Read ( `mem_wb_mem_data` )**
  - In the sequential design, `mem_data` is assigned directly based on `memread`.
  - In the pipelined design, `mem_wb_mem_data` is assigned conditionally from `data_memory[ex_mem_alu_result[7:0]]` when `ex_mem_Memread` is high.
- Memory Write ( `data_memory[ex_mem_alu_result[7:0]] <= ex_mem_rs2` )**
  - Instead of using `memwrite` from the sequential design, the pipelined version now uses `ex_mem_Memwrite` to check if memory should be written.

#### Reason for the Change:

- Separation of Memory Read and Write:** In the pipeline, the memory stage must handle both operations without interfering with other stages.
- Control Signal Dependency:** The new approach ensures that memory writes happen only when explicitly enabled by control signals passed from the EX stage.

## Removal of Sequential Control and Direct Memory Access

- In the sequential design, `mem_data` was updated synchronously within an `always @(*)` block.
- The pipelined version ensures that memory operations are only carried out based on control signals from the **EX/MEM pipeline registers**.

### Reason for the Change:

- Pipeline Stages Must Operate Independently:** The memory stage should only execute memory operations when the EX stage has issued the appropriate control signals.
- Ensuring Data Flow Continuity:** Instead of direct memory access, data is now forwarded through pipeline registers.

## Summary of New Signals (Wires and Registers)

| Signal Name                    | Type  | Purpose                                                                   |
|--------------------------------|-------|---------------------------------------------------------------------------|
| <code>mem_wb_rd</code>         | Reg   | Holds the destination register from MEM to WB stage.                      |
| <code>mem_wb_MemtoReg</code>   | Reg   | Control signal to determine if memory data or ALU result is written back. |
| <code>mem_wb_alu_result</code> | Reg   | Holds ALU result for the WB stage.                                        |
| <code>mem_wb_mem_data</code>   | Reg   | Holds memory read data for the WB stage.                                  |
| <code>mem_wb_RegWrite</code>   | Reg   | Control signal indicating if instruction writes back to a register.       |
| <code>ex_mem_rd</code>         | Input | Destination register from EX stage.                                       |
| <code>ex_mem_Memwrite</code>   | Input | Control signal indicating a memory write operation.                       |
| <code>ex_mem_Memread</code>    | Input | Control signal indicating a memory read operation.                        |
| <code>ex_mem_MemtoReg</code>   | Input | Control signal determining if memory data or ALU result is written back.  |
| <code>ex_mem_Regwrite</code>   | Input | Control signal for register write-back.                                   |
| <code>ex_mem_alu_result</code> | Input | ALU result from EX stage.                                                 |
| <code>ex_mem_rs2</code>        | Input | Register value used for memory write operations.                          |

```
[Running] memory_tb.v
Memory Values:          10,           90,           6
Time=0 | clk=0 | MemWrite=0 | MemRead=1 | MemtoReg=0 | RegWrite=0 | rd= 0 | ALU_Result=
Time=5000 | clk=1 | MemWrite=0 | MemRead=1 | MemtoReg=0 | RegWrite=0 | rd= 0 | ALU_Result=
Memory Values:          10,           90,           6
Memory Values:          10,           90,           42
Time=10000 | clk=0 | MemWrite=1 | MemRead=0 | MemtoReg=0 | RegWrite=0 | rd= 0 | ALU_Result=
Time=15000 | clk=1 | MemWrite=1 | MemRead=0 | MemtoReg=0 | RegWrite=0 | rd= 0 | ALU_Result=
Memory Values:          10,           90,           42
Time=20000 | clk=0 | MemWrite=0 | MemRead=0 | MemtoReg=0 | RegWrite=0 | rd= 0 | ALU_Result=
Time=25000 | clk=1 | MemWrite=0 | MemRead=0 | MemtoReg=0 | RegWrite=0 | rd= 0 | ALU_Result=
memory_tb.v:68: $finish called at 30000 (ips)
Time=30000 | clk=0 | MemWrite=0 | MemRead=0 | MemtoReg=0 | RegWrite=0 | rd= 0 | ALU_Result=
[Done] exit with code=0 in 0.053 seconds
```

## Write Back Stage:

- Selection Between ALU Result and Memory Data**
  - In the sequential version, `registerout` was determined using `MemtoReg_reg` and `Regwrite_reg`.
  - In the pipelined version, `wb_registerout` is selected using `mem_wb_MemtoReg`, ensuring it comes from either memory (`mem_wb_mem_data`) or ALU (`mem_wb_alu_result`).
- Register Write Enable Forwarding**
  - The `RegWrite` control signal (`wb_RegWrite`) is explicitly forwarded from the MEM/WB stage to the WB stage.

### Reason for the Change:

- Separating Control Logic from Data Flow** ensures better **pipeline efficiency** and eliminates dependency on earlier stages.

| Signal Name       | Type  | Purpose                                                 |
|-------------------|-------|---------------------------------------------------------|
| wb_RegWrite       | Reg   | Holds the write enable signal for register file.        |
| wb_rd             | Reg   | Holds the destination register number for the WB stage. |
| wb_registerout    | Reg   | Final data value written to the register file.          |
| mem_wb_MemtoReg   | Input | Determines if data comes from memory or ALU.            |
| mem_wb_RegWrite   | Input | Write enable signal from MEM/WB.                        |
| mem_wb_rd         | Input | Destination register from MEM/WB.                       |
| mem_wb_alu_result | Input | ALU result from MEM/WB stage.                           |
| mem_wb_mem_data   | Input | Memory read data from MEM/WB stage.                     |

```
[Running] memory_tb.v
Memory Values:          10,           90,           6
Time=0 | clk=0 | MemWrite=0 | MemRead=1 | MemtoReg=0 | RegWrite=0 | rd= 0 | ALU_Result=
Time=5000 | clk=1 | MemWrite=0 | MemRead=1 | MemtoReg=0 | RegWrite=0 | rd= 0 | ALU_Result=
Memory Values:          10,           90,           6
Memory Values:          10,           90,           42
Time=10000 | clk=0 | MemWrite=1 | MemRead=0 | MemtoReg=0 | RegWrite=0 | rd= 0 | ALU_Result=
Time=15000 | clk=1 | MemWrite=1 | MemRead=0 | MemtoReg=0 | RegWrite=0 | rd= 0 | ALU_Result=
Memory Values:          10,           90,           42
Time=20000 | clk=0 | MemWrite=0 | MemRead=0 | MemtoReg=0 | RegWrite=0 | rd= 0 | ALU_Result=
Time=25000 | clk=1 | MemWrite=0 | MemRead=0 | MemtoReg=0 | RegWrite=0 | rd= 0 | ALU_Result=
memory_tb.v:68: $finish called at 30000 (ips)
Time=30000 | clk=0 | MemWrite=0 | MemRead=0 | MemtoReg=0 | RegWrite=0 | rd= 0 | ALU_Result=
[Done] exit with code=0 in 0.053 seconds

[Running] writeback_tb.v
Time=0 | MemtoReg=0 | RegWrite=0 | RD= 0 | ALU_Result=
Time=10000 | MemtoReg=0 | RegWrite=1 | RD=10 | ALU_Result=
Time=20000 | MemtoReg=1 | RegWrite=1 | RD=15 | ALU_Result=
Time=30000 | MemtoReg=1 | RegWrite=0 | RD=15 | ALU_Result=
writeback_tb.v:62: $finish called at 50000 (ips)
[Done] exit with code=0 in 0.049 seconds
```

## Hazard Module

### Design Description of the Hazard Detection Module

The **Hazard Detection Unit** is a crucial part of a pipelined processor. It is responsible for detecting **load-use hazards**, which occur when an instruction in the **ID/EX** stage is reading from memory (`MemRead = 1`), and the next instruction in the **IF/ID** stage needs the result of that memory load.

## Inputs & Outputs

| Signal        | Width | Type   | Description                                                                                                                                         |
|---------------|-------|--------|-----------------------------------------------------------------------------------------------------------------------------------------------------|
| ID_EX_MemRead | 1-bit | Input  | Indicates whether the instruction in the <b>ID/EX</b> stage is performing a memory read.                                                            |
| ID_EX_Rd      | 5-bit | Input  | Destination register ( <code>Rd</code> ) of the instruction in the <b>ID/EX</b> stage.                                                              |
| IF_ID_Rs1     | 5-bit | Input  | First source register ( <code>Rs1</code> ) of the instruction in the <b>IF/ID</b> stage.                                                            |
| IF_ID_Rs2     | 5-bit | Input  | Second source register ( <code>Rs2</code> ) of the instruction in the <b>IF/ID</b> stage.                                                           |
| PCWrite       | 1-bit | Output | Controls whether the <b>Program Counter (PC)</b> should update. ( <code>0</code> = Stall, <code>1</code> = Normal execution).                       |
| IF_ID_Write   | 1-bit | Output | Controls whether the <b>IF/ID</b> register should update. ( <code>0</code> = Stall, <code>1</code> = Normal execution).                             |
| ControlMux    | 1-bit | Output | Determines whether the control signals for the current instruction should be stalled. ( <code>1</code> = Stall, <code>0</code> = Normal execution). |

## Hazard Detection Logic

The hazard detection module stalls the pipeline if:

1. A load instruction (`MemRead = 1`) is in the **ID/EX** stage.
2. The destination register (`ID_EX_Rd`) of that load instruction matches the source register (`Rs1` or `Rs2`) of the next instruction in the **IF/ID** stage.

### 3. The destination register is not x0 ( `ID_EX_Rd ≠ 0` ).

## Pipeline Stalling Example

| Clock Cycle | Instruction in IF/ID Stage  | Instruction in ID/EX Stage           | Hazard? | Action Taken     |
|-------------|-----------------------------|--------------------------------------|---------|------------------|
| 1           | <code>LD x2, 0(x3)</code>   | -                                    | No      | Normal Execution |
| 2           | <code>ADD x5, x2, x4</code> | <code>LD x2, 0(x3)</code>            | Yes     | Stall Pipeline   |
| 3           | <code>ADD x5, x2, x4</code> | <code>LD x2, 0(x3) (stall)</code>    | Yes     | Stall Pipeline   |
| 4           | <code>ADD x5, x2, x4</code> | <code>LD x2, 0(x3) (finished)</code> | No      | Resume Execution |

- The pipeline is **stalled for one cycle** to allow `LD x2, 0(x3)` to complete, ensuring correct execution of `ADD x5, x2, x4`.

```
VCD info: dumpfile Hazard_tb.vcd opened for output.
Test 1: No hazard => PCWrite: 1, IF_ID_Write: 1, ControlMux: 0
Test 2: Hazard on Rs1 => PCWrite: 0, IF_ID_Write: 0, ControlMux: 1
Test 3: Hazard on Rs2 => PCWrite: 0, IF_ID_Write: 0, ControlMux: 1
Test 4: Hazard on Rs1 & Rs2 => PCWrite: 0, IF_ID_Write: 0, ControlMux: 0
Test 5: ID_EX_Rd is zero => PCWrite: 1, IF_ID_Write: 1, ControlMux: 0
Hazard.v:115: $finish called at 50000 (ips)
```

## Design Description of the Forwarding Unit ( `forwarding_unit` )

### ◆ Overview

The **Forwarding Unit** is a critical component of a **pipelined processor** that helps resolve **data hazards** by forwarding data from later pipeline stages (**EX/MEM** and **MEM/WB**) back to the **EX stage**, preventing the need for stalls in certain cases.

In **RISC-V**, data hazards occur when an instruction depends on the result of a previous instruction that has not yet completed writing back to the register file. The forwarding unit detects these hazards and **bypasses the correct value** from the later pipeline stages to the ALU inputs ( `ForwardA` and `ForwardB` ).

### ◆ Inputs & Outputs

| Signal                       | Width | Direction | Description                                                                                          |
|------------------------------|-------|-----------|------------------------------------------------------------------------------------------------------|
| <code>EX_MEM_RegWrite</code> | 1-bit | Input     | Indicates if the instruction in <b>EX/MEM</b> stage writes to a register.                            |
| <code>MEM_WB_RegWrite</code> | 1-bit | Input     | Indicates if the instruction in <b>MEM/WB</b> stage writes to a register.                            |
| <code>EX_MEM_Rd</code>       | 5-bit | Input     | Destination register ( <code>Rd</code> ) of the instruction in <b>EX/MEM</b> stage.                  |
| <code>MEM_WB_Rd</code>       | 5-bit | Input     | Destination register ( <code>Rd</code> ) of the instruction in <b>MEM/WB</b> stage.                  |
| <code>ID_EX_Rs1</code>       | 5-bit | Input     | Source register 1 ( <code>Rs1</code> ) of the instruction in <b>ID/EX</b> stage.                     |
| <code>ID_EX_Rs2</code>       | 5-bit | Input     | Source register 2 ( <code>Rs2</code> ) of the instruction in <b>ID/EX</b> stage.                     |
| <code>ForwardA</code>        | 2-bit | Output    | Control signal for <b>ALU input A</b> (selects forwarded data from <b>EX/MEM</b> or <b>MEM/WB</b> ). |
| <code>ForwardB</code>        | 2-bit | Output    | Control signal for <b>ALU input B</b> (selects forwarded data from <b>EX/MEM</b> or <b>MEM/WB</b> ). |

### ◆ Forwarding Logic

The forwarding unit checks whether the **destination register ( `Rd` ) of a previous instruction** matches the **source registers ( `Rs1` or `Rs2` ) of the current instruction** in the **ID/EX stage**.

If a match is found and the **previous instruction writes to a register**, the unit **forwards** the data to prevent stalls.

### Forwarding Conditions:

- Forward from EX/MEM stage ( ForwardA = 10 or ForwardB = 10 )**
  - The EX/MEM instruction writes ( EX\_MEM\_RegWrite = 1 )
  - The destination register ( EX\_MEM\_Rd ) matches the source register ( ID\_EX\_Rs1 or ID\_EX\_Rs2 )
  - EX\_MEM\_Rd is not register x0 (to avoid forwarding invalid data).
- Forward from MEM/WB stage ( ForwardA = 01 or ForwardB = 01 )**
  - The MEM/WB instruction writes ( MEM\_WB\_RegWrite = 1 )
  - The destination register ( MEM\_WB\_Rd ) matches the source register ( ID\_EX\_Rs1 or ID\_EX\_Rs2 )
  - MEM\_WB\_Rd is not register x0 (to avoid forwarding invalid data).
- No Forwarding ( ForwardA = 00 , ForwardB = 00 )**
  - If no match is found, the ALU receives the **original register values** from the register file.

## ◆ Example Forwarding Scenarios

| Cycle | Instruction in ID/EX | Instruction in EX/MEM | Instruction in MEM/WB | ForwardA | ForwardB |
|-------|----------------------|-----------------------|-----------------------|----------|----------|
| 1     | add x5, x1, x2       | -                     | -                     | 00       | 00       |
| 2     | sub x6, x5, x3       | add x5, x1, x2        | -                     | 10       | 00       |
| 3     | or x7, x6, x4        | sub x6, x5, x3        | add x5, x1, x2        | 10       | 01       |

- Cycle 2:** x5 is needed, but the add instruction is in EX/MEM → ForwardA = 10 .
- Cycle 3:** x6 is needed, but the sub instruction is in EX/MEM, while x5 is in MEM/WB → ForwardA = 10 , ForwardB = 01 .

```
• eswar-kumar@eswar-kumar-HP-Laptop-15s-fq2xxx:~/Downloads/pipelined(1)/pipelined/hazard_test_bench$ iverilog forwarding.v
• eswar-kumar@eswar-kumar-HP-Laptop-15s-fq2xxx:~/Downloads/pipelined(1)/pipelined/hazard_test_bench$ vvp a.out
Time=0 | EX_MEM_RegWrite=0, MEM_WB_RegWrite=0, EX_MEM_Rd= 0, MEM_WB_Rd= 0, ID_EX_Rs1= 1, ID_EX_Rs2= 2 | ForwardA=00, ForwardB=00
Time=10000 | EX_MEM_RegWrite=1, MEM_WB_RegWrite=0, EX_MEM_Rd= 0, MEM_WB_Rd= 0, ID_EX_Rs1= 3, ID_EX_Rs2= 4 | ForwardA=10, ForwardB=00
Time=20000 | EX_MEM_RegWrite=1, MEM_WB_RegWrite=0, EX_MEM_Rd= 4, MEM_WB_Rd= 0, ID_EX_Rs1= 5, ID_EX_Rs2= 4 | ForwardA=00, ForwardB=10
Time=30000 | EX_MEM_RegWrite=0, MEM_WB_RegWrite=1, EX_MEM_Rd= 0, MEM_WB_Rd= 6, ID_EX_Rs1= 6, ID_EX_Rs2= 7 | ForwardA=01, ForwardB=00
Time=40000 | EX_MEM_RegWrite=0, MEM_WB_RegWrite=1, EX_MEM_Rd= 0, MEM_WB_Rd= 7, ID_EX_Rs1= 8, ID_EX_Rs2= 7 | ForwardA=00, ForwardB=01
Time=50000 | EX_MEM_RegWrite=1, MEM_WB_RegWrite=1, EX_MEM_Rd= 9, MEM_WB_Rd= 9, ID_EX_Rs1= 9, ID_EX_Rs2=10 | ForwardA=10, ForwardB=00
Time=60000 | EX_MEM_RegWrite=1, MEM_WB_RegWrite=1, EX_MEM_Rd=11, MEM_WB_Rd=12, ID_EX_Rs1=11, ID_EX_Rs2=12 | ForwardA=10, ForwardB=01
forwarding.v:144: $finish called at 70000 (lps)
```

## 1. Stages Separated into Pipeline Registers

- In the sequential design, all operations (Fetch → Decode → Execute → Memory → Writeback) were happening in a single clock cycle.
- In the pipelined version, each stage now has its own **pipeline register** that stores intermediate results between stages.
- This separation allows overlapping of instruction execution, significantly improving performance.

## 2. Introduction of Pipeline Registers

Pipeline registers are **special registers** introduced between stages to hold the intermediate results. They are updated on the **positive edge of the clock (posedge clk)**.

Here are the pipeline registers introduced:

| Pipeline Register | Purpose                                                                      |
|-------------------|------------------------------------------------------------------------------|
| <b>IF/ID</b>      | Holds instruction fetched from memory (IF stage)                             |
| <b>ID/EX</b>      | Holds decoded instruction data (rs1, rs2, rd, imm, control signals)          |
| <b>EX/MEM</b>     | Holds ALU result, memory write data, control signals                         |
| <b>MEM/WB</b>     | Holds memory read data or ALU result to be written back to the register file |

### 3. Pipeline Stalls and Hazard Detection

#### 💡 Problem: Data Hazard

- In sequential design, if **one instruction depends on the result of a previous instruction**, the processor would give the wrong result.
- Example:

```
add x1, x2, x3
sub x4, x1, x5 // x1 value from previous instruction not available yet
```

- To solve this:
  - **Hazard detection unit** was introduced to detect data hazards.
  - **Pipeline stalls** were introduced when a hazard is detected.

#### ✓ Signals Introduced in Pipeline Processor

Here is a summary of the major signals introduced:

| Signal Name                                                              | Purpose                                                                                 |
|--------------------------------------------------------------------------|-----------------------------------------------------------------------------------------|
| <b>pc_write</b>                                                          | Controls whether the PC (Program Counter) should update or stall.                       |
| <b>if_id_write</b>                                                       | Controls whether the IF/ID register should update or stall.                             |
| <b>forwardA, forwardB</b>                                                | Forwarding control signals to select the correct value for ALU inputs (prevent hazard). |
| <b>hazard_detected</b>                                                   | This signal is raised when a data hazard occurs (stall the pipeline).                   |
| <b>MemtoReg</b>                                                          | Determines if the value written back to the register is from memory or ALU.             |
| <b>RegWrite</b>                                                          | Controls if the register should be written with a new value.                            |
| <b>id_ex_MemRead, id_ex_MemWrite, id_ex_MemtoReg, id_ex_Regwrite</b>     | Control signals from Decode to Execute stage.                                           |
| <b>ex_mem_MemRead, ex_mem_MemWrite, ex_mem_MemtoReg, ex_mem_Regwrite</b> | Control signals from Execute to Memory stage.                                           |
| <b>mem_wb_RegWrite, mem_wb_MemtoReg</b>                                  | Control signals from Memory to Writeback stage.                                         |

#### ✓ 4. How Pipeline Registers Are Updated (Data Flow)

Let's break down how the pipeline registers are used and updated:

##### Stage 1: Instruction Fetch (IF)

**Purpose:** Fetch the instruction from instruction memory.

### Pipeline Register: IF/ID

| Field Name               | Purpose                        |
|--------------------------|--------------------------------|
| <b>if_id_instruction</b> | Holds the instruction fetched. |
| <b>if_id_rs1</b>         | Holds source register 1.       |
| <b>if_id_rs2</b>         | Holds source register 2.       |
| <b>if_id_rd</b>          | Holds destination register.    |

### How it Updates:

```
always @(posedge clk) begin
    if (if_id_write) begin
        if_id_instruction <= if_instruction;
        if_id_rs1 <= if_instruction[19:15];
        if_id_rs2 <= if_instruction[24:20];
        if_id_rd <= if_instruction[11:7];
    end
end
```

- The instruction and register addresses are **passed to the next stage** via the IF/ID register.

### Stage 2: Instruction Decode (ID)

**Purpose:** Decode the instruction and generate control signals.

### Pipeline Register: ID/EX

| Field Name             | Purpose                                  |
|------------------------|------------------------------------------|
| <b>id_ex_rs1_val</b>   | Value of register rs1.                   |
| <b>id_ex_rs2_val</b>   | Value of register rs2.                   |
| <b>id_ex_imm</b>       | Immediate value (sign-extended).         |
| <b>id_ex_rd</b>        | Destination register.                    |
| <b>Control Signals</b> | MemRead, MemWrite, ALUOp, RegWrite, etc. |

### How it Updates:

```
always @(posedge clk) begin
    id_ex_rs1_val <= register_file[id_ex_rs1];
    id_ex_rs2_val <= register_file[id_ex_rs2];
    id_ex_imm <= id_ex_imm_wire;
    id_ex_MemRead <= id_ex_MemRead_wire;
    id_ex_RegWrite <= id_ex_RegWrite_wire;
end
```

- The control signals and operands are passed to the **Execute stage**.

### Stage 3: Execute (EX)

**Purpose:** Perform ALU operations or branch calculations.

### Pipeline Register: EX/MEM

| Field Name               | Purpose                                |
|--------------------------|----------------------------------------|
| <b>ex_mem_alu_result</b> | Result from ALU.                       |
| <b>ex_mem_rs2</b>        | Value of rs2 (for memory writes).      |
| <b>ex_mem_rd</b>         | Destination register.                  |
| <b>Control Signals</b>   | MemRead, MemWrite, RegWrite, MemtoReg. |

### How it Updates:

```
always @(posedge clk) begin
    ex_mem_alu_result <= ex_mem_alu_result_wire;
    ex_mem_rd <= ex_mem_rd_wire;
    ex_mem_MemWrite <= ex_mem_MemWrite_wire;
end
```

- The ALU result and control signals are passed to the **Memory stage**.

### Stage 4: Memory (MEM)

**Purpose:** Read/Write from memory.

### Pipeline Register: MEM/WB

| Field Name               | Purpose                                   |
|--------------------------|-------------------------------------------|
| <b>mem_wb_mem_data</b>   | Data read from memory (if MemRead=1).     |
| <b>mem_wb_alu_result</b> | ALU result (for non-memory instructions). |
| <b>mem_wb_rd</b>         | Destination register.                     |
| <b>Control Signals</b>   | RegWrite, MemtoReg.                       |

### How it Updates:

```
always @(posedge clk) begin
    mem_wb_mem_data <= mem_wb_mem_data_wire;
    mem_wb_rd <= mem_wb_rd_wire;
    mem_wb_RegWrite <= mem_wb_RegWrite_wire;
end
```

- The data is passed to the **Writeback stage**.

### Stage 5: Writeback (WB)

**Purpose:** Write the result back to the register file.

### Write Logic:

```
always @(*) begin
    if (wb_RegWrite) begin
```

```

    register_file[wb_rd] = wb_registerout;
  end
end

```

- If `wb_RegWrite` is high, the result from ALU/memory is written back.

## ✓ 5. How Forwarding Works (Forwarding Unit)

### Problem: Data Hazard

- Example:

```

add x1, x2, x3
sub x4, x1, x5

```

- The value of `x1` is not available in `ID` stage.

### Forwarding Solution

- The **Forwarding Unit** uses:

```

assign ForwardA[1] = fwdA_EX;
assign ForwardA[0] = fwdA_MEM & ~fwdA_EX;

```

- It automatically **forwards the value** from MEM/WB or EX/MEM if needed.

## ✓ 6. Hazard Detection (Stall Control)

- When `MemRead` is high and `id_ex_rd` matches `if_id_rs1`, it stalls the pipeline.
- This prevents the next instruction from executing until data is ready.

| Feature         | Sequential Processor | Pipelined Processor         |
|-----------------|----------------------|-----------------------------|
| Execution Time  | Slow                 | Faster (5x improvement)     |
| Pipeline        | No                   | Yes                         |
| Hazard Control  | No                   | Yes (Forwarding + Stalling) |
| Control Signals | Less                 | More                        |

### Test cases :

```

ld x3 0(x0)
add x4 x3 x2
sd x4 8(x0)
ld x5 8(x0)

```

```
reg [63:0] data_memory [0:255]; // Memory array
reg wb_write_reg;
// Initialize memory with sample values
initial begin
    data_memory[0] = 64'd7;
    data_memory[1] = 64'd34;
    data_memory[2] = 64'd6;
end
```

```
// Initialize registers manually  
uut.register_file[0] = 64'd0000;  
// uut.register_file[1] = 64'h0001;  
uut.register_file[2] = 64'd6;  
// uut.register_file[3] = 64'd14;  
// uut.register_file[4] = 64'd0;  
// uut.register_file[6] = 64'd4;  
// uut.register_file[7] = 64'd9;
```

```
uut.pc = 32'h0000;
```



this test case handles load-use data hazard, double data hazard.

```
Id x3 0(x0)
add x5 ×3 ×4
add x7 ×6 ×5
```

```
module memory (
    output reg [63:0] mem_wb_mem_data,
    output reg mem_wb_RegWrite
);

reg [63:0] data_memory [0:255]; // Memory array
reg wb_write_reg;
// Initialize memory with sample values
initial begin
    data_memory[0] = 64'd10;
    data_memory[1] = 64'd1;
    data_memory[2] = 64'd6;
end

// Write operation (MEMORY WRITE)
always @(*) begin
    //if(memwrite)begin
```

```
processor_tb >  processor_tb
module processor_tb();
    clk = 1'b0;
    instr_count = 0;

    // Initialize registers manually
    // Initialize registers manually
    uut.register_file[0] = 64'd0000;
    uut.register_file[1] = 64'h0001;
    uut.register_file[6] = 64'd3;
    // uut.register_file[3] = 64'd14;
    uut.register_file[4] = 64'd10;
    // uut.register_file[3] = 64'd14;
    // uut.register_file[4] = 64'd0;
    // uut.register_file[6] = 64'd4;
    // uut.register_file[7] = 64'd9;

    uut.pc = 32'h0000;
```



this handles double data hazard, load use data hazard

```
inst_mem[0] = 32'b00000000_00110_00010_000_00111_0110011; // add x7 <= x2 <= x6  
inst_mem[1] = 32'b0000000000111_00000_011_00000_0100011; // sd x7, 0(x0)
```

```

processor_tb.v
3 module processor_tb();
30 initial begin
32     instr_count = 0;
33
34     // Initialize registers manually
35     uut.register_file[0] = 64'h0000;
36     uut.register_file[1] = 64'h0001;
37     uut.register_file[2] = 64'h0;
38     // uut.register_file[3] = 64'd14;
39     uut.register_file[5] = 64'd1;
40     uut.register_file[6] = 64'd1;
41     uut.register_file[7] = 64'd9;
42
43     uut.pc = 32'h0000;
44     uut.pc_write = 1'b1;
45     uut.if_id_write = 1'b1;
46     // uut.id_ex_write = 1'b0;
47     uut.id_ex_Memread = 1'b0;

```

---

```

wire--- id_ex_rst : x | id_ex_rs2 : x | id_ex_rd : x | id_ex_rst_val : x | id_ex_rs2_val : x | id_ex_imm : 0 |
id_ex_Memread : 0 | id_ex_Memwrite : 0 | id_ex_MemtoReg : 0 | id_ex_Regwrite : 0 |alu_src : 0 |alu_op : 0000 | branch : 0
reg---id_ex_rst : x | id_ex_rs2 : x | id_ex_rd : x | id_ex_rst_val : x | id_ex_rs2_val : x | id_ex_imm : 0 |
id_ex_Memread : 0 | id_ex_Memwrite : 0 | id_ex_MemtoReg : 0 | id_ex_Regwrite : 0 |alu_src : 0 |alu_op : 0000 | branch : 0

```

---

**EX\_MEM REGISTER DETAILS**

---

```

operandA- x
operandB- x
wire---opranddummy= x ex_mem_alu_result : x | ex_mem_Memwrite : 0 | ex_mem_Memread : 0 | ex_mem_MemtoReg : 0 | ex_mem_Regwrite : 0 |
| reg---ex_mem_rd : x | ex_mem_rs2 : x | ex_mem_alu_result : x | ex_mem_Memwrite : 0 | ex_mem_Memread : 0 | ex_mem_MemtoReg : 0 | ex_
mem_Regwrite : 0 || forward A : 00 | forward B : 00

```

---

**WRITE BACK DETAILS**

---

```

wire---pc= 40||TIME : 105000 || clk : 1 || Mem_data: 0 || || mem_wb_MemtoReg: 0 || mem_wb_Regwrite: 0 || rd: xxxx imm: 0
reg---pc= 40||TIME : 105000 || clk : 1 || Mem_data: 0 || || mem_wb_MemtoReg: 0 || mem_wb_Regwrite: 0 || rd: xxxx, imm: 0

```

---

**MEM\_DATA REGISTER DETAILS**

---

```

wire---mem_wb_rd : x | mem_wb_alu_result : x | mem_wb_mem_data : 0 | mem_wb_Regwrite : 0 | mem_wb_MemtoReg : 0
reg---mem_wb_rd : x | mem_wb_alu_result : x | mem_wb_mem_data : 0 | mem_wb_Regwrite : 0 | mem_wb_MemtoReg : 0

```

---

```

Time: 0 | clk: 1 | MemtoReg: 0 | MemRead: 0 | MemWrite: 0 | rd: x | ALU_Result: x | MemData: 0
Memory Values: 2, 1, x
Time: 0 | clk: 0 | MemtoReg: 0 | MemRead: 0 | MemWrite: 0 | rd: x | ALU_Result: 2, 1, x | MemData: 0
Memory Values: 2, 1, x
processor>vii5: $finish called at 115000 (ips)
TIME : 115000
clk : 1

```



```

wire [11:0] forwardA_forwards;
forwarding_unit fwd_unit (
    .EX_MEM_Regwrite(ex_mem_Regwrite),
    .MEM_WB_Regwrite(mem_wb_Regwrite),
    .EX_MEM_Rd(ex_mem_rd),
    .MEM_WB_Rd(mem_wb_rd),
    .ID EX_R1(id_ex_r1),
    .ID EX_R2(id_ex_r2),
    .ForwardA(forwardA),
    .ForwardB(forwardB)
);
reg [63:0] operandBdummy;
always @(*)
begin
    // Determine operandA based on ForwardA
    case (fwd_unit)
        case (forwards)
            2'bo0: operandA = id_ex_r1_val;           // No forwarding
            2'bo1: operandA = mem_wb_mem_data;         // Forward from MEM/WB stage
            2'b10: operandA = ex_mem_alu_result;       // Forward from EX/MEM stage
            default: operandA = id_ex_r1_val;          // Default to register value
        endcase

        // Determine operandB based on ForwardB
        case (forwards)
            2'bo0: operandBdummy = id_ex_r2_val; // Use r2_val or immediate
            2'bo1: operandBdummy = mem_wb_mem_data; // Forward from MEM/WB stage
            2'b10: operandBdummy = ex_mem_alu_result; // Forward from EX/MEM stage
            default: operandBdummy = id_ex_r2_val; // Default to register value
        endcase
    endcase
    case (alu_src)
        1'bo0: operandB = operandBdummy;
        1'bo1: operandB = id_ex_imm;
    endcase
end
end

```

Id x3 0(x0)  
Id x4 8(x0)  
add x5 ×3 ×4

```
17
18     reg [63:0] data_memory [0:255]; // Memory array
19     reg wb_write_reg;
20     // Initialize memory with sample values
21     initial begin
22         data_memory[0] = 64'd10;
23         data_memory[1] = 64'd12;
24         data_memory[2] = 64'd6;
25     end
26
27     // Write operation (MEMORY_WRITE)
```



This handles the load-use hazard, but for the second load and add, there will be a stall. As a stall occurs, this error is rectified.

```
ld x7 0(x0)
beq x7 x3
```

this also gives hazard detected when

```
id_ex_mem_read==1&&if_id_branch==1&&(if_id_rs1==id_ex_rd||if_id_rs2==id_ex_rd)==1
```

## Forwarding from EX/MEM Stage

In your code, you have implemented **data forwarding from EX/MEM stage** using a **forwarding unit**. The conditions for forwarding are:

### Condition 1 (EX/MEM Forwarding):

If the **EX/MEM stage** has a valid result to write back (`RegWrite=1`) and the destination register (`EX/MEM.Rd`) matches the source register of the current instruction (`ID/EX.Rs1` or `ID/EX.Rs2`), then the result is **forwarded** directly from the EX stage.

Your code implementation for this:

```
case (forwardA)
  2'b00: operandA = id_ex_rs1_val;      // No forwarding
  2'b01: operandA = mem_wb_mem_data;    // Forward from MEM/WB stage
  2'b10: operandA = ex_mem_alu_result; // Forward from EX/MEM stage
  default: operandA = id_ex_rs1_val;
endcase
```

### Explanation:

- If **EX/MEM.RegWrite == 1** and **EX/MEM.Rd == ID/EX.Rs1**, then the result from the EX stage (`ex_mem_alu_result`) is forwarded to **OperandA**.
- Similarly, for **OperandB**, the EX stage result is forwarded if required.

## Forwarding from MEM/WB Stage

If the instruction result has already reached the **MEM/WB stage**, the value is forwarded from **MEM/WB.MemData** or **MEM/WB.ALUResult**.

### Condition 2 (MEM/WB Forwarding):

```
case (forwardB)
  2'b00: operandBdummy = id_ex_rs2_val;
  2'b01: operandBdummy = mem_wb_mem_data;
  2'b10: operandBdummy = ex_mem_alu_result;
  default: operandBdummy = id_ex_rs2_val;
endcase
```

### Explanation:

- If the result is available in the **MEM/WB stage**, it is forwarded to the execution stage to prevent incorrect data usage.
- This effectively eliminates the need for the instruction to wait for the write-back stage.

## Conditions for Forwarding

The forwarding conditions in your design are checked in the **Forwarding Unit** module:

```
if (EX_MEMORY_RegWrite  
    && (EX_MEMORY_Rd != 0)  
    && (EX_MEMORY_Rd == ID_EX_Rs1))  
    ForwardA = 10;  
  
if (MEMORY_RegWrite  
    && (MEMORY_WB_Rd != 0)  
    && (MEMORY_WB_Rd == ID_EX_Rs1))  
    ForwardA = 01;
```

- This prevents data hazards when **ID/EX** instruction depends on **EX/MEM** or **MEM/WB** stage results.

## 2. Load-Use Data Hazard Handling Using Stall Mechanism

A **Load-Use hazard** occurs specifically when an instruction **loads data from memory** and the very next instruction uses the loaded value.

**Example:**

```
ld x3, 0(x1)  
add x4, x3, x5
```

In this scenario, the **add** instruction needs the value of **x3** from memory, but the **lw** instruction is still in the **MEM stage**. Without a stall, the **add** instruction would receive an invalid value.

## Stall Condition in Design

In your processor, you have effectively handled the **Load-Use Hazard** using a **Hazard Detection Unit**. The unit generates a **stall** signal based on the following condition:

### Condition for Stall (Load-Use Hazard):

```
if (id_ex_MemRead &&  
    ((id_ex_rd == if_id_rs1) || (id_ex_rd == if_id_rs2))) begin  
    hazard_detected = 1;  
end
```

**Explanation:**

- If the **ID/EX.MemRead** signal is high ( **Load instruction** ).
- And either **ID/EX.Rd == IF/ID.Rs1** or **ID/EX.Rd == IF/ID.Rs2** (dependency exists).
- Then, the pipeline is stalled for **one clock cycle**.

## Stall Control Mechanism

The stall mechanism works by freezing the **PC** and **IF/ID registers** for one cycle:

```
always @(posedge clk) begin  
    if_id_write = if_id_write_wire;
```

```

pc_write = pc_write_wire;
endPCWrite = 0 → Prevents fetching the next instruction.

```

- **IF/ID\_Write = 0** → Prevents updating the IF/ID pipeline register.
- This provides time for the `lw` instruction to complete and update the register file.

### ✓ Effect of Stall

- One cycle is delayed for dependent instruction.
- After the `load` instruction completes, the next instruction executes normally.

## 3. Correct Write-Back Handling

### ✓ Handling Write-Back Hazards

A write-back hazard can occur when two instructions try to write to the same register. In your processor, you have ensured that:

- If the **MEM/WB stage** has a valid write (`RegWrite=1`) and the destination register (`Rd`) is non-zero, it writes back to the register file.
- Your code:

```

always @(*) begin
    if (wb_RegWrite) begin
        register_file[wb_rd] = wb_registerout;
    end
end

```

- This ensures **no overwrite** happens in the pipeline and maintains data consistency.

## ✓ 5. Conditions for Forwarding and Stalls

| Hazard Type              | Condition (from code)                                                                             | Action Taken                          |
|--------------------------|---------------------------------------------------------------------------------------------------|---------------------------------------|
| <b>EX/MEM Forwarding</b> | <code>(EX/MEM.RegWrite == 1) &amp;&amp; (EX/MEM.Rd == ID/EX.Rs1)</code>                           | Forward EX/MEM.ALU_Result to EX stage |
| <b>MEM/WB Forwarding</b> | <code>(MEM/WB.RegWrite == 1) &amp;&amp; (MEM/WB.Rd == ID/EX.Rs1)</code>                           | Forward MEM/WB.Data to EX stage       |
| <b>Load-Use Hazard</b>   | <code>(ID/EX.MemRead == 1) &amp;&amp; ((ID/EX.Rd == IF/ID.Rs1) or (ID/EX.Rd == IF/ID.Rs2))</code> | Stall Pipeline by one cycle           |

### Work distribution :

#### Eswar Kumar Anantha

- **Test benches for all stages(pipeline)**
- **EX(sequential) WB(sequential)**
- Integrating(sequential)
- Hazard detection and forwarding (debugging)

**K.Roshan Lal**

- Test Benches for all stages (sequential )
- Debugging test cases (Pipeline,sequential)
- ID stage (sequential)
- Integration for pipeline and debugging it

**M.Radheshyam**

- Generating test cases (Sequential) (Pipeline)
- Report for Sequential Execution, Pipeline Execution
- generate both memory block (instruction and memory) ,compartor( external function)
- changes in pipeline all stages