

## **8-bit Simple Processor**

## Abstract

This report presents the design and implementation of an **8-bit simple processor**, developed for educational purposes. The processor consists of key components such as an **Arithmetic Logic Unit (ALU)**, **Register File**, **Program Counter (PC)**, **Instruction Decoder**, and **Memory**. It follows a **fetch-decode-execute cycle** to process a small instruction set, including arithmetic, logical, and memory operations. The processor is implemented using **Verilog HDL**, with code examples provided for the **ALU**, **instruction set**, and **testbench**. The goal of this design is to offer a basic yet functional model of a **Reduced Instruction Set Computer (RISC)-like architecture**, making it suitable for learning and experimentation in digital design and computer architecture.



## **Table of Contents**

1. Introduction
2. Processor Architecture
3. Block Diagram
4. Processor Operation
5. Instruction Set
6. Verilog Implementation
7. Simulation and Testing
8. Conclusion



## 1. Introduction

This document describes a simple 8-bit processor designed for educational purposes. It features a basic ALU, a small instruction set, and a simple memory architecture.

An 8-bit processor is a type of microprocessor that processes data in 8-bit chunks, meaning that its registers, ALU (Arithmetic Logic Unit), and data bus are all 8 bits wide. This means it can handle a maximum value of 255 ( $2^8 - 1$ ) per operation. These processors were widely used in early computers, embedded systems, and microcontrollers due to their simplicity, low power consumption, and cost-effectiveness.

A **processor** is the fundamental unit of any computing system, responsible for executing instructions and performing operations on data. This report presents the design and implementation of a **simple 8-bit processor**, developed as an educational model to illustrate the core concepts of processor architecture and digital logic design.

This processor is based on a **Reduced Instruction Set Computer (RISC) approach**, featuring a limited yet efficient instruction set that simplifies processing. It consists of key components such as:

- Arithmetic Logic Unit (ALU)
- Register File
- Program Counter (PC)
- Instruction Decoder
- Memory Unit

The processor follows a **fetch-decode-execute cycle**, ensuring systematic execution of instructions. It has been implemented using **Verilog HDL**, allowing for simulation and testing, making it a valuable tool for learners and researchers exploring processor design.

This report provides an in-depth analysis of the processor's **architecture, instruction set, Verilog implementation, and performance testing**, offering insights into fundamental computing operations.

## 1. Arithmetic Logic Unit (ALU)

The Arithmetic Logic Unit (ALU) is a fundamental component of a computer processor responsible for performing arithmetic and logical operations. It executes operations such as addition, subtraction, multiplication, and division, as well as bitwise operations (AND, OR, XOR, NOT) and comparisons. The ALU interacts with registers, the control unit, and memory to process data efficiently.

Modern ALUs are designed for high-speed calculations and may include floating-point units (FPUs) for complex mathematical operations. The performance of a CPU heavily depends on the efficiency of its ALU, as it handles all computational tasks required for executing instructions in a program. In advanced processors, pipelining and parallel execution techniques are used to enhance ALU performance.

## 2. Register File

A **register file** is a collection of **high-speed memory registers** used to **store temporary data** for immediate processing. It is an essential part of a **CPU or microcontroller** that provides **fast access** to frequently used values, reducing the need to access slower main memory.

Registers in the register file store:

- **Operands for the ALU**
- **Intermediate computation results**
- **Instruction addresses**

The size of a register file (e.g., **8-bit, 16-bit, 32-bit, or 64-bit registers**) determines the CPU's **data processing capacity**. Common registers include **general-purpose registers (GPRs)**, **special-purpose registers (SPRs)**, and **status registers**. Efficient register allocation significantly improves the **speed and efficiency of instruction execution**.

## 3. Program Counter (PC)

The **Program Counter (PC)** is a crucial **control unit register** that keeps track of the **address of**

**the next instruction** to be executed in a program. Every time an instruction is fetched, the **PC is updated** to point to the next instruction in memory.

Functions of the Program Counter include:

- **Sequential Execution:** Ensures instructions are executed in order.
- **Branching and Jumping:** Updates the address when conditional/unconditional jumps occur.
- **Exception Handling:** Adjusts program flow during interrupts or exceptions.

In modern CPUs, **pipeline architectures** allow multiple instructions to be fetched simultaneously, requiring efficient PC management. A well-designed PC ensures **efficient control flow and program execution**.



#### 4. Instruction Decoder

The **Instruction Decoder** is a component of the CPU that interprets **machine code instructions** fetched from memory and translates them into **control signals** for execution. It converts binary instructions into micro-operations, directing the CPU's **ALU, registers, and control unit** accordingly.

Key functions of an instruction decoder:

- **Opcode Identification:** Determines the operation type (arithmetic, logical, load/store, branch).
- **Operand Fetching:** Retrieves required operands from registers or memory.
- **Control Signal Generation:** Activates components like the ALU, memory unit, or I/O devices.

Modern CPUs use **microprogramming or hardwired logic** for instruction decoding, allowing efficient execution of **complex instruction sets (CISC)** or **simplified instruction sets (RISC)**.

## 5. Memory Unit

The **Memory Unit** is responsible for **storing data and instructions** required by the CPU during execution. It consists of **primary (RAM, cache) and secondary (HDD, SSD) memory**, ensuring efficient data access and storage.

Types of memory within the CPU include:

- **RAM (Random Access Memory):** Temporary storage for actively running programs.
- **Cache Memory:** Small, fast memory that reduces access time to frequently used data.
- **ROM (Read-Only Memory):** Stores permanent firmware and boot instructions.

The memory unit is structured in a **hierarchical model**, with **faster cache memory** near the processor and **slower main memory** used for bulk storage. Efficient memory management ensures **high processing speed, reduced latency, and smooth execution of programs**.

### Example of an 8-Bit Processor in Action

#### Adding Two Numbers Using an 8-Bit Processor

1. The processor **fetches the instruction:** ADD B (Add register B to accumulator A).
2. The **decoder interprets** it as an arithmetic operation.
3. The **ALU performs the addition:**  $A = A + B$ .
4. The **result is stored in the Accumulator (A)**.
5. The **PC moves to the next instruction**, continuing the execution cycle.

## Applications of 8-Bit Processors

Despite their **limited processing power**, 8-bit processors are widely used in:

- **Microcontrollers** (e.g., **8051, AVR, PIC**) for **embedded systems**.
- **Appliances** like washing machines, microwaves, and calculators.
- **Industrial automation systems**.
- **Retro computers and gaming consoles** (e.g., Nintendo Entertainment System).

An **8-bit processor** is a fundamental computing unit that processes **8-bit data per cycle**. It operates through a structured cycle of **fetch, decode, execute, and update PC**, allowing it to execute instructions efficiently. While modern processors have advanced to **32-bit and 64-bit architectures**, 8-bit processors remain **relevant in embedded systems, low-power devices, and industrial applications** due to their **simplicity, cost-effectiveness, and efficiency**.

### Flowchart of 8-Bit Processor Instruction Cycle





Fetch Operands (Registers/Memory)



Execute Instruction (ALU/Control)



Store Result (Register/Memory)



Update Program Counter (PC + 1 or Branch)



Next Instruction?



Yes   No



→ Repeat Cycle



End of Program



## Diagram of an 8-Bit Processor Architecture



| +-----+ |  
|| **Arithmetic Logic Unit (ALU)** || **Performs arithmetic/logical ops**  
| +-----+ |

| | |  
| ▼ |  
| | |

| +-----+ |  
|| **Memory Unit (RAM, ROM)** || **Stores data and instructions**  
| +-----+ |



| | **Control Unit** | | **Manages overall processor operation**  
| +-----+ |

| | |  
| ▼ |  
| | |

| | **External Buses (Data, Address, Control)** |  
+-----+

## Flowchart of Addition Operation in an 8-Bit Processor

Example: Adding two numbers using an 8-bit processor



End

### Data Flow During Instruction Execution in an 8-Bit Processor



| ALU Execution | <--- Performs arithmetic/logical operation

+-----+



+-----+

| Store Result in Memory|

+-----+



+-----+

| Update Program Counter|

+-----+



Next Instruction Execution



These flowcharts and diagrams provide a **clear step-by-step representation** of how an **8-bit processor functions**, processes instructions, performs arithmetic, and manages data.

## 2. Processor Architecture

The processor has the following key components:

- **ALU (Arithmetic Logic Unit):** Performs arithmetic and logical operations.
- **Register File:** Stores temporary data (regA, regB).
- **Program Counter (PC):** Holds the address of the next instruction to be executed.
- **Memory:** Stores instructions and data.
- **Instruction Decoder:** Decodes the opcode from the instruction.

A **Processor Architecture** consists of several key components that work together to execute instructions efficiently.

- ALU (Arithmetic Logic Unit): The ALU performs mathematical (addition, subtraction, multiplication, division) and logical (AND, OR, XOR, NOT, comparisons) operations, essential for computation and decision-making in a CPU.
- Register File: A high-speed memory unit that stores temporary values, such as operands and computation results (e.g., regA, regB), ensuring quick access for the processor.
- Program Counter (PC): This register holds the memory address of the next instruction to be executed, enabling sequential program flow and branch execution.
- Memory: Stores both instructions (program code) and data needed during execution. It includes RAM (volatile) and ROM (non-volatile) memory.
- Instruction Decoder: Interprets the opcode (operation code) from the fetched instruction and translates it into control signals, directing CPU components like the ALU and registers to execute the command.

Together, these components form the core of a processor, enabling efficient execution of instructions in computers, microcontrollers, and embedded systems.

### 3. Block Diagram

[https://www.researchgate.net/figure/A-simple-RISC-processor-architecture\\_fig20\\_322978966](https://www.researchgate.net/figure/A-simple-RISC-processor-architecture_fig20_322978966)



## 4. Operation

1. **Fetch:** The PC holds the address of the next instruction. The instruction is fetched from memory.
2. **Decode:** The instruction decoder extracts the opcode and operands from the instruction.
3. **Execute:**
  - o For ALU operations (ADD, SUB, AND, OR, XOR), the ALU performs the operation on the operands (regA, regB), and the result is stored in regA.
  - o For LOAD, data is fetched from memory and stored in regA.
  - o For STORE, the value in regA is stored in memory.
  - o For JMP, the PC is updated to the jump address.
4. **Update PC:** The PC is incremented or updated based on the instruction.
5. **Repeat:** The process repeats from step 1.

The **processor operation** follows a structured sequence known as the **instruction cycle**, which consists of four main steps: **Fetch, Decode, Execute, and Update PC**. This cycle is repeated continuously, allowing the processor to execute instructions efficiently and ensure smooth program execution.

The first step, **Fetch**, involves retrieving the next instruction from memory. The **Program Counter (PC)** holds the address of the instruction to be executed. This address is sent to the memory unit, which fetches the instruction and loads it into the **Instruction Register (IR)**. Fetching ensures a continuous flow of instructions to the processor.

In the **Decode** phase, the instruction is analyzed by the **Instruction Decoder**. The decoder determines the **operation type** (arithmetic, logical, control, or data transfer) and identifies whether additional **operands** need to be fetched from registers or memory. The CPU's control unit then prepares the necessary components, such as the **ALU, registers, and memory**, for execution.

During the **Execute** phase, the processor performs the required operation. If it is an **arithmetic instruction**, the **ALU (Arithmetic Logic Unit)** carries out the computation. If it is a **data transfer instruction**, data is moved between registers and memory. The result is stored in a register or written back to memory.

Finally, in the **Update PC** phase, the **Program Counter** is updated to point to the next instruction.

If a branch or jump instruction is executed, the PC is modified to the new address; otherwise, it moves to the next sequential instruction in memory. This cycle repeats continuously, allowing the processor to execute instructions in a structured and efficient manner.

The **processor operation** follows a sequence known as the **instruction cycle**, which consists of four key steps: **Fetch, Decode, Execute, and Update PC**. This cycle ensures smooth execution of programs in a **CPU, microcontroller, or embedded system**.

### 1. Fetch

In this step, the **processor retrieves the next instruction from memory (RAM)**. The **Program Counter (PC)** holds the **memory address** of the instruction to be fetched. The instruction is then **loaded into the Instruction Register (IR)** for processing.

### 2. Decode

Once fetched, the **Instruction Decoder** interprets the **opcode (operation code)** and identifies **which components of the CPU (ALU, registers, memory, etc.) need to be activated**. The decoder also determines whether **operands** need to be fetched from registers or memory for execution.

### 3. Execute

The **ALU (Arithmetic Logic Unit)** or other functional units perform the required operation, such as **mathematical calculations (ADD, SUB, MUL, DIV), logical operations (AND, OR, XOR), data movement, or branching**. The result is stored in a **register or memory** as needed.

### 4. Update PC

After execution, the **Program Counter (PC)** is **updated** to point to the **next instruction**. If the instruction was a jump or branch, the PC is modified accordingly. Otherwise, it simply moves to the **next sequential address**.

This cycle repeats continuously, ensuring the **smooth execution of programs** in digital processors.

## 5. Instruction Set

| Opcode (Binary) | Instruction | Description                                    |
|-----------------|-------------|------------------------------------------------|
| 000             | ADD         | $\text{regA} = \text{regA} + \text{regB}$      |
| 001             | SUB         | $\text{regA} = \text{regA} - \text{regB}$      |
| 010             | AND         | $\text{regA} = \text{regA} \& \text{regB}$     |
| 011             | OR          | $\text{regA} = \text{regA} \vee \text{regB}$   |
| 100             | XOR         | $\text{regA} = \text{regA} \wedge \text{regB}$ |
| 101             | LOAD        | $\text{regA} = \text{mem}[\text{address}]$     |
| 110             | STORE       | $\text{mem}[\text{address}] = \text{regA}$     |
| 111             | JMP         | $\text{PC} = \text{address}$                   |



### Instruction Set in an 8-Bit Processor

An instruction set in an 8-bit processor consists of a set of predefined machine-level commands that the processor can execute. These instructions manipulate 8-bit data and control the processor's operations. The instruction set is categorized into several types based on functionality.

#### 1. Data Transfer Instructions

These instructions move data between registers, memory, and the accumulator.

- MOV: Transfers data between registers (e.g., MOV A, B).

- MVI: Moves an immediate value to a register (e.g., MVI A, 0x05).
- LDA/STA: Loads or stores data from/to memory (e.g., LDA 2000H).

## 2. Arithmetic Instructions

Perform basic mathematical operations.

- ADD: Adds a register value to the accumulator (ADD B).
- SUB: Subtracts a register value from the accumulator (SUB C).
- INR/DCR: Increments or decrements a register (INR A).

## 3. Logical Instructions

Perform bitwise operations and condition checking.

- ANA/OR/A/XRA: AND, OR, and XOR operations on registers.
- CPI: Compares an immediate value with the accumulator.

## 4. Control Instructions

Control the execution flow of the program.

- JMP: Jumps to a specified memory address (JMP 3000H).
- CALL/RET: Calls a subroutine and returns from it.
- HLT: Halts the processor.

The 8-bit instruction set enables efficient processing for small-scale computing, embedded systems, and microcontroller applications.

## 6. Verilog Code

opcodes.v:

Verilog

```
`define OPCODE_ADD 3'b000
`define OPCODE_SUB 3'b001
`define OPCODE_AND 3'b010
`define OPCODE_OR  3'b011
`define OPCODE_XOR 3'b100
`define OPCODE_LOAD 3'b101
`define OPCODE_STORE 3'b110
`define OPCODE_JMP 3'b111
```

alu.v:

Verilog

```
module ALU (
    input [7:0] a,
    input [7:0] b,
    input [2:0] opcode,
    output reg [7:0] result
);

    always @(*) begin
        case (opcode)
            3'b000: result = a + b;
            3'b001: result = a - b;
            3'b010: result = a & b;
            3'b011: result = a | b;
            3'b100: result = a ^ b;
            default: result = 8'h00;
        endcase
    end

endmodule
```



## processor.v:

Verilog

```
\include "opcodes.v"

module Processor (
    input clk,
    input rst,
    input [7:0] instruction,
    input [7:0] data_in,
    output reg [7:0] data_out,
    output reg [7:0] address_out
);

reg [7:0] regA, regB, regPC;
wire [7:0] alu_result;
wire [2:0] opcode;
reg [7:0] mem[0:255];

assign opcode = instruction[7:5];

ALU alu (
    .a(regA),
    .b(regB),
    .opcode(opcode[2:0]),
    .result(alu_result)
);

always @(posedge clk or posedge rst) begin
    if (rst) begin
        regA <= 8'h00;
        regB <= 8'h00;
        regPC <= 8'h00;
        data_out <= 8'h00;
        address_out <= 8'h00;
    end else begin
        case (opcode)
            `OPCODE_ADD: begin
                regA <= alu_result;
                regPC <= regPC + 1;
            end
            `OPCODE_SUB: begin
                regA <= alu_result;
                regPC <= regPC + 1;
            end
            `OPCODE_AND: begin
                regA <= alu_result;
```



```

regPC <= regPC + 1;
end
`OPCODE_OR: begin
  regA <= alu_result;
  regPC <= regPC + 1;
end
`OPCODE_XOR: begin
  regA <= alu_result;
  regPC <= regPC + 1;
end
`OPCODE_LOAD: begin
  regA <= mem[instruction[4:0]];
  regPC <= regPC + 1;
end
`OPCODE_STORE: begin
  mem[instruction[4:0]] <= regA;
  regPC <= regPC + 1;
end
`OPCODE_JMP: begin
  regPC <= instruction[4:0];
end
default: begin
  regPC <= regPC + 1;
end
endcase
end
end

always @(posedge clk) begin
  address_out <= regPC;
  data_out <= mem[regPC];
end

endmodule

```



### processor\_tb.v:

Verilog

```

`include "opcodes.v"

module Processor_tb;
  reg clk, rst;
  reg [7:0] instruction, data_in;
  wire [7:0] data_out, address_out;

```

```

Processor dut (
    .clk(clk),
    .rst(rst),
    .instruction(instruction),
    .data_in(data_in),
    .data_out(data_out),
    .address_out(address_out)
);

initial begin
    clk = 0;
    forever #5 clk = ~clk;
end

initial begin
    rst = 1;
    instruction = 8'h00;
    data_in = 8'h00;
    #10;
    rst = 0;

    instruction = {`OPCODE_LOAD, 5'd10}; #10;
    instruction = {`OPCODE_STORE, 5'd11}; #10;
    instruction = {`OPCODE_LOAD, 5'd12}; #10;
    instruction = {`OPCODE_ADD, 5'd2}; #10;
    instruction = {`OPCODE_STORE, 5'd03}; #10;
    instruction = {`OPCODE_JMP, 5'd0}; #10;

    #100;
    $finish;
end

initial begin
    $dumpfile("processor_tb.vcd");
    $dumpvars(0, Processor_tb);
end

initial begin
    $monitor("Time=%t, PC=%h, Instruction=%b, regA=%h, regB=%h", $time, dut.regPC, instruction,
dut.regA, dut.regB);
    end
endmodule

```

```

`include "opcodes.v"

module Processor (
    input clk,
    input rst,
    input [7:0] instruction,
    input [7:0] data_in,
    output reg [7:0] data_out,
    output reg [7:0] address_out
);

reg [7:0] regA, regPC;
wire [7:0] alu_result;
wire [2:0] opcode;
wire [7:0] regB;
assign regB = instruction [4:0];
reg [7:0] mem[0:31];

assign opcode = instruction[7:5];

ALU alu (
    .a(regA),
    .b(regB),
    .opcode(opcode[2:0]),
    .result(alu_result)
);
integer i;
always @(posedge clk or posedge rst) begin
    if (rst) begin
        regA <= 8'h00;
        //regB <= 8'h00;
        regPC <= 8'h00;
        data_out <= 8'h00;
        address_out <= 8'h00;
        mem[ 0] <= 8'haf;
        mem[ 1] <= 8'h0e;
        mem[ 2] <= 8'h2d;
        mem[ 3] <= 8'h3a;
        mem[ 4] <= 8'h4c;
        mem[ 5] <= 8'h5f;
        mem[ 6] <= 8'h67;
        mem[ 7] <= 8'h90;
        mem[ 8] <= 8'h00;
        mem[ 9] <= 8'haa;
        mem[10] <= 8'hbb;
        mem[11] <= 8'hcc;
        mem[12] <= 8'hdd;
        mem[13] <= 8'hee;
        mem[14] <= 8'hff;
        mem[15] <= 8'h01;
    end
end

```



```

for (i=16; i <=31; i++) begin
    mem[i] <= 0;
end

end else begin
    case (opcode)
        `OPCODE_ADD: begin
            regA <= alu_result;
            regPC <= regPC + 1;
        end
        `OPCODE_SUB: begin
            regA <= alu_result;
            regPC <= regPC + 1;
        end
        `OPCODE_AND: begin
            regA <= alu_result;
            regPC <= regPC + 1;
        end
        `OPCODE_OR: begin
            regA <= alu_result;
            regPC <= regPC + 1;
        end
        `OPCODE_XOR: begin
            regA <= alu_result;
            regPC <= regPC + 1;
        end
        `OPCODE_LOAD: begin
            regA <= mem[instruction[4:0]];
            regPC <= regPC + 1;
        end
        `OPCODE_STORE: begin
            mem[instruction[4:0]] <= regA;
            regPC <= regPC + 1;
        end
        `OPCODE_JMP: begin
            regPC <= instruction[4:0];
        end
        default: begin
            regPC <= regPC + 1;
        end
    endcase
end
end

always @(posedge clk) begin
    address_out <= regPC;

    data_out <= alu_result;//mem[regPC];
end

endmodule

```



## Opcode Definitions

In an 8-bit processor, an opcode (operation code) is a binary-coded instruction that tells the processor what operation to perform. The instruction decoder reads the opcode and activates the necessary CPU components. Examples include:

- MOV A, B (Opcode: 78H) → Moves data from register B to A.
- ADD A, B (Opcode: 80H) → Adds values in registers A and B.
- JMP 2000H (Opcode: C3H) → Jumps to memory address 2000H.

Each opcode has a fixed bit pattern, allowing the processor to recognize and execute it.

## ALU Module

The Arithmetic Logic Unit (ALU) is responsible for performing arithmetic and logical operations in an 8-bit processor. It takes 8-bit inputs from registers, processes them, and stores the result in an accumulator or memory. It handles:

- Arithmetic Operations: ADD, SUB, INC, DEC.
- Logical Operations: AND, OR, XOR, NOT.
- Comparisons: CMP (Compare two values).

The ALU works with status flags (Zero, Carry, Sign, Overflow) to determine the result's conditions.

## Processor Module

The processor module integrates all key CPU components, including the ALU, registers, program counter, instruction decoder, and control unit. It executes instructions in four stages:

1. Fetch: Retrieves instruction from memory.
2. Decode: Interprets opcode and fetches operands.
3. Execute: Performs the operation (via ALU or memory).
4. Update PC: Moves to the next instruction.

The processor also manages interrupt handling, clock cycles, and memory access.

## Testbench for Simulation

A testbench is used to simulate and verify an 8-bit processor's functionality before hardware implementation. It is written in Verilog or VHDL and provides:

- Instruction execution testing (e.g., ADD, MOV).
- Register behavior verification.
- ALU operations and flag testing.

Simulation tools like ModelSim or Xilinx ISE analyze the testbench output, ensuring the processor functions correctly under various conditions.



## 7. Simulation and Testing

Simulation and testing are crucial steps in verifying the functionality of an 8-bit processor before it is implemented in hardware. This process ensures that all components, such as the ALU, registers, memory, instruction decoder, and control unit, work correctly under different conditions. HDL (Hardware Description Language) like Verilog or VHDL is used to design and test the processor through simulation tools.

### 1. Steps in Simulating an 8-Bit Processor

#### Step 1: Design the Processor Components

The ALU, Register File, Program Counter, Instruction Decoder, and Memory are designed as individual modules using Verilog or VHDL.

The processor's architecture is defined based on its instruction set and opcode format.

#### Step 2: Develop a Testbench

A testbench is written in HDL to simulate the processor. It includes:

Stimulus Inputs: Sample instruction sequences to test different operations.

Expected Outputs: Correct results for comparison.

Clock and Reset Signals: To control processor execution cycles.

Example of a Verilog testbench snippet for an ADD instruction:

```
initial begin
    opcode = 8'b10000000; // ADD instruction opcode
    regA = 8'b00000101; // Register A = 5
    regB = 8'b00000011; // Register B = 3
    #10; // Wait for execution
    $display("Result: %d", result); // Check output
end
```

### Step 3: Run the Simulation

The processor's behavior is analyzed cycle by cycle using simulation tools like ModelSim, Xilinx ISE, or Quartus.

Waveform outputs (such as register values and ALU results) are observed.

Errors like incorrect opcode decoding, memory access failures, or logic errors are identified and fixed.

## 2. Testing the 8-Bit Processor

### Functional Testing

- Ensures that all instructions **execute correctly** (e.g., ADD, SUB, MOV, JMP).
- Compares **actual results vs. expected results** for validation.

### Performance Testing

- Measures execution time, **clock cycles per instruction (CPI)**, and memory access delays.
- Ensures proper synchronization with external components like memory and I/O devices.

### Edge Case Testing

- Tests scenarios like **register overflow, divide-by-zero, and invalid opcode handling**.
- Ensures error flags (Zero, Carry, Overflow) are set correctly.

## 3. Finalizing the Processor for Hardware Implementation

Once simulation is successful, the processor can be:

Synthesized into FPGA hardware for real-world testing.

Implemented as an ASIC (Application-Specific Integrated Circuit) for dedicated applications.

By performing simulation and testing, designers ensure the 8-bit processor operates accurately and efficiently, avoiding costly hardware errors.

## **8. Conclusion**

The 8-bit simple processor designed in this report demonstrates a fundamental yet practical approach to understanding processor architecture. With its basic ALU, instruction set, and memory management, it provides a clear foundation for learning digital design concepts. The implementation in Verilog allows for further testing, modification, and expansion, making it an excellent tool for students and engineers exploring computer architecture and embedded system design. Future improvements could include pipeline processing, additional instructions, and memory hierarchy enhancements to create a more powerful and efficient processor model.

