

---

**team BOES**

Computer Architecture  
ECE 09243 | Summer 2021

---

Professor Muhlbaier

---

Jeremy Beal  
Michael Onyeije  
Ali Elhamawi

---

---

**BOES-16JAMB Processor Datasheet**  
**A 16-bit Processor**

---

# Section 1: The Overview

The BOES-16JAMB is a 16-bit processor based on Harvard Architecture, meaning that it makes use of two separate memory spaces for program instructions and data. There are a few advantages that come with this. A processor's operating bandwidth can improve, and a designer can incorporate different bus sizes between the two forms of memory. Our processor uses two separate memory components, a ROM (read-only memory) and a RAM (random access memory). These two forms of memory will be described in more detail in the [\*\*MEMORY SECTION\*\*](#). The BOES-16JAMB uses a register file with 8 registers, all (at least) 16-bits. Data can be written from a desired register, or read from two desired registers at a time. The ALU in our processor is capable of doing common arithmetic and logic instructions along with logical shifts. Both the ALU and register file handle (at least) 16-bit input. More individualized information on each group member's processor can be found [\*\*HERE\*\*](#). These sections will shed more light on specific memory sizes, architecture implementation, peripherals, and clock speed.

# Table of Contents

|                                                 |    |
|-------------------------------------------------|----|
| <u>The Overview</u> .....                       | 2  |
| <u>Architecture Overview</u> .....              | 4  |
| <u>List of Figures</u> .....                    | 5  |
| <u>List of Tables</u> .....                     | 7  |
| <u>The Register File</u> .....                  | 8  |
| <u>The ALU</u> .....                            | 10 |
| <u>The Memory</u> .....                         | 14 |
| <u>The Datapath</u> .....                       | 15 |
| <u>The Control Unit</u> .....                   | 19 |
| <u>The CPU</u> .....                            | 23 |
| <u>Peripherals</u> .....                        | 28 |
| <u>Instruction Set Architecture</u> .....       | 30 |
| <u>Example Program</u> .....                    | 38 |
| <u>Performance</u> .....                        | 40 |
| <u>Personal Enhancements and Features</u> ..... | 41 |

## Section 2: Architecture Overview

Figure 2-1: Simple Block Diagram (Jeremy)



This is a very simple version of my finished datapath. Data flows into the program counter, the program counter value flows into the separate ROM which passes instructions to the instruction register. The instruction register will load instructions into the control unit which will generate a control word that tells everything else what to do. More information about this process can be found in the [datapath section](#). However, from an architecture standpoint, there are indeed two separate memory units, and the bulk of the selection happens using tri-state buffers (as seen with the AS and DS signals in the above figure).

# **Section 3: List of Figures**

## **Architecture Overview**

- Figure 2-1: Simple Block Diagram (Jeremy)

## **The Register File**

- Figure 5-1: 8x16 Register File
- Figure 5-2: Test bench of the Register File

## **The ALU**

- Figure 6-1: 16-bit ALU from 4, 4-bit ALU's
- Figure 6-2: Testbench of ALU (logic)
- Figure 6-3: Testbench of ALU (arithmetic)

## **The Memory**

- Figure 7-1: Memory Layout

## **The Datapath**

- Figure 8-1: Instruction Register Symbol
- Figure 8-2: Program Counter Symbol
- Figure 8-3: Datapath using Tri-state Buffers
- Figure 8-4: Datapath using Muxes
- Figure 8-5: Control Word

## **The Control Unit**

Figure 6-1: Control Unit Final Schematic (1 of 2)

- Figure 9-2: Control Unit Final Schematic (2 of 2)
- Figure 9-3: Control Unit Decoders Register
- Figure 9-4: Control Unit State Machine
- Figure 9-5: Control Unit Test Bench (1)
- Figure 9-5: Control Unit Test Bench (2)

## **The CPU**

- Figure 10-1: Complete Processor Datapath
- Figure 10-2: Testing LRI Instruction
- Figure 10-3: Testing BRN Instruction (1 of 2)
- Figure 10-4: Testing BRN Instruction (2 of 2)
- Figure 10-5: Testing STI Instruction

## **The Peripherals**

- Figure 11-1: VGA Block Connected to Processor
- Figure 11-2: Horizontal Timing For VGA

## **Instruction Set Architecture**

- [\*\*Figure 12-1: Instruction Set Summary\*\*](#)
- [\*\*Figure 12-2: Instruction Set Formats\*\*](#)
- [\*\*Figure 12-3: Detailed Instruction Set List with Descriptions \(1 of 5\)\*\*](#)
- [\*\*Figure 12-4: Detailed Instruction Set List with Descriptions \(2 of 5\)\*\*](#)
- [\*\*Figure 12-5: Detailed Instruction Set List with Descriptions \(3 of 5\)\*\*](#)
- [\*\*Figure 12-6: Detailed Instruction Set List with Descriptions \(4 of 5\)\*\*](#)
- [\*\*Figure 12-7 Detailed Instruction Set List with Descriptions \(5 of 5\)\*\*](#)

## **Example Program**

- [\*\*Figure 13-1: Example Program Code \(Jeremy\)\*\*](#)

## **Performance**

## **Personal Enhancements and Features (Jeremy)**

- [\*\*Figure 15-1: ALU Flags\*\*](#)
- [\*\*Figure 15-2: Overflow Example\*\*](#)
- [\*\*Figure 15-3: Multiplier Test\*\*](#)
- [\*\*Figure 15-4: Stack Test\*\*](#)
- [\*\*Figure 15-5: Call and Return\*\*](#)

## **Appendix**

- [\*\*Figure 16-1: Register File Decoder\*\*](#)
- [\*\*Figure 16-2: Register File Mux\*\*](#)
- [\*\*Figure 16-3: ALU Cell\*\*](#)
- [\*\*Figure 16-4: Instruction Register Verilog\*\*](#)
- [\*\*Figure 16-5: Program Counter Verilog\*\*](#)
- [\*\*Figure 16-6: Better View of Datapath \(Jeremy\)\*\*](#)
- [\*\*Figure 16-7: Clock Divider\*\*](#)
- [\*\*Figure 16-8: VGA Horizontal Counter\*\*](#)
- [\*\*Figure 16-9: VGA Vertical Counter\*\*](#)
- [\*\*Figure 16-10: VGA Top\*\*](#)
- [\*\*Figure 16-11: Multiplier Schematic\*\*](#)
- [\*\*Figure 16-12: Stack Verilog \(1\)\*\*](#)
- [\*\*Figure 16-13: Stack Verilog \(2\)\*\*](#)



## Section 4: List of Tables

### The Register File

- Table 5-1: Inputs and Outputs of Register File

### The ALU

- Table 6-1: Function Select Table

### The Datapath

- Figure 8-1: Table of Signals

### Instruction Set Architecture

- Table 12-1: Opcode Field Descriptions
- Table 12-2: Table of Operations and Control Words (Opcodes starting with 01)  
*(1 of 2)*
- Table 12-3: Table of Operations and Control Words (Opcodes starting with 01)  
*(2 of 2)*
- Table 12-4: Table of Operations and Control Words (Opcodes starting with 10)  
*(1 of 2)*
- Table 12-5: Table of Operations and Control Words (Opcodes starting with 10)  
*(2 of 2)*
- Table 12-6: Table of Operations and Control Words (Opcodes starting with 00)

# Section 5: The Register File

Figure 5-1: 8x16 Register File



Figure 1-1 shows an 8x16 register file, 8 registers 16 bits each. There is a 4 bit destination select that chooses which register to load data into (this is sent through a 3-to-8 decoder). It has a write input which will load data when write is 1, and does not load data when write is 0. There are also 2, 16 bit 8-to-1 multiplexers that select the registers to connect to the outputs. The write signal and decoder outputs are sent through 8 AND gates before going into each of the registers.

**Figure 5-2: Test bench of the Register File**



Here, we can prove that the register file is working properly by looking at the wave diagram in ModelSim. Write is 1, meaning that data will be loaded. The data to be loaded is located on the D input (in orange). Specifically, it will be loaded onto register 011 based on our destination address (in blue). On the next clock edge, we get that.

**Table 5-1: Inputs and Outputs of Register File**

| Inputs / Outputs | Description                                           |
|------------------|-------------------------------------------------------|
| D                | 16-bit data input that loads into the registers.      |
| DA               | 3 / 4 bit address that selects register to load into. |
| A                | 16-bit output from selected register.                 |
| AA               | 3 / 4 bit output to select A for output.              |
| B                | 16-bit output from selected register.                 |
| BA               | 3 / 4 bit output to select B for output.              |
| WR               | Write a signal to enable load.                        |
| clock            | Input to the clock of all registers.                  |
| reset            | Resets all registers to zero.                         |
| R0 - R7/R15      | Direct 16-bit register outputs.                       |

## Section 6: The ALU

The chosen ALU is constructed, essentially, of 16 1-bit ALU's. A 4-bit symbol file was created using 4 1-bit ALU's which made it relatively easy to implement a 16-bit version. This ALU can perform arithmetic operations (add / subtract etc.), logical operations (AND / OR / NOT etc.), and shifts to the right or left. A carry-look-ahead adder was implemented into this design for gate delay optimization. Schematics of the individual cells can be found in the [APPENDIX](#).

**Figure 6-1: 16-bit ALU from 4, 4-bit ALU's**



Also included in this section is a very useful function select (FS) table complete with correct  $C_0$  values that describes each function select input and what operation it executes. Some values are omitted due to an obscure output that wouldn't necessarily be used.

**Table 6-1: Function Select Table**

| FS[4] | FS[3] | FS[2] | FS[1] | FS[0] | C <sub>0</sub> | Operation            |
|-------|-------|-------|-------|-------|----------------|----------------------|
| 0     | 0     | 0     | 0     | 0     | x              | F = 0                |
| 0     | 0     | 0     | 0     | 1     | x              | F = A ~  B (NOR)     |
| 0     | 0     | 0     | 1     | 0     | x              | F = (~A) & B         |
| 0     | 0     | 0     | 1     | 1     | x              | F = ~A               |
| 0     | 0     | 1     | 0     | 0     | x              | F = A & (~B)         |
| 0     | 0     | 1     | 0     | 1     | x              | F = ~B               |
| 0     | 0     | 1     | 1     | 0     | x              | F = A ^ B (XOR)      |
| 0     | 0     | 1     | 1     | 1     | x              | F = A ~& B (NAND)    |
| 0     | 1     | 0     | 0     | 0     | x              | F = A & B            |
| 0     | 1     | 0     | 0     | 1     | x              | F = A ~^ B (not XOR) |
| 0     | 1     | 0     | 1     | 0     | x              | F = B                |
| 0     | 1     | 0     | 1     | 1     | x              | Omitted              |
| 0     | 1     | 1     | 0     | 0     | x              | F = A                |
| 0     | 1     | 1     | 0     | 1     | x              | Omitted              |
| 0     | 1     | 1     | 1     | 0     | x              | F = A   B            |
| 0     | 1     | 1     | 1     | 1     | x              | F = 1'b1             |
| 1     | 0     | 0     | 0     | 0     | 0              | F = A                |
| 1     | 0     | 0     | 0     | 0     | 1              | F = A + 1            |
| 1     | 0     | 0     | 0     | 1     | 0              | F = ~A               |
| 1     | 0     | 0     | 0     | 1     | 1              | F = -A               |
| 1     | 0     | 0     | 1     | 0     | 0              | F = A + 1            |
| 1     | 0     | 0     | 1     | 0     | 1              | F = A + 2            |
| 1     | 0     | 0     | 1     | 1     | 0              | F = -A               |
| 1     | 0     | 0     | 1     | 1     | 1              | F = 1-A              |

|   |   |   |   |   |   |                       |
|---|---|---|---|---|---|-----------------------|
| 1 | 0 | 1 | 0 | 0 | 0 | $F = A + B$           |
| 1 | 0 | 1 | 0 | 0 | 1 | $F = A + B + 1$       |
| 1 | 0 | 1 | 0 | 1 | 0 | $F = \sim A + B$      |
| 1 | 0 | 0 | 1 | 1 | 1 | $F = B - A$           |
| 1 | 0 | 1 | 1 | 0 | 0 | $F = A + \sim B$      |
| 1 | 0 | 1 | 1 | 0 | 1 | $F = A - B$           |
| 1 | 0 | 1 | 1 | 1 | 0 | $F = \sim A + \sim B$ |
| 1 | 0 | 1 | 1 | 1 | 1 | $F = \text{Omitted}$  |
| 1 | 1 | 0 | 0 | 0 | 0 | $F = A \ll 0$         |
| 1 | 1 | 0 | 0 | 0 | 1 | $F = A \ll 1$         |
| 1 | 1 | 0 | 0 | 1 | 0 | $F = A \gg 1$         |
| 1 | 1 | 0 | 0 | 1 | 1 | $F = A \ggg 1$        |

The remaining bits will simply perform the previous three shift operations, they will each repeat three times (until 11111) and were omitted from the table. Also, the omitted bits still have a function, that said function was just deemed unnecessary for showcasing.

**Figure 6-2: Testbench of ALU (logic)**



The figure above shows a testbench of the ALU, specifically the initial logic functions. The functions being performed are above in red and are working as expected. The blue dots represent the zero flag at work and the red dots represent the negative flag (along with Cout).

**Figure 6-3: Testbench of ALU (arithmetic)**



This snippet of the testbench shows some of the arithmetic operations (switched every 10 ticks) as listed at the top in red. Again everything seems to be working properly and the flags changes are shown with blue and red dots.

# Section 7: The Memory

Figure 7-1: Memory Layout



Our processor is equipped with individual ROM and RAM modules. The size of the actual memory will vary depending on the group member as each member was required to have a slightly different size, this given size is just for figure purposes. At this size the RAM spans from 0x000 to 0x3FF while the ROM caps out at 0x1FF. The RAM accepts 10-bit addresses (variable) and 16-bit data, and has a write enable input. It outputs 16-bit data that is fed into a tri-state buffer for selecting purposes. The ROM has an 8-bit address input (variable) with a 16-bit output that also goes directly to a tri-state buffer. Advanced features and modifications can be found in some individual sections [HERE](#).

# Section 8: The Datapath

Figure 8-1: Instruction Register Symbol



The instruction register is essentially a normal register with an IR\_load input which when enabled will allow the IR to load instructions. The instruction register feeds directly into the control unit which will output specific control word values based on the bits of the IR output. The Verilog code for the instruction register can be found in the [APPENDIX](#).

Figure 8-2: Program Counter Symbol



The program counter chosen in the design has a 2-bit selector, at 00 the program counter does nothing, 01 causes the program counter to increment by 1, while 11 will increment the program counter by a literal value (named ex). That literal value feeds into the PC\_offset output and a tri-state buffer can select the PC output of the program counter (although not

used often). The PC\_in output will feed into the different components of the datapath (RAM, ALU, Register File). The Verilog code for the program counter can be found in the [APPENDIX](#).

**Figure 8-3: Datapath using Tri-state Buffers**



Note: An annotated version of this datapath can be found in the [APPENDIX](#).

**Figure 8-4: Datapath using Muxes**



**Figure 8-5: Control Word**

|        |     |        |        |        |    |     |        |     |        |    |       |    |                                                                                    |
|--------|-----|--------|--------|--------|----|-----|--------|-----|--------|----|-------|----|------------------------------------------------------------------------------------|
| 44..43 | 42  | 41..39 | 38..36 | 35..33 | 32 | 31  | 30..26 | 25  | 24..19 | 18 | 17..2 | 1  |  |
| PS     | IRL | SA     | SB     | DR     | WR | RST | FS     | Cin | DS     | AS | ex    | MW | NS                                                                                 |

**Figure 5-1: Table of Signals**

|           |                                                                                                                                                                            |
|-----------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| PS[45:44] | Program Counter Selector, selects what the program counter is doing                                                                                                        |
| IRL[43]   | Instruction Register Load, is enabled the instruction register will load instructions                                                                                      |
| SA[42:40] | Source A from the register file, will be assigned to a register in the file                                                                                                |
| SB[39:37] | Source B from the register file, will be assigned to a register in the file                                                                                                |
| DR[36:34] | Destination Register from the register file, will be assigned to a register in the file                                                                                    |
| WR[33]    | Write Register, when enabled data will be written onto a register                                                                                                          |
| RST[32]   | Reset, clears registers                                                                                                                                                    |
| FS[31:27] | Function Select, controls the ALU and what operations it will execute                                                                                                      |
| Cin[26]   | Cin value, paired with the ALU, will perform different operations based on the value of Cin (in some cases)                                                                |
| DS[25:21] | Data Bus Selector, selects the source of data to be loaded onto the data bus. DS[0] correlates to ROM, DS[1] to PC, DS[2] to the ALU, DS[3] to the RAM, DS[4] to the stack |
| AS[20]    | Address Selector, selects either the literal value or B output of register                                                                                                 |
| ex[19:4]  | Literal Value, 16 bits, used as an address or immediate value for different instructions                                                                                   |
| MW[3]     | Write Memory, when enabled memory will be written to                                                                                                                       |
| SS[2:1]   | Stack Selector, selects between the inputs of the hardware stack                                                                                                           |
| NS[0]     | Next State                                                                                                                                                                 |

In reference to figure 4-3, this datapath was designed using tri-state buffers. Tri-state buffers select the output source that is loaded onto the data bus (DS) and two more tri-state buffers select between the literal value and B output of the register file. From a design standpoint, the components were connected by wire name to make everything look cleaner (an annotated version is included to help see what goes where). The datapath is of Harvard Architecture, there are separate buses for memory data addresses, along with two separate blocks for ROM and RAM. More benefits of this architecture will come in the control unit section.

# Section 9: The Control Unit

Figure 9-1: Control Unit Final Schematic (1 of 2)



Figure 9-2: Control Unit Final Schematic (2 of 2)



Note:  $ControlWordOut_{XX}$  outputs connect to DXX inputs.

**Figure 9-3: Control Unit Decoders Register**



Note: out output connects to final (fourth) state input of 11 decoder

In order to create the control unit, four separate decoders were created for different opcodes starting with 00, 01, 10, and 11. After breaking the instructions down into their respective most significant two bits, tables were made to make things more readable and easier to work with (found in the instruction set architecture section). Some of the signals can be easily reduced to wires (for example if SA always belongs to the same bits of IR you already have your answer). Once those are dealt with, you can solve for the slightly more complex signals. K-maps were the method of choice for this process. Negative and Zero status signals were also included for branch instructions. Symbol files were created for each decoder, which were put into a schematic file and linked together. Each state input linked to the next state input until decoder 11 fed into the output of a register that held that data. The control word was broken down and bus/wire named into their signal and whether they corresponded to a 00, 01, 10, or 11 instruction. A symbol file was then created for that file, and brought into the file seen in the first two figures. As previously stated, the two most significant bits of IR are used as select bits, and are fed into a 46-bit 4-to-1 mux. The DXX inputs receive the different control words and the output ends up being the final control word based on the IR input.

**Figure 9-4: Control Unit State Machine**



The datapath utilizes Harvard Architecture. When looking at the state machine for the control unit, only two states are needed. Most instructions are single cycle because the instruction fetch and EX0 states are combined into one. This can be seen on the left of the diagram. There are only two instructions that require two cycles, CALL and LRRI. To do this, the program counter needs to be able to add an offset, and memory instructions need to be single cycle. Inverting the RAM clock solved that issue.

**Figure 9-5: Control Unit Test Bench (1)**



In this ModelSim screenshot, the increment operation is being executed. The two most significant bits of IR are 01, and along with the opcode and desired registers, this selects the corresponding control word that is shown on the screen. In an effort to make things more readable and easier to break down, most signals were broken down into their corresponding place in the control word and wired in the testbench. This makes it a bit easier to visualize where everything is going and what goes where. For the increment operation, The opcode goes in first followed by the destination register, source A register, and source B register (which really is a don't care). Although the literal value is set here, the AS signal is low so it is not being incorporated. Cin is high which is crucial for this operation since Cin must be 1 for the operation to actually be incremented

The following figures will not be labeled this way so it would be best to refer to the above figure or the control word figure located [HERE](#).

**Figure 9-6: Control Unit Test Bench (2)**



This snippet shows the control word generation for a store operation. A few key things here: The AS signal is 1 meaning the literal value is in use, the ex literal value is set to a specific address correlating to the one chosen in the IR input, write to memory is on and write to register is off, and SA is set to the correct value based on IR input. This would store whatever is in SA to address 41 (hex) in memory.

# Section 10: The CPU

Figure 10-1: Complete Processor Datapath



Note: Smaller snippets of this complete datapath can be found in the [APPENDIX](#).

The testbench for our complete CPU was fairly simple. Outputs of the control word were wired for monitoring purposes and the clock was connected as an input. Once that was done, the processor did everything else as far as generating control words via instructions in the ROM. In the following testbench figures, we tested a few of the required instructions, loading to each register to make sure everything was working correctly. Again, there will be specific sections for individual optional instructions included and validating their functionality.

**Figure 10-2: Testing LRI Instruction**



Here, the LRI instruction is being tested, so a value based on the literal value ex is passed onto the desired register. In this case the instruction called for a zero to be loaded into R0, a one to be loaded in R1, and so on. You can see that before a one is loaded into R1, the source was 001, and the literal value ex was also 0001. The instruction value can be seen in the IR signal, and the next instruction to be loaded can be seen in the ROM signal.

**Figure 10-3: Testing BRN Instruction (1 of 2)**



This is an example of the branch not being taken. The program counter continues to increment even though a branch instruction is being executed. This happens because this specific instruction is checking if R1 is negative, and in this case it's not. Therefore, the program counter continues its normal process.

**Figure 10-4: Testing BRN Instruction (2 of 2)**



Here, the branch is taken because R1 is now negative. This sets the PS bits to 11 which will increment the program counter by an offset (in this case -15). This takes the PC to -3 and afterwards it returns to incrementing (where it will eventually get back to 0 and restart).

**Figure 10-4: Testing STI Instruction**



For the storing at an immediate address the corresponding bits are marked in yellow.  
Source A is one so it will store the data in R1 to address ex which is 41 in hex. Another thing to note is that MW (write to memory) is on and it is.

**Figure 10-5: Testing STI Instruction**



Here we are loading data (LDI). The address is 41 in hex, the data is available in time for the next clock cycle, and is loaded in DR 100 or R4. It is important to note that memory write is off and write to register is on.

The above tests go into detail about some of the required instructions, optional instructions (if supported) will be shown in individual sections.

# Section 11: The Peripherals

## VGA (Jeremy)

Figure 11-1: VGA Block Connected to Processor



At its core, my VGA peripheral is really just a VGA controller with a few tweaks so it can act as a peripheral. In order to make the controller, I referred to the DE-10 Lite Manual which had timing specifications for general VGA purposes. It required a 25Mhz clock and I chose to implement counters that would sync the vertical and horizontal components.

Figure 11-2: Horizontal Timing For VGA



Looking at the horizontal timing, there is a display interval when the horizontal counter reaches a specific number, which will then trigger the vertical counter to start counting which will eventually enable the vertical sync signal. In Verilog, it wasn't extremely difficult to assign the RGB colors to that display interval, where they could then be driven by another signal when true, or zero when false. This is where the special function registers come in. There were several ways to implement the special function registers, but I ended up using only one. This was named 'color\_config'. The SFR only received data if its address and write were enabled. An address selector was used to output data based on the input of an address. Address zero would output a 12-bit 1. The address itself would go into the address input of the VGA block and the first bit of the address would be ANDed with the write to memory signal. Therefore, 'color\_config' is technically mapped to 12'h001. Another signal was used to reserve the first address for the SFR, as it would only allow the output of RAM to pass through if none of the SFR addresses were being selected (using a tri-state buffer). The Verilog code for the horizontal sync and vertical sync, along with a clock divider can be found in the [APPENDIX](#)

# Section 12: Instruction Set Architecture

Table 12-1: Opcode Field Descriptions

|            |                                                        |
|------------|--------------------------------------------------------|
| DR         | Destination Register Address for Register File         |
| SA         | Source A Address for Register File                     |
| SB         | Source B Address for Register File                     |
| AD         | Literal Address Value                                  |
| lit        | Literal Data Value                                     |
| PC         | Program Counter (Will increment unless told otherwise) |
| zf         | Zero Fill, extends literal with zeroes                 |
| se         | Sign Extend, extends literal with sign bit             |
| R[address] | Register File Register at Address                      |
| M[address] | Data Memory at Address                                 |

Figure 12-1: Instruction Set Summary

| Mnemonic         | Description                                | Opcode  | RTL                                  |                                             |
|------------------|--------------------------------------------|---------|--------------------------------------|---------------------------------------------|
| NOP              | No Operation                               | 000000  | N/A                                  | Arithmetic and Shift Instructions           |
| INC              | Increment A by 1                           | 0110000 | $R[DR] \leftarrow R[SA] + 1$         |                                             |
| ADD              | Add A + B                                  | 0110100 | $R[DR] \leftarrow R[SA] + R[SB]$     |                                             |
| SUB              | Subtract A - B                             | 0110110 | $R[DR] \leftarrow R[SA] - R[SB]$     |                                             |
| DEC              | Decrement A by 1                           | 0110010 | $R[DR] \leftarrow R[SA] - 1$         |                                             |
| MUL              | Multiply A and B                           | 0110111 | $R[DR] \leftarrow R[SA] * R[SB]$     |                                             |
| ADDC             | Add with carry A + B + C                   | 0110101 | $R[DR] \leftarrow R[SA] + R[SB] + C$ |                                             |
| NEG              | Negative -A                                | 0110001 | $R[DR] \leftarrow \sim R[SA] + 1$    |                                             |
| SHR              | Shift A right by 1                         | 0111001 | $R[DR] \leftarrow sr R[SA]$          |                                             |
| SHL              | Shift A left by 1                          | 0111000 | $R[DR] \leftarrow sl R[SA]$          |                                             |
| CLR              | Clear A to zero                            | 0100000 | $R[DR] \leftarrow 0$                 | Logic Instructions                          |
| SET              | Set all A bits to 1                        | 0101111 | $R[DR] \leftarrow FFFF$              |                                             |
| NOT              | Invert all A bits                          | 0100011 | $R[DR] \leftarrow \sim R[SA]$        |                                             |
| AND              | AND A & B                                  | 0101000 | $R[DR] \leftarrow R[SA] \& R[SB]$    |                                             |
| OR               | OR A   B                                   | 0101110 | $R[DR] \leftarrow R[SA]   R[SB]$     |                                             |
| XOR              | XOR A ^ B                                  | 0100110 | $R[DR] \leftarrow R[SA] XOR R[SB]$   |                                             |
| MOVA             | Transfer A                                 | 0101100 | $R[DR] \leftarrow R[SA]$             |                                             |
| MOV <sub>B</sub> | Transfer B                                 | 0101010 | $R[DR] \leftarrow R[SB]$             |                                             |
| ADI              | Add A + Literal                            | 00001   | $R[DR] \leftarrow R[SA] + se-lit$    | Arithmetic and Logic with Immediate Operand |
| SBI              | Subtract A - Literal                       | 00010   | $R[DR] \leftarrow R[SA] - se-lit$    |                                             |
| ANI              | AND A & Literal                            | 00011   | $R[DR] \leftarrow R[SA] \& zf-lit$   |                                             |
| ORI              | OR A   Literal                             | 00101   | $R[DR] \leftarrow R[SA]   zf-lit$    |                                             |
| XRI              | XOR A ^ Literal                            | 00110   | $R[DR] \leftarrow R[SA] XOR zf-lit$  |                                             |
| LRI              | Load literal value to register             | 11      | $R[DR] \leftarrow se-lit$            | Loading a literal value                     |
| RLI              | Load register with UI literal              | 1000010 | $R[DR] \leftarrow lit$               |                                             |
| LDI              | Load data from memory at immediate address | 10100   | $R[DR] \leftarrow M[zf-AD]$          | Memory Access Instructions                  |
| STI              | Store data to memory at immediate address  | 10101   | $M[zf-AD] \leftarrow R[SA]$          |                                             |
| LDR              | Load data from memory at register address  | 1000100 | $R[DR] \leftarrow M[R[SA]]$          |                                             |
| STR              | Store data to memory at register address   | 1000101 | $M[R[DR]] \leftarrow R[SA]$          |                                             |
| PUSH             | Push register value on stack               | 1000000 | PUSH R[SA]                           | Stack Instructions                          |
| POP              | Pop value from stack on register           | 1000001 | $R[DR] \leftarrow POP$               |                                             |
| CALL             | Call a subroutine                          | 1001110 | PUSH PC + 1, PC <= zf-AD             | Program Counter Instructions                |
| RET              | Return from a subroutine                   | 1001111 | PC <= POP                            |                                             |
| JUMPR            | Jump to address in register                | 1001101 | PC <= R[SA]                          |                                             |
| BRZ              | Branch to address if register is zero      | 10110   | if (R[SA] == 0) PC <= PC + se-AD     | Program Control Instructions                |
| BRN              | Branch to address if register is negative  | 10111   | if (R[SA] < 0) PC <= PC + se-AD      |                                             |

In our instruction set architecture, there are a total of 36 instructions. The most significant bits of each instruction will always be the opcode. There are several different formats used.

**Figure 12-2: Instruction Set Formats**

|                                                                                                                                                                            |                                                 |                      |                      |                                                                                                                                                                                                                       |                                                                                                                                                                 |                   |                  |                      |  |  |
|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------|----------------------|----------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------|------------------|----------------------|--|--|
| <b>Three Register Format</b>                                                                                                                                               | <b>One Register and 8-bit Literal Format</b>    |                      |                      |                                                                                                                                                                                                                       |                                                                                                                                                                 |                   |                  |                      |  |  |
| <table border="1" style="margin-left: auto; margin-right: auto;"> <tr> <td>7-bit Opcode</td> <td>3-bit Destination</td> <td>3-bit SA</td> <td>3-bit SB</td> </tr> </table> | 7-bit Opcode                                    | 3-bit Destination    | 3-bit SA             | 3-bit SB                                                                                                                                                                                                              | <table border="1" style="margin-left: auto; margin-right: auto;"> <tr> <td>5-bit Opcode</td> <td>3-bit Address</td> <td>8-bit Literal Value</td> </tr> </table> | 5-bit Opcode      | 3-bit Address    | 8-bit Literal Value  |  |  |
| 7-bit Opcode                                                                                                                                                               | 3-bit Destination                               | 3-bit SA             | 3-bit SB             |                                                                                                                                                                                                                       |                                                                                                                                                                 |                   |                  |                      |  |  |
| 5-bit Opcode                                                                                                                                                               | 3-bit Address                                   | 8-bit Literal Value  |                      |                                                                                                                                                                                                                       |                                                                                                                                                                 |                   |                  |                      |  |  |
| <b>One Register and 11-bit Literal Format</b>                                                                                                                              | <b>One Register Two Instruction Word Format</b> |                      |                      |                                                                                                                                                                                                                       |                                                                                                                                                                 |                   |                  |                      |  |  |
| <table border="1" style="margin-left: auto; margin-right: auto;"> <tr> <td>2-bit Opcode</td> <td>3-bit Destination</td> <td>11-bit Literal Value</td> </tr> </table>       | 2-bit Opcode                                    | 3-bit Destination    | 11-bit Literal Value | <table border="1" style="margin-left: auto; margin-right: auto;"> <tr> <td>7-bit Opcode</td> <td>3-bit Destination</td> <td>6-bit Don't Care</td> </tr> <tr> <td colspan="3">16-bit Literal Value</td> </tr> </table> | 7-bit Opcode                                                                                                                                                    | 3-bit Destination | 6-bit Don't Care | 16-bit Literal Value |  |  |
| 2-bit Opcode                                                                                                                                                               | 3-bit Destination                               | 11-bit Literal Value |                      |                                                                                                                                                                                                                       |                                                                                                                                                                 |                   |                  |                      |  |  |
| 7-bit Opcode                                                                                                                                                               | 3-bit Destination                               | 6-bit Don't Care     |                      |                                                                                                                                                                                                                       |                                                                                                                                                                 |                   |                  |                      |  |  |
| 16-bit Literal Value                                                                                                                                                       |                                                 |                      |                      |                                                                                                                                                                                                                       |                                                                                                                                                                 |                   |                  |                      |  |  |
| <b>Program Counter Instruction Format</b>                                                                                                                                  |                                                 |                      |                      |                                                                                                                                                                                                                       |                                                                                                                                                                 |                   |                  |                      |  |  |
| <table border="1" style="margin-left: auto; margin-right: auto;"> <tr> <td>7-bit Opcode</td> <td>9-bit Literal Value</td> </tr> </table>                                   | 7-bit Opcode                                    | 9-bit Literal Value  |                      |                                                                                                                                                                                                                       |                                                                                                                                                                 |                   |                  |                      |  |  |
| 7-bit Opcode                                                                                                                                                               | 9-bit Literal Value                             |                      |                      |                                                                                                                                                                                                                       |                                                                                                                                                                 |                   |                  |                      |  |  |

The 'Three Register Format', located in the top left corner of figure 7-2 will be used for:

- NOP
- Arithmetic Instructions (INC, ADD, SUB, DEC, MUL, ADDC, NEG)
- Shift Instructions (SHR, SHL)
- Logic Instructions (CLR, SET, NOT, AND, OR, XOR, MOVA, MOVB)
- Jump to Address (JMPR)
- Stack Instructions (PUSH, POP)
- Memory Access Instructions (LDR, STR)

The 'One Register and 8-bit Literal Format', located in the top right corner of figure 7-2 will be used for:

- Arithmetic Instructions with Immediate Operand (ADI, SBI)
- Logic Instructions with Immediate Operand (ANI, ORI, XRI)
- Memory Access Instructions (LDI, STI)
- Program Control Instructions (BRZ, BRN)

The 3-bit address is used as a destination register, source A register, or both destination and source A register.

The 'One Register Two Instruction Word Format' is only used for loading an immediate long literal value to a register (LRLI), while the 'One Register and 11-bit literal Format' will only be used for loading an immediate value to a register (LRI).

The 'Program Counter Instruction Format' will be used for program counter instructions RET and CALL.

**Table 12-2: Table of Operations and Control Words  
(Opcodes starting with 01)  
(1 of 2)**

**Table 12-3: Table of Operations and Control Words  
(Opcodes starting with 01)  
(2 of 2)**

**Table 12-4: Table of Operations and Control Words  
(Opcodes starting with 10)  
(1 of 2)**

| ...       | LRLI       | <<<<<     | LDI        | STI        | STR        | LDR        | PUSH       | POP        |
|-----------|------------|-----------|------------|------------|------------|------------|------------|------------|
| OPCODE    | 1000010    | 1000010   | 10100      | 10101      | 1000101    | 1000100    | 1000000    | 1000001    |
|           | <i>C</i>   | <i>O</i>  | <i>N</i>   | <i>T</i>   | <i>L</i>   |            |            |            |
| PS[46:45] | 2'b01      | 2'b01     | 2'b01      | 2'b01      | 2'b01      | 2'b01      | 2'b01      | 2'b01      |
| IRL[44]   | 1'b0       | 1'b1      | 1'b1       | 1'b1       | 1'b1       | 1'b1       | 1'b1       | 1'b1       |
| SA[43:41] | x x x      | x x x     | x x x      | IR[10:8]   | IR[8:6]    | x x x      | IR[5:3]    | x x x      |
| SB[40:38] | x x x      | x x x     | x x x      | x x x      | IR[2:0]    | IR[2:0]    | x x x      | x x x      |
| DR[37:35] | IR[8:6]    | IR[8:6]   | IR[10:8]   | x x x      | x x x      | IR[8:6]    | x x x      | IR[8:6]    |
| WR[34]    | 1'b1       | 1'b0      | 1'b1       | 1'b0       | 1'b0       | 1'b1       | 1'b0       | 1'b1       |
| CLR[33]   | 1'b0       | 1'b0      | 1'b0       | 1'b0       | 1'b0       | 1'b0       | 1'b0       | 1'b0       |
| FS[32:28] | 5'b00000   | 5'b01111  | 5'b10001   | 5'b01000   | 5'b01100   | 5'b01010   | 5'b01110   | 5'b000110  |
| Cin[27]   | x          | 0         | x          | x          | x          | x          | x          | x          |
| DS[26:21] | 5'b000100  | 5'b000100 | 5'b000100  | 5'b000100  | 5'b000100  | 5'b000100  | 5'b000100  | 5'b000100  |
| AS[20]    | x          | 1'b1      | 1'b1       | 1'b1       | 1'b0       | 1'b0       | x          | x          |
| ex[19:4]  | xxxxxxxxxx | IR[15:0]  | zf(IR[7:0] | zf(IR[7:0] | xxxxxxxxxx | xxxxxxxxxx | xxxxxxxxxx | xxxxxxxxxx |
| MW[3]     | 1'b0       | 1'b0      | 1'b0       | 1'b1       | 1'b1       | 1'b0       | 1'b0       | 1'b0       |
| SS[2:1]   | 2'b00      | 2'b00     | 2'b00      | 2'b00      | 2'b00      | 2'b00      | 2'b01      | 2'b10      |
| NS[0]     | 1'b1       | 1'b0      | 1'b0       | 1'b0       | 1'b0       | 1'b0       | 1'b0       | 1'b0       |
| S         | 1'b0       | 1'b1      | 1'b0       | 1'b0       | 1'b0       | 1'b0       | 1'b0       | 1'b0       |

**Table 12-5: Table of Operations and Control Words  
(Opcodes starting with 10)  
(2 of 2)**

| CALL       | <<<<<       | RET        | BRZ (take)  | BRZ (don't) | BRN (take)  | BRN (don't) | JMPR        |
|------------|-------------|------------|-------------|-------------|-------------|-------------|-------------|
| 1001110    | 1001110     | 1001111    | 10110       |             | 10111       |             | 1001101     |
|            |             |            | <i>W</i>    | <i>O</i>    | <i>R</i>    | <i>D</i>    |             |
| 2'b10      | 2'b01       | 2'b10      | 2'b11       | 2'b01       | 2'b11       | 2'b01       | 2'b11       |
| 1'b0       | 1'b1        | 1'b1       | 1'b1        | 1'b1        | 1'b1        | 1'b1        | 1'b1        |
| x x x      | x x x       | x x x      | IR[10:8]    | x x x       | IR[10:8]    | x x x       | x x x       |
| x x x      | x x x       | x x x      | x x x       | x x x       | x x x       | x x x       | x x x       |
| x x x      | x x x       | x x x      | x x x       | x x x       | x x x       | x x x       | x x x       |
| 1'b0       | 1'b0        | 1'b0       | 1'b0        | 1'b0        | 1'b0        | 1'b0        | 1'b0        |
| 1'b0       | 1'b0        | 1'b0       | 1'b0        | 1'b0        | 1'b0        | 1'b0        | 1'b0        |
| 5'b10000   | 5'b10100    | 5'b10100   | 5'b10110    | 5'b10110    | 5'b10011    | 5'b11001    | 5'b11000    |
| 1'b1       | 1'b0        | 1'b1       | 1'b1        | 1'b1        | x           | 1'b1        | x           |
| 5'b000100  | 5'b000100   | 5'b000100  | 5'b000100   | x x x x     | 5'b000100   | x x x x     | 5'b000100   |
| x          | 1'b1        | 1'b1       | 1'b1        | x           | 1'b1        | x           | 1'b1        |
| xxxxxxxxxx | zf(IR[8:0]) | xxxxxxxxxx | zf(IR[7:0]) | xxxxxxxxxx  | zf(IR[7:0]) | xxxxxxxxxx  | zf(IR[8:0]) |
| 1'b0       | 1'b0        | 1'b0       | 1'b0        | 1'b0        | 1'b0        | 1'b0        | 1'b0        |
| 2'b01      | 2'b00       | 2'b10      | 2'b00       | 2'b00       | 2'b00       | 2'b00       | 2'b00       |
| 1'b1       | 1'b0        | 1'b0       | 1'b0        | 1'b0        | 1'b0        | 1'b0        | 1'b0        |
| 1'b0       | 1'b1        | 1'b0       | 1'b0        | 1'b0        | 1'b0        | 1'b0        | 1'b0        |

**Table 12-6: Table of Operations and Control Words  
(Opcodes starting with 00)**

| ...       | ADI         | SBI         | ANI         | ORI         | XRI         | LRI         |
|-----------|-------------|-------------|-------------|-------------|-------------|-------------|
| OPCODE    | 00001       | 00010       | 00011       | 00101       | 00110       | 00011       |
|           |             | C           |             | W           |             |             |
| PS[46:45] | 2'b01       | 2'b01       | 2'b01       | 2'b01       | 2'b01       | 2'b01       |
| IRL[44]   | 1'b1        | 1'b1        | 1'b1        | 1'b1        | 1'b1        | 1'b1        |
| SA[43:41] | IR[10:8]    | IR[10:8]    | IR[10:8]    | IR[10:8]    | IR[10:8]    | x x x       |
| SB[40:38] | x x x       | x x x       | x x x       | x x x       | x x x       | x x x       |
| DR[37:35] | IR[10:8]    | IR[10:8]    | IR[10:8]    | IR[10:8]    | IR[10:8]    | IR[13:11]   |
| WR[34]    | 1'b1        | 1'b1        | 1'b1        | 1'b1        | 1'b1        | 1'b1        |
| CLR[33]   | 1'b0        | 1'b0        | 1'b0        | 1'b0        | 1'b0        | 1'b0        |
| FS[32:28] | 5'b10100    | 5'b10110    | 5'b01000    | 5'b01110    | 5'b00110    | 5'b01010    |
| Cin[27]   | x           | 1'b0        | x           | x           | x           | x           |
| DS[26:21] | 5'b000100   | 5'b000100   | 5'b000100   | 5'b000100   | 5'b000100   | 5'b000100   |
| AS[20]    | 1'b1        | 1'b1        | 1'b1        | 1'b1        | 1'b1        | 1'b1        |
| ex[19:4]  | zf(IR[7:0]) | zf(IR[7:0]) | zf(IR[7:0]) | zf(IR[7:0]) | zf(IR[7:0]) | zf(IR[7:0]) |
| MW[3]     | 1'b0        | 1'b0        | 1'b0        | 1'b0        | 1'b0        | 1'b0        |
| SS[2:1]   | 2'b00       | 2'b00       | 2'b00       | 2'b00       | 2'b00       | 2'b00       |
| NS[0]     | 1'b0        | 1'b0        | 1'b0        | 1'b0        | 1'b0        | 1'b0        |
| S         | 1'b0        | 1'b0        | 1'b0        | 1'b0        | 1'b0        | 1'b0        |

As previously stated in the control unit section, the opcodes were broken down and sorted by their two most significant bits, and four decoders were made for 00, 01, 10, and 11. So, in these past figures (7-5 through 7-8) the operations are sorted by that system instead of being grouped by their function.

The following figures will describe each operation in detail, including a description of the operation, the operands, the syntax, and the exact operation they perform

**Figure 12-3: Detailed Instruction Set List with Descriptions**

(1 of 5)

| NOP                                                                 | No Operation             | INC                                                                   | Increment | ADD                                                                          | Add          | SUB                                                                   | Subtract |
|---------------------------------------------------------------------|--------------------------|-----------------------------------------------------------------------|-----------|------------------------------------------------------------------------------|--------------|-----------------------------------------------------------------------|----------|
| Syntax: NOP                                                         |                          | Syntax: INC DR, SA                                                    |           | Syntax: ADD DR, SA, SB                                                       |              | Syntax: SUB DR, SA, SB                                                |          |
| Operands: None                                                      |                          | Operands: $0 \leq DR \leq 7$<br>$0 \leq SA \leq 7$                    |           | Operands: $0 \leq DR \leq 7$<br>$0 \leq SB \leq 7$ $0 \leq SA \leq 7$        |              | Operands: $0 \leq DR \leq 7$<br>$0 \leq SB \leq 7$ $0 \leq SA \leq 7$ |          |
| Operations: None                                                    |                          | Operations:<br>$R[DR] \leftarrow R[SA] + 1$                           |           | Operations:<br>$R[DR] \leftarrow R[SA] + R[SB]$                              |              | Operations:<br>$R[DR] \leftarrow R[SA] - R[SB]$                       |          |
| Description:                                                        | No Operation is Executed | Description:<br>Increment SA register value by 1, store value in DR   |           | Description:<br>Add values in registers SA and SB, store in DR               |              | Description:<br>Subtract values in registers SA and SB, store in DR   |          |
| DEC                                                                 | Decrement                | MUL                                                                   | Multiply  | ADDC                                                                         | Add w/ Carry | NEG                                                                   | Negative |
| Syntax: DEC DR, SA                                                  |                          | Syntax: MUL DR, SA, SB                                                |           | Syntax:<br>ADDC DR, SA, SB, C                                                |              | Syntax: NEG DR, SA                                                    |          |
| Operands: $0 \leq DR \leq 7$<br>$0 \leq SA \leq 7$                  |                          | Operands: $0 \leq DR \leq 7$<br>$0 \leq SB \leq 7$ $0 \leq SA \leq 7$ |           | Operands: $0 \leq DR \leq 7$<br>$0 \leq SB \leq 7$ $0 \leq SA \leq 7$        |              | Operands: $0 \leq DR \leq 7$<br>$0 \leq SA \leq 7$                    |          |
| Operations:<br>$R[DR] \leftarrow R[SA] - 1$                         |                          | Operations:<br>$R[DR] \leftarrow R[SA] * R[SB]$                       |           | Operations:<br>$R[DR] \leftarrow R[SA] + R[SB] + C$                          |              | Operations:<br>$R[DR] \leftarrow \sim R[SA] + 1$                      |          |
| Description:<br>Decrement SA register value by 1, store value in DR |                          | Description:<br>Multiply values in registers SA and SB, store in DR   |           | Description:<br>Add values in registers SA, SB, and carry value, store in DR |              | Description:<br>Negate value in register SA, store in DR              |          |

**Figure 12-4: Detailed Instruction Set List with Descriptions**

(2 of 5)

| SHR                                                                  | Shift Right | SHL                                                                   | Shift Left  | CLR                                                                   | Clear Register | SET                                                                   | Set         |
|----------------------------------------------------------------------|-------------|-----------------------------------------------------------------------|-------------|-----------------------------------------------------------------------|----------------|-----------------------------------------------------------------------|-------------|
| Syntax: SHR DR, SA                                                   |             | Syntax: SHL DR, SA                                                    |             | Syntax: CLR DR                                                        |                | Syntax: SET DR                                                        |             |
| Operands: $0 \leq DR \leq 7$<br>$0 \leq SA \leq 7$                   |             | Operands: $0 \leq DR \leq 7$<br>$0 \leq SA \leq 7$                    |             | Operands: $0 \leq DR \leq 7$<br>$0 \leq SB \leq 7$ $0 \leq SA \leq 7$ |                | Operands: $0 \leq DR \leq 7$<br>$0 \leq SB \leq 7$ $0 \leq SA \leq 7$ |             |
| Operations:<br>$R[DR] \leftarrow sr R[SA]$                           |             | Operations:<br>$R[DR] \leftarrow sl R[SA]$                            |             | Operations:<br>$R[DR] \leftarrow 0$                                   |                | Operations:<br>$R[DR] \leftarrow FFFF$                                |             |
| Description:<br>Shift value in register SA to the right, store in DR |             | Description:<br>Shift value in register SA to the left, store in DR   |             | Description:<br>Clear bits in register DR                             |                | Description:<br>Set bits in register DR to 1                          |             |
| NOT                                                                  | Logical NOT | AND                                                                   | Logical AND | OR                                                                    | Logical OR     | XOR                                                                   | Logical XOR |
| Syntax: NOT DR, SA                                                   |             | Syntax: AND DR, SA, SB                                                |             | Syntax: OR DR, SA, SB                                                 |                | Syntax: XOR DR, SA, SB                                                |             |
| Operands: $0 \leq DR \leq 7$<br>$0 \leq SA \leq 7$                   |             | Operands: $0 \leq DR \leq 7$<br>$0 \leq SB \leq 7$ $0 \leq SA \leq 7$ |             | Operands: $0 \leq DR \leq 7$<br>$0 \leq SB \leq 7$ $0 \leq SA \leq 7$ |                | Operands: $0 \leq DR \leq 7$<br>$0 \leq SB \leq 7$ $0 \leq SA \leq 7$ |             |
| Operations:<br>$R[DR] \leftarrow \sim R[SA]$                         |             | Operations:<br>$R[DR] \leftarrow R[SA] \& R[SB]$                      |             | Operations:<br>$R[DR] \leftarrow R[SA]   R[SB]$                       |                | Operations:<br>$R[DR] \leftarrow R[SA] ^ R[SB]$                       |             |
| Description:<br>NOT value of register SA, store in DR                |             | Description:<br>AND values in registers SA and SB, store in DR        |             | Description:<br>OR values in registers SA and SB, store in DR         |                | Description:<br>XOR values in registers SA and SB, store in DR        |             |

**Figure 12-5: Detailed Instruction Set List with Descriptions**

(3 of 5)

| MOVA                                                  | Transfer A    | MOVB                                                 | Transfer B         | ADI                                                   | Add Immediate      | SBI                                                        | Subtract Immediate |
|-------------------------------------------------------|---------------|------------------------------------------------------|--------------------|-------------------------------------------------------|--------------------|------------------------------------------------------------|--------------------|
| Syntax: MOVA DR, SA                                   |               | Syntax: MOVB DR, SB                                  |                    | Syntax: ADI DR, SA, ex                                |                    | Syntax: SBI DR, SA, ex                                     |                    |
| Operands: $0 \leq DR \leq 7$                          |               | Operands: $0 \leq DR \leq 7$                         |                    | Operands: $0 \leq DR \leq 7$                          |                    | Operands: $0 \leq DR \leq 7$                               |                    |
| $0 \leq SA \leq 7$                                    |               | $0 \leq SB \leq 7$                                   |                    | $0 \leq ex \leq 255$                                  | $0 \leq SA \leq 7$ | $0 \leq ex \leq 255$                                       | $0 \leq SA \leq 7$ |
| Operations:                                           |               | Operations:                                          |                    | Operations:                                           |                    | Operations:                                                |                    |
| $R[DR] \leftarrow R[SA]$                              |               | $R[DR] \leftarrow R[SB]$                             |                    | $R[DR] \leftarrow R[SA] + zf\text{-lit}$              |                    | $R[DR] \leftarrow R[SA] - zf\text{-lit}$                   |                    |
| Description:                                          |               | Description:                                         |                    | Description:                                          |                    | Description:                                               |                    |
| Move value in SA to DR                                |               | Move value in SB to DR                               |                    | Add value in register SA with zf literal, store in DR |                    | Subtract value in register SA with zf literal, store in DR |                    |
| ANI                                                   | AND Immediate | ORI                                                  | OR Immediate       | XRI                                                   | XOR Immediate      | LRI                                                        | Load Immediate     |
| Syntax: ANI DR, SA ex                                 |               | Syntax: ORI DR, SA, ex                               |                    | Syntax: XRI DR, SA, ex                                |                    | Syntax: LRI DR, ex                                         |                    |
| Operands: $0 \leq DR \leq 7$                          |               | Operands: $0 \leq DR \leq 7$                         |                    | Operands: $0 \leq DR \leq 7$                          |                    | Operands: $0 \leq DR \leq 7$                               |                    |
| $0 \leq SA \leq 7$                                    |               | $0 \leq ex \leq 255$                                 | $0 \leq SA \leq 7$ | $0 \leq ex \leq 255$                                  | $0 \leq SA \leq 7$ | $0 \leq ex \leq 255$                                       | $0 \leq SA \leq 7$ |
| Operations:                                           |               | Operations:                                          |                    | Operations:                                           |                    | Operations:                                                |                    |
| $R[DR] \leftarrow R[SA] \& zf\text{-lit}$             |               | $R[DR] \leftarrow R[SA]   zf\text{-lit}$             |                    | $R[DR] \leftarrow R[SA] ^  zf\text{-lit}$             |                    | $R[DR] \leftarrow zf\text{-lit}$                           |                    |
| Description:                                          |               | Description:                                         |                    | Description:                                          |                    | Description:                                               |                    |
| AND value in register SA with zf literal, store in DR |               | OR value in register SA with zf literal, store in DR |                    | XOR value in register SA with zf literal, store in DR |                    | Load zf literal value into register DR                     |                    |

**Figure 12-6: Detailed Instruction Set List with Descriptions**

(4 of 5)

| LRLI Load Long Immediate                             | Load from<br>LDI Immediate Address        | Store to<br>STI Immediate Address       | Load from<br>LDR Register Address                                  |
|------------------------------------------------------|-------------------------------------------|-----------------------------------------|--------------------------------------------------------------------|
| Syntax: LRLI DR, ex                                  | Syntax: LDI DR, ex                        | Syntax: STI ex, SA                      | Syntax: SBI DR, SA                                                 |
| Operands: $0 \leq DR \leq 7$                         | Operands: $0 \leq DR \leq 7$              | Operands: $0 \leq SA \leq 7$            | Operands: $0 \leq SA \leq 7$                                       |
| $0 \leq ex \leq 65535$                               | $0 \leq ex \leq 255$                      | $0 \leq ex \leq 255$                    | $0 \leq ex \leq 255$                                               |
| Operations:                                          | Operations:                               | Operations:                             | Operations:                                                        |
| $R[DR] \leftarrow lit$                               | $R[DR] \leftarrow M[zf\text{-AD}]$        | $M[zf\text{-AD}] \leftarrow R[SA]$      | $R[DR] \leftarrow M[R[SA]]$                                        |
| Description:                                         | Description:                              | Description:                            | Description:                                                       |
| Load Long literal value ex to register DR in state 0 | Load data from memory to DR at address ex | Store data from SA to memory address ex | Load data from memory to DR at address SA                          |
| STR Store to Register Address                        | PUSH Push Register Value                  | POP Pop register value                  | CALL Call subroutine                                               |
| Syntax: STR DR, SA                                   | Syntax: PUSH SA                           | Syntax: POP DR                          | Syntax: CALL ex                                                    |
| Operands: $0 \leq DR \leq 7$                         | Operands: $0 \leq SA \leq 7$              | Operands: $0 \leq DR \leq 7$            | Operands: $0 \leq ex \leq 255$                                     |
| $0 \leq SA \leq 7$                                   |                                           |                                         |                                                                    |
| Operations:                                          | Operations:                               | Operations:                             | Operations:                                                        |
| $M[R[DR]] \leftarrow R[SA]$                          | $PUSH R[SA]$                              | $R[DR] \leftarrow POP$                  | $PUSH PC + 1, PC \leftarrow zf\text{-AD}$                          |
| Description:                                         | Description:                              | Description:                            | Description:                                                       |
| Store data to DR from memory address SA              | Push register value SA onto stack         | Pop value from stack on register DR     | Call a subroutine<br>Push PC + 1 onto stack, PC gets zf ex address |

**Figure 12-7: Detailed Instruction Set List with Descriptions**  
 (5 of 5)

| RET         | Return from Subroutine | Jump to JUMPR register address                                                                                                          | BRZ                                                                                                                                                                                                                                 | Branch on Zero                                                                                                                                                                                                                        | BRN Branch on Negative |
|-------------|------------------------|-----------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------|
| Syntax: RET | Operands: None         | Syntax: JUMPR SA<br>Operands: $0 \leq SA \leq 7$<br>Operations:<br>$PC \leftarrow R[SA]$<br>Description:<br>Jump to register address SA | Syntax: BRZ SA, ex<br>Operands: $0 \leq SA \leq 7$<br>$0 \leq ex \leq 255$<br>Operations:<br>if ( $R[SA] == 0$ )<br>$PC \leftarrow PC + se\text{-}AD$<br>Description:<br>Program counter is incremented by se literal if SA is zero | Syntax: BRN SA, k<br>Operands: $0 \leq SA \leq 7$<br>$0 \leq ex \leq 255$<br>Operations:<br>if ( $R[SA] < 0$ )<br>$PC \leftarrow PC + se\text{-}AD$<br>Description:<br>Program counter is incremented by se literal if SA is negative |                        |

# Section 13: Example Program

Figure 13-1: Example Program Code (Jeremy)

```
1  module rom_case_rainbow(out, address); //10
2    output reg[15:0] out;
3    input [7:0] address; //address- 8 deep memory
4
5    // This program will cycle through 12 bits of RGB color by loading select bits into
6    // different register file registers and storing them in address 0 (color_config register)
7    // an address selector will (in this case) always chose address zero which outputs
8    // 12'b000000000001 which is ANDed with MW (mem write). If that is true, write is enabled
9    // Although the address stored into is 0, the address selector takes that zero and outputs a 1
10   // meaning the SFR color_config is actually mapped to a 12-bit 1. If both address and write are
11   // 1, color_config <= data. For the color to actually be seen a clock divider is required
12   // or else you'll just see vertical / diagonal colored lines.
13
14   // Order of color: White -- Red -- Orange -- Yellow -- Green -- Blue -- Indigo -- Violet
15
16   // This program could be easily modified to show combinations of colors or simply a white screen.
17
18   always @(address) begin
19     case(address)
20       8'h00: out = 16'b1111000011111111; // LRI R6, 255 // Setting bits to Multiply
21       8'h01: out = 16'b0111000110110000; // SHL R6, R6 // Shifting to the left
22       8'h02: out = 16'b0011011011111111; // XRI R6, 1 // XOR with lit to get 1000000001
23       8'h03: out = 16'b1110100011111111; // LRI R5, 255 // Load 11111111 into R5
24       8'h04: out = 16'b01101111110101; // MUL R7, R6, R5 // Multiply R6 with R5 to get all 1's
25
26       8'h05: out = 16'b1100000000001111; // LRI R0, 15 = Red
27       8'h06: out = 16'b1100100000101111; // LRI R1, 47 = Orange
28       8'h07: out = 16'b1101000000111111; // LRI R2, 63 = Yellow
29       8'h08: out = 16'b1101010001111000; // LRI R3, 240 = Green
30       8'h09: out = 16'b0100011100101101; // NOT R4, R5 // setting bits for Blue
31       8'h0A: out = 16'b0101100101100010; // MOVA R5, R4 // Setting Indigo
32       8'h0B: out = 16'b0110000101101000; // INC R5, R5 // ^^^^^^^^^^^^^^^^^^^^^^
33       8'h0C: out = 16'b0110000110101000; // INC R6, R5 // Setting Violet
34       8'h0D: out = 16'b0110000110110000; // INC R6, R6 // ^^^^^^^^^^^^^^^^^^^^^^
35
36       8'h0E: out = 16'b1010100000000000; // STI R0, 0 // Storing Red
37       8'h0F: out = 16'b1010100100000000; // STI R1, 0 // Storing Orange
38       8'h10: out = 16'b1010101000000000; // STI R2, 0 // Storing Yellow
39       8'h11: out = 16'b1010101100000000; // STI R3, 0 // Storing Green
40       8'h12: out = 16'b1010110000000000; // STI R4, 0 // Storing Blue
41       8'h13: out = 16'b1010110100000000; // STI R5, 0 // Storing Indigo
42       8'h14: out = 16'b1010111000000000; // STI R6, 0 // Storing Violet
43       8'h15: out = 16'b1010111100000000; // STI R7, 0 // Storing White
44       8'h16: out = 16'b1010111100000000; // STI R7, 0
45
46       8'h17: out = 16'b101111111110110; // BRN R7, -10 // Loop back to storing values if
47                                // R7 is negative, always true
48
49
50       default: out = 16'b0000000000000000; //NOP
51     endcase
52   end
53 endmodule // rom_case
```

Above is my complete example program using my VGA peripheral. Comments on what the program does and how it works are included before the 'always' block. A general summary is that it changes the RGB colors in a looped 'rainbow' sequence. The program also makes use of the advanced multiply instruction to set register 7 to 65535, meaning all colors will be active and the screen will be white.

## Section 14: Performance

Unfortunately for me (Jeremy), I was unable to get the overclock ROM to function properly with my processor. Trying to debug was a nightmare as compile times were now in the 10-15 minute range and because of the delay in the program it was hard to see everything in ModelSim. I think the issue may have been caused by my registers being 32 bits, making the delay hundreds of billions of clock cycles. While I tried to make a VGA overclock test, compile times were just as long and I didn't have the time to get everything straightened out.

I'm confident that in the right circumstances my processor could be tested for max frequency, and in that case I could shine some light on how it could be improved further.

While I have no current idea on how I would do this for my datapath, a pipeline would boost performance a decent bit. Would take a lot of careful thought during the process and to get the full benefit there would be a system to negate hazards.

Changing the instruction set architecture could also boost performance. I debated making the instructions 32 bits, initially with more room for literal values, but it could also be implemented to incorporate multiple instructions in one.

I feel like those two additions / changes would be the right initial steps in making a higher - performance processor.

(Ali Elhamawi)

I was unable to get the ROM to work correctly and was not able to test the overclocking. I am unsure of the issue as of now. I will hopefully keep attempting to figure out why it is now working.

I also had issues with my DE0 board and it has caused complications in some testing.



# Section 15: Personal Enhancements and Features (Jeremy)

This is a section intended to showcase the optional / advanced features of my processor. Some of the features included in here may overlap with those in the overall processor, but I aim to give more of a detailed description in this section.

**ALU:** My ALU is quite similar to the one in the design guide, it can add with carry, has every status flag, but cannot do arithmetic shifts.

Figure 15-1: ALU Flags

```
assign N = ALU_Result[31];
assign Z = ~|ALU_Result;
assign F = ALU_Result;
assign V = (C & Inner_Carry[31]) ? 1'b1 : 1'b0;
```

Figure 15-2: Overflow Example



Adding two positive numbers, and the result is negative. This causes the overflow flag to be set.

**Multiplier:** My processor includes a separate hardware multiplier capable of multiplying two 16-bit numbers. However, this could output a number larger than 16 bits, up to 32-bits. Because of this, the registers in the register file were increased to 32 bits to accommodate.

This issue could have been avoided by implementing an 8x8 multiplier instead of a 16x16 multiplier. The multiplier was built as a schematic using AND gates, half-adders, and full-adders. In the datapath, it has a separate output select apart from the ALU. Because of the 16-bit increase in size, I either concatenated or increased the different signals connected to the data bus in order for everything to match up width wise.

**Figure 15-3: Multiplier Test**



Here, 3 and 255 (decimal) are multiplied together and the result is then multiplied by 3 and so on. The DS signal is selecting the output of the multiplier and everything is working properly. The schematic for the multiplier can be found [HERE](#).

**Stack:** I managed to implement a hardware stack for my processor. The stack will be utilized for the advanced instructions PUSH, POP, CALL, and RET. It is controlled by a 3-bit signal inside of the control word. 001 will PUSH, 010 will POP, and 100 will reset / clear the stack. Inside the stack is a stack pointer that is able to increment or decrement depending on the instruction. Data is read and written based on the address of the stack pointer. It also has flags for when the stack is full and when it is empty, which also depends on the value of the stack pointer. The stack can be several different sizes (which also changes the size of the stack pointer), but for demonstration purposes it will be set to a smaller size (i.e 8 deep) to showcase the full and empty signals. Similarly to the RAM, the clock needed to be inverted in order for data to be available in time (single cycle). The stack in the finished processor is 128 deep, resulting in a 7-bit stack pointer. The Verilog code for the stack can be found [HERE](#).

Figure 15-4: Stack Test



(Pushing zoomed in)

The stack is currently set to 8 deep, meaning that it can hold 8 values in 8 different places in the stack with a 3-bit stack pointer. First, we can see that the empty signal is high at first before going low when data starts to be pushed onto the stack. In the tail end of this zoomed view you can see the full signal go high once 8 values have been pushed. SA determines what value is getting pushed, and SA for each instruction is highlighted in a different color which will match its corresponding destination once popped in the next zoomed view. Whatever is pushed onto the stack last is also the first to come off (last in first out). The final thing to note here is that stack select is 001, signifying that the stack is pushing.



(Popping zoomed in)

There are a few key things happening here. Stack select is now 010, meaning the stack will be popping data. Although not visible here, write is now set to 1, and DS is selecting the output of the stack instead of the ALU. The DR signal determines where the data is being popped to, which is color coordinated with the corresponding data that was pushed. The top right shows the empty signal going high again after the stack pops all data off. Looking at the actual data, the last value that was pushed onto the stack was from R4 (1ae5 hex), that is the first value that popped from the stack, which is written to DR R6. The rest of the simulation follows the same rule, and the most recent pushed data pops off first.

**Figure 15-5: Call and Return**

The screenshot shows two memory dump windows from a debugger. The top window displays memory starting at address 1100000000000000. It includes columns for address, register name, and value. A blue arrow points from the value of CPU\_tb/PC (2d) to the stack area. The bottom window is a zoomed-in view of the same memory starting at 1100000000000000. It highlights the PC value (2d) and the stack pointer (40). Labels 'Push' and 'Pop' are overlaid on the stack area.

|                      |                     |                     |                            |                                     |                        |                        |                                     |
|----------------------|---------------------|---------------------|----------------------------|-------------------------------------|------------------------|------------------------|-------------------------------------|
| + CPU_tb/ControlWord | 1100000000000000... | 1000000000000101... | 01000000000000001010000... | 11000000000000000000000000000000... | 1100100000100011010... | 0100011111100110000... | 11000000000000000000000000000000... |
| + CPU_tb/R0          | 00000000            | 00000000            | 00000000                   | 00000000                            | 00000000               | 00000000               | 00000000                            |
| + CPU_tb/R1          | 0000003f            | 00000000            | 00000000                   | 00000000                            | 00000000               | 00000000               | 00000000                            |
| + CPU_tb/R2          | 0000003f            | 000000ff            | 000000ff                   | 000000ff                            | 000000ff               | 000000ff               | 000000ff                            |
| + CPU_tb/R3          | 000000fc            | 000008f7            | 000008f7                   | 000008f7                            | 000008f7               | 000008f7               | 000008f7                            |
| + CPU_tb/R4          | 000001f9            | 00001ae6            | 00001ae6                   | 00001ae6                            | 00001ae6               | 00001ae6               | 00001ae6                            |
| + CPU_tb/R5          | 000001f8            | 00001ae5            | 00001ae5                   | 00001ae5                            | 00001ae5               | 00001ae5               | 00001ae5                            |
| + CPU_tb/R6          | 00000000            | 00001ae5            | 00001ae5                   | 00001ae5                            | 00001ae5               | 00001ae5               | 00001ae5                            |
| + CPU_tb/R7          | 0000001c            | 000050af            | 000050af                   | 000050af                            | 000050af               | 000050af               | 000050af                            |
| + CPU_tb/FS          | 00000               | 01010               | 00000                      | 00110                               | 01100                  | 00000                  | 00000                               |
| + CPU_tb/IR          | 0000000000000000    | 1001110000100000    | 0000000000000000           | 0001100011111111                    | 1001111000000000       | 0000000000000000       | 0000000000000000                    |
| + CPU_tb/ROM         | 0000000000000000    | 10000001110110110   | 0000000000000000           | 0011000111111111                    | 1001111000000000       | 0000000000000000       | 10000001110110110                   |
| + CPU_tb/PC          | 2d                  | 1c                  | 1d                         | 40                                  | 41                     | 42                     | 1e                                  |
| + CPU_tb/Data        | 0000000000000000    | 0000000000000000    | 0000000000000000           | 0000000000000000                    | 0000000000000000       | 0000000000000000       | 0000000000000000                    |
| + CPU_tb/DR          | 000                 | 000                 | 000                        | 001                                 | 111                    | 000                    | 000                                 |
| + CPU_tb/SA          | 000                 | 000                 | 000                        | 001                                 | 000                    | 000                    | 000                                 |
| + CPU_tb/SB          | 000                 | 000                 | 000                        | 111                                 | 000                    | 000                    | 000                                 |
| + CPU_tb/ex          | 0000000000000000    | 0000000000000000    | 0000000000000000           | 0000000000000000                    | 0000000000000000       | 0000000000000000       | 0000000000000000                    |
| + CPU_tb/DS          | 000100              | 000010              | 000100                     | 0000000000000000                    | 0100000000000000       | 0100000000000000       | 0001000000000000                    |
| + CPU_tb/PS          | 01                  | 01                  | 10                         | 01                                  | 10                     | 01                     | 01                                  |
| + CPU_tb/S_Sel       | 000                 | 001                 | 000                        | 000                                 | 000                    | 000                    | 000                                 |
| + CPU_tb/NS          | St0                 | St0                 | St0                        | St0                                 | St0                    | St0                    | St0                                 |
| + CPU_tb/WR          | St0                 | St0                 | St0                        | St0                                 | St0                    | St0                    | St0                                 |
| + CPU_tb/MW          | St0                 | St0                 | St0                        | St0                                 | St0                    | St0                    | St0                                 |

  

|                      |                     |                        |                            |                                     |                                     |                                     |                                     |
|----------------------|---------------------|------------------------|----------------------------|-------------------------------------|-------------------------------------|-------------------------------------|-------------------------------------|
| + CPU_tb/ControlWord | 1100000000000000... | 1000000000000000101... | 01000000000000001010000... | 11000000000000000000000000000000... | 11000000000000000000000000000000... | 11000000000000000000000000000000... | 11000000000000000000000000000000... |
| + CPU_tb/R0          | 00000000            | 00000000               | 00000000                   | 00000000                            | 00000000                            | 00000000                            | 00000000                            |
| + CPU_tb/R1          | 0000003f            | 00000000               | 00000000                   | 00000000                            | 00000000                            | 00000000                            | 00000000                            |
| + CPU_tb/R2          | 0000003f            | 000000ff               | 000000ff                   | 000000ff                            | 000000ff                            | 000000ff                            | 000000ff                            |
| + CPU_tb/R3          | 000000fc            | 000008f7               | 000008f7                   | 000008f7                            | 000008f7                            | 000008f7                            | 000008f7                            |
| + CPU_tb/R4          | 000001f9            | 00001ae6               | 00001ae6                   | 00001ae6                            | 00001ae6                            | 00001ae6                            | 00001ae6                            |
| + CPU_tb/R5          | 000001f8            | 00001ae5               | 00001ae5                   | 00001ae5                            | 00001ae5                            | 00001ae5                            | 00001ae5                            |
| + CPU_tb/R6          | 00000000            | 00001ae5               | 00001ae5                   | 00001ae5                            | 00001ae5                            | 00001ae5                            | 00001ae5                            |
| + CPU_tb/R7          | 0000001c            | 000050af               | 000050af                   | 000050af                            | 000050af                            | 000050af                            | 000050af                            |
| + CPU_tb/FS          | 00000               | 01010                  | 00000                      | 00110                               | 01100                               | 00000                               | 00000                               |
| + CPU_tb/IR          | 0000000000000000    | 1001110000100000       | 0000000000000000           | 0000000000000000                    | 0000000000000000                    | 0000000000000000                    | 0000000000000000                    |
| + CPU_tb/ROM         | 0000000000000000    | 10000001110110110      | 0000000000000000           | 0011000111111111                    | 1001111000000000                    | 0000000000000000                    | 10000001110110110                   |
| + CPU_tb/PC          | 2d                  | 1c                     | 1d                         | 40                                  | 41                                  | 42                                  | 1e                                  |
| + CPU_tb/Data        | 0000000000000000    | 0000000000000000       | 0000000000000000           | 0000000000000000                    | 0000000000000000                    | 0000000000000000                    | 0000000000000000                    |
| + CPU_tb/DR          | 000                 | 000                    | 000                        | 001                                 | 111                                 | 000                                 | 000                                 |
| + CPU_tb/SA          | 000                 | 000                    | 000                        | 001                                 | 000                                 | 000                                 | 000                                 |
| + CPU_tb/SB          | 000                 | 000                    | 000                        | 111                                 | 000                                 | 000                                 | 000                                 |
| + CPU_tb/ex          | 0000000000000000    | 0000000000000000       | 0000000000000000           | 0000000000000000                    | 0000000000000000                    | 0000000000000000                    | 0000000000000000                    |
| + CPU_tb/DS          | 000100              | 000010                 | 000100                     | 0000000000000000                    | 0100000000000000                    | 0100000000000000                    | 0001000000000000                    |
| + CPU_tb/PS          | 01                  | 01                     | 10                         | 01                                  | 10                                  | 01                                  | 01                                  |
| + CPU_tb/S_Sel       | 000                 | 001                    | 000                        | 000                                 | 000                                 | 000                                 | 000                                 |
| + CPU_tb/NS          | St0                 | St0                    | St0                        | St0                                 | St0                                 | St0                                 | St0                                 |
| + CPU_tb/WR          | St0                 | St0                    | St0                        | St0                                 | St0                                 | St0                                 | St0                                 |
| + CPU_tb/MW          | St0                 | St0                    | St0                        | St0                                 | St0                                 | St0                                 | St0                                 |

(Left Side Zoom)

This program tests the functionality of the CALL and RET instructions. Here, I'm calling a subroutine at address 40 (hex), where a literal value will be XOR'd with a register value. After that, the program counter should jump back to where it was prior to the subroutine. It does so by pushing the program counter value onto the stack (with PC output selected). For my implementation, CALL is a two cycle instruction.

The screenshot shows a memory dump and assembly code. The assembly code is as follows:

```

11001000001100011010... 0100011111100110000...
110000000000000000000000
00110 01100 00000
00110001111111111111 10011111000000000000
10011111000000000000 00000000000000000000
41 42 1c
00000000000000000000... 00000000... 00000000... 0000000000000000
001 111 000
001 000
111
0000000011111111 0000000000000000
010000 000100
10 01
010 000

```

The memory dump shows the same data as the assembly code. A blue arrow points from the value at address 41 (41) to the value at address 42 (42). The value at address 42 is highlighted in blue. The assembly code at address 42 is a RET instruction, which pops the value from the stack (000000ff) onto the program counter. The value 000000ff is also highlighted in blue.

(Right Side Zoom)

On the left, the XRI instruction happens as the PC counts up from 40. At address 42, the instruction register is executing the RET (return) instruction, which pops the previously stored program counter value onto the data bus. The program counter selects it set to 10, meaning it will take whatever is on the data bus as its own value. That's how the jump from 42 back to 1c happens.

# Section 16: Appendix

Figure 16-1: Register File Decoder

```
module decoder3to8(in0, in1, in2, out0, out1, out2, out3, o
input in0, in1, in2;
output reg out0, out1, out2, out3, out4, out5, out6, out7;

always @ (in2,in1,in0)
begin
    out0=0;
    out1=0;
    out2=0;
    out3=0;
    out4=0;
    out5=0;
    out6=0;
    out7=0;
begin
    case ({in2,in1,in0})
        3'b000: out0=1;
        3'b001: out1=1;
        3'b010: out2=1;
        3'b011: out3=1;
        3'b100: out4=1;
        3'b101: out5=1;
        3'b110: out6=1;
        3'b111: out7=1;
    default: begin
        out0=0;
        out1=0;
        out2=0;
        out3=0;
        out4=0;
        out5=0;
        out6=0;
        out7=0;
    end
    endcase
end
end
endmodule
```

**Figure 16-2: Register File Mux**



**Figure 16-3: ALU Cell**



**Figure 16-4: Instruction Register Verilog**

```
module IR (clock, load, D_in, D_out);
    parameter n = 16;
    input clock, load;
    input [n-1:0]D_in;
    output reg [n-1:0]D_out;
    always @ (posedge clock) begin
        if(load) begin
            D_out <= D_in;
        end
    end
endmodule
```

(Basically normal register)

**Figure 16-5: Program Counter Verilog**

```
module PC (clock, PS, PC_in, PC_offset, PC);
    input clock;
    input [15:0] PC_in;
    input [15:0] PC_offset;
    input [1:0]PS;
    output reg [15:0]PC;
    always @(posedge clock)begin
        case (PS)
            2'b00: PC = PC; // do nothing
            2'b01: PC = PC+1'b1; // increment
            2'b10: PC = PC_in; // program counter gets data from data bus
            2'b11: PC = PC+PC_offset; // program counter gets pc + offset
        endcase
    end
endmodule
```

Figure 16-6: Better View of Datapath (Jeremy)



Figure 16-7: Clock Divider

```

module clock_divider(
    input wire clk,
    output reg div_clk
);

parameter div_value = 0; //div_value = 50Mhz / 2*desried freq) -1
integer counter_value = 0;

always@ (posedge clk)
begin
    if (counter_value == div_value)
        counter_value <= 0;
    else
        counter_value <= counter_value + 1;
end

always@ (posedge clk)
begin
    if (counter_value == div_value)
        div_clk <= ~div_clk;
    else
        div_clk <= div_clk;
end
endmodule

```

**Figure 16-8: VGA Horizontal Counter**

```
module horizontal_counter(
    input clk_25,
    output reg enable_V_Counter =0,
    output reg [15:0] H_Count_value
);

    always @ (posedge clk_25) begin
        if (H_Count_value < 799) begin
            H_Count_value <= H_Count_value + 1;
            enable_V_Counter <= 0;
        end
        else begin
            H_Count_value <= 0;
            enable_V_Counter <= 1;
        end
    end
endmodule
```

**Figure 16-9: VGA Vertical Counter**

```
module vertical_counter(
    input clk_25,
    input enable_V_Counter,
    output reg [15:0] V_Count_value =0
);

    always @ (posedge clk_25) begin
        if (enable_V_Counter == 1'b1) begin
            if (V_Count_value < 524)
                V_Count_value <= V_Count_value + 1;
            else V_Count_Value <= 0;
        end
    end
endmodule
```

Figure 16-10: VGA Top

```
module vga_top(
    input clk,
    input [9:0]sw,
    input [1:0]KEY,
    output Hsync,
    output Vsync,
    output [3:0] Red,
    output [3:0] Green,
    output [3:0] Blue
);

wire clk_25;
wire enable_v_counter;
wire [15:0] H_Count_value;
wire [15:0] V_Count_value;
wire red_switch0, red_switch1, red_switch2, red_switch3;
wire green_switch0, green_switch1, green_switch2, green_switch3;
wire blue_switch0, blue_switch1, blue_button0, blue_button1;
wire [3:0] red_switches, green_switches, blue_switches;

clock_divider VGA_clock_gen (clk, clk_25);
horizontal_counter VGA_H (clk_25, enable_v_counter, H_Count_value);
vertical_counter VGA_V (clk_25, enable_v_counter, V_Count_value);

assign red_switch0 = sw[0];
assign red_switch1 = sw[1];
assign red_switch2 = sw[2];
assign red_switch3 = sw[3];

assign green_switch0 = sw[4];
assign green_switch1 = sw[5];
assign green_switch2 = sw[6];
assign green_switch3 = sw[7];

assign blue_switch0 = sw[8];
assign blue_switch1 = sw[9];
assign blue_button0 = ~KEY[0];
assign blue_button1 = ~KEY[1];

assign red_switches = {red_switch0, red_switch1, red_switch2, red_switch3};
assign green_switches = {green_switch0, green_switch1, green_switch2, green_switch3};
assign blue_switches = {blue_switch0, blue_switch1, blue_button0, blue_button1};

assign Hsync = (H_Count_value < 96) ? 1'b1: 1'b0;
assign Vsync = (V_Count_value < 2) ? 1'b1: 1'b0;

assign Red = (H_Count_value < 784 && H_Count_value > 143 && V_Count_value < 515 && V_Count_value > 34) ? red_switches : 4'h0;
assign Green = (H_Count_value < 784 && H_Count_value > 143 && V_Count_value < 515 && V_Count_value > 34) ? green_switches : 4'h0;
assign Blue = (H_Count_value < 784 && H_Count_value > 143 && V_Count_value < 515 && V_Count_value > 34) ? blue_switches : 4'h0;

endmodule
```

**Figure 16-11: Multiplier Schematic**



(Started with 2 bit multiplier, worked up to 16 bit using different sized adders)

**Figure 16-12: Stack Verilog (1)**

```
module stack(d_in, d_out, push, pop, reset, empty, full, clock);
    input [15:0] d_in;
    input push, pop, reset, clock;
    output reg empty, full; // signals that determine if the stack is empty or full
    output reg [15:0] d_out;
    reg [15:0] stack_mem[0:127]; // 128 deep, can be modified but must also change width of SP
    reg [6:0] SP; // stack pointer max value of 1111111 = 127
    integer i;
    always @ (posedge clock)
    begin
        if (reset==1) begin
            SP      = 7'b1000000; // reset will set SP to this value
            empty   = SP[6]; // reset will empty stack so empty <= 1
            d_out   = 4'h0;
            for (i=0;i<128;i=i+1) begin
                stack_mem[i]= 0; // clearing stack
            end
        end
        else if (reset==0) begin
            full = SP ? 0:1;
        end
    end
endmodule
```

**Figure 16-13: Stack Verilog (2)**

```
empty  = SP[6];
d_out  = 16'h0000;
if (full == 1'b0 && push == 1'b1) begin // not full and pushing
    SP      = SP-1'b1; // stack pointer decrements
    full    = SP ? 0:1; // check if full
    empty   = SP[6];
    stack_mem[SP] = d_in; // getting data based on SP address
end
else if (empty == 1'b0 && pop == 1'b1) begin // not full and popping
    d_out  = stack_mem[SP]; // outputs gets stack at SP address
    stack_mem[SP] = 0; // fill address with zero
    SP    = SP+1; // stack pointer increments
    full  = SP? 0:1; // check full
    empty = SP[6]; // check empty
end
else;
```

