



## 本科实验报告

课程名称：计算机组成  
姓名：曾一欣

学院：竺可桢学院

班级：求是科学班（计算机科学与技术）

专业：计算机科学与技术

学号：3180105144

指导老师：姜晓红  
2020.5.6

# 浙江大学实验报告

课程名称：计算机组成      实验类型：综合

实验项目名称：Lab4 Multi Cycle CPU

学生姓名：曾一欣      专业：计算机科学与技术      学号：3180105144

同组学生姓名：None      指导老师：姜晓红

实验地点：无      实验日期：2020 年 6 月 19 日

note: In this experiment I designed the SCPU by myself instead of follow the instructions in the slides. So the report might be different from lab PPTs because my steps to construct the SCPU is different.

## 一、实验目的和要求

1. 构建数据通路
2. 构建控制器
3. 设计数据通路的核心部件
4. 连接完成MCPU

## 二、实验内容和原理

**Decomposability of computer systems**



### Digital circuit



DataPath



Controller



## ALU

use the same module ALU in SCPU

| ALU Control Lines | Function                | note |
|-------------------|-------------------------|------|
| <b>000</b>        | <b>And</b>              | 兼容   |
| <b>001</b>        | <b>Or</b>               | 兼容   |
| <b>010</b>        | <b>Add</b>              | 兼容   |
| <b>110</b>        | <b>Sub</b>              | 兼容   |
| <b>111</b>        | <b>Set on less than</b> |      |
| <b>100</b>        | <b>nor</b>              | 扩展   |
| <b>101</b>        | <b>srl</b>              | 扩展   |
| <b>011</b>        | <b>xor</b>              | 扩展   |



### Register file

use the same module in SCPU



## 三、 实验过程和数据记录及结果分析

### Construct the circuit

construct by myself:



follow the lead of the slides:



## Construct the Ctrl part

| 看PPT的疑问: Branch和PC Write Cond 有什么区别? |          |         |             |      |         |         |          |        |       |          |          | ↓        |        |       |       |        |
|--------------------------------------|----------|---------|-------------|------|---------|---------|----------|--------|-------|----------|----------|----------|--------|-------|-------|--------|
|                                      | 状态       | PCWrite | PCWriteCond | Load | MemRead | IRWrite | MemWrite | MemLog | PCSrc | AllSIC_A | AllSIC_B | RegWrite | RegDst | MemID | AllOp | branch |
| ✓ IF                                 | 0000(1)  | 1       | 0           | 0    | 1       | 0       | 1        | 00     | 00    | 0        | 01       | 0        | 00     | 1     | 00    | 0      |
| ✓ ID                                 | 0001(11) | 0       | 0           | 0    | 0       | 0       | 0        | 00     | 00    | 0        | 11       | 0        | 00     | 0     | 00    | 0      |
| ✓ Mem_Ex                             | 0010(2)  | 0       | 0           | 0    | 0       | 0       | 0        | 00     | 00    | 1        | 10       | 0        | 00     | 0     | 00    | 0      |
| ✓ Mem_RD                             | 0011(3)  | 0       | 0           | 1    | 1       | 0       | 0        | 00     | 00    | 1        | 10       | 1        | 00     | 0     | 00    | 0      |
| ✓ LW_WB                              | 0100(4)  | 0       | 0           | 0    | 0       | 0       | 0        | 01     | 00    | 0        | 10       | 1        | 00     | 0     | 00    | 0      |
| ✓ Mem_WD                             | 0101(5)  | 0       | 0           | 1    | 0       | 1       | 0        | 00     | 00    | 1        | 10       | 0        | 00     | 1     | 00    | 0      |
| ✓ R_Exe                              | 0110(6)  | 0       | 0           | 0    | 0       | 0       | 0        | 00     | 00    | 1        | 00       | 0        | 00     | 0     | 10    | 0      |
| ✓ R_WB                               | 0111(7)  | 0       | 0           | 0    | 0       | 0       | 0        | 00     | 00    | 1        | 00       | 1        | 01     | 0     | 00    | 0      |
| ✓ Beq_Exe                            | 1000(8)  | 0       | 1           | 0    | 0       | 0       | 0        | 00     | 01    | 1        | 00       | 0        | 00     | 0     | 01    | 1      |
| ✓ J                                  | 1001(9)  | 1       | 0           | 0    | 0       | 0       | 0        | 00     | 10    | 0        | 11       | 0        | 00     | 0     | 00    | 0      |
| ✓ I_Exe                              | 1010(10) | 0       | 0           | 0    | 0       | 0       | 0        | 00     | 00    | 1        | 10       | 0        | 00     | 0     | 11    | 0      |
| ✓ I_WB                               | 1011(11) | 0       | 0           | 0    | 0       | 0       | 0        | 00     | 00    | 1        | 10       | 1        | 00     | 0     | 00    | 0      |
| ✓ Lui_WB                             | 1100(12) | 0       | 0           | 0    | 0       | 0       | 0        | 10     | 00    | 0        | 11       | 1        | 00     | 0     | 00    | 0      |
| ✓ Bne_Exe                            | 1101(13) | 0       | 1           | 0    | 0       | 0       | 0        | 00     | 01    | 1        | 00       | 0        | 00     | 0     | 01    | ?      |
| ✓ Jr                                 | 1110(14) | 1       | 0           | 0    | 0       | 0       | 0        | 00     | 00    | 1        | 00       | 0        | 00     | 0     | 00    | 0      |
| Jal                                  | 1111(15) | 1       | 0           | 0    | 0       | 0       | 0        | 11     | 10    | 0        | 11       | 1        | 10     | 0     | 00    | 0      |

  

The diagram illustrates the control flow and data dependencies between various instruction types. Nodes include IF, ID, R\_Exe, Mem\_Ex, Beq\_Exe, J, I\_Exe, I\_WB, Lui, Bne\_Exe, and Jal. Arrows indicate transitions between stages (IF to ID, ID to R\_Exe, etc.) and control flow from conditional branches (J, Beq\_Exe) to their target blocks.

ALU: R-type, I type → 单操作  
Bne, Beq, Sub  
其他操作

SHOT ON MI 8  
AI DUAL CAMERA

## Modules

### ALU\_SCPU

use the module ALU4b last semester

```
module ALU32b(
    input [31:0]A,
    input [31:0]B,
    input [1:0]S,
    output [31:0]C,
    output Co
);
```

```

wire c0;
ALU4b m1(.A(A[3:0]), .B(B[3:0]), .S(S), .C(C[3:0]), .Ci(S[0]), .Co(C0));
wire c1;
ALU4b m2(.A(A[7:4]), .B(B[7:4]), .S(S), .C(C[7:4]), .Ci(C0), .Co(C1));
wire c2;
ALU4b m3(.A(A[11:8]), .B(B[11:8]), .S(S), .C(C[11:8]), .Ci(C1), .Co(C2));
wire c3;
ALU4b m4(.A(A[15:12]), .B(B[15:12]), .S(S), .C(C[15:12]), .Ci(C2), .Co(C3));
wire c4;
ALU4b m5(.A(A[19:16]), .B(B[19:16]), .S(S), .C(C[19:16]), .Ci(C3), .Co(C4));
wire c5;
ALU4b m6(.A(A[23:20]), .B(B[23:20]), .S(S), .C(C[23:20]), .Ci(C4), .Co(C5));
wire c6;
ALU4b m7(.A(A[27:24]), .B(B[27:24]), .S(S), .C(C[27:24]), .Ci(C5), .Co(C6));
ALU4b m8(.A(A[31:28]), .B(B[31:28]), .S(S), .C(C[31:28]), .Ci(C6), .Co(Co));

endmodule

```

```

module ALUSCPU(
    input [31:0]A,
    input [31:0]B,
    input [2:0]op, //operation
    output [31:0]C,
    output zero,
    output [1:0]S,
    output [31:0]result1
);

wire [1:0]S=0;
assign S[0]=(op==6|op==1|op==7)?1:0;
assign S[1]=(op==0|op==1)?1:0;
wire co;
wire [31:0]result1; //ALU result
ALU32b m1(A,B,S,result1,co);

wire [31:0]result2; //SLL result
wire [31:0]result3; //SRL result
wire [31:0]result4; //SLT result
assign result4=(result1[31]==1)?1:0;
SLL m2(A,B[10:6],result2);
SRL m3(A,B[10:6],result3);

wire [31:0]temp1;
wire [31:0]temp2;
assign temp1=(op==7)?result4:result1;
assign temp2=(op==4)?result3:result2;

assign C=(op==3|op==4)?temp2:temp1;
assign
zero=C[0]|C[1]|C[2]|C[3]|C[4]|C[5]|C[6]|C[7]|C[8]|C[9]|C[10]|C[11]|C[12]|C[13]|C[14]|C[15]|C[16]|C[17]|C[18]|C[19]|C[20]|C[21]|C[22]|C[23]|C[24]|C[25]|C[26]|C[27]|C[28]|C[29]|C[30]|C[31];
endmodule

```

simulation:



## Regfile

```
module Regfile(
    input rst,
    input clk,
    input Wreg,
    input [4:0]RN1,
    input [4:0]RN2,
    input [4:0]Des,
    input [31:0]datain,
    output [31:0]A,
    output [31:0]B
);
reg [31:0]a[1:31];
integer i;
assign A=(RN1==0)?0:a[RN1];
assign B=(RN2==0)?0:a[RN2];

always@(posedge rst)
begin
    for(i=1;i<32;i=i+1)
        a[i]<=0;
end

always@(negedge clk)
begin
    if(Des!=0&&Wreg==1)
        a[Des]<=datain;
end

endmodule
```

## REG32

```
module REG32(
    input rst,
    input clk,
    input CE,
    input [31:0]D,
    output [31:0]Q
);
reg [31:0]a;
assign Q=a;

always@(posedge rst)
begin
    a=0;
end

always@(posedge clk)
```

```

begin
  if(CE==1)
    a<=D;
end

endmodule

```

## SLL

In last experiment, I left SLL and SRL for a better solution. In this experiment, I try to calculate the 32 result of SLL from 0 to 31 and use a 32T1 MUX to output the asked one. I thought this method is not good and wish to learn a better way to do SLL from others.



## Ctrl

```

module LuiExt(
  input [15:0]imm,
  output [31:0]LuiData
);
  assign LuiData[31:16]=imm;
  assign LuiData[15:0]=0;

endmodule

```



## Top





## simulation

```
module test_ctrl;

// Inputs
reg clk;
reg reset;
reg [31:0] Inst_in;
reg zero;
reg overflow;
reg MIO_ready;

// Outputs
wire MemRead;
wire MemWrite;
wire [2:0] ALU_operation;
wire [4:0] state_out;
wire CPU_MIO;
wire IorD;
wire IRWrite;
wire [1:0] RegDst;
wire RegWrite;
wire [1:0] MemtoReg;
wire ALUSrcA;
wire [1:0] ALUSrcB;
wire [1:0] PCSource;
wire PCWrite;
wire PCWriteCond;
wire Branch;
wire [1:0] ALUop;

// Instantiate the Unit Under Test (UUT)
ctrl uut (
    .clk(clk),
    .reset(reset),
    .Inst_in(Inst_in),
    .zero(zero),
```

```

    .overflow(overflow),
    .MIO_ready(MIO_ready),
    .MemRead(MemRead),
    .MemWrite(MemWrite),
    .ALU_operation(ALU_operation),
    .state_out(state_out),
    .CPU_MIO(CPU_MIO),
    .IorD(IorD),
    .IRWrite(IRWrite),
    .RegDst(RegDst),
    .RegWrite(RegWrite),
    .MemtoReg(MemtoReg),
    .ALUSrcA(ALUSrcA),
    .ALUSrcB(ALUSrcB),
    .PCSource(PCSource),
    .PCWrite(PCWrite),
    .PCWriteCond(PCwriteCond),
    .Branch(Branch),
    .ALUop(ALUop)
);

initial begin
    // Initialize Inputs
    clk <= 0;
    reset = 1;
    Inst_in = 32'h08000008;
    zero = 0;
    overflow = 0;
    MIO_ready = 1;
    #10;
    reset=0;
    forever #10 clk<=~clk;
    #100;
    reset=1;
end

endmodule

```

1.MIO\_ready is always zero so CPU is not allowed to read instruction. Under this condition we can have a look at default status output.



2.MIO\_ready=1 for add instruction. The method to check the other instrument is just the same.



```
timescale 1ns / 1ps

module M_datapath_M_datapath_sch_tb();
    // Inputs
    reg reset;
    reg clk;
    reg IRWrite;
    reg [31:0] data2CPU;
    reg [1:0] RegDst;
    reg [1:0] MemtoReg;
    reg RegWrite;
    reg ALUSrcA;
    reg [1:0] ALUSrcB;
    reg [2:0] ALU_operation;
    reg IorD;
    reg [1:0] PCSource;
    reg Branch;
    reg PCWriteCond;
    reg PCWrite;
    reg MIO_ready;

    // Output
    wire [31:0] Inst;
    wire [31:0] data_out;
    wire zero;
    wire [31:0] PC_Current;
    wire [31:0] M_addr;

    // Bidirs

    // Instantiate the UUT
    M_datapath UUT (
        .reset(reset),
        .clk(clk),
        .IRWrite(IRWrite),
        .data2CPU(data2CPU),
        .Inst(Inst),
        .RegDst(RegDst),
        .MemtoReg(MemtoReg),
        .Branch(Branch),
        .PCWriteCond(PCWriteCond),
        .PCWrite(PCWrite),
        .MIO_ready(MIO_ready)
    );

```

```

.RegWrite(RegWrite),
.ALUSrcA(ALUSrcA),
.ALUSrcB(ALUSrcB),
.data_out(data_out),
.ALU_operation(ALU_operation),
.zero(zero),
.PC_Current(PC_Current),
.IorD(IorD),
.M_addr(M_addr),
.PCSOURCE(PCSource),
.Branch/Branch),
.PCWriteCond(PCWriteCond),
.PCWrite(PCWrite),
.MIO_ready(MIO_ready)
);
// Initialize Inputs
initial begin
reset = 1;
clk = 0;
IRWrite = 0;
data2CPU = 0;
RegDst = 0;
MemtoReg = 0;
RegWrite = 0;
ALUSrcA = 0;
ALUSrcB = 0;
ALU_operation = 0;
IorD = 0;
PCSource = 0;
Branch = 0;
PCWriteCond = 0;
PCWrite = 0;
MIO_ready = 0;
#10;
reset=0;
forever #10 clk=~clk;
#100;
reset=1;
end
endmodule

```

1. test default status output to make sure the datapath correct. M\_addr is blue because MIO\_ready=0



2. test IF and ID instruction; the other instuctions are the same

```

initial begin
    reset = 1;
    clk = 0;
    IRWrite = 1;
    data2CPU = 0;
    RegDst = 0;
    MemtoReg = 0;
    RegWrite = 0;
    ALUSrcA = 0;
    ALUSrcB = 2'b01;
    ALU_operation = 3'b010;
    IorD = 0;
    PCSource = 0;
    Branch = 0;
    PCWriteCond = 0;
    PCwrite = 1;
    MIO_ready = 1;
    #10;
    reset=0;
    forever #10 clk=~clk;
    #100;
    reset=1;
end

```



## 四、讨论与心得

In this experiment, I could design my own multi cycle CPU. The most difficult part is to figure out the transfer between states and fulfill the state transfer table. It's really difficult to find out errors in the table and design a check method that is efficient. When it comes to wiring, actually, after doing the wiring job in the previous exeperiment, this part is not so difficult now. I have found some easier way to design the schematic and became much more familliar with ise software.

I also met some problems in the experiment. One of them that I haven't understand now is the wire design between Branch and PCWriteCond, I can't really understand whether it is meaningful to depart these two signal. In my mind, they actually have the same function. Then there's another question about how to simulate, I can't think of a solution good enough now. Maybe I have to find a more guaranteed methond to do simulation before final checking.