

Q1

(a) RESET



With initialise =  $A \leftarrow 0, P \leftarrow n-1$

Clear\_C =  $C \leftarrow 0$

Load =  $A \leftarrow A+B, C \leftarrow C_{out}$

Shift\_dec =  $C || A || Q \leftarrow sr C || A || Q, P \leftarrow P-1$

(b) In a microcoded control solution, control words are now stored and as such they cannot be made dynamically depend on the value of control input. Hence, additional states must be introduced to supply these alternative control words.

As a result, conditional output boxes will be replaced with state boxes in the ASM chart of the microcoded solution.

- Q2. (a) • When it is first turned on, the Program Counter (PC) and Control Address Register (CAR) are both reset.
- The PC specifies the address of the next microprogram in the Memory M, while the CAR specifies the address of the next set of machine code instructions in the Control Memory.
  - When the CAR is reset, it points to the machine code for fetching the next instruction.
  - When the PC is reset, it points to the first microprogram in memory.
  - This microprogram is then loaded into the IR register, where it is split into a 7-bit opcode, and three 3-bit values that are either used for register selection or as values (for immediate values or PC offset).
  - The opcode is loaded into MUX C, and, as the control memory is in the Instruction Fetch state, it is loaded from MUX C into the CAR and then used to select the next set of machine code in the Control Memory.
  - The Control Memory will carry out one or more instructions before going back into instruction fetch mode.
  - The Control Memory controls the inputs to all of the multiplexers, the functional unit, the PC, IR, CAR, and memory.

NOTE Not sure how much detail is needed, so just adding in extra for the sake of it.

- Q2. (a cont.)
- Values coming from either the output of the functional unit or memory (as selected by MUX D) can be loaded into registers in the register file if RW is set high. The destination register is selected by DR coming from the IR, or TD (Per the temp register) coming from the control memory.
  - The functional unit takes in values from the register file (selected by SA, SB, TA or TB) or immediate values and performs arithmetic, logic or shifting operations on these values. The status bits from the functional unit are used for conditional instructions in the microprogrammed control.
  - Values from the register files can also be stored in memory if MW is set.
  - The processor is pipelined, so a number of operations can be happening simultaneously, i.e. instruction fetching, execution of a previous instruction, and writing of the results of a previous instruction to the register file.

(b) NOTE Not sure what Immediate instruction means

## Function Unit

Q3-(a)



### Shifter



Note: The shifter contains n multiplexers. The input B is split into individual bits which are input to the multiplexers, all of the multiplexers share the same select bit S.



KVH

CS 2022 Paper 2013

Q3 (a cont.) Note Not sure how low level is needed.



B input logic (1-bit slice)



Ripple Adder



Full Adder



Q3 (b) The boolean expressions implement a carry look-ahead adder.

The equation  $C_{i+1} = g_i + p_i C_i$  defines the carry out for each stage of the adder.

A carry out occurs when either both inputs are 1 ( $g_i = x_i y_i$ ) or when there is a carry in for that stage and at least one of the inputs is 1 ( $p_i = x_i + y_i$ ).

At each stage, the equation is fully expanded, so as to reduce the propagation delay, at the expense of redundant gates.

The alternative (as in the ripple adder) is to simply take the  $p_i$  AND  $C_i$  of the previous stage, which incurs a great propagation delay, linear relative to the adder length.

The carry lookahead design instead causes (at best) a logarithmic propagation delay relative to the adder length, with each further stage having a longer delay on its carry in.

The carry lookahead design ~~uses~~ uses increasingly large gates, which have a limited fanout, so the actual efficiency (in terms of propagation delay, power usage and physical size) of this method decreases the larger the adder is.

Q4 (a)

reg4.vhd

library ieee;

use ieee.std\_logic\_1164.all;

entity reg4 is

Port ( D : in std\_logic\_vector (3 downto 0);

Load, Clk : in std\_logic;

(Q : out std\_logic\_vector (3 downto 0));

end reg4;

architecture behavioral of reg4 is

begin

process (Clk)

begin

if (rising-edge (Clk)) then

if (Load = '1') then

Q &lt;= D after 5ns;

end if;

end if;

end process;

end behavioral;

out puts ((IA xor OA) &gt; 00

out puts ((IA xor OA) &gt; 10

out puts ((IA xor OA) &gt; 50

out puts ((IA xor OA) &gt; 80

## mux 2 - 4bit.vhd

library ieee;

use ieee.std\_logic\_1164.all;

entity mux2\_4bit is

Port( In0, In1 : in std\_logic\_vector (3 downto 0);

S: in std\_logic;

Z: out std\_logic\_vector (3 downto 0));

end mux2\_4bit;

architecture behavioral of mux2\_4bit is

begin

Z <= In0 after 5ns when S = '0' else

In1 after 5ns when S = '1' else

"0" after 5ns;

end behavioral;

## decoder - 2to4.vhd

library ieee;

use ieee.std\_logic\_1164.all;

entity decoder\_2to4 is

Port( A0, A1: in std\_logic;

Q0, Q1, Q2, Q3: out std\_logic);

end decoder\_2to4;

architecture behavioral of decoder\_2to4 is

begin

Q0 <= ((not A0) and (not A1)) after 5ns;

Q1 <= (A0 and (not A1)) after 5ns;

Q2 <= ((not A0) and A1) after 5ns;

Q3 <= (A0 and A1) after 5ns;

end behavioral;

Q4 (a cont.)

### mux4\_bit.vhd

Library ieee;

use ieee.std\_logic\_1164.all;

entity mux4\_bit is

Port ( In0, In1, In2, In3 : in std\_logic\_vector(3 downto 0);  
S0, S1 : in std\_logic;  
Z : out std\_logic\_vector(3 downto 0));

end mux4\_bit;

architecture behavioral of mux4\_bit is

begin

Z <= In0 after 5ns when S0 = '0' and S1 = '0' else  
In1 after 5ns when S0 = '1' and S1 = '0' else  
In2 after 5ns when S0 = '0' and S1 = '1' else  
In3 after 5ns when S0 = '1' and S1 = '1' else  
x "0" after 5ns;

end behavioral;

## register\_file.vhd

Q4. library IEEE;

use IEEE.STD\_LOGIC\_1164.ALL;

use IEEE.STD\_LOGIC\_ARITH.ALL;

use IEEE.STD\_LOGIC\_UNSIGNED.ALL;

entity register\_file is

Port (src\_S0, src\_S1, des\_A0, des\_A1: in std\_logic;

Clk, data\_src: in std\_logic;

data: in std\_logic\_vector (3 downto 0);

reg0, reg1, reg2, reg3: out std\_logic\_vector (3 downto 0);

end register\_file;

architecture behavioral of register\_file is

component reg4

Port (D: in std\_logic\_vector (3 downto 0);

Load, Clk: in std\_logic;

Q: out std\_logic\_vector (3 downto 0);

end component;

component decoder\_2to4

Port (AO, AI: in std\_logic;

Q0, Q1, Q2, Q3: out std\_logic);

end component;

component mux2\_4bit

Port (In0, In1: in std\_logic\_vector (3 downto 0);

S: in std\_logic;

Z: out std\_logic\_vector (3 downto 0);

end component;

component mux4-bit

Port (In0, In1, In2, In3 : in std\_logic\_vector (3 downto 0);

S0, S1 : in std\_logic;

Z : out std\_logic\_vector (3 downto 0));

end component;

signal Load\_Reg0, Load\_Reg1, Load\_Reg2,

Load\_Reg3 : std\_logic;

signal reg0-q, reg1-q, reg2-q, reg3-q,

data\_src\_mux\_out, src-reg : std\_logic\_vector (3 downto 0);

begin

reg00 : reg4 port map(

D => data\_src\_mux\_out,

Load => Load\_Reg0,

Clk => Clk,

Q => reg0-q

);

reg01 : reg4 port map(

D => data\_src\_mux\_out,

Load => Load\_Reg1,

Clk => Clk,

Q => reg1-q

);

reg02 : reg4 port map(

D => data\_src\_mux\_out,

Load => Load\_Reg2,

Clk => Clk,

Q => reg2-q

);

reg03: reg4 port map(  
D => data\_src\_mux\_out,  
local => local-reg3,  
Clk => Clk,  
Q => reg3-q  
);

des-decoder-2to4: decoder-2to4 port map(  
A0 => des-A0,  
A1 => des-A1,  
Q0 => local-reg0,  
Q1 => local-reg1,  
Q2 => local-reg2,  
Q3 => local-reg3  
);

data\_src\_mux2-bit: mux2-bit port map(  
In0 => data,  
In1 => src-reg,  
S => data\_src,  
Z => data\_src\_mux\_out  
);

Inst-mux4-bit: mux4-bit port map(  
In0 => reg0-q,  
In1 => reg1-q,  
In2 => reg2-q,  
In3 => reg3-q,  
S0 => src-S0,  
S1 => src-S1,  
Z => src-reg  
);

## KUH CS2022 Paper 2013

```
reg0 <= reg0-q;  
reg1 <= reg1-q;  
reg2 <= reg2-q;  
reg3 <= reg3-q;  
end behavioural;
```

- Q6 (b) This register file can load data from external sources using the data(3:0) input, one register at a time. Alternatively, it can copy data from one register to another using the src and des selection inputs.

The data-src input decides whether external source or register source data will be loaded into the destination register.

On every clock cycle, one of the registers will be updated, so it is difficult to have all registers keep their values unless one is careful to always copy the value of a register back to itself.