



# CAO

- sign of mantissa:

$$\begin{array}{l} 0 \rightarrow + \\ 1 \rightarrow - \end{array}$$

xOR  
AND  
logic gates



## # Lec 2-4 :

- Comp. Architecture — user's point of view (external view of computer)

Comp. Organisation — internal view of computer & roles various internal components play during program execution.

Comp. design — Hardware design of computer

- logic gates:

i)



AND



OR



NOT



Buffer



NAND



NOR



XOR



XNOR

$$XOR = A \oplus B = A'B + AB'$$

$$XNOR = (A \oplus B)' = A'B' + AB$$

eg.  $AB \times A'C = F$

Sol:



- Identities:

$$(x+y)' = x'y'$$

$$x+yz = (x+y)(x+z)$$

$$(xy)' = x'+y'$$



OR - Invert



Invert - AND



AND - Invert



- K-Map :

Minterm : for 1 output

| A | B | C | f |
|---|---|---|---|
| 0 | 0 | 0 | 0 |
| 0 | 0 | 1 | 1 |
| 0 | 1 | 0 | 0 |
| 0 | 1 | 1 | 0 |
| 1 | 0 | 0 | 1 |
| 1 | 0 | 1 | - |
| 1 | 1 | 0 | 1 |
| 1 | 1 | 1 | 1 |

$A \nearrow B, C$

|    |    |    |    |
|----|----|----|----|
| 00 | 01 | 11 | 10 |
| 0  | 1  | 3  | 2  |
| 4  | 5  | 7  | 6  |

$$f(A, B, C) = \sum (1, 4, 5, 6, 7)$$

K-Maps :

2 variables

$A \nearrow B, C$

|   |   |   |
|---|---|---|
| 0 | 0 | 1 |
| 0 | 1 | 0 |
| 1 | 2 | 3 |

3 variables

$A \nearrow B, C$

|    |    |    |    |
|----|----|----|----|
| 00 | 01 | 11 | 10 |
| 0  | 1  | 3  | 2  |
| 4  | 5  | 7  | 6  |

4 variables

$A, B \nearrow C, D$

|    |    |    |    |
|----|----|----|----|
| 00 | 01 | 10 | 11 |
| 00 | 0  | 1  | 3  |
| 01 | 4  | 5  | 7  |
| 10 | 12 | 13 | 15 |
| 11 | 8  | 9  | 11 |

eg.

|   |     | 00 | 01 | 11 | 10 |
|---|-----|----|----|----|----|
|   |     | 0  | 0  | 0  | 0  |
|   |     | 1  | 1  | 1  | 1  |
| A | B,C | 0  | 0  | 1  | 0  |
| 1 |     | 1  | 0  | 1  | 1  |

$$F = BC + AC' + AB$$

eg.  $F(A,B,C,D) = \sum(1, 3, 4, 5, 6, 7, 11, 12, 13, 14)$

|     |     | 00 | 01 | 11 | 10 |
|-----|-----|----|----|----|----|
|     |     | 00 | 0  | 1  | 1  |
|     |     | 01 | 1  | 1  | 0  |
| A,B | C,D | 1  | 1  | 0  | 1  |
| 11  |     | 1  | 1  | 0  | 1  |
| 10  |     | 0  | 1  | 1  | 0  |

Sum of product = minterms

- Minterms: (product of sum - POS)

- for 0 output

- 5 variable K-map:

|     |     | 00 | 01 | 11 | 10 |
|-----|-----|----|----|----|----|
|     |     | 0  | 1  | 3  | 2  |
|     |     | 4  | 5  | 7  | 6  |
| B,C | D,E | 12 | 13 | 15 | 14 |
| 10  |     | 8  | 9  | 11 | 10 |

Made with Goodnotes

|     |   | 00 | 01 | 11 | 10 |
|-----|---|----|----|----|----|
|     |   | 16 | 17 | 19 | 18 |
|     |   | 20 | 21 | 23 | 22 |
| D,E | C | 28 | 29 | 31 | 30 |
| 10  |   | 26 | 25 | 27 | 26 |

## - 5 variable Reflection K-Map :

| A,B |    | 000 | 001 | 011 | 010 | 110 | 111 | 101 | 100 |
|-----|----|-----|-----|-----|-----|-----|-----|-----|-----|
|     |    | 00  | 0   | 1   | 2   | 6   | 7   | 5   | 4   |
|     |    | 01  | 8   | 9   | 18  | 10  | 14  | 15  | 13  |
| 00  | 00 | 0   | 1   | 2   | 1   | 0   | 1   | 1   | 0   |
| 01  | 01 | 8   | 9   | 18  | 10  | 14  | 15  | 13  | 12  |
| 11  | 11 | 24  | 25  | 21  | 26  | 30  | 31  | 27  | 28  |
| 10  | 10 | 16  | 17  | 11  | 18  | 22  | 23  | 21  | 20  |

$$B'C'E + AB'C'D + A'C'DE + ABCE + A'B'E' + A'CDE' + AB'DE$$

## - Overlap Maps

| Maps: |   | C, D, E | 000 | 001 | 011 | 010 | 100 | 101 | 111 | 110 |
|-------|---|---------|-----|-----|-----|-----|-----|-----|-----|-----|
| A     | B | 00      | 00  | 10  | 31  | 20  | 40  | 51  | 71  | 61  |
|       |   | 01      | 80  | 91  | 11  | 10  | 11  | 12  | 13  | 15  |
|       |   | 10      | 24  | 25  | 24  | 26  | 28  | 29  | 31  | 50  |
|       |   | 11      | 16  | 17  | 18  | 18  | 20  | 21  | 23  | 22  |
|       |   |         | 0   | 1   | 1   | 1   | 1   | 1   | 0   | 1   |

|   |                  | 00 | 01 | 11 | 10 |   |
|---|------------------|----|----|----|----|---|
|   |                  | 0  | 1  | x  | x  | 1 |
|   |                  | 1  | 0  | x  | 1  | 0 |
| C | B <sub>1</sub> C | 0  | 1  | 0  | 1  | 0 |

$$f = \sum (0, 2, 1) + d(1, 3, 5)$$

50

$$f = A' + C$$

## Combinational Circuits:

half adder : Adds two bits

full adder : Adds three bits

Half Adder :



$$S = A'B + AB' = A \oplus B$$

$$C = AB$$

| Input |   | output |   |
|-------|---|--------|---|
| A     | B | S      | C |
| 0     | 0 | 0      | 0 |
| 0     | 1 | 1      | 0 |
| 1     | 0 | 1      | 0 |
| 1     | 1 | 0      | 1 |

full Adder:

| A | B | C <sub>in</sub> | Sum | C <sub>out</sub> |
|---|---|-----------------|-----|------------------|
| 0 | 0 | 0               | 0   | 0                |
| 0 | 0 | 1               | 1   | 0                |
| 0 | 1 | 0               | 1   | 0                |
| 0 | 1 | 1               | 0   | 1                |
| 1 | 0 | 0               | 1   | 0                |
| 1 | 0 | 1               | 0   | 1                |
| 1 | 1 | 0               | 0   | 1                |
| 1 | 1 | 1               | 1   | 1                |



$$\text{Sum} = A \oplus B \oplus C_{in}$$

$$\text{Carry} = AB + (A \oplus B)C_{in}$$

## #lec 5 ( Sequential logic - flip flops )



### - Flip flops

i) always have a clock signal.      doesn't have a clock signal.

ii) checks input & changes output only at times defined by clock signal.

iii) classified into synchronous

2 asynchronous.

iv) edge triggered device

hatch

— — — changes the output immediately used to store either 1 or 0 at any given time.

no such classification

level triggered device

### SR flipflop:



Truth table for SR flip flop

| S | R | Q(N+1) | Q(N) |
|---|---|--------|------|
| 0 | 0 | Q(N)   | Q(N) |
| 0 | 1 | 0      | Q(N) |
| 1 | 0 | Q(N)   | 0    |
| 1 | 1 | X      | X    |

Characteristic table  
for SR flip flop

| S | R | Q(N) | Q(N+1) |
|---|---|------|--------|
| 0 | 0 | 0    | 0      |
| 0 | 0 | 1    | 1      |
| 0 | 1 | 0    | 0      |
| 0 | 1 | 1    | 0      |
| 1 | 0 | 0    | 1      |
| 1 | 0 | 1    | 1      |
| 1 | 1 | 0    | x      |
| 1 | 1 | 1    | x      |

$$Q(N+1) = S + R'Q(N)$$

JK flip flops: refinement of SR flip flop



| J | K | Q(N+1) |            |
|---|---|--------|------------|
| 0 | 0 | Q(N)   | Hold       |
| 0 | 1 | 0      | Clear to 0 |
| 1 | 0 | 1      | Set to 1   |
| 1 | 1 | Q'(N)  | Toggle     |

Characteristic table  
of JK flip flop

| J | K | Q(N) | Q(N+1) |
|---|---|------|--------|
| 0 | 0 | 0    | 0      |
| 0 | 0 | 1    | 1      |
| 0 | 1 | 0    | 0      |
| 0 | 1 | 1    | 0      |
| 1 | 0 | 0    | 1      |
| 1 | 0 | 1    | 1      |
| 1 | 1 | 0    | 1      |
| 1 | 1 | 1    | 0      |

$$Q(N+1) = JQ(N) + K'Q(N)$$

Made with Goodnotes

## Delay flip flop (D flip flop):

also called transparent latch as input & output are equal  
 SR is converted to D by putting inverter b/w S & R.



| D | Q(n+1) |
|---|--------|
| 0 | 0      |
| 1 | 1      |

clear to 0  
Set to 1

Characteristic table for D flip flop

$$| \quad Q(n+1) = D$$

| D | Q(n) | Q(n+1) |
|---|------|--------|
| 0 | 0    | 0      |
| 0 | 1    | 0      |
| 1 | 0    | 1      |
| 1 | 1    | 1      |

## Toggle flip flop (T flip flop):

obtained from JK flip flop. In JK flip flop when inputs J & K are connected to a single input by T



| T | Q(n+1) |
|---|--------|
| 0 | Q(n)   |
| 1 | Q(n)   |

Characteristic table :  
for T flip flop

| T | Q(N) | Q(N+1) |
|---|------|--------|
| 0 | 0    | 0      |
| 0 | 1    | 1      |
| 1 | 0    | 1      |
| 1 | 1    | 0      |

$$Q(N+1) = T \oplus Q(N) = T'Q(N) + TQ'(N)$$

## # lec 6 : Multiplexer & Demultiplexer

### - Multiplexer :

-  $2^n$  inputs, single output

### - $2 \times 1$ MUX :



Truth table :

| Input               | Output              |
|---------------------|---------------------|
| S <sub>0</sub><br>0 | Y<br>A <sub>0</sub> |
| 1                   | A <sub>1</sub>      |

Logical circuit :



$$Y = A_0 \bar{S}_0 + A_1 S_0$$

### - $4 \times 1$ MUX :



| S <sub>1</sub> | S <sub>0</sub> | Y              |
|----------------|----------------|----------------|
| 0              | 0              | A <sub>0</sub> |
| 0              | 1              | A <sub>1</sub> |
| 1              | 0              | A <sub>2</sub> |
| 1              | 1              | A <sub>3</sub> |



$$Y = S_1' S_0' A_0 + S_1' S_0 A_1 + S_1 S_0' A_2 + S_1 S_0 A_3$$

when input is 0, 1, 2, 3 output is 1 greater than input  
 n n n 4, 5, 6, 7 " " 1 less "

Design circuit.

Sol:

| x | y | z |
|---|---|---|
| 0 | 0 | 0 |
| 0 | 0 | 1 |
| 0 | 1 | 0 |
| 0 | 1 | 1 |
| 1 | 0 | 0 |
| 1 | 0 | 1 |
| 1 | 1 | 0 |
| 1 | 1 | 1 |

| A | B | C | S | C |
|---|---|---|---|---|
| 0 | 0 | 1 | 0 | 0 |
| 0 | 1 | 0 | 1 | 0 |
| 0 | 1 | 1 | 1 | 0 |
| 1 | 0 | 0 | 0 | 1 |
| 0 | 1 | 1 | 1 | 0 |
| 1 | 0 | 1 | 0 | 1 |
| 1 | 1 | 0 | 0 | 1 |
| 1 | 1 | 1 | 1 | 1 |

Draw the circuit using table.  
 (full adder)



# Lecture - 7 (Decoder & Encoder): with an additional input of enable



4 to 16 line decoder :



4 to 2 encoder



| Inputs |       |       |       | Outputs |       |
|--------|-------|-------|-------|---------|-------|
| $Y_3$  | $Y_2$ | $Y_1$ | $Y_0$ | $A_1$   | $A_0$ |
| 0      | 0     | 0     | 1     | 0       | 0     |
| 0      | 0     | 1     | 0     | 0       | 1     |
| 0      | 1     | 0     | 0     | 1       | 0     |
| 1      | 0     | 0     | 0     | 1       | 1     |

$$A_1 = Y_3 \oplus Y_2$$

$$A_0 = Y_1 + Y_3$$



(\*) 8 to 3 encoder = octal to binary

# Boolean Algebra :

$$\text{Properties : } ① A + AB = A(1+B) = A \quad \text{or} \quad A(A+B) = A$$

$$② A + A'B = A + B \quad \text{or} \quad A' + A \cdot B = A' + B$$

$$③ (A + B)' = A' \cdot B' \quad \text{or} \quad (A \cdot B)' = A' \cdot B'$$

for ②

$$A + A'B$$

$$A + (1-A)B$$

$$A + B - AB$$

eg.  $f = (A + B + C)(A + B' + C)(A + B \cdot C)$

$$\begin{aligned} &= (x + c)(x + c') (A + B' + c) \quad (A + B = x) \\ &= (x^2 + x \cdot c' + x \cdot c) (A + B' + c) \\ &= (x^2 + x) (A + B' + c) \\ &= x (A + B' + c) \\ &= (A + B) (A + B' + c) \\ &= \underline{\underline{A}} + \underline{\underline{AB'}} + AC + \underline{\underline{AB}} + BC \\ &= \underline{\underline{A}} + AC + BC \\ &= A + BC \end{aligned}$$

eg.  $G_1 = (A + B)(A + B')(A' + B)(A' + B')$

sol'  $G_1 = (A + AB' + AB) (A' + A'B' + BA')$

$$\begin{aligned} &= (A) (A') \\ &= 0 \end{aligned}$$

minterm - 90s

Made with Goodnotes

minterm - 30s

①

- Minterm & minterm expressions
- Kmap

## # Numericals on full & half adder :

Q1.

| x | y | z | f |
|---|---|---|---|
| 0 | 0 | 0 | 1 |
| 0 | 0 | 1 | 1 |
| 0 | 1 | 0 | 1 |
| 0 | 1 | 1 | 0 |
| 1 | 0 | 0 | 0 |
| 1 | 0 | 1 | 0 |
| 1 | 1 | 0 | 0 |
| 1 | 1 | 1 | 0 |



$$f = x'y' + z'x'$$



## # Unit 2 :



operations performed on this data is microoperations

A microoperation is an elementary operation performed on the info stored in one or more registers.

e.g. shift, load, clear, count

# Internal Hardware Organization of a digital computer

Set of registers the computer contains

Sequence of microoperations performed on the info. stored in the registers

The control that initiates the sequence of microoperations

Register Transfer Language: Symbolic notation to denote microoperation transfer among registers



- Registers are denoted using MAR, PC, IR.

↓      ↗  
Memory access      Program Counter  
Register      Instruction Register



- Register Transfer:

① Replacement operator

$$R_2 \leftarrow R_1$$

Made with **Xamfnotes** from  $R_1$  to  $R_2$

② control signal

if ( $P=1$ ) then  $R_2 \leftarrow R_1$

or

$P : R_2 \leftarrow R_1$



when control variable  $P=1$ ,  $n$  bits are transferred from  $R_1$  to  $R_2$ .

### # Bus Transfer:

- communication lines are required for transfer b/w multiple registers & to ↓ the no. of lines common bus transfer is used.
- Bus transfer using 4 registers:  $n$  bits  $\Rightarrow$   $n$  multiplexer

T: Bus  $\leftarrow C$ ,  $R_1 \leftarrow$  Bus

$R_1 \leftarrow C$

### # Three State Bus Buffer:

Rather than using MUX, we can use three state gates

States of a 3 state gate

Logic 0

Logic 1

High-impedance State  
(open circuit)



# Memory Transfer:

- operations

Read

Write

Read :  $DR \leftarrow M[AR]$

This causes a transfer of info. from memory word M selected by address in AR into DR.

Write :  $M[AR] \leftarrow R_1$

This causes a transfer of info. from  $R_1$  into memory word M selected by address in AR.

## # Microoperations :

$$R_3 \leftarrow R_1 + R_2$$

$$R_3 \leftarrow R_1 - R_2$$

$$R_3 \leftarrow \overline{R}_2$$

$$R_3 \leftarrow \overline{R}_2 + 1$$

$$R_3 \leftarrow R_1 + \overline{R}_2 + 1$$

$$R_1 \leftarrow R_1 + 1$$

$$R_1 \leftarrow R_1 - 1$$

Content of  $R_1$  plus  $R_2$  transferred to  $R_3$

Content of  $R_1$  minus  $R_2$  transferred to  $R_3$

1's complement of  $R_2$

2's complement of  $R_2$  (negate)

$R_1$  plus 2's complement of  $R_2$  (subtraction)

Increment content of  $R_1$

Decrement content of  $R_1$

- 4 bit binary adder

Binary adder-subtractor

Binary Incrementer

Binary Adder : A binary adder is constructed with full adder circuits connected in cascade, output carry converted as input carry of next adder.



n-bit binary adder requires n full adders

## Binary Adder-Subtractor:

- 1's complement — can be implemented with inverters and added to sum through input carry
- 2's complement — Take 1's complement and add 1 to least significant pair of bits  
(subtraction)

To calculate  $A - B$ , take 2's complement of  $B$  and add it to  $A$ .



## Binary Incrementer: Half Adders are used.

0110  $\xrightarrow{\text{Increment by 1}}$  0111



## Arithmetic Circuit

| Select                            | Input     | Output                | Microoperation       |
|-----------------------------------|-----------|-----------------------|----------------------|
| S <sub>1</sub> S <sub>0</sub> Cin | y         | D = A + y + Cin       | Add                  |
| 0 0 0                             | B         | D = A + B             | Add with carry       |
| 0 0 1                             | B         | D = A + B + 1         | Subtract with borrow |
| 0 1 0                             | $\bar{B}$ | D = A + $\bar{B}$     | Subtract             |
| 0 1 1                             | $\bar{B}$ | D = A + $\bar{B}$ + 1 |                      |
| 1 0 0                             | 0         | D = A                 | Transfer             |
| 1 0 1                             | 0         | D = A + 1             | Increment A          |
| 1 1 0                             | 1         | D = A - 1             | Dec                  |
| 1 1 1                             | 1         | D = A                 | Transfer             |

④ '+' in control statement means 'OR' , '+' in microop. denotes arithmetic sum.

$$P+Q : R_1 \leftarrow R_2 + R_3, R_4 \leftarrow R_5 \vee R_6$$

↓

OR      Add      OR

## Applications of Logic Microoperations:

selective clear :

$$\begin{array}{r}
 1010 \\
 1100 \\
 \hline
 0010
 \end{array} \quad A$$

*reg*

$$\begin{array}{r}
 1010 \\
 0011 \\
 \hline
 0010
 \end{array} \quad B$$

*reg*

A (after)

selective set :

set bits of A to 1 for corresponding 1's in B

$$A \leftarrow A \wedge \bar{B}$$

selective complement :

Made with Goodnotes  
complement positions of A where there are 1's in B.

Mask operation :  
(AND)

$$\begin{array}{r} 1010 \\ 1100 \\ \hline 1000 \end{array} \quad A$$
$$B$$
$$A \text{ (after)}$$

Insert operation : ?

trying to mask the  
4 leftmost bits



$$\begin{array}{r} 0110 1010 \\ 0000 1111 \\ \hline 0000 1010 \end{array}$$

A (before)

B (mask)

A (after masking)

and then inserting  
the new value

$$\begin{array}{r} 0100 1010 \\ 1001 0000 \\ \hline 1001 1010 \end{array}$$

A (after masking)

B (insert)

A (after insertion)

clear operation :  $A \leftarrow A \oplus B$

The mask operation is AND microoperation and insert operation is OR. Clear is achieved by XOR.

Shift Microoperations : Logical, Circular, Arithmetic

Logical

discard  $R_1 \leftarrow \text{shl } R_1$



00011001  $R_1$  (before)

00110010  $R_1$  (after logical shift-left)

extra 0 is added



Arithmetic Shift : shifts a signed binary no. to left or right.  
The sign bit (i.e. the leftmost bit)  
must remain unchanged.



In case of arithmetic left shift, the signed bit is lost resulting in erroneous results and this phenomenon is called "overflow".

## Hardware Implementation of a Circuit Shifter ??

Arithmetic Logic Shift Unit : combination of all shift circuits



function table for arithmetic logic shift unit ???

# questions :

q<sup>1</sup> : y<sup>T</sup> 2 :  $R_2 \leftarrow R_1$  ,  $R_1 \leftarrow R_2$



Q<sup>2</sup>

$$\begin{aligned}
 T_0 &\leftarrow R_5 \leftarrow R_0 \\
 T_1 &\leftarrow R_5 \leftarrow R_1 \\
 T_2 &\leftarrow R_5 \leftarrow R_2 \\
 T_3 &\leftarrow R_5 \leftarrow R_3
 \end{aligned}
 \quad \text{hardware} = ?$$



on  
outputs  
to that  
logical  
circuit

| Input |       | Output |          |
|-------|-------|--------|----------|
| $S_0$ | $S_1$ | $T_0$  | $R_{S0}$ |
| 0     | 0     | $T_0$  | $R_{S0}$ |
| 0     | 1     | $T_1$  | $R_1$    |
| 1     | 0     | $T_2$  | $R_2$    |
| 1     | 1     | $T_3$  | $R_3$    |

Time signals  
will be the  
inputs

| $T_0$ | $T_1$ | $T_2$ | $T_3$ | $S_0$ | $S_1$ | output of MUX |
|-------|-------|-------|-------|-------|-------|---------------|
| 0     | 0     | 0     | 0     | x     | x     | -             |
| 1     | 0     | 0     | 0     | 0     | 0     | $R_0$         |
| 0     | 1     | 0     | 0     | 0     | 1     | $R_1$         |
| 0     | 0     | 1     | 0     | 1     | 0     | $R_2$         |
| 0     | 0     | 0     | 1     | 1     | 1     | $R_3$         |

K-map for  $S_0$

|       |       | $T_0, T_1$ | $T_2, T_3$ |    |    |
|-------|-------|------------|------------|----|----|
|       |       | 00         | 01         | 11 | 10 |
| $T_0$ | $T_1$ | 00         | 01         | 11 | 10 |
| 0     | 0     | x          | 1          | x  | 1  |
| 0     | 1     | 0          |            |    |    |
| 1     | 0     |            |            |    |    |
| 1     | 1     | 0          |            |    |    |

$$S_0 = \overline{T_0} \overline{T_1}$$

K-map for  $S_1$



??

Alternative  
K-map  
for some  
reason



$$S_1 = \overline{T_0} \overline{T_2}$$



$$\text{Load} = T_0 + T_2 + T_1 + T_3$$

$R_S$  must be connected to all the time signals.

g3 Q :  $R_1 \leftarrow R_2$

Q'Q :  $R_1 \leftarrow R_3$

g4



q/b 16 register of 32 bits each

Sol: a)  $16 \times 1$  size of multiplexers

b) 4 selection lines

$$(16 = 2^4)$$

c) 32 multiplexers (one bit for each register)

- q/b
- $R_2 \leftarrow M[AR]$
  - $M[AR] \leftarrow R_3$
  - $R_5 \leftarrow M[R_5]$

Sol: a) read the memory word specified by address AR into  $R_2$ .

b) write the content of  $R_3$  into memory word specified by address in the AR.

c) same as a) (this will overwrite the prev. value of  $R_5$ )

q/b  $x + yz : AR \leftarrow AR + BR$

AR & BR are n-bit registers



$$9/9 \quad xyT_0 + T_1 + y'T_2 : AR \leftarrow \underline{AR + 1}$$

*n-bit binary counter*



$$9/10 \quad xT : R_1 \leftarrow R_1 + R_2$$

$$x'T : R_1 \leftarrow R_2$$

two 4 bit registers

(use 4 bit adder & a quadruple  
2-to-1 line mux.)

9/10<sup>m</sup>

$$T=1 \quad \begin{cases} R_1 \leftarrow R_1 + R_2 & x=1 \\ R_1 \leftarrow R_2 & x=0 \end{cases}$$



$$9/12 \quad \begin{array}{c|cc|c} \text{Input Mode} & \text{Output Mode} & & \\ M & A & B & \\ \hline 0 & 0111 & 0110 & \\ 0 & 1000 & 1001 & \\ 1 & 1100 & 1000 & \\ 1 & 0101 & 1010 & \\ 1 & 0000 & 0001 & \end{array}$$

Sop<sup>1</sup>

## Adder subtractor circuit



when  $M = 0$  — adder  
 $M = 1$  — subtractor

$$\begin{array}{r} 11 \\ 0111 \\ 0110 \\ \hline 1101 \end{array}$$

$$\begin{array}{r} 1000 \\ 1001 \\ 1001 \\ \hline 0001 \end{array}$$

$$\begin{array}{r} 1100 \\ 1000 \\ 1000 \\ \hline 0100 \end{array}$$

$$5 - 9 = 4$$

| $M$ | $A$  | $B$  | Sum / Diff | Carry ( $C_4$ ) |
|-----|------|------|------------|-----------------|
| 0   | 0111 | 0110 | 1101       | 0               |
| 0   | 1000 | 1001 | 0001       | 1               |
| 1   | 1100 | 1000 | 0100       | 1               |
| 1   | 0101 | 1010 | 1011       | 0               |
| 1   | 0000 | 0001 | 1111       | 0               |

0x01  
1010  
1011  
1011  
 $(0-1=1)$   
you take a  
borrow  
Sop<sup>1</sup>

4 bit decrementer using 4 full adder



$$A - 1 = A + (2^{\text{complement of } 1})$$

$$\begin{array}{r} 0010 \\ 1010 \\ 1010 \\ 0001 \\ \hline 1111 \end{array}$$

Made with Goodnotes  $\rightarrow (1110) + 1 \rightarrow 1111$

$$\Rightarrow A_3 A_2 A_1 A_0 + 1111$$



we were given a 4 bit arithmetic circuit, it had to be connected to form an 8 bit circuit

q/15

q/16, 17, 18 → easy

q/19 : a)  $AR \leftarrow AR + BR$

Sof a) ①  $\begin{array}{r} 111111 \\ 11110010 \\ + 11111111 \\ \hline AR = 11110001 \end{array}$

b)  $CR \leftarrow CR \wedge DR, BR \leftarrow BR + 1$

b)  $\begin{array}{r} 10111001 \\ 11101010 \\ \hline \text{AND} \quad \begin{array}{r} 11111111 \\ \hline 00000000 = BR \end{array} \end{array}$

$CR = 10101000$

c)  $AR \leftarrow AR - CR$

$\begin{array}{r} 10101000 \\ 11110010 \\ - 10111001 \\ \hline 01111001 = AR \end{array}$

*sequence main hei*

$\begin{array}{r} 11110000 \\ 10101000 \\ \hline AR = 01001001 \end{array}$

q/20  
 $10011100$

Arithmetic shift right

## # unit 3:

- Input - Output Configuration : flow of instruction transfer



- Initially,  $F_{G1}$  is set to zero. When a key is pressed on Keyboard, an 8 bit alphanumeric code is shifted into INPR and  $F_{G1}$  is set to 1.
- if  $F_{G1}=1$ , the info. from INPR is transferred in parallel to AC
- Initially,  $F_{GO}$  is set to 1. The comp. checks the flag if flag is 1, the info. is transferred from AC to OUT R & flag cleared to 0.

- The printer accepts the coded info, prints the corresponding character, and when this is done.  
FGO is set to 1.

- I/O instructions :

$D_3 T_3 = p$  (common to all instructions)

$IR(i) = b_i$  (bit in IR (6-11) that specifies the instruction)

|     |                                                                  |                     |
|-----|------------------------------------------------------------------|---------------------|
| INP | $p: SC \leftarrow 0$                                             | Input character     |
|     | $p B_{11}: AC \leftarrow INPR, FGO \leftarrow 0$                 |                     |
| OUT | $p B_{10}: OUTR \leftarrow AC, FGO \leftarrow 0$                 | Output character    |
| SKI | $p B_9: \text{if } (FGI = 1) \text{ then } PC \leftarrow PC + 1$ | Skip on input flag  |
| SKO | $p B_8: \text{if } (FGO = 1) \text{ then } PC \leftarrow PC + 1$ | Skip on output flag |
| ION | $p B_7: IEN \rightarrow 1$                                       | Enable Interrupt on |
| IOF | $p B_6: IEN \rightarrow 0$                                       | Enable " off        |

- Programmed Control Transfer : (PCT)

In this type of communication, the program checks the flag bit, & when it finds the bit is set, it initiates an info. transfer.

PCT is inefficient because of diff. of info. flow rates b/w processor and input/output device.

Program interrupt : A more efficient method is to let the I/O device inform the processor when it's free and in the mean time the processor is free to do other task.

Working :

Step 1 — When comp. is running a program, it doesn't check the flags

Step 2 — When a flag is set, the comp. is interrupted from its current program and is informed that a flag has been set.

Step 3 — The comp. momentarily deviates from what it is doing to take care of I/O task & return to its original task.

Flowchart of Interrupt Cycle :





### Microoperations for Interrupt Cycle :

1.  $RT_0 : AR \leftarrow 0, TR \leftarrow PC$

During first timing signal, AR is cleared to zero. The content of PC is transferred to TR.

2.  $RT_1 : M[AR] \leftarrow TR, PC \leftarrow 0$

During second timing signal, the return address is stored in memory location zero & PC is cleared to 0.

3.  $RT_2 : PC \leftarrow PC + 1, IEN \leftarrow 0, R \leftarrow 0, SC \leftarrow 0$

During the third timing signal, PC is incremented. IEN, R, SC are cleared to zero.

# # Complete Computer Description:

- ① Flowchart for Computer operation → Most Important
- ② Design of basic computer

①



②

A memory unit with 4096 words with 16 bit each

9 register

7 flip flops (I, S, E, R, LEN, FG1, FG0)

2 decoder ( $3 \times 8$  of decoder,  $4 \times 16$  timing decoder)

16 bit common bus

Control logic gates

Adder & logic circuit connected to input of AE

# Exercise for unit 3 :

q1 256 K words of 32 bit each

Sol a) Indirect bit = 1  
opcode =  $256K = 2^8 \times 2^{10} = 18$  bits

Register code = 64 registers =  $2^6$ , 6 bits

Address code =  $32 - 18 - 6 - 1 = 7$

b)

|   |        |          |         |
|---|--------|----------|---------|
| 1 | 18     | 6        | 7       |
|   | opcode | Register | Address |

c) Data input = 32

Address input =

Direct  $\rightarrow$  2 read references (addr, operand)

Indirect  $\rightarrow$  3 (., effective address, operand)

q2

Made with Goodnotes

9 2

?

9 4

?

q6

a)  $IR \leftarrow M[PC]$

$$AR \leftarrow PC$$

$$IR \leftarrow AR$$

b)  $AC \leftarrow AC + TR$

$$DR \leftarrow TR$$

$$AC \leftarrow AC + DR$$

c)  $DR \leftarrow DR + AC$  (AC doesn't change)

Switch content of DR & AC

Addition

switch again

$$DR \leftarrow AC, AC \leftarrow DR$$

$$AC \leftarrow DR + AC$$

$$DR \leftarrow AC, AC \leftarrow DR$$

q6

a)  $\begin{array}{r} 0001 \\ + 0000 \\ \hline 0010 \end{array}$   $0100$   $= (1024)_{16}$

1            0            2            4

Add  $(024)_{16}$

Add  $M[024]$  to AC

b)  $\begin{array}{r} 2^3 \\ 1011 \\ + 2^2 \\ \hline B \end{array}$   $0001$   $0010$   $0100$   $= (B124)_{16}$

Store AC in  $M[M[124]]$

$1+2+4$  c)  $\begin{array}{r} 0111 \\ + 0000 \\ \hline 0010 \end{array}$   $0000$   $= (\text{?}020)_{16}$

Register reference Increment AC

97

CLE : Clear E

(To set  $E \rightarrow 1$ )

CME : Complement E

98

$C_7 T_3$  :  $SC \leftarrow 0$



# 9.9 Register Reference Instructions

⑤ LSL(AC)

$$AC = A937$$



⑥ CIL(AC)

$$AC = A937$$



⑦ skip to next (SNA) if AC is negative:

|             | E | AC   | PC  | AR  | IR   |
|-------------|---|------|-----|-----|------|
| Initial SNA | 1 | A937 | 021 | -   | -    |
|             | 1 | A937 | 023 | 008 | 7008 |

In case of  
if AC is +ve  
if AC is zero  
we incremented PC  
by 1 only

Q10 go over the instruction cycle

$$M[083] = B8F2$$

$$AC = A937$$

|                | PC  | AR  | DR | AC   | IR   |
|----------------|-----|-----|----|------|------|
| Initial<br>BUN | 021 | -   | -  | A937 | -    |
|                | 083 | 083 | -  | A937 | 4083 |

BUN :  $PC \leftarrow AR$

|                | PC  | AR  | DR | AC   | IR   |
|----------------|-----|-----|----|------|------|
| Initial<br>BSA | 021 | -   | -  | A937 | -    |
|                | 084 | 084 | -  | A937 | 5083 |

BSA :  $M[AR] \leftarrow PC$ ,  $PC \leftarrow AR + 1$   
 $AR \leftarrow PC$

ISZ :  $M[AR] \leftarrow M[AR] + 1$

If  $M[AR] + 1 = 0$  then  $PC \leftarrow PC + 1$ ,  $SC \leftarrow 0$

PC = 7FF

$M[7FF] = EA9F$

$M[A9F] = 0C35$

$M[C35] = FFFF$

?

Q12

$$PC = 3AF$$

$$AC = 7EC3$$

$$M[3AF] = 932E$$

$$M[32E] = 09AC$$

$$M[9AC] = 8B9F$$

Sol a)

Memory

| Memory |      |
|--------|------|
| 3AF    | 932E |
| 32E    | 09AC |
| 9AC    | 8B9F |

PC has 3AF and at 3AF we have 932E.

$$9 = 1001$$

$$\underline{1001} \quad 32E = \boxed{1} \boxed{\text{ADD}} \boxed{32E}$$

b)  $AC = 7EC3$

Q  
A  
B  
C  
D  
E  
F



Add 7EC3 and 32E, we get 0A62.

$$E = 1$$

①  $\begin{array}{r} 111 \\ 7EC3 \\ + 32E \\ \hline 0A62 \end{array}$

$3+F=16=16+2$

goes in carry

c)

$$\boxed{1 \quad \text{ADD} \quad | \quad 32E} \rightarrow 09AC \rightarrow 8B9F \rightarrow DR$$

$$E = 1, I = 1, SC \rightarrow 0$$

$$PC = 3AF + 1 = 3B0$$

$$AC = 0A62$$

$$DR = 8B9F$$

$$AR = 9AC$$

$$IR = 932E$$

$$\begin{array}{r} 1 \\ 3AF \\ + 32E \\ \hline 3B0 \end{array}$$

7.13

Symbol of code      Symbolic designation

$$\text{ii) XOR} \quad 000 \quad AC \leftarrow AC \oplus M[EA]$$

$$\text{? iii) ADM} \quad 001 \quad M[EA] \leftarrow M[EA] + AC \quad - ?$$

$$\text{iv) SUB} \quad 010 \quad AC \leftarrow AC - M[EA]$$

$$\text{v) XCH} \quad 011 \quad AC \leftarrow M[EA], M[EA] \leftarrow AC$$

$$\text{vi) SEQ} \quad 100 \quad \text{if } (M[EA] = AC) \text{ then } PC \leftarrow PC + 1$$

$$\text{vii) BRA} \quad 101 \quad \text{if } (AC > 0) \text{ then } PC \leftarrow EA$$

~~Sop~~

ii) original

$$AC \leftarrow AC + M[AR],$$

$$E \leftarrow \text{Carry}$$

$$D_1 T_4 : DR \leftarrow M[AR]$$

$$D_1 T_5 : AC \leftarrow AC + DR, \\ E \leftarrow \text{Carry}, SC \leftarrow 0$$

New instruction

$$M[EA] \leftarrow M[EA] + AC$$

$$D_1 T_4 : DR \leftarrow M[AR]$$

$$D_1 T_5 : DR \leftarrow AC, AC \leftarrow AC + DR$$

$$D_1 T_6 : M[AR] \leftarrow AC, AC \leftarrow DR, SC \leftarrow 0$$

g14

$$S_2 S_1 S_0 = 000$$

LDC Address  $CTR \leftarrow M[\text{Address}]$

(Replace ISZ instruction that loads a no. into CTR)

Add a register reference ICSZ

Sof

Conversion from memory reference instruction to register reference instruction.

The new instruction ICSZ can be executed at  $T_3$  instead of  $T_6$ .

g15

$65536 \times 16$ ,  $\overset{(I=1)}{\text{address part of instruction residing in}}$  position 0 through 11

$I=0$ , the address of instruction is given by 16 bits in next word following the instruction.

Sof





g16  $\underbrace{PC, AR, TR, AC, DR, IR}_{16}$ ,  $\underbrace{IR}_{8}$  , 8 bit opcode , 16 bit address

All operands are 8 bits . No indirect bit



b)



c) T<sub>0</sub>: IR  $\leftarrow M[PC]$ , PC  $\leftarrow PC + 1$

T<sub>1</sub>: AR  $\leftarrow M[PC]$ , PC  $\leftarrow PC + 1$   
(0-7)

T<sub>2</sub>: AR(8-15)  $\leftarrow M[PC]$ , PC  $\leftarrow PC + 1$

T<sub>3</sub>: DR  $\leftarrow M[AR]$

g17 16384 words, 40 bits per word, 6 bits for operation part,  
14 bits for address part, 2 instructions are packed in 1  
memory word, 40 bit IR is available.



1. Read 40 bit double instruction from memory to IR, increment PC.
2. Decode opcode 1
3. Execute instruction 1 using address 1.
4. Decode opcode 2.
5. Execute instruction 2 using address 2.
6. Go to step 1.

g18 FG0 = 1, EN = 1



b)



q19

$$x_3' x_1 : R \leftarrow M[AR]$$

read memory word into R

$$x_1' x_2 : R \leftarrow AC$$

transfer AC to R

$$x_1' x_3 : M[AR] \leftarrow R$$

write to memory

sel



$$x_3' x_1 = \text{Read}$$
$$x_1' x_3 = \text{write}$$

q22, 23, 24



### Unit - 3 :

- Computer instruction  $\rightarrow$  instruction code = opcode + address  
 ↓  
 operation to be performed

$4096 \times 16$  memory  $\rightarrow 4096 = 2^{12} \Rightarrow 12$  bits to specify an address  
 $16$  bit memory  $= 2^4 \Rightarrow 4$  bit opcode word  
 ↓  
 $16$  operations can be performed

- Addressing modes : Immediate, Direct, Indirect  
 ↓                   ↓                   ↓  
 specifies operand   specifies address of operand   code contains address of memory in which address of operand is present



Immediate



Direct



Indirect

|      |    |                                   |                              |
|------|----|-----------------------------------|------------------------------|
| PC   | 12 | program counter                   | Holds address of instruction |
| AR   | 12 | address register                  | Holds address for memory     |
| IR   | 16 | Instruction register              | Holds instruction code       |
| TR   | 16 | Temporary register                | Holds temporary data         |
| DR   | 16 | Data register                     | Holds memory operand         |
| AC   | 16 | Process register<br>(accumulator) | Process register             |
| INPR | 8  |                                   | Holds input character        |
| OUTR | 8  |                                   | Holds output character       |

000 to 110



Memory reference instruction



Register reference instruction



I/O reference instruction

- Timing of all registers is controlled by a Master Control Generator.



Block diagram  
of typical control  
unit

- Relationship b/w Data transfer, Clock Transition, Timing Signals

To :  $AR \leftarrow PC$

- Instruction cycle :
  - fetch the instruction from memory
  - Decode the instruction
  - Read effective address from memory
  - Execute the instruction

(Diagram for fetch phase)

- Memory Reference Instructions :

| Symbol | Operation decoder | Symbolic description                                                            |
|--------|-------------------|---------------------------------------------------------------------------------|
| AND    | D <sub>0</sub>    | $AC \leftarrow AC \wedge M[AR]$                                                 |
| ADD    | D <sub>1</sub>    | $AC \leftarrow AC + M[AR], E \leftarrow \text{cout}$                            |
| LDA    | D <sub>2</sub>    | $AC \leftarrow M[AR]$                                                           |
| STA    | D <sub>3</sub>    | $M[AR] \leftarrow AC$                                                           |
| BSA    | D <sub>5</sub>    | $M[AR] \leftarrow PC, PC \leftarrow AR + 1$                                     |
| ISZ    | D <sub>6</sub>    | $M[AR] \leftarrow M[AR] + 1,$<br>if $M[AR] + 1 = 0$ then $PC \leftarrow PC + 1$ |
| BUN    | D <sub>4</sub>    | $PC \leftarrow AR$                                                              |

Execution of memory reference starts with T<sub>4</sub>.

BSA :  $M[AR] \leftarrow PC$ ,  $PC \leftarrow PC + 1$

I/O instruction configuration:

$FG0 = 1$



$FG1 = 0$

# lec-1: (Unit-1)

- Binary to octal

$$\begin{array}{r} 110 \quad 101 \\ \hline 6 \quad 5 \end{array} = 65$$

- Binary to hexadecimal (form groups of 4)

for integer - right to left  
for fractional - left to right

- Octal to hexadecimal: use binary as intermediate

0 → ne  
1 → -ne

~~PYQ~~

# UCS510 (1) :



$$g = ?$$

~~sol~~ "  $g = PQ + \bar{h}\bar{Q}$  ,  $h = PR + \bar{Q}\bar{R}$



Analysis :



$$f_1 = \bar{W} \bar{S}_1 + \bar{W} S_1 = W \oplus S_1$$

$$F = f_1 \bar{S}_2 + \bar{f}_1 S_2 = f_1 \oplus S_2 = W \oplus S_1 \oplus S_2$$

$$F = W \oplus S_1 \oplus S_2$$

$$g = PQ + \bar{T} \bar{Q}$$

$$h = PR + \bar{Q} \bar{R}$$

$$\begin{aligned} \Rightarrow g &= PQ + (\overline{PR + \bar{Q}\bar{R}}) \bar{Q} \\ &= PQ + (\overline{PR})(\overline{QR})(\bar{Q}) \end{aligned}$$

$$= PQ + 0$$

$$g = PQ$$

$$g \text{ (b) } f(A, B, C, D) = \sum(5, 7, 11, 12) + d(3, 11, 13, 15)$$

sol minimal exp. = SOP

| $A, B$ | CD<br>00 | 01 | 11 | 10 |
|--------|----------|----|----|----|
| 00     | 0        | 1  | 2  | 2  |
| 01     | 4        | 5  | 7  | 6  |
| 11     | 12       | 13 | 15 | 14 |
| 10     | 8        | 9  | 11 | 10 |

| $A, B$ | CD<br>00 | 01 | 11 | 10 |
|--------|----------|----|----|----|
| 00     |          |    | X  |    |
| 01     |          | X  | X  |    |
| 11     | X        | X  | X  |    |
| 10     |          |    |    | X  |

$$f = CD + ABC + A'BD$$

9/29/2021

| S | $C_{in} = 0$ | $C_{in} = 1$ |
|---|--------------|--------------|
| 0 | $D = A+B$    | $D = A-B$    |
| 1 | $D = A+1$    | $D = A-1$    |

2 bit input  $\rightarrow A, B$

Carry input  $\rightarrow C_{in}=0, C_{in}=1$

selection line  $\rightarrow S$

Sofn ②)

reference:

| $S$ | $C_{in} = 0$ | $C_{in} = 1$     |
|-----|--------------|------------------|
| 0   | $D = A + B$  | $D = A + 1$      |
| 1   | $D = A - 1$  | $D = A + B' + 1$ |

for add"  $\rightarrow$  Adder , carry  $\rightarrow$  full adder  
all outputs have A  $\rightarrow$  A should be one of inputs  
we use a multiplexer to take care of the selection  
variable  $\rightarrow$  one of the inputs from mux into Adder

Now converting  
table,

| $S$ | $C_{in}$ | $X$ | $Y$  |
|-----|----------|-----|------|
| 0   | 0        | A   | B    |
| 0   | 1        | A   | 0    |
| 1   | 0        | A   | 1    |
| 1   | 1        | A   | $B'$ |

concept :- sum = A ⊕ B ⊕ Cin  
 Cout = AB + (A ⊕ B) Cin

Q. 2(b)

e.g.  $(45 \cdot 45)_0 = (?)_2$  using IEEE 754 format

$$\text{Soln} \quad (45)_{10} = 32 \quad 16 \quad 8 \quad 4 \quad 2 \quad 1 = (101101)_2$$

$$\begin{array}{r} 1 \quad 0 \quad 1 \quad 1 \quad 0 \quad 1 \end{array}$$

$$(0.45)_{10} = 101101 \cdot 01110011 = 1.0110101110011 \times 2^5$$

$$\begin{array}{r} 0 \quad 0.45 \\ \underline{-} \quad 2 \\ 0 \quad 0.90 \\ \underline{-} \quad 2 \\ 1 \quad 0.80 \\ \underline{-} \quad 2 \\ 1 \quad 0.60 \\ \underline{-} \quad 2 \\ 1 \quad 0.20 \\ \underline{-} \quad 2 \\ 0 \quad 0.40 \\ \underline{-} \quad 2 \\ 1 \quad 0.80 \\ \underline{-} \quad 2 \\ 1 \quad 0.60 \end{array}$$

01110011

~~127 + 5 = 132~~

$$132 = 128 \ 64 \ 32 \ 16 \ 8 \ 4 \ 2 \ 1$$

$$\quad \quad \quad 1 \ 0 \ 0 \ 0 \ 0 \ 1 \ 0 \ 0$$



**Binary to decimal :**  $(-1)^s \times 2^{e-127} \times 1.m$

not  $1.m$

**S** = signed bit

**e** = exponent number = 10000100 = (in decimal)

**m** = mantissa = \_\_\_\_\_ (in decimal)

e.g.  $(85.125)_{10} = (?)_2$  using IEEE

$$(85)_{10} = 64 \ 32 \ 16 \ 8 \ 4 \ 2 \ 1 = (1010101)_2$$

$$\quad \quad \quad 1 \ 0 \ 1 \ 0 \ 1 \ 0 \ 1$$

$$(0.125)_{10} = \begin{array}{r} 0 \\ 0 \end{array} \frac{0.125}{2} = 001\bar{0}$$

$$\begin{array}{r} 0 \\ 1 \end{array} \frac{0.250}{2} \\ \begin{array}{r} 0 \\ 0 \end{array} \frac{0.500}{2} \\ \begin{array}{r} 0 \\ 0 \end{array} \frac{*0.000}{2} \\ \begin{array}{r} 0 \\ 0 \end{array} \frac{0.000}{2} \end{array}$$

$$g \cdot 5.125 = 1010101 \cdot 001$$

$$= 1 \cdot \underline{010101001} \times 2^6$$

$$6+127 = 133$$

$$133 = 128 \ 64 \ 32 \ 16 \ 8 \ 4 \ 2 \ 1$$

1 0 0 0 0 1 0 1



0100 0010 1000 0100 1000000000000000  
4 2 A A y 0 0 0

Q3 b) R two operand instruction

L zero operand instruction



two operand instructions =  $2^{10} = 1024$

$$\text{number of one Operand instructions} = \frac{2^{16} - L - 2^{12} K}{2^6}$$

$2^{16}$  = total encoding possible

L = zero operand instruction

$$2^6 \times 2^6 \times K = \text{two } " \quad "$$

Q4 a)  $PC = F2C$



ISZ : Increment and Skip if zero

$$M[AR] \leftarrow M[AR] + 1$$

if  $M[AR] + 1 = 0$  then  $PC \leftarrow PC + 1$

execution:  $D_6 T_4 : DR \leftarrow M[AR]$

$D_6 T_5 : DR \leftarrow DR + 1$

$D_5 T_6 : M[AR] \leftarrow DR$ , if ( $DR = 0$ ) then  $(PC \leftarrow PC + 1)$   
 $SC \leftarrow 0$

| Timing Signal  | PC  | AR  | DR   | IR   | Address | Memory |
|----------------|-----|-----|------|------|---------|--------|
| initial        | F2C | F2C |      |      |         |        |
| T <sub>0</sub> | F2C | F2C |      | E78F |         |        |
| T <sub>1</sub> | F2C | F2C |      | E78F | F2C     | E78F   |
| T <sub>2</sub> | F2C | 78F |      | E78F | 78F     | 895C   |
| T <sub>3</sub> | F2C | 95C |      | E78F | 95C     | FFF0   |
| T <sub>4</sub> | F2C | 95C | FFF0 | E78F |         |        |
| T <sub>5</sub> | F2C | 95C | FFF1 | E78F |         |        |
| T <sub>6</sub> | F2C | FF1 | FFF1 | E78F |         |        |

6)  $PC = x_1 - x_2 - x_3 - x_4$

UCS5510 (2) :

Q(a)

| A | B | C | F |
|---|---|---|---|
| 0 | 0 | 0 | 0 |
| 0 | 0 | 1 | 0 |
| 0 | 1 | 0 | 0 |
| 0 | 1 | 1 | 1 |
| 0 | 0 | 0 | 0 |
| 1 | 0 | 1 | 1 |
| 1 | 1 | 0 | 1 |
| 1 | 1 | 1 | 1 |

$$\begin{aligned} F &= \underline{A'BC + AB'C + ABC'} + \underline{ABC} \\ &= BC + A(B \oplus C) \end{aligned}$$

$$b) \quad \begin{array}{r} 1 \underline{10000011} \\ 2^7 + 2^1 \\ 131 \end{array} \quad \begin{array}{r} 010\ldots \text{(21 zeros)} = ( )_{10} \\ \hline \\ \downarrow \\ 2^{22} = 4194304 \end{array}$$

$$(-) \quad 131 - 127 = 4$$

$$\begin{array}{c} \downarrow \\ (2^4) \end{array}$$

$$-1. \quad \underline{4194304} \times 2^4$$

$$-1.4194304 \times 16 = -22.7108864$$

Q2. Adder/subtractor with  $C_{in} = 0$  & two inputs A & B.

when  $C_{in} = 0 \rightarrow A + B$

$C_{in} = 1 \rightarrow A - B$



Q3 b)  $AC \leftarrow AB98$ ,  $E = 0$ ,  $IR \leftarrow 7200$

$PC = 2FF$

|         | AC   | IR   | AR  | E |               |
|---------|------|------|-----|---|---------------|
| Initial | AB98 | 7200 |     | 0 |               |
| CMA     | 5467 | 7200 | 200 | 0 | complement AC |

| CME | AB98 | 7100 | 100 | 1 | complement E |
|-----|------|------|-----|---|--------------|
|     |      | 7080 | 080 | 0 |              |



**CIR, CIL?**



$$PC = 300, RI = 500 \quad XR = 200$$

$$\begin{array}{lll}
 M[499] = 550 & M[600] = 900 & M[800] = 650 \\
 M[500] = 800 & M[700] = 1000 & M[802] = 350 \\
 M[900] = 400 & M[902] = 700 &
 \end{array}$$

Sol i) Direct address:

Instruction specifies an address field = 600

$\Rightarrow$  EA (effective address) for = 900

direct address mode

AC is loaded with content of memory location 600.

$$\boxed{AC = 900}$$

ii) Immediate operand:

In immediate mode, second word of instruction is the actual data.

AC is loaded with address part directly.

$$\boxed{AC = 600.}$$

iii) Indirect operand:

$$M[600] = 900$$

$$M[900] = 400 \Rightarrow \boxed{AC = 400}$$

iv) Relative Address:

$$\text{Address part in instruction} + PC = 600 + 300 = 900$$

$$M[900] = 400 \Rightarrow \boxed{AC = 400}$$

v) Indexed Address

(using XR)

address field = 600

XR = 200

AC is loaded with sum of address part and content of XR.

$$\cancel{600 + 200 = 800}, \quad M[800] = 650 \Rightarrow \boxed{AC = 650}$$

vi) Register Indirect:

In this mode, the address part of the instruction contains the address of memory location that contains the effective address

RI = 500

$M[RI] = 800$

$M[800] = 650$

$$\Rightarrow \boxed{AC = 650}$$

~~QCS510 (3)~~

Q3. Explain various addressing modes with eg.

(2)



$$PC = \underline{FFF}, AC = \underline{FEC3}$$

$\rightarrow 8BCD$

$\rightarrow B420$

$\rightarrow \underline{\text{FFFF}}$

Set Memory reference instructions, six more times

i) AND  $\rightarrow$  AND memory word to AC

|         | PC   | AR | DR | AC   | IR |
|---------|------|----|----|------|----|
| Initial | FFF  |    |    | FEC3 |    |
| AND     | 1000 |    |    |      |    |

$$DR \leftarrow M[AR]$$

$$AC \leftarrow AC \wedge DR$$

$$\begin{aligned} FFF + 1 &= \\ 1000 & \end{aligned}$$

QCS 10(4) :

Q1 a) PC = 3AC, AC = FEC3



opcode of ADD = 001  
AND = 010

Sol - what instruction will be fetched & executed next.

The instruction that will be fetched & executed next depends on content of memory at 3AC. The current value of PC is 3AC and content of memory at address 3AC is 935E. The opcode bits of ADD and AND is 001 and 010. Therefore instruction at memory address 3AC is not AND and ADD instruction.

|              |        |               |               |
|--------------|--------|---------------|---------------|
| Indirect bit | opcode | register code | address field |
|--------------|--------|---------------|---------------|

$$935E = \underline{\quad 100\quad} \underline{\quad 0011\quad} \underline{\quad 0101\quad} \underline{\quad 1110\quad}$$

↓      ↓      ↓      ↓  
Indirect    opcode    register    address field  
bit code      code      code

- a) PC contains address of next instruction to be executed  
 $\Rightarrow$  instruction at SAC will be fetched & executed
- b) Content of SAC = 935E

$935E = \underline{\underline{1001}} \underline{\underline{00110101}} \underline{\underline{1110}}$

↓      ↓      ↓  
 Indirect mode    op code    address  
 ||  
 ADD

Address of operand = 35E

Address of operand at ~~35E~~ is 08AC  $\Rightarrow$   
 Operand is at 08AC  $\Rightarrow$  Operand is 7A9F

$$DR \leftarrow M[AR]$$

$$AC \leftarrow AC + DR$$

$$AC = 7EC3$$

$$M[AR] = 7A9F$$

$$\begin{array}{r} 12+9 \\ 21 \\ 16+5 \\ 10+14=24 \\ 16+8=18 \end{array}$$

$$\begin{array}{r} 7EC3 \\ + 7A9F \\ \hline 3 FCE \end{array}$$

$$\begin{aligned} 3+F &= 12 \text{ (in hex)} \\ &= C \end{aligned}$$

$$C+A = 15 = F$$

$$E+A = 18 = 1B + 3 = 3$$

$$7+7 = E$$

Q2

Reference Q :

|        |     |
|--------|-----|
| Opcode | 500 |
|--------|-----|

| Load | 5'te | Mode |
|------|------|------|
| 200  | 500  |      |
| 201  | 500  |      |
|      |      |      |
|      |      |      |
|      |      |      |



|                                                      |                |      |
|------------------------------------------------------|----------------|------|
| a) Direct                                            | 500            | 1100 |
| b) Immediate                                         | 201            | 500  |
| c) Indirect                                          | 1100           | 1200 |
| d) PC relative                                       | 201+1+80       | 1302 |
| e) Displacement                                      | 500+100        | 1200 |
| f) Register                                          | R <sub>i</sub> | 400  |
| g) Register Indir                                    | 400            | 1000 |
| h) Autodecoding<br>with inc,<br>using R <sub>j</sub> | 400            | 1000 |

$R_i$  [401]

a → diff b/w mem. reference instruction & register reference instruction using eg.  
diff b/w interrupt I/O and programmed I/O

$$q : A = 01000001$$

$$B = 10000100$$

Sol i)  $A = 65$  } unsigned  
 $B = 132$  }

ii)  $A = 65$  } signed  
 $B = -124$  }

$$\begin{aligned} 10000100 &= 01111011 + 1 \\ &= 01111100 \\ &= 124 \end{aligned}$$

$$\Rightarrow -124$$

sign bit of result → whether result is zero  
 C, S, V, Z bit → whether there was an overflow  
 ↓  
 carry out of leftmost bit

There is an overflow if the sum of two +ve nos is -ve and sum of two -ve nos is +ve.

VCSS10(5)

| Q1. a) | A | B | C | D | Output (Y) |
|--------|---|---|---|---|------------|
| 0      | 0 | 0 | 0 | 0 | 0          |
| 0      | 0 | 0 | 1 | 0 | 0          |
| 0      | 0 | 1 | 1 | 1 | 0          |
| 0      | 1 | 0 | 0 | 0 | 0          |

|   |   |   |   |  |   |
|---|---|---|---|--|---|
| 0 | 1 | 0 | ) |  | 1 |
| 0 | ) | ) | 0 |  | ) |
| 0 | ) | ) | ) |  | ) |
| ) | 0 | 0 | 0 |  | ) |
| ) | 0 | 0 | ) |  | ) |

(5)

b) PC = 16 bits, K, DR, IR = 8 bits

Q2. a) R<sub>1</sub> = 1100

$$\begin{array}{r}
 R_2 = \frac{1010}{\overline{1010}} \quad \sqrt{21}
 \end{array}$$

b) K-map

(x3 a) 64 registers and uses 16 bit instruction format

(B) I type : opcode + Register + 4 bit immediate value

(I) I type : opcode + 2 h names

I type uses  $\rightarrow$  6 bit for opcode

$$8 \times 6 = 56 \text{ remaining opcodes}$$

R type uses  $\rightarrow$  4 bit for opcode

$$\frac{56}{4} = 14$$

# Unit - ~~4~~ Central processing unit :



Register set stores the intermediate data during the execution of instructions.

ALU performs the required micro-operation for executing the instructions.

The control unit supervises the transfer of information among the registers and instructs ALU as to which operation to perform.

- Advantage of having many registers:

- ① Transfer b/w registers within processor is really fast.
- ② Going "off the processor" to access ~~memory~~ is slow.

Bus organization for 7 CPU Registers : (slide 8)

#### A BUS ORGANIZATION FOR SEVEN CPU REGISTERS

- The control unit directs the information flow through ALU by
  - Selecting various components in the system.
  - Selecting the Function of ALU.
- Example:  $R1 \leftarrow R2 + R3$ 
  1. MUX A selector (SELA): BUS A  $\leftarrow$  R2.
  2. MUX B selector (SELB): BUS B  $\leftarrow$  R3.
  3. ALU operation selector (OPR): ALU to ADD.
  4. Decoder destination selector (SELD):  $R1 \leftarrow$  Out Bus
- The entire information is passed through the Control Word.



| Register Selection Field Encoding |       |       |      | ALU operations encoding |           |                |      |
|-----------------------------------|-------|-------|------|-------------------------|-----------|----------------|------|
| Binary Code                       | SELA  | SELB  | SELD | OPR Select              | Operation | Symbol         |      |
| 000                               | Input | Input | None | 0                       | 00000     | Transfer A     | TSFA |
| 001                               | R1    | R1    | R1   | 1                       | 00001     | Increment A    | INCA |
| 010                               | R2    | R2    | R2   | 2                       | 00010     | Add A + B      | ADD  |
| 011                               | R3    | R3    | R3   | 5                       | 00101     | Subtract A - B | SUB  |
| 100                               | R4    | R4    | R4   | 6                       | 00110     | Decrement A    | DECA |
| 101                               | R5    | R5    | R5   | 8                       | 01000     | ADD A and B    | AND  |
| 110                               | R6    | R6    | R6   | 10                      | 01010     | OR A and B     | OR   |
| 111                               | R7    | R7    | R7   | 12                      | 01100     | XOR A and B    | XOR  |
|                                   |       |       |      | 14                      | 01110     | Complement A   | COMA |
|                                   |       |       |      | 16                      | 10000     | Shift right A  | SHRA |
|                                   |       |       |      | 24                      | 11000     | Shift left A   | SHLA |

- Control word format : 14 bits



| Binary code | SEL A          | SEL B          | SEL D          | OPR select | operation  | Symbol |
|-------------|----------------|----------------|----------------|------------|------------|--------|
| 000         | R <sub>1</sub> | R <sub>1</sub> | R <sub>1</sub> | 00000      | Transfer A | TSFA   |
| 001         | R <sub>2</sub> | R <sub>2</sub> | R <sub>2</sub> |            |            | ,      |
| :           | :              |                |                |            |            |        |
| 1           | 1              |                |                | 1          | ;          | ;      |

- Stack Org. : (push/pop operation)

ordered linear list, LIFO

stack pointer = (top pointer of stack) pointing to the register storing the address of topmost element

Types of stack org :

Register stack : It is built using a register-

memory stack : It is logical part of memory allocated as stack. The logically partitioned part of RAM is used to create stack.

## - Register Stack Org.

e.g.



stack of registers

64 registers

address =  $(0, 1, 2 \dots 63)$   
denoted by

SP = stack pointer of 6 bits

$$\therefore 2^6 = 64$$

made of flip-flops  $\leftarrow$  Flags = FULL EMPTY

full = 1, empty = 0  $\rightarrow$  stack is full

DR is the data register through which data is transferred  
to and from the stack.

zero address instructions are used in register stack org. i.e.  
the address that doesn't contain address of operands.

# - Push operation & Pop operation code

| PUSH Operation                                | POP Operation              |
|-----------------------------------------------|----------------------------|
| For the PUSH operation initially,             |                            |
| $SP \leftarrow 0$                             |                            |
| $Full \leftarrow 0$                           |                            |
| $EMTY \leftarrow 1$                           |                            |
| <br>                                          |                            |
| $SP \leftarrow SP + 1$                        | Increment stack pointer    |
| $M[SP] \leftarrow DR$                         | Write on top of the stack  |
| If ( $SP = 63$ ) then ( $FULL \leftarrow 1$ ) | Check if the stack is full |
| $EMTY \leftarrow 0$                           | Stack is not empty         |

## Memory Stack Org. :

a processor register is used as a stack pointer.  
Stack pointer is a CPU register that specifies the stack's initial memory.



## RAM is divided into three parts:

- ① Program — part of RAM where programs are stored
- ② Data — " " " " " data is in
- ③ Stack — part of " " used to implement stack

- Most computers don't provide hardware to check overflow/underflow, so upper limit / lower limit registers are used. (software)

### PUSH Operation

$SP \leftarrow SP - 1$   
 $M[SP] \leftarrow DR$

Decrement stack pointer  
 Write on top of the stack

### POP Operation

$DR \leftarrow M[SP]$   
 $SP \leftarrow SP + 1$

Read from the top of the stack  
 Increment the stack pointer

### - Reverse Polish Notation:

Postfix notation are used by stack org. & prefix are used by general register org.

eg.  $(A+B)^* \times (C^* (D+E) + F)$

postfix :  $AB + CDE^* + F^* +$

-  $(3^* 4) + (5^* 6)$



### - Processor Organization:

- 1) Single register organization (Accumulator)
- 2) General register organization  $\rightarrow$  Any register can be used as source or dest.
- 3) Stack organization (Hardware stack)  $\rightarrow$  All operations are done using hardware stack

## - Instruction format :

Machine instructions contain no. of bits. These bits are grouped together & called fields

- Single Accumulator org : use only one address field

eg.  $AC \leftarrow AC + M[x]$

the instruction for arithmetic add" is defined by an instruction "ADD".

General Register Org : uses two or three address fields

An instruction symbolized by ADD, R<sub>1</sub>, X specifies the operation  $R_1 \leftarrow R_1 + M[x]$

This instruction has two address fields : R<sub>1</sub> and X.

Stack Organization : Doesn't require address field as the operation is performed on the two items on top of stack using push and pop operations

- Instruction format representation :

← Machine instruction



- instruction format type : 0, 1, 2, 3 address inst. format

- zero address instruction format :

|      |        |
|------|--------|
| Mode | opcode |
|------|--------|

Stack organization supports this

doesn't have operand field, they are implicitly defined  
arithmetic exp.  $\rightarrow$  RPN

One address instruction format: uses only one address field

one operand is in the accumulator and other is in  
memory location.

|      |        |         |
|------|--------|---------|
| Mode | opcode | operand |
|------|--------|---------|

It has only one operand.

It has two special instructions LOAD and STORE.

Load : transfer data to accumulator

Store : move data from accumulator to memory

Two address instruction :

|      |        |          |          |
|------|--------|----------|----------|
| Mode | opcode | operand1 | operand2 |
|------|--------|----------|----------|

has two operands

requires shorter assembly language instructions  
result can be stored in different locations

MOV : instruction to transfer the operands to the memory from processor registers

Three address instruction :

|      |        |          |          |          |
|------|--------|----------|----------|----------|
| Mode | opcode | operand1 | operand2 | operand3 |
|------|--------|----------|----------|----------|

The operand1 and operand2 contain the data or address that the CPU will operate. Operand 3 contains the result's address

- Addressing Modes:

Addressing modes define the rules and mechanisms by which the processor calculates the effective memory address or operand location for data operations.

The purpose for using addressing modes :

- ① to provide versatility to the user
- ② to reduce the number of bits in addressing field of inst.

Types of Addressing Modes :

③ Implied / Implicit : Stack

Made with Goodnotes

e.g. CMA, CLA, PUSH, POP



### ③ Immediate :

e.g. `ADD 8` will increment the value stored in the accumulator by 8.



### ④ Register Direct :



e.g. `MOV R1, 35H`

### ⑤ Register Indirect :



e.g. `MOV R1, [R2]`

### ⑥ Auto-increment :

Made with Goodnotes post increment approach

eg. INC R1



$$(R_1) + 1 = 179$$

① Auto-Decrement: pre-decrement approach

eg. DEC R1



② Direct add. mode: eg. ADD, [1000]<sub>R</sub> (1 reference)

③ Indirect add. mode: eg. ADD M (2 references)  
slow

## Q) Displacement Addressing Mode :

e.g. MOV R1 , [1000H + 09H]

$$EA = A + R$$

↓  
displacement value

[opcode | Register R] Address A



## Q) Relative addressing mode :

e.g. MOV R1 , [PC + Address field]

$$EA = A + PC$$

Q) Indexed Addressing Modes : The index register's content is added to instructions' address to obtain the effective address.

e.g. MOV R1 , [SI + 1000H]

$$EA = \text{content of index register} + \text{instruction address part}$$

## Q) Base Register : $EA = A + (R)$

A : Inst. add.

R : pointer to base address

## Q) Stack Addressing modes

# - Applications, adv. & disadv. of addressing modes

## APPLICATIONS OF ADDRESSING MODES

| Addressing Mode                                                  | Applications                                                                                           |
|------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------|
| Immediate Addressing Mode                                        | Initialize the register to a constant value.                                                           |
| Direct Addressing Modes<br>Register Direct Addressing Mode       | Helps access static data and implement variables.                                                      |
| Indirect Addressing Modes<br>Register Indirect Addressing Mode   | Helps implement pointers and pass arrays as parameters.                                                |
| Relative Addressing Mode                                         | Helps in program relocation at runtime. And in changing the sequence of instructions during execution. |
| Index Addressing Mode                                            | Helps in the array and record implementation.                                                          |
| Base Register Addressing Mode                                    | Helps in writing relocatable code and handling recursive procedures.                                   |
| Auto-Increment Addressing Mode<br>Auto-Decrement Addressing Mode | Helps implements loops and stacks.                                                                     |

Adv :

improve efficiency, reduce latency

used to implement complex ds

program sizes can be reduced

gives you diff way of specifying address of operands

eg.

## EXAMPLE: ADDRESSING MODES

| Addressing Mode                              | Effective Address | Content of AC |
|----------------------------------------------|-------------------|---------------|
| Immediate AM                                 | 201               | 500           |
| Register AM                                  | -                 | 400           |
| Register Indirect AM                         | 400               | 700           |
| Auto-Increment AM                            | 400               | 700           |
| Auto-Decrement AM                            | 399               | 450           |
| Direct AM                                    | 500               | 800           |
| Indirect AM                                  | 800               | 300           |
| Relative AM (EA=PC + Address Field)          | 702               | 325           |
| Base Register AM (EA=XR + Address Field/)    | 600               | 900           |
| Indexed Register AM (EA=XR + Address Field/) | 600               | 900           |

| Address | Memory           |
|---------|------------------|
| 200     | Load to AC Mode  |
| 201     | Address = 500    |
| 202     | Next Instruction |
| 399     | 450              |
| 400     | 700              |
| 500     | 800              |
| 600     | 900              |
| 702     | 325              |
| 800     | 300              |

## - Data Transfer and Manipulation :

| Addressing Mode | opcode | operand |
|-----------------|--------|---------|
|-----------------|--------|---------|

Types of instructions :

- ① Data Transfer Instructions
- ② Data Manipulation Instructions
- ③ Program Control Instruction

Arithmetic  
Logical & bit manipulation  
Shift

- Data transfer inst. : They move data from one location to another. also called copy inst.  
(without changing binary content)

- Program control :

program control instructions modify or change flow of a program.

It changes value of PC leading to change in execution.

features :

- i) conditional / unconditional
- ii) flags / control flow & jump inst. called jumps, calls, returns
- iii) Subroutine & subroutine handling
- iv) Loop & loop handling inst.

## PROGRAM CONTROL INSTRUCTIONS

| Program Control Instructions | Description                                                                                                                                                                                                                                                                                               |
|------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Branch (BR)                  | Branch which means it is an unconditional jump. It is unconditional branching wherever we specify the address we need to branch.                                                                                                                                                                          |
| Skip (SKP)                   | Skip instructions is used to skip one(next) instruction. It can be conditional or unconditional. It does not need an address field. In the case of conditional skip instruction, the combination of conditional skip and an unconditional branch can be used as a replacement for the conditional branch. |
| Jump (JMP)                   | The jump instruction transfers the program sequence to the memory address given in the operand based on the specified flag.                                                                                                                                                                               |
| Compare (CMP)                | The Compare instruction performs a comparison via a subtraction, with difference not retained. CMP compares register sized values, with one exception.                                                                                                                                                    |
| CALL and RETURN              | The CALL and RETURN instructions interrupt the flow of a program by passing control to an internal or external subroutine. An external subroutine is another program. The RETURN instruction returns control from a subroutine back to the calling program and optionally returns a value.                |
| TEST                         | TEST instructions perform the AND of two operands without retaining the result, and so on.                                                                                                                                                                                                                |

- Status bit / flag / processor status word :



V = - of overflow

Z = if ALU output = 0 then Z = 1 otherwise Z = 0

C = if ALU output gives carry " C = 1 " C = 0

S = Sign bit , S = 0 if +ve



- Conditional branch instruction: Check the condition and branch

| Mnemonic                                   | Branch condition           | Tested condition |
|--------------------------------------------|----------------------------|------------------|
| BZ                                         | Branch if zero             | Z = 1            |
| BNZ                                        | Branch if not zero         | Z = 0            |
| BC                                         | Branch if carry            | C = 1            |
| BNC                                        | Branch if no carry         | C = 0            |
| BP                                         | Branch if plus             | S = 0            |
| BM                                         | Branch if minus            | S = 1            |
| BV                                         | Branch if overflow         | V = 1            |
| BNV                                        | Branch if no overflow      | V = 0            |
| <i>Unsigned compare conditions (A - B)</i> |                            |                  |
| BHI                                        | Branch if higher           | A > B            |
| BHE                                        | Branch if higher or equal  | A ≥ B            |
| BLO                                        | Branch if lower            | A < B            |
| BLOE                                       | Branch if lower or equal   | A ≤ B            |
| BE                                         | Branch if equal            | A = B            |
| BNE                                        | Branch if not equal        | A ≠ B            |
| <i>Signed compare conditions (A - B)</i>   |                            |                  |
| BTG                                        | Branch if greater than     | A > B            |
| BGE                                        | Branch if greater or equal | A ≥ B            |
| BLT                                        | Branch if less than        | A < B            |
| BLE                                        | Branch if less or equal    | A ≤ B            |
| BE                                         | Branch if equal            | A = B            |
| BNE                                        | Branch if not equal        | A ≠ B            |

- Subroutine call and return:

performs a self-contained sequence of instructions that performs a computational task.

names for instruction that : Call subroutine, jump to subroutine, Transfer program control  
branch to subroutine, branch & save the address

- Most imp. operations : ① branch to the beginning of subroutine  
② Save the address (return) to get address of location in calling program upon exit from subroutine

- Locations for storing return addresses : in subroutine, in memory, processor register, memory stack

e.g. In memory stack



### - Program Interrupt :

An interrupt is a signal emitted by device attached to a computer, which temporarily stops or terminates a service of the current process

- Types of interrupts : External, Internal, Software

### 3. PROGRAM INTERRUPT



- Interrupt Procedure:

## RISC and CISC :

"CISC"  
=

- 1) Complex Instruction Set Computer
- 2) Large Number of Instructions
- 3) Variable Length Instruction format
- 4) Large No. of addressing modes
- 5) Cost is High
- 6) More Powerful
- 7) Several Cycle Instructions
- 8) Manipulation directly in Memory
- 9) Microprogrammed Control Unit
- 10) Examples: Mainframes, Motorola 6800, Intel 8080

"RISC"  
=

- 1) Reduced Instruction Set Computer
- 2) Less No. of Instructions
- 3) Fixed Length Instruction format
- 4) Few no. of ALU
- 5) Less cost
- 6) Less Powerful
- 7) Single Cycle Instructions
- 8) Only In Registers
- 9) Hardwired Control Unit
- 10) MIPS, ARM, SPARC, Fugaku



LOAD A, 2:2  
LOAD B, 3:3  
PROD A, B  
STORE 2:3, A

## RISC :

RISC is a microprocessor architecture that uses a simple set of instructions than can be substantially modified.

each instruction cycle has one clock per cycle and each cycle consists of three parameters : fetch, decode, execute.



## features of RISC processors :

- ① RISC use 1 CPI ( 1 clock per instruction cycle ).  
each CPI comprises of fetch, decode, execute.
- ② Multiple registers in RISC allow them to hold instruction ready fast to complete.
- ③ RISC uses pipelining to execute multiple parts of instruction efficiently.
- ④ RISC has simple addressing mode & fixed instruction length for pipeline execution.
- ⑤ Uses LOAD and STORE to access memory location.

- CISC :



## features :

- CISC may take longer than a single clock cycle to execute code.

Length of code is short, so it requires minimal RAM.

Enables more accessible programming in assembly language.

Focuses ~~on~~ creating instructions on hardware rather than software, ∵ they are faster to develop.

It comprises of fewer registers and more addressing modes, typically 5 to 20.

## **DIFFERENCE BETWEEN RISC AND CISC**

|    | RISC                                                                                                                  | CISC                                                                                                                                  |
|----|-----------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------|
| 1  | <b>Instructions of a fixed size</b>                                                                                   | Instructions of <b>variable size</b>                                                                                                  |
| 2  | Most instructions take <b>same time to fetch</b> .                                                                    | Instructions have <b>different fetching times</b> .                                                                                   |
| 3  | Instruction set <b>simple and small</b> .                                                                             | Instruction set <b>large and complex</b> .                                                                                            |
| 4  | <b>Less addressing modes</b> as most operations are register based.                                                   | <b>Complex addressing modes</b> as most operations are memory based.                                                                  |
| 5  | <b>Compiler design is simple</b>                                                                                      | <b>Compiler design is complex</b>                                                                                                     |
| 6  | <b>Total size of program is large</b> as many instructions are required to perform a task as instructions are simple. | <b>Total size of program is small</b> as few instructions are required to perform a task as instructions are complex & more powerful. |
| 7  | Instructions use a <b>fixed number of operands</b> .                                                                  | Instructions have <b>variable number of operands</b> .                                                                                |
| 8  | Ideal for processors performing a <b>dedicated operation</b> .                                                        | Ideal for processors performing a <b>variety of operations</b> .                                                                      |
| 9  | Since instructions are simple, they can be decoded by a <b>hardwired control unit</b> .                               | Since instructions are complex, they require a <b>Micro-programmed Control Unit</b> .                                                 |
| 10 | Execution speed is <b>faster</b> as most operations are register based.                                               | Execution speed is <b>slower</b> as most operations are memory based.                                                                 |
| 11 | As No. of cycles per instruction is fixed, it gives a <b>better degree of pipelining</b>                              | Since number of cycles per instruction varies, <b>pipelining has more bubbles</b> or stalls.                                          |
| 12 | E.g.: <b>ARM7, PIC 18</b> Microcontrollers.                                                                           | E.g.: Intel <b>8085, 8086</b> Microprocessors.                                                                                        |

# # Pipelining (Unit -4) :

In non-pipelined (sequential) architecture, all the instructions of a program are executed sequentially one after the other. Pipelining is referred as

A technique in which a given task is divided into a number of subtasks that need to be performed in sequence.

One of the processes of arranging the hardware so that simultaneous execution of multiple instructions takes place, thus, improving the overall performance.

The main advantage of pipelining is the simultaneous execution of various subtasks, which improves the system's throughput.

In pipelined execution, the first instruction will take  $T$  clock cycles to process, but the other instructions will take just 1 more clock cycle.

The speedup can be calculated as:

Speedup ( $S$ ) = Cycles of non-pipelined processor/ cycle of pipelined processor

$$S = (n*t_n)/((k+n-1) * t_p)$$

$n$  = number of tasks

$k$  =  $k$  = number of segments pipeline

$t_n$  = clock cycle time (non-pipelined)

$t_p$  = clock cycle time (pipelined)

As the number of tasks increases,

Take the limit as  $n$  approaches to infinity,  $(n+k-1)$  approaches to  $n$ , resulting in theoretical speedup (max) of:

$$\lim_{n \rightarrow \infty} S_{max} = t_n / t_p$$

If time it takes to process a task is the same in pipelined and non-pipelined circuits then,

$$S_{max} = (k*t_p) / t_p = k$$

$$S = \frac{n t_n}{(n+k-1) t_p}$$

$$S_{max} = \frac{t_n}{t_p}$$

~~Instruction pipelines are used to divide the task of executing a stream of instructions into subtasks to be executed in different pipeline segments to improve the throughput of the computer system.~~

For example, if we have a stream of instructions, then one segment of the pipeline can read the instructions while another segment can decode the previous instruction. In this way, more than one instruction will be handled simultaneously by the computer system which will improve its throughput. The instruction pipeline will be more efficient if the instructions are divided into equal-duration segments.

A typical example of an instruction pipeline used by computer systems consists of the following segments:

Segment 1: This segment will fetch the instruction from the memory

Segment 2: This segment will decode the instruction and find out the effective address

Segment 3: This segment will fetch the operands from the memory

Segment 4: This segment will execute the instruction

Types of pipelining —

- ① Instruction
- ② Arithmetic

↓  
read from slides  
(do the diagram)

- There are several stages of processing an instruction. A pipeline can be of three, four, five, or six stages.

| 3-stage | 4-stage                   | 5-stage                    | 6-stage                                        |
|---------|---------------------------|----------------------------|------------------------------------------------|
| Fetch   | Fetching the instruction  | Fetching the instruction   | Fetching the instruction                       |
| Decode  | Decoding the instruction  | Decoding the instruction   | Decoding the instruction                       |
| Execute | Executing the instruction | Memory Access for operands | Calculate the effective address of the operand |
|         | Write Back                | Executing the instruction  | Memory Access for operands                     |
|         |                           | Write Back                 | Executing the instruction                      |
|         |                           |                            | Write Back                                     |



Pipeline Hazards: situations that prevent the next instruction in the instruction stream from executing during its designated clock cycles.

→ Data Hazards

Control Hazards or Instruction hazards

Structural Hazards

## - Sequential Vs pipelined execution of instructions :



Performance Measures for goodness of a pipeline :

### **PERFORMANCE MEASURES FOR THE GOODNESS OF A PIPELINE**

- In order to formulate the performance measures for the goodness of a pipeline in processing a series of instructions
  - A space-time chart (called the Gantt's chart) is used.
- In this chart, the vertical axis represents the segments (four in this case) and the horizontal axis represents time (the time (T) taken by each subunit to perform its task is the same, therefore, known as unit time)

13 time units are required to execute 10 instructions using 4-stage pipelining



The space-time chart (Gantt chart)

## - Data Hazard :

It occurs when either source or destination operands of an instruction are not available at the time expected in the pipeline.

A data hazard is any condition in which either the source or the destination operands of an instruction are not available at the time expected in the pipeline. As a result, some operation has to be delayed, and the pipeline stalls.

When the execution of an instruction is dependent on the results of a prior instruction that's still being processed in a pipeline, data hazards occur. If the execution is done in a pipelined processor, it is highly likely that the interleaving of these two instructions can lead to incorrect results due to data dependency between the instructions. Thus the pipeline needs to be stalled as and when necessary to avoid errors.

### Flow/True Data Dependency [RAW (or Read after Write)]:

This is when one instruction makes use of data from a previous instruction.

Example,

ADD X0, X1, X2

SUB X4, X3, X0

### Anti-Data Dependency [WAR (or Write after Read)]

When the second instruction is written to a register before the first instruction is read, this is known as a race condition. In the case of a simple structure of a pipeline, this is uncommon. WAR, on the other hand, can occur in some machines having complex and specific instructions.

Example,

ADD X2, X1, X0

SUB X0, X3, X4

### Output data dependency [WAW (or Write after Write)]

Made with Goodnotes  
This is a situation where two simultaneous instructions must write the same register in the same sequence they were issued.

## Output data dependency [WAW (or Write after Write)]

This is a situation where two simultaneous instructions must write the same register in the same sequence they were issued.

Example,

ADD X0, X1, X2

SUB X0, X4, X5

WAW and WAR hazards can only occur when instructions are executed in parallel or out of order. These occur because the same register numbers have been allotted by the compiler although avoidable.

This situation is fixed by renaming one of the registers by the compiler or by delaying the updating of a register until the appropriate value has been produced.

Modern CPUs not only have incorporated Parallel execution with multiple ALUs but also out of order issues and execution of instructions along with many stages of pipelines.

## Structural Hazard

Hardware resource conflicts among the instructions in the pipeline cause structural hazards. Memory, a GPR Register, or an ALU might all be used as resources here.

When more than one instruction in the pipe requires access to the very same resource in the same clock cycle, a resource conflict is said to arise.

In an overlapping pipelined execution, this is a situation where the hardware cannot handle all potential combinations.

Solution: For a portion of the pipeline, instructions must be performed in series rather than parallel.

## Control Hazard

Control hazards are called Branch hazards and are caused by Branch Instructions. Branch instructions control the flow of program/ instructions execution. Recall that we use conditional statements in the higher-level language either for iterative loops or with conditions checking (correlate with for, while, if, and case statements). These are transformed into one of the variants of BRANCH instructions. It is necessary to know the value of the condition being checked to get the program flow.

Thus a Conditional hazard occurs when the decision to execute an instruction is based on the result of another instruction like a conditional branch, which checks the condition's resultant value.

The branch and jump instructions decide the program flow by loading the appropriate location in the Program Counter(PC). The PC has the value of the next instruction to be fetched and executed by CPU. Consider the following sequence of instructions.

Solution For control Hazards :

Stall:

*SPRD*

Stall the given pipeline as soon as any branch instructions are decoded. Just don't allow IF anymore. Stalling reduces throughput as it always does. According to statistics, at least 30% of the instructions in a program are BRANCH. With Stalling, the pipeline is effectively operating at 50% capacity.

Prediction:

Consider a for or a while loop that is repeated 100 times. We know the program would run 100 times without the given branch condition being met. The program only exits the loop for the 101st time. As a result, it's better to let the pipeline run its course and then flush/undo when the branch condition is met. This has less of an impact on the pipeline's throttle and stalling.



Dynamic Branch Prediction :

A history record is maintained with the help of Branch Table Buffer (BTB). The BTB is a kind of cache, which has a set of entries, with the PC address of the Branch Instruction and the corresponding effective branch address. This is maintained for every branch instruction that occurs.

Reordering Instructions:

Made with GoodNotes  
Delayed branching entails reordering the instructions to move the branch instruction later in the sequence, allowing safe and beneficial instructions that are unaffected by the result of a branch to be brought in earlier in the sequence, delaying the fetch of the branch instruction. If such instructions are not available, NOP is used. The Compiler is used

# 1. DATA HAZARDS

- **Solution 1:** At the IF stage of the SUB instruction, add three bubbles. This will make it easier for SUB – ID (Instruction Decoder) to work at t6. As a result, all subsequent instructions in the pipe are similarly delayed.
- **Solution 2:** Forwarding of Data – Data forwarding is the process of sending a result straight to that functional unit that needs it: a result is transferred from one unit's output to another's input. The goal is to have the solution ready for the next instruction as soon as possible.

|                  | I1 | I2 | I3    | I4    | I5    | I6    | I7  | I8  | I9 |
|------------------|----|----|-------|-------|-------|-------|-----|-----|----|
| ADD X3, X6, X5   | IF | ID | IE    | MEN   | RW X3 | --    | --  | --  | -- |
| SUB X4, X3, X5   | -- | IF | ID X3 | IE    | MEM   | RW    | --  | --  | -- |
| OR X6, X3, X7    | -- | -- | IF    | ID X3 | IE    | MEM   | RW  | --  | -- |
| AND X8, X3, X9   | -- | -- | --    | IF    | ID X3 | IE    | MEM | RW  | -- |
| XOR X12, X3, X11 | -- | -- | --    | --    | IF    | ID X3 | IE  | MEM | RW |

Result of ADD available at ALU output here      Data forwarding

The diagram illustrates data forwarding. A red arrow points from the 'RW X3' cell in the fourth row (ADD X3, X6, X5) to the 'ID X3' cell in the fifth row (SUB X4, X3, X5). Another red arrow points from the 'X3' value in the 'RW X3' cell to the 'ID X3' cell, indicating that the result of the ADD operation is being forwarded directly to the ID stage of the SUB instruction.

Reordering instructions: Delayed branching entails reordering the instructions to move the branch instruction later in the sequence, allowing safe and beneficial instructions that are unaffected by result of a branch to be brought in earlier in the sequence, delaying the fetch of the branch instruction. If such instructions are not available, NOP is used. The compiler is used to implement this delayed branch.

### \* Numericals :

e.g. No. of clock cycles to process 200 tasks in a six-segment pipeline.

$$\text{Sol} \quad (n + m - 1) = (6 + 200 - 1) = 205$$

Draw a space-time diagram for a six-segment pipeline showing the time it takes to process eight tasks.

Solution:

| Segment | 1              | 2              | 3              | 4              | 5              | 6              | 7              | 8              | 9              | 10             | 11             | 12             | 13             |
|---------|----------------|----------------|----------------|----------------|----------------|----------------|----------------|----------------|----------------|----------------|----------------|----------------|----------------|
| 1       | T <sub>1</sub> | T <sub>2</sub> | T <sub>3</sub> | T <sub>4</sub> | T <sub>5</sub> | T <sub>6</sub> | T <sub>7</sub> | T <sub>8</sub> |                |                |                |                |                |
| 2       |                | T <sub>1</sub> | T <sub>2</sub> | T <sub>3</sub> | T <sub>4</sub> | T <sub>5</sub> | T <sub>6</sub> | T <sub>7</sub> | T <sub>8</sub> |                |                |                |                |
| 3       |                |                | T <sub>1</sub> | T <sub>2</sub> | T <sub>3</sub> | T <sub>4</sub> | T <sub>5</sub> | T <sub>6</sub> | T <sub>7</sub> | T <sub>8</sub> |                |                |                |
| 4       |                |                |                | T <sub>1</sub> | T <sub>2</sub> | T <sub>3</sub> | T <sub>4</sub> | T <sub>5</sub> | T <sub>6</sub> | T <sub>7</sub> | T <sub>8</sub> |                |                |
| 5       |                |                |                |                | T <sub>1</sub> | T <sub>2</sub> | T <sub>3</sub> | T <sub>4</sub> | T <sub>5</sub> | T <sub>6</sub> | T <sub>7</sub> | T <sub>8</sub> |                |
| 6       |                |                |                |                |                | T <sub>1</sub> | T <sub>2</sub> | T <sub>3</sub> | T <sub>4</sub> | T <sub>5</sub> | T <sub>6</sub> | T <sub>7</sub> | T <sub>8</sub> |

$$(n + m - 1) = 6 + 8 - 1 = 13 \text{ cycles}$$

eg.  $t_n$  (non-pipeline) = 50 ns

$t_p$  (pipeline) = 10 ns

$n = 100$  (no. of task)

$k = 6$  (segments)

Sol: Speedup (s) =  $\frac{n^k t_n}{(n+k-1)t_p} = \frac{100 \times 500}{105 \times 10} = 4.76$

max<sup>m</sup> speedup (S<sub>max</sub>) =  $\frac{t_n}{t_p} = \frac{50}{10} = 5$

eg. 05

## QUESTION 5

Solution:

(a)  $t_p = 45 + 5 = 50$  ns      k = 3  
 (b)  $t_n = 40 + 45 + 15 = 100$  ns

(c)  $S = \frac{nt_n}{(k+n-1)t_p} = \frac{10 \times 100}{(3+9)50} = 1.67$       for n = 10

$= \frac{100 \times 100}{(3+99)50} = 1.96$       for n = 100

(d)  $S_{max} = \frac{t_n}{t_p} = \frac{100}{50} = 2$



## # Unit - 5 (Memory Organization) :

Memory hierarchy

Main memory

Aux Memory

Associative Memory

Cache Memory

Virtual Memory

Memory Management hardware

- Memory Hierarchy : Memory hierarchy is to obtain highest possible access speed while minimizing the total cost of memory system.



Main memory : (Primary mem | Volatile memory) is used to communicate directly with the CPU. CPU is mainly used to execute any task in the computer. CPU can execute only that program which will be present into the main

memory. If we are going to save any program by default it gets saved into the secondary memory. But during execution DMA / OS transfers the program to the main memory.

Auxiliary memory: (Secondary mem / Non-volatile memory) is used to store the data permanently into the comp. Eg. are magnetic disk & tapes. CPU can't access the data of auxiliary memory.

Cache memory: Small and faster memory which has higher accessibility. used to store data which is frequently used.

Main memory access time is slow but cache memory access time is fast. So if there are frequently used instructions then OS / I/O transfers the instructions of main memory to cache memory so that access will be fast.

I/O processor: is mainly used to transfer data from secondary memory to main memory.

- Main memory:

① Typical RAM chips

- Types of memory  
i) Static  
ii) Dynamic



| CS1 | CS2 | RD | WR | Memory f <sup>n</sup> | State of data bus    |
|-----|-----|----|----|-----------------------|----------------------|
| 0   | 0   | x  | x  | Inhibit               | High Impedance       |
| 0   | 1   | x  | x  | Inhibit               | High Impedance       |
| 1   | 0   | 0  | 0  | Inhibit               | High Impedance       |
| 1   | 0   | 0  | 1  | Write                 | Input data to RAM    |
| 1   | 0   | 1  | x  | Read                  | Output data from RAM |
|     |     | x  | x  | Inhibit               | High Impedance       |

Typic ROM chip



Uses of ROM

- ① Bootstrap Loader
- ② Computer startup

The **static RAM** consists essentially of internal flip-flops that store the binary information. The stored information remains valid as long as power is applied to the unit. It is very fast but the cost is high. Cache memory is designed with the help of SRAM. The **dynamic RAM** stores the binary information in the form of electric charges that are applied to capacitors. The capacitors are provided inside the chip by MOS transistors. The stored charge on the capacitors tend to discharge with time and the capacitors must be periodically recharged by refreshing the dynamic memory. Refreshing is done by cycling through the words every few milliseconds to restore the decaying charge. The dynamic RAM offers reduced power consumption and larger storage capacity in a single memory chip. The static RAM is easier to use and has shorter read and write cycles. **Main memory** is designed with the help of DRAM.

**RAM** is a volatile memory when we switch off the computer the entire contents of memory will be lost but **ROM** is a permanent memory that means it stores contents permanently. Most portion of the memory is RAM only but less is ROM.

**Bootstrap loader:** Among other things, the ROM portion of main memory is needed for storing an initial program called a bootstrap loader. The bootstrap loader is a program whose function is to start the computer software operating when power is turned on. Since RAM is volatile, its contents are destroyed when power is turned off. The contents of ROM remain unchanged after power is turned off and on again. The startup of a computer consists of turning the power on and starting the execution of an initial program. Thus when power is turned on, the hardware of the computer sets the program counter to the first address of the bootstrap loader. The bootstrap program loads a portion of the operating system from disk to main memory and control is then transferred to the operating system, which prepares the computer for general use. RAM and ROM chips are available in a variety of sizes. If the memory needed for the computer is larger than the capacity of one chip, it is necessary to combine a number of chips to form the required memory size. To demonstrate the chip interconnection, we will show an example of a 1024 x 8 memory constructed with 128 x 8 RAM chips and 512 x 8 ROM chips.

- Memory Address Map :

memory connection to CPU

- RAM and ROM chips are connected to a CPU through the data and address buses
- The low-order lines in the address bus select the byte within the chips & other lines in the address bus

select a particular chip through its chip select inputs

### CONNECTION OF MEMORY TO CPU



### Auxiliary Memory:

- ① Info org. on magnetic tapes
- ② Organization of disk hardware — Moving head disk, fixed head disk

### - characteristics of Auxiliary memory :

- ① Access mode
- ② Access Time
- ③ Transfer rate
- ④ Capacity
- ⑤ Cost

- Access Time : ① Seek Time  
② Transfer Time

## Associative Memory:

- Accessed by content of data rather than by an address
- Also called Content Addressable Memory (CAM)

Hardware org.



## - Cache memory:

### Locality of Reference

- The references to memory at a given time interval tend to be confined with localized areas.
- This area contains set of info. & the membership changes gradually over-time.

Temporal Locality: The info. which will be used in near future is likely to be in use already. (e.g. Reuse of info. in loops)

Spatial Locality: If a word is accessed, near words are likely to be accessed soon.  
(e.g. Related data items (arrays) are usually stored together; instructions are executed sequentially)

The property of Locality of References makes Cache system work.

### Performance of Cache:

- ① Memory Access — All the memory accesses are directed first to Cache.  
If the word is in Cache, access Cache to provide it to CPU.  
If " " " not " , bring a block including that word to replace one of the existing blocks.

- ② Performance:

$$T_c = T_c + (1-h) T_m$$

$h$  = hit ratio

$T_c$  = effective memory access time in Cache memory

$T_m$  = main memory access time

$T_c$  = Cache access time

## Memory and Cache Mapping:

Mapping fn: specification of correspondence b/w main and memory blocks and cache blocks

Associative mapping      Direct Mapping      Set-Associative Mapping



where do you place the words in cache → Mapping



size of line = size of block

Direct mapping : 

|     |          |           |
|-----|----------|-----------|
| Tag | line no. | Block no. |
|-----|----------|-----------|



Made with Goodnotes



fully associative mapping → koi bhi block kahi bhi aa sakte hai

↳ adv : No conflict miss (no. of hits ↑)

Physical address

| Block no. | Block offset |
|-----------|--------------|
|-----------|--------------|

tells at present kosa block woh jagah pe hai

|   |   |
|---|---|
| S | 2 |
|---|---|

$$2^S = 32 \text{ blocks} \quad 2^2 = 4 \text{ words}$$

no. of words in each block



↳ disadv. : Tag bit length has increased.

Also no. of comparisons ↑, as any block can be anywhere. (Comparisons = no. of tag bits)

## Set associative

Block replacement policy — LRU

Cache write — Write through, write back (not updated)

Virtual mem —

no. of blocks in memory = m

Made with Goodnotes

no. of pages in virtual address space = n

page replacement algo : FIFO, OPT, LRU



### - Memory Management Hardware :

- ① Dynamic Storage Relocation—mapping (Basic f's of MM)
- ② Provision for sharing common info.
- ③ Protection of info. against unauthorized access

- Segmentation : set of logically related instructions or data elements associated with a given name.

- User view of memory



## Implementation of page and Segment Tables

- Two memory accesses are needed to access a word, one for pg table, one for the word

- Logical Address

Physical Address



- Memory protection:

protection info. can be stored in the segment table or segment register

format of typical segment descriptor



Specifies Access

Rights to a particular segment

full read & write

read only

execute only

System only

FRES

$$\text{Q1} \quad \text{i) } \frac{2048 \times 8}{128 \times 8} = 16$$

$128 \times 8$  RAM to 2048  
chip byte

ii)  $2048 = 2^n \rightarrow 11$  lines

$128 = 2^7 \rightarrow 7$  common lines

iii)  $11 - 7 = 4$  lines for decoding

4x16 decoder

### Slide 15-29 :

- Association mapping

| Address | Data |
|---------|------|
| 01000   | 3450 |
| 02227   | 6710 |

Mapping table is associated in associative memory → fast, very expensive

Mapping table stores both address and the content of memory word

- Direct Mapping : (random gives unique)

- ① Each memory has only one place to load in Cache
- ② Mapping Table is made of RAM instead of CAM.
- ③ n-bit address contain 2 parts : K bit index, n-K tag field
- ④ Made with Goodnotes
- ⑤ n bit address is used to access main memory, K bit index for cache

## Addressing Relationships



## Direct Mapping Cache Organization



Operation :

- CPU generates a memory request with (Tag, Index)  
Access cache using Index ;  
(tag, data) compare Tag and tag
- If matches → Hit  
Provide Cache[Index](data) to CPU
- If not match → Miss  
 $M[\text{tag}, \text{Index}] \leftarrow \text{Cache}[\text{Index}](\text{data})$   
 $\text{Cache}[\text{Index}] \leftarrow (\text{Tag}, M[\text{Tag}, \text{Index}])$   
 $\text{CPU} \leftarrow \text{Cache}[\text{Index}](\text{data})$



## - Set Associative Mapping :

Each memory block has set of locations in the cache to load

| Index | Tag   Data | Tag   Data |
|-------|------------|------------|
|       | :          | :          |
|       | :          | :          |
|       |            |            |

← Set Ass.  
mapping cache  
with set size  
two

## Operation :

- CPU generates a memory address (Tag, Index)
- Access cache with index (Cache word = ( $\text{tag}_0, \text{data}_0$ ); ( $\text{tag}_1, \text{data}_1$ ))
- Compare Tag and  $\text{tag}_0$  and then  $\text{tag}_1$
- If  $\text{tag}_i = \text{Tag} \rightarrow \text{Hit}, \text{CPU} \leftarrow \text{data}_i$
- If  $\text{tag}_i \neq \text{Tag} \rightarrow \text{Miss}$

Replace either ( $\text{tag}_0, \text{data}_0$ ) or ( $\text{tag}_1, \text{data}_1$ )

Assume ( $\text{tag}_0, \text{data}_0$ ) is selected for replacement

$M[\text{tag}_0, \text{Index}] \leftarrow \text{Cache}[\text{Index}] (\text{data}_0)$

$(\text{cache}[\text{Index}] (\text{tag}_0, \text{data}_0) \leftarrow (\text{tag}, M[\text{tag}, \text{Index}])$

$\text{CPU} \leftarrow \text{Cache}[\text{Index}] (\text{data}_0)$

# Block Replacement Policy:

## BLOCK REPLACEMENT POLICY

Many different block replacement policies are available

LRU(Least Recently Used) is most easy to implement

Cache word = (tag 0, data 0, U0);(tag 1, data 1, U1),  $U_i = 0$  or 1(binary)

Implementation of LRU in the Set Associative Mapping with set size = 2

### Modifications

Initially all  $U_0 = U_1 = 1$

When Hit to (tag 0, data 0, U0),  $U_1 \leftarrow 1$ (least recently used)

(When Hit to (tag 1, data 1, U1),  $U_0 \leftarrow 1$ (least recently used))

When Miss, find the least recently used one( $U_i=1$ )

If  $U_0 = 1$ , and  $U_1 = 0$ , then replace (tag 0, data 0)

$M[\text{tag } 0, \text{INDEX}] \leftarrow \text{Cache}[\text{INDEX}](\text{data } 0)$

$\text{Cache}[\text{INDEX}](\text{tag } 0, \text{data } 0, U_0) \leftarrow (\text{TAG}, M[\text{TAG, INDEX}], 0); U_1 \leftarrow 1$

If  $U_0 = 0$ , and  $U_1 = 1$ , then replace (tag 1, data 1)

Similar to above;  $U_0 \leftarrow 1$

If  $U_0 = U_1 = 0$ , this condition does not exist

If  $U_0 = U_1 = 1$ , Both of them are candidates,

Take arbitrary selection

## Cache Write:

### Write Through

When writing into memory

If Hit, both Cache and memory is written in parallel

If Miss, Memory is written

For a read miss, missing block may be overloaded onto a cache

block

Memory is always updated

-> Memory is always updated DMA I/O are both executing

Slow important when Cache and DMA I/O are both executing ]

Write Back (Copy Back) memory access time

When writing into memory

If Hit, only Cache is written

When writing into memory

If Miss, missing block is brought to Cache and write into Cache

If Hit, only Cache is written

For a read miss, candidate block must be written back to memory

Memory read miss, candidate block must be

written back to memory

Cache and memory may have different value ]

Memory is not up-to-date, i.e., the same item in

Cache and memory may have different value

- Virtual memory : Logical address

memory mapping Table for virtual address



- Address Mapping : Representation of the above diagram  
for an eg.

- Associative memory page table :

Assume that

Number of Blocks in memory = m

Number of Pages in Virtual Address Space = n

Page Table

- Straight forward design -> n entry table in memory  
Inefficient storage space utilization  
<- n-m entries of the table is empty
- More efficient method is m-entry Page Table  
Page Table made of an Associative Memory m words; (Page Number:Block Number)

# - Page Fault :

## PAGE FAULT

1. Trap to the OS
2. Save the user registers and program state
3. Determine that the interrupt was a page fault
4. Check that the page reference was legal and determine the location of the page on the backing store(disk)
5. Issue a read from the backing store to a free frame
  - a. Wait in a queue for this device until serviced
  - b. Wait for the device seek and/or latency time
  - c. Begin the transfer of the page to a free frame
6. While waiting, the CPU may be allocated to some other process
7. Interrupt from the backing store (I/O completed)
8. Save the registers and program state for the other user
9. Determine that the interrupt was from the backing store
10. Correct the page tables (the desired page is now in memory)
11. Wait for the CPU to be allocated to this process again
12. Restore the user registers, program state, and new page table, then resume the interrupted instruction.



Processor architecture should provide the ability to restart any instruction after a page fault.

# - Page Replacement :

Decision on which page to displace to make room for an incoming page when no free frame is available

### Modified page fault service routine

1. Find the location of the desired page on the backing store
2. Find a free frame
  - If there is a free frame, use it
  - Otherwise, use a page-replacement algorithm to select a **victim** frame
  - Write the victim page to the backing store
3. Read the desired page into the (newly) free frame
4. Restart the user process



## - Page Replacement Algo. :

① FIFO: selects page that has been in the memory the longest time

uses queue

easy to implement

may result in frequent pg fault

② Optimal Replacement : Lowest page fault rate of all algs

Replace the pg which will not be used for longest period of time

③ LRU : OPT is difficult to implement since it requires future knowledge, Replace the pg which has not been used for longest period of time

LRU may require substantial hardware assistance

The problem is to determine an order for the frames defined by the time of last use

LRU replacement methods : (Slide 28)

① Stark

② Counter

③ LRU Approximation

## Segmentation :

A memory management scheme which supports user's view of memory

A logical address space is a collection of segments

Each segment has a name and a length

Address specify both the segment name and offset within segment  
for simplicity of implementation, segments are numbered



## segmented by System :



# Implementation of page and segment tables :

## Implementation of the Page Table

- Hardware registers (if the page table is reasonably small)
- Main memory
  - Page Table Base Register(PTBR) points to PT
  - Two memory accesses are needed to access a word; one for the page table, one for the word
- Cache memory (TLB: Translation Lookaside Buffer)
  - To speedup the effective memory access time, a special small memory called associative memory, or cache is used

## Implementation of the Segment Table

Similar to the case of the page table

Q2 : i)  $\frac{1024 \times 8}{1024 \times 1} = 8 \text{ chips}$

ii)  $\frac{16 \times 2^{10} \times 8}{1024 \times 1} = 128$

iii)  $16R = 2^{14}$   
14 total lines

$1024 = 2^{10}$  — Common lines  $\rightarrow$  10 lines specify chip address  
4 decoder lines

4 lines are decoded into 16 chip select lines

Q3 1024x8 ROM chip has 4 select line inputs, 5V power supply

Sol



① output lines = 8

① 1024x8

$2^{10} \rightarrow 10$  common lines

② 4 select lines

③ Power pin = 2

$$\text{Total pins} = 8 + 10 + 4 + 2 \\ = 24$$

Q4 RAM chips =  $256 \times 8$

ROM chips =  $1024 \times 8$

Need = 2K RAM, 4K ROM

4 interface unit  
4 register

Sol  $\frac{2 \times 2^{10} \times 8}{256 \times 8} = 2^3 = 8$  RAM chips

$\frac{4 \times 2^{10} \times 8}{1024 \times 8} = 4$  ROM chips

$4 \times 4 = 16$  register ( $2^4 = 16$ )

| Component | Address   | 16 | 15 | 14 | 13 | 12                                   | 11                                   | 10 | 9    | 8765 | 4321 |
|-----------|-----------|----|----|----|----|--------------------------------------|--------------------------------------|----|------|------|------|
| RAM       | 0000-07FF | 0  | 0  | 0  | 0  | 0                                    | $\xleftarrow{3 \times 8}$<br>decoder |    | xxxx | xxxx |      |
| ROM       | 4000-4FFF | 0  | 1  | 0  | 0  | $\xleftarrow{2 \times 4}$<br>decoder |                                      | xx | xxxx | xxxx |      |
| Interface | 8000-800F | 1  | 0  | 0  | 0  | 0                                    | 0                                    | 0  | 0000 | xxxx |      |

D5 128 segments, Each seg. - 32 pages, 4K words

4K blocks of 4K words in each, logical & phy. address format?

$$\text{SOL} \quad \text{Logical address space} = 128 \times 32 \times 4K = 2^7 \times 2^5 \times \underbrace{2^2 \times 2^{10}}_{2^4} = 2^{24}$$

$$\text{Phy} = 4K \times 4K = 2^{12} \times 2^{12} = 2^{24}$$



## # Multiplication Algorithm :



$$\text{eg. } 7 \times 3 = 21$$

Sol<sup>n</sup>

$$7 = 0111 \text{ (M)}$$

$$3 = 0011 \text{ (Q)}$$

$$1) \quad \begin{array}{ccc} A & Q & Q_{-1} \\ 0000 & 0011 & 0 \end{array}$$

operations

$$A - M \Rightarrow A + 2^3 C \text{ of } M \quad 1001$$

$$0000 + 1000 + 1$$

$$2) \quad \begin{array}{ccc} & 1001 & 0011 \\ \swarrow \searrow & \swarrow \searrow & \swarrow \searrow \\ 1100 & 1001 & 0 \end{array}$$

shift operation

$$3) \quad \begin{array}{ccc} & 1110 & 0100 \\ \swarrow \searrow & \swarrow \searrow & \swarrow \searrow \\ 0101 & 0100 & 1 \end{array}$$

$$A + M \Rightarrow 1110 + 0111 = 1\ 0101$$

$$4) \quad \begin{array}{ccc} & 0010 & 1010 \\ \swarrow \searrow & \swarrow \searrow & \swarrow \searrow \\ 0001 & 0101 & 0 \end{array}$$

shift operation

$N=0$

$$7 \times 3 = 0001\ 0101 = 21$$



e.g.  $-9 \times -13 = ?$

Sol:  $-9 = 11001$  (M) | 10111  
 $-13 = 11101$  (Q) | 10011

|   | A     | Q     | Q_{-1} | operation                                    |
|---|-------|-------|--------|----------------------------------------------|
| ① | 00000 | 10011 | 0      | $00000 + \overline{10111} + 1 = 01001$ (sub) |
|   | 01001 | 10011 | 0      | step                                         |

↓↓↓↓↓    ↓↓↓↓↓  
00100    11001    |

② 00100    11001    |    Shr  
↓↓↓↓↓    ↓↓↓↓↓  
00010    01100    |

③ 00010 01100 |    Add"  
↓↓↓↓↓    ↓↓↓↓↓  
11001 01100 |  
↓↓↓↓↓    ↓↓↓↓↓  
11100 10110 0    Shr

④ 11100 10110 0    Shr  
↓↓↓↓↓    ↓↓↓↓↓  
01110 01011

$$\begin{array}{r} & 1 \\ 00010 & - \\ 10111 & - \\ \hline 11001 & \end{array}$$

# DMA :

- Input / Output Interface :

Provides a method for transferring info. b/w internal storage and external I/O devices

