

①

## UNIT - 3

### CONTROL UNIT

Instruction format :-

what is instruction?

program: A sequence of instructions

```

main()
{
    ...
    c = a+b
}

```

} get contain list of instructions.

Instruction: An instruction is a command given to the computer to perform a specific operation.

⇒ An instruction is a group of bits (binary code) that instructions the computer to perform a specific operation.

⇒ The purpose of an instruction is to specify both operation to be performed by CPU and the set of operands (or data).

⇒ operation included add, sub, DIV, mul shift etc. operations are performed on data (operands).

M.S

⇒ Operands: included the input data and the results that are produced.

What is instruction format?

- ⇒ An instruction format is a binary format which specifies a computer instruction.
- ⇒ The bits of the instructions are divided into groups called field.

|      |        |                   |
|------|--------|-------------------|
| Mode | opcode | Address (operand) |
|------|--------|-------------------|

(Instruction format (with mode field))

The most common field in instruction format are.

① opcode (operation code) field specifies the operation to be performed such as add, subtract, multiply, shift, complement etc.

② Address: An Address field specifies a memory address or processor register where operand is stored.

③ mode: A mode field specifies the way to operand or the effective address of the operand is determined (or located).

M.S

## CPU organization, and Types of Instructions.

⇒ computers may have instructions of several different length containing varying number of addresses (address field may be 3, 2, 1, or 0).

The number of address field in the instruction format of a computer depends on the internal organization of its registers.

⇒ most computers fall into one of three types of CPU organization.

- ① Single Accumulator organization.
- ② General Register organization.
- ③ Stack organization.

## Types of Instructions format / Types of address instructions

- ① Three - address instruction format.
- ② Two - address instruction format
- ③ One - address instruction format
- ④ zero - address instruction format.

M-S

Three and two instruction format - general register organization

one-address - instruction format - single accumulator organization.

zero-address - instruction format - stack Based organization

Classification of Instructions Based On CPU organization:-

① Single Accumulator organization:-

⇒ All operations are performed on an implied accumulator (AC) register.

⇒ with one operand implicitly in the accumulator register minimizes the internal complexity of a machine and allow for short instructions.

⇒ The instruction format uses only one address field (i.e one address instruction format).

② General Register organization:-

⇒ It uses of general purpose registers ( $R_0, R_1, R_2, \dots$ ) one of the most widely accepted models for machine architecture today.

M.S

(5)

⇒ Specifies all operands explicitly.

⇒ The instruction format needs 2 or 3 address fields according to the operation (i.e. two or 3-address format)

ex ADD R<sub>1</sub>, R<sub>2</sub>, R<sub>3</sub> // R<sub>1</sub> ← R<sub>1</sub> + R<sub>2</sub>

This can be reduced from three or two address if the destination register is the same as one of the source registers.

ADD R<sub>1</sub>, R<sub>2</sub> // R<sub>1</sub> ← R<sub>1</sub> + R<sub>2</sub>

MOV R<sub>1</sub>, R<sub>2</sub> // R<sub>1</sub> ← R<sub>2</sub>

ADD R<sub>1</sub>, X // R<sub>1</sub> ← R<sub>1</sub> + M[X]

Each address field may specify a processor register or a memory operand.

Stack organization :-

⇒ The operands are put into the stack and the operations are carried out on the top of the stack (TOS). The operands are implicitly specified on (TOS)

M.S

- ⇒ It does not use an address field for the instructions like ADD, MUL, SUB DIV etc. (i.e zero - address instruction format)
- ⇒ It does not use an address field for
- ⇒ PUSH and POP instructions are used to communicate with Stack which require an address field.

Note:- No Address field is required.

Ex

|      |   |
|------|---|
| PUSH | A |
| PUSH | B |
| ADD  |   |
| POP  | C |

Here ADD consists of any opcode and no address field. It has the effect of popping the top two numbers from the stack thus all the operands are implied to be in the stack.

Types of instruction format / Types of Address instructions:-

- 
- ① Three - Address Instruction format
  - ② two - address - Instruction format
  - ③ zero - address - Instruction format

## Three-address instructions:-

| opcode | Destination Address | Source Address 1 | Source Address-2 |
|--------|---------------------|------------------|------------------|
|--------|---------------------|------------------|------------------|

- ⇒ Three - Address instruction format has three address field.
- ⇒ one Address field is used for destination and two address fields for source.
- ⇒ Each Address field may specify a Processor register or memory operand.
- ⇒ It is also called general register organization.

Ex      ADD    R<sub>1</sub>, R<sub>2</sub>, R<sub>3</sub>    // R<sub>1</sub> ← R<sub>2</sub> + R<sub>3</sub>  
           ADD    R<sub>1</sub>, A, B      // R<sub>1</sub> ← M[A] + M[B]

Assembly languages  
notation (ALN)

Register transfer  
notation (RTN).

## Two - address instructions:-

| opcode | Destination Address | Source Address |
|--------|---------------------|----------------|
|--------|---------------------|----------------|

- ⇒ two address instructions are most common in commercial computers.

M.S

- ⇒ It has two address fields.
- ⇒ Each address field may specify a processor register or a memory operand.

|           |                                                                                     |                                                                                                    |
|-----------|-------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------|
| <u>Ex</u> | $\begin{array}{c} \downarrow \text{rs} \\ \text{R}_1, \quad \text{R}_2 \end{array}$ | $\begin{array}{c} \uparrow \text{rd} \\ \text{R}_1 \leftarrow \text{R}_1 + \text{R}_2 \end{array}$ |
|           | mov      R <sub>1</sub> R <sub>2</sub>                                              | // R                                                                                               |

### One Address Instructions:-

|        |                     |
|--------|---------------------|
| opcode | Operand Address S/D |
|--------|---------------------|

⇒ It has only one address field. The operand specified in the instruction may be source or destination depending on instruction. ~~on~~ instruction.

⇒ One address instructions use a implied Accumulator (AC) register for all data manipulation.

⇒ Implied means that CPU already know that one operand is in accumulator so there is no need to specify it.

⇒ All operations done between accumulator register and memory operand.

⇒ It is also called single accumulator organization.

Ex ADD X // AC  $\leftarrow$  AC + M[X]

Ex Evaluate C = A + B (using one-address format)

so LOAD A // AC  $\leftarrow$  M[A]

ADD B // AC  $\leftarrow$  AC + M[B]

STORE C // M[C]  $\leftarrow$  AC

LOAD and STORE instruction:- used for transfers to and from memory to AC register.

LOAD instruction:- operand (like A) specifies the source operand and the destination location is AC register.

Ex LOAD A // AC  $\leftarrow$  M[A]

STORE instruction:- operand like (C) specifies the destination location and the source is AC register.

e.g. STORE C // M[C]  $\leftarrow$  AC

## zero instruction - format:-

opcode

- ⇒ In zero Address instructions Stack is used. Also called stack based organization.
- ⇒ A stack based computer does not use an address field in the instructions. (like ADD, MUL).
- ⇒ PUSH and POP instructions, however need an Address field to specify the operand that communicate with stack.
- ⇒ To evaluate a arithmetic expression first it is converted to Reverse Polish notation i.e Postfix notation.
- ⇒ The name "zero-address" is given to this type of computer of the absence of a address field in instruction.

Ex

$$C = A + B$$

|      |   |                           |
|------|---|---------------------------|
| PUSH | A | // TOS $\leftarrow A$     |
| PUSH | B | // TOS $\leftarrow B$     |
| ADD  |   | // TOS $\leftarrow (A+B)$ |
| POP  | C | // $M[C] \leftarrow TOS$  |



NOTE :-

① Always use minimum number of instruction  
(So that the program length can be minimized)

② Always use minimum number of Registers  
(Registers can be reused)

$$\text{Ex} \quad x = x + y * z - t/u$$

so convert the expression into postfix

$$x + y * z - t/u$$

$$\Rightarrow x + y z * - t/u$$

$$\Rightarrow x + T - t/y$$

$$\Rightarrow x + T - Y$$

$$\Rightarrow x \underset{z}{\underset{\lceil}{+}} - Y$$

$$\Rightarrow Z - Y$$

$$\Rightarrow ZY -$$

but the value Z and Y and T

M.S

$xT + y -$

$xyz * + tu / -$  post-fix

$x = xyz * + tu / -$

zero Address instruction

PUSH  $x$

top of Stack(TOS)

TOS -  $x$

PUSH  $y$

TOS -  $y$

PUSH  $z$

TOS -  $z$

MUL

$TOS = (y * z)$

ADD

$TOS = \underbrace{x}_{M} + (y * z)$

PUSH  $t$

TOS =  $M$

PUSH  $u$

TOS -  $t$

DIV

TOS -  $u$

SUB

TOS  $\rightarrow t/u$

POP  $x$

NOTE:

(1) Infix (2) Prefix (3) Postfix

$A + B$

$+ A B$

$A B +$

① Precedence

$* / ()$

② Associativity

$A + B - C \rightarrow$

MS

Q<sub>2</sub>  $x = (A+B) * (C+D)$  using zero address instruction format.

Sol convert the expression into postfix

$$x = AB + CD + * \quad \text{postfix}$$

zero Address instruction format

PUSH A

TOS  $\leftarrow A$

PUSH B

TOS  $\leftarrow B$

ADD

TOS  $\leftarrow (A+B)$

PUSH C

TOS  $\leftarrow C$

PUSH D

TOS  $\leftarrow D$

ADD

TOS  $\leftarrow (C+D)$

MUL

TOS  $\leftarrow \underbrace{(C+D) * (A+B)}$   
M

POP X

// M(x)  $\leftarrow TOS$

Q<sub>3</sub>  $x = P + Q/R - S * T$  (using zero instruction format)

Sol convert the expression into postfix

$$x = P + Q R / - S T *$$

$$X = P \otimes R / + S * A - \quad \text{post-fix}$$

zero instruction format

PUSH P // TOS  $\leftarrow P$

PUSH Q // TOS  $\leftarrow Q$

PUSH R // TOS  $\leftarrow R$

DIV // TOS  $\leftarrow Q/R$

ADD // TOS  $\leftarrow P+Q/R$

PUSH S // TOS  $\leftarrow S$

PUSH T // TOS  $\leftarrow T$

MUL // TOS  $\leftarrow S * T$

SUB // TOS  $\leftarrow P+Q/R-S*T$

POP X // M[X]  $\leftarrow TOS$

Ex one-address instruction format:-

$$R = ((x + (y * z)) - (t / u))$$

M-S

Sof Using one-instruction format

|       |       |                                                                              |
|-------|-------|------------------------------------------------------------------------------|
| LOAD  | $y$   | $\text{AC} \leftarrow y$                                                     |
| MUL   | $z$   | $\text{AC} \leftarrow \underbrace{\text{AC} * z}_{y * z}$                    |
| ADD   | $x$   | $\text{AC} \leftarrow \text{AC} + x$<br>$\text{AC} \leftarrow y * z + x$     |
| STORE | $R_1$ | $R_1 \leftarrow x + (y * z)$                                                 |
| LOAD  | $t$   | $\text{AC} \leftarrow t$                                                     |
| DI V  | $u$   | $\text{AC} \leftarrow t/u$                                                   |
| STORE | $R_2$ | $R_2 \leftarrow t/u$                                                         |
| LOAD  | $R_1$ | $\text{AC} \leftarrow R_1$<br>$\text{AC} \leftarrow x + (y * z)$             |
| SUB   | $R_2$ | $\text{AC} \leftarrow R_1 - R_2$<br>$\text{AC} \leftarrow x + (y * z) - t/u$ |
| STORE | $R$   | $\text{M}[R] \leftarrow \text{AC}$                                           |

M-S

Ex 2 Evaluate  $C = A + B$  using one address instruction format

Sy  $C = A + B$

LOAD A //  $AC \leftarrow M[A]$

ADD B //  $AC \leftarrow AC + M[B]$

STORE C //  $M[C] \leftarrow AC$

Ex 3  $X = \frac{A + B * C}{D - E * F + G * H}$

using one-Address-Instruction format.

Sy  $X = \frac{A + B * C}{D - E * F + G * H}$

LOAD E  $AC \leftarrow M[E]$

MUL F  $AC \leftarrow AC * M[F]$

$$(AC \leftarrow E * F)$$

STORE T  $M[T] \leftarrow E * F$  (Temporary

LOAD D  $AC \leftarrow M[D]$  Store value in Register)

M.S

Note! Always use minimum number of instructions and minimum number of distinct registers.

example - two - instruction format

Q1

$$x = \frac{(A + B * C)}{(D - E * F + G * H)}$$

(using two - instruction format).

Sol

$$x = \frac{(A + B * C)}{\underbrace{C D - \underbrace{E * F}_{R_2} + \underbrace{(G * H)}_{R_3}}_{R_2}} \rightarrow R_1$$

|     |            |                             |
|-----|------------|-----------------------------|
| MOV | $R_1, B$   | $R_1 \leftarrow M[B]$       |
| MUL | $R_1, C$   | $R_1 \leftarrow R_1 * M[C]$ |
| ADD | $R_1, A$   | $R_1 \leftarrow R_1 + M[A]$ |
| MOV | $R_2, D$   | $R_2 \leftarrow M[D]$       |
| MOV | $R_3, E$   | $R_3 \leftarrow M[E]$       |
| MUL | $R_3, F$   | $R_3 \leftarrow R_3 * M[F]$ |
| SUB | $R_2, R_3$ | $R_2 \leftarrow R_2 - R_3$  |
| MOV | $R_3, G$   | $R_3 \leftarrow M[G]$       |

M-S

$$R \rightarrow ((X + (Y * Z)) - (D / 4))$$

SUB T  $AC \leftarrow D - E * F$

STORE T  $M[T] \leftarrow AC$

LOAD G  $AC \leftarrow M[G]$

MUL H  $AC \leftarrow AC * M[H]$   
 $(AC \leftarrow G * H)$

ADD T  $AC \leftarrow AC + M[T]$

STORE T  $M[T] \leftarrow D - E * F + G * H$

LOAD B  $AC \leftarrow M[B]$

MUL C  $AC \leftarrow AC * M[C]$

ADD A  $AC \leftarrow AC + M[A]$

DIV T  $AC \leftarrow AC / M[T]$

STORE X  $M[X] \leftarrow AC$

Ex-3  $x = (A+B) * (C+D) + Q$

Ex-4  $x = A - B + C * (D * E - F)$

Do yourself  
address instruction format

[M.S]

Ex-2

$$x = \frac{A + B * C}{D - E * F + G * H}$$

By using three address instruction format.

Sol

$$x = \frac{A + B * C}{D - E * F + G * H}$$

|     |                 |                                   |
|-----|-----------------|-----------------------------------|
| MUL | $R_1, B, C$     | $R_1 \leftarrow M[B] * M[C]$      |
| ADD | $R_1, R_1, A$   | $R_1 \leftarrow R_1 + M[A]$       |
| MUL | $R_2, E, F$     | $R_2 \leftarrow M[E] * M[F]$      |
| SUB | $R_2, D, R_2$   | $R_2 \leftarrow M[D] - R_2$       |
| MUL | $R_3, G, H$     | $R_3 \leftarrow M[G] * M[H]$      |
| ADD | $R_2, R_2, R_3$ | $R_2 \leftarrow R_2 + R_3$        |
| DIV | $x, R_1, R_2$   | $M[x] \leftarrow \frac{R_1}{R_2}$ |

Ex-3 two - Address instruction format.

$$x = \frac{A + B * C}{D - E * F + G * H}$$

by using two - address instruction format

M.S

$$X = \frac{A + B * C}{D - E * F + G * H}$$

|     |            |                             |
|-----|------------|-----------------------------|
| MOV | $R_1, B$   | $R_1 \leftarrow M[B]$       |
| MUL | $R_1, C$   | $R_1 \leftarrow R_1 * M[C]$ |
| ADD | $R_1, A$   | $R_1 \leftarrow R_1 + M[A]$ |
| MOV | $R_2, D$   | $R_2 \leftarrow M[D]$       |
| MOV | $R_3, E$   | $R_3 \leftarrow M[E]$       |
| MUL | $R_3, F$   | $R_3 \leftarrow R_3 * M[F]$ |
| SUB | $R_2, R_3$ | $R_2 \leftarrow R_2 - R_3$  |
| MOV | $R_3, G$   | $R_3 \leftarrow M[G]$       |
| MUL | $R_3, H$   | $R_3 \leftarrow R_3 * M[H]$ |
| ADD | $R_2, R_3$ | $R_2 \leftarrow R_2 + R_3$  |
| DIV | $R_1, R_2$ | $R_1 \leftarrow R_1 / R_2$  |
| MOV | $x, R_1$   | $M[x] \leftarrow R_1$       |

M.S.

|     |                                 |  |                                                  |
|-----|---------------------------------|--|--------------------------------------------------|
| MUL |                                 |  |                                                  |
| ADD | R <sub>1</sub> , A              |  | R <sub>1</sub> ← R <sub>1</sub> + m[A]           |
| MOV | R <sub>2</sub> , D              |  | R <sub>2</sub> ← m[D]                            |
| DIV | R <sub>2</sub> , E              |  | R <sub>2</sub> ← D/E                             |
| SUB | R <sub>1</sub> , R <sub>2</sub> |  | R <sub>1</sub> ← R <sub>1</sub> - R <sub>2</sub> |
| MOV | X, R <sub>1</sub>               |  | m[X] ← R <sub>1</sub>                            |

Ex-y   x = 
$$\frac{A - B + C * (D * E - F)}{G + H * I}$$

write code two - address instruction format. ~~Three~~  
do your self

Three - Address instruction format examples

Ex-y   A + B \* C - D/E

|     |                |                                 |                                     |
|-----|----------------|---------------------------------|-------------------------------------|
| MUL | R <sub>1</sub> | B, C                            | R <sub>1</sub> ← B * C              |
| ADD | R <sub>1</sub> | R <sub>1</sub> , A              | R <sub>1</sub> ← R <sub>1</sub> + A |
| DIV | R <sub>2</sub> | D, E                            | R <sub>2</sub> ← D/E                |
| SUB | X              | R <sub>1</sub> , R <sub>2</sub> | X ← R <sub>1</sub> - R <sub>2</sub> |

(Result)

[M-S]

|     |            |                             |
|-----|------------|-----------------------------|
| MUL | $R_3, H$   | $R_3 \leftarrow R_3 * M[H]$ |
| ADD | $R_2, R_3$ | $R_2 \leftarrow R_2 + R_3$  |
| DIV | $R_1, R_2$ | $R_1 \leftarrow R_1 / R_2$  |
| MOV | $x, R_1$   | $M[x] \leftarrow R_1$       |

Ex-2  $x = (A+B) * (C+D)$  using two  
Address instruction format.

Sol  $x = (A+B) * (C+D)$

|     |            |                             |
|-----|------------|-----------------------------|
| MOV | $R_1, A$   | $R_1 \leftarrow M[A]$       |
| ADD | $R_1, B$   | $R_1 \leftarrow R_1 + M[B]$ |
| MOV | $R_2, C$   | $R_2 \leftarrow M[C]$       |
| ADD | $R_2, D$   | $R_2 \leftarrow R_2 + M[D]$ |
| MUL | $R_1, R_2$ | $R_1 \leftarrow R_1 * R_2$  |
| MOV | $x, R_1$   | $M[x] \leftarrow R_1$       |

Ex-3  $A+B*C - D/E$  by using two  
instruction format?

Sol  $x = A + B * C - D / E$

|     |          |                             |
|-----|----------|-----------------------------|
| MOV | $R_1, B$ | $R_1 \leftarrow B$          |
| MUL | $R_1, C$ | $R_1 \leftarrow R_1 * M[C]$ |

M.S

## Computer Instruction:-

- ⇒ Memory reference
- ⇒ Register reference
- ⇒ I/O instruction

All three are 16-bit

## Memory reference:-



## Register reference:-



Register Reference }      {  
                    opcode = 111  
                    Addmode = 0

## I/O instruction:-



Add. mode = 1

Opcode = 111

→ I/O instruction

## Types of Instructions:-

- ⇒ Data transfer Instructions
- ⇒ Data manipulation Instructions
- ⇒ Program Control Instructions

### Data Transfer Instructions:-

Transfer data from one location to another without changing the binary information content.

### Data manipulation Instructions:-

Perform Arithmetic Logic, Shift operation.

### Program Control Instructions:-

It provides decision making capabilities & change the path taken by the program when executed in the computer.

### Data Transfer Instructions:-

| Name     | Mnemonic  |
|----------|-----------|
| Load     | LD / LOAD |
| Store    | ST        |
| Move     | MOV       |
| Exchange | XCH       |
| Input    | IN        |
| Output   | OUT       |
| Push     | PUSH      |
| Pop      | POP       |

## (2) Data manipulation Instructions:-

- ⇒ Arithmetic Instructions
- ⇒ Logical & bit manipulation
- ⇒ Shift instructions

| Name      | Mnemonic |
|-----------|----------|
| Increment | INC      |
| Decrement | DEC      |
| Add       | ADD      |
| Subtract  | SUB      |
| Multiply  | MUL      |
| Divide    | DIV      |

## Logical & Bit manipulation:-

| Name         | Mnemonic |
|--------------|----------|
| Clear        | CLR      |
| complement   | COM      |
| AND          | AND      |
| OR           | OR       |
| Exclusive OR | XOR      |

### Shift Instruction:

| Name                   | Mnemonic |
|------------------------|----------|
| Logical Shift right    | SHR      |
| Logical Shift Left     | SHL      |
| Arithmetic Shift right | SHRA     |
| Arithmetic Shift Left  | SHLA     |
| Rotate Left            | ROL      |
| Rotate Right           | ROR      |

### (3) Program Control Instruction:-

| Name   | Mnemonic |
|--------|----------|
| Branch | BR       |
| Jump   | JMP      |
| Skip   | SKP      |
| Call   | CALL     |
| Return | RET      |

MICRO OPERATIONS! - The operation executed on data stored in register is called micro-operations.  
ex shift, load, clear, count.

Types of micro operations! -

- (1) Register transfer micro operation! - transfer binary informations.
- (2) Arithmetic micro operation! - Perform arithmetic operation on numeric data stored in registers.
- (3) Logical micro operation! - Perform arithmetic operation on numeric data stored in registers.  
Perform bit manipulation operations on data stored in registers.
- (4) Shift micro operation! - Perform shift operations on data stored in registers

## INSTRUCTION CYCLES:-

Basic function of a computer is execution of program.

- \* A Program is a set of instructions to process an instruction.
- ① The Processor reads (fetches) an instruction from memory.
- ② The Processor executes the instruction.
- ③ Execution may involve several operations.

## Basic Instruction cycle:



⇒ execution cycle for a particular operation is dependent on the type of operation.

⇒ Reference to memory can be one/many.

⇒ Instead of reference of instruction can specify an I/O operation.

Interrupts and the instruction cycle:-

virtually, I/O devices may interrupt the normal processing of program.

On this account, execution is suspended control gets transferred.

To serve this purpose, an interrupt cycle is added to instruction cycle.

Instruction cycle with interrupt:-



M-S

## Instruction cycle :-



AR → Address Register

PC → Program Counter

IR → Instruction Register

D<sub>7</sub> → Decoder output

I → Instruction format

SC → Sequence controller

T<sub>0</sub>, T<sub>1</sub>, T<sub>2</sub>, T<sub>3</sub> → Clock cycle

Decoding Instruction format



opcode → 000 → D<sub>6</sub>



## RISC (Reduced instruction set computer)



(RISC ARCHITECTURE)

## Characteristics of RISC Processor

- ⇒ Ability to execute one instruction per clock cycle.
- ⇒ efficient pipelining.
- ⇒ RISC Processor consists mostly register-to-register operations.
- ⇒ It consists of relatively few instructions (< 50)

M.S

⇒ It consists of relatively few addressing modes.

⇒ It consists of fixed length instructions format and easily decoded instruction format.

⇒ RISC processor contains Hardwired control unit rather than microprogrammed control unit.

⇒ very few Instructions refer memory.  
⇒ memory access limited to load and store Instructions.

⇒ Complexity is there in the compiler.

⇒ It consists of multiple register set.

⇒ All operations done with in the registers of the CPU. Hence A relatively large number in the Processor.

⇒ Use of overlapped register windows to speed-up procedure call and return.

⇒ Compiler support for efficient translation of High-level language programs to machine-level language programs.

⇒ An Average CPI is less than 1.5 RISC Processors.

CISC (Complex instruction set compute):



(CISC architecture)

Characteristics of CISC architecture:-

- ⇒ variable length instruction format used by CISC architecture.
- ⇒ Large number of Instructions. typically from 100 to 250 instructions.
- ⇒ A large variety of Addressing modes, typically from 5 to 20 different Addressing modes.

M.S

⇒ Large number of Instructions, typically from 100 to 250 instructions.

⇒ A large variety of Addressing modes typically from 5 to 20 different Addressing modes.

⇒ Complex instructions take multiple cycles to execute. An average CPI is between 2 and 15.

⇒ most of the instructions may refer memory to manipulate operands in memory.

⇒ Complexity in the microProgram.

⇒ Execution time is more.

⇒ Instructions are larger than one word size.

⇒ It supports more data types.

⇒ Single register set.

⇒ Instructions decoding logic is complex.

⇒ CISC Architecture contains a microProgrammed control unit not a hardwired control unit.

M.S

## Microprogrammed control unit:-

A control unit whose binary control values are saved as words in memory is called a micro Programmed control Unit.

### Advantages:-

- ① micro - Program can be easily.
- ② flexible.
- ③ Better in terms of scalability than Hardwired.

### Disadvantages:-

- ① Hard ware cost is more because of control memory and its access circuitry
- ② slower than hardwired.

## Advantage and Disadvantage in Hardwired:-

### Advantage:-

- ① faster Processing.
- ② simple to implement or configure.

### Disadvantage:-

- ① It is not applicable to change the structure and Instruction set once it is developed.

- ② The design of the computer is complex.
- ③ The architecture and instruction set are not specified.
- ④ It is quick.
- ⑤ It has a processor to create signals to be executed in the right sequence.
- ⑥ It operates through the need for flip-flops, chips, and sequential circuits.

### micro Programmed:-

- ① It is applicable to make modification by changing the micro program saved in the control memory.
- ② The design of the computer is simplified.
- ③ The architecture and instruction set is specified.
- ④ It is moderate comparatively.
- ⑤ It facilitates the micro sequences from which instruction bits are decoded and executed.
- ⑥ It controls the sub-devices including ALU, register, bus, instruction register.

M.S

## HARDWIRED CONTROL UNIT:

A control unit can be implemented basically by two techniques.

- ① Hardwired Implementation
- ② micro - Programmed implementation.

⇒ Hardwired control unit is implemented as a sequential logic circuit or a finite state machine. That generates a specific sequence of control signals.

### Structure:



It uses fixed logic. Interprets and then generates corresponding control signals for design.

- ⇒ Amount of hardware.
- ⇒ Speed of operation
- ⇒ Cost.

M.S

These are four techniques for design of Hardwired control unit.

(1) State-table method.

(2) minimizes Hardware.

(3) constructs a state transition table.

(4) every generation of states has a set of control signals.

(2) Delay-element method:-

control signals follow a proper sequence.

A specific time delay between two groups of control signals.

To ensure synchronisation, D-flip flops are controlled by a common clock signal.

(3) Sequence - counter method:-

It uses counter for timing purposes.

(4) PLA method:- It uses Programmable logic array.

## PIPELINE:-

Pipelining is the process of arrangement of hardware elements of CPU such that its overall performance is increased.

Simultaneously execution of more than one instruction takes place in pipelined processor.

In pipelining multiple instruction are overlapped in execution.

## Sequential execution:-



## NON Pipelining Representation:-

Phase = 3      Instructions = 4

M.S



$$\text{Total clock cycle} = n \times k$$

$$= 3 \times 4$$

$$= 12$$

$$= P \times n \text{ or } k \times n.$$

where  $\Rightarrow k$  is number of Phase

$\Rightarrow n$  is number of Instruction.

Pipelining clock cycle:-





SpaceTime  
Diagram  
or  
PhaseTime  
Diagram

IF  
ID  
EX  
MEM  
WB

$$\begin{aligned}
 \text{total clock cycle} &= k + n - 1 \\
 &= 3 + 4 - 1 \\
 &= 6 \text{ sec}
 \end{aligned}$$

Factors in pipeline architecture:-

- ① Speed up Ratio - S<sub>k</sub>
- ② frequency - f
- ③ Throughput - H<sub>k</sub>
- ④ efficiency - E<sub>k</sub>



It is a Buffer/Registers  
that holds instructions  
temporarily.

$t_{in} \Rightarrow$  time for complete execution of  
instruction (NP) non pipeline.

$t_p \Rightarrow$  time taken in Phase.

$t_{cd} \Rightarrow$  delay time (time taken in  
buffer or/ Registers)

$$T = t_p + t_{cd}$$

Time taken by one Phase =  $T_{(tau)}$

① Speed up Ratio  $S_k \Rightarrow$

M.S

$$S_K = \frac{\text{time taken in } N_p}{\text{time taken in } P}$$

$$= \frac{n \times t_m}{(K+m-1) \times T}$$

$m \geq K$   
↳ much greater

$$n = 1000$$

$$K = 3$$

$$\Rightarrow 1000 - 1 + 3$$

$$\Rightarrow 1002$$

$$K + (m-1) \leq n$$

$$= \frac{x \times t_m}{x \times T}$$

$$S_K = \frac{t_m}{T}$$

$$\textcircled{2} \quad \text{Frequency} = \frac{1}{T}$$

$$f = \frac{1}{T}$$

$$\textcircled{3} \quad \text{Throughput } (H_K) \Rightarrow$$

Number of task completed per unit time.

$$H_K = \frac{n}{(K+(m-1)) \times T}$$

M.S

$$= \frac{n}{(k+n-1)} \times \frac{1}{T}$$

$$H_K = \frac{n}{k+n-1} \times f$$

$$H_K = \frac{n \times f}{k+n-1}$$

④ efficiency  $E_K \Rightarrow$

$$E_K = \frac{s_k}{k}$$

$$E_K = \frac{n \times t_m}{(k+n-1) \times T}$$

$$E_K = \frac{n \times k}{k+n-1}$$

$$\begin{aligned} &= \frac{n \times t_m}{\frac{s_k}{k} \times t_m} \\ &= \frac{n}{\frac{s_k}{k}} \\ &= \frac{n}{\frac{t_m}{K}} \end{aligned}$$

## Pipeline Performance:-

$n$ : Number of tasks to be performed  
conventional machine (Non-pipelined).

$t_m$ : clock cycle

M.S



4 clock cycle pipeline is full when one task completed.

$$\text{task} = 4 \quad \text{stage} = 4$$

### Linear Pipeline Processor:-

Linear Pipeline Processor is a cascade of processing stage which are linearly connected to perform a fixed function over a stream of data following from one end to another.

⇒ A Linear Pipeline Processor is constructed with  $k$  Processing Stages.

M.S

$T_1$  : Time required to complete  $n$  tasks.

$$T_1 = n \times t_m$$

pipelined machine ( $K$  stages)

$t_p$  : clock cycle (time to complete each sub operation)

$T_K$  : Time required to complete  $n$  tasks.

$$T_K = (K+n-1) \times t_p$$

Speed up

$S_K$  : speed up

$$S_K = \frac{n \times t_m}{(K+n-1)t_p}$$

$$\lim_{n \rightarrow \infty} S_K = \frac{t_m}{t_p} = (k \text{ if } t_m = k \cdot t_p)$$

M.S

- ⇒ External input is supplied to the pipeline from stage  $S_1$
- ⇒ The processed request is passed from stage  $S_{i+1}$ , for all  $i = 1, 2, 3, \dots, k-1$
- ⇒ The final request is produced from the last pipeline stage  $S_k$ .



$$1 \text{ Hz} = 1 \text{ cycle/second}$$

$$1 \text{ sec} = 1 \times 10^6 \text{ usec}$$

$$S_k = \frac{T_1}{T_2}$$

$$S_k = \frac{m \times d_m}{(k+m-1) d_p}$$

(NP)
(P)

if  $m$  values increased,  
maximum speed is  $= k$   
maximum Speed upratio  $S_k = k$

M.S

Ex-1 Determine the number of clock cycles that it takes to process 200 tasks in a 6 segment (Phase) pipeline.

Sol  $K = 6$   $m = 200$

$$\begin{aligned}\text{Number of clock cycle} &= K + m - 1 \\ &= 6 + 200 - 1 \\ &= 205 \text{ clock cycle}\end{aligned}$$

Af

Ex-2 A non-pipeline system takes 50 ns to process a task. The same task can be processed in a 6-segment pipeline with a clock cycle of 10 ns. Determine the speed-up ratio of 4x pipeline for 100 tasks. What is the maximum speed-up that can be achieved

Sol In Non-pipeline

$$t_n = 50 \text{ ns}$$

In pipelining

$$t_p = 10 \text{ ns} \quad K = 6$$

$$m = 100$$

$$\begin{aligned}
 \text{Speed-up Ratio } (S_K) &= \frac{n t_m}{(K+n-1) t_p} \\
 &= \frac{100 \times 50}{(6+100-1) \times 10} \\
 &= \frac{100 \times 50}{105 \times 10} \\
 &= \frac{100}{27}
 \end{aligned}$$

$$S_K = 4.76$$

then maximum speed up (S) can be achieved when no. of task increases is

$$S = K \quad \text{as } n = \infty$$

so the maximum speed-up will be 6 i.e

$$S = 6$$

$$M.S$$

Ex-3 Consider the execution of Program 15000 instructions by a linear (one line in execute) Pipeline Processor with a clock rate of 25 MHz (megahertz). Assume that the instruction pipeline has 5-stages and that 1 (one) instruction is issued per clock cycle. The penalties due to branch instructions and out-of-sequence execution are ignored.

(a) Calculate the speed-up factor in using this pipeline.

(b) calculate the efficiency and throughput.

Given

Number of Instruction

$$n = 15000$$

frequency of the clock

$$f = 25 \text{ MHz}$$

Number of Pipeline Storage

$$K = 5$$

M.S

Instruction is issued per clock cycle

$$1 \text{ Hz} = 1 \text{ Per Sec}$$

$$f = \frac{1}{T} \quad (\text{Clock Period})$$

① Speed-up  $S_K$

$$S_K = \frac{T_1(NP)}{T_2(P)}$$

$$S_K = \frac{nK\tau}{(K+n-1)\tau}$$

$$S_K = \frac{nK}{K+n-1}$$

$$S_K = \frac{15000 \times 5}{5 + 15000 - 1}$$

$$S_K = \frac{75000}{15004}$$

$$S_K = 4.999$$

② efficiency  $E_K$

$$E_K = \frac{S_K}{K} = \frac{n}{K+n-1}$$

M.S

$$E_K = \frac{4.999}{5}$$

$$\boxed{E_K = 0.999}$$

③ Throughout  $H_K$

$$H_K = \frac{n f}{k+n-1}$$

$$H_K = \frac{n}{(k+n-1)T}$$

$$= \frac{n f}{k+n-1}$$

$$H_K = \frac{E_K}{T}$$

$$= \frac{n}{(k+n-1)T} = \frac{n f}{k+n-1}$$

$$= \frac{15000 \times 25}{5 + (15000 - 1)}$$

$$= \frac{15000 \times 25}{15004}$$

$$= \frac{375000}{15004}$$

$$\boxed{H_K = 24.99 \text{ MIPS}}$$

(million instructions per sec)

**M-S**

Ex-4 Consider a non-pipelined Processor ( $P_1$ ) with a clock rate of 25 MHz and average cycles per instruction (CPI) of 4. The same processor ( $P_2$ ) with 5 stages but due to the internal pipeline delay, the clock speed is reduced to 20 MHz. Assume there are no stalls in the pipeline.

(a) if a program of 100 instructions is to be executed on both processor what is speed-up of  $P_2$  compared to that  $P_1$ ?

(b) find (MIPS) (throughput find out) rate of  $P_1$  as well  $P_2$  during execution of the program.

Sol given

$$1 \text{ meg Hz} = 10^6 \text{ Hz}$$

Clock  
Per Instruction  $\downarrow$

$$\text{CPI} = 4$$

$$f_1 = 25 \text{ MHz} \\ = 25 \times 10^6 \text{ Hz}$$

$$n = 100$$

$$K = 5$$

total time for non-pipeline Processor

M.S

$$\begin{aligned}
 T_1 &= \frac{n \times CPI}{f_1} \\
 &= \frac{100 \times 4}{25 \times 10^6 \text{ Hz}} \\
 &= \frac{400}{25 \times 10^6 \text{ cycle/sec}} \\
 &= \frac{400 \text{ sec}}{25 \times 10^6 \text{ cycle}} \\
 &= \frac{400 \times 10^6 \mu\text{sec}}{25 \times 10^6 \text{ cycle}} \\
 &= 16 \mu\text{sec}
 \end{aligned}$$

No of instruction  
 $\frac{32 \times 4}{f}$   
 $1 \text{ Hz} = 1 \text{ cycle}$   
 $1 \text{ sec} = 1 \times 10^6 \mu\text{sec}$

total time in pipeline Processor  $T_2$

$$\begin{aligned}
 T_2 &= (K+n-1) \tau \\
 T_2 &= \frac{K+n-1}{f_2} \\
 &= \frac{5+100-1}{20 \times 10^6 \text{ Hz}}
 \end{aligned}$$

$$\tau = \frac{1}{f}$$

$1 \text{ Hz} = 1 \text{ cycle/sec}$   
 $1 \text{ sec} = 1 \times 10^6 \mu\text{sec}$

M.S

Ex-5 The time delay of 4 stages are  $T_1 = 6\text{ ns}$ ,  $T_2 = 7\text{ ns}$ ,  $T_3 = 9\text{ ns}$  and  $T_4 = 8\text{ ns}$  respectively and the interface latch has a delay of  $10\text{ ns}$  calculate

(a) clock Period

(b) frequency

(c) speed-up

Sol Latch is the high-speed register

(a) clock Period

$$\tau = \max \times \left\{ \sum_{i=1}^K T_i \right\} + d$$

$$\tau = T_m + d$$

$$\tau = 9\text{ ns} + 10\text{ ns}$$

$$\tau = 100\text{ ns}$$

d is latch  
delay time

where d = latch delay

$T_m$  = maximum stage delay

M.S



② frequency

$$\begin{aligned}
 f &= \frac{1}{\tau} \\
 &= \frac{1}{100} \text{ ns} \\
 &= \frac{1}{100 \times 10^{-9} \text{ sec}} \\
 &= \frac{1}{10^{-7} \text{ s}}
 \end{aligned}$$

$$f = 10^7 \text{ Hz}$$

frequency  $\text{Hz}^{0.8}$   
MHz

$$1 \text{ ms} = 1 \times 10^{-3}$$

$$1 \text{ CPS} = 1 \text{ Hz}$$

Cycle Per Sec

$$1 \text{ MHz} = 1 \times 10^6 \text{ Hz}$$

③ speed up =  $\frac{\text{non pipeline cycle time}}{\text{pipeline cycle time}}$

$$= \frac{T_1 + T_2 + T_3 + T_4}{\tau}$$

M.S

$$= \frac{60 + 50 + 90 + 80}{100}$$

$$S_k = 2.8 \text{ ns}$$

Ex-6 consider a 4-segment floating point adder Pipeline. The time delay of the 4-segment in the Pipeline are  $T_1 = 50 \text{ ns}$ ,  $T_2 = 30 \text{ ns}$   $T_3 = 95 \text{ ns}$  and  $T_4 = 45 \text{ ns}$ . The interface latch is 5 ns.

(a) How long would it take to add 100 pairs of numbers in the Pipeline.

(b) How can we reduce the total time one half of the time calculated in Part (a).

Sol

(a) Clock Period

$$T = \max \times \left\{ T_i \right\}_{i=1}^k + d$$

All Stage  
maximum td

$$T = 100 + 5$$

$$T = 95 + 5$$

$$T = 100 \text{ ns}$$

M.S

for  $m = 100$   $K = 4$   $T = 100\text{ns}$   
 time to add 100 numbers in pipeline

$$\begin{aligned} T_K &= (K+m-1) T \\ &= (4 + 100 - 1) \times 100 \\ &= (4 + 99) \times 100 \\ &= 10300\text{ns} \\ &= 10.3\text{μs} \end{aligned}$$

(b) to reduced total time period divided  
 Stage 3 into 2 stage.

$$T_3 = 95\text{ns} \quad \begin{array}{l} \xrightarrow{1} T_3 = 50\text{ns} \\ \xrightarrow{2} T_3 = 45\text{ns} \end{array}$$

so now total no of stages are 5  
 such as

$$T_1 = 50\text{ns} \quad T_2 = 30\text{ns} \quad T_{1,3} = 50\text{ns}$$

$$T_{2,3} = 45\text{ns} \quad \text{and} \quad T_m = 45\text{ns}$$

$$\begin{aligned} \text{Clock Period } (T) &= T_m + d \\ &= 50\text{ns} + 5\text{ns} \\ &= 55\text{ns} \end{aligned}$$

M.S

$$\begin{aligned}
 &= \frac{100 \times 20 \text{ MHz}}{(5 + 100 - 1) \text{ sec}} \\
 &= \frac{100 \times 20 \times 10^6}{(5 + 99) \text{ sec}} \\
 &= \frac{2000}{104} \text{ MIPS} \\
 &= 19.23 \text{ MIPS}
 \end{aligned}$$

(million of  
instructions per  
second).

Throughput of Non pipelined Processor

P<sub>2</sub>

$$WP_2 = \frac{1}{CPI \times T}$$

cycle per sec  
 $T = \frac{1}{f}$

$$= \frac{1 \times f}{CPI}$$

$$= \frac{1 \times 25 \times 10^6}{4}$$

$$= 6.25 \times 10^6 \text{ instruction/sec}$$

$$= 6.25 \text{ MIPS} \quad \underline{\text{Ap}}$$

M.S

$$= \frac{5 + 99}{20 \times 10^6 \text{ Hz}}$$

$$= \frac{104}{20 \times 10^6 \text{ cycles/sec}}$$

$$= \frac{104 \times 10^6 \mu\text{sec}}{20 \times 10^6 \text{ cycles}}$$

$$= 5.2 \mu\text{sec}$$

$$\begin{aligned} \text{Speedup factor} &= \frac{T_1}{T_2} \\ &= \frac{16 \times 10}{5.2} \\ &= 3.08 \end{aligned}$$

$$S_K = 3.08$$

(2) Throughput or ( $\omega$ )

$H_K$  = No of instruction executed/sec

Throughput for pipelined Processor P<sub>1</sub>

$$\omega_{P_1} = \frac{m f}{K + m - 1}$$

M.S

$$\text{total time } (T_k) = (k+m-1) \tau$$
$$= (5 + 104 - 1) \times 55$$
$$= 5720 \text{ ns}$$
$$= 5.720 \mu\text{s}$$

Result