

## Unit-1 Basic Structure of Computers

Computer Architecture :— It is concerned with the structure and behavior of the computer as seen by the user. It includes the instruction formats, the instruction set and techniques for addressing memory.

Computer Organization :— It is concerned with how the hardware components operate and the way they are connected together to form the computer system.

### Computer types :—

→ computers are differ in size, cost, computational power and intended use.

① Personal Computer/ Desktop

② Portable/ notebook computer.

③ Workstation

④ Enterprise systems & servers

⑤ Super computer.

→ Enterprise system or mainframes are used for business data processing in medium to large operations that require much more computing power and storage capacity than workstations can provide.

→ Servers contain sizable database storage units and are capable of handling large volumes of requests to access the data.

→ Supercomputers are used for the large scale numerical calculations required in applications such as weather forecasting and aircraft design and simulation.

# Basic Functional Units of a Computer:



→ A computer consists of five units:

- \* input unit
- \* output unit
- \* memory unit
- \* Arithmetic and logic unit
- \* Control unit.

## Input Unit:

- It is a medium of communication between the user and the computer.
- It accepts information from human, electronic, mechanical device or from other computers over digital communication lines.

Example: keyboard, mouse,

## Memory Unit:

- The information is stored in memory and later the information is used for ALU processing.
- It stores user programs and data as well as system programs.
- Memory unit consists of two types of memories.
- \* Read only memory (ROM)
  - \* Random access memory (RAM).
- ROM stores permanent data. Only read is possible
- RAM stores user program from where read and write can be possible.

ALU :- (Arithmetic and logic unit.)  
→ All arithmetic and logical operations performed within this unit.

CU :- (Control Unit)

→ This unit generates necessary timing and control signals to activate different blocks/units in the computer to perform the given task.

→ ALU and CU together is called CPU.  
It is heart of any digital computer.

Output Unit :-

→ It is a medium of ~~computer~~ communication between computer and the user.

→ It takes output from the computer.

Example! : Printer, Video display unit (VDU).

Program :- A list of instruction that performs a task is called a program.

→ source program  
(High level language)

Object Program  
(Machine language)  
program

→



$$1 \text{ Byte} = 8 \text{ bits}$$

Word :-  $n$  bits.



→ One basic operation can store or retrieve  $n$  bits or 1 word.

$$1 \text{ word} = 8 \text{ bits}$$

$$1 \text{ word} = 16 \text{ bits}$$

$$1 \text{ word} = 32 \text{ bit}$$

$$1 \text{ word} = 64 \text{ bit}$$

Address :— is associated with each word location.  
→ for storing or retrieving, the memory can be accessed only using address.



Word length :— No. of bits in each word is called word length.

Memory access time :— Time required to access one word is called memory access time.

⇒ The operation of a computer can be summarized :—

- \* The computer accepts information in the form of programs and data through an input unit and stores it in the memory.
- \* Information stored in the memory is fetched, under program control, into an arithmetic and logic unit, where it is processed.
- \* Processed information leaves the computer through an output unit.
- \* All activities inside the machine are directed by the control unit.



(Connection between processor and memory)

### IR (Instruction Register) :-

→ It holds the instruction that is currently being executed.

### PC (Program Counter) :-

→ It keeps track of the execution of a program.

→ It contains the memory address of the next instruction to be fetched and executed.

→ The contents of PC is updated to address of the next instruction during the execution of instruction.

→ General purpose registers ( $R_0 \dots R_{n-1}$ )  
Present in the processor. The intermediate result are stored into the general purpose registers.

→ Two registers (MAR, MDR) provides the communication with the memory.

MAR (memory address register) :- It holds the address of the location to be accessed.

→ MDR (memory data register) :- It contains the data to be written into or read out of the addressed location.

### Execution of a program :-

- PC points to the 1<sup>st</sup> instruction of the program.
- The content of PC transferred to MAR.  
 $(MAR \leftarrow PC)$
- and a read signal is sent to the memory.
- The data (instruction) present in the address specified in MAR are read out and loaded into MDR.

$MDR \leftarrow$  Data present in address  
[MAR]

- The contents of MDR are transferred to IR.  
 $IR \leftarrow MDR$
- Now the instruction is present in the IR and ready for decode and execution.

### Interrupt:-

- The device raises an interrupt signal when it needs urgent servicing. The normal execution of programs is interrupted.
- The processor provides the requested service by executing an appropriate interrupt service routine.

Instruction :-

Add LOCA, R<sub>0</sub> : R<sub>0</sub>  $\leftarrow$  D[LOCA] + R<sub>0</sub>

OR

Load LOCA, R<sub>1</sub> : R<sub>1</sub>  $\leftarrow$  D[LOCA]

Add R<sub>1</sub>, R<sub>0</sub> : R<sub>0</sub>  $\leftarrow$  R<sub>1</sub> + R<sub>0</sub>

## VON - NEUMANN vs. HARVARD Concept

### VON - Neumann architecture :-



→ In von-Neumann architecture, the same memory and bus are used to store both data and instructions that run the program.

→ So we can't access program memory and data memory simultaneously. This affects the system performance.

### Harvard architecture :-



- The Harvard architecture stores machine instructions and data in separate memory units that are connected by different buses.
- Computers designed with the Harvard architecture are able to run a program and access data simultaneously.

## Bus Structure

- A group of lines that serves as a connecting path for several devices is called a bus.

→ Bus carry the data, address and control.



→ Data Bus:- carries the data between the processor and other components. The data bus is bidirectional.

→ Address Bus:- carries memory addresses from the processor to other components such as primary storage and input/output devices. The address bus is unidirectional.

→ Control Bus:- carries the control signals from the processor to other components. The control bus also carries the clock's pulse. The control bus is unidirectional.



### single Bus :-



- All units are connected to a single bus.
- The bus can be used for only one transfer at a time, only two units can actively use the bus at any given time.
- Low cost and its flexibility for attaching peripheral devices.

### Multiple Bus :-



- achieves more concurrency in operations by allowing two or more transfers to be carried out at the same time.
- This leads to better performance but an increased cost.

## Memory Unit :-

① bit

② nibble → 4 bits

③ Byte(B) → 8 bits.

④ 1KB →  $1024 \text{ bytes} = 2^{10} \text{ Bytes}$ .

⑤ 1MB →  $1024 \text{ KB} = 2^{20} \text{ Bytes}$

⑥ 1GB →  $1024 \text{ MB} = 2^{30} \text{ Bytes}$

⑦ 1TB →  $1024 \text{ GB} = 2^{40} \text{ Bytes}$

(Represented in  
the form of  
power 2)

## Address Bus width:-

① If memory contain 16 location,  
we require 4 address unit. ( $2^4 = 16$ )



1/0    1/0    1/0    1/0

Address bus.

= 4 bits.

|                                                             |    |
|-------------------------------------------------------------|----|
| A <sub>3</sub> A <sub>2</sub> A <sub>1</sub> A <sub>0</sub> | 0  |
| 0 0 0 0                                                     | 1  |
| 0 0 0 1                                                     | 2  |
| 0 0 1 0                                                     | 3  |
| 0 0 1 1                                                     | 4  |
| 0 1 0 0                                                     | 5  |
| 0 1 0 1                                                     | 6  |
| 0 1 1 0                                                     | 7  |
| 0 1 1 1                                                     | 8  |
| 1 0 0 0                                                     | 9  |
| 1 0 0 1                                                     | 10 |
| 1 0 1 0                                                     | 11 |
| 1 0 1 1                                                     | 12 |
| 1 1 0 0                                                     | 13 |
| 1 1 0 1                                                     | 14 |
| 1 1 1 0                                                     | 15 |
| 1 1 1 1                                                     |    |

② If memory size 64MB

$$= 2^6 \times 2^{20} \text{ byte}$$

$$= 2^{26} \text{ byte} \quad (1 \text{ byte} = 2 \text{ locations})$$

Width of address bus is 26 bits.

Or If width of address bus is 26 bits.

$$\text{Then memory size } = 2^{26} \text{ byte} = 64 \text{ MB.}$$

Data bus width: -

→ If word length = n bits, then the processor is treated as n bit processor because it accesses at a time n bits from the memory.

Size of data bus = n bits. (as at a time n bits is transferred)

③ 32 bit processor means

width of data bus = 32 bits.

④ 16 bit processor means

width of data bus = 16 bits.

⑤ 32 bit processor having 64 GB memory. Find out the width of the address and data bus.

Ans: Width of data bus = 32 bits., word length = 32 bits.

$$\text{memory size} = 64 \text{ GB} = 2^6 \times 2^{30} \text{ bytes} = 2^{36} \text{ bytes.}$$

width of Address bus = 36 bits.

⑥ 64 bit processor having 128 TB memory.

Find out the width of address and data bus.

Ans: width of data bus = 64 bits.

width of Address bus = 47 bits.

$$\begin{aligned} 128 \text{ TB} \\ = 2^7 \times 2^{40} \text{ bytes} \\ = 2^{47} \text{ bytes} \end{aligned}$$

## Performance:-

- The most important measure of the performance of a computer is how quickly it can execute program.
- Three factors affect performance:
  - \* Hardware design
  - \* instruction set
  - \* Compiler.



→ Processor time to execute a program depends on the hardware involved in the execution of individual machine instrn.

## CLOCK:-

- Processor circuits are controlled by a timing signal called a CLOCK.
- The clock defines regular time interval called CLOCK cycles.
- To execute a machine instruction, the processor divides the action to be performed into a sequence of basic steps, such that each step must be completed in one cycle.

$$\rightarrow \text{CLOCK Rate (R)} = \frac{1}{P}$$

P: Length of one CLOCK Cycle

UNIT: Hertz  
No. of cycles per second

## Basic Performance equation :-

$$T = \frac{N * S}{R}$$

where

T :- processor time required to execute a program that has been prepared in high-level language.

N :- Number of actual instructions needed to complete execution of program. Some instr<sup>n</sup> may be executed more than once (in loop).

S :- average number of basic steps needed to execute one machine instruction. Each step completes in one clock cycle.

R :- clock rate.

→ To improve T means reduce the values of T.

- \* reducing N (if source program is compiled into fewer instruction)
- \* reducing S (if instr<sup>n</sup> have smaller no. of basic steps to perform)
- \* increasing R (using a higher frequency clock increase the value of R.)

Numericals :-

① system has 8 MHz processor. 1 inst<sup>n</sup> takes 7 clock cycles to be executed. What is the time required to complete 1 instruction?

Ans:-

8 MHz processor means

8M clock cycles per sec.

8M clock cycles  $\rightarrow$  1 sec.

$\Rightarrow$  1 clock cycle  $\rightarrow \frac{1}{8M}$  sec

1 inst<sup>n</sup> take 7 clock cycles  $\rightarrow \frac{7}{8M}$  sec

$\Rightarrow 7 \times \frac{1}{8M}$  usec

$\Rightarrow \frac{7}{8} \mu\text{sec}$ .

② For 8 GHz processor

1 clock cycle  $\rightarrow \frac{1}{8G}$  sec =  $\frac{1}{8 \times 10^9}$  sec

$\Rightarrow \frac{1}{8 \times 10^9}$  sec =  $\frac{1}{8}$  nsec.

③ For 8 KHz processor

1 clock cycle  $\rightarrow \frac{1}{8K}$  sec =  $\frac{1}{8 \times 10^3}$  sec =  $\frac{1}{8}$  msec

Q) A system has 100MHz processor. A program contains 50 instructions. Each instruction takes 10 clock cycle. Find out the performance of system?

Ans: Performance =  $\frac{1}{\text{CPU-Time}}$

$$T = \frac{N \times S}{R} = \frac{50 \times 10}{10^8 \text{ M}} = 5 \mu\text{sec.}$$

$$\text{Performance} = \frac{1}{5 \mu} = 0.2 \text{ M} = 2 \times 10^5$$

Q. There is two system. Both are executing a program.

- One system has 100MHz processor. Program contains 100 instructions. Each instruction takes 10 CC.

Other system has 150MHz processor. Programs 50 instruction. Each instruction takes 25 CC.

Then which system performance is better?

| Computer - 1                                               | Computer - 2                                                    |
|------------------------------------------------------------|-----------------------------------------------------------------|
| $\text{CPU-time}_1 = \frac{N \times S}{R}$                 | $\text{CPU-time}_2 = \frac{N \times S}{R}$                      |
| $= \frac{100 \times 10}{100 \text{ M}} = 10 \mu\text{sec}$ | $= \frac{50 \times 25}{150 \text{ M}} = \frac{25}{3 \text{ M}}$ |
|                                                            | $= 8.3 \mu\text{sec.}$                                          |
|                                                            | $\text{CPU-time}_2 < \text{CPU-time}_1$                         |
|                                                            | So Computer 2 performance is better than Computer 1.            |

## Other terminology related to performance

$$\text{Performance ratio} = \frac{\text{Performance 1}}{\text{Performance 2}} = \frac{\text{CPU-time}_2}{\text{CPU-time}_1}$$

If ratio > 1 then

Computer 1 is better than Computer 2

Otherwise

Computer 2 is better than Computer 1.

$$\text{Performance ratio} = \frac{\frac{N_2 \times S_2}{R_2}}{\frac{N_1 \times S_1}{R_1}}$$

CPI : Clock cycles per instruction.

$$\text{Total } \underset{\text{execute a program}}{\cancel{\text{No. of cycles}}} = N \times \underset{\substack{\downarrow \\ \text{No. of instructions}}}{\text{CPI}}$$

Pipelining

Execution of  
Each instruction is passed through different  
stages in sequentially.

\* Fetching (IF)

\* Decoding (ID)

\* Executing (EX)

\* Write back (WB)

→ If 1<sup>st</sup> instruction will execute o  
start

→ If 2<sup>nd</sup> instruction will execute after  
the completion of 1<sup>st</sup> instruction.



Then to execute 3 instruction it will  
take 12 cycle.

→ A substantial improvement in performance  
can be achieved by overlapping the execution  
of successive instructions, using a technique  
called Pipelining.



so to execute 3 instruction, it will  
take 6 cycles. (which improves the performance)

Pipelining Defn: Overlapping execution of successive instruction are called pipelining.

### Superscalar Operation

- Multiple functional units are used, so the multiple instruction can be executed parallel.
- A higher degree of concurrency can be achieved if multiple instruction pipelines are implemented.

|                 | 1               | 2               | 3               | 4 |
|-----------------|-----------------|-----------------|-----------------|---|
| IF <sub>1</sub> | ID <sub>1</sub> | EX <sub>1</sub> | WB <sub>1</sub> |   |
| IF <sub>2</sub> | ID <sub>2</sub> | EX <sub>2</sub> | WB <sub>2</sub> | S |
| IF <sub>3</sub> | ID <sub>3</sub> | EX <sub>3</sub> | WB <sub>3</sub> |   |
| IF <sub>4</sub> | ID <sub>4</sub> | EX <sub>4</sub> | WB              |   |

(Two FU's are present).

### Instruction Set: RISC vs CISC

RISC: Reduced Instruction Set Computer

CISC: Complex Instruction Set Computer

#### RISC

- Code size is large. ( $N \uparrow$ )
- No. of basic steps of an instruction is less. ( $S \downarrow$ )
- It takes one clock cycle.
- It supports pipeline.
- Only load and store are memory reference.
- requires more no. of general purpose register.

#### CISC

- code size is small. ( $N \downarrow$ )
- No. of basic steps of an instruction is more ( $S \uparrow$ )
- It takes multiple clock cycles.
- It does not support pipeline.
- Many instruction are memory reference.
- requires less no. of general purpose registers.