

Name - Mukesh Singh  
Class - B.Ed ICT Sec - 3<sup>rd</sup> Semester  
Roll. No - 7  
Subject - Microprocessor and Computer Organization  
Subject Teacher -  
College - Sukuna Multiple Campus

Microprocessor and Computer Organization  
Notes  
ICT 3<sup>rd</sup> Semester  
by

Mukesh Singh  
ICT 5<sup>th</sup> batch 2073

Sukuna Multiple Campus  
Sundarharaincha, Morang

# Introduction

## 1.1 Introduction and History of Microprocessors

A microprocessor is a computer processor which incorporates the functions of a computer's central processing unit (CPU) on a single integrated circuit (IC) or at most a few integrated circuits.

### # History of Microprocessors:

- Microprocessor revolution began in 1969 by Ted Hoff (Intel Engineer).
- The first microprocessor was introduced in 1971 by Intel Corporation. It was named Intel 4004 as it was 4 bit processor.
- It was a processor on a single chip and can perform simple arithmetic and logic operations such as addition, subtraction, multiplication and Boolean AND and Boolean OR.
- The first microprocessor was quite a success in industry. Soon other microprocessors were also introduced.
- The first 8 bit microprocessor which could perform arithmetic and logical operations on 8 bit words was introduced in 1973 again by Intel.

- This was intel 8008 and was later followed by an improved version, Intel 808~~0~~. Some other 8 bit processors are Zilog-80 and Motorola M6800.
- The 8-bit microprocessors were followed by 16 bit processors. They are intel 8086 and 80286.
- The 32 bit microprocessors were introduced by several companies but the most popular one is Intel 80386.

| Microprocessors | Introduction | Data Bus | Address Bus |
|-----------------|--------------|----------|-------------|
| 4004            | 1971         | 4        | 8           |
| 8008            | 1972         | 8        | 8           |
| 8080            | 1974         | 8        | 16          |
| 8085            | 1977         | 8        | 16          |
| 8086            | 1978         | 16       | 20          |
| 80186           | 1982         | 16       | 20          |
| 80286           | 1983         | 16       | 24          |
| 80386           | 1986         | 32       | 32          |
| 80486           | 1989         | 32       | 32          |
| Pentium         | 1993 onwards | 32       |             |
| Core Solo       | 2006         | 32       |             |
| Dual Core       | 2006         | 32       |             |
| Core to Duo     | 2006         | 32       |             |
| Core to Quad    | 2008         | 32       |             |
| i3, i5, i7      | 2010         | 64       |             |

## 1.2 Basic Block Diagram of a Computer

The basic block diagram of a Computer is given below:



Fig: Basic block diagram of a Computer

## 1.3 Organization of Microprocessor Based System



## 1-4 Bus Organization

The diagram of Bus Organization is shown below:



Fig: Bus System Organization

- \* Data Bus (16 bit) :> responsible for transmission of data i.e. carry data.
- \* Address Bus (20 bit) :> information regarding devices communicating with CPU.
- \* Control Bus :> carry control instruction

## 1.5 Stored Program Concept and Von Neumann Machine

Stored program technology was discovered in 1945 AD by John Von Neumann. This became the fundamental program technology for the modern digital computer. This technology made computer programming and computing faster, more flexible and more efficient. Program can be electrically stored in binary number format in a memory device so that instructions could be modified by the computer as determined by intermediate computational results.

The basic structure proposed in the draft became known as the "Von Neumann machine". It has following parts:

- a memory, containing instructions and data
- a processing unit, for performing arithmetic and logical operations
- a control unit, for interpreting instructions

MAR → Memory Address Register

MDR → Memory Data Register

Stores both  
program instruc-  
tions & data



Fig: Abstraction of Von Neumann

## 1.6. Processing Cycle of a Stored Program Computer



- Instructions are collection of bits
- Programs are stored in memory

### Steps

1. fetch an instruction from memory to the instruction register
2. increment the program counter register
3. decide the instruction
4. fetch operands, if any, usually from registers
5. perform the operation; this may modify the PC register
6. Store the results, usually to register



### 1.7 SSI, MSI, LSI circuits:

An integrated circuit (IC) is a set of electronic circuits on one small flat piece (or chip) of semiconductor material, normally silicon.

- The first integrated circuits contained only a few (numbering in tens) transistors and so were called "Small-Scale Integration (SSI)".  
E.g. gates, flip-flops etc.

- The increased in number of transistors to hundreds on a chip and so were called Medium-Scale Integration (MSI). Eg: 4 bit microprocessors.
- Large Scale integration (LSI) includes tens of thousands of transistors integrated on a single chip. Eg: 8-bit microprocessors, RAM, ROM etc

### 1.8 VLSI Technology:

VLSI is a technology by which 10000-1 Million transistors can be fabricated on a single chip.

Eg: 16-32 bit microprocessors.

Advantages of VLSI:

1. Reduces the size of circuits.
2. Reduces the effective cost of the devices.
3. Increases the operating speed of circuits.
4. Requires less power than discrete components.
5. Higher Reliability.
6. Occupies a relatively smaller area.

### 1.9 Introduction to Register Transfer Language:-

The symbolic notation describing Micro-Operation(MO) transfers among registers is called a register transfer language.

The term "register transfer" means the availability of hardware logic circuits that can perform a stated micro-operation and transfer the result of operation to the same or another register. The word "language" is borrowed from programmers who apply this term to programming languages.

A register is a group of flip-flops which is capable of storing one bit of information. Registers are designated by capital letters followed by optional number. Eg. PC, IR, R1, R2 etc.

#### Basic Symbols for Register Transfers

| Symbol               | Description                     | Examples                                 |
|----------------------|---------------------------------|------------------------------------------|
| Letters and Numerals | Denotes a register.             | MAR, R2                                  |
| (Parenthesis)        | Denotes a part of a register.   | R2(0-7), R2(L)                           |
| Arrow $\leftarrow$   | Denotes transfer of information | $R_2 \leftarrow R_1$                     |
| Comma ,              | Separates two micro-operations  | $R_2 \leftarrow R_1, R_1 \leftarrow R_2$ |

Examples

① Read :  $DR \leftarrow M[AR]$

→ Read memory word M from address AR to data register DR.

② Write :  $M[AR] \leftarrow R_1$

→ Write data from  $R_1$  to a memory word M at address AR.

## # Arithmetic micro-operations (Mano 1993)

| Symbolic designation                      | Description                                          |
|-------------------------------------------|------------------------------------------------------|
| $R_3 \leftarrow R_1 + R_2$                | Contents of $R_1$ plus $R_2$ transferred to $R_3$ .  |
| $R_3 \leftarrow R_1 - R_2$                | Contents of $R_1$ minus $R_2$ transferred to $R_3$   |
| $R_2 \leftarrow \overline{R_2}$           | Complements the contents of $R_2$ (1's complement)   |
| $R_2 \leftarrow \overline{R_2} + 1$       | 2's Complements the contents of $R_2$ (negate)       |
| $R_3 \leftarrow R_1 + \overline{R_2} + 1$ | $R_1$ plus the 2's complement of $R_2$ (subtraction) |
| $R_1 \leftarrow R_1 + 1$                  | Increment the contents of $R_1$ by one.              |
| $R_1 \leftarrow R_1 - 1$                  | Decrement the contents of $R_1$ by one.              |

## 2.1 Internal Architecture &amp; features of 8086

Microprocessor:



Fig: Internal Architecture of 8086 microprocessor

## # Features of 8086 Microprocessor:

- i) 8086 is a 16-bit microprocessor.
- ii) 8086 has 16-bit data bus, read or write data from/to memory and ports either 16 bit or 8 bits at a time.
- iii) 8086 has 20 bit address bus, so it can access  $2^{20}$  (1 MB) memory locations.
- iv) 8086 can generate 16-bit I/O address, hence it can access  $2^{16}$  I/O ports.
- v) 8086 provides 14 16-bit registers.
- vi) It is possible to perform bit, byte, word and block operations.
- vii) 8086 is designed to operate in two modes; minimum and maximum.
- viii) It supports multiprogramming.

### 2.1.1. BIU and Components:

BIU stands for Bus Interface Unit. It handles all data and addresses on the bus for the execution unit.

#### Functions of BIU

- Sends Address of memory / I/O.
- Fetches instructions from memory.
- Read/write data to/ from port / memory.
- Supports instruction queue and address relocation facility.

Components or parts of BIU:

1. Instruction Queue
2. Segment Registers
3. Instruction Pointer (IP)

### 1. Instruction Queue:

To increase the execution speed, BIU fetches as many as 6 instruction bytes ahead of time from memory. The pre-fetched instruction bytes are held for the EU in a first in first out group of registers called a instruction queue. When the EU is ready for its next instruction, it simply reads the instruction from this instruction queue. Fetching the next instruction while the current instruction executes is called pipelining.

### 2. Segment Registers:

The BIU contains four 16-bit registers. They are: Extra segment (ES) registers, Code segment (CS) registers, data segment (DS) registers, and stack segment (SS) registers. It holds the addresses of instructions and data in memory, which are used by the processor to access memory locations.

The registers that specify the location of segments are called segment registers.

i) Code Segment (CS):

The CS register is used for addressing a memory location in the Code segment of the memory, where the executable program is stored.

ii) Data Segment (DS):

It stores the starting address of data segment.

iii) Stack Segment (SS):

It handles memory to store data and addresses during execution.

iv) Extra Segment (ES):

ES is additional data segment used by some of the string to hold the extra destination data.



Fig: 8086 Memory Segment  
16/127

### 3. Instruction Pointer (IP):

In the BIU, the next register, below the segment register is instruction pointer. It is a 16-bit register used to hold the address of the next instruction to be executed.

#### 2.1.2 EU and Components:

EU stands for Execution Unit. It gives instructions to BIU stating from where to fetch the data and then decode and execute those instructions.

Its function is to control operation on data using the instruction decoder & ALU. EU has no direct connection with system buses, it performs operations over data through BIU.

Components or parts of EU:

1. Control Circuitry or System
2. Instruction Decoder
3. ALU
4. Flag Registers
5. General Purpose Registers
6. Stack Pointer Registers
7. Pointer and Index Registers

1. Control Circuitry:

It performs various internal operations.

## 2. Instruction Decoder:

It translates instructions fetched from memory to generate different internal or external control signals that required performing the operation.

## 3. ALU:

The EV has a 16-bit ALU, which performs basic Arithmetic and logical operations such as: addition (+), subtraction (-), multiplication (\*), division (/), OR, AND, NOT etc.

## 4. Flag Registers:

| BIT                | 15 | 14 | 13 | 12 | 11 | 10 | 9  | 8  | 7  | 6  | 5 | 4  | 3 | 2  | 1 | 0  |
|--------------------|----|----|----|----|----|----|----|----|----|----|---|----|---|----|---|----|
| U                  | U  | U  | U  | U  | OF | DF | JF | TF | SF | ZF | V | AF | V | PF | U | CF |
| <u>U=undefined</u> |    |    |    |    |    |    |    |    |    |    |   |    |   |    |   |    |

Carry flag: Set by carry out of MSB

Parity flag: Set if result has even parity

Auxiliary carry flag for BCD

Zero flag: Set if result = 0

Sign flag = MSB of result

Single step trap

Interrupt enable

String direction

Overflow

A 16-bit flag register is a flip-flop which indicates some condition produced by the execution of an instruction or controls certain operation of the EV. It has 9 flags and they are divided into two categories:

1. Conditional Flags
2. Control Flags

### 1. Conditional Flags:

Conditional flags represent result of last arithmetic or logical instructions.

#### (i) Carry Flag (CF):

This flag will set to 1 if ~~there~~ the arithmetic operation produces the carry in MSB position.

#### (ii) Parity Flag (PF):

It is set to 1 if result of ALU is even.

#### (iii) Auxiliary Flag (AF):

It is set to 1 if carry from lower ~~to~~ nibble to upper in addition, and borrow from upper to lower nibble in subtraction. Not available to programmers used for BCD operations.

#### (iv) Zero Flag (ZF):

It is set to 1 if result of ALU is zero otherwise it is reset.

## (v) Sign flag (SF):

It is set ~~to 1~~ if MSB bit is 1, indicates negative result.

## (vi) Overflow flag (OF):

This flag represents the result when the system capacity is exceeded.

## 2. Control flags:

Control flags control the operation of the execution unit (EU). Following is the list of control flags:-

## (i) Trap flag (TF):

It is used for debugging.

## (ii) Interrupt flag (IF):

It is an interrupt enable/~~flag~~ disable flag, i.e. used to allow/prohibit the interruption of program. It is set to 1 for interrupt enabled condition and set to 0 for interrupt disabled condition.

## (iii) Direction flag:

It is used in string operation. As the name suggests, when it is set then string bytes are accessed from the higher memory address to the lower memory address and vice-versa.

## 5. General Purpose Registers:

The 8086 has eight general purpose registers namely: AH, AL, BH, BL, CH, CL, DH and DL. These registers can be used individually for temporary storage of 8-bit data. The AL register is also called the accumulator. Certain pairs of these general purpose registers can be used together to store 16-bit data. The valid register pairs are: AH & AL, BH & BL, CH & CL and DH & DL. These register pairs are referred to the Ax, Bx, Cx and Dx respectively.

### (i) AX Register:

For 16-bit operations, Ax is called the accumulator register that stores operands for arithmetic operations.

### (ii) BX register:

This register is mainly used as a base register. It holds the starting base location of a memory region within a data segment.

### (iii) CX Register:

It is defined as a counter. It is primarily used in loop instruction to store loop counter.

#### (iv) DX Register:

This register is used to hold I/O port address for I/O instruction.

#### 6. Stack Pointer Register:

It is a 16-bit register, which holds the address from the start of the segment to the memory location, where a word was most recently stored on the stack. This most recently stored memory location is called the top of stack.

#### 7. Pointer and Index Registers:

The EU also contains a 16-bit source index (SI) register, base pointer (BP) register, and Destination Index (DI) registers. These three registers can be ~~primarily~~ mainly used for temporary storage of 16-bit data just like a general purpose registers.

### 2.5.3 EU and BIU Operations :

The BIU provides hardware functions including generation of the memory and I/O addressed for the transfer of data between itself and the outside world.

The Operations of BIU are :

- It provides a full 16 bit bidirectional data bus and 20 bit address bus.
- It sends address of memory or I/O.
- It fetches instruction from memory.
- It reads data from port/memory.
- It makes 8086's interface to the outside world.

The EV receives program instruction codes and data from the BIU, executes these instructions, and stores the results in the general registers.

The Operations of EU are:

- The EU is responsible for decoding and executing all instructions.
- The EU extends instructions from the top of the queue in the BIU.
- If the queue is empty, the EU waits for the next instruction byte to be fetched and shifted to top of the queue.
- It tells from where to fetch instructions or data, decodes instructions & execute instructions.

### 2.1.4. Segment and Offset Address:

Addresses in 8086 are 20 bit long. Addresses are computed from a segment start and an offset, both are 16 bit. The 16-bit segment start is shifted left by 4, then offset is added. These two operations are performed in a 20 bit register, so the result is a 20 bit address.

Each logical address consists of two parts : a 16-bit segment and a 16-bit offset. Physical addresses are generated by adding the segment register (not shifted) and outputting the 20-bit result. Thus, segment 0000 [all numbers in hex] offset 0000 to FFFF represent physical addresses 00000 to 0FFFF. Adding those offsets to various other segments will yield other address ranges:

| <u>Segment</u> | <u>Physical addressing</u> | <u>Range</u>                                                 |
|----------------|----------------------------|--------------------------------------------------------------|
| 0000           | 00000                      | - 0FFFF                                                      |
| 0001           | 00010                      | - 1000F                                                      |
| 0002           | 00020                      | - 1001F                                                      |
| 0FFF           | 0FFF0                      | - 1FFFF                                                      |
| 1000           | 10000                      | - 1FFFF                                                      |
| FFFF           | FFFF0                      | - OFFEF (or 10FFEF if higher physical addressing bit exists) |

## 2.2 Addressing modes of 8086:

Addressing modes indicates the way of locating data as operands. The different ways in which a source operand is denoted in an instruction is known as addressing modes. There are 8 different addressing modes in 8086 microprocessor:

### 1. Immediate addressing mode:

The addressing mode in which data operand is a part of the instruction itself is known as immediate addressing mode.

Example: MOV Ax, Mov AL etc

### 2. Register addressing mode:

It means that the register is the source of an operand for an instruction

Example:- Mov CX, AX etc

### 3. Direct addressing mode:

The addressing mode in which the effective address of the memory location is written directly in the instruction is direct addressing mode.

Example: MOV Ax, [5000H] , MOV AL,[300H]

#### 4. Register indirect addressing mode:

This addressing mode allows data to be addressed at any memory location through an offset address held in any of the following registers: BP, BX, DI & SI.

Example: `Mov AX, [BX]`

#### 5. Based addressing mode:

In this addressing mode, the offset address of the operands is given by the sum of contents of the BX / BP registers and 8-bit / 16-bit displacement.

Example: `Mov ox, [BX + 4]`

#### 6. Index - addressing mode :

In this addressing mode, the operand offset address is found by adding the content of SI or DI register and 8-bit / 16-bit displacements.

Example: `Mov BX, FS + 16, ADD AL, [DI + 16]`

#### 7. Based-index addressing mode:

In this addressing mode, the offset address of the operand is computed by summing the base register to the contents of an index register.

Example: `ADD CX, [AX + SI], Mov AX, [AX + DS]`

### 8. Based-index with displacement mode:

In this addressing mode, the operands offset is computed by adding the base register contents, An index register contents and 8 or 16-bit displacement.

Example: - MOV AX, [BX + DS + 08],  
ADD CX, [BX + SI + 16]

### 2.3. Assembly Language Programming:

Assembly language is a special type of low level language which consists set of alphanumeric instructions, called mnemonics. It was developed by Bob Nelson, Jack Powell, Clement and Michael in IBM laboratory.

Programming could easily be done using alphanumeric symbols instead of 0s and 1s.

#### Advantages

- It is easier to understand & more standard form.
- It is easier to find errors because of translator program.

#### Disadvantages

- They are machine dependent.
- Program execution is slower and less efficient than machine language.

## 2.4. High level Versus Low Level Programming :

### High level Programming

1. High level programming languages are platform independent.
2. High level language programs are slower than low level language as they need to convert.
3. High level languages are easy to learn.
4. High level languages are near to human languages.
5. Programs in high-level languages are easy to modify.
6. Knowledge of hardware is not required to write programs.

### Low level Programming

1. Low level programming languages are platform dependent.
2. Low level language programs are faster than high level language programs as they don't need to convert.
3. Low level languages are difficult to learn.
4. Low-level languages are far from human languages.
5. Program in low-level language are difficult to modify.
6. Deep knowledge of hardware is required to write programs.

- |                                                                           |                                                                            |
|---------------------------------------------------------------------------|----------------------------------------------------------------------------|
| 7. These languages are normally used to write application programs.       | 7. These languages are normally used to write hardware programs.           |
| 8. It is divided into 3 types :- 3GL, 4GL and 5GL.                        | 8. It is divided into two types :- Machine language and Assembly language. |
| 9. Examples:- C, C++, Java, Cobol, FORTRAN, PASCAL, Perl, BASIC, PHP etc. | 9. Examples:- C, C++, machine language, assembly language etc.             |

## 2.5. Assembly Language Syntax:

The above topic describes the basic lexical elements of assembly language programming, and explains how those elements combine to form complete assembly language expressions. It also explains how sequences of expressions are put together to form the statements that make up an assembly language program.

The syntax of an assembly language

### 2.5.1 Comments

### ~~2.5.1.~~ Comments:

Assembly language comments begin with a semicolon (;). It may contain any printable character including blank. It can appear on a line by itself, like

; This program displays a message on screen  
Or, on the same line along with an instruction, like-

[add eax, ebx ; adds ebx to eax]

### 2.5.2 Reserved words:

Reserved word is a word that cannot be used as an identifier, such as the name of a variable, function, or label - it is "reserved from use". Reserved words are not case sensitive except for predefined symbols. The assembler generates an error if we use a reserved words as a variable.

Examples of reserved words are:-

Operands - \$, ?, @R, BASIC, DWORD, SYSCALL,  
PASCAL, FORTRAN.

Predefined symbols :- @CatStr\*, @Code, @Line,  
@FileName, @Stack\* etc

Registers :- AH, AL, AX, BH, BL, BX, CH, CL, CX,  
etc

### 2.5.3. Identifiers:

An identifier is a name that we invent and attach to a definition. Identifiers can be symbols representing variables, constants, procedure names, code labels, and user defined data types such as structure, unions, records and types defined with TYPEDEF. Identifiers longer than 247 characters generate an error.

- The first character of the identifier can be an alphabetic character (A-Z) or any of these four characters : @, -, \$, ?
- The other characters in the identifier can be any of the characters listed above or a decimal digit (0-9).

### 2.5.4. Statements:

Statements are the line-by-line components of source files. Each statement specifies an instruction or directive for the assembler.

Statements have up to four fields : [[name:]] [operation] [operands] [t; comment].

### 2.5.5. Directives:

~~Directive is optional and specifies the title of the program. Like a comment, it has no effect on the program. It is just~~

used to make the program easier to understand.

### 2.5.5 Directives:

Unlike instructions being compiled and written to chip program memory, directives are commands of assembly language itself and have no influence on the operation of the microcontroller. Some of them are obligatory part of every program while some are used only to facilitate or speed up the operation. Directives are written in the column reserved for instructions. There is a rule following only one directive per program line.

### 2.5.6 Operators:

The assembler supports the following operators for use in expressions. Operators have no assigned precedence. Expression can be grouped in square brackets ([ ]) to establish precedence.

Addition (+), Subtraction (-), multiplication (\*), division (/), Bitwise logical AND (&), Bitwise logical OR (|), Shift right (">>), shift left ("><) etc.

Note: → The asterisk (\*), slash (/), and percent sign (%) characters are overloaded. When used as operators in an expression, these characters must

be preceded by the backslash (\).

### 2.5.7. Instructions:

There exist around 150 instructions for the 8086 processor. These instructions may be classified into different classes. The following table summarizes the different classes of instructions, together with examples of each class.

| Instruction Type               | Definition                                                                     | Examples                                  |
|--------------------------------|--------------------------------------------------------------------------------|-------------------------------------------|
| Data transfer instructions     | Transfer information between registers & memory locations or I/O ports.        | Mov, LEA, Pop                             |
| Arithmetic instruction         | Perform arithmetic operations on binary or binary-coded-decimal (BCD) numbers. | ADD, SUB, INC, DEC                        |
| Bit manipulation instructions  | Perform shift, rotate & logical operations on memory locations & registers.    | SHL, SHR, SAR, ROL                        |
| Control transfer instructions  | Control sequence of program execution. Include jumps & procedure transfers.    | JL, JE, JNE, JGE                          |
| String handling instructions   | Move, compare & scan strings of data.                                          | MOVSB, CMPS, MOVSW                        |
| Processor control instructions | Set and clear status flags, and change the processor execution state.          | <del>INT, INTO, INT3, STC, STD, STJ</del> |

No P, W, D

2.

Miscellaneous  
instructionsInterrupt  
instructionsInterrupt processor to service  
a specific conditionINT, INTO,  
IRET

## 2.6. EXE and COM Programs:

### # .EXE files

- .EXE is executable file which is a main entry point for execution of a program.
- .EXE code files use RAM more efficiently.
- Data and Code occupy separate segments.
- The programmer is responsible for setting up the data and code segments properly.

### # .COM files:

- The .COM file format is a historic file format since the first version of MS-DOS.
- All code, data and stack occupy one 64k segment.
- The COM format is the original binary executable format used in Control Program for Microcomputers and MS-DOS.
- It is very simple.
- It has no header (with the exception of CP/M 3 files), and contains only code and data.

## 2.1. Assembling, Linking and Executing:

### 1. Assembling:

- Assembling converts source program into object program of syntactically correct and generates an intermediate .obj file or module.
- It calculates the offset address for every data item in data segment and every instruction in code segment.
- A header file created which contains the incomplete address in front of the generated obj module during the assembling.
- Assembler complains about the syntax error if any and does not generate the object module.
- Assembler creates .obj, .lst and .crf files and last two are optional files that can be created at run time.
- For short programs, assembling can be done manually where the programmer translates each mnemonic into machine language using lookup table.
- Assembler reads each assembly instruction of

a program as ASCII character and translate them into respective machine code.

### Assembler Types :

#### a) One pass assembler

It scans the assembly language program once and converts to object code at the same time.

#### b) Two pass assembler:

It scans the assembly language twice.

### 2. Linking:

→ This involves the converting of .OBJ module into .EXE (executable) module i.e. executable machine code.

→ It completes the address left by the assembler.

→ It combines separately assembled object files.

→ Linking creates .EXE, .LIB & .MAP files among which last two are optional files.

### 3. Executing:

→ It loads the program in memory for execution.

→ It reserves remaining address.

→ This process creates the program segment prefix (PSP) before loading.

→ It executes to generate the result.

Sample program  $\xrightarrow{\text{assembling}}$  Object program  $\xrightarrow{\text{Linking}}$  executable program

## 2.8 One pass and two pass assemblers:

### 1 One pass Assemblers

→ This assembler scans the assembly language program once and converts to object code at the same time.

→ This assembler has the program of defining forward references only.

→ This jump instruction uses an address that appears later in the program during scan, for that case, the programmer defines such addresses after the program is assembled.

### 2. Two pass assembler:

→ This type of assembler scans the assembly language twice.

→ First pass generates symbol table of names and labels used in the program and circulates their relative address.

→ The table can be seen at the end of the list file and here we need not define anything.

→ Second pass uses the table constructed

in the first pass and completes the object code creation.

→ The assembler is more efficient and easier than earlier.

## 2.9 Keyboard and Video Services:

The intel CPU uses two types of interrupt: hardware and software interrupt. The common interrupts are: 10H, is used for video services and 21H, is used for DOS.

### # INT 21H:

- It is called the DOS function call for keyboard operation. The different services under it are:

i) 00H :- It terminates the current program. However, it is generally not used instead 4CH is used.

ii) 01H :- It reads the character with echo. It waits for the character if the buffer is empty. The character that is read is returned in AL in ASCII value.

Eg:

- data

char DB ?

.code  
MOV AH, 01H  
INT 21H  
MOV char AL

iii) 02H : It is used to display the single character.  
The character is sent to DL for display.

Eg:

MOV AH, 02H  
MOV DL, 'B'; Returns the ASCII value of B To DL  
i.e 66  
INT 21H

iv) 03H and 04H :- used as auxiliary input output.  
However, INT 14H is preferred instead.

v) 05H :- Also called printer service as it sends the character in DL to printer.

Eg:

:data

.code

MOV AH, 05H ; select printer output

MOV DL, 65 ; character to be printed

vi) 0CH : Clears the keyboard buffer and invoke input function such as 03, 06, 07, 08 or 0A.  
AL will contain the input function.

## # INT 10H :

It is called video display control. It controls the screen format. It controls the screen format color, text style, making window scrolling etc. The control functions listed below are contained in AH register.

- i) 00H :- sets the video mode
- ii) 01H :- Set cursor lines
- iii) 02H :- Set cursor position
- iv) 03H :- Get cursor position and size
- v) 06H :- Scroll window up
- vi) 07H :- Scroll window down
- vii) 08H :- Read character and attribute
- viii) 09H :- Write character and attribute
- ix) 0AH :- Write character
- x) 10H (AL=03H) :- Toggle blinking/intensity bit
- xi) 0FH :- Get video mode

## 2.11. Pin Configuration of 8086 Microprocessor:

2.

# 1. Minimum Mode:

|    |                 |                    |    |
|----|-----------------|--------------------|----|
| 16 | AD <sub>0</sub> | AD <sub>8</sub>    | 8  |
| 15 | AD <sub>1</sub> | AD <sub>9</sub>    | 7  |
| 14 | AD <sub>2</sub> | AD <sub>10</sub>   | 6  |
| 13 | AD <sub>3</sub> | AD <sub>11</sub>   | 5  |
| 12 | AD <sub>4</sub> | AD <sub>12</sub>   | 4  |
| 11 | AD <sub>5</sub> | AD <sub>13</sub>   | 3  |
| 20 | AD <sub>6</sub> | AD <sub>14</sub>   | 2  |
| 9  | AD <sub>7</sub> | AD <sub>15</sub>   | 39 |
| 15 | CLK             | A <sub>16/53</sub> | 38 |
| 31 | HOLD            | A <sub>17/54</sub> | 37 |
| 18 | INTR            | A <sub>18/55</sub> | 36 |
| 33 | MN              | A <sub>19/56</sub> | 35 |
| 17 | NMS             | ALE                |    |
| 22 | READY           | BHE <sub>157</sub> |    |
| 21 | RST             | DEN                |    |
| 23 | TEST            | DT/R               |    |
|    |                 | HLDA               |    |
|    |                 | JNTA               |    |
|    |                 | M/IO               |    |
|    |                 | RD                 |    |
|    |                 | WR                 |    |

Fig (g) 8086 minimum mode

- In minimum mode, there can be only one processor i.e. 8086.
- $MN/M\bar{X}$  is 1 to indicate minimum mode.
- ALE for the latch is given by 8086 as it is the only processor in the circuit.
- $\overline{DEN}$  and  $DT/R$  for the trans-receivers are given by 8086 itself.
- Direct control signals  $M/I\bar{O}$ ,  $\overline{RD}$  and  $\overline{WR}$  are given by 8086.
- Control signals  $M/I\bar{O}$ ,  $\overline{RD}$  and  $\overline{WR}$  are decoded by a 3:8 decoder like 74138.
- $\overline{INTA}$  is given by 8086 in response to an interrupt on  $INTR$  line.
- HOLD and HLD signals are used for bus request with a DMA controller like 8237.
- The circuit is simpler.
- Multiprocessing cannot be performed, hence performance is lower.

## 2. Maximum Mode:

|    |                                  |                    |    |
|----|----------------------------------|--------------------|----|
| 26 | AD <sub>0</sub>                  | AD <sub>8</sub>    | 8  |
| 25 | AD <sub>1</sub>                  | AD <sub>9</sub>    | 1  |
| 24 | AD <sub>2</sub>                  | AD <sub>10</sub>   | 6  |
| 23 | AD <sub>3</sub>                  | AD <sub>11</sub>   | 5  |
| 22 | AD <sub>4</sub>                  | AD <sub>12</sub>   | 4  |
| 31 | AD <sub>5</sub>                  | AD <sub>13</sub>   | 3  |
| 10 | AD <sub>6</sub>                  | AD <sub>14</sub>   | 2  |
| 9  | AD <sub>7</sub>                  | AD <sub>15</sub>   | 31 |
| 29 | → CLK                            | A <sub>16/52</sub> | 38 |
| 18 | INTR                             | A <sub>17/54</sub> | 37 |
| 33 | MX                               | A <sub>18/55</sub> | 36 |
| 17 | NMI                              | A <sub>19/56</sub> | 35 |
| 22 | READY                            | S <sub>0</sub>     | 26 |
| 31 | R <sub>9/47</sub> T <sub>0</sub> | S <sub>1</sub>     | 27 |
| 30 | R <sub>9/47</sub> T <sub>1</sub> | S <sub>2</sub>     | 28 |
| 21 | RST                              | RHE/S <sub>7</sub> |    |
| 23 | TEST                             | LOCK               |    |
|    |                                  | QS <sub>0</sub>    |    |
|    |                                  | QS <sub>1</sub>    |    |
|    |                                  | RD                 |    |

Fig:(b) 8086 Maximum mode

- In maximum mode, there can be multiple processors with 8086, like 8087 & 8089.
- MN/ $\overline{R_X}$  is 0 to indicate maximum mode.
- ALE for the latch given by 8288 bus Controller.
  - a) There can be multiple processor in the circuit.
- DT/ $\overline{R}$  for the trans-receivers are given by 8288 bus status signals called  $\overline{S_2}$ ,  $\overline{S_1}$  and  $\overline{S_0}$ .
- Status signals  $\overline{S_2}$ ,  $\overline{S_1}$ ,  $\overline{S_0}$  are decoded by a bus controller like 8288 to produce control signals.
- $\overline{\text{INTA}}$  is given by 8288 bus controller in response to an interrupt on INTR line.
- $\overline{RD}$  /  $\overline{GT}$  lines are used for bus requests by other processor like 8087 or 8089.
- The circuit is more complex.
- As multiprocessing can be performed, it can give very high performance.

## 2.12 Bus Structures:

### 2.12.1. Synchronous Bus



Fig: Synchronous ~~Parity~~ Program Bus

- Transmitters and receivers are ~~not~~ synchronized by clock.
- Data bits are transmitted with synchronization of clock.
- Character is received at constant rate.
- Data transfer takes place in blocks.
- Start and stop bit are required to establish communication of each character.

- Used in high-speed transmission.

### 2.12.2 Asynchronous Bus



Fig: Asynchronous Bus

- Transmitters and receivers are not synchronized by clock.
- Bits of data are transmitted at constant rate.
- Character may arrive at any rate at receiver.
- Data transfer is character oriented.
- Start and stop bits are required to establish communication of each character.
- Use in low-speed transmission.

Central Processing Unit3. CPU Structure and function.

Central Processing Unit (CPU) is the portion of a computer system that carries out the instructions of a computer program, and is the primary element carrying out the computer's function. The CPU carries out each instructions of the program in sequence to perform basic arithmetical, logical and input/output operations of the system. A CPU on single chip is called Microprocessor.



Fig: Presenting CPU in Computer System

Primary Components of CPU are:

i) Registers:

Registers are primary memory of computer system. They are mainly used to store data during the time of processing inside ALU.

- They are fastest, less capacity and temporary memory.

ii) Control Unit:

The Control Unit is the brain of the CPU itself. It is situated inside the processor. It controls overall operations and devices of computer. It executes the computer instructions. It directs the flow of data in the computer i.e. from input device to primary memory and from primary memory to output device & so on.

Two types:

- Hardwired Control Unit
- Micro programmed Control Unit.

iii) Arithmetic & Logic Unit

- ALU performs basic arithmetical operations such as addition, subtraction, multiplication and division.
- It performs ~~basic~~ logical operations such as

as comparing greater than, equal to etc.

### 3.1 CPU Structure and Function



Fig: Internal Structure of CPU

Functions of CPU:

- Fetch Instructions
- Fetch Data
- Interpret Instructions
- Process Data
- Write Data

### 3.2 ~~ALU~~ Arithmetic and Logic Unit

An arithmetic and logic unit (ALU) is a digital electronic circuit that performs arithmetic and logical operations on data. All of the other elements of computer system - control unit, registers, memory, I/O are mainly to bring data into the ALU for it to process and then to take the result back out. Data are presented to ALU in register and the result of operation is stored in register.



Fig: Block diagram of ALU

Consider an ALU having four Arithmetic

Operations ~~and~~ :- Addition, subtraction, Multiplication and division. Also consider ALU having four logical operations : OR, AND, NOT, EX-OR.

We need three control lines to identify any one of these operations. Control lines are used to identify any one of the four operations in a group. One possible combination is given below:

| $C_1$ | $C_0$ | Arithmetic ( $C_2 = 0$ ) | Logical ( $C_2 = 1$ ) |
|-------|-------|--------------------------|-----------------------|
| 0     | 0     | Addition                 | OR                    |
| 0     | 1     | Subtraction              | AND                   |
| 1     | 0     | Multiplication           | NOT                   |
| 1     | 1     | Division                 | EX-OR                 |

A 3-to-1 decoder is used to decode the instructions as given above in the block diagram.

### 3.3. Stack

The stack is a block of memory that may be used for temporarily storing the contents of the registers inside the CPU. It is top-down data structure whose elements are accessed using the stack pointer (SP) which gets decremented by 2 as we store a data word into the stack and gets incremented by +2 as we retrieve a data word from the stack back to the CPU register.

The process of ~~transferring~~ storing the data in the stack is called 'pushing into' the stack and its

The reverse process of transferring the data back from the stack to the CPU register is known as 'popping off' the stack.

The stack is essentially Last-In-first-Out (LIFO) data segment. This means that the data which is pushed into the stack last will be on top of stack and will be popped off the stack first.

The stack pointer (SP) is a 16-bit register that contains the offset address of the memory location in the stack segment.

The stack segment (SS) register contains the base address of the stack segment in the memory.

The stack segment register (SS) and Stack Pointer (SP) register together address the stack-top as explained below:

$$SS \rightarrow 5000H$$

$$SP \rightarrow 2050H$$

### 3.4. Processor Organization:

Things CPU must do :

- Fetch instruction:  
The processor reads an instruction from memory (register, cache, main memory).
- Interpret instruction:  
The instruction is decoded to determine what action is required.
- fetch data:  
The execution of an instruction may require reading data from memory or an I/O module.
- Process data:   
The execution of an instruction may require performing some arithmetic or logical operation on data.
- Write data:  
The results of an execution may require

writing data to memory on I/O module



Fig: Simplified view  
of Processor

### Components of Processor:

- ALU does the actual computation of processing of data.
- The Control Unit controls the movement of data and instructions into and out of the processor and controls the operations of the ALU.
- Register consists of set of storage locations.

### 3.6 Instruction formats:

Instruction format of a computer instruction usually contains 3 fields: operation code(opcode), address field and mode field. The number of address fields in the instruction formats depends on the internal organization of CPU. On the basis of no. of address field, we can categorize the instruction as below:

#### i) Three address instructions:

Computers with three address instruction can use each address field to specify either a processor register or a memory operand.

Assembly language program to evaluate

$$X = (A+B) * (C+D)$$

ADD R<sub>3</sub>, A, B

format: Op X, Y, Z ; X  $\leftarrow$  Y op Z

Example: ADD X, Y, Z ; X  $\leftarrow$  Y + Z

Advantage: It results in short programs when evaluating arithmetic expressions.

Disadvantage: The instructions requires too many bits to specify 3 addresses.

### ii) Two address instructions:

Two address instructions are most common in commercial computers. Here again each address field can specify either a processor register, or a memory word. One address must do double duty as both operand and result.

Format:  $Op\ X, Y; \quad X \leftarrow X \text{ op } Y$

Example:  $SUB\ X, Y; \quad X \leftarrow X - Y$

Advantage - Tries to minimize the size of instruction.

Disadvantage - Size of program is relatively larger.

### iii) One-address instructions:

One-address instructions uses an implied accumulator (AC) register for all data manipulation. All operations are done between AC and memory operand.

Format:  $Op\ X; \quad AC \leftarrow AC \text{ op } X$

Example:  $MUL\ X; \quad AC \leftarrow AC * X$

Advantage - Memory access is only limited to load and store.

Disadvantage:- Larger program size.

## iv) Zero-address instructions:

A stack organized computer uses this type of instructions. The name "zero-address" is given to this type of computer because of the absence of an address field in the computational instructions.

format :  $TOS \leftarrow TOS \oplus (TOS - 1)$

Example : DIV;  $TOS \leftarrow TOS \text{ DIV } (TOS - 1)$

Assembly programs to evaluate the following instructions.

$X = (A + B) * (C + D)$  using zero, one, two or three

## i) Three address instructions:

ADD R<sub>1</sub>, A, B ;  $R_1 \leftarrow M[A] + M[B]$

ADD R<sub>2</sub>, C, D ;  $R_2 \leftarrow M[C] + M[D]$

MUL X, R<sub>1</sub>, R<sub>2</sub>;  $M[X] \leftarrow R_1 * R_2$

## ii) Two address instructions:

MOV R<sub>1</sub>, A;  $R_1 \leftarrow M[A]$

ADD R<sub>1</sub>, B;  $R_1 \leftarrow R_1 + M[B]$

MOV R<sub>2</sub>, C;  $R_2 \leftarrow \cancel{R_2 + M[C]} \rightarrow M[C]$

ADD R<sub>2</sub>, D;  $R_2 \leftarrow R_2 + M[D]$

MUL R<sub>1</sub>, R<sub>2</sub>;  $R_1 \leftarrow R_1 * R_2$

MOV X, R<sub>1</sub>;  $M[X] \leftarrow R_1$

### iii) One address instructions:

LOAD A;  $A_c \leftarrow M[A]$   
ADD B;  $A_c \leftarrow A_c + M[B]$   
STORE T;  $M[T] \leftarrow A_c$   
LOAD C;  $A_c \leftarrow M[C]$   
ADD D;  $A_c \leftarrow A_c + M[D]$   
MULT;  $A_c \leftarrow A_c * M[T]$   
STORE X;  $M[X] \leftarrow A_c$

Here, T is temporary memory location required for storing the intermediate result.

### iv) Zero address instructions:

PUSH A;  $TOS \leftarrow A$   
PUSH B;  $TOS \leftarrow B$   
ADD ;  $TOS \leftarrow (A+B)$   
PUSH C;  $TOS \leftarrow C$   
PUSH D;  $TOS \leftarrow D$   
ADD ;  $TOS \leftarrow (C+D)$   
MUL;  $TOS \leftarrow (C+D) * (A+B)$   
POP X;  $M[X] \leftarrow TOS$

### 3.8 Data Transfer and Manipulation:

Computer gives extensive set of instructions to give the user the flexibility to carry out various computational tasks. The actual operations in the instruction set are not very different from one computer to another although binary encodings and symbol name (operation) may vary. So, most computer instructions can be classified into

3 categories:

1. Data transfer instructions
2. Data manipulation instructions
3. Program Control instructions

#### 1) Data transfer instructions:

Data transfer instructions cause transfer of data from one location to another without changing the binary information. The most common transfers are between the:

- Memory and Processor registers
- Processor registers and input output devices
- Processor registers themselves

Typical data transfer instructions:

| Name     | Mnemonic |
|----------|----------|
| Load     | LD       |
| Store    | ST       |
| Move     | MOV      |
| Exchange | XCH      |

|        |      |
|--------|------|
| Input  | IN   |
| Output | OUT  |
| Push   | PUSH |
| Pop    | POP  |

## 2) Data manipulation instructions:

Data manipulation instructions perform operations on data and provide the computational capabilities for the computer. These instructions perform arithmetic, logical and shift operations.

### Arithmetic instructions

| Name                    | Mnemonic |
|-------------------------|----------|
| Increment               | INC      |
| Decrement               | DEC      |
| Add                     | ADD      |
| Subtract                | SUB      |
| Multiply                | MUL      |
| Divide                  | DIV      |
| Add with carry          | ADC      |
| Subtract with borrow    | SUBB     |
| Negate (2's Complement) | NEG      |

### Logical and Bit manipulation Instructions:

| Name       | Mnemonic |
|------------|----------|
| Clear      | CLR      |
| Complement | COM      |
| AND        | AND      |
| OR         | OR       |

|                   |      |
|-------------------|------|
| Exclusive OR      | XOR  |
| Clear Carry       | CLRC |
| Set Carry         | SETC |
| Complement Carry  | COMC |
| Enable interrupt  | EI   |
| Disable interrupt | DI   |

### Shift instructions

| Name                       | Mnemonic |
|----------------------------|----------|
| Logical Shift Right        | SHR      |
| Logical Shift Left         | SHL      |
| Arithmetic Shift Right     | SHRA     |
| Arithmetic Shift Left      | SHLA     |
| Rotate right               | ROR      |
| Rotate left                | ROL      |
| Rotate right through Carry | RORC     |
| Rotate left through Carry  | ROLC     |

### 3) Program Control Instructions

The program control instructions provide decision making capabilities and change the path taken by the program when executed in computer. These instructions specify conditions for altering the content of the program counter. Some typical program control instructions are:

| Name                     | Mnemonic |
|--------------------------|----------|
| Branch                   | BR       |
| Jump                     | JMP      |
| Skip                     | SKP      |
| Call                     | CALL     |
| Return                   | RET      |
| Compare (by subtraction) | CMP      |
| Test (by Avering)        | TST      |

### 3.9. RISC and CISC:

#### A. RISC

RISC is acronym for Reduced Instruction Set Computer. In this architecture a microprocessor is designed to perform a smaller number of types of instructions so that it can operate at a higher speed ~~perform~~. Since each instruction type that a computer must perform requires additional transistors and circuitry, a larger list or set of computer instructions tends to make the microprocessor more complicated and slower in operation. That is why less instruction mean fast in ~~operations~~ operations. Pipelining is one of the unique feature of RISC. It is performed by overlapping the execution of several instructions in a pipeline.

fashion.

Example: Apple iPod and Nintendo DS etc.  
A RISC is a computer that can be divided into multiple instructions which perform low-level operation with single clock cycle, as its name suggest "Reduced Instruction Set".

## B. CISC

CISC is acronym for Complex Instruction Set Computer. CISC architecture are designed to decrease the memory cost. The large programs need more storage, thus increasing the memory cost and large memory becomes more expensive.

To solve these problems, in CISC architecture, the number of instructions per program is reduced by embedding the number of operations in each instruction. thereby the instructions are more complex.

Example: IBM 370/368, VAX 11/780, Intel 8086 etc

A CISC is a computer where single instructions can execute several low-level operations or are capable of multi-step operations or addressing modes within single instructions, as its name suggest "Complex Instruction Set".

## RISC

- |                                           |                                                 |
|-------------------------------------------|-------------------------------------------------|
| 1. Simple instructions taking one cycle.  | 1. Complex instructions taking multiple cycles. |
| 2. Very few instructions refer memory.    | 2. Most of instructions may refer memory.       |
| 3. Instructions are executed by hardware. | 3. Instructions are executed by micro-program.  |
| 4. Fixed format instructions.             | 4. Variable format instructions.                |
| 5. Few instructions.                      | 5. Many instructions.                           |
| 6. Multiple register sets.                | 6. Single register set.                         |
| 7. Highly pipelined.                      | 7. Not or less pipelined.                       |
| 8. Complexity is in compiler.             | 8. Complexity is in micro-program.              |
| 9. Eg: Apple ipod, Nintendo DS etc        | 9. Eg: IBM 370/368, Intel 80486                 |

Understanding RISC & CISC architecture with example:

Let us take an example of multiplying two numbers:

$$A = A \times B;$$

<<== This is C statement

The CISC Approach:

The primary goal of CISC architecture is to complete a task in as few lines of assembly as possible. This is achieved by building processor hardware that is capable of understanding & executing a set of operations, that's where our CISC architecture introduced.

for a particular task, a CISC processor would come prepared with a specific instruction (we'll call it "MULT"). When executed this instruction:

- 1.) Loads the two values into separate registers.
- 2.) Multiplies the operands in the execution unit.
- 3.) And finally third, stores the product in the appropriate register.

Thus, the entire task of multiplying two numbers can be completed with one instruction:

MULT A,B

<<== This is assembly statement.

MULT is what is known as a "Complex instruction". It operates directly on the computer's memory banks and does not require the programmer to explicitly call any loading or storing functions.

## The RISC Approach:

RISC processors only use simple instructions that can be executed within one clock cycle. True, the "MULT" command described above could be divided into three separate commands:

- 1) "LOAD" which moves data from the memory bank to a register.
- 2) "PROD" which finds the product of two operands located within the registers.
- 3) "STORE" which moves data from a register to the memory bank.

In order to perform the exact series of steps described in the CISC approach, a programmer would need to code four lines of assembly:

LOAD R1,A

LOAD R2,B

PROD A,B

STORE R3,A.

At first, this may seem like a much less efficient way of completing the operation. Because there are more lines of code, more RAM is needed to store the assembly level instructions. The compiler must also perform more work to convert a high-level language statement into code of this form.

### 3.10. 64-Bit Processor

A 64-bit processor is a microprocessor with a word size of 64 bits, a requirement for memory and data intensive applications such as computer-aided design (CAD) applications, database management systems (DBMS), technical and scientific applications, and high performance servers.

In simple words, 64-bit processors are computer chips that can support 64-bit computing. And by 64-bit computing, we mean software supports 64-bit virtual memory address.

Intel, IBM, Sun Microsystems, Hewlett Packard, and AMD currently develop or offer 64-bit processors.

4.1 Control Memory

Control memory is type of RAM which contains addressable storage registers.

The function of Control Unit is to initiate sequences of micro-operations.

When Control Signals

- When control signals are generated by hardware using conventional logic design techniques, the control unit is said to be hardwired.
- A control unit whose binary control variables are stored in memory is called a microprogrammed control unit.

Control memory:

A memory that is a part of a control unit is referred to as a control memory.

Each word in control memory contains within a microinstruction.

A sequence of microinstructions constitutes a microprogram.



Fig: Microprogrammed Control Unit organization

## 7.2 Addressing Sequencing:

Each computer instruction has its own microprogram routine in control memory to generate the microoperations that execute the instructions.

Process of finding address of next micro-instruction to be executed is called address sequencing. ~~Address sequencer~~

### Process of Address sequencing:

Address sequencer must have capability of finding address of next micro-instruction in following situations:

- 1) Incrementing of the Control address register
- 2) Unconditional or Conditional branch, depending on status bit conditions.
- 3) Mapping Process
- 4) A facility for subroutine call and return



Fig: Block diagram of Address Sequencer

OR - ~~address sequencer~~  
Selection of address for Control Memory

- Control address register receives address of next ~~to~~ micro-instruction from different sources.
- Incrementer simply increments the address by one.
- In case of branching, branch address is specified in one of the fields of microinstruction.
- In case of subroutine call, return address is stored in the register SBR which is used when returning from called Subroutine.

**Conditional Branch:**  
Controls the conditional branch decisions generated in the Branch logic.  
Tests the specified condition and branch to the indicated address if the condition is met, otherwise address register is simply incremented.

If condition is true, set the appropriate flag status register to 1. Conditions are tested for O(Overflow), N(Negative), Z(Zero) (Carry), etc.

#### \* Unconditional Branch

Fix the value of one status bit at the input of the multiplexer to 1 so that branching can always be done.

#### \* Mapping of instructions:

One simple mapping process that converts the 4-bit operation code to a 7-bit address for control memory is shown below:

Computer Instruction: [ ] 0 1 1 | Address

Mapping bits 0 | x x x x | 0 0

Microinstruction address: 0 1 0 1 1 0 0

4-bit opcode = specify up to 16 distinct instruction.

- g) Mapping Process: Converts the 4-bit opcode to a 7-bit control memory address
- Place '0' in the most significant bit of address.
  - Transfer the 4-bit operation code bits.
  - Clear the last two significant bits of the control address register (CAR).

Mapping function: Implemented by mapping ROM or PLD.

Control Memory Size: 128 words ( $= 2^{7 \times 8}$ )

f) Subroutines:

- Subroutines are programs that are used by another program to accomplish a particular task.
- Microinstructions can be saved by employing subroutines that use common sections of the micro-code.

#### 4.3 Computer Configuration:



Fig: Computer hardware Configuration

#### 4.1 Microinstruction format :

The microinstruction format for the control memory is shown in the figure below:

The 20 bits of the microinstruction are divided into four functional parts:

- The three fields  $F_1, F_2$  and  $F_3$  specify micro-operations for the computer.
- The CD field selects status bit conditions.
- The BR field specifies the type of branch.
- The AD field contains a branch address.



Fig: Microinstruction code format

$F_1, F_2, F_3$  : Micro-operation fields

CD : Condition for branching

BR : Branch - field

AD : Address field

#### Micro-instructions:

It is a single instruction in micro-code.

Each microinstruction below is defined using register transfer statements and is assigned a symbol for use in symbolic microprogram.

Instructions that makes microprogrammed are called micro-instructions.

## Description of CD:

| CD Condition  | Symbol | Comments             |
|---------------|--------|----------------------|
| 00 Always = 1 | U      | Unconditional branch |
| 01 DR(15)     | I      | Indirect address bit |
| 10 AC(15)     | S      | Sign bit of AC       |
| 11 AC = 0     | Z      | Zero value in AC     |

## Description of BR

| BR Symbol | function                                                                                                    |
|-----------|-------------------------------------------------------------------------------------------------------------|
| 00 JMP    | CAR $\leftarrow$ AD if condition = 1<br>CAR $\leftarrow$ CAR + 1 if condition = 0                           |
| 01 CALL   | CAR $\leftarrow$ AD, SBR $\leftarrow$ CAR + 1 if condition = 1<br>CAR $\leftarrow$ CAR + 3 if condition = 0 |
| 10 RET    | CAR $\leftarrow$ SBR (Return from subroutine)                                                               |
| 11 MAP    | CAR(2-5) $\leftarrow$ PR(13-14), CAR(0,1,6) $\leftarrow$ 0.                                                 |

CD (Condition field) consists of two bits representing 4 status bits and BR (Branch) field (2-bits) used together with address field AD, to choose the address of the next micro instruction.

## Microinstruction Fields ( $f_1, f_2, f_3$ )

| $f_1$ | Microoperation           | Symbol |
|-------|--------------------------|--------|
| 000   | None                     | NOP    |
| 001   | $AC \leftarrow AC + DR$  | ADD    |
| 010   | $AC \leftarrow 0$        | CLEAR  |
| 011   | $AC \leftarrow AC + 1$   | INCAC  |
| 100   | $AC \leftarrow DR$       | DRTAC  |
| 101   | $AR \leftarrow DR(0-10)$ | DRTAR  |
| 110   | $AR \leftarrow PC$       | PCTAR  |
| 111   | $M[AR] \leftarrow DR$    | WRITE  |

| $f_2$ | Microoperation               | Symbol |
|-------|------------------------------|--------|
| 000   | None                         | NOP    |
| 001   | $AC \leftarrow AC - DR$      | SUB    |
| 010   | $AC \leftarrow AC \vee DR$   | OR     |
| 011   | $AC \leftarrow AC \wedge DR$ | AND    |
| 100   | $DR \leftarrow M[AR]$        | READ   |
| 101   | $DR \leftarrow AC$           | ACTDIR |
| 110   | $DR \leftarrow DR + 1$       | INCDR  |
| 111   | $DR(0-10) \leftarrow PC$     | PCTDR  |

| F3  | Microoperation           | Symbol |
|-----|--------------------------|--------|
| 000 | None                     | NOP    |
| 001 | $Ac \leftarrow Ac + DR$  | XOR    |
| 010 | $Ac \leftarrow \bar{Ac}$ | COM    |
| 011 | $Ac \leftarrow shl\ Ac$  | SHL    |
| 100 | $Ac \leftarrow Shr\ Ac$  | SHR    |
| 101 | $Pc \leftarrow Pc + 1$   | INCPC  |
| 110 | $Pc \leftarrow AR$       | ARTPC  |
| 111 | Reserved                 |        |

Here micro-operations are subdivided into three fields of 3-bits each. These 3 bits are used to encode 7 different micro-operations. No more than 3 micro-operations can be chosen for a micro-instruction, one for each field. If fewer than 3 microoperations are used, one or more fields will contain 000 for no operation.

## 4.5 Symbolic Microinstruction:

Symbols are used in microinstructions as in assembly language. A symbolic microprogram can be translated into its binary equivalent by a microprogram assembler.

### Format of microinstruction:

Contains five fields → Label; micro-ops; CD; BR; AD

Label: → May be empty or may specify a ~~value~~ symbolic address terminated with a colon.

Micro-ops: → Consists of one, two or three symbols separated by commas.

CD: → One of {U, I, S, Z}

Where, U: Unconditional Branch

I: Indirect address bit

S: Sign of AC

Z: Zero value in AC

BR →

BR → One of {JMP, CALL, RET, MAP}

AD → One of {Symbolic address, NEXT, empty (in case of MAP and RET)}

## 4.6 Symbolic Microprogram:

### Fetch Routine:

- Read instruction from memory.
- Decode instruction and update PC.

Sequence of micro-operations in fetch routine / cycle:

AR  $\leftarrow$  PC  
DR  $\leftarrow M[AR]$ , PC  $\leftarrow PC + 1$   
AR  $\leftarrow DR(0-10)$ , CAR(2-5)  $\leftarrow DR(11-14)$ , CAR(0,1,6)  $\leftarrow 0$

Symbolic microprogram for fetch routine / cycle:

|                |   |     |      |  |
|----------------|---|-----|------|--|
| ORI 64         |   |     |      |  |
| FETCH: PC TAR  | U | JMP | NEXT |  |
| · READ, INC PC | U | JMP | NEXT |  |
| DRTAR          | U | MAP |      |  |

Binary microprogram for fetch routine:

| Binary address | F1  | F2  | F3     | C  | BR | AD       |
|----------------|-----|-----|--------|----|----|----------|
| 10000000       | 110 | 000 | 000    | 00 | 00 | 10000001 |
| 10000001       | 000 | 100 | 100101 | 00 | 00 | 10000110 |
| 10000010       | 101 | 000 | 000    | 00 | 11 | 00000000 |

## # Symbolic Microprogram

- Control Memory  $\rightarrow$  128 20-bit words
- first 64 words  $\rightarrow$  Routines for 16 machine instructions
- Last 64 words  $\rightarrow$  Used for other purpose (e.g. fetch buffer & other subroutines)
- Mapping  $\rightarrow$  OP-Code XXXX into 0XXXXDD, the first address for 16 routines are 0(0000 00), 4(0 0001 00), 8, 12, 16, 20, ... 60.

## Partial Symbolic Micro program:

| Label | Microoperation | D | BR   | AD     |
|-------|----------------|---|------|--------|
| Add   | ORG 0          |   |      |        |
| Add:  | NOP            | I | CALL | INDRCT |
|       | READ           | V | JMP  | NEXT   |
|       | ADD            | V | JMP  | FETCH  |

ORU 4

|         |       |   |      |        |
|---------|-------|---|------|--------|
| BRANCH: | NOP   | S | JMP  | OVER   |
|         | NOP   | V | JMP  | FETCH  |
| OVER :  | NOP   | I | CALL | INDRCT |
|         | ARTPC | V | JMP  | FETCH  |

ORU 8

|       |       |   |      |        |
|-------|-------|---|------|--------|
| STORE | NOP   | I | CALL | INDRCT |
|       | ACTDR | V | JMP  | NEXT   |
|       | WRITE | V | JMP  | FETCH  |

EXCHANGE: ORU 12

|              |   |      |        |
|--------------|---|------|--------|
| NOP          | I | CALL | INDRCT |
| READ         | V | JMP  | NEXT   |
| ACTDR, DRTAC | V | JMP  | NEXT   |
| WRITE        | V | JMP  | FETCH  |

|         |                       |   |     |      |
|---------|-----------------------|---|-----|------|
|         | OR64                  |   |     |      |
| FETCH:  | PCTAR                 | U | JMP | NEXT |
|         | READINGPC             | U | JMP | NEXT |
| INDRCT: | <del>READ</del> DRTAR | V | MAP |      |
| INDRCT  | DRTAR READ            | V | JMP | NEXT |
|         | DRTAR                 | U | RET |      |

## Binary Microprogram:

| Micro<br>routine | Address<br>Decimal / Binary | Binary Microinstruction |     |     |    |    |          |  |  |
|------------------|-----------------------------|-------------------------|-----|-----|----|----|----------|--|--|
|                  |                             | F1                      | F2  | F3  | CD | BR | AP       |  |  |
| ADD              | 0 0000000                   | 000                     | 000 | 000 | 01 | 01 | 10000011 |  |  |
|                  | 1 0000001                   | 000                     | 100 | 000 | 00 | 00 | 00000101 |  |  |
|                  | 2 0000010                   | 001                     | 000 | 000 | 00 | 00 | 10000000 |  |  |
|                  | 3 0000011                   | 000                     | 000 | 000 | 00 | 00 | 10000011 |  |  |
| BRANCH           | 4 0000100                   | 000                     | 000 | 000 | 10 | 00 | 00000110 |  |  |
|                  | 5 0000101                   | 000                     | 000 | 000 | 00 | 00 | 10000000 |  |  |
|                  | 6 0000110                   | 000                     | 000 | 000 | 01 | 01 | 10000011 |  |  |
|                  | 7 0000111                   | 000                     | 000 | 110 | 00 | 01 | 10000011 |  |  |
| STORE            | 8 0001000                   | 000                     | 000 | 000 | 01 | 01 | 10000011 |  |  |
|                  | 9 0001001                   | 000                     | 101 | 000 | 00 | 00 | 00010010 |  |  |
|                  | 10 0001010                  | 111                     | 000 | 000 | 00 | 00 | 10000000 |  |  |
|                  | 11 0001011                  | 000                     | 000 | 000 | 00 | 00 | 10000000 |  |  |
| EXCHANGE         | 12 0001100                  | 000                     | 000 | 000 | 01 | 01 | 10000011 |  |  |
|                  | 13 0001101                  | 001                     | 000 | 000 | 00 | 00 | 0001110  |  |  |
|                  | 14 0001110                  | 101                     | 101 | 000 | 00 | 00 | 0001111  |  |  |
|                  | 15 0001111                  | 000                     | 000 | 000 | 00 | 00 | 10000000 |  |  |

|        |    |         |     |     |     |    |    |         |
|--------|----|---------|-----|-----|-----|----|----|---------|
| FETCH  | 64 | 1000000 | 110 | 000 | 000 | 00 | 00 | 1000001 |
|        | 65 | 1000001 | 000 | 000 | 101 | 00 | 00 | 1000010 |
|        | 66 | 1000010 | 101 | 000 | 000 | 00 | 11 | 0000000 |
| INDRCT | 67 | 1000011 | 000 | 100 | 000 | 00 | 00 | 1000100 |
|        | 68 | 1000100 | 101 | 000 | 000 | 00 | 10 | 0000000 |

u.7 Control Unit Operation:

Computer Arithmetic5.1 Addition Algorithm

Computer arithmetic is a branch of computer engineering that deals with methods of representing integers and real values (e.g. fixed- and floating-point numbers) in digital systems and efficient algorithms for manipulating such numbers by means of hardware circuits or software routines.

5.1. Addition Algorithm:

The standard algorithm for addition is a process that is normally used to solve multi-digit addition problems.

Algorithm

Step 1: Start the program.

Step 2: Initialize the contents of memory location ~~Step 2~~ in Ax & Bx registers

Step 3: Initialize the value 00 for CL register.

Step 4: Add the value of Ax & Bx by using ADD mnemonics and store the result in a memory location.

Step 5: If there is carry, increment the value of CL register by 1, else move the contents of Ax to a location for result.

Step 6: Move the value of CL to a location for output

Step 7: Stop the program.

## Program

Data Segment

a db 9h

b db 2h

c dw ?

data ends

Code Segment

assume CS: Code, DS: data

start:

mov ax, data

mov ds, ax

mov al, a

mov bl, b

add al, bl

mov c, ax

int 3

Code ends

end start

## 5.2 Subtraction Algorithm

The algorithm that is currently taught for subtraction requires "regrouping", which has previously been called "borrowing".

This algorithm is usually illustrated with manipulatives, in which 10 is exchanged for 10 ones and 100 is exchanged for 10 tens.

### Algorithm

Step 1: Start the program.

Step 2: Initialize the contents of memory location in Ax & Bx registers.

Step 3: Initialize the value 00 for CL register.

Step 4: Subtract the value of Ax & Bx by using SUB mnemonics and store the result in a memory location.

Step 5: If there is carry, increment the value of CL register by 1, else move the contents of Ax to a location for result.

Step 6: Move the value of CL to a location for output.

Step 7: Stop the program.

# Program for Subtraction

DATA Segment

a db 2Ah

b db 13h

c dw ?

DATA ends

Code Segment

assume CS:Code, DS:Data

Start:

Mov AX, DATA

Program to input character & print

Mov DS, AX

Mov AL, A

!

Mov BL, B

mode small

Sub AL, BL

.DATA

Mov C, AX

.CODE

Int 3

Mov AH, 1H

Code ends

INT 21H

end Start.

Mov AL, DL, AL

Mov AH, 2H

INT 21 H

End.

## 5.3 Multiplication Algorithm:

A multiplication algorithm is an algorithm (or method) to multiply two numbers. Depending on the size of the numbers, different algorithms are in use. Efficient multiplication algorithms have existed since the ~~development~~ advent of the decimal system.

### Algorithm

Step 1: Start the program.

Step 2: Initialize the contents for Ax & Bx from memory locations.

Step 3: Multiple the Ax & Bx value using MUL mnemonics and store result in memory location.

Step 4: Move the contents of Ax and Dx to memory location to check the result.

Step 5: Stop the program.

## Program.

Data Segment

a db 9h

b db 2h

c dw 9

code data ends

Code segment

assume cs:code, ds: data

start:

mov ax, data

mov ds, ax

mov ax, 000h

mov bx, 000h

mov al, a

mov bl, b

mul b

mov c, al

int 3

code ends

end start

## 5.4 Division Algorithm:

A division algorithm is an algorithm which, given two integers  $N$  &  $D$ , computes their quotient & or remainder, the result of division. Some are applied by hand, while others are employed by digital circuit designs and Software.

### Algorithm

Step 1: Start the program

Step 2: Initialize the contents for Ax & Rx from memory locations

Step 3: Multiple the Ax and Rx values using MUL mnemonics and store the result in a memory location.

Step 4: Move the content of Ax and Dx to memory location to check the result.

Step 5: Stop the program

# Program

Data Segment

a db 28h

b db 2h

c dw ?

data ends

Code Segment

Assume CS:code, DS:data

Start:

MOV AX, DATA

Mov DS, AX

Mov AX, 000h

Mov BX, 000h

Mov AL, 9

Mov BL, B

Div B

Mov C, AX

INT 3

Code ends

end Start

Hello World Program

Data Segment

MSG DB "Hello World\n", \$

Ends

Code Segment

Assume CS:Code, DS:Data

Start:

MOV AX, DATA

Mov DS, AX

LEA DX, MSG

INT 21H

Mov AH, 4CH

INT 21H

End

End Starts

## 5.5 Logical Operations:

### # Gate level Logical Components:

#### ii) AND

Symbol



VHDL Equation

$$X \leftarrow A \text{ and } B$$

or

$$X = A \cdot B$$

Truth Table

| A | B | X |
|---|---|---|
| 0 | 0 | 0 |
| 0 | 1 | 0 |
| 1 | 0 | 0 |
| 1 | 1 | 1 |

#### iii) OR

Symbol



VHDL Equation

$$X \leftarrow A \text{ or } B$$

or

$$X = A + B$$

Truth Table

| A | B | X |
|---|---|---|
| 0 | 0 | 0 |
| 0 | 1 | 1 |
| 1 | 0 | 1 |
| 1 | 1 | 1 |

iii) NOT  
Symbol



VHDL Equation

$$X \leftarrow \text{not } A$$

or

$$X = A'$$

Truth Tabl.

| A | B | X |
|---|---|---|
| 0 |   | 1 |
| 1 |   | 0 |

i) ~~Composite Logic Gates~~

iv) NAND

Symbol



VHDL Equation

$$X \leftarrow \text{not}(A \text{ and } B)$$

$$X \leftarrow \text{not}(A \text{ and } B)$$

or

$$X = (A \cdot B)'$$

Truth Table

| A | B | X |
|---|---|---|
| 0 | 0 | 1 |
| 0 | 1 | 1 |
| 1 | 0 | 1 |
| 1 | 1 | 0 |

v) NOR

Symbol



VHDL Equation

$$X \leftarrow \text{not}(A \text{ or } B)$$

or

$$X = (A + B)'$$

Truth Table

| A | B | X |
|---|---|---|
| 0 | 0 | 1 |
| 0 | 1 | 0 |
| 1 | 0 | 0 |
| 1 | 1 | 0 |

vii) XOR  
Symbol



VHDL Equation

$$X \leftarrow A \text{ xor } B$$

or

$$X = A \oplus B$$

Truth Table

| A | B | X |
|---|---|---|
| 0 | 0 | 0 |
| 0 | 1 | 1 |
| 1 | 0 | 1 |
| 1 | 1 | 0 |

viii) X-NOR  
Symbol



VHDL Equation

$$X \leftarrow A \text{ xnor } B$$

or

$$X = (A \oplus B)'$$

Truth Table

| A | B | X |
|---|---|---|
| 0 | 0 | 1 |
| 0 | 1 | 0 |
| 1 | 0 | 0 |
| 1 | 1 | 1 |

# Input/Output Organization

Rajendra

Date \_\_\_\_\_

## ~~Microcomputer Memory~~

### 6.1. Peripheral devices:

Peripheral devices are devices that are attached/connected to a computer system to enhance its capabilities. Peripheral devices includes input devices, output devices, storage devices and communication devices.

Examples - keyboard, mouse, printer, scanner, monitor, speaker, etc.



Fig: Block diagram of  
Peripheral device

## 6.2 I/O modules:

In computing, input/output or I/O module is the communication between an information processing system, such as a computer, and the outside world, possibly a human or another information processing system.

OR

I/O module is a technique for exchanging data between processor and I/O device.

The I/O module works as a mediator between I/O devices and the processor. It conveys the information from I/O devices (sometimes called as peripheral or external device) to processor and vice versa.

An I/O module at one end is connected to the system bus (information transmission cable) of processor and at the other end may be connected with a number of I/O devices.

Functions of I/O module:

i) Processor Communication - This involves the following tasks:

a. exchange of data between processor and I/O modules.

b.

b. Command decoding - I/O module accepts

Commands sent from the processor.

c) Status reporting → The device must be able to perform device communication to report its status to the processor.

d) Address recognition → Each I/O device has a unique address and the I/O module must recognize the address.

ii) Device Communication - The I/O device must be able to perform device communication such as status reporting

iii) Control & Timing - The I/O module must be able to co-ordinate the flow of data between the internal resources and external devices.

iv) Data buffering - This is necessary as there is a speed mismatch between the data buffered in the I/O module and then sent to the peripheral device at its rate.

v) Error detection - The I/O module must also be able to detect errors and report them to the processor.



Fig: Model of I/O module



Fig: I/O module Diagram

### 6.3. Input/Output interface:

Input/Output interface provides a method for framing information between internal storage and external I/O devices. Peripherals connected to a computer need special communication links for interfacing them with the central processing unit. The purpose of the communication link is to resolve the differences that exist between the central computer and each peripheral.

OR

The method that is used to transfer information between internal storage and external I/O devices is known as I/O interface. The CPU is interfaced using special communication links by the peripherals connected to any computer system. These communication links are used to resolve the differences between CPU and peripheral.

### 6.4. Modes of transfer:

The binary information that is received from an external device is usually stored in the memory unit. The information that is transferred from the CPU to the external device is originated from the memory unit. CPU merely processes the information but the source and target is always the memory unit. Data transfer between CPU and the I/O devices may be done in different modes.

AUGUST 2018  
CCE

Data transfer to and from the peripherals may be done in any of the three possible ways:

1. Programmed I/O
2. Interrupt-initiated I/O
3. Direct memory access (DMA).

## 6.5. Programmed I/O.

Using this technique, data transfer takes place under the direct control of the processor. The processor must continuously check on I/O device and hence it cannot do another task. This method is hence inefficient (Slow).



DB - Data Bus      DAB - Device Address Bus

DC - Device Control signals      MAB - Memory Address Bus

CB - Control Bus

## Characteristics of Programmed I/O:

- i) In programmed I/O, the I/O operations are completely controlled by the CPU.
- ii) Used in real time and embedded systems.
- iii) Used in CPUs ~~have~~ which have a single input and a single output instruction.

## 6.6 Interrupt-Driven Input/Output (I/O):

In the interrupt driven I/O, the processor issues a READ/ WRITE instruction to the device and then continues doing its task. When the interface buffer is full, and after ready to send data to the processor, the interface sends a signal to the processor informing it that data is ready. The signal is called as the interrupt signal. When the processor receives the interrupt signal, it knows that the data is ready; it suspends its current job and transfers data from buffer to its own register.

### Role of CPU in interrupt-driven I/O:

- i) The device issues an interrupt signal to the processor.
- ii) The processor finishes the execution of the current instruction. It then responds to the interrupt signal.
- iii) The processor sends an acknowledgement signal to the device that sent the interrupt. The device then removes its interrupt signal.
- iv) The processor then attends to the device ~~and~~ that issued to the interrupt signal.

## 6.7. Direct Memory Access:

DMA is a method/technique of transferring data from main memory (RAM) to a device without passing it through the CPU. It is a technique to speed up data transfer without using the CPU.

While most data that is input or output from your computer is processed by the CPU, some data does not require processing, or can be processed by another device. In this situation, DMA can save processing time and is a more efficient way to move data from the computer's memory to other devices.

This means computers having DMA channels can transfer data to and from devices much more quickly than computers without a DMA channel can. This is useful for making quick backups and for real-time applications.

For example, a sound card, video cards that support DMA can also access the system memory and process graphics without needing the CPU. Ultra DMA hard drives use DMA to transfer data faster than previous hard drives that required the data to first be run through the CPU.



Fig: CPU Bus Signals for DMA Transfer



Fig: Block diagram DMA

## 6.9 Data Communication Processor:

A data communication processor is an I/O processor that distributes and collects data from numerous remote terminals connected through telephone and other communication lines to the computer. It is a specialized I/O processor designed to communicate with data communication networks.

Such a communication network consists of variety of devices such as printer, display devices, digital sensors etc. serving many users at once. The data communication processor communicates with each terminal through a single pair of wires. It also communicates with CPU and memory in the same manner as any I/O processor does.

Q.

Memory is the storage device of computer that stores data either temporarily or permanently.

Memory organization is a classification of memory or a class presentation of memory in which the memory is categorized or sub divided according to their work.

### 7.1. Microcomputer Memory:

A microcomputer is a small sized personal computer (PC) that is designed for an individual having a microprocessor inside it. A microcomputer contains a microprocessor (a CPU on microchip), memory in the form of read-only memory (ROM) and random access memory (RAM), I/O ports and a bus or system of interconnecting wires housed in a unit that is usually called a motherboard.

- Memory is essential component of micro computer system.
- It stores binary instructions and datum for the microcomputer.
- The memory is the place where the computer holds current programs and data that are in use.
- None technology is optimal in satisfying the memory requirements for a computer system.

7.2 Characteristics of memory systems  
The memory system can be characterized with their following properties:

1. Location
2. Capacity
3. Unit of Transfer
4. Access Method
5. Performance
6. Physical Type
7. Physical Characteristics
8. Organization

### 1. Location:

It deals with the location of memory device in the computer system. There are three possible locations:

- CPU: This is often in the form of CPU register and small amount of cache.
- ~~ROM~~
- Internal memory: This is main memory like RAM or ROM.
- External Memory: It consists of secondary storage devices like hard disk, magnetic tape, that are accessible to processor through controllers.

## 2 Capacity:

The capacity of any memory device is expressed in terms of:

- Word Size: Words are expressed in bytes (8 bits). Commonly used word sizes are: 1 byte, 2 bytes and 4 bytes.

Number of Words: - They specifies number of words available in the particular memory device. Common word lengths are 8, 16, 32 bits etc.

## 3. Unit of Transfer:

It is the maximum number of bits that can be read or written into the memory at a time.

- Internal Memory: Unit of transfer is equal to the word size.

- External Memory: They are transferred in blocks which is larger than a word.

## 4. Access Methods:

It is a fundamental characteristic of memory devices. It is the sequence or order in which memory can be accessed.

- Sequential Access: Access must be made in specific linear sequence.

Eg: Magnetic tape

• Direct Access: Access is accomplished by jumping (direct access) to general vicinity plus a sequential search to reach the final location.  
Eg: magnetic disk

• Random Access: Any location can be selected out randomly and directly addressed and accessed.  
e.g. RAM

• Associative Access: A word is retrieved based on a portion of its contents rather than its address.

5. Performance: The performance of the memory system is determined using three parameters:

• Access time: for random access memory, it is time taken to perform read/write operation.

For non-random access memory, it is time taken to position read/write mechanism at desired location.

- Time between presenting the address and getting the valid data.

• Memory Cycle time: It is the total time that is required to store next memory access operation from the previous memory access operation.

• Transfer rate: It is defined as the rate which data can be transferred into or out of ~~commerse~~ memory unit.

#### 6 Physical Type:

- Semiconductor - RAM
- Magnetic - Dr Hard disk
- Optical - CD & DVD.

#### 7 Physical Characteristics:

- Volatile - Information ~~as~~ decays or is lost when power is off.
- Non-volatile - Information ~~once~~ is not lost when power is off.
- Erasable - Data in memory can be erased or called erasable
- Non-erasable - Data once programmed cannot be erased or modified.

#### 8 Organization:

Organization refers to the physical arrangement of ~~most~~ bits to form words.  
It is not always obvious.

### 7.3 The Memory Hierarchy:

Memory hierarchy is a concept that is necessary for the CPU to be able to manipulate data. This is because it is only able to get instructions from cache memory.

Computer memory hierarchy is a program structure that is commonly used to illustrate the significant differences among memory types.

The memory hierarchy system consists of all storage devices employed in a computer system from slow but high capacity auxiliary memory up to a relatively faster cache memory accessible to high speed processing logic.

The figure in the next page illustrates memory hierarchy:



Fig: The Memory Hierarchy

### i) Registers:

Registers are temporary memory of computer system that stores data during the time of processing inside ALU.

Registers within CPU are special purpose temporary storage locations. There are two types of registers:

- (a) General Purpose register
- (b) Specific purpose register. The important register within CPU is Program Counter (PC).

- They are fastest memory of Computer.
- Storage capacity of register is small.
- They are temporary memory of computer.

### ii) Cache:

Cache memory is extremely fast memory that is built into the computer's CPU, or located next to it on a separate chip.

- It is a temporary memory of Computer.
- It increases the performance & speed of computer.
- It lies between processor and main memory.

The advantage of cache memory is that the CPU does not have to use the motherboard's system bus for data transfer.

Two processes in Cache:

\* Cache Hit  $\Rightarrow$  At first, processor send address of desired data to cache, if data is available then event is called Cache Hit.

\* Cache Miss: → When processor sent address & it is not present in cache, event is called Cache miss. After cache miss, data of main memory is copied to cache for next cache hit.

Two main levels of cache:

- \* L<sub>1</sub> Cache: → L<sub>1</sub> is "level-1" cache memory, usually built onto the microprocessor chip itself. For example, the Intel microprocessor comes with 32 thousand bytes of L<sub>1</sub>.
  - It is accessed without any delay.
- \* L<sub>2</sub> Cache → L<sub>2</sub> is "Level-2" cache memory that is on a separate chip that can be accessed more quickly than the larger "main" memory. A popular L<sub>2</sub> cache memory size is in terms of megabytes.
  - It takes more clock cycle to access than L<sub>1</sub> cache.

### iii) Main Memory:

Main memory is principle internal memory of system of computer. It is fast and less capacity memory of computer.

- It is directly connected to the CPU.
- It is semiconductor based memory.
- It is composed mainly of RAM and small amount of ROM.
- These memories are located on motherboard.

IV) Magnetic Disk and Optical Disk:

Magnetic disks are flat circular plate, of metal or plastic, coated with iron oxide on both sides.

A magnetic disk offers high storage capacity, reliability and fast to access data. It access data randomly. It is slower since it is not directly connected to CPU.

The types of magnetic disks are hard disk, floppy disk.

### Optical Disk

An optical disk is data storage medium that can be written to and read using a low powered laser beam. It is mainly used for music, movies, and software programs. It is compact, lightweight, durable. Optical disks are: CD-ROM, CD-RW, DVD, Blu-Ray disc etc.

V) Magnetic Tape: magnetic

A memory tape is sequential storage device of computer. It is not common storage device now a days. It is mainly used in mainframe and super computers. It has slow and sequential flow of data. It is very useful for storing huge volume of data economically.

A magnetic tape is a continuous strip of plastic

## Advantages

- It is cheaper.
- It can be reused.
- Data from magnetic tape can be directly transferred into hard disk.

## Disadvantages

- It is sequential storage device.
- Data access time of magnetic tape is slow.

→ As we go down in the hierarchy:

- Cost per bit decreases.
- Capacity of memory increases.
- Access time increases.
- Frequency of access of memory by processor also decreases.

## 7.4 Internal and External Memory:

### A. Internal Memory:

Internal memory is also called primary memory or main memory of computer. It has fast and less capacity of computer. They are mainly used to store data during time of processing. These memories have great effect on processing speed of computer.

There are mainly two types of primary or internal memory:

- (1) RAM
- (2) ROM

#### 1. RAM

RAM stands for Random Access Memory. It stores data only when computer is on. It takes the form of integrated circuits that allow stored data to be accessed in any order (that is random). The advantage of using RAM is to store whatever you are working on at any moment and RAM is very fast memory.

There are two types of RAM:

- (i) SRAM
- (ii) DRAM

## SRAM

## DRAM

- |                                                                                                                                                                                                                                                                    |                                                                                                                                                                                                                                                                |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 1. Mostly, it is formed by using transistors.<br>2. It is expensive.<br>3. It has low density of data.<br>4. It is faster than DRAM.<br>5. It does not need periodic refreshing.<br>6. It is mainly used as cache memory.<br>7. It consumes less electrical power. | 1. Mostly, it is formed by using capacitors.<br>2. It is less expensive.<br>3. It has high density of data.<br>4. It is slower than SRAM.<br>5. It needs periodic refreshing.<br>6. It is mainly used as main memory.<br>7. It consumes more electrical power. |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|

## 2. ROM:

ROM stands for Random Access Memory. It is a class of storage media used in computers. Data stored in ROM cannot be modified; or can be modified only slowly or with difficulty, so it is mainly in distributive firmware. The manufacturing company already installed program in ROM.

## Types of ROM:

MAY

### i) PROM:

PROM stands for Programmable Read Only Memory. It is the memory chip on which data can be written only once. It retains their contents when the computer is turned off. The difference between PROM and ROM is that a PROM is manufactured as blank memory whereas ROM is programmed during the manufacturing process.

### ii) EEPROM:

EEPROM stands for Erasable Programmable Read Only Memory. It is a special type of memory that retains its contents until it is exposed to ultraviolet light. The ultraviolet light clears its contents making it possible to reprogram the memory.

### iii) EEPROM:

EEPROM stands for Electrically Erasable Programmable Read Only Memory. It is a special type of ~~memory~~ PROM that can be erased by exposing it to an electrical signal instead of UV-light. It can be erased and rewritten at byte level.

## RAM

1. It stands for Random Access Memory.

2. Contents will be erased when power is off.

3. It usually has higher memory space than ROM.

4. It is less expensive.

5. It is faster than ROM.

6. Its types are SRAM & DRAM.

## ROM

1. It stands for Read Only memory.

2. Contents won't be erased when power is off.

3. It usually has less memory space than RAM.

4. It is more expensive.

5. It is slower than RAM.

6. Its types are PROM, EEPROM and EEPROM.

## B External Memory

External memory is also known as secondary memory or auxiliary memory. It is slower and cheaper form of memory. CPU does not access the secondary memory directly. The contents in it must be copied into the primary storage, RAM for CPU to process. Secondary memory device includes hard drives, floppy disk, CDs, DVDs.

## Type of External Memory:

### 1) Magnetic Tape:

A magnetic tape is sequential storage device of computer. A magnetic tape is the strip of plastic coated with magnetic recording medium (i.e. magnetic oxide). Data can be recorded and read as a sequence of character through read/write head.

It is not common storage nowadays. It is mainly used in mainframe and super computers. It is slow and sequential flow of data.

### 2) Magnetic Disk:

Magnetic disk are flat circular plates of metal or plastic, coated with on both side with iron oxide.

Magnetic disk offers high storage capacity, reliability and fast to access data. It access data randomly. It is slower since it is not directly connected to the CPU.

Type of hard disk: magnetic disk.

### 3) Hard Disk:

The hard disk is non-volatile and random memory of a computer system. The hard disk has enormous storage capacity compared to

main memory. The hard disk is usually contained in the system unit of a computer. The hard disk is used for long term storage of programs and data. Hard disk has a storage capacity in terms of gigabytes such as 20GB, 40GB, 80GB, 320GB etc.

### iii) floppy disk:

Floppy disks are portable magnetic storage devices. Floppy disks are not as fast as hard disk, nor they store much information. They are mostly used for transferring software between computers. They are small, removable media storage devices. They record data on a thin, circular magnetic field enclosed in a flat, square plastic jacket.

### iv. Optical Disk:

An optical disk is a data storage medium that can be written to and read using a low powered laser beam. It is mainly used for music, movies and software programs. It is compact, lightweight, durable. Optical disks are: CD-ROM, CD-RW, DVD-ROM, DVD-RW, Blu-ray disks etc.

### 4. Flash memory:

Flash memory is non-volatile data storage and thus is capable of retaining its data even

When its power source has been turned off, flash memory is small, fast, lightweight and makes no noise. Flash memory is reliable and allows us to specify which data we want to keep. Examples of flash memory are pendrive, memory-card etc.

## 7.5 Cache Memory Principles:

### Principles

- ~~Faster~~ Cache memory is intended to give memory speed approaching that of faster memory available but with large size at the price of less expensive semiconductor memory.
- Cache is checked first for all memory references.
- If not found, the entire block in which that reference resides in Main memory is stored in a cache slot, called a line.
- Each line includes a tag which identifies which particular block is being stored.
- Locality of reference implies that future reference will likely come from the block of memory, so that cache line will probably be utilized.

repeatedly.

- The proportion of memory references, which are found already stored in cache, is called the hit ratio.

Cache memory is based on the principle of locality or locality of reference.

~~The locality of ref~~

According to locality of reference, if programs and data which are accessed frequently are placed in fast memory, the average access time can be reduced.

Locality of reference is divided into two categories:

i) Spatial Locality:

It reflects tendency of a program to access data locations sequentially, such as when processing a table of data.

ii) Temporal Locality:

It refers to the tendency of a processor to access memory locations that have been used frequently.

## 7.6. Elements of Cache Design:

### 7.6.1. Cache Size

The size of cache should be small enough so that the overall average cost per bit is close to that of main memory alone and large enough so that overall average access time is close to that of the cache alone.

- Large cache are ~~expensive~~ than smaller ones.
- The available chip and board also limit cache size.

### 7.6.2 Mapping function:

The transformation of data from main memory to cache memory is referred to as memory mapping process.

As there are fewer cache lines than main memory blocks, an algorithm is needed for mapping main memory blocks into cache lines.

There are three different types of mapping functions:

#### i) Direct Mapping:

The simplest technique, known as direct mapping, maps each block of main memory into only one possible cache line.

This technique is simple and inexpensive to use.

### ii) Associate Mapping:

Associate mapping overcomes the disadvantage of direct mapping by permitting each main memory block to be loaded into any line of the cache.

### iii) Set-Associative Mapping

Set-associative mapping is a compromise that exhibits the strength of both direct and associative approaches. With set associative mapping, block can be mapped into any of the lines of set  $j$ .

### 7.6.3. Replacement Algorithm:

Once the cache has been filled, when a block is brought into the cache, one of the existing blocks must be replaced.

- for direct mapping, there is only one possible line for any particular block, and no choice is possible.
- for the associative and set associative techniques, a replacement algorithm is needed.

To achieve high speed, such an algorithm must be implemented in hardware.

Last Recently Used (LRU), Least frequently Used (LFU), first in first out (FIFO) are some replacement algorithms.

#### 7.6.4. Write Policy

- Write Through:

→ The simplest policy is called write through. Using this technique, all write operations are made to main memory as well as to the cache, ensuring that main memory is always valid.

- Write back:

An alternative technique, known as write back, minimizes memory writes. With write back updates are made only in the cache. When an update occurs, a dirty bit, or use bit associated with the line is set. Then, when a block is replaced, it is written back to main memory if and only if the dirty bit is set.

#### 7.6.5 Number of Caches:

##### #. Multilevel Caches: L1 and L2 cache

Most contemporary design includes both on-chip and external caches. The simplest such organization is known as two-level caches with the internal cache (on-chip) designed as level 1 (L1) and the external cache (off-chip) designed as level 2 (L2). There can also be 3 or more levels of cache. This helps in reducing main memory access.

## B. Unified and Split Cache:

- Earlier on-chip cache designs consisted of a single cache used to store both instructions and data. This is the unified approach.

More recently, it has become common to split the cache into two: one ~~dedicated~~ dedicated to instructions and other dedicated to data. These two caches exist at the same level. This is the split cache.

Using ~~split~~ a unified cache or a split cache is another design issue.