

Memory Organization# Memory HierarchyAuxiliary memory:

provides backup storage. ← .. Programs that are not being used

Main memory:

memory unit that directly connects to the CPU. ← .. Programs that are currently needed are stored in main memory.

# Cache Memory

Cache Memory: It is very special; v-high-speed memory.

Used to increase the speed of processing by making current programs and data available to the CPU at a rapid rate.

→ Since CPU is faster than main memory, thus limit of processing speed is due to main memory.

→ The cache stores segment of programs currently being executed in the CPU & temporary data required frequently in present calculations.

Good Write

## Multiprogramming

refers to existence of 2 or more programs in different parts of memory hierarchy @ the same time. When one program is waiting for input or output transfer, another program is ready to utilize the CPU.

The task of maintaining information's part in the main memory i.e. currently active. The part of Computer System that supervises the flow of info b/w auxiliary & main memory is "Memory Management System."

## MAIN MEMORY

#

are based on Semiconductor integrated ckt.

Ram chips can be Static or dynamic

- Static RAMs consist of internal flip-flops that store the binary information. This remains static as long as power is applied to it. [Easy to use]
- Dynamic RAMs stores info in form of electric charges applied to the capacitors, this stored charge tends to discharge with time thus it needs to be periodically recharged by refreshing the d-memory by cycling thru the words every few milli sec to restore charge. [Less consumption]

# Main memory has both:

RAM is used to store bulk of programs which are subject to change.

ROM stores the permanently resident programs, tables of constants which won't change.

Bootstrap ladder: The initial program is stored in the ROM portion; it is called Bootstrap loader.

Its function is to start the computer system when power is turned ON.

It consists of turning power on & starting to execute the initial program. Thus when turned on; PC  $\leftarrow$  first address of the Bootstrap ladder.

It loads a portion of OS from disk to main memory & the control is transferred to the OS. (for general use)



A 8-bit data bus (bi-directional) connects it with the memory.

$\{$  CS1 = 1 & CS2 = 0 for it to work.

$\swarrow$  Write (WR)  
 $\searrow$  Read (RD)

when  $WR = 1$   $CS_1 = 1$   $CS_2 = 0$

the Rom accepts the data from the 8 bit data bus to the address given by  $AD_{0-6}$

when  $RD = 1$   $CS_1 = 1$   $CS_2 = 0$

the Ram sends the data bus with the data given by the address  $AD_{0-6}$



### Rom chip

for the same size Rom has more bits as its cells occupy less space



single directional, as Rom can only send data or only RD operation can be used here.

when  $CS_1 = 1$  &  $CS_2 = 0$  then data bus is given data stored by address given by  $AD_{0-6}$

### Memory Address Map

| Component                | Hex. address | Address Bus [AD <sub>16</sub> ] |
|--------------------------|--------------|---------------------------------|
| 128 Bg RAM 1             | 0000 - 007F  | 0 0 0 × × × × × × × ×           |
| 128 RAM 2                | 0080 - 00FF  | 0 0 1 × × × × × × ×             |
| 128 RAM 3                | 0100 - 017F  | 0 1 0 × × × × × × ×             |
| 128 RAM 4                | 0180 - 01FF  | 0 1 1 × × × × × × ×             |
| 512 Rom GoodQ8000 = 03FF |              | 1 × × × × × × × ×               |

Note:



If  $AD_{10} = 0 \rightarrow AD_{9-8}$

|    |       |
|----|-------|
| 00 | RAM,  |
| 01 | RAM 2 |
| 10 | RAM 3 |
| 11 | RAM 4 |



#

## Auxiliary Memory

[These are magnetic devices]

The avg time req to reach the storage location in memory and obtain its content : Access time

Seek time → Transfer time req to put the read-write head at the right place. → time req to transfer data.

\* To minimize Seek time the memory is organized in records or blocks.

Set of characters

\* The transfer rate = number of characters/word the device can transfer in a second after positioning at the tip of the beginning of record.

⇒ Bits are recorded as magnetic spots on the writer head surface; which are detected by the change in magnetic field by read head.

→ Magnetic Disk is a circular plate of metal/plastic coated with magnetized material.

There are concentric circles on the disk all 'tracks' where data is stored. These tracks are then broken into 'sectors'

They can be of 2 types:

- > A single read/write head for each disk surface. The track address bits are used by a mech. assembly to move the head into the specified track position.
  - > Separate read/write heads are provided for each track, the address bits can electronically select a particular track via a decoder. (Expensive)
- In a sector the track at more distance from center will have more circumference. Thus to balance out the number of bits in 1 track sector the density has to vary. (of bits)

Disk which are permanently attached to the unit assembly & can't be removed are called while removable disks are called 'hard disk' 'floppy disk'

- Magnetic Tapes is a strip of plastic coated with magnetic medium. Bits are recorded as magnetic spots on tape. Read/write heads are mounted one in each track.

Adv: It can be stopped, started, move forward/backward.  
Dis Adv: Can't be started or stopped fast enough.

- > Gaps are left when tape is stopped, at the time of starting again, these gaps provide runup length to get on the right speed again.

Each record on tape has an identification pattern at begin & end.

\* The pattern @ beg^n : tape control identifies the record number.

\* The pattern @ end : It realises beginning of a "gap"

## # Associative Memory or Content addressable memory

Concept  
Problem: The time req. to find an item stored in memory can be reduced if stored data can be identified for access by the contents of the data itself. rather than the address.

Working: When a word is written in associative memory, no address is given. The memory finds the empty location to store the word.

Write: When a word is to be written in an ass. memory, the content of the word / part of the word is specified. The memory locates all the words matching the specified content & marks them for reading.

Disadvantage: Since every memory location will need logic to do write it is thus expensive & complex.

## Hardware:



### > Argument register (A)

Each word in memory is compared in parallel with the content of the argument register.

### > Key register (K) [Provides a mask for choosing]

The part of the word from associative memory which needs to be compared to the memory is set 1 in K & rest is set to 0.

### > Match register (M)

The words that match a corresponding bits of argument register sets the match bit to 1.

Eg:      A      101 111 100  
           K      110 100 000  
 req address:      10X1XXX X X    [X : Dont care]

Good Write

## > One cell of associative memory



## >



say  $C_{ij} = f_{ij}$  &  $x_j = \text{Match of } j^{\text{th}} \text{ bit}$

then  $x_j = \underbrace{A_j f_{ij}}_{\text{either both } 1 \text{ or both } 0.} + \underbrace{A_j' f_{ij'}}_{\text{either both } 1 \text{ or both } 0.}$

$$[M_i = x_1 \cdot x_2 \cdots x_n]$$

Considering Key register

$$(x_j') = x_j + k_j' = \begin{cases} x_j & \text{if } k_j = 1 \\ 1 & \text{if } k_j = 0 \end{cases}$$

$$M_i' = \prod_{j=1}^n (x_j + k_j') \quad \text{Good Write} = \prod_{j=1}^n [A_j f_{ij} + A_j' f_{ij'} + k_j']$$

## ~~CACHE~~ ~~INTERFACe~~

→ Match logic



→ Read Operation :



Good Write

# Cache Memory

The reference to memory at a given moment / intervals of time tend to be combined within a few localized areas in memory. This is "locality of reference"

Thus we tend to place those frequently used data / memory in the fast small memory called cache memory.

**Basic working:** If CPU needs to access a memory, first the cache is examined. If found in the cache it is read from the fast memory. If not, then the main memory is accessed, this block is then transferred to the cache memory.

**Hit ratio:** The ratio of hits (match) / total computer ref. is called hit - ratio.

Transforming data from main-mem to cache : mapping process

# Associative Mapping

$2^5$  bit or [4 words]  $\leftarrow 2^5$  bit or [128 words]  $\rightarrow 128$  bytes  
 $2^2 \times 2^5$  bytes

Any block ~~can~~ go anywhere in any line, a full block goes in a line.

(say)



Initial address:  $\eta$  bits  $\rightarrow$



When in Cache:  $\eta$  bits  $\rightarrow$



DisAdv: Comparisons increased.

Tag to find which block was transferred.

Adv: Hit ratio increased.

### # Direct Mapping

8 options

$L_0$  can only have  $B_0 / B_4 / B_8 \dots$

$L_1$  " " "  $B_1 / B_5 / B_9 \dots$

$L_2$  " " "  $B_2 / B_6 / B_{10} \dots$

$L_3$  " " "  $B_3 / B_7 / B_{12} \dots$

Initial address:  $\eta$  bits:  $\boxed{5 \ 2}$

When in Cache:  $\eta$  bits:  $\boxed{3 \ 2 \ 2}$

Good Write say  $[B_0, B_4, \dots, B_{28}]$

Line no.  $\eta$  bits

## Disadvantages of Direct Mapping:

2 words with same <sup>line no.</sup> tag but with different tag value  
 Can't reside in cache memory at the same time.  
 Then we will have to change the block as the same line has to accessed.

## # Set - Associative Mapping

k-way set associative

→ d-way set associative



Associative inside the set

Number of lines = No of sets

k

You can put block (0, 1)(2, 3)

in either L<sub>2</sub>, L<sub>3</sub>

L<sub>0</sub> or L<sub>1</sub> (S<sub>0</sub>) (S<sub>1</sub>)

Before: 7 bits:



After:



To choose b/w To choose b/w S<sub>0</sub> & S<sub>1</sub>

[B<sub>0</sub>, B<sub>1</sub>, B<sub>4</sub>, B<sub>5</sub>, B<sub>8</sub>, B<sub>9</sub> ...]

Good Write

## > Replacement Algo (only for Associatives)

When cache is full then if there is a fail, then we do replacement to replace the tag-data.

These Algos are: Random replacement @ random

FIFO: first in first out

LRU: Replaces the item least recently used by CPU.

## > Writing to the Cache:

(WRITE  
THROUGH)

Method 1: The cache memory when incur changes → parallelly make changes in the main memory also.

Adv: The main memory is always updated in sync with the cache memory.  
So main-memory data is always valid.

(WRITE  
BACK)

Method 2: The cache loc<sup>n</sup> is updated only during write opr<sup>n</sup>. The loc<sup>n</sup> is the marked by a flag so that when word is removed from cache it is copied to main memory.

Ans As long as the word is in a cache, memory doesn't need to be accessed. So it may be changed several times

## Cache Initialisation

The cache is initialised when power is applied or when main memory is loaded with a complete set of programs from auxiliary memory. It is initialised with some invalid data.

A valid bit indicates if the data is valid or not. When cache is initialised all valid bits are 0, which is set 1 when when first word is loaded to it.

If valid bit = 0 → replace directly as data is invalid.

Unit 11# Peripheral Devices →

Devices that are under the direct control of the computers are said to be online. They are designed to read info in or out of the memory unit. Input - output devices attached to the computer are called peripherals.

ASCII I/O: devices communicate with user & computer using ASCII codes for normal characters.



The control chars. are used for routing data and arranging the printed text into a prescribed format.

There are 3 of control chars. types?

1. format effector: Control layout of printing  
e.g. Backspace, carriage return, etc.
2. information separator: Separate data into pages & page  
= Record Separator (RS), file Separator (FS)

### 3. Communication Control Characters:

Transmission of text b/w "n" remote terminals.  
eg: STX (start of text) & ETX (end of text)

→ ASCII is a 7 bit code. thus the 8th bit is free. we use it for other features  
eg:

Some printers takes MSB = 0 normal  
MSB = 1 to specify Greek  
signals

In communication, 8th bit is used for parity  
of binary-coded characters.

## # Input Output Interface

The CPU can't communicate directly with the I/O due to the major differences:

1. Peripherals are EMech. or EM devices and their manner of operation is different from the op's of the CPU.
2. The rate is v. slow in peripherals than CPU.
3. Data codes in peripheral differ from CPU.
4. Operating modes of peripherals are different & must be controlled so not to distract others.

Good Write

## &gt; Connecting via I/O bus.



- > Each interface decodes the address & control received from the bus, interprets them for the peripheral & provides them signals for controller.
- > Each peripheral has its own controller
- > I/O bus is connected to all the I/O interfaces. To communicate with a particular I/O device the device address is placed on the address line.
- > When interface detects its own address on the line it activates the path b/w bus lines & the devices.
- > I/O Commands: These are commands of an instruction.
- a) Control Command: Issued to activate the peripheral and to inform it what to do.
- b) Status Command: Used to test various status conditions in the interface and peripheral. The errors may be detected & bits in status register is set 0. (Good Write)

c) Output data command: This command makes the system respond by transferring data from bus into one of its register.

d) Input data command: This command makes the system respond by receiving data from register to the data bus.

## # I/O Bus vs Memory Bus

3 ways:

### 1. "Use 2 separate bus"

The computer has independent set of data, address, control buses. This is done in computers with separate I/O with the CPU.

- > Memory communicates with both CPU & I/O using memory bus
- > The I/O communicates with I/O devices with separate I/O bus with its own address, data, control lines.

This provides an independent pathway for info transfer.

## 2. "Isolated I/O"

CPU has distinct I/O instructions.

- When CPU fetches the operation code of an I/O instruction, it puts the address associated with the instn in the common address bus
- At same time I/O read or I/O write control lines are enabled. [Informing that its an I/O oper.]  
↳ Vice-versa for memory instruction.

Adv: It isolates the memory & I/O addresses thus one not affecting the other.

## 3. "Memory-mapped"

Use the same address space for both the memory and the I/O.

CPU don't have distinct I/O instn. Thus CPU can manipulate both I/O data & memory words with same instructions.

Adv: To load / store an instruction the codes are same for either reg. or memory words.

## eg of I/O interface



#

# A synchronous Data Transfer

Synchronous: if registers in the interface share a common clock with the CPU registers.

A-synchronous: if internal timing in each unit is independent from the other in that each uses its own pri. 'clock' for internal registers.

## CONTROLS

↓ Strobe Control

↓ Handshaking

i) originated by source

i) originated by source



to tell that data in the data bus is valid.

= Dig:



Timing: Dig:



Valid:



ii) originated by destn



to tell that Destn is ready to accept data.

ii) originated by destn



Ready :



Bus :



Valid :



## # Time Out:

In Handshaking if one device is faulty, the data transfer won't be completed. This error is detected by the means of time out as it produces an alarm if the data transfer isn't complete.

It is implemented by a means of an internal clock that starts counting when one of the handshake controls is enabled. If the return signal doesn't occur in a given time, it assumes an error.

## → Async - Serial - Transfer

In async transmission, binary info is only transmitted when it is available & the line remains idle when there is no transmission.

for this purpose we use a data transmission technique used in many ~~interface~~ interactive terminals by employing special bits at both ends of character code.

The first start bit = 0

The 8 character bits

& then 1 or 2 stop bits.

e.g. Send (C5)H. C5: 11000101

Good Write



Baud Rate: is defined as the rate at which serial info is transmitted and is equivalent to the data transfer in bits per second.

e.g. 10 characters/sec in a 11-bit format.

→ 11 \* 10 bits/sec or 110 baud.

→ The receiver receives a 11-bit message from the line & forwards the 8-bit chars to the computer.

Integrated chips are available which are specifically designed to provide an interface b/w computers & similar interactive terminals called UART.



|                 | CS | RD | WR | RS | Bus |
|-----------------|----|----|----|----|-----|
| Good Write      | 1  | 0  | 1  | 0  | -   |
| Receiver reg    | 1  | 1  | 0  | X  | X   |
| Status reg      | 1  | 1  | X  | 1  | X   |
| Transmitter reg | 1  | 0  | 1  | 0  | 1   |
| Control reg     | 1  | 0  | 1  | 1  | 2   |

### Transmitter :

The CPU reads the status register & checks the flag to see if the transmitter register is empty. If yes the CPU transfers a character to the transmitter ~~shift~~ reg. & interface clears the flag to make the register full. Then data is transferred (in parallel) to the Shift register & stop bits are appended into the Shift register.

### Receiver :

The receive data input is in "1-state" when line is idle. The receiver control monitors for the line for a 0 signal. Once the Start bit is detected, the incoming bits are taken as character bits then they are shifted to "Shift register". After receiving it checks the parity & stop bits.

errors may occur

→ Parity error: if the number of 1's received is not the correct parity.

Framing error: if right number of stop bits isn't detected.

Overrun error: if CPU doesn't read the character from the receiver register before the next one is available in Shift register.

(Read fifo buffer)

Good Write

## # Modes of Transfer

1. "Programmed I/O"

Each data transfer is initiated by an instruction in the program.

This requires constant monitoring of the peripheral by CPU; once done there is a need to check if next transfer can be made.



here CPU stays busy if there is no interrupt/  
input/output thus using the CPU unsafely.

## 2. "Interrupt Initiated I/O"

Whenever any I/O actions occur an interrupt is sent to the CPU to tell it that the I/O action exists by setting a flag.



## # Software Considerations

## # Priority Interrupted I/O

Priority interrupt is a system that establishes a priority over the various sources to determine which condition is to be serviced first when 2 or more requests arrive simultaneously.

→ Polling

We provide one common branch address for all interrupts. The program that takes care of interrupt begins at the branch address & polls them interrupt sources in sequence. They are tested from priority (high to low), if the control signal of the current interrupt is on it gets the branching, otherwise the next is tested.

Disadvantage:

If there are many interrupts, the time required to poll them can exceed the available time to service I/O devices.

→ Daisy - Chaining Priority

Good Write

- > first when an interrupt is incurred, then an interrupt request is sent to the INT.
- > If the CPU accepts it, this is acknowledged by sending back JNT ACK which goes to PI.

### Device working

- > if any device has  $PI = 1 \wedge$  Interr. request = 1 then VAD is sent to the CPU.  
 ↳ Vector address  $\therefore PO = \overline{INT.PI}$   
 & it sets  $PO = 0$       Grabble =  $INT.PI$
- > if  $PI = 0$ ,  $PO = 0$
- > if  $PI = 1$ , Interr. request = 0 then  $PO = 1$ .  
 → finally set Interr req = 0.
- > One stage of daisy chain arrangement



## → Parallel Priority Interrupts



- > here priority of  $0 > 1 > 2 > 3 > 4$ .
- > Maskreg. is used to mask any specific interrupt on will.
- >  $I_i = [Device_i . Mask_i]$
- > This priority encoder thus finally gets the right interrupt

\* IST: if any interrupt has occurred it will turn 1.

Steps →

- \* IST turns 1
- \* if  $LEN = 0$  { Nothing works } ; if  $LEN = 1$  then Interrupt sent to CPU, which sends back INT Ack.
- \* Thus finally enabling the VAD.