

Cache memory is to help CPU to access data faster. It is part of the program which is running on the CPU.

→ we know that CPU is faster than main memory and the main memory is faster than secondary memory.

But if CPU executes instruction in  $10^{-12}$  sec, but main memory speed is  $10^{-9}$  sec then the speed of CPU falls down to speed of main memory.

So we introduced registers, they are having small size compared to main memory but faster than main memory. But they are very costly so we introduced cache memory which is cheaper than registers and faster than main memory.

CPU is having different levels of memory.

i.e. Registers as level one memory  
Cache as level two memory  
Main memory as level three memory  
Secondary memory as level four memory.

We know the concept of Paging in that process is present

in Secondary memory and a part of it is present in Main memory and if Cache is having some part of the process then CPU can fetch the instructions from the Cache and execute the process.

If the CPU gets what it needs in Cache then it is called Cache hit and if not present then it is called Cache miss.

Cache miss then CPU must fetch from Main memory and service the Cache (replace with newly fetched instructions) and execute them is called the "miss latency".

If the part of the process which CPU is looking for is not present in main memory then it is called the "Page fault".

Like page table which shows what is present in page in cache also "Tag directory" is present which shows what are the elements present in cache.

As we know that the size of the cache is small compared to main memory now the question is which part in the main memory will be stored in the cache.

For this temporal locality says that whatever we need now in the page will be present near to the recently used.

This is needed by CPU but CPU will fetch the entire page into cache instead of that bit because according to temporal locality the data needed in the future will be present near to the bit.

Direct mapping: like in paging the main memory is divided into frames and process is divided into pages and each page can be fit into one frame.



So, like wise the main memory is divided into blocks and cache is divided into lines, and

each block can fit into one line



"word" is the unit that we use here instead of "Byte"

Let main memory size = 64 words, block size = 4 words

and cache size = 16 words

no of blocks =  $\frac{64}{4} = 16$

no of lines =  $\frac{16}{4} = 4$



totally 64 words so  $2^6 = 64$

we can represent each word with 6 bits

Suppose CPU wants word '5'

000101 then 0001[01]

↳ Block offset

first check block number i.e '1' [0001] 01  
↳ Block number

in block '1' check '01' i.e '5' so this is how CPU

gets what it wants if CPU wants word '10'

1st of comp result (10) i.e. 000110 [10]  
↳ block '2' element '3'

now we will see how the blocks are stored in

the cache memory

We will fill the cache in round robin manner

|   |   |   |    |    |
|---|---|---|----|----|
| 0 | 0 | 4 | 8  | 12 |
| 1 | 1 | 5 | 9  | 13 |
| 2 | 2 | 6 | 10 | 14 |
| 3 | 3 | 7 | 11 | 15 |

|   |   |   |   |   |   |   |   |   |    |    |    |    |    |    |
|---|---|---|---|---|---|---|---|---|----|----|----|----|----|----|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 |
|---|---|---|---|---|---|---|---|---|----|----|----|----|----|----|

i.e. the blocks 0, 4, 8, 12 should always goto line '0'

the blocks 1, 5, 9, 13 should always goto line '1'

the blocks 2, 6, 10, 14 should always goto line '2'

the blocks 3, 7, 11, 15 should always goto line '3'



$$P = \frac{31}{4} = 20.75 \text{ go to } 0.75$$

Ex:

0 0 0 1 0 0

↳

Block number

↳ Block offset

0 0 0 1

↳

line no 2 block no 11010

Tag

|    |   |
|----|---|
| 00 | 0 |
| 01 | 1 |
| 10 | 2 |
| 11 | 3 |

the msb two bits represents the tag number

if CPU wants 01 01 01 then first it checks the cache memory if it is present or not

in cache it checks first line no i.e. (01) and goes to it and checks tag no i.e. (01) then goes to the

block offset this is how the CPU checks in the Cache.

10 01 11

cpu goes to line (01) and checks the tag

it is not (10) so it thinks that required block number is not present in cache memory

so cache miss

Direct mapping Problem:

Here conversions are very important

if byte addressable then

128 KB

=  $2^{17}$  bytes

so physical address = 17 bits



if word addressable,  $1w = 4$  bytes

128 KB  $\approx$  32 kW

=  $2^{15}$  w

so physical address = 15 bits

So based upon the conversion the no of bits representation

changes.

| Q) | mm size | Cache size | Blocksize | tag bits | tag directory size | ? |
|----|---------|------------|-----------|----------|--------------------|---|
| 1) | 128 KB  | 16 KB      | 256 B     | ?        | ?                  | ? |
| 2) | 32 GB   | 32 KB      | 1 KB      | ?        | ?                  | ? |
| 3) | ?       | 512 KB     | 1 KB      | 7        | ?                  | ? |
| 4) | 16 GB   | ?          | 4 KB      | 10       | ?                  | ? |
| 5) | 64 MB   | ?          | ?         | 10       | ?                  | ? |
| 6) | ?       | 512 KB     | ?         | 7        | ?                  | ? |

Assume that memory is byte addressable

Soln: ① Given mm = 128 KB =  $2^{17}$  B

Cache = 16 KB =  $2^{14}$  B

Block size = 256 B =  $2^8$  B

We know



line (or) Block offset =  $2^8$  B = 8 bits

tag bits = 3 bits

tag directory size =  $3 \times 2^6$

② Given mm =  $2^{35}$  bytes

Cache =  $2^{15}$  bytes



no of lines =  $\frac{2^{15}}{2^{10}} = 2^5 = 32$  bits

so tag bits = 20 bits, tag directory =  $2^5 \times 20$

③ Let mm =  $x$  bits

tag size = 7

Cache =  $2^{19}$  B

no of lines =  $\frac{2^{19}}{2^{10}} = 2^9$



tag directory =  $7 \times 2^9$

④ Given mm =  $2^{34}$  bytes

Block size =  $2^{12}$  bytes

Cache =  $x$

tag bits = 10      34



$x = 34 - 22 = 12$  bits

so Cache size =  $2^{12} \cdot 2^{12} \cdot 2^{24}$  bytes

⑤ Given mm =  $2^{26}$  Bytes

tag bits = 10



$x + y = 16$

So cache size be  $2^{16}$  bytes

Block size, tag directory size cannot be guessed

e) Same blocksize, tag directory size cannot be guessed.

Direct mapping Hardware implementation:

the time taken to check if required data is present in

the cache is known as hit latency

the time taken to service the cache with required data  
is known as miss latency.

Now we will see how to find that required data  
is present in the cache or not

Suppose physical address is generated by CPU

and no of tag bits = 1

now a multiplexer is connected to the tag bits

and selection lines are given from line number



If the line no is given as selection lines then the

multiplexer will process the output with is a tag number

now the tag number of the cache is compared with

tag no generated by CPU

then if both are same then it is a hit

and if not there is a miss

So the time taken = latency of mux + latency of Comparator  
 if the tag bits are 'two' then we need two mux's  
 if there are  $k$  tag bits then we need  
 $k$  multiplexers and  
 $1 k$  bit comparator  
 even if  $k$  mux's are present all will work in parallel so the latency of all the mux's will be taken as single mux latency.

a) Given word length of word = 32 bits  
 main memory = 1 GB =  $2^{30}$  Bytes  
 Cache memory = 1 MB =  $2^{20}$  Bytes  
 Comparator latency = 10 Kns  
 then comparator delay = ?  
 we know that no of tag bits =  $\frac{m.m}{cache} = \frac{2^{30}}{2^{20}} = 2^{10}$  bits

$= 10$  bits  
 so we need 1 '10' bit magnitude Comparator  
 $\Rightarrow 10 \times (10)$  nsec  
 $= 100$  nsec

if we want hit latency it is same as the comparator delay so hit latency = 100 nsec  
 if multiplexer delay is ignored then we also ignore it when we solve the problems.

\*\* in direct mapping only one comparator is used.

\* Disadvantage of direct mapping:

Let  $n$  be the line no of the cache

Also there are 4 lines present in the cache if m is the block number (or) present in the main memory then the line number at which the block number will be present is

$$n = m \bmod 4 \text{ (or)} n = m / 4$$

Let 5, 6, 8, 9, 12, 15, 20 are the block numbers then in Cache

|   |    |         |
|---|----|---------|
| 0 | 5  | 8 12 20 |
| 1 | 8  | 9       |
| 2 | 6  |         |
| 3 | 15 |         |

even though line '3' is free for a long time the block numbers 8, 12 did not go there because at starting we discussed that 0, 4, 8, 12 should goto only line '0' so this is the disadvantage of direct mapping.

This is called Conflict miss

Introduction to associative mapping:

in this associative mapping we cleared the

disadvantage of conflict miss

i.e. any block can be placed in any line

which is available in the cache so due to this the

Conflict miss donot occur

but tag bits need to be increased because in case of direct mapping only few are able to fill in one line so we need less bits to represent what is present

but now any block can place anywhere so more no of tag bits are required

in direct mapping;



in associative mapping:  Block offset  
tag bits.

Due to the increase in tag bits the load on the comparators increases so latency will increase.

for this we use more comparators.



with increase in no of lines the no of comparators also increase

Q) Given main memory = 32 GB

Block size = 32 KB

Tag bits =  $K$

so  $\text{Delay in Comparator} = 10 \times K \text{ ns}$

Delay in orgate =  $10 \text{ ns}$

then latency = ? physical address

we know



35 bits

so total no of tags = 20 bits

Delay in Comparator =  $10 \times 20 = 200 \text{ ns}$

total latency =  $200 + 10 = 210 \text{ ns}$

Problems on associative mapping!

- | Q) | Main memory | Cache  | Block size | Tag size | Tag directory size |
|----|-------------|--------|------------|----------|--------------------|
| ①  | 128 KB      | 16 KB  | 256 B      | 9 bits   | ?                  |
| ②  | 32 GB       | 32 KB  | 1 KB       | ?        | ?                  |
| ③  | —           | 512 KB | 1 KB       | 17       | —                  |
| ④  | 16 GB       | —      | 4 KB       | —        | —                  |
| ⑤  | 64 MB       | —      | —          | 10       | —                  |
| ⑥  | —           | 512 KB | —          | 7        | —                  |

① Given  $mm = 128 \text{ KB} = 2^{17} \text{ B}$   
 $\text{Cache} = 16 \text{ kB} = 2^{14} \text{ B}$



$$\text{no of lines} = \frac{2^{14}}{2^8} = 2^6$$

So tag bits = 9

Tag directory size =  $9 \times 64$

② Given  $mm = 2^{35} \text{ B}$

$$\text{Cache} = 2^{15} \text{ B}$$

So Tag bits =  $2^{20} = 20$  bits

$$\text{no of lines} = \frac{2^{15}}{2^{10}} = 2^5$$



Tag directory size =  $25 \times 2^5$



③ Given Cache =  $512 \text{ KB} = 2^{19} \text{ B}$

$$\text{no of lines} = \frac{2^{19}}{2^{10}} = 2^9$$

so main memory =  $2^{27} = 128 \text{ MB}$

$$\text{tag directory} = 17 \times 2^9$$

④ Given main memory =  $2^{34} \text{ B}$



so no of bits in tag = 22

Cache size, tag directory size cannot be guessed.

⑤ Cache size, tag directory size cannot be guessed

Set Associative mapping:

In set associative mapping we divide the

no of lines in the Cache into sets

If set size = 2 lines then it is called two way

Set associative

Let Main memory = 64 B

Cache size = 32 B

Block size = 4 bytes  
Set size = 2 lines

Total number of lines =  $\frac{C \cdot S}{B \cdot S}$  if  $\frac{32}{4} = 8$

Sets =  $\frac{\text{no of lines}}{\text{set size}} = \frac{8}{2} = 4$



in Set Associative mapping



If Cache is k-way Set Associative then we need k-comparators.

The main intention of introducing the set associative mapping is to decrease the number of comparators.

Problems on Set Associative mapping

|   | Main memory | Cache size | Block size | Tag bits | Tag directory size | Set associative |
|---|-------------|------------|------------|----------|--------------------|-----------------|
| ① | 128 KB      | 16 KB      | 256 B      | —        | —                  | 2 way           |
| ② | 32 GB       | 32 KB      | 1 KB       | —        | —                  | 4 way           |
| ③ | —           | 512 KB     | 1 KB       | 7        | —                  | 8 way           |
| ④ | 16 GB       | —          | 4 KB       | 10       | —                  | 4 way           |
| ⑤ | 64 MB       | —          | —          | 10       | —                  | 4 way           |
| ⑥ | —           | 512 KB     | —          | 7        | —                  | 8 way           |

① Given main memory =  $128\text{ KB} = 2^{17}\text{ B}$

Cache size =  $16\text{ KB} = 2^{14}\text{ B}$

$$\text{no of lines} = \frac{2^{14}}{2^8} = 2^6$$

it is 2 way set associative

$$\text{so no of sets} = \frac{2^6}{2} = 2^5$$



$$\text{Tag directory size} = 4 \times 2^6 \text{ bits}$$

② Given main memory =  $2^{35}\text{ B}$

Cache size =  $2^{15}\text{ B}$

$$\text{no of lines} = \frac{2^{35}}{2^{10}} = 2^5$$

$$\text{no of sets} = \frac{2^5}{4} = 2^3$$



$$\text{so tag directory size} = 2^5 \times 2^2$$

③ Given cache memory =  $2^{19}\text{ B}$

$$\text{no of lines} = \frac{2^{19}}{2^{10}} = 2^9$$

it is 8 way set associative

$$\text{so no of sets} = \frac{2^9}{8} = 2^6$$



$$\text{Tag directory size} = 2^9 \times 7$$

④ Given main memory =  $16\text{ GB} = 2^{34}\text{ B}$



$$\text{so no of sets} = 12$$

$$\text{no of lines} = 12 \times 4 = 48$$

Cache size = no of lines  $\times$  block size

$$= 48 \times 2^{12} = 2^{26}$$

$$\text{Tag directory size} = 2^{14} \times 10$$

⑤ we know that set Associative mapping

$$\text{Cache size} = \text{no of sets} \times \frac{\text{no of lines}}{\text{set}} \times \text{line size}$$

so

Cache size, Tag directory size cannot be guessed.

but

|    |   |   |
|----|---|---|
| 10 | x | y |
|----|---|---|

$$\text{so } 2^6 = 10 + x + y$$

$$\text{so } x + y = 16$$

$$\text{cache size} = 2^x \times 4 \times 2^y$$

$$= 2^{x+y+2}$$

$$= 2^{16+2} = 2^{18} \text{ Bytes}$$

$$\text{Cache size} = 256 \text{ KB}$$

⑥ we know

$$\text{Cache size} = \text{Sets} \times \frac{\text{no of lines}}{\text{set}} \times \text{line size}$$

$$= 2^x \times 8 \times 2^y$$

$$2^{19} = 2^{x+y+3}$$

$$\Rightarrow x + y = 16$$

so

|   |   |   |
|---|---|---|
| 7 | x | y |
|---|---|---|

$$\text{Physical address size} = 7 + 16 = 23 \text{ bits} = 2^{23}$$

Comparing all the mappings

Given main memory = 4 GB

Cache memory = 4 MB

Block size = 1 KB

Direct:

|    |    |    |
|----|----|----|
| 10 | 12 | 10 |
|----|----|----|

↓  
tag bits  
→ line no

associative:

22 10

Tag

Block offset

4 way set associative:

Block offset = 10

$$\text{no of sets} = \frac{2^{12}}{4} = 2^{10}$$

|    |    |    |
|----|----|----|
| 12 | 10 | 10 |
|----|----|----|

So

for k-way set associative

$k=1$  it is direct mapping

$k=N$  it is associative mapping.

Tag Comparators Tag directory size

|        |    |                  |
|--------|----|------------------|
| Direct | 10 | $10 \times 2^12$ |
|--------|----|------------------|

|             |    |          |
|-------------|----|----------|
| Associative | 22 | $2^{12}$ |
|-------------|----|----------|

|                 |    |   |                    |
|-----------------|----|---|--------------------|
| set Associative | 12 | 4 | $12 \times 2^{12}$ |
|-----------------|----|---|--------------------|

Ques 99:

→ main memory has " $2^m$ " blocks and cache has " $2^n$ "

blocks and "2 way" set associative is used then  
 $k$ th block of main memory maps to one set

Soln: we know that if it is a 2 way set mapping then

there will be ' $c$ ' block sets so  $c$  will be present so

$k$ th element will go in " $k \bmod c$ " set

Ques 96:

→ more than one word are put in one block to exploit locality in a program

Soln: if we want element into cache then we would load the surrounding bits also this is called locality of reference.

it is of two types, 1) spatial locality and 2) temporal

locality.

spatial locality says that if we need some data

now then in future we will need the data closest to what we picked

Temporal locality says that the data that we need now will be again needed in future also

so spatial locality will be used.

Gate 95:

→ Cache size = 4K words

block size = 64 words

set size = 4 blocks

the no of bits present in "set" and "word" field of main memory address are

$$\text{soln: } \text{no of lines} = \frac{2^{12}}{2^6} = 2^6$$

$$\text{no of sets} = \frac{2^6}{4} = 2^4$$

so

|   |   |
|---|---|
| 4 | 6 |
|---|---|

↑ word field  
↓ set field.

Gate 07:

→ 4 way set associative

Cache lines = 128

line size = 64 words

Physical address = 20

tag, set, word fields are

$$\text{soln: } \text{no of lines} = 128$$

$$\text{no of sets} = \frac{128}{4} = 2^5$$

line size =  $2^6$

|   |   |   |
|---|---|---|
| 9 | 5 | 6 |
|---|---|---|

20

Gate 13

$\rightarrow$  K way set associative cache contains 'v' sets  
 the main memory block numbered 'j' must be mapped  
 to any of the cache lines from

a)  $(j \bmod v) * k$  to  $(j \bmod v) * k + (k-1)$

b)  $(j \bmod v)$  to  $(j \bmod v) + (k-1)$

c)  $(j \bmod v)$  to  $(j \bmod v) + (v-1)$

d)  $(j \bmod k) * v$  to  $(j \bmod k) * v + (v-1)$

Soln: let  $k=2$

$v=2$

we know

set number =  $j \bmod v$



if we want to goto line 'k' then

$$(j \bmod v) * k$$

we know no of lines =  $k$

so  $(j \bmod v) * k$  to  $(j \bmod v) * k + (k-1)$

Gate 14

$\rightarrow$  An access sequence of cache block address is of length 'N' and containing 'n' unique block addresses  
 the number of unique addresses between two consecutive addresses access to the same block

address is bounded above by 'k' what is the miss ratio if the access sequence is passed through a

Cache of associativity  $A \geq k$ , exercising least recently used replacement policy

- a)  $n/N$  b)  $\frac{1}{N}$  c)  $\frac{1}{A}$  d)  $\frac{k}{n}$



Soln: Let the access sequence be

abcdefc which have 'n' unique block numbers  
 now, no of block numbers which are unique between  
 two consecutive repetitive numbers be 'k'

a b c d e f c a

$$k = 3 = 4 + (\text{v.bom}) \text{ of } 2 + (\text{v.bom})$$

$$N = 8 = 6 + 1 + (\text{v.bom}) \text{ of } (\text{v.bom})$$

$$n = 6$$

Given Associativity  $\geq k = 4 + (\text{v.bom}) \text{ of } (\text{v.bom})$

so



$$\text{So miss rate} = \frac{6}{8} = \frac{n}{N}$$

Ques 90:

→ Blocks in cache = 128

4 way set associative

main memory contains  $2^{14}$  blocks

Block size is 256 eight bit words

- (i) How many bits are required for addressing MM
- (ii) How many bits are needed to represent the TAG set and word fields.

Soln: no of lines = 128

$$\text{no of sets} = \frac{128}{4} = 2^5$$

$$\text{Block size} = 2^8$$

main memory size = no of blocks  $\times$  block size

$$= 2^{14} \times 2^8 = 2^{22}$$

|   |   |   |
|---|---|---|
| 9 | 5 | 8 |
|---|---|---|

Ques 05:

→ Direct mapped Cache

Cache size = 32 kB

Block size = 32 B

P.A = 32 bits

the no of bits needed for "cache indexing" and tag bits are respectively

Soln:

$$\text{no of lines} = \frac{32 \text{ kB}}{32 \text{ B}} = 2^{10}$$

$$\text{Block size} = 2^5 \quad 32$$



Ques 06:

→ Consider two cache organisations the first one is 32 kB 2 way set associative with 32 byte block size. the second one is of same size but directly mapped. the size of an address is 32 bits in both cases. A 2 to 1 multiplexer has latency of 0.6 ns while a k bit comparator has latency of  $(k/10)$  nsec. the hit latency of the set associative organisation is  $h_1$ , while that of direct mapped is  $h_2$ .

Soln: Given in direct mapping

$$\text{no of lines} = \frac{32 \text{ kB}}{32} = 2^{10}$$

$$P.A = 32$$

$$\text{Block size} = 2^5 \quad 32$$



we know each multiplexer can read only

1 bit so for 10 bits

$2^{10} \times 1$  mux is needed

but latency for  $2 \times 1$  is given not for  $2^{10} \times 1$

so we ignore it

now in direct mapping latency is only due to

K bit magnitude Comparator

so K depends on no of tag bits

$$\text{so } K = 17$$

so magnitude Comparator latency =  $\frac{K}{10} \approx 1.7 \text{nsec}$

so in direct mapping latency =  $1.7 \text{nsec}$

in set Associative mapping



so given  $T = 17$  and  $S = 9$  (from  $2^9 = 512$ )

$$S = \frac{2^{10}}{2} = 2^9$$

$$\text{so } S = 9$$

$$T = 18$$

totally  $2^9 \times 1$  mux is required but it is not

given so it is ignored

2 K bit magnitude Comparators are required

but they are connected in parallel so one comparato

time is enough

$$\text{so magnitude Comparator latency} = \frac{K}{10} = \frac{18}{10} = 1.8 \text{nsec}$$

now OR gate is implemented using 2x1 mux so

$$\text{latency} = 1.8 + 0.6 = 2.4 \text{ ns}$$

Gate 11

→ Direct mapping

cache size = 8 KB

block size = 32 bytes

physical Address = 32 bits

the Cache Controller maintains tag information for each cache block comprising of the following

1 valid bit, 1 modified bit and as many bits needed to identify the memory block mapped in the cache what is the total size of the memory needed at the

Cache Controller to store meta-data (tags) for the cache

- a) 4864 bits b) 6144 bits c) 6656 bits d) 5376 bits

Soln: no of lines =  $\frac{2^{13}}{2^5} = 2^8$   
So  $\overbrace{19 \ 8 \ 5}^{32}$

19 + 1 valid bit + 1 modified bit

$$= 21 \text{ bits}$$

$$\text{tag directory size} = 21 \times 2^8 = 5376 \text{ bits}$$

Gate 12

→ Cache Size = 256 KB

4 way set associative

Block size = 32 B, P.A = 32 bits

Each Cache tag directory entry contains in addition

to address tag 2 valid bits, 1 modified bit and replacement

bit

Q) the no of bits in the tag field of an address is

- a) 11 b) 14 c) 16 d) 27

Q) the size of the cache tag directory is

a) 160 k bits b) 136 k bits c) 40 k bits d) 32 k bits

Soln:

$$\text{Given no of lines} = \frac{256 \text{ kB}}{32 \text{ B}} = \frac{2^{18}}{2^5} = 2^{13}$$

$$\text{no of sets} = \frac{2^{13}}{32} = 2^4$$

|    |    |   |
|----|----|---|
| 16 | 11 | 5 |
|----|----|---|

so tag field has 16 bits

$$\begin{aligned}\text{cache tag directory} &= (16 + 2 + 1 + 1) \text{ no of lines} \\ &= 20 \times 2^{13} \\ &= 160 \text{ K Bits}\end{aligned}$$

Ques 14

→ In designing a computer cache system, the cache block (or cache line) size is an important parameter which of the following is correct.

- A smaller block size implies better spatial locality
- A smaller block size implies a smaller cache tag and hence lower cache tag overhead
- A smaller block size implies a larger cache tag and hence lower cache hit time
- A smaller block size incurs a lower cache miss penalty

Soln: According to spatial locality taking more data into cache helps us to reduce the miss latency but if block size is low then more data cannot be in the cache so it is not true.

So option a is correct.

option b:

direct mapping:



if block offset decreases

$$\text{no of lines} = \frac{\text{cache size}}{\text{Block offset}} = \text{increases}$$

which don't effect tag size

in set associative mapping:



if block offset decreases no of sets increases

which don't effect tag size

in associative mapping:



there is no tags present here separately

so option b is false.

option c:

lower blocksize will increase cache hit time

so it is false

option d:

lower block size means we can replace the miss lines easily into cache so lower miss penalty

so it is true.

Gate 14

→ if the associativity of a processor cache is doubled while keeping the capacity and block size unchanged which one of the following is guaranteed not to be effected.

- width of a tag comparator
- width of set index decoder
- width of way selection multiplexer
- width of processor to main memory data bus

Soln:



as the B0's

↓ is double the associativity  
so '1' is constant

as the half the set size

Tag must increase

So as tag size increase width of tag comparator increase

as no of sets decreases the width of set index decoder decrease

as we know in set associative mapping no of multiplexers used is  $(K \times 1)$

since K is doubled

no of multiplexers ( $2K \times 1$ )

in option 'd' as the word length donot change the

the width of processor to main memory databus will not change.

Gate 14

→ 4 way set Associativity

Cache size = 16 KB

Block size = 8 words

P.A.S = 4 GB

Tag bits

Soln: if any information is given in words then convert all the other information into words

Cache size = 4 KB

Block size = 8 w

P.A.S = 1 G w

So no of lines =  $\frac{2^{12}}{2^3} = 2^9$  Lines no of sets =  $\frac{2^9}{4} = 2^7$

|    |    |   |   |
|----|----|---|---|
| 30 | 20 | 7 | 3 |
|----|----|---|---|

## Memory interfacing

### Introduction:

it comes under the computer organisation, the main difference between organisation and architecture is in organisation we study of how different parts are connected in computer whereas in architecture we study the effective use of the connected parts.

Generally CPU speed is calculated is "MIPS" (Million instructions per second)

But we cannot give input to CPU at that speed

so we better use some high speed memory



the speed of the secondary memory is very low

so in terms of Speed

Cache > main memory > secondary memory

(volatile) (Volatile) (non volatile)  
very costly cheap  
Costly

when we want to load program into CPU then

we transfer it from secondary memory to main memory and main memory to CPU (or) main memory to Cache and Cache to CPU

we don't load program directly within the

secondary memory because the speed of the secondary memory is very low.



- Cache levels, main memory have random access i.e. we can go to a bit anywhere anytime in cache, main memory
- In case of magnetic disks we got semi random access i.e. upto some part we get random access and after that we get sequential access.
- Magnetic tapes got sequential access.



Access time:  
for registers access time is less, which means they can be accessed very fast compared to cache, main memory etc

frequency:  
Generally CPU uses registers, cache more often than the others because they are fast

size:  
Registers, cache are not having much sizes because they are very cost to implement

Cost:  
As we know registers are having very high cost.

the purpose of memory hierarchy is to bridge the speed mismatch between fastest processor to slow memory at reasonable cost.

2-level memory:

information in  $i^{\text{th}}$  is subset of  $(i+1)^{\text{th}}$  level



all the information present in computer is present in secondary memory and some part of it is present in main memory  
some part of information in main memory is present in cache

Some part of information present in cache is present in CPU

→ if processor refers to  $i^{\text{th}}$  level memory is found then "Hit" otherwise "miss(or) fault"

i.e. if processor is looking for data in cache then if it found then Hit else fail (or) miss

\* if CPU searches for 100 times and if  $x$  times we found data then

$$\text{Hit rate} = \frac{x}{100}$$

$$\text{miss rate} = \frac{100-x}{100} = 1 - \frac{x}{100}$$

→ there are two ways in which the processor is connected to various levels of memory

Case - 1:



Also access time of  $L_1$  & access time of  $L_2$

Let  $T_1$  = access time of  $L_1$

$$S_1 = \text{size of } L_1 (T_1 + T_2) + H(H-1) + T_2 H = \text{per byte}$$

$C_1$  = cost per bit of  $L_1$  (or) cost per byte of  $L_1$

$H_1$  = Hit rate of  $L_1$

$$\text{now } T_{\text{avg}} = H_1 T_1 + (1 - H_1) T_2$$

ALSO

$$C_{avg} = \frac{C_1 S_1 + C_2 S_2}{S_1 + S_2}$$

Case-2:



P if fail in L1

$$So T_{avg} = H_1 T_1 + (1-H_1) (T_1 + T_2)$$

Ls if we found in L1

$$C_{avg} = \frac{C_1 S_1 + C_2 S_2}{S_1 + S_2}$$

if in the question if type of connection is not mentioned i.e case 1 (or) case 2 then by default go take case-2

3-level memory:

Case-1:



$$T_1 < T_2 < T_3$$

addr phab (mem)

$$\frac{x}{601} = \text{start fill}$$

$$so T_{avg} = H_1 T_1 + (1-H_1) H_2 T_2 + (1-H_1) (1-H_2) T_3$$

$$C_{avg} = \frac{C_1 S_1 + C_2 S_2 + C_3 S_3}{S_1 + S_2 + S_3}$$

case-2:



$$T_{avg} = H_1 T_1 + (1-H_1) H_2 (T_1 + T_2) + (1-H_1) (1-H_2) (T_1 + T_2 + T_3)$$

$$C_{avg} = \frac{C_1 S_1 + C_2 S_2 + C_3 S_3}{S_1 + S_2 + S_3}$$

→ the average memory access time for a machine with a cache hit rate of 80% where the cache access time is 5 nsec and main memory access time is 100 ns is

Soln: Given  $H_c = 0.8$   
 $T_c = 5 \text{ nsec}$   
 $T_m = 100 \text{ nsec}$   
but case(1) (or) case(2) is not given so we do both the cases

Case-1:



$$T_{avg} = 0.8(5) + 0.2(100)$$

$$= 24 \text{ nsec}$$

case-2:



$$T_{avg} = 0.8(5) + 0.2(100+5)$$

$$= 25 \text{ nsec}$$

Cache replacement Algorithms:

→ Replacement policy is required for Associative mapping and set associative mapping but not for direct mapping

→ Replacement policies are aimed to minimize miss penalty for future references

Various replacement policies are:

Random:

no specific criteria to replace the block.

FIFO:

the block that entered first is the candidate for replacement

LRU:

the block which is not referred for the longest time is the candidate for replacement.

LRU:

the block with fewer references is replaced.

→ Consider a direct mapped cache with 8 cache blocks (0-7) if the memory block requests are in the order 3, 5, 2, 8, 0, 6, 3, 9, 16, 20, 17, 25, 18, 24, 2, 63, 5, 82, 17, 24 which one of the memory blocks are not present in cache at the end of the sequence

- a) 3 b) 18 c) 20 d) 30

Soln:  $i = j \bmod 8$

where  $j$  is block no in main memory

$i$  is line no in cache

|   |    |    |    |
|---|----|----|----|
| 0 | 8  | 9  | 24 |
| 1 | X  | 17 | 25 |
| 2 | X  | 18 | 2  |
| 3 | 3  |    |    |
| 4 | 20 |    |    |
| 5 | 5  |    |    |
| 6 | 6  |    | 30 |
| 7 | 63 |    |    |

so 18 is not present.

→ Consider a 4 way set associative mapping with 16 cache blocks the memory block requests are in the order (0, 255, 1, 4, 3, 8, 133, 159, 216, 219, 63, 8, 48, 32, 73, 92, 155) which one of the following memory block will not be in cache if LRU is used

- a) 3, b) 8, c) 129 d) 216

Soln: Given 4 way set associative  
so each set has 4 lines

|   |                           |
|---|---------------------------|
| 0 | 0, 1, 8, 216              |
| 1 | 1, 133, 73                |
| 2 |                           |
| 3 | 255, 3, 159, 219, 63, 155 |

in set associative mapping the blocks will go into sets ranging from  $(0 \oplus (k-1))$

now the block number will go to set no

$$i = j \bmod k$$

k-set associative

$j \rightarrow$  is main memory block number

$i$  = set no which  $j$  will go

$$\text{So } i = j \bmod 4$$

Given LRU replacement policy is given

in set 0, 3 4 lines are filled so now we must replace them using 'LRU'

so 255 is replaced for 63

8 is replaced because 8 is already present in the cache and since we are using 'LRU' if we leave like it in future we may think that 216 is least recently used but '8' is recently used

so we replace '8' with '8' for clarity

so only '8' is present in the cache.

→ Consider a small 2-way set associative mapping with a total of 4 blocks, for choosing the block to be replaced use LRU scheme the number of cache misses for the following block addresses 8, 12, 0, 12, 8 is —

Soln:

Given

|   |       |          |
|---|-------|----------|
| 0 | 8, 12 | 0, 12, 8 |
| 1 |       |          |

8 - miss

12 - miss

0 - miss

12 - hit

$$\text{So hit rate} = \frac{1}{5} \times 100 = 20\% \quad 8 - \text{miss}$$

$$\text{miss rate} = \frac{4}{5} \times 100 = 80\%$$

→ Consider a 2 way set associative mapping consisting of  $2^c$  memory blocks and  $2^c$  cache blocks, the cache location for the memory block 'k' is

- a)  $k \bmod 2^c$
- b)  $k \bmod 2^c$
- c)  $k \bmod c$
- d)  $2^c \bmod k$

Soln: Cache Size =  $2^c$  blocks      Set size =  $2^c$  blocks  
main memory =  $2^c$  blocks

$$\text{no of sets} = \frac{\text{Cache size}}{\text{Set size}} = \frac{2^c}{2} = c$$

so  $k \bmod c$

- Consider the cache has 4 blocks for the memory references 5, 12, 13, 17, 4, 12, 13, 17, 2, 13, 19, 13, 43, 61.
- 19 what is the hit ratio of following cache replacement policy.

- 1) FIFO 2) LRU 3) Direct mapping 4) 2 way set associative

Soln:

FIFO

|    |    |
|----|----|
| 8  | 43 |
| 42 | 61 |
| 13 | 19 |
| 17 | 13 |

$$\text{Hit ratio} = \frac{5}{15} \times 100 = 33.33$$

LRU

|    |    |
|----|----|
| 8  | 42 |
| 42 | 43 |
| 13 | 17 |
| 17 | 61 |

(or)

8, 42, 13, 17, 43, 12, 13, 17, 2, 13, 19, 13, 43,

12, 13, 61, 19

12-m

13-m

17-m

$$\text{Hit ratio} = \frac{6}{15} \times 100 = 40\%$$

Direct mapping:

|    |    |
|----|----|
| 12 | 42 |
| 8  | 17 |
| 2  | 13 |
| 14 | 43 |

$$= \text{star first}$$

$$= \text{star second}$$

$$= \text{star third}$$

$$= \text{star fourth}$$

$$= \text{star fifth}$$

$$= \text{star sixth}$$

$$= \text{star seventh}$$

$$= \text{star eighth}$$

$$= \text{star ninth}$$

$$= \text{star tenth}$$

$$= \text{star eleventh}$$

$$= \text{star twelfth}$$

$$= \text{star thirteenth}$$

$$= \text{star fourteenth}$$

2 way set associative (LRU)

|                                   |
|-----------------------------------|
| 8, 12, 13                         |
| 8, 15, 17, 15, 19, 15, 13, 61, 19 |

|                                   |
|-----------------------------------|
| 8, 12, 13                         |
| 8, 15, 17, 15, 19, 15, 13, 61, 19 |

$$\text{So hit ratio} = \frac{5}{15} \times 100$$

→ A hierarchical memory system has the following specifications  
 20 MB main storage with access time of 350 ns and word size is 4B, page size is 8 words.  
 what will be the hit ratio if the page address trace of a program has the pattern 0, 1, 2, 3, 0, 1, 2, 4 and following LRU page replacement technique.

$$\text{word size} = 4B$$

$$\text{Page size} = 8W$$

now we will convert words to bytes

$$\text{so page size} = 8 \times 4B = 32B$$

$$\text{no of Cache pages} = \frac{\text{Cache size}}{\text{Page size}} = \frac{256}{32} = 8$$

LRU is used

0, 1, 2, 3, 0, 1, 2, 4

$$\text{so hit ratio} = \frac{3}{8} \times 100$$

→ Consider an array has 100 elements and each element occupies 4 words. A 32 bit word cache is used and divided into a block of 8 words then what is the hit ratio?

$$A[i][j] = A[i][j] + 10;$$

Soln: we can store the array in two orders

Row major order, column major order

[00, 01, 02, ...]

[00, 10, 20, ...]

Given



In cache, each block can hold two elements

if we call 00 then

if we call 00 then along with 00, 01 will also be called

with 00, 01 will also be called into cache as each block can store two elements

read 00 read 01 so read 00 - miss write 00 write 01 write 00 - hit

read 01 - hit write 01 - hit

$$\text{So in row major order hit rate} = \frac{3}{4} \times 100 = \frac{3}{4} \times 100$$

so in case of column major order

$m \quad H$   
 00 00  
 read write

$m \quad H$   
 01 01  
 read write

so hit rate =  $\frac{2}{4} \times 100$

→ Consider an array  $A[100]$  and each element occupies 4 words, A 32 word cache is used and divided into 8 word blocks

what is the hit ratio for the statement.

for ( $i=0$ ,  $i<100$ ;  $i++$ )

$A[i] = A[i] + 10$

Soln: Given

Cache size = 32 W

Block size = 8 W

Blocks =  $\frac{32}{8} = 4$



so hit ratio =  $\frac{3}{4} \times 100$

Cache Coherence problem: multiple copies of same data can exist in different caches simultaneously and if processors are allowed to update their own copies freely an inconsistency can result

View of memory



if 3 processors  $P_1, P_2, P_3$  are having cache memories

$C_1, C_2, C_3$  then if all the processors having command

main memory then if all of them want to operate

on same variable 'A'

Let  $A = 5$

$P_1$  wants to increment A so it loads A into cache and increment it to '6'

$P_2$  wants to decrement 'A' but before  $P_1$  writes the result of 'A' in main memory  $P_2$  loads A into its cache and decrement it to '4'

so Actually  $A = 5 \rightarrow 6 \rightarrow 5$

But  $A = 5 \rightarrow 6$   
 $P_1$   
 $= 5 \rightarrow 4$   
 $P_2$

So inconsistency occurs

this is called cache coherence problem.

there are four methods to avoid cache coherence

problem

1) write update - write through

2) write update - write back

3) write invalidate - write through

4) write invalidate - write back.



Let  $A = 5$

if  $P_1$  takes A into its cache and decrement it and immediately update it in main memory then it is called write through

if  $P_1$  takes  $A = 5$  and decrement it and the new value is kept in Cache and if it wants to clear the cache then it update the value in main memory then

it is called write back

if  $P_1, P_2, P_3$  are accessing 'A'

$P_1$  changed  $A = 5$  to '4' and then it immediately updated the value of 'A' in  $P_2, P_3$  then it is called write update.

if  $P_1, P_2, P_3$  are accessing A

$P_1$  changed  $A = 5$  to '4' and then it warned  $P_2, P_3$  to not to use value of 'A' then it is called write invalidate.

memory interleaving:

we know when a program is being run then CPU asks for more than one word at a time for execution.

So it means that



1, 2, 3, 4 are the words present in a program accessing one after the other is waste of time i.e. access time for each word is 100ns then for accessing all the 4 words takes 400ns.

if all the words are parallelly read then



1, 2, 3, 4 can be accessed in 100 nsec so this is the concept of memory interleaving.

memory interleaving will reduce the average access time and improves data transfer rate.

the accessing time for all these four words

will be reduced if we access them parallelly.

So due to memory interleaving the access time of memory decreases.



Suppose CPU generated an address then it is divided into module reference i.e. a decoder will decode this reference and says the required data is present in which module ( $m_0, m_1, \dots, m_k$ )

for decoder, it takes 4 ns to decode the module no word reference will find the selected word in the particular module

if access time of each module is 100 ns  
then to find a word



so for accessing  $K$  words  
 $(4 \times K + 100 \text{ ns})$

If memory interleaving is not used then  
 $(K \times 100 \text{ ns})$  will be the access time for  
K words.

This is how memory interleaving will reduce the average access time and improves data transfer rate.

→ Consider a memory system that contains 8 memory modules they are attached to the system bus which is one word length when a write request is made the bus is occupied by 100ns with address data and control information, there after addressed memory module takes 500ns for starting the way if modules internal operations are overlapped what will be the maximum no. of requests can be generated in 1ms time.

Soln: we know 1ms =  $K \times (100\text{ns}) + 500\text{ns}$

the decoder takes 100ns  
memory module takes 500ns

$$1\text{ms} = K \times 10^7 + 5 \times 10^7$$

$$10^3 = 10^7 (K+5)$$

$$K+5 = 10^4$$

$$K \approx 10^4$$

Rate of:

→ A CPU has a cache of 64 bytes, the main memory has  $K$  banks each bank has  $c$  bytes of data. The consecutive  $c$  byte chunks are mapped to consecutive banks with wrap around manner all  $K$  banks can't be accessed in parallel. However the two access for the same bank have to be serialized (one after the other). Let "K" = 24 and  $c = 2$ . Each iteration requires decoding the bank numbers to be accessed in parallel and this takes

$\frac{K}{2}$  nsec, latency of each addressing bank is 80 ns  
 How much time is required for transferring initial block  
 of cache.

Soln: Given  $K = 24$

$$C = 2$$



for 24 banks 48 bytes of data is present

but to transfer 64 bytes we need two iterations  
 because we only transfer integral no of times as  
 time for one iteration is given

$$\text{so time for one iteration} = 12 + 80 \text{ nsec}$$

$$= 92 \text{ nsec}$$

$$\text{for two iterations } 2 \times 92 = 184 \text{ nsec.}$$

Gate 16

→ A file system uses an in memory cache to cache disk blocks. the miss rate of the cache is shown in fig  
 the latency to read a block from cache is 1ms and  
 to read a block from disk is 10ms Assume that cost  
 of checking whether a block exists in the cache  
 is negligible Available cache sizes are in multiples of  
 10 MB



A smallest cache size required to ensure an average  
 read latency of less than 1ms is — mb?

Sol<sup>n</sup>: we should get miss rate in order to find the Cache size

We know

Read latency  $\leq 6 \text{ ms}$

$$H_1 T_1 + (1-H_1) T_2 \leq 6 \text{ ms}$$

$$H_1 (1\text{ms}) + (1-H_1) (10\text{ms}) \leq 6 \text{ ms}$$

$$H_1 + 10 - 10H_1 \leq 6$$

$$-9H_1 \leq -4$$

$$H_1 \geq \frac{4}{9}$$

$$-H_1 \leq -\frac{4}{9}$$

$$(1-H_1) \leq 1 - \frac{4}{9}$$

miss rate

$$m \leq \frac{5}{9} \Rightarrow m \leq 0.55$$

So nearest value is 40% which has 30 MB as

Cache size from memory to 220 misses

Grade 16

→ A processor can support a maximum memory of 4 GB, where the memory is word addressable (a word consists of two bytes) the size of address bus of the processor is atleast 31 bits.

Sol<sup>n</sup>:

$$\text{memory} = 4 \text{ GB}$$

$$= 2^32 \text{ (2B)}$$

$$= 2^{32} \text{ W}$$

$$= 2^{31} \text{ W}$$

So the size of address bus of the processor is atleast 31 bits.

Grade 16  
 → the width of the physical address on a machine is 40 bits. the width of tag field in a 512 KB 8 way set associative cache is — bits.

Soln:



$$\text{Cache size} = 512 \text{ KB} = 2^{19} \text{ B}$$

$$\text{Let Block size} = 2^x$$

$$\text{So no of lines} = \frac{2^{19}}{2^x} = 2^{19-x}$$

$$\text{no of sets} = \frac{2^{19-x}}{8} = 2^{19-x-3} = 2^{16-x}$$

So

$$y + 16 - x + x = 40$$

$$y = 14$$

Grade - 14:  
 → the memory access time is 1 nsec for a read operation with a hit in cache, 5 ns for a read operation for a miss in cache, 2 ns for a write operation with a hit in cache and 10 nsec for a write operation with a miss in cache. execution of a sequence of instructions

involves 100 instruction fetch operations, 60 memory read operations and 40 memory write operations. the cache hit ratio is 0.9 the average memory access time (in ns) in executing the sequence of instructions is —

Soln:

$$T_R = (0.9)(1n) + (0.1)(5n)$$

$$= 1.4 \text{ nsec}$$

$$T_W = (0.9)(2n) + (0.1)(10n)$$

$$= 2.8 \text{ nsec}$$

Given 100 instruction fetch, 60 instruction read

so instruction fetch is also instruction read

## Secondary memory



- Almost all the hard disks have sector size as 512 bytes
  - Total size of disk = no of tracks  $\times$  sectors  $\times$  no of Bytes / tracks / sectors
  - Format is used to save sector number, track number, etc.
  - Read write head is used to read or write information to the hard disk.
  - so for reading or writing we need to move the read/write head from one sector to other sector(s) from one track to another track.
  - \* the time required to move read/write head from one track to another track is called seek time
  - \* the time taken to wait for sector to come under the read write head is called rotational delay
- $T_{avg} = \text{seek time} + \text{rotational delay} + \text{Time taken to transfer data}$
- $$= T_s + T_r + \overline{T}_{\text{transfer}}$$
- if  $T_s$  is not given take it '0'
- if  $T_r$  is not given take  $\frac{1}{2} ( \text{rotation time} )$

- Consider a disk pack with the following specifications
- 16 surfaces, 128 tracks/surface, 256 sectors/track and 512 bytes/sector
- a) what is the capacity of disk pack
- b) the no of bits required to address the sector

$$\text{SOLN: } 16 \text{ sptr} \times \frac{128 \text{ tracks}}{\text{sptr}} \times \frac{256 \text{ sectors}}{\text{track}} \times \frac{512 \text{ bytes}}{\text{sector}}$$

$$= 2^4 \times 2^9 \times 2^7 \times 2^9$$

a) = 256 MB

$$\text{b) } 16 \text{ sptr} \times \frac{128 \text{ tracks}}{\text{sptr}} \times \frac{256 \text{ sectors}}{\text{track}}$$

$$= 2^4 \times 2^7 \times 2^8 \text{ sectors}$$

$$= 2^{19}$$

So 19 bits are required.

c) in the above disk pack the format overhead 32 bytes/sector then what is the formatted disk space

$$\text{SOLN: no of sectors} = 2^{19}$$

$$32 \text{ bytes/sector}$$

$$\text{so } 2^{19} \times 32 = 2^{24} \text{ B}$$

$$= 16 \text{ MB}$$

$$\text{Total Usable Space} = 256 - 16 \text{ MB}$$

$$= 240 \text{ MB}$$

d) in the above disk space the format overhead is 64 bytes/sector

How much amount of memory is wasted due to formatting.

$$\text{SOLN: Formatting overhead} = 2^{19} \times 64 = 2^{25} \text{ B} = 32 \text{ MB}$$

e) Let the diameter of the innermost track is 21cm what is the maximum recording density.

  
SOLN: in both ①, ② no of bytes present are same  
but circumference of '1' is greater than '2'

due to density difference in ①, ② no of bytes are same  
density of ② > density of ①

$$\text{Track capacity} = \frac{256 \text{ sectors/track} \times 512 \text{ bytes}}{\text{sector}}$$

$$= 2^{17} \text{ bytes/track}$$

$$\Rightarrow \text{Circumference} = 2\pi r = \pi d = \frac{22}{7} \times 21 = 66 \text{ cm}$$

$$\text{so for } 66 \text{ cm} - 2^{17} \text{ bytes}$$

$$- 1 \text{ cm} \rightarrow ?$$

$$\Rightarrow \frac{2^{17}}{66} = \frac{128 \text{ KB}}{66} = 1.9 \text{ KB/cm}$$

f) let the diameter of the innermost track is 21cm, with 2 KB/cm what is the capacity of the track.

Soln: track capacity = circumference  $\times$  density

$$= 66 \text{ cm} \times 2 \frac{\text{KB}}{\text{cm}}$$
$$= 132 \text{ KB}$$

g) the disk is rotating at 3600 rpm what is the data transfer rate

Soln: there are 16 surfaces so all the surfaces can be read at a time

so

$$3600 \text{ Rotations} - 60 \text{ sec}$$

$$1 \text{ rotation} \rightarrow x$$

$$x = \frac{1}{60} \text{ s}$$

we know track capacity = 128 KB

so after one rotation we can read 128 KB data

$$\frac{1}{60} \text{ s} \rightarrow 128 \text{ KB}$$

$$1 \text{ sec} \rightarrow 128 \times 60$$

$$\text{total data rate} = 16 \times 128 \times 60$$

$$= 120 \text{ mbps}$$

h) Using one read/write head disk is rotating at 6000 rpm what is the data transfer rate

Soln: 6000 R  $\rightarrow$  60 sec

$$1 \text{ sec} \rightarrow \frac{1}{6000} \text{ R}$$

$$\frac{1}{6000} \text{ s} = 128 \text{ KB}$$

$$1 \text{ s} = 128 \text{ KB} \times 6000$$

$$1 \text{ s} = 13 \text{ mbps}$$

i) if the disk system has rotational speed of 3000 rpm what is the average access time, with a seek time of 14.5 msec

Sol<sup>n</sup>: we know that

$$T_{avg} = Sk + \frac{1}{2}(RT) + transfer\ rate$$

since transfer rate is not given take it as '0'

now rotating time = 3000 rpm

$$3000R - 60sec$$

$$1track = \frac{1}{50} sec$$

$$\frac{1}{2}R = \frac{1}{100} sec = 10 m sec$$

$$so T_{avg} = 11.5 + 10$$

$$= 21.5 msec$$

Q) what is the average access time for transferring 512 bytes of data with following specs

$$Average seek time = 5 msec$$

$$disk rotation = 6000 rpm$$

$$Data rate = 40 KB/sec$$

$$Controller overhead = 0.1 msec$$

$$Sol^n: T_{avg} = S \cdot T + \frac{1}{2}(RT) + D \cdot T + C \cdot O$$

$$6000 R - 60sec$$

$$1R = \frac{1}{100} sec$$

$$\frac{1}{2}R = 5 msec$$

$$D \cdot T \rightarrow 40 KB - 1 sec$$

$$512 b \rightarrow$$

$$\frac{512}{40 \times 1024} = 12.5 msec$$

$$so T_{avg} = 5m + 5m + 12.5m + 0.1m$$

$$= 22.6 msec$$

→ A certain moving arm disk storage with one head has the following specifications

$$Number of tracks/surface = 200$$

$$disk rotation speed = 2400 rpm$$

$$tracks storage capacity = 62,500 bits$$

$$average latency = p msec$$

$$data transfer rate = q bits/sec$$

what is the value of P and Q

Soln:  $2400 \text{ R} = 60 \text{ sec}$

$$1R \rightarrow \frac{1}{40} \text{ sec}$$

$$\frac{1}{2}R \rightarrow 12.5 \text{ msec}$$

$$\text{so } P = 12.5 \text{ msec}$$

$$25 \text{ msec} - 1 \text{ rot} = 62,500 \text{ bits}$$

$$1 \text{ sec} = \frac{62500}{25} \times 1000 = 2.5 \times 10^6 \text{ bps}$$

Note

→ A hard disk has 63 sectors/track, 10 platters each with 2 recording surfaces and 1000 cylinders. The address of the sector is given as

$\langle c, h, s \rangle$  where c = cylinder number, h = surface number

s = sector number then the 8<sup>th</sup> sector is addressed

$\langle 0, 0, 8 \rangle$  1<sup>st</sup> sector is addressed as  $\langle 0, 0, 1 \rangle$  and so

Q) the address  $\langle 400, 16, 29 \rangle$  corresponds to sector number

- a) 505035 b) 505036 c) 505037 d) 505038

Soln: if a numbering is starting from '0' then

if x is where we are which means we should travel x numbers to reach our place

$\overline{0} \quad \overline{1} \quad \overline{2} \quad \overline{3}$  if we are '3' in order to reach

'3' we should cross 3 numbers

So in question

1 cylinder has 10 platters, 2 surfaces, 63 sectors, no of tracks per surface is not given so take 1 track per surface

$$\text{So, } \langle 400, 16, 29 \rangle = 400(10 \times 2 \times 63) + 16(63) + 29 \\ = 505037$$

- a) the address of 2039 sector is  
a)  $(0, 15, 25)$  b)  $(0, 16, 30)$  c)  $(0, 16, 31)$ , d)  $(0, 17, 31)$
- Sol<sup>n</sup>: ~~out of~~ by verification of given diagram base address = 1000  
 $(C) = 1000 + 16(63) + 31$  base address = 1000  
= 2039  
so option 'c'