

# CS & IT ENGINEERING

COMPUTER ORGANIZATION  
AND ARCHITECTURE



## Cache Organization

Lecture No.- 06

By- Vishvadeep Gothi sir



# Recap of Previous Lecture



**Topic**

Block Replacement

**Topic**

Miss Penalty

**Topic**

Types of Cache Miss

# Topics to be Covered



**Topic**

Cache Mapping Hardware

**Topic**

Array Access with Cache

#Q. Consider a 2-way set associative cache with 256 blocks and uses LRU replacement, Initially the cache is empty. Conflict misses are those misses which occur due the contention of multiple blocks for the same cache set. Compulsory misses occur due to first time access to the block. The following sequence of accesses to memory blocks

(0,128,256,128,0,128,256,128,1,129,257,129,1,129,257,129)

is repeated 10 times. The number of conflict misses experienced by the cache is 76?

$$\frac{1^{st}}{4} \quad \frac{2^{nd} - 9^{th}}{8 * 9}$$



## Topic : Checking Hit/Miss in Direct Mapping



mm add.





# Topic : Hardware Implementation in Direct Mapping





# Topic : Hardware Implementation in Direct Mapping





## Topic : Hardware Requirements

No. of MUX needed to extract tag = no. of bits in tag

size of MUX — 11 ————— = No. of blocks in Cache : 1

No. of comparators needed = 1

size of — 11 ————— = (No. of bits in tag) - bits comparator



## Topic : Hit Latency in Direct Mapping

↓  
Time to check hit or miss

$$= \text{mux delay} + \text{comparator delay}$$



# Topic : Checking Hit/Miss in Set Associative Mapping



mm add.

2-way set





## Topic : Checking Hit/Miss in Set Associative Mapping





# Topic : Hardware Implementation in Set Associative Mapping





# Topic : Hardware Implementation in Set Associative Mapping





# Topic : Hardware Implementation in Set Associative Mapping





## Topic : Hardware Requirements

for k-way set ass. cache

No. of mux for tag selection =  $k * (\text{no. of bits in tag})$

Size of ——— = no. of sets in cache ; 1

No. of comparators needed = k

Size of ——— = (no. of bits in tag) bits comparator

no. of OR gate = 1

no. of inputs = k



## Topic : Hit Latency in Set Associative Mapping

= mux delay + comparator + OR gate delay  
for tag selection



# Topic : Checking Hit/Miss in Fully Associative Mapping





## Topic : Hardware Requirements

No. of comparators = no. of blocks in cache

size — " — = (no. of bits in tag) - bits comparator

No. of OR gates = 1

No. of inputs in OR gate = no. of blocks in cache



Fully ass. Cache is implemented using associative memory -





## Topic : Hit Latency in Fully Associative Mapping



= comparator delay + OR gate

#Q. Consider a 4-way set associative cache with size 256Kbytes and block size of 16 bytes. The main memory address size is 30 bits. Find the following:

1. Tag bits in main memory address 14 bits
2. Number of MUX needed to extract the tag  $= 4 * 14 = 56$
3. MUX size to extract the tag  $2^{12}:1$
4. Number of comparators needed 4
5. Comparator size 14 bits Comparator



#Q. Consider two cache organizations. First one is 32 KB 2-way set associative with 32-bytes block size, the second is of same size but direct mapped. The size of an address is 32 bits in both cases. A 2-to-1 multiplexer has latency of 0.6 ns while a k-bit comparator has latency of  $\frac{k}{10}$  ns. The hit latency of the set associative organization is  $h_1$  while that of direct mapped is  $h_2$ .

The value of  $h_1$  is:

- A 2.4 ns
- B 2.3 ns
- C 1.8 ns
- D 1.7 ns

#Q. Consider two cache organizations. First one is 32 KB 2-way set associative with 32-bytes block size, the second is of same size but direct mapped. The size of an address is 32 bits in both cases. A 2-to-1 multiplexer has latency of 0.6 ns while a k-bit comparator has latency of  $\frac{k}{10}$  ns. The hit latency of the set associative organization is  $h_1$  while that of direct mapped is  $h_2$ .

The value of  $h_2$  is:

- A 2.4 ns
- B 2.3 ns
- C 1.8 ns
- D 1.7 ns



## Topic : Questions on Cache and Array

- Cache Size = 32 bytes
- Block size = 4 bytes
- Array in main memory      A[40], each element is 1 bytes
- Direct mapping

$$\text{array size} = 40 * 1B = 40B$$

$$\text{no. of blocks needed to store array in mm} = \frac{40B}{4B} = 10$$

$$\text{no. of Array elements per block} = \frac{4B}{1B} = 4$$

Cache

|    |            |
|----|------------|
| 0  | A[20]      |
| 1  | A[23]      |
| 2  | A[24]      |
| 3  | A[27]      |
| 4  | A[28]      |
| 5  | A[31]      |
| 6  | A[0] A[32] |
| 7  | A[3] A[35] |
| 8  | A[4] A[36] |
| 9  | A[7] A[39] |
| 10 | A[8]       |
| 11 | A[11]      |
| 12 | A[12]      |
| 13 | A[15]      |
| 14 | A[16]      |
| 15 | A[19]      |



| cpu access    | hit / miss | 2nd time access |
|---------------|------------|-----------------|
| A[0]          | miss       |                 |
| A[1]          | hit        |                 |
| A[2]          | hit        |                 |
| A[3]          | hit        |                 |
| A[4]          | miss       |                 |
| A[5]          | hit        |                 |
| A[6]          |            |                 |
| A[7]          |            |                 |
| :             |            |                 |
| A[32]         | miss       |                 |
| A[33] - A[35] | hit        |                 |
| A[36]         | miss       |                 |
| A[37] - A[39] | hit        |                 |

$$\text{no. of extra blocks in array} = 10 - 8 = 2$$

1<sup>st</sup> time access of array

$$\begin{aligned}\text{no. of miss} &= \text{no. of blocks to} \\ &\quad \text{store array} \\ &= 10\end{aligned}$$

$$\begin{aligned}\text{no. of hits} &= \text{array elements} - 10 \\ &= 40 - 10 \\ &= 30\end{aligned}$$

2<sup>nd</sup> time access  
of array

$$\begin{aligned}&= 2 * 2 \\ &= 4\end{aligned}$$

$$\begin{aligned}\text{hits} &= 40 - 4 \\ &= 36\end{aligned}$$

3<sup>rd</sup> time  
access

$$\begin{aligned}&= 4 \\ &= 36\end{aligned}$$

Ques) no. of blocks to store array = 21

no. of blocks in cache = 16

$$\begin{aligned}\text{no. of extra blocks in array} &= 21 - 16 \\ &= 5\end{aligned}$$

| no. of miss | first access of array | 2 <sup>nd</sup> access | 3 <sup>rd</sup> access | ----- |
|-------------|-----------------------|------------------------|------------------------|-------|
| 21          |                       | $5 * 2 = 10$           | $5 * 2 = 10$           |       |



## Topic : Questions on Cache and Array

- Direct mapping

- Cache Size = 64 bytes
- Block size = 4 bytes
- Array in main memory int A[40], each element is 2 bytes

$$\text{no. of hits} = \frac{84}{8}$$

$$\text{no. of miss} = \frac{36}{8}$$

If array is accessed 3 times

$$\text{hit ratio} = \frac{84}{(36+84)} = 0.7 = 70\%$$

$$\text{miss ratio} = 30\%$$

$$\text{array size} = 40 * 2 = 80B$$

$$\text{no. of blocks in array} = \frac{80B}{4B} = 20$$

$$\text{extra blocks in array} = 20 - 16 = 4$$

$$\text{no. of miss} = 20 + (4*2) + (4*2) = 36$$

$$\text{no. of hits} = (40-20) + (40-8) + (40-8) = 84$$

#Q. Consider a machine with a byte addressable main memory of  $2^{16}$  bytes. Assume that a direct mapped data cache consisting of 32 lines of 64 bytes each is used in the system. A  $50 \times 50$  two-dimensional array of bytes is stored in the main memory starting from memory location 1100H. Assume that the data cache is initially empty. The complete array is accessed twice. Assume that the contents of the data cache do not change in between the two accesses.

How many data cache misses will occur in total?

$$\begin{aligned} \text{array size} &= 50 * 50 = 2500 \text{ elements} = 2500 * 1B = 2500B \\ \text{no. of blocks to store array} &= \left\lceil \frac{2500B}{64B} \right\rceil = 40 \end{aligned}$$

no. of extra blocks in array = 40 - 32 = 8

$$\begin{aligned} \text{no. of miss} &= 40 + (8 * 2) \\ &= 56 \end{aligned}$$

#Q. Which of the following lines of the data cache will be replaced by new blocks in accessing the array for the second time?

- A line 4 to line 11
- B line 4 to line 12
- C line 0 to line 7
- D line 0 to line 8



#Q. Consider a direct mapped write back data cache of size 2KB with block size of 128 bytes. The cache is considered to be empty initially. The byte addressable main memory has size 1Mbytes. Further consider that there is an array A[30][25] with each element occupies 4 bytes. The base address of array is  $(1A300)_{16}$ . The array is accessed 4 times. And between the accesses, there is no any data cache changes happen. Hit ratio of cache for the array access is \_\_\_\_ %. ?



## 2 mins Summary



**Topic**

Cache Mapping Hardware

**Topic**

Array Access with Cache



# Happy Learning

## THANK - YOU