



# Cache Management - Part I

Complete Course on Computer Architecture for GATE

## Direct Mapping

- In direct mapping scheme the main memory blocks are directly mapped onto a particular cache memory line.
- It is also known as many to one mapping.



|      |
|------|
| CL-0 |
| CL-1 |
| CL-2 |
| CL-3 |

|      |
|------|
| B-0  |
| B-1  |
| B-2  |
| B-3  |
| B-4  |
| B-5  |
| B-6  |
| B-7  |
| B-8  |
| B-9  |
| B-10 |
| B-11 |
| B-12 |
| B-13 |
| B-14 |
| B-15 |

**Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount**



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount



Use Referral Code **KGYT** for Unacademy Plus to

Discount

*Block No*

*No of Cache Line* = Remainder is the Address of block in Cache



Use Referral Code **KGYT** for Unacademy Plus to

Discount

## Cache

|             |     |      |     |      |      |      |      |      |
|-------------|-----|------|-----|------|------|------|------|------|
| <b>CL-0</b> | B-0 | W-0  | B-4 | W-16 | B-8  | W-32 | B-12 | W-48 |
|             |     | W-1  |     | W-17 |      | W-33 |      | W-49 |
|             |     | W-2  |     | W-18 |      | W-34 |      | W-50 |
|             |     | W-3  |     | W-19 |      | W-35 |      | W-51 |
| <b>CL-1</b> | B-1 | W-4  | B-5 | W-20 | B-9  | W-36 | B-13 | W-52 |
|             |     | W-5  |     | W-21 |      | W-37 |      | W-53 |
|             |     | W-6  |     | W-22 |      | W-38 |      | W-54 |
|             |     | W-7  |     | W-23 |      | W-39 |      | W-55 |
| <b>CL-2</b> | B-2 | W-8  | B-6 | W-24 | B-10 | W-40 | B-14 | W-56 |
|             |     | W-9  |     | W-25 |      | W-41 |      | W-57 |
|             |     | W-10 |     | W-26 |      | W-42 |      | W-58 |
|             |     | W-11 |     | W-27 |      | W-43 |      | W-59 |
| <b>CL-3</b> | B-3 | W-12 | B-7 | W-28 | B-11 | W-44 | B-15 | W-60 |
|             |     | W-13 |     | W-29 |      | W-45 |      | W-61 |
|             |     | W-14 |     | W-30 |      | W-46 |      | W-62 |
|             |     | W-15 |     | W-31 |      | W-47 |      | W-63 |

## Cache Memory

|             |                         |
|-------------|-------------------------|
| <b>CL-0</b> | B-0 / B-4 / B-8 / B-12  |
| <b>CL-1</b> | B-1 / B-5 / B-9 / B-13  |
| <b>CL-2</b> | B-2 / B-6 / B-10 / B-14 |
| <b>CL-3</b> | B-3 / B-7 / B-11 / B-15 |

## Cache Memory

|             |     |    |
|-------------|-----|----|
| <b>CL-0</b> | TAG | W- |
| W-          |     |    |
| W-          |     |    |
| W-          |     |    |
| <b>CL-1</b> |     |    |
| <b>CL-1</b> | TAG | W- |
|             |     | W  |
|             |     | W- |
|             |     | W- |
| <b>CL-2</b> |     |    |
| <b>CL-2</b> | TAG | W- |
|             |     | W- |
|             |     | W- |
|             |     | W- |
| <b>CL-3</b> |     |    |
| <b>CL-3</b> | TAG | W- |
|             |     | W- |
|             |     | W- |
|             |     | W- |

Use Referral Code **KGYT** for Unacademy Plus to Get



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

Tag      line no.      Block offset



| MM Size | Cache Size | Block Size | No of bits in Tag | Tag Directory Size |
|---------|------------|------------|-------------------|--------------------|
| 16 GB   | 32 MB      | 4 KB       |                   |                    |
| 128 MB  | 256 KB     | 512 B      |                   |                    |
| 32 GB   | 128 MB     | 1 KB       |                   |                    |
| 256 MB  | 16 KB      | 1 KB       |                   |                    |
| 4 GB    | 8 MB       | 2 KB       |                   |                    |
| 512 KB  | 2 KB       | 128 B      |                   |                    |



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

| MM Size    | Cache Size | Block Size | No of bits in Tag | Tag Directory Size |
|------------|------------|------------|-------------------|--------------------|
| 128 KB     | 16 KB      | 256 B      | 3                 | $3 * 2^6$          |
| 32 GB      | 32 KB      | 1 KB       | 20                | $20 * 2^5$         |
| $2^{26}$ B | 512 KB     | 1 KB       | 7                 | $7 * 2^9$          |
| 16 GB      | $2^{24}$ B | 4 KB       | 10                | $10 * 2^{12}$      |
| 64 MB      | $2^{16}$ B | ?          | 10                | ?                  |
| $2^{26}$ B | 512 KB     | ?          | 7                 | ?                  |

Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

**Q.41** Consider a computer system with a byte-addressable primary memory of size  $2^{32}$  bytes. Assume the computer system has a direct-mapped cache of size 32 KB (1 KB =  $2^{10}$  bytes), and each cache block is of size 64 bytes.

**(GATE-2021)**

The size of the tag field is \_\_\_\_\_ bits.



**Q** Consider a machine with byte addressable memory of  $2^{32}$  bytes divided into blocks of size 32 bytes. Assume a direct mapped cache having 512 cache lines is used with this machine. The size of tag field in bits is \_\_\_\_\_ (Gate-2017) (2 Marks)



**Q** Consider a direct mapped cache of size 32 KB with block size 32 bytes. The CPU generates 32 bit addresses. The number of bits needed for cache indexing and the number of tag bits are respectively (Gate-2005) (2 Marks)

(A) 10, 17

(B) 10, 22

(C) 15, 17

(D) 5, 17



**Q** Consider a machine with a byte addressable main memory of  $2^{20}$  bytes, block size of 16 bytes and a direct mapped cache having  $2^{12}$  cache lines. Let the addresses of two consecutive bytes in main memory be  $(E201F)_{16}$  and  $(E2020)_{16}$ . What are the tag and cache line address (in hex) for main memory address  $(E201F)_{16}$ ? **(Gate-2015) (1 Marks)**

- (A)** E, 201      **(B)** F, 201      **(C)** E, E20      **(D)** 2, 01F



**Q** An 8KB direct-mapped write-back cache is organized as multiple blocks, each of size 32-bytes. The processor generates 32-bit addresses. The cache controller maintains the tag information for each cache block comprising of the following.

1 Valid bit

1 Modified bit

As many bits as the minimum needed to identify the memory block mapped in the cache. What is the total size of memory needed at the cache controller to store meta-data (tags) for the cache? **(Gate-2011) (2 Marks)**

**(A)** 4864 bits

**(B)** 6144 bits

**(C)** 6656 bits

**(D)** 5376 bits



**Use Referral Code KGYT for Unacademy Plus to Get minimum 10% Discount**

# Break

Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount



**Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount**

## Associative Mapping

- To overcome the problem of conflict-miss in direct mapping we have Associative Mapping.
- A block of main memory can be mapped to any freely available cache line. This makes fully associative mapping more flexible than direct mapping.
- It is also known as many to many mappings.

|             |
|-------------|
| <b>CL-0</b> |
| <b>CL-1</b> |
| <b>CL-2</b> |
| <b>CL-3</b> |

|      |
|------|
| B-0  |
| B-1  |
| B-2  |
| B-3  |
| B-4  |
| B-5  |
| B-6  |
| B-7  |
| B-8  |
| B-9  |
| B-10 |
| B-11 |
| B-12 |
| B-13 |
| B-14 |
| B-15 |



Use Referral Code **KGYT** for Unacademy Plus to

Discount

| Cache Memory |                |    |    |    |
|--------------|----------------|----|----|----|
| CL-0         | TAG = Block no | W- | W- | W- |
| CL-1         | TAG = Block no | W- | W  | W- |
| CL-2         | TAG = Block no | W- | W  | W- |
| CL-3         | TAG = Block no | W- | W  | W- |

| Main Memory |      |      |      |      |
|-------------|------|------|------|------|
| B-0         | W-0  | W-1  | W-2  | W-3  |
| B-1         | W-4  | W-5  | W-6  | W-7  |
| B-2         | W-8  | W-9  | W-10 | W-11 |
| B-3         | W-12 | W-13 | W-14 | W-15 |
| B-4         | W-16 | W-17 | W-18 | W-19 |
| B-5         | W-20 | W-21 | W-22 | W-23 |
| B-6         | W-24 | W-25 | W-26 | W-27 |
| B-7         | W-28 | W-29 | W-30 | W-31 |
| B-8         | W-32 | W-33 | W-34 | W-35 |
| B-9         | W-36 | W-37 | W-38 | W-39 |
| B-10        | W-40 | W-41 | W-42 | W-43 |
| B-11        | W-44 | W-45 | W-46 | W-47 |
| B-12        | W-48 | W-49 | W-50 | W-51 |
| B-13        | W-52 | W-53 | W-54 | W-55 |
| B-14        | W-56 | W-57 | W-58 | W-59 |
| B-15        | W-60 | W-61 | W-62 | W-63 |

Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

## Cache

|             |     |      |     |      |      |      |      |      |
|-------------|-----|------|-----|------|------|------|------|------|
| <b>CL-0</b> | B-0 | W-0  | B-4 | W-16 | B-8  | W-32 | B-12 | W-48 |
|             |     | W-1  |     | W-17 |      | W-33 |      | W-49 |
|             |     | W-2  |     | W-18 |      | W-34 |      | W-50 |
|             |     | W-3  |     | W-19 |      | W-35 |      | W-51 |
| <b>CL-1</b> | B-1 | W-4  | B-5 | W-20 | B-9  | W-36 | B-13 | W-52 |
|             |     | W-5  |     | W-21 |      | W-37 |      | W-53 |
|             |     | W-6  |     | W-22 |      | W-38 |      | W-54 |
|             |     | W-7  |     | W-23 |      | W-39 |      | W-55 |
| <b>CL-2</b> | B-2 | W-8  | B-6 | W-24 | B-10 | W-40 | B-14 | W-56 |
|             |     | W-9  |     | W-25 |      | W-41 |      | W-57 |
|             |     | W-10 |     | W-26 |      | W-42 |      | W-58 |
|             |     | W-11 |     | W-27 |      | W-43 |      | W-59 |
| <b>CL-3</b> | B-3 | W-12 | B-7 | W-28 | B-11 | W-44 | B-15 | W-60 |
|             |     | W-13 |     | W-29 |      | W-45 |      | W-61 |
|             |     | W-14 |     | W-30 |      | W-46 |      | W-62 |
|             |     | W-15 |     | W-31 |      | W-47 |      | W-63 |

## Cache Memory

|             |                         |
|-------------|-------------------------|
| <b>CL-0</b> | B-0 / B-4 / B-8 / B-12  |
| <b>CL-1</b> | B-1 / B-5 / B-9 / B-13  |
| <b>CL-2</b> | B-2 / B-6 / B-10 / B-14 |
| <b>CL-3</b> | B-3 / B-7 / B-11 / B-15 |

## Cache Memory

|             |     |    |
|-------------|-----|----|
| <b>CL-0</b> | TAG | W- |
| W-          |     |    |
| W-          |     |    |
| W-          |     |    |
| <b>CL-1</b> |     |    |
| <b>CL-1</b> | TAG | W- |
|             |     | W  |
|             |     | W- |
|             |     | W- |
| <b>CL-2</b> |     |    |
| <b>CL-2</b> | TAG | W- |
|             |     | W- |
|             |     | W- |
|             |     | W- |
| <b>CL-3</b> |     |    |
| <b>CL-3</b> | TAG | W- |
|             |     | W- |
|             |     | W- |
|             |     | W- |

Use Referral Code **KGYT** for Unacademy Plus to Get

- A replacement algorithm is needed to replace a block if the cache is full.
- In fully associative mapping we only have two fields: Tag/Block Number field and a Block offset field.
- Here the number of bits in tag = number of bits to require to represent block number

Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

Tag      line no.      Block offset



Cache



Use Referral Code **KCYT** for Unacademy Plus to Get minimum 10% Discount

## Hardware Architecture

- If we have 'n' lines in cache then 'n' number of comparators are required.
- Size of comparator = Size of Tag
- If we have 'n' bit tag then we require 'n' bit comparator.





Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

**Q** Consider a fully associative mapped cache of size 16 KB with block size 256 bytes. The size of main memory is 128 KB. Find out the: Number of bits in tag and Tag directory size?



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

| MM Size | Cache Size | Block Size | No of bits in Tag | Tag Directory Size | Comp |
|---------|------------|------------|-------------------|--------------------|------|
| 128 KB  | 16 KB      | 256 B      |                   |                    |      |
| 32 GB   | 32 KB      | 1 KB       |                   |                    |      |
|         | 512 KB     | 1 KB       | 17                |                    |      |
| 16 GB   |            | 4 KB       | 22                |                    |      |
| 64MB    |            |            | 10                |                    |      |
|         | 512 KB     |            | 7                 |                    |      |



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

| MM Size | Cache Size | Block Size | No of bits in Tag | Tag Directory Size | Comp |
|---------|------------|------------|-------------------|--------------------|------|
| 128 KB  | 16 KB      | 256 B      | 9                 | 9 * 64             | 64   |
| 32 GB   | 32 KB      | 1 KB       | 25                | 25 * 32            | 32   |
| 128 MB  | 512 KB     | 1 KB       | 17                | 512 * 17           | 512  |
| 16 GB   | ?          | 4 KB       | 22                | ?                  | ?    |
| 64MB    | ?          | 64 KB      | 10                | ?                  | ?    |
| ?       | 512 KB     |            | 7                 | ?                  | ?    |

Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

**Hit latency = Time taken by one comparator (Comparators are in Parallel) + time taken by OR Gate**

**Example:** Let us assume that main memory size is 32GB, block size is 32KB and the propagation delay for the comparators and OR gate is 10 K ns and 10 ns respectively. Calculate the tag size and hit latency?

Main Memory = 32 GB =  $2^{30} * 2^5 = 2^{35}$  = 35 bits Physical Address.

Block Size = 32 KB =  $2^{15}$  = 15 bits block size.

So, the block number =  $35 - 15 = 20$  bits



Propagation Delay =  $10 * 20 = 200$  ns.

Hit latency:  $200$  ns +  $10$  ns (Propagation Delay of OR gate) =  $210$  ns.

**Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount**

## **Disadvantage**

- Hardware cost is high as compared to direct mapping.
- Tag directory size is more as compared to direct mapping.

# Break

Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

## Set Associative Mapping

- In k-way set associative mapping, cache lines are grouped into sets where each set contains “k” number of lines.
- A particular block of main memory can map to only one particular set of the cache.
- However, within that set, the memory block can map to any freely available cache line.



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount



## Formulas

- Number of Sets = No of Lines in Cache / No of Cache line in a set( $k$ )( $k$ -way set associative)



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

| Main Memory |      |      |      |      |
|-------------|------|------|------|------|
| B-0         | W-0  | W-1  | W-2  | W-3  |
| B-1         | W-4  | W-5  | W-6  | W-7  |
| B-2         | W-8  | W-9  | W-10 | W-11 |
| B-3         | W-12 | W-13 | W-14 | W-15 |
| B-4         | W-16 | W-17 | W-18 | W-19 |
| B-5         | W-20 | W-21 | W-22 | W-23 |
| B-6         | W-24 | W-25 | W-26 | W-27 |
| B-7         | W-28 | W-29 | W-30 | W-31 |
| B-8         | W-32 | W-33 | W-34 | W-35 |
| B-9         | W-36 | W-37 | W-38 | W-39 |
| B-10        | W-40 | W-41 | W-42 | W-43 |
| B-11        | W-44 | W-45 | W-46 | W-47 |
| B-12        | W-48 | W-49 | W-50 | W-51 |
| B-13        | W-52 | W-53 | W-54 | W-55 |
| B-14        | W-56 | W-57 | W-58 | W-59 |
| B-15        | W-60 | W-61 | W-62 | W-63 |

| Cache           |      |                     |  |    |
|-----------------|------|---------------------|--|----|
| Set<br>Number-0 | CL-0 | TAG = Block no – CL |  | W- |
|                 | CL-1 | TAG = Block no – CL |  | W- |
| Set<br>Number-1 | CL-2 | TAG = Block no – CL |  | W- |
|                 | CL-3 | TAG = Block no - CL |  | W- |

Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

# Cache

|                 |      | Cache |     |      |      |      |      |      |      |      |  |
|-----------------|------|-------|-----|------|------|------|------|------|------|------|--|
| Set<br>Number-0 | CL-0 | Tag   | B-0 | W-0  | B-2  | W-8  | B-4  | W-16 | B-6  | W-24 |  |
|                 |      |       |     | W-1  |      | W-9  |      | W-17 |      | W-25 |  |
|                 |      | Tag   |     | W-2  |      | W-10 |      | W-18 |      | W-26 |  |
|                 |      |       |     | W-3  |      | W-11 |      | W-19 |      | W-27 |  |
|                 |      |       |     |      |      |      |      |      |      |      |  |
|                 | CL-1 | Tag   | B-8 | W-32 | B-10 | W-40 | B-12 | W-48 | B-14 | W-56 |  |
|                 |      |       |     | W-33 |      | W-41 |      | W-49 |      | W-57 |  |
|                 |      | Tag   |     | W-34 |      | W-42 |      | W-50 |      | W-58 |  |
|                 |      |       |     | W-35 |      | W-43 |      | W-51 |      | W-59 |  |
|                 |      |       |     |      |      |      |      |      |      |      |  |
| Set<br>Number-1 | CL-2 | Tag   | B-1 | W-4  | B-3  | W-12 | B-5  | W-20 | B-7  | W-28 |  |
|                 |      |       |     | W-5  |      | W-13 |      | W-21 |      | W-29 |  |
|                 |      | Tag   |     | W-6  |      | W-14 |      | W-22 |      | W-30 |  |
|                 |      |       |     | W-7  |      | W-15 |      | W-23 |      | W-31 |  |
|                 | CL-3 | Tag   | B-9 | W-36 | B-11 | W-44 | B-13 | W-52 | B-15 | W-60 |  |
|                 |      |       |     | W-37 |      | W-45 |      | W-53 |      | W-61 |  |
|                 |      | Tag   |     | W-38 |      | W-46 |      | W-54 |      | W-62 |  |
|                 |      |       |     | W-39 |      | W-47 |      | W-55 |      | W-63 |  |

## Hardware Architecture



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

**Q** Consider the main memory size is of 128 KB, the cache size is of 16 KB, the block size is of 256 B, the set size is 2. Find Tag.



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

**Q Main Memory = 32 GB, Cache Size = 32 KB, Block Size = 1 KB and it is a 4-way set associative cache?**



**Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount**

| MM Size | Cache Size | Block Size | No of bits in Tag | Tag Directory Size | Set Associative |
|---------|------------|------------|-------------------|--------------------|-----------------|
| 128 KB  | 16 KB      | 256 B      |                   |                    | 2-way           |
| 32 GB   | 32 KB      | 1 KB       |                   |                    | 4-way           |
|         | 512 KB     | 1 KB       | 7                 |                    | 8-way           |
| 16 GB   |            | 4 KB       | 10                |                    | 4-way           |
| 64MB    |            |            | 10                |                    | 4-way           |
|         | 512 KB     |            | 7                 |                    | 8-way           |



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

| MM Size    | Cache Size | Block Size | No of bits in Tag | Tag Directory Size | Set Associative |
|------------|------------|------------|-------------------|--------------------|-----------------|
| 128 KB     | 16 KB      | 256 B      | 4                 | $4 * 2^6$          | 2-way           |
| 32 GB      | 32 KB      | 1 KB       | 22                | $22 * 32$          | 4-way           |
| $2^{23}$ B | 512 KB     | 1 KB       | 7                 | $7 * 2^9$          | 8-way           |
| 16 GB      | $2^{26}$ B | 4 KB       | 10                | $10 * 2^{14}$      | 4-way           |
| 64MB       | ?          | ?          | 10                | ?                  | 4-way           |
| ?          | 512 KB     | ?          | 7                 | ?                  | 8-way           |

Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

**Q.55** Consider a set associative cache of size ~~2 KB~~ ( $1 \text{ KB} = 2^{10}$  bytes) with cache block size of 64 bytes. Assume that the ~~cache~~ is byte-addressable and a 32-bit address is used for accessing the cache. If the width of the tag field is ~~22~~ bits, the associativity of the cache is 2.

**(GATE- 2021)**

**Ans.** (2)



**Q** A computer system with a word length of 32 bits has a 16 MB byte-addressable main memory and a 64 KB, 4-way set associative cache memory with a block size of 256 bytes. Consider the following four physical addresses represented in hexadecimal notation. (Gate-2020) (2 Marks)

$$A_1 = 0x42C8A4,$$

$$A_2 = 0x546888,$$

$$A_3 = 0x6A289C,$$

$$A_4 = 0x5E4880$$

- (A)  $A_1$  and  $A_4$  are mapped to different cache sets.
- (B)  $A_2$  and  $A_3$  are mapped to the same cache set.
- (C)  $A_3$  and  $A_4$  are mapped to the same cache set.
- (D)  $A_1$  and  $A_3$  are mapped to the same cache set.



**Q** The size of the physical address space of a processor is  $2^P$  bytes. The word length is  $2^W$  bytes. The capacity of cache memory is  $2^N$  bytes. The size of each cache block is  $2^M$  words. For a K-way set-associative cache memory, the length (in number of bits) of the tag field is (Gate-2018) (2 Marks)

(A)  $P - N - \log_2 K - 7$

(C)  $P - N - M - W - \log_2 K - 4$

(B)  $P - N + \log_2 K - 76$

(D)  $P - N - M - W + \log_2 K - 14$



**Q** A cache memory unit with capacity of N words and block size of B words is to be designed. If it is designed as direct mapped cache, the length of the TAG field is 10 bits. If the cache unit is now designed as a 16-way set-associative cache, the length of the TAG field is 14 bits. (Gate-2017) (1 Marks)

9 → a) 16

75 → b) 14

7 → c) 4

9 → d) none of them



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

**Q** The width of the physical address on a machine is 40 bits. The width of the tag field in a 512 KB 8-way set associative cache is 24 bits (Gate-2016) (2 Marks)

3 q) 40

→ 43 b) < 4

25 q) 21

23 q) no justification



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

~~Q A 4-way set associative cache memory unit with a capacity of 16 KB is built using a block size of 8 words. The word length is 32 bits. The size of the physical address space is 4 GB. The number of bits for the TAG field is \_\_\_\_\_~~ (Gate-2014) (1 Marks)

$$\frac{16\text{KB}}{4\text{B}} = 4\text{K}$$

$$\frac{4\text{GB}}{4\text{B}}$$



1 → a) 16

19 → b) 18

7 → c) 20

1 → d) no of bits

3 L

| Main Memory (Physical Address) <del>30</del> |                     |                                 |                                 |
|----------------------------------------------|---------------------|---------------------------------|---------------------------------|
| Block Number                                 |                     | Block Offset                    |                                 |
| <----->                                      |                     | <----->                         |                                 |
| Tag<br><---- 18 + L ---->                    | Set<br><----->      | Block Offset<br><----- 3 -----> | Block Offset<br><----- 3 -----> |
| 2<br><----->                                 | Cache <del>12</del> | 14<br><----->                   | 12<br><----->                   |

~~Q In a k-way set associative cache, the cache is divided into v sets, each of which consists of k lines. The lines of a set are placed in sequence one after another. The lines in set s are sequenced before the lines in set (s+1). The main memory blocks are numbered 0 onwards. The main memory block numbered j must be mapped to any one of the cache lines from.~~ (Gate-2013) (1 Marks)

~~1 x 1 2 L S~~

(A)  $(j \bmod v) * k$  to  $(j \bmod v) * k + (k-1)$  - 50

(C)  $(j \bmod k)$  to  $(j \bmod k) + (v-1)$  - 1

(B)  $(j \bmod v)$  to  $(j \bmod v) + (k-1)$  - 33

(D)  $(j \bmod k) * v$  to  $(j \bmod k) * v + (v-1)$  - 16



**Q** A computer has a 256 KByte, 4-way set associative, write back data cache with block size of 32 Bytes. The processor sends 32-bit addresses to the cache controller. Each cache tag directory entry contains, in addition to address tag, 2 valid bits, 1 modified bit and 1 replacement bit. The number of bits in the tag field of an address is (Gate-2012) (2 Marks)

- (A) 11  $\rightarrow$  7      (B) 14  $\rightarrow$  11      (C) 16  $\rightarrow$  78      (D) 27  $\rightarrow$  4

$$2^0 \times 2^{13} \\ 8K = 16\text{Ku'h}$$



**Q** The size of the cache tag directory is (Gate-2012) (2 Marks)

a) 160 Kbits

51

b) 136 Kbits

9

c) 40 Kbits

15

d) 32 Kbits

20



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

~~Q Consider a computer with a 4-ways set-associative mapped cache of the following characteristics: a total of 1 MB of main memory, a word size of 1 byte, a block size of 128 words and a cache size of 8 KB. The number of bits in the TAG, SET and WORD fields, respectively are:~~ (Gate-2008) (2 Marks)

(A) 7, 6, 7

(B) 8, 5, 7

(C) 8, 6, 6

(D) 9, 4, 7

8  
9, 6, 7

9, 9, 7



**Q** Consider a computer with a 4-way set-associative mapped cache of the following characteristics: a total of 1 MB of main memory, a word size of 1 byte, a block size of 128 words and a cache size of 8 KB. While accessing the memory location 0C795H by the CPU, the contents of the TAG field of the corresponding cache line is: (Gate-2008) (2 Marks)

a) 000011000

b) 110001111

c) 00011000

d) 110010101



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

**Q** Consider a 4-way set associative cache consisting of 128 lines with a line size of 64 words. The CPU generates a 20-bit address of a word in main memory. The number of bits in the TAG, LINE and WORD fields are respectively: **(Gate-2007) (1 Marks)**



**Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount**

**Q** A computer system has a level-1 instruction cache (1-cache), a level-1 data cache (D-cache) and a level-2 cache (L2-cache) with the following specifications:

The length of the physical address of a word in the main memory is 30 bits. The capacity of the tag memory in the I-cache, D-cache and L2-cache is, respectively,

**(Gate-2006) (2 Marks)**

- (A) 1 K x 18-bit, 1 K x 19-bit, 4 K x 16-bit
- (B) 1 K x 16-bit, 1 K x 19-bit, 4 K x 18-bit
- (C) 1 K x 16-bit, 512 x 18-bit, 1 K x 16-bit
- (D) 1 K x 18-bit, 512 x 18-bit, 1 K x 18-bit

|          | Capacity  | Mapping Method                | Block size |
|----------|-----------|-------------------------------|------------|
| I-cache  | 4K words  | Direct mapping                | 4 Words    |
| D-cache  | 4K words  | 2-way set associative mapping | 4 Words    |
| L2-cache | 64K words | 4-way set associative mapping | 16 Words   |



Use Referral Code **KGYT** for Unacademy Plus to Get minimum 10% Discount

**Q** The main memory of a computer has  $2^m$  blocks while the cache has  $2^c$  blocks. If the cache uses the set associative mapping scheme with 2 blocks per set, then the block  $k$  of main memory maps to the set: **(Gate-1999) (1 Marks)**

- (A)  $(k \bmod m)$  of the cache  
(C)  $(k \bmod 2^c)$  of the cache

- (B)  $(k \bmod c)$  of the cache  
(D)  $(k \bmod 2^{cm})$  of the cache



**Q** A block-set associative cache memory consists of 128 blocks divided into four block sets. The main memory consists of 16,384 blocks and each block contains 256 eight-bit words. **(Gate-1990) (2 Marks)**

- i) How many bits are required for addressing the main memory?
- ii) How many bits are needed to represent the TAG, SET and WORD fields?



**Q** In designing a computer's cache system, the cache block (or cache line) size is an important parameter. Which one of the following statements is correct in this context? **(Gate-2014) (1 Marks)**

- (A) A smaller block size implies better spatial locality
- (B) A smaller block size implies a smaller cache tag and hence lower cache tag overhead
- (C) A smaller block size implies a larger cache tag and hence lower cache hit time
- (D) A smaller block size incurs a lower cache miss penalty

## Cache Replacement Policies

- In direct mapped cache, the position of each block is predetermined hence no replacement policy exists.
- In fully associative and set associative caches there exists policies.
- When a new block is brought into the cache and all the positions that it may occupy are full, then the controller needs to decide which of the old blocks it can overwrite.





→ A.M. ← ===

SAM

## FIFO Policy — A-m

- The block which have entered first in the memory will be replaced first.
- This can lead to a problem known as “**Belady’s Anomaly**”, it states that if we increase the number of lines in cache memory the cache miss will increase.

**Example:** Let the blocks be in the sequence: 7, 0, 1, 2, 0, 3, 0, 4, 2, 3, 0, 3, 2, 1, 2, 0, 1, 7, 0, 1 and the cache memory has 4 lines.



## FIFO Policy

- The block which have entered first in the memory will be replaced first.
- This can lead to a problem known as “**Belady’s Anomaly**”, it states that if we increase the number of lines in cache memory the cache miss will increase.

**Example:** Let the blocks be in the sequence: 7, 0, 1, 2, 0, 3, 0, 4, 2, 3, 0, 3, 2, 1, 2, 0, 1, 7, 0, 1 and the cache memory has 4 lines.



## LRU (Least Recently Used)

- The page which was not used for the longest period of time in the past will get replaced first.
- Example: Let the blocks be in the sequence: 7, 0, 1, 2, 0, 3, 0, 4, 2, 3, 0, 3, 2, 1, 2, 0, 1, 7, 0, 1 and the cache memory has 4 lines.



### LRU (Least Recently Used)

- The page which was not used for the longest period of time in the past will get replaced first.
- Example: Let the blocks be in the sequence: 7, 0, 1, 2, 0, 3, 0, 4, 2, 3, 0, 3, 2, 1, 2, 0, 1, 7, 0, 1 and the cache memory has 4 lines.



## Most Recently Used (MRU)

- The page which was used recently will be replaced first.

Example: Let the blocks be in the sequence: 7, 0, 1, 2, 0, 3, 0, 4, 2, 3, 0, 3, 2, 1, 2, 0, 1, 7, 0, 1 and the cache memory has 4 lines.

7, 0, 1, 2, 0, 3, 0, 4, 2, 3, 0, 3, 2, 1, 2, 0, 1, 7, 0, 1



## Most Recently Used (MRU)

- The page which was used recently will be replaced first.

Example: Let the blocks be in the sequence: 7, 0, 1, 2, 0, 3, 0, 4, 2, 3, 0, 3, 2, 1, 2, 0, 1, 7, 0, 1 and the cache memory has 4 lines.



### Optimal Algorithm

- The page which will not be used for the longest period of time in future references will be replaced first.
- The optimal algorithm will provide the best performance but it is difficult to implement as it requires the future knowledge of pages which is not possible.
- It is used as a benchmark for cache replacement algorithms.

**Example:** Let the blocks be in the sequence: 7, 0, 1, 2, 0, 3, 0, 4, 2, 3, 0, 3, 2, 1, 2, 0, 1, 7, 0, 1 and the cache memory has 4 lines.

7, 0, 1, 2, 0, 3, 0, 4, 2, 3, 0, 3, 2, 1, 2, 0, 1, 7, 0, 1



### Optimal Algorithm

- The page which will not be used for the longest period of time in future references will be replaced first.
- The optimal algorithm will provide the best performance but it is difficult to implement as it requires the future knowledge of pages which is not possible.
- It is used as a benchmark for cache replacement algorithms.

**Example:** Let the blocks be in the sequence: 7, 0, 1, 2, 0, 3, 0, 4, 2, 3, 0, 3, 2, 1, 2, 0, 1, 7, 0, 1 and the cache memory has 4 lines.



**Break**

## Types of Miss

- Compulsory Miss
  - When CPU demands for any block for the first time then definitely a miss is going to occur as the block needs to be brought into the cache, it is known as Compulsory miss.
- Capacity Miss — A-M
  - Occur because blocks are being discarded from cache because cache cannot contain all blocks needed for program execution.
- Conflict Miss
  - In the case of set associative or direct mapped block placement strategies, conflict misses occur when several blocks are mapped to the same set or block frame; also called collision misses or interference misses.

**Q** Consider a 2-way set associative cache with 256 blocks and uses LRU replacement. Initially the cache is empty. Conflict misses are those misses which occur due to the contention of multiple blocks for the same cache set. Compulsory misses occur due to first time access to the block. The following sequence of accesses to memory blocks (0, 128, 256, 128, 0, 128, 256, 128, 1, 129, 257, 129, 1, 129, 257, 129) is repeated 10 times. The number of conflict misses experienced by the cache is \_\_\_\_\_. (Gate-2017) (2 Marks)

~~Q Consider a 4-way set associative cache (initially empty) with total 16 cache blocks. The main memory consists of 256 blocks and the request for memory blocks is in the following order: 0, 255, 1, 4, 3, 8, 133, 159, 216, 129, 63, 8, 48, 32, 73, 92, 155. Which one of the following memory blocks will NOT be in cache if LRU replacement policy is used? (Gate-2009) (2 Marks)~~

~~(A) 3~~

~~(B) 8~~

~~(C) 129~~

~~(D) 216~~



$$\begin{array}{r} 155 \\ \times 3 \\ \hline 33 \\ -1 \\ \hline 4 \end{array}$$

**Q** Consider a Direct Mapped Cache with 8 cache blocks (numbered 0-7). If the memory block requests are in the following order 3, 5, 2, 8, 0, 63, 9, 16, 20, 17, 25, 18, 30, 24, 2, 63, 5, 82, 17, 24. Which of the following memory blocks will not be in the cache at the end of the sequence? (GATE-2007) (2 Marks)

Q Consider a Direct Mapped Cache with 8 cache blocks (numbered 0-7). If the memory block requests are in the following order 3, 5, 2, 8, 0, 63, 9, 16, 20, 17, 25, 18, 30, 24, 2, 63, 5, 82, 17, 24. Which of the following memory blocks will not be in the cache at the end of the sequence? (GATE-2007) (2 Marks)

(A) 3 - 23

(B) 18

(C) 20 - 18

(D) 30 - 7

~~3, 5, 2, 8, 0, 63, 9, 16, 20, 17, 25, 18, 30, 24, 2, 63, 5, 82, 17, 24~~



**Q** Consider a small two-way set-associative cache memory, consisting of four blocks. For choosing the block to be replaced, use the least recently used (LRU) scheme. The number of cache misses for the following sequence of block addresses is 8, 12, 0, 12, 8 (Gate-2004) (2 Marks)

- (A) 2  
7
- (B) 3  
37

- (C) 4  
S2
- (D) 5  
1



8, 12, 0, 12, 8  
M, M, M, M, M

**Break**

## Memory Organization

- Memory is organized at different levels.
- CPU may try to access different levels of memory in different ways.
- On this basis, the memory organization is broadly divided into two types



## Hierarchical Access Memory Organization

- In this memory organization, memory levels are organized as
  - Level-1 is directly connected to the CPU.
  - Level-2 is directly connected to level-1.
  - Level-3 is directly connected to level-2 and so on
- Whenever CPU requires any word,
  - It first searches for the word in level-1.
  - If the required word is not found in level-1, it searches for the word in level-2.
  - If the required word is not found in level-2, it searches for the word in level-3 and so on.



- |                                                                                                                                                                                                                                                                                                                  |                                                                                                                                                                                                                                                                                                                  |                                                                                                                                                                                                                                                                                                                  |
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <ul style="list-style-type: none"> <li>• <math>T_1</math> = Access time of level <math>L_1</math></li> <li>• <math>S_1</math> = Size of level <math>L_1</math></li> <li>• <math>C_1</math> = Cost per byte of level <math>L_1</math></li> <li>• <math>H_1</math> = Hit rate of level <math>L_1</math></li> </ul> | <ul style="list-style-type: none"> <li>• <math>T_2</math> = Access time of level <math>L_2</math></li> <li>• <math>S_2</math> = Size of level <math>L_2</math></li> <li>• <math>C_2</math> = Cost per byte of level <math>L_2</math></li> <li>• <math>H_2</math> = Hit rate of level <math>L_2</math></li> </ul> | <ul style="list-style-type: none"> <li>• <math>T_3</math> = Access time of level <math>L_3</math></li> <li>• <math>S_3</math> = Size of level <math>L_3</math></li> <li>• <math>C_3</math> = Cost per byte of level <math>L_3</math></li> <li>• <math>H_3</math> = Hit rate of level <math>L_3</math></li> </ul> |
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|



### Effective Memory Access Time

Average time required to access memory per operation =

$$H_1 * T_1 + (1 - H_1) * H_2 * (T_1 + T_2) + (1 - H_1) (1 - H_2) * H_3 * (T_1 + T_2 + T_3)$$

$$H_1 * T_1 + (1 - H_1) * H_2 * (T_1 + T_2) + (1 - H_1) (1 - H_2) * (T_1 + T_2 + T_3)$$

## Average Cost per Byte

- Average cost per byte of the memory =

$$\frac{C_1 S_1 + C_2 S_2 + C_3 S_3}{S_1 + S_2 + S_3}$$



## Simultaneous Access Memory Organization

- In simultaneous access all the levels of memory are directly connected to the CPU, whenever CPU requires any word, it starts searching for it in all the levels simultaneously.



|                                                                                                                                                                                                                                                                                                                  |                                                                                                                                                                                                                                                                                                                  |                                                                                                                                                                                                                                                                                                                  |
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <ul style="list-style-type: none"> <li>• <math>T_1</math> = Access time of level <math>L_1</math></li> <li>• <math>S_1</math> = Size of level <math>L_1</math></li> <li>• <math>C_1</math> = Cost per byte of level <math>L_1</math></li> <li>• <math>H_1</math> = Hit rate of level <math>L_1</math></li> </ul> | <ul style="list-style-type: none"> <li>• <math>T_2</math> = Access time of level <math>L_2</math></li> <li>• <math>S_2</math> = Size of level <math>L_2</math></li> <li>• <math>C_2</math> = Cost per byte of level <math>L_2</math></li> <li>• <math>H_2</math> = Hit rate of level <math>L_2</math></li> </ul> | <ul style="list-style-type: none"> <li>• <math>T_3</math> = Access time of level <math>L_3</math></li> <li>• <math>S_3</math> = Size of level <math>L_3</math></li> <li>• <math>C_3</math> = Cost per byte of level <math>L_3</math></li> <li>• <math>H_3</math> = Hit rate of level <math>L_3</math></li> </ul> |
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|

Effective Memory Access Time (EMAT) =

$$H_1 * T_1 + (1 - H_1) * H_2 * T_2 + (1 - H_1) (1 - H_2) * H_3 * T_3$$

In any memory organization, the data item being searched will definitely be present in the last level (or secondary memory).

Thus, hit rate for the last level is always 1. So,

$$H_1 * T_1 + (1 - H_1) * H_2 * T_2 + (1 - H_1) (1 - H_2) * T_3$$



## Average Cost per Byte

- Average cost per byte of the memory =

$$\frac{C_1 S_1 + C_2 S_2 + C_3 S_3}{S_1 + S_2 + S_3}$$



**Example:** Calculate the EMAT for a machine with a cache hit rate of 80% where cache access time is 5ns and main memory access time is 100ns, both for simultaneous and hierarchical access.

**Use Referral Code KGYT for Unacademy Plus to Get minimum 10% Discount**

**Q** Assume that for a certain processor, a read request takes 50 nanoseconds on a cache miss and 5 nanoseconds on a cache hit. Suppose while running a program, it was observed that 80% of the processor's read requests result in a cache hit. The average read access time in nanoseconds is \_\_\_\_\_ . **(GATE-2015) (2 Marks)**

**Q** Consider a system with 2 level cache. Access times of Level 1 cache, Level 2 cache and main memory are 0.5 ns, 5 ns and 100 ns respectively. The hit rates of Level 1 and Level 2 caches are 0.7 and 0.8 respectively. What is the average access time of the system ignoring the search time within the cache? **(NET-DEC-2018)**

a) 35.20 ns

b) 7.55 ns

c) 20.75 ns

d) 24.35 ns

## Cache Coherence Problem

- If multiple copy of same data is maintained at different level of memories then inconsistency may occur, this problem is known as cache coherence problem.
- Cache coherence problem can be resolved using the following techniques:
  - Write Through
  - Write Back

## Write Through

- Write through is used to maintain the consistency between the cache and main memory.
- According to it if the cache copy is updated, at the same time main memory is also updated.
- **Advantages**
  - It provides the highest level of consistency.
- **Disadvantages**
  - It requires more number of memory access.

## Write Back

- Write back is also used to maintain the consistency between the cache and main memory.
- According to it all the changes performed on cache are reflected back to the main memory in the end.
- **Advantage**
  - Less number of memory accesses and less write operations.
- **Disadvantage**
  - Inconsistency may occur.

**Q** In \_\_\_\_\_ method, the word is written to the block in both the cache and main memory, in parallel. **(NET-JULY-2016)**

- (a) Write through
- (b) Write back
- (c) Write protected
- (d) Direct mapping

**Q** A hierarchical memory system that uses cache memory has cache access time of 50 nano seconds, main memory access time of 300 nano seconds, 75% of memory requests are for read, hit ratio of 0.8 for read access and the write-through scheme is used. What will be the average access time of the system both for read and write requests? (NET-DEC-2014)

- (A) 157.5 n.sec
- (B) 110 n.sec
- (C) 75 n.sec
- (D) 82.5 n.sec

**Q** For inclusion to hold between two cache levels  $L_1$  and  $L_2$  in a multi-level cache hierarchy, which of the following are necessary?  
**(Gate-2008) (2 Marks)**

- I.  $L_1$  must be a write-through cache
- II.  $L_2$  must be a write-through cache
- III. The associativity of  $L_2$  must be greater than that of  $L_1$
- IV. The  $L_2$  cache must be at least as large as the  $L_1$  cache

**(A) IV only**  
**(C) I, III and IV only**

**(B) I and IV only**  
**(D) I, II, III and IV**

**Q** In a two-level cache system, the access times of  $L_1$  and  $L_2$  is 1 and 8 clock cycles, respectively. The miss penalty from the  $L_2$  cache to main memory is 18 clock cycles. The miss rate of  $L_1$  cache is twice that of  $L_2$ . The average memory access time (AMAT) of this cache system is 2 cycles. The miss rates of  $L_1$  and  $L_2$  respectively are: (Gate-2017) (2 Marks)

- (A) 0.111 and 0.056
- (B) 0.056 and 0.111
- (C) 0.0892 and 0.1784
- (D) 0.1784 and 0.0892

**Q** The read access times and the hit ratios for different caches in a memory hierarchy are as given below:

| Cache    | Read access time<br>(in nanoseconds) | Hit ratio |
|----------|--------------------------------------|-----------|
| I-cache  | 2                                    | 0.8       |
| D-cache  | 2                                    | 0.9       |
| L2-cache | 8                                    | 0.9       |

The read access time of main memory is 90 nanoseconds. Assume that the caches use the referred-word-first read policy and the writeback policy. Assume that all the caches are direct mapped caches. Assume that the dirty bit is always 0 for all the blocks in the caches. In execution of a program, 60% of memory reads are for instruction fetch and 40% are for memory operand fetch. The average read access time in nanoseconds (up to 2 decimal places) is \_\_\_\_\_ (Gate-2017) (2 Marks)

**Q** Consider a two-level cache hierarchy  $L_1$  and  $L_2$  caches. An application incurs 1.4 memory accesses per instruction on average. For this application, the miss rate of  $L_1$  cache 0.1, the  $L_2$  cache experience on average. 7 misses per 1000 instructions. The miss rate of  $L_2$  expressed correct to two decimal places is \_\_\_\_\_ . (Gate-2017) (1 Marks)

**Q** A file system uses an in-memory cache to cache disk blocks. The miss rate of the cache is shown in the figure. The latency to read a block from the cache is 1 ms and to read a block from the disk is 10 ms. Assume that the cost of checking whether a block exists in the cache is negligible. Available cache sizes are in multiples of 10 MB.



The smallest cache size required to ensure an average read latency of less than 6 ms is \_\_\_\_\_ MB. (Gate-2016) (2 Marks)

**Q** The memory access time is 1 nanosecond for a read operation with a hit in cache, 5 nanoseconds for a read operation with a miss in cache, 2 nanoseconds for a write operation with a hit in cache and 10 nanoseconds for a write operation with a miss in cache. Execution of a sequence of instructions involves 100 instruction fetch operations, 60 memory operand read operations and 40 memory operands write operations. The cache hit-ratio is 0.9. The average memory access time (in nanoseconds) in executing the sequence of instructions is \_\_\_\_\_. **(Gate-2014) (2 Marks)**

**Q** A computer system has an L<sub>1</sub> cache, an L<sub>2</sub> cache, and a main memory unit connected as shown below. The block size in L<sub>1</sub> cache is 4 words. The block size in L<sub>2</sub> cache is 16 words. The memory access times are 2 nanoseconds, 20 nanoseconds and 200 nanoseconds for L<sub>1</sub> cache, L<sub>2</sub> cache and main memory unit respectively.



When there is a miss in L<sub>1</sub> cache and a hit in L<sub>2</sub> cache, a block is transferred from L<sub>2</sub> cache to L<sub>1</sub> cache. What is the time taken for this transfer? (Gate-2010) (2 Marks)

- (A) 2 nanoseconds
- (B) 20 nanoseconds
- (C) 22 nanoseconds
- (D) 88 nanoseconds

**Q** A computer system has an L<sub>1</sub> cache, an L<sub>2</sub> cache, and a main memory unit connected as shown below. The block size in L<sub>1</sub> cache is 4 words. The block size in L<sub>2</sub> cache is 16 words. The memory access times are 2 nanoseconds, 20 nanoseconds and 200 nanoseconds for L<sub>1</sub> cache, L<sub>2</sub> cache and main memory unit respectively.



**Q** When there is a miss in both L<sub>1</sub> cache and L<sub>2</sub> cache, first a block is transferred from main memory to L<sub>2</sub> cache, and then a block is transferred from L<sub>2</sub> cache to L<sub>1</sub> cache. What is the total time taken for these transfers? (Gate-2010) (2 Marks)

- (A) 222 nanoseconds
- (C) 902 nanoseconds

- (B) 888 nanoseconds
- (D) 968 nanoseconds

**Q** Consider a system with 2 level caches. Access times of Level<sub>1</sub> cache, Level<sub>2</sub> cache and main memory are 1 ns, 10ns, and 500 ns, respectively. The hit rates of Level<sub>1</sub> and Level<sub>2</sub> caches are 0.8 and 0.9, respectively. What is the average access time of the system ignoring the search time within the cache? (Gate-2004) (1 Marks)

**Q** A dynamic RAM has a memory cycle time of 64 nsec. It has to be refreshed 100 times per msec and each refresh takes 100 nsec. What percentage of the memory cycle time is used for refreshing? **(Gate-2005) (1 Marks)**

**(A) 10**

**(B) 6.4**

**(C) 1**

**(D) .64**

**Q** A cache line is 64 bytes. The main memory has latency 32ns and bandwidth 1G.Bytes/s. The time required to fetch the entire cache line from the main memory is **(Gate-2006) (2 Marks)**

- (A) 32 ns**      **(B) 64 ns**      **(C) 96 ns**      **(D) 128 ns**

**Q.29** Assume a two-level inclusive cache hierarchy. L1 and L2, where L2 is the larger of the two. Consider the following statements.

**S<sub>1</sub>** : Read misses in a write through L1 cache do not result in writebacks of dirty lines to the L2.

**S<sub>2</sub>** : Write allocate policy must be used in conjunction with write through caches and no-write allocate policy is “used with writeback caches.”

**(GATE- 2021)**

Which of the following statements is correct?

- |                                                         |                                                          |
|---------------------------------------------------------|----------------------------------------------------------|
| (a) S <sub>1</sub> is true and S <sub>2</sub> is true.  | (b) S <sub>1</sub> is true and S <sub>2</sub> is false.  |
| (c) S <sub>1</sub> is false and S <sub>2</sub> is true. | (d) S <sub>1</sub> is false and S <sub>2</sub> is false. |