

# Operating Systems

## Unit - 3

### Memory Management

#### \* Registers vs. Main memory access

- Registers built into the CPU are accessible within one clock cycle.
- However, the main memory is accessed via transactions on the memory bus, which may take several clock cycles.
- In such cases, the processor normally needs to stall, since it does not have the data required to complete the instruction that it is executing.

Remedy : Introduce cache between main memory & CPU registers

#### \* Base and limit Registers

- Each process must have its own memory space.
- To separate memory spaces, the range of legal addresses for that process must be determined.
- Done using 2 registers - base, and limit register

Base Register : holds smallest legal physical memory address

Limit Register : specifies range

#### \* Hardware Address Protection

- Protection of memory space is accomplished by having the CPU hardware compare every address generated in user mode w/ the registers.



- An attempt by a user program to access OS memory or other user's memory results in a trap to the OS.
- This scheme prevents the user program from accidentally & deliberately modifying the code or data structures of either the OS or other users.
- The base and limit registers can be accessed only using a special privileged instruction  $\Rightarrow$  only the OS can load the base & limit registers



### \* Address Binding

- Programs on the disk are brought into the memory to execute from an input queue.
- A user program goes through several steps, before being executed. Addresses are represented in different ways during these steps.
- Initially, addresses in the source program are symbolic.
- A compiler binds symbolic addresses to relocatable addresses
- The linkage editor or Loader binds the relocatable addresses to the absolute address.
- The binding of instructions is a mapping from one address space to another.

→ Address binding of instructions and data to memory

Addresses can happen at 3 different stages.

(i) Compile Time : If the memory location is known a priori, absolute code can be generated, must recompile code if starting location changes.

(ii) Load Time: If it is not known at compile time where the process will reside in the memory, then the compiler must generate relocatable code. In this case, final binding is delayed until load time.

(iii) Execution Time: Binding delayed until runtime if the process can be moved during its execution from one memory segment to another  
(Need special hardware for this scheme to work)



## \* Logical vs. Physical Address Space

Logical address - address generated by the CPU

Physical address - address loaded into the memory address register

Logical address space - set of all logical addresses generated by a program

Physical address space - set of all physical addresses generated by a program

## \* Memory Management Unit (MMU)

- run-time mapping from virtual to physical addresses done using MMU.
- base register now called a relocation register
- value in the relocation register added to every address generated by a user process.



$$\text{eq. base address} = 14000$$

$$\text{logical address} = 346$$

$$\text{physical address} = 14346$$

→ In general : Logical address = 0 to max

physical addresses :  $R+0$  to  $\max+R$   
for base  $R$ .

## \* Dynamic loading

- a routine is not loaded until it is called
  - all routines are kept on the disk in a relocatable load format.
  - The main program is loaded into the memory & executed.
  - When a routine needs to call another routine, the calling routine checks to see whether the other routine has been loaded.
  - If it has not, the relocatable linking loader loads the desired routine & updates the program's address.
  - does not require special support from the OS.
- Advantage: a routine is loaded only when it is needed
- useful when large amounts of code are needed to handle infrequently occurring cases.

## \* Dynamic linking

static linking: system libraries and program code combined by the loader into the binary program image

dynamic linking: linking postponed until execution time

- a small piece of code - stub - used to locate the appropriate memory resident library routine
- stub checks if the routine is already in memory or otherwise loads it
- stub replaces itself w/ the address of the routine & executes the routine
- useful for using latest library versions - called shared libraries

## \* Swapping

- A process can be swapped temporarily out of the memory to a backing store, and then be brought back into the memory.
- Swapping makes it possible for the total physical address space of all processes to exceed the real physical memory of the system.

### a. Standard Swapping

- move processes between the main memory & backing store (backing store = a fast disk, large enough to accommodate copies of all memory images for all users).
- system maintains a ready queue w/ all processes whose memory images are in the backing store.
- context-switch time in standard swapping is very high
- can reduce if the size of memory swapped is less (by knowing how much memory really is being used)



system calls to inform  
OS of memory use  
request-memory() &  
release-memory()

### \* Constraints on Swapping

- can swap only completely idle processes
- must be careful when one wants to swap a process that is pending I/O (can't swap as I/O would occur to wrong process)

Q Solutions : (i) never swap process w/ pending I/O

(ii) execute I/O operations only on to operating system buffers

Transfers between OS buffers & process memory happen when the process is to be swapped in.

→ called double buffering → adds overhead, since the data has to be copied from the kernel memory to user memory, before the user process can access it.

## \* Swapping in Modern Operating Systems

methods used:

### (i) disable swapping

- starts only if the amount of free memory falls below a certain threshold.
- halted once free memory increased

(ii) Swap portions of processes - rather than entire processes, to decrease swap time.

## \* Swapping on Mobile Systems

- Mobile systems do not support swapping in any form.
- They use flash memory rather than spacious hard disks.
- When free memory falls below a certain threshold, iOS asks applications to voluntarily relinquish allocated memory.
- The read only data is removed from the system & later reloaded
- Failure to free memory may result in application termination.

Android : adopts a strategy similar to iOS.

- before terminating a process, Android writes its application state to the flash memory so that it can be quickly restarted

## \* Contiguous Memory Allocation

- Main memory must accommodate both the OS and the various user processes.
- Main memory must be allocated as efficiently as possible.
- In contiguous memory allocation, the memory is divided into partitions.
  - (i) The resident OS, is usually held in low memory along w/ the interrupt vector
  - (ii) User processes are held in high memory.
- Each process is contained in a single contiguous section of memory.

## \* Memory Protection

- use both a relocation register and limit registers.
- Relocation register contains the value of the smallest physical address.
- The limit register contains the range of logical addresses.
- The MMU dynamically maps the logical address.
- This allows the size of the OS to change dynamically.
- The OS contains code & buffer space for drivers.
- If a device driver is not commonly used, the space can be used for other purposes.

Such code = Transient operating system code



## \* Multiple- Partition Allocation

### A. Fixed-Size Partitioning

- memory divided into several fixed-size partitions
- each partition may contain exactly one process
- degree of multiprogramming bound by the no. of partitions

### B. <sup>Variable</sup>Multiple-Partitioning

- The OS keeps a table indicating which parts of memory are available and which are occupied.
- holes: a block of available memory, holes of various sizes are scattered throughout memory.
- When a process arrives, and needs memory, the system searches in the set for a hole that is large enough for this process.
- If the hole is too large, it is split again.
- Adjacent holes are merged to form one larger whole.
- The system continually checks for processes waiting for memory and whether the free/recombined memory could satisfy the demands of any of the waiting processes.

## \* Dynamic Storage-Allocation Problem

- concerns how to satisfy a request of size n from a list of free holes

### Methods

- First Fit - allocate first hole that is big enough
- Best Fit - allocate the smallest hole that is big enough
- Worst-fit - allocate the largest list, search entire list

next-fit-like  
first fit, but  
check from last allocated  
spot

Example : Allocate the following requests using first fit, best fit, worst fit and next fit



$A = 6K$   
 $B = 3K$   
 $C = 1K$   
 $D = 5K$   
 $E = 8K$

### Ans First Fit



$E$  not allocated

### Best Fit



$E$  not allocated

### Worst Fit



$D \geq E$  not allocated

### Next fit



$E$  not allocated.

### \* Fragmentation

- A. External Fragmentation - happens when there is enough total memory to satisfy a request, but the available spaces are not contiguous.

\* Internal Fragmentation - allocated memory may be slightly <sup>larger</sup> than requested memory, this size diff. is internal to a partition

50% Rule: Analysis of first fit reveals that even col some optimisation, given N allocated blocks, another 0.5N blocks will be lost to fragmentation.  $\frac{1}{3}$ rd of memory be unusable. This property is known as the 50% rule.

### \* Compaction

- can be used to reduce external fragmentation
- shuffle memory contents to place all free memory together in one large block
- compaction is possible only if relocation is dynamic, and is done at execution time.
- If allowed, then move program and data, and change the base register to reflect the new address
- simplest compaction algorithm - move all processes to one end of the memory, and all holes to the other end, producing one large hole of memory. This scheme is expensive

### \* Segmentation

- a memory management scheme that supports the programmer view of memory
- A logical address space is a collection of segments. Each segment has a name and a length

- The addresses specify the segment name and the offset
- The logical address is a tuple with 2 entries  
(segment - no., offset)

## Segmentation Architecture

- \* segment table - maps two-dimensional physical addresses. Each table entry has:
  - base - contains the starting physical address where the segments reside in the memory
  - limit - specifies the length of the segment
- \* segment table base register (STBR) : points to the segment table's location in memory
- \* segment table length register (STR) : indicates number of segments used by a program

## protection

with each entry in the segment table associate:

- validation bit = 0  $\Rightarrow$  illegal segment
- read | write | execute privilege

→ Protection bits are associated with segments



Example: Consider the following segments along with their base and length values.

| Segment | Base | Length |
|---------|------|--------|
| 0       | 219  | 600    |
| 1       | 2300 | 14     |
| 2       | 90   | 100    |
| 3       | 1327 | 580    |
| 4       | 1952 | 96     |

What are the physical addresses for the following logical addresses?

(a) 0, 430

$$\begin{aligned}\text{physical addr} &= 219 + 430 \\ &= 649\end{aligned}$$

(b) 1, 10

$$\begin{aligned}\text{physical addr} &= 10 + 2300 \\ &= 2310\end{aligned}$$

(c) 2, 500

$$\begin{aligned}\text{physical addr} &= 90 + 500 \\ &= 590 > \text{length} + \text{base} \\ \Rightarrow &\text{segmentation fault}\end{aligned}$$

(d) 3, 400

$$\text{physical address} = 1327 + 400 = 1727$$

(e) 4, 112

$$\begin{aligned}\text{physical address} &= 1952 + 112 \geq \text{length} + \text{base} \\ \Rightarrow &\text{segmentation fault}\end{aligned}$$

## \* Paging

- a method of non-contiguous memory allocation
- The process is divided into a no. of partitions called pages.  
The size of each page must be the same in the logical memory.
- In the main memory (physical memory), the division is in the form of frames . size of each frame.
- Page table helps link the logical memory and physical memory.  
→ has 2 parts <page no., frame no>

### Example

|        |
|--------|
| page 0 |
| page 1 |
| page 2 |
| page 3 |

Logical memory

|   |   |
|---|---|
| 0 | 1 |
| 1 | 4 |
| 2 | 3 |
| 3 | 7 |

page table



### Address Translation Scheme

The logical address generated by the CPU is divided into:

page no (p) - indexes into a page table, which contains base address of each page in physical memory

page offset (d) - combined with base address to define the physical memory address that is sent to the memory unit



for  
logical address space:  $2^m$   
Pagesize =  $2^n$

## \* Paging Hardware

- CPU generates a logical address for the instruction to be executed.
- logical address is divided  $\boxed{P \mid d}$ , where  
 $P$  = page no.  
 $d$  = displacement
- The page no is given as input to the page table. The page table produces the corresponding frame no
- If the frame no is combined with the offset  $d$ , it gives the exact physical address



## \* Implementation of Page Table

- The page table is kept in the main memory,
- The page-table base register (PTBR) points to the page table
- The page-table length register (PTRR) indicates the size of the page table
- In this scheme, every data/instruction access requires 2 memory accesses - one for the page table and one for data/instruction.

→ The two memory access problem can be solved by the use of  
a special fast - lookup hardware cache called associative memory  
or translation look-aside buffers (TLBs).

- Some TLBs store address - space identifiers (ASIDs) in each  
TLB entry - uniquely identifies each process to provide address-  
space protection for that process
- TLBs are very small, on a TLB miss, a value is loaded into the  
TLB for faster access next time.

### Associative memory

- The TLB has 2 parts : page # & frame #
- Address translation (p.d) : If p is in associative register  
get the frame # out.
- Otherwise get frame # from page table in memory.

### Hardware



## \* Effective Access Time

(n)

→ associative lookup =  $\epsilon_p$  time unit

→ hit ratio =  $\alpha$

↳ defines the percentage of times that a page no. is found in the associative registers

→ To find the effective memory access time, weight each case by its probability.

$$EAT = \frac{Hit(TLB + MM) + miss(TLB + PT + MM)}{HIM}$$

Example :  $\alpha = 80\%$ ,  $\epsilon_p = 20\text{ns}$  for TLB search, 100 ns for memory access.

$$\begin{aligned} EAT &= 0.8 * (20 + 100) + 0.2 * (100 + 20 + 100) \\ &= 0.8 * 120 + 0.2 * 220 \\ &= \underline{\underline{140\text{ns}}} \end{aligned}$$

Example 2: Consider a hit ratio of 90%, 20ns for TLB search and 100ns for memory access

$$\begin{aligned} EAT &= Hit(TLB + MM) + miss(TLB + PT + MM) \\ &= 0.9 * (20 + 100) + 0.1 * (20 + 100 + 100) \\ &= 0.9 * 120 + 0.1 * 220 \\ &= \underline{\underline{121\text{ns}}} \end{aligned}$$

Example 3: Consider a paging system with the page table stored in the memory. If a memory reference takes 50 nanoseconds how long does a paged memory reference take? If we add TLBs and

75). If all page-table references are found in the TLBs, what is the effective memory reference time? Assume that finding a page-table entry in the TLBs takes 2ns, if the entry is present.

### Solution

$$\begin{aligned}\text{Time for paged memory reference} &= \text{time to access page table} \\ &\quad + \text{time to access main memory} \\ &= 50 + 50 = 100 \text{ ns}\end{aligned}$$

$$\alpha = 75\%.$$

$$t_p = 2 \text{ ns}$$

$$\begin{aligned}EAT &= \text{hit}(\text{TLB} + \text{MM}) + \text{miss}(\text{TLB} + \text{PT} + \text{MM}) \\ &= 0.75(2 + 50) + 0.25(2 + 100) \\ &= 0.75 * 52 + 0.25 * 102 \\ &= 64.5 \text{ nanoseconds}\end{aligned}$$

### \* Memory Protection

- Is implemented by associating a protection bit with each frame to indicate if read-only or ~~or~~ read-write access is allowed
- a valid-invalid bit is attached to each entry in the page table
  - (i) 'valid' indicates that the associated page is in the process' logical address space, and is thus a legal page
  - (ii) 'invalid' indicates that the page is not in the process' logical address space.

|        |
|--------|
| Page 0 |
| Page 1 |
| Page 2 |
| Page 3 |
| Page 4 |
| Page 5 |

valid - invalid bit

|   |   |   |
|---|---|---|
| 0 | 2 | v |
| 1 | 3 | v |
| 2 | 4 | v |
| 3 | 7 | v |
| 4 | 8 | v |
| 5 | 9 | v |
| 6 | 0 | i |
| 7 | 0 | i |

## \* Shared Pages

- Paging can help share common code.
- Reentrant / pure code can be shared
  - ↓ non-self-modifying code - it never changes during execution.
- Thus, 2 or more processes, can execute the same code at the same time.
- Each process has its own copy of registers & data storage to hold data for the process' execution.
- The pages for the private code and data can appear anywhere in the logical address space

## Example

|       |
|-------|
| ed1   |
| ed2   |
| ed3   |
| data1 |

|   |   |
|---|---|
| 0 | 3 |
| 1 | 4 |
| 2 | 6 |
| 3 | 1 |

Process P1

pagetable<sub>P1</sub>

|       |
|-------|
| ed1   |
| ed2   |
| ed3   |
| data3 |

|   |
|---|
| 3 |
| 4 |
| 6 |
| 2 |

pagetable<sub>P3</sub>

|   |       |
|---|-------|
| 0 | d'    |
| 1 | data1 |
| 2 | data3 |
| 3 | ed1   |
| 4 | ed2   |
| 5 | .     |
| 6 | ed3   |
| 7 | .     |

## \* Structuring the page table

- memory structures for paging can get huge using straight forward method.
- It is not desirable to allocate such large blocks contiguously in the main memory.
- Common techniques used:
  - (i) Hierarchical paging
  - (ii) Hashed page tables
  - (iii) Inverted page tables

### A. Hierarchical Paging

- The logical address space is broken up into multiple page tables.
- The page table itself is paged.

#### Two-level paging



- $P_1$  is an index to the outer page no
- $P_2$  is the displacement within the page of the inner page table
- Known as forward-mapped page table.

#### Three-level paging



## \* Address Translation Scheme for Hierarchical Paging

(Q1)



### B. Hashed Page Tables

- CPU generates a logical address - w/ page no. & offset
- The page no. is given as i/p to the hash function
- The hash fn. produces a hash value, that is mapped to a location in the hashtable.
- Each entry in hash table is a linked list (separate chaining)
- Each entry of the linked list has the following structure:

|          |          |          |
|----------|----------|----------|
| page no. | frame no | nextaddr |
|----------|----------|----------|

- The virtual page no. is compared with field1 in the first element of the linked list.
- If there is a match, the corresponding page frame (field2) is to form the desired physical address.
- If there is no match, subsequent entries in the linked list are searched for a matching virtual page no.



#### \* clustered page tables

- a variation of hashed page tables
- each entry in the hash table refers to several pages rather than a single page.
- ∴, a single page-table entry can store the mappings for multiple physical page-frames.
- Clustered page tables are useful for sparse address spaces where memory references are non-contiguous & scattered throughout the address space

#### \* Inverted Page Tables

- Usually, each process has an associated page table.
- A drawback of this method is that each page table may consist of millions of entries - may consume large amounts of physical memory
- Inverted page table - has one entry for each real page / frame
- Each entry consists of the virtual address of the page, along w/ the information about the process that owns the page.

→ Inverted page tables have an address-space identifier. This allows for a logical page & a particular process to be mapped to the corresponding physical page frame.

→ The virtual address is in the form of:

$\langle \text{process-id}, \text{page-number}, \text{offset} \rangle$

- Each inverted page table entry is a pair  $\langle \text{process-id}, \text{page-no} \rangle$
- When a memory reference occurs, part of the virtual address is presented to the memory sub system.
- The inverted page table is then searched for a match, then the corresponding physical address  $\langle i, \text{offset} \rangle$  is generated.

### Disadvantages

- increased time to search table (use hash tables)
- difficult to implement shared memory (have one mapping of virtual address to shared physical address)



## Numericals

### Points to Note:-

- m-bit processor  $\Rightarrow$  logical address = m-bits longs
- size of logical address space =  $2^m$   
page size =  $2^n$  (bytes or words)
- page no = m-n
- page offset = n

Q1. Consider the logical address space of 8 pages of  $1024$  words mapped onto the physical memory of 32 frames

- (i) How many bits are there in the physical address?
- (ii) How many bits are there in the logical address?

Ans: page size = 1024 words

$$= 2^{10}$$

$$\boxed{n=10}$$

logical address space = 8 pages  $\times$  1024 words

$$= 2^3 \times 2^{10}$$

$$= 2^{13}$$

$$\boxed{2^m = 2^{13}}$$



$$\boxed{m=13}$$

size of physical address space:

$$= 32 \times 2^{10}$$

$$= 2^5 \times 2^{10}$$

$$= 2^{15}$$

$\Rightarrow$  logical address space = 13 bits

physical address space = 15 bits

Ques Given a 32 bit processor, with a page size of 1024 bytes. Find

(i) size of logical address

(ii) no. of bits to represent page no. & offset

(iii) max. size of logical address space

(iv) max. no. of pages in logical address space

(v) max length of page table of a process

Ans: (i) size of logical address

32 bit processor = size of logical address space =  $2^m$

where  $m = 32$

(ii) no. of bits used to represent page no. and offset

page size = 1024 bytes

$$= 2^{10} \text{ bytes} \rightarrow$$

$$n = 10$$



bits to represent pg. no. =  $m-n = 22$  bits

bits to represent offset =  $n = 10$  bits

(iii) max. size of logical address space

$$= 2^{32} \text{ bytes}$$

$$2^{10} = 1 \text{ kB}$$

$$= 2^2 \times 2^{30}$$

$$2^{20} = 1 \text{ MB}$$

$$= \underline{4 \text{ Gi bytes}}$$

$$2^{30} = 1 \text{ GB}$$

$$= \underline{\underline{4 \text{ GB}}}$$

(iv) max no. of pages in logical address space =

$$2^{m-n}$$

$$= 2^{32-10} = 2^{22}$$

$$= 2^2 \times 2^{20}$$

$$= 4 \text{ MB}$$

(v) max length of page table

$$= \text{max no. of pages} = 4 \text{ M entries}$$

$$= \underline{4 \text{ million entries}}$$

Q3 Consider a machine with 64 MB physical memory & a 32 bit virtual memory. If the pagesize is 4 kB, what is the approximate size of the page table?

Ans: physical memory = 64 MB

logical address = 32 bits long

page size = 4 kB

Convert into bytes first

$$\text{physical memory} = 64 \text{ MB} = 2^{20} \times 2^6 \\ = 2^{26} \text{ bytes}$$

$$\text{page size} = 2^2 \times 2^{10} \text{ bytes} \\ = 2^{12} \text{ bytes} \quad | \boxed{n=12}$$

$$\text{virtual address space} = 32 \text{ bits long} \Rightarrow 2^{32} \text{ bytes}$$



$$m = 32$$

| size of logical address space = pages  $\times$  words (pagesize) |

$$2^{32} = \text{pages} \times 2^{12}$$

$$\text{pages} = \frac{2^{32}}{2^{12}} = \boxed{2^{20} \text{ entries}}$$

Total no. of frames = page size  $\times$

| physical memory size = page size  $\times$  no. of frames |

$$\text{frame size} = \frac{2^{26}}{2^{12}} = 2^{14} \text{ frames}$$

$$\text{page table size} = 2^{20} \text{ entries} \times 14 \text{ bytes}$$

$$\approx 2^{20} \times 2 \text{ bytes} = 2 \text{ MB}$$

Q4 Consider a system which has logical address = 7 bits  
 physical page address = 6 bits, page size = 8 words. Calculate the no. of pages and no. of frames

Ans:

logical address = 7 bits

page size  
 page offset are  
 the same thing

logical address size =  $2^7$

$$m = 7$$



$m-n$  page no.       $n$  pagesize/offset

pagesize = 8 words

= 8 bytes

=  $2^3$  bytes

$$n = 3$$

no. of bits used to represent page no = 4 bits

$$\text{no. of pages} = 2^4 = 16$$

physical address = 6 bits

$$\text{total size} = 2^6 = 64 \text{ bytes}$$

physical  $\xrightarrow{6}$



frame no.      frame size/offset

frame size = page size

bits used to represent frames = 3

$$\text{total no. of frames} = 2^3 = 8$$

## Consolidated Formulae

logical address space

physical address space



page no } page size /  
offset  
represented in  
words somehow }

$$\text{page size / offset} = \text{frame size / offset}$$

logical address = m bits  
is represented in

Total logical address size =  $2^m$

no. of words = no. of bits used to represent a page = n

size of page =  $2^n$

size of logical address = pages × page size

size of physical address = frames × no. of frame size

Q5. Consider a system in which the logical address is 27 bits and the physical address requires 21 bits. The page size is 4K words. Calculate the no. of pages and no. of ~~frames~~ <sup>frames</sup>

Ans logical address = 27 bits

$$m = 27$$



page size = 4K words

$$= 2^2 \times 2^{10} = 2^{12} \text{ bytes}$$

$$1 n = 12$$

page no      page size/  
no. of words

$$\frac{\text{no. of bits used to rep pages}}{\text{Total size no. of pages}} = m \cdot n = 15$$

no. of frames = ?

Physical memory



frame no.      frame size / offset

representation of physical address is in 21 bits

$$\text{no. of frames} = 21 - 12 = 9 \text{ bits}$$

$$\text{Total frame size} = 2^9$$

$$= 512 \text{ frames}$$

.....

Q6 Consider a system having a page table w/ 4k entries, logical address of 29 bits. How many bits does the physical address require if the system is of 512 frames?

Logical address



page no      page size /  
                  offset

no. of bits used to represent logical address = 29 bits

$$m = 29$$

$$\text{no. of entries} = \frac{\text{no. of words}}{\text{no. of pages}} = \text{no. of pages} = 4k = 2^2 \times 2^{10} \\ = 2^{12}$$

$$\frac{m}{n} = 12$$

$$\text{Page size } n = 29 - 12 = 17$$

## Physical address



frame no.      frame size

$$\text{no. of frames} = 512 = 2^9$$

$\Rightarrow$  9 frames

$$\text{frame size} = \text{page size} = 17$$



$$\Rightarrow \text{Total bits in physical address} = \underline{\underline{2^6 \text{ bits}}}$$

(Q7) Consider a logical address space of 64 pages of  $1024$  words each mapped onto a physical memory of 32 frames

How many bits are there in the logical address & physical address?

## Logical address



no. of pages      page size / offset

$$m-n = \underline{\underline{64}} \quad \text{no. of pages}$$

$$m-n = 2^6$$

∴

$$\boxed{m-n=6}$$

$$\text{no. of words} = \cancel{\text{no. of pages}} \cdot \text{page size}$$

$$= 1024 \text{ words}$$

$$= 2^{10}$$

$$\Rightarrow 10 \text{ bits}$$

$$\boxed{n=10}$$

$\Rightarrow$  Total logical address ~~bits~~

$$\boxed{\text{words} = \text{page size}}$$



$$\boxed{16 \text{ bits}}$$

## Physical memory

|   |    |
|---|----|
| 6 | 10 |
|---|----|

frame no. frame size

physical frames = 32

=  $2^5$  frames

no. of bits to represent frames = 5

Total no. of bits in physical address = 15

Q8 Consider a memory system w/ 128 kB of logical address

space and 512 kB of physical address, the page size is 16 kB

Calculate the no. of bits needed to represent the logical address  
and the no. of bits needed to represent the physical address

Calculate the no. of pages and the no. of frames

logical address

|          |           |
|----------|-----------|
| P        | d         |
| page no. | page size |
|          | offset    |

no. of bits in logical addr = m

$$= 128 \text{ kB} = 2^7 \times 2^{10} \\ = 2^{17}$$

$m = n$

no. of bits needed to represent logical address = 17

$$\text{page size} = 16 \text{ kB} = 2^4 \times 2^{10} \\ = 2^{14}$$

n = 14

|    |    |
|----|----|
| LA | 17 |
|    |    |

physical address =  $\frac{512}{16 \text{ kB}} = 2^9 \times 2^{10} = 2^{19}$

Total = 19 bits

|    |    |
|----|----|
| PA | 19 |
|    |    |

no. of bits needed to represent physical address = 19

$$\text{no. of pages} = 2^3 = 8 \text{ pages}$$

(33)

$$\text{no. of frames} = 2^5 = 32 \text{ frames}$$

Q9 Consider the page table where

Page 0 → frame 7

Page 1 → frame 1

Page 2 → frame 15

Page 3 → frame 31

The remaining pages are not loaded in the memory.

Convert the following memory locations from the logical address space into the physical address space

(i) 1024

page size = 16kB

(ii) 11000

Formula

$$P = \text{mem. addr} / \text{page size}$$

$$d = \text{mem. addr} \% \text{page size}$$

$$PA = \text{frame no} \times \text{framesize} + d$$

Ans. (i) 1024

$$P = 1024 / 16 \times 1024 = 1 / 16 = 0 \Rightarrow \text{frame 7}$$

$$d = 1024 \% (16 \times 1024) = 1024$$

$$\begin{aligned} PA &= 7 \times (16 \times 1024) + 1024 \\ &= \underline{\underline{115712}} \end{aligned}$$

(ii) 11000

$$p = 11000 / (16 \times 1024) = 0$$

$$d = 11000 \% (16 \times 1024) = 11000$$

$$PA = 7 \times (16 \times 1024) + 11000$$

$$= \underline{\underline{125688}}$$

(iii) 123458

$$p = 123458 / (16 \times 1024) = 7$$

↳ not in page table  
⇒ page fault

(iv)  $\overset{31588}{p} = 31588 / (16 \times 1024) = 1 \rightarrow \text{frame 1}$

$$d = 15204$$

$$\begin{aligned} PA &= 1 \times (16 \times 1024) + 15204 \\ &= 31588 \end{aligned}$$

(v) 50,000

$$p = 50000 / (16 \times 1024) = 3 \Rightarrow \text{frame 3}$$

$$d = 848$$

$$31 \times (16 \times 1024) + 848$$

$$= \underline{\underline{508752}}$$

## Virtual Memory

→ Virtual memory - separation of user logical memory from the physical memory

→ only a part of the program needs to be in memory for execution

→ logical address space can be much larger than physical address space.

Benefits: (i) Address spaces shared by multiple processes

- (i) more programs running concurrently
- (ii) less I/O to load or swap processes

→ virtual address space

→ logical view of how a process is stored in the memory

→ usually starts at address 0, contiguous addresses until the end of space

→ physical memory organized in page frames

→ MMU maps logical to physical addr.

• Virtual memory can be implemented as:

- (i) demand paging
- (ii) demand segmentation



→ The heap in the virtual address space grows upwards in memory, as it is used for dynamic memory allocation.

→ The stack grows downwards in memory.

→ The large blank space or hole between the heap & the

stack is part of the virtual address space, but will require actual physical pages only if the heap or stack grows.

→ Virtual address spaces that include holes are called sparse address spaces.

→ Sparse address spaces are beneficial because the holes can be filled as the stack / heap segments grow & if we wish to dynamically link libraries during program execution

### \* Demand Paging

→ load only pages only when needed, rather than loading the entire program into the physical memory at program execution time.

→ Pages that are never accessed are thus never loaded into the physical memory.

→ demand paging system similar to a paging system with swapping.

→ A lazy swapper is used ⇒ never swaps a page into memory unless that page will be needed.

→ A swapper that deals w/ pages = pager, in the context of demand paging.

## Working

(27)

- with swapping, the pager guesses which pages will be used before swapping out again
  - require hardware support to distinguish between pages in memory & pages on the disk - use the valid, invalid bit
  - pages which are already present in the main memory are memory resident - there is no difference between these
- non-demand paging
- ⇒ memory resident

## \* Page Faults

- when the process tries to access a page that was not brought into memory.
- causes a trap to the operating system.

## Handling Page Faults

1. check an internal table - determine whether the reference does a valid or invalid memory access.
2. If an invalid reference, terminate the process
3. If valid, but page not brought in, page it in.
4. Find a free frame
5. Schedule a disk operation to read the desired page into the newly allocated frame.

6. When the disk read is complete, modify to internal table to indicate that the page is now in memory.

7. Restart the instruction that was interrupted by trap.

Picks address page as though it had always been in memory



### \* Pure Demand Paging

→ In extreme cases, can start executing a process w/ no pages in the memory

→ When the OS sets the instruction pointer to the first instruction of the process, the process immediately faults for the page.

→ After this page is brought into the memory, the process continues to execute, faulting as necessary.

→ This is called pure-demand paging: never bring a page into memory until it is required.

### \* Locality of reference

- some programs would access several new pages of memory w/ each instruction execution
- would cause multiple page faults per instruction
- However, this behaviour is highly unlikely, since programs tend to have locality of reference.

### \* Hardware support for demand paging

- (i) page table - w/ valid / invalid bit
- (ii) secondary memory - holds pages not in main memory  
usually a high-speed disk, called the swap device  
section of disk used for this purpose = swap space

### \* Performance of Demand Paging

$$\text{Effective access time} = (1-p) \times ma \times \text{page fault time}$$

$p$  = probability of a page fault

$ma$  = memory-access time

wire about steps taken by computer in case of page fault pg. 37

### \* Demand Paging Optimizations

- (i) Disk I/O to swap space is generally faster than using the file system.
- (ii) Copy entire file image into the swap space at process startup and then perform demand paging from the swap space.

(iii) Some systems attempt to limit the amount of swap space used through demand paging of binary files. Demand pages for such files are brought directly from the file system. When page replacement is called for, these frames can simply be overwritten. Swap spaces must be used for pages not associated with a file - known as anonymous memory

#### \* Swapping on Mobile OS

- do not support swapping
- demand page from the file system and reclaim read-only pages, such as code from applications

#### Page Replacement

- prevent over-allocation of memory
- swap out a process, freeing all of its frames & reducing the level of multiprogramming.

#### \* Basic Page Replacement

1. Find the location of the desired page on the disk
2. Find a free frame:
  - a. If there is a free frame, use it
  - b. If there is no free frame, use a page-replacement algorithm, to select a victim frame.
  - c. Write the victim frame to disk, change the page & frame tables accordingly
3. Read the desired page into the newly freed frame, change the page & frame table
4. Continue user process from where page fault occurred.

## \* Reducing overhead

(41)

→ If no frames are free, 2 page faults are required  
⇒ increases access time

→ solution: use a modify bit.

The modify bit for a page is set by the hardware whenever any ~~page~~<sup>bit</sup> in the page is written into, indicating that the page has been modified.

→ When a page is selected for replacement, examine the modify bit - if the bit is set, we know that the page has been modified since it was read into the memory.

→ If the memory bit is not set, the page has not been modified;

need not write page into disk, it is already there



## \* Page and Frame Replacement Algorithms

### ① Frame-allocation algorithm

- determines

→ how many frames to give each process

→ which frames to replace

### ② Page-Replacement algorithm

→ want lowest page-fault rate on both first access & re-access

→ evaluate algorithm by running it on a particular program

memory references, (reference string), and computing the no. of page faults on that string.

### \* FIFO Page - Replacement

→ associates each page the time when that page was brought into memory

→ when a page must be replaced, the oldest page is chosen

Example: Reference string: 701 203 042 303 032 120 170 1  
no. of frames = 3

|       |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |
|-------|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|       | 7 | 0 | 1 | 2 | 0 | 3 | 0 | 4 | 2 | 1 | 3 | 0 | 3 | 2 | 1 | 2 | 0 | 1 | 7 |
| $f_1$ | 7 | 7 | 7 | 2 | 2 | 2 | 2 | 4 | 4 | 4 | 6 |   |   |   | 0 | 0 | 0 | 7 |   |
| $f_2$ | 0 | 0 | 0 | 3 | 3 | 3 | 3 | 2 | 2 | 2 | 2 |   |   |   | 1 | 1 | 1 | 0 | 0 |
| $f_3$ | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 3 | 3 | 3 | 3 |   |   |   | 3 | 2 | 2 | 2 | 1 |

PF PF

Total = 15 page faults

Example 2: Reference string: 123 412 512 345  
no. of frames = (a) 3  
(b) 4

3 frames

|       |   |   |   |   |   |   |   |   |   |   |   |   |
|-------|---|---|---|---|---|---|---|---|---|---|---|---|
|       | 1 | 2 | 3 | 4 | 1 | 2 | 5 | 1 | 2 | 3 | 4 | 5 |
| $f_1$ | 1 | 1 | 1 | 4 | 4 | 4 | 5 |   |   | 5 | 5 |   |
| $f_2$ | 2 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 3 | 3 | 3 |
| $f_3$ | 3 | 3 | 3 | 3 | 2 | 2 | 2 | 2 | 2 | 2 | 4 | 4 |

Total = 9 page faults

PF PF

4 frames

|       |   |   |   |   |   |   |   |   |   |   |   |   |
|-------|---|---|---|---|---|---|---|---|---|---|---|---|
|       | 1 | 2 | 3 | 4 | 1 | 2 | 5 | 1 | 2 | 3 | 4 | 5 |
| $f_1$ | 1 | 1 | 1 | 1 | 5 | 5 | 5 | 5 | 4 | 4 | 4 | 4 |
| $f_2$ | 2 | 2 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 5 | 5 |
| $f_3$ | 3 | 3 | 3 | 3 | 3 | 3 | 2 | 2 | 2 | 2 | 2 | 2 |
| $f_4$ | 4 | 4 | 4 | 4 | 4 | 4 | 3 | 3 | 3 | 3 | 3 | 3 |

Total = 10 page faults

PF PF

### \* Belady's Anomaly

→ The no. of page faults may increase with an increase in the no. of pages.

### \* Optimal Algorithm

- replace the page that will not be used for the longest period of time.
- not practical - since, one will not know digits in future references.
- used only for comparative purposes.

Example: 701 203 042 303 032 126 170 1

$$\text{no. of frames} = \text{(a) } 3$$

$$\text{(b) } 4$$

3 frames

|       |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |
|-------|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|       | 7 | 6 | 1 | 2 | 0 | 3 | 0 | 4 | 2 | 3 | 0 | 3 | 0 | 8 | 2 | 0 | 2 | 0 | 0 | 1 |
| $f_1$ | 7 | 7 | 7 | 2 | 2 | 2 | 2 | 4 | 2 | 2 | 0 | 3 | 0 | 8 | 2 | 0 | 2 | 0 | 0 | 1 |
| $f_2$ | 0 | 0 | 0 | ✓ | 0 | ✓ | ✓ | 4 | ✓ | 0 | ✓ | ✓ | ✓ | ✓ | 0 | ✓ | ✓ | ✓ | ✓ | ✓ |
| $f_3$ | 1 | 1 | 1 | 3 | 3 | 3 | 3 | 2 | 2 | 3 | 3 | 3 | 3 | 2 | 2 | 1 | 1 | 1 | 1 | 1 |

$$\boxed{\text{Page faults} = 9}$$

4 frames

|       |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |
|-------|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|       | 7 | 0 | 1 | 2 | 0 | 3 | 0 | 4 | 2 | 3 | 0 | 3 | 0 | 3 | 2 | 1 | 2 | 0 | 1 | 0 |
| $f_1$ | 7 | 7 | 7 | 7 | 3 | 3 | 3 | 0 | 2 | 2 | 0 | 3 | 0 | 3 | 2 | 1 | 0 | 1 | 0 | 1 |
| $f_2$ | 0 | 0 | 0 | ✓ | 0 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | 0 | ✓ | ✓ | ✓ | ✓ |
| $f_3$ | 1 | 1 | 1 | 1 | 2 | 2 | 2 | 4 | 2 | 2 | 3 | 3 | 3 | 3 | 2 | 1 | 2 | 1 | 2 | 1 |
| $f_4$ | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 |

$$\boxed{\text{Page faults} = 8}$$

Example 2: Reference string: 123 412 512 345

no. of frames: (i) 3  
(ii) 4

| 3 frames |       |       |   |   |   |   |   |   |   |   |   |
|----------|-------|-------|---|---|---|---|---|---|---|---|---|
| $f_1$    | $f_2$ | $f_3$ |   | 1 | 2 | 5 | 1 | 2 | 3 | 4 | 5 |
| 1        | 2     | 3     | 4 | 1 | 2 | 5 | 1 | 2 | 3 | 4 | 5 |
| 1        | 2     | 2     | 2 | 1 | 2 | 2 | 1 | 2 | 2 | 4 | 5 |
| 1        | 3     | 4     |   | 1 | 2 | 5 | 1 | 2 | 5 | 3 | 5 |

page faults = 7

4 frames

| 4 frames |       |       |       |   |   |   |   |   |   |   |   |
|----------|-------|-------|-------|---|---|---|---|---|---|---|---|
| $f_1$    | $f_2$ | $f_3$ | $f_4$ | 1 | 2 | 5 | 1 | 2 | 3 | 4 | 5 |
| 1        | 2     | 3     | 4     | 1 | 2 | 5 | 1 | 2 | 3 | 4 | 5 |
| 1        | 2     | 2     | 2     | 1 | 2 | 2 | 1 | 2 | 2 | 3 | 5 |
| 1        | 3     | 3     | 4     | 1 | 2 | 3 | 1 | 2 | 3 | 2 | 5 |
| 1        | 4     |       |       | 1 | 2 | 5 | 1 | 2 | 3 | 3 | 5 |

page faults = 6

\* Least Recently Used Algorithm

→ replace page which has not been used for the longest period of time

→ like optimal algo, but looking backwards

Reference string: 701 203 042 303 032 100 170 1

Page Faults?

frames : (i) 2

(ii) 4

| 2 frames |       |       |   |   |   |   |   |   |   |   |   |   |   |
|----------|-------|-------|---|---|---|---|---|---|---|---|---|---|---|
| $f_1$    | $f_2$ | $f_3$ |   | 1 | 2 | 3 | 0 | 3 | 2 | 1 | 2 | 0 | 1 |
| 7        | 7     | 2     |   | 2 | 4 | 4 | 0 | 0 | 1 | 1 | 1 | 0 | 7 |
| 0        | 0     | 0     | ✓ | 0 | 0 | 3 | 3 | 1 | 1 | 1 | 2 | 0 | 0 |
| 1        | 1     | 3     |   | 3 | 2 | 2 | 2 | 1 | 1 | 1 | 2 | 1 | 1 |

### w/ 4 frames

45

|                | 1 | 2 | 3 | 4 | 1 | 2 | 3 | 4 | 1 | 2 | 3 | 4 | 1 | 2 | 3 | 4 |
|----------------|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| p <sub>1</sub> | 7 | 7 | 7 | 7 | 3 | 3 | 3 | 3 | 0 | 0 | 0 | 0 | 3 | 3 | 3 | 3 |
| p <sub>2</sub> | 0 | 0 | 0 | - | 0 | - | 0 | - | - | - | - | - | 0 | - | 7 | 7 |
| p <sub>3</sub> | 1 | 1 | 1 | 1 | 4 | 4 | 4 | 4 | 1 | 1 | 1 | 1 | 2 | 2 | 2 | 2 |
| p <sub>4</sub> | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 |

page faults = 9 page faults

### \* Implementation of LRU

#### ① Counters:

- Every page entry has a counter, every time a page is referenced through this entry, copy the clock into the counter
- When a page needs to be changed, look at the counters to find the smallest value.

#### ② Stack Implementation

- Keep a stack of page numbers
- Whenever a page is referenced, it is removed from the stack and put on the top.
- The most recently used page is always at the top of the stack.
- Should be implemented using a doubly linked list with a head pointer and a tail pointer.
- Each update requires 6 pointer changes, but no need to search for replacement, as the tail pointer points to the bottom of the stack.

## \* Stack Algorithms

- the set of pages in memory for n frames is always a subset of the set of pages that would be in memory with  $n+1$  frames
- For LRU replacement, the set of pages in memory would be the  $n$  most recently used pages.
- If the no. of frames is increased, those  $n$  pages will still be the most recently referenced, so they would still be in memory
- Stack Algorithms can never exhibit Belady's algorithm.

## \* LRU Approximation Algorithms

- Very few computer systems provide hardware support for LRU page replacement.
- LRU approximation algos are used.

## \* Reference Bit

- A reference bit for a page is set by the hardware whenever that page is referenced.
- Initially, all bits are cleared to 0 by the OS.
- As a user process executes, the bit associated w/ each page referenced is set to 1 by the hardware.
- can determine which pages have been used & which have not been used, by examining the reference bits
- However, order of use is not known.

## \* Additional-Reference-Bits Algorithm

- order info. by recording the reference bits at regular intervals
  - Those bits are kept in shift registers - contains history of page used.
  - 0000 0000  $\Rightarrow$  page has not been used at all for eight time period
  - 1111 1111  $\Rightarrow$  page has been used at least once in eight time period
- A                    B
- 1110 0100  $\Rightarrow$  vs 0111 1111  $\Rightarrow$  A has been more recently used than B.
  - interpret 8-bit bytes as unsigned integers, page w/ lowest no. is the FRU page.

## \* Second-Chance Algorithm (clock algorithm)

- inspect reference bit of selected bit
- if 0  $\Rightarrow$  replace page
- if 1  $\Rightarrow$  gives page a second chance, move on to select the next FIFO page.
- When the page gets a second chance, its reference bit is cleared, arrival time is set to the current time.
- $\therefore$  A page that is given a second chance, will not be replaced until all other pages have been replaced.
- Implement using a circular queue
- Note: If all bits are set  $\Rightarrow$  degenerates to FIFO replacement.

## \* Enhanced Second - Chance Algorithm

→ modify second-chance algo by considering the reference bit and the modify bit as an ordered pair

4 possible cases:

1. (0,0) ⇒ neither recently used nor modified ⇒ best page to replace

2. (0,1) ⇒ not recently used but modified ⇒ not quite as good, because the page will need to be written out before replacement

3. (1,0) ⇒ recently used but clean ⇒ probably will be used again soon

4. (1,1) ⇒ recently used & modified ⇒ probably will be used again and the page will need to be written out to the disk before it can be replaced

## \* Counting - Based Page Replacement

### 1. Least Frequently Used (LFU)

→ page with the smallest reference count is replaced

→ rationale is that an actively used page must have a large reference count

→ problem: page may be heavily used in the initial phase of a process and then never used again.

Example: Find the no. of page faults for the following reference string, when there are 3 frames

(49)

7 0 1 2 0 3 0 4 2 3 0 3 2

|                | 7 | 0 | 1 | 2 | 0 | 3 | 0 | 4 | 2 | 3 | 0 | 3 | 2 |
|----------------|---|---|---|---|---|---|---|---|---|---|---|---|---|
| F <sub>1</sub> | 7 | 7 | 7 | 2 | 0 | 3 | 1 | 0 | 4 | 2 | 3 | 0 | 3 |
| F <sub>2</sub> | 0 | 0 | 0 | - |   |   |   |   |   |   |   |   |   |
| F <sub>3</sub> | F | F | F | F |   |   |   |   |   |   |   |   |   |

|   |   |   |   |   |   |
|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 7 |
| ↓ | ↓ | ↓ | ↓ | ↓ | ↓ |
| ∅ | ∅ | ∅ | 0 | 0 | ∅ |
| 1 | 1 | 1 |   |   | 1 |

follow FIFO as 2<sup>nd</sup> rule

## \* Page Buffering Algorithms

- keep a pool of free frames
- when a page fault occurs, a victim frame is chosen.
- The desired page is written into a free frame from the pool before the victim is written out.
- This procedure allows the process to restart as soon as possible without waiting for the victim page to be written out.

(or) → maintain a list of modified pages

- when paging device is idle - write pages that are set to non-dirty

(or) → keep a pool of free frames, remember which page was in each frame.

- If referenced again, no need to load contents again from disk

## \* Applications & Page Replacement

- In some cases, applications accessing data through the operating system's virtual memory perform worse than if the OS provided no buffering at all.
- e.g. a database, which provides its own memory management & I/O buffering.
- double buffering - OS is buffering I/O as well as application
- OS gives direct access to disk, as a large sequential array of logical blocks ⇒ raw disk

## \* Allocation of Frames

→ to determine the fixed amount of free memory among the various processes

### Minimum no. of frames

→ as the no. of frames allocated to each process decreases, the page fault rate increases → process execution slows

→ determined by architecture

### Maximum no. of frames

→ defined by amount of available physical memory

## \* Allocation Algorithms

### ① Fixed Allocation

A. equal allocation → divide frames equally among processes

B. proportional allocation ⇒ allocate according to size of process

$$s_i = \text{size of process } p_i$$

$$S = \sum s_i$$

$$m = \text{total no. of frames}$$

$$a_i = \text{allocation for } p_i = \frac{s_i}{S} \times m$$

$$\text{eg. } m = 62$$

$$s_1 = 10$$

$$s_2 = 127$$

$$a_1 = \frac{10}{127} \times 62 \approx 4$$

$$a_2 = \frac{127}{137} \times 62 \approx 57$$

### B. Priority Allocation

- use a proportional allocation scheme using priorities rather than size
- if process  $P_i$  generates a page fault, → select a frame for replacement from a process w/ lower priority number

### C. Global & Local Allocation

Global replacement - process selects a replacement frame from the set of all frames, one process can take a frame from another

- execution time may vary greatly
- commonly used because of greater throughput

Local replacement - each process selects from only its own set of allocated frames

- more consistent per-process performance
- possible underutilized memory

### D. NUMA - Non-Uniform Memory Access

- A given CPU can access some sections of main memory faster than other sections
- Those performance differences are caused by how CPU & memory are interconnected in the system.
- CPUs on a particular board can access the memory on that board w/ less delay than they can access memory on other boards.
- called NUMA systems.

## Managing NUMA Systems

- memory frames should be allocated "as close as possible" to the CPU on which the process is running
- close  $\Rightarrow$  minimum latency
- in Solaris, Lgroups (Latency groups) are created, where each Lgroup gathers together close CPUs and memory.

## \* Thrashing

- If a process doesn't have enough pages, the page fault rate is very high
- If there are insufficient frames, there will be a page fault.
- At this point, it must replace some page
- Since all the pages are in active use, it must replace a frame that will be needed again right away
- Consequently, it faults again and again, replacing pages it must bring back in immediately.
- This high paging activity is called thrashing. A process is thrashing if it is spending more time paging than executing.

## \* Causes of Thrashing

- If CPU utilization is low  $\Rightarrow$  increase degree of multiprogramming by introducing a new process
- If a page fault occurs, the process waits for the paging device & CPU utilization decreases.

→ CPU scheduler sees decreasing CPU utilization  $\Rightarrow$

increases the degree of multiprogramming.

→ new process tries to take frames from running processes,  
causes more page faults.

→ system throughput drops

→ To stop thrashing - decrease degree of multiprogramming



#### \* Limiting effects of thrashing

A. Local replacement algorithm | priority replacement algorithm

→ If one process starts thrashing, it cannot steal frames from another process and cause the latter to thrash as well.

#### B. Locality Model

→ As a process executes, it moves from locality to locality

→ A locality is a set of pages that are actively used together

→ A program has different localities, which may overlap

→ Thrashing occurs if  $\sum \text{size of locality} > \frac{\text{total memory size}}{2}$

→ limit effects by using local & priority page replacement