

## Memory Management

Memory - consists large array of words or bytes, each having its own unique address.

Program generates sequence of memory address during its execution.

⇒ CPU only access data from memory or registers so if data is not present in these direct access storage devices then data should moved there before CPU access.

⇒ Memory is slow than register access so to speed up a fast memory is added between memory & CPU called Cache.

⇒ <sup>(OS)</sup> we have to protect user programs to access each other address space.

OS have two registers for each process -

① base register - Smallest physical address

② limit register - Specifies size of range

base address  $\leq$  CPU generated  $<$  base + limit address

e.g. 30004 (base) & limit is 12030 means address can range from 30004 to 42034.

Note - Only OS can <sup>load</sup> access & change the value of these two registers

## Address Binding -

disk → Program as binary executable file.  
(Name \* exc)

Input queue | Job queue - waiting processes  
on the disk.

they are loaded into main memory &  
form ready queue.

⇒ Addresses may be represented ~~during~~<sup>in</sup> different ways during loading to execution.



① Compile time address binding - if you know exact base address at compile time then absolute code can be generated.

" if base changes, recompilation necessary

(2) Load time - Compiler generates relocation code. at load time absolute address is generated.  
∴ if base changes, need to reload process.

(3) Execution time - if process can move during its execution, then binding must be delayed until run time.

Dynamic loading - A routine is not loaded into memory until it is called. main program is loaded & executed when a routine call another routine it checks whether it is in memory or not. If not then relocatable linker is called to load relocatable code of that routine. Then By Program address table is updated & control passed to newly loaded routine.

Unused routine is never loaded.

## Static linking -

In static linking all the library files are also combined with binary image of program at loading time. but

in this linking is postponed until execution in static linking memory is wasted because all programs have their own copy of library files in .exe.

Stub - In dynamic linking in the place of library reference a stub is included in program image.

Stub is a small piece of code that indicates how that library routine is located into memory or how to load if not present in memory.

When Stub is executed Stub is replaced by the address of that routine.

Next time if same code runs ~~not~~ no stub is executed direct address is found.

# in this scheme, all processes that use a language library execute only one copy of library code.

# If library replaced by new version all the programs automatically use new version.

overlays - A program to be executed can't be in memory. If program is larger than allocated memory, then how a program will execute?

The idea of overlays is to keep only those instructions and data that are needed at any given time. When other instructions are needed, they are loaded into space occupied by previously loaded instructions that are no longer needed.

example - two Pass assembler

Pass 1 70K

Pass 2 80K

Symbol Table 20K

Common Routine 30K

200K



If only 150K is available then what?  
define two overlays -

Overlay A - ST + CR + Pass 1 = 120K

Overlay B - ST + CR + Pass 2 = 130K

# Speed becomes slow

# Code on disk is kept as absolute memory image.

# It requires complete knowledge of program

## Logical Versus Physical address space - (4)

Logical Address - Address generated by CPU.

Physical Address - Address seen by memory unit.

# Set of logical addresses - logical address space.

# for execution time binding these both are different.

# A hardware device, Memory management unit (MMU) used to translate logical address to physical address.

# User program deals /subtles logical address



### Contiguous Allocation -

operating system & Interrupt Vector resides

in low memory & user processes 0

resides in high memory

Transient OS Code - it comes into  
memory & goes as needed.



# it changes the size of operating system by execution.



# limit register used to protect processes to access each other code section.

# limit register contains the range of logical addresses ie. 0 to 255

D Single Partition Scheme - Initially all memory is available to user processes as one large block called a hole.

Process arrives, large enough hole available is allocated to process or it have to wait to free memory.

# When process terminates its freed memory is added into hole list or merged with adjacent hole.

# OS maintains a list indicating which part of memory are available and which are occupied.

allocate hole to process -

first fit - Allocate the first hole that is big enough.  
Best fit - Allocate the smallest hole that is big enough. This produce smallest leftover hole.

⑤ Worst-fit - Allocate largest hole. this produces largest leftover hole more useful than smallest.

⑥ Multiple Partition Scheme - divide memory

into a number of fixed size partitions.  
each partition may contain one process. No. of partitions define degree of multiprogramming.

Drawback - Internal Fragmentation -

Allocated memory may be slightly larger than requested memory. difference between these two no is internal fragmentation.

↳ it is memory that is free but internal to a partition & can not be used.

External fragmentation - in single partition

scheme, external fragmentation exists when enough total memory space exists to satisfy a request, but it is not contiguous. Storage is fragmented into a large number of small holes.

50-Percent rule - First Fit analysis never that

## Swapping -

- (1) in case of round robin scheduling quantum expired process is swap out
  - (2) in case priority scheduling low priority process is roll out & roll in
- ⇒ if address binding is done at the compile time or load time then swapped out process should be swap in at the same location.
- ⇒ Swapping requires a backing store i.e. a fast disk.
- # Swapped process must be completely idle.

## \* Continue -

Given  $N$  allocated blocks, another  $0.5N$  blocks will be lost due to fragmentation. That is  $1/3$  of memory is unusable.

Solution - Compaction → Shuffle the memory holes to place all free memory together in one large block.

# Only relocatable code can be compacted by changing base register value.

# Compaction can also be combined with swapping & Other ways may be used to compact memory.



Original  
allocation



Moved 600K  
 $P_3 + P_4$   
 200 400



Moved  
400K  
 Only  
 $P_4$  is moved



Moved 200K  
 Only  $P_3$  is  
 moved

Different ways to compact memory

Q - it is a memory management scheme that allows the physical address space of a process to be non-contiguous.

# No external fragmentation in main memory & disk.

Basic method -

Paging breaks physical memory into fixed size blocks called FRAMES. & disk into same size blocks called PAGES.

→ every address generated by CPU is divided into two parts.



|        |
|--------|
| Page 0 |
| 1      |
| 2      |
| 3      |

|   |   |
|---|---|
| 0 | 1 |
| 1 | 4 |
| 2 | 3 |
| 3 | 7 |

logical  
memory  
(disc)

Page table

|   |    |
|---|----|
| 0 |    |
| 1 | P0 |
| 2 |    |
| 3 | P2 |
| 4 | P1 |
| 5 |    |
| 6 |    |
| 7 | P3 |

# Page size is defined as power of 2,  
it makes translation of LA to PA  
easy.

if Size of logical address space  $\rightarrow 2^m$   
Page size  $\rightarrow 2^n$



eg. 32 bytes address space -  $2^{5=m}$   
4 bytes per page -  $2^2=n$   
(8 Pages)  
 $5-2 = 3$  bits for page No.  
2 bits for Page offset

external fragmentation but internal fragmentation may appear.

e.g. 4 bytes per page if process is 9 bytes long than 3rd page will have 3 bytes free.

# Small Page Size can reduce this problem but Page table entries increase & disk I/O will be less efficient.  
(smaller data transfer)

# A process to be executed of  $n$  pages should have  $m$  frames available in memory.

# User view of memory & actual physical memory view is separate

# frame table by OS is maintained in which frame is allocated or free, & to which process it is allocated is written.

# operating system maintains a copy of page table for each process. & this copy is also swapped & restored by CPU dispatcher when process swapped so it increases context switch time.

Hardware support - hardware implementation

of the page table can be done in several ways -

① Page table implemented as set of dedicated registers, these registers are reload by CPU dispatcher as other registers.

→ registers should be built with high speed logic to speed up translation process.

→ but costly if Page table is very large

② Page table is stored in main memory & Page table Base register (PTBR) points to the page table.

⇒ changing page table require to change contents of this register, reducing contention with time.

⇒ with this scheme two memory access are needed to access a byte (one for page table, other for byte itself)

→ Translation look-aside buffer (TLB) -

high Speed, Associative, Cache memory is used to speed up Page table look up time.

Key      Value      (Associative memory)      (89)

Page No.      Frame No.

Page No. from logical address is matched with key value of TLB if it found correspondingly value (frame no.) is returned

if not found (TLB Miss), memory reference of page table is generated & the entry is also made in cache (TLB).

if TLB is full LRU policy is generally used.

⇒ Wired down entries can not be removed from TLB. Ex TLB entry for Kernel code

⇒ Address space identifier (ASID) is just a unique no. for each process as process id.

⇒ With the page no. ASID value should also be matched to be correct frame no.

If ASID do not match means TLB miss

⇒ ASID allows the TLB to contain entries for several different processes simultaneously

| Page No.                           | ASID | Value (Frame No.) |
|------------------------------------|------|-------------------|
| 0                                  | 1    | 4                 |
| 1                                  | 1    | 5                 |
| 2                                  | 1    | 6                 |
| Page No./<br>are same<br>i.e. ASID | 2    | 1                 |
| 1                                  | 2    |                   |
| 2                                  | 2    |                   |

$\Rightarrow$  if TLB does not support ASID  
every time a new page table is selected,  
TLB must be flushed.

Hit ratio - The percentage of times that a particular page no. is found in the TLB is called hit ratio.

$$\text{Hit ratio} = 80\%$$

$$\text{TLB access time} = 20 \text{ ns}$$

$$\text{Memory access time} = 100 \text{ ns}$$

$$\begin{aligned}\text{effective access time} &= \\ &= 80 \times 120 + 20 \times 220 \\ &= 140 \text{ nsec.}\end{aligned}$$

Protection - A protection bit in page table is associated with each page like

- read-write }
- read only }
- execute only }
- valid } → Page is in the process's logical address space
- invalid } Not in process's logical address space

|   |
|---|
| 0 |
| 1 |
| 2 |
| 3 |

disk

|   |   |
|---|---|
| 0 | V |
| 1 | V |
| 2 | V |
| 3 | V |
| 4 | I |
| 5 | I |
| 6 | C |

Page table

for Process P1

$\leftarrow$  if page table is of fix size like 7.

Select D  
Table length register (PTLR) - (10)

define length of page table.

every logical address checked against PTLR to verify that address is in process range.

### Shared Pages -

If the code is reentrant code or pure code (it never changes during execution then it can be shared by many processes at a time.

Eg. Text editor code - 150 KB  
data - 50 KB

40 user want to run text editor then  
 $200 \times 40 = 8000$  KB space is needed

but if page size is 50 KB then  
page table of each user maps onto  
same physical copy of the editor.

Data is different for each user

Now

$$40 \times 50 + 150 = 2150 \text{ KB is needed.}$$

## Multilevel Paging -

If logical address space is very large then page table itself becomes excessively large.

Example - 32 bit logical address space  
Page size 4K ( $2^{12}$ )

Means

| Page No. | Page offset |
|----------|-------------|
| 20 bits  | 12 bits     |

# here Page table itself larger than one page.

# Page table also ~~is~~ may not be granted contiguous memory allocation.

Sol<sup>n</sup> - divide Page table into pieces  
(Paging of page table)

| Page no.       | Page offset |
|----------------|-------------|
| $P_1 \mid P_2$ | d           |

## Two Level Paging

$P_1 \rightarrow$  Index in outer Page table

it gives Page Index into inner Page table

→ base address of page containing frame No. for that address

$P_2 \rightarrow$  Index into inner Page table whose address

(11)



# Memory access increased.

### Inverted Page table -

# Page tables for all the processes consume more physical memory, that can be used in other processes.

# in this scheme, there is only one page table in the system, and it has only one entry for each page of physical memory.

Virtual address consist of triple-

$\langle \underbrace{\text{Process id, Page Number, offset}}_{\text{Page table entry}} \rangle$

if found at  $i$  location means +  
this Virtual Page is at  $i^{th}$  frame  
and physical address

$\langle i, d \rangle$  is generated.

∴ Search time increased because

Search is based on virtual address while table is sorted according to physical address.

hash table + associative look up may be used to fast search process

segmentation - it is a memory management scheme that supports this user view of memory.



logical address space is a collection of segments.

Each segment has a name and length.

### Logical address

⇒ The address specify both the segment name and offset within the segment.

⇒ Segments are numbered & user gives each address by two tuples -

$\langle$  Segment - Number , offset  $\rangle$

⇒ ① Compiler construct segments  
② Loader assigns segment numbers.

## Hardware -



User see two dimensional address but  
actual physical address is one dimensional  
to translate logical address to physical  
address Segment table is used.

Base  $\rightarrow$  starting address of segment

limit  $\rightarrow$  length of segment

if  $d < \text{limit}$  then  $S$  is used

to index into Segment table to  
find base address of that segment.

base address is added into offset

to generate physical address.

if  $d > \text{limit}$  then addressing error is  
generated.

## Implementation of Segment table -

(13)

① Register Implementation  $\rightarrow$  fast but expensive

② Memory -      STBR      } Registers  
                      STLR      } Maintained

$(S, d) \leftarrow$  logical address pair

if  $S < STBR$  then

$STBR + S$  gives base address of that segment

Protection & sharing - Protection bit associated  
to check if segment  
is read only or execute only.

Shared segment in all segment table

Point same physical copy of shared  
segment.

Fragmentation - segments are of variable  
length. Variable size partition scheme is  
used, and best fit or first fit also  
used to allocate memory.

External fragmentation may cause .