OS Module IV

Download as pdf or txt
Download as pdf or txt
You are on page 1of 73

Module-IV:

Memory Management

by:
Dr. Soumya Priyadarsini Panda
Assistant Professor
Dept. of CSE
Silicon Institute of Technology, Bhubaneswar
Background

 Memory consists of a large array of bytes, each with its own address.

 The CPU fetches instructions from memory according to the value of


the program counter.

 These instructions may cause additional loading from and storing to


specific memory addresses.
Cont…
 In order to execute a program it must be brought (from disk) into
memory.

 Memory unit only sees a stream of addresses.

 Main memory and registers are the only storage that CPU can access
directly.

 Register access in one CPU clock (or less)

 Main memory can take many cycles, causing a stall

 Cache Memory: fast memory between the CPU and main memory
Basic Hardware
 The objective is to ensure relative speed of accessing physical memory,
along with correct operation.

 For proper system operation, the operating system must be protected


from access by user processes.

 On multiuser systems, the user processes must be protected from one


another.

 This protection must be provided by the hardware.


Base and Limit Registers
 Each process has a separate memory space that protects the processes
from each other.

 To determine the range of legal addresses that the process may access:
base and limit registers are used.

 The base register holds the smallest legal physical memory address.

 The limit register specifies the size of the range

Example:
 If the base register holds 300040 and the limit register is 120900, then
the program can legally access all addresses from 300040 through
420939 (inclusive).
Cont…

 CPU must check every memory access generated in user mode to be


sure it is between base and limit for that user
Hardware Address Protection

 The base and limit registers can be loaded only by the operating
system, which uses a special privileged instruction.

 Since privileged instructions can be executed only in kernel mode, and


since only the operating system executes in kernel mode, only the
operating system can load the base and limit registers.
Address Binding
 Usually, a program resides on a disk as a binary executable file.
 To be executed, the program must be brought into memory and placed
within a process.

 The processes on the disk that are waiting to be brought into memory for
execution form the input queue.

 Addresses can be represented in different ways at different stages of a


program’s life

 Address binding of instructions and data to memory addresses can


happen at three different stages:
1. Compile time
2. Load time
3. Execution time
Cont…
Compile time:
 If memory location is known a priori at compile time, absolute code can
be generated .
 The generated compiler code will start at that location and extend up
from there.
 If, at some later time, the starting location changes, then it will be
necessary to recompile this code.
 The MS-DOS .COM-format programs are bound at compile time.

Load time:
 If it is not known at compile time where the process will reside in
memory, then the compiler must generate relocatable code.
 In this case, final binding is delayed until load time.
 If the starting address changes, reloading of the user code is required to
incorporate the changed value.
Cont…
Execution time:
 If the process can be moved during its execution from one memory
segment to another, then binding must be delayed until run time.
 Special hardware must be available for this scheme to work
Logical vs. Physical Address
Logical address:
 Generated by the CPU
 Also referred to as virtual address

Physical address:
 Address seen by the memory unit

 Logical and physical addresses are the same in compile-time and load-
time address-binding schemes.

 Logical and physical addresses differ in execution-time address-binding


scheme
Logical vs. Physical Address Space

Logical address space:


 The set of all logical addresses generated by a program

Physical address space:


 The set of all physical addresses generated by a program
Memory-Management Unit (MMU)
 The run-time mapping from virtual/logical to physical addresses is done
by a hardware device called the memory-management unit (MMU).

 The value in the relocation register is added to every address generated


by a user process at the time it is sent to memory

 The user program deals with logical addresses; it never sees the real
physical addresses.
Dynamic Loading
 With dynamic loading, a routine is not loaded until it is called.

 All routines are kept on disk in a relocatable load format.

 The main program is loaded into memory and executed first and
whenever other routines are required they are loaded.

 It does not require special support from the OS.

 It is the responsibility of the users to design their programs to take


advantage of such a method.

 However, OS provides the library routines to implement dynamic


loading.
Dynamic Linking
 Dynamically linked libraries are system libraries that are linked to user
programs when the programs are run.

 In dynamic linking, linking is postponed until execution time.

 With dynamic linking, a stub is included in the image for each library
routine reference.

 The stub is a small piece of code that indicates-


 how to locate the appropriate memory-resident library routine
or
 how to load the library if the routine is not already present.
Swapping
 A process can be swapped temporarily out of memory to a backing store
and then brought back into memory for continuing execution.

 The backing store is commonly a fast disk.

 Swapping makes it possible for the total physical address space of all
processes to exceed the real physical memory of the system
 Thus increases the degree of multiprogramming in a system.
Cont…
 The system maintains a ready queue consisting of all processes whose
memory images are on the backing store or in memory and are ready to
run.

 Whenever the CPU scheduler decides to execute a process, it calls the


dispatcher.

 The dispatcher checks whether the next process in the queue is in


memory.

 If it is not in the memory and there is no free memory region available,


the dispatcher swaps out a process currently in memory and swaps in the
desired process.

 The context-switch time in the swapping system is fairly high.


Example
If user process is 100 MB in size and the backing store is a standard hard
disk with a transfer rate of 50 MB per second.

The transfer of the 100 MB process to/from main memory will take:
100 MB/50 MB per second = 2 seconds

The total swap time is about 4,000 milliseconds (both swap out/in)

Note:
The major part of the swap time is transfer time.

The total transfer time is directly proportional to the amount of memory


swapped.
Contiguous Allocation
 The main memory must accommodate both the operating system and
user processes.

 The memory is usually divided into two partitions:


 One for the resident operating system and one for the user processes.

 The resident operating system, usually held in low memory and user
processes held in high memory.

 In contiguous memory allocation, each process is contained in a single


section of memory that is contiguous to the section containing the next
process.
Memory Allocation
Multiple partition method:
 Divide memory into several fixed-sized partitions. Each partition may
contain exactly one process.
 When a partition is free, a process is selected from the input queue and
is loaded into the free partition.
 When the process terminates, the partition becomes available for
another process.
 The degree of multiprogramming is bound by the number of partitions.

Variable-partition scheme:
 The operating system keeps a table indicating which parts of memory
are available and which are occupied.
 Initially, all memory is available for user processes and is considered
one large block of available memory, a hole
Dynamic Storage-Allocation Problem
 The memory blocks comprise a set of holes of various sizes scattered
throughout memory.

 When a process arrives and needs memory, the system searches the set
for a hole that is large enough for this process.

 When a process terminates, it releases its block of memory, which is


then placed back in the set of holes.

 Dynamic storage allocation problem concerns on how to satisfy a


request of size n from a list of free holes.

 There are three most commonly used solutions to select a free hole from
the set of available hole: first-fit, best-fit, and worst-fit
Cont…
First-fit:
 Allocate the first hole that is big enough

Best-fit:
 Allocate the smallest hole that is big enough
 Must search entire list, unless ordered by size
 Produces the smallest leftover hole

Worst-fit:
 Allocate the largest hole
 Must search entire list to find the largest hole
 Produces the largest leftover hole

 Note: First-fit and best-fit methods are better than worst-fit in terms of
speed and storage utilization
Fragmentation
 Both the first-fit and best-fit strategies for memory allocation suffer from
external fragmentation.

 As processes are loaded and removed from memory, the free memory
space is broken into small pieces.

 External fragmentation exists when there is enough total memory


space to satisfy a request but the available spaces are not contiguous:
 storage is fragmented into a large number of small holes.

 Memory fragmentation can be internal as well as external


External Vs. Internal Fragmentation
External Fragmentation:
 When there is enough total memory space to satisfy a request but the
available spaces are not contiguous

 Internal Fragmentation
 Allocated memory may be slightly larger than requested memory
 This size difference is memory internal to a partition, but not being used

 Statistical analysis of first fit, reveals that, even with some optimization,
given N allocated blocks, another 0.5 N blocks will be lost to
fragmentation.
 i,.e, one-third of memory may be unusable
 This property is known as the 50-percent rule.
Solutions to External Fragmentation
1. Compaction
 Shuffle memory contents to place all free memory together in one
large block.
 Compaction is possible only if relocation is dynamic, and is done at
execution time.

2. To permit the logical address space of the processes to be non-


contiguous
 Allowing a process to be allocated physical memory wherever
such memory is available.
 Two techniques are used for this purpose:
 Segmentation
 Paging
Segmentation
Basic Concepts
 A program is a collection of segments.

 A segment is a logical unit such as: main program, function, local


variables, global variables, common block, stack, symbol table, etc

User’s View of a Program:


Logical View of Segmentation

4
1

3 2
4

user space physical memory space


Segmentation
 Segmentation is a memory-management scheme that supports user view
of memory.

 Logical address consists of a two tuple: <segment-number, offset>

 The mapping from two-dimensional user-defined addresses into one-


dimensional physical address is done by a segment table.

 Each entry in the segment table has a segment base and a segment limit.

 The segment base contains the starting physical address where the
segment resides in memory.

 The segment limit specifies the length of the segment.


Segmentation Hardware
Cont…
 A logical address consists of two parts:
 a segment number ‘s’
 and an offset into that segment ‘d’

 The segment number is used as an index to the segment table.

 The offset d of the logical address must be between 0 and the segment
limit.

 When an offset is legal, it is added to the segment base to produce the


address in physical memory of the desired byte.

 The segment table is thus essentially an array of base–limit register pairs.


Example
Example-1
 If segment 2 is 400 bytes long and begins at location 4300.

 A reference to byte 53 of segment 2 is mapped onto location:

4300 + 53 = 4353.
Example-2
 Given the segment table. A reference to segment 3, byte 852, is
mapped to_____?

Answer:
3200 (Base value in segment table) + 852 = 4052
Example-3
 A reference to byte 1222 of segment 0 would result in a trap to the
operating system.

 As segment 0 is only 1,000 bytes long.


Cont…
Advantages of Segmentation:
 No Internal fragmentation.
 Segment table consumes less space in comparison to page table in
paging.

Disadvantage of Segmentation:
 As processes are loaded and removed from the memory, the free memory
space is broken into smaller pieces, causing external fragmentation.
Paging
Paging
 Paging is a memory-management scheme that permits the physical
address space of a process to be non-contiguous.

 Paging avoids external fragmentation and the need of compaction.

 It also solves the problem of fitting memory chunks of varying sizes


onto the backing store.

 Paging involves-
 dividing the physical memory into fixed-sized blocks called frames
and
 divide logical memory into blocks of the same size called pages
Paging Hardware
Cont…
 Every address generated by the CPU is divided into two parts:
 a page number (p)
 and a page offset (d).

 The page number is used as an index into a page table.

 The page table contains the base address of each page in physical
memory.

 This base address is combined with the page offset to define the physical
memory address that is sent to the memory unit.
Paging Model of Logical and Physical
Memory
Cont…
 The page size (like the frame size) is defined by the hardware.

 The size of a page is a power of 2, varying between 512 bytes and 1 GB


per page, depending on the computer architecture.

 If the size of the logical address space is 2m, and a page size is 2n bytes,
 then the high-order m−n bits of a logical address designate the page number
 and the n low-order bits designate the page offset.

 Where p is an index into the page table and d is the displacement within
the page.
page number page offset
p d
m -n n
Example

In the logical address, if n=2 and m=4

size of the logical address space =2m =24 =16

page size= 2n = 22 =4 bytes


Cont…
Page size=frame size
i.e. frame size = 4 bytes
Considering physical memory of 32-bytes:

No. of frames= physical address space/frame size


= 32/4 = 8

No. of pages=logical address space/page size=


16/4=4
Cont…

Logical address 0 indexing into the page table in frame 5


Logical address 0 (page 0, offset 0) maps to physical address : [ (5 × 4) + 0] = 20
Logical address 3 (page 0, offset 3) maps to physical address : [(5 × 4) + 3] = 23
Logical address 4 ( page 1, offset 0) maps to physical address : [ (6 × 4 ) + 0] = 24
Logical address 13 (page 3, offset 1) maps to physical address : [(2 × 4) + 1] = 9
Implementation of Page Table
 Page table is kept in main memory
 Page-table base register (PTBR) points to the page table

 Page-table length register (PTLR) indicates size of the page table

 In this scheme every data/instruction access requires two memory


accesses
 One for the page table and one for the data / instruction

 The two memory access problem can be solved by the use of a special
fast-lookup hardware cache called associative memory or translation
look-aside buffers (TLBs)
Cont…
 Translation look-aside buffers (TLBs) is a special, small, fast lookup
hardware cache.
 The TLB is associative, high-speed memory.
 Each entry in the TLB consists of two parts:
 a key (or tag) and a value.

 Some TLBs store address-space identifiers (ASIDs) in each TLB entry –


 uniquely identifies each process to provide address-space protection for
that process

 The percentage of times that the page number of interest is found in the
TLB is called the hit ratio.
Paging Hardware With TLB
Example-1
An 80-percent hit ratio means that, the desired page number is found in
the TLB 80 percent of the time.

If it takes 100 nanoseconds to access memory, then a mapped-memory


access takes 100 nanoseconds when the page number is in the TLB.

If we fail to find the page number in the TLB, then


• we must first access memory for the page table and frame number (100
nanoseconds) and
• then access the desired byte in memory (100 nanoseconds)

Resulting for a total of 200 nanoseconds


Effective memory-access time
 To find the effective memory-access time, we weight the case by its
probability:

effective access time = 0.80 × 100 + 0.20 × 200


= 120 nanoseconds

suffer a 20-percent slowdown in average memory-access time (from 100


to 120 nanoseconds).
Example-2
For a 99-percent hit ratio,
effective access time = 0.99 × 100 + 0.01 × 200
= 101 nanoseconds

 This increased hit rate produces only a 1 percent slowdown in access


time.
Memory Protection
 Memory protection implemented by associating protection bit with each
frame to indicate if read-only or read-write access is allowed

 Valid-invalid bit attached to each entry in the page table:


 “valid” indicates that the associated page is in the logical address
space, and is thus a legal page
 “invalid” indicates that the page is not in the logical address space

 Or use page-table length register (PTLR)


 Any violations result in a trap to the kernel
Valid (v) or Invalid (i) Bit in a Page Table
Virtual Memory
Virtual Memory
 Virtual memory is a technique that allows the execution of processes that
are not completely in memory.
 i.e. it runs programs that are partly loaded into memory

 As in many cases, the entire program is not needed to be loaded into


memory, virtual memory provides a separation between user logical
memory and physical memory.

 Virtual memory is implemented by demand paging.


Advantages
 Programs can be larger than physical memory

 Programmers don’t have to bother about the amount of physical memory


available.

 As each program takes less memory, more number of programs can be


executed at the same time
 increasing CPU utilization and throughput with no increase in
response time and turn around time

 It allows files and memory to be shared by several processes through


page sharing

 Less I/O is needed to load/swap user programs into memory leading to


faster execution of user programs.
Virtual memory that is larger than
physical memory

Note:
For further details on Virtual memory do self study from book
Demand Paging
 Demand Paging is similar to paging with swapping.

 In demand paging, process resides in secondary storage(disk)


 when the process is to be executed, it swap in into the memory.

 Instead of swapping the entire process into memory, lazy swapper is


used which loads only the necessary pages into memory.

 A lazy swapper never swaps a page into memory unless that page is
needed.
Cont…
 In demand paging a valid- invalid bit scheme is used.

 When the bit is valid, it indicates that-


 the associated page is legal and in main memory.

 If the bit is invalid, then that


 page is either invalid or
 valid but currently on disk not in memory.

 The page table entry for a page that is not currently in memory is set as
invalid.
Page Table when some pages are not in
main memory
Cont…
Page fault:
 When a process tries to access a page that is not present in the physical
memory page fault occurs.

Page replacement:
 Page replacement is the process of swapping out an existing page from
the main memory and replacing it with the required page.

 Page replacement is required when all the frames of the main memory
are already occupied. And some new page is needed to be loaded.
Steps in handling a page fault
Steps in handling a page fault cont…
1. Check an internal table (usually kept with the process control block)
for the process to determine whether the reference was a valid or an
invalid memory access.
2. If the reference was invalid, terminate the process. If it was valid but
not yet brought in that page, page it in.
3. Find a free frame (by taking one from the free-frame list).
4. Schedule a disk operation to read the desired page into the newly
allocated frame.
5. When the disk read is complete, modify the internal table kept with the
process and the page table to indicate that the page is now in memory.
6. Restart the instruction that was interrupted by the trap. The process can
then access the page as though it had always been in memory.
Page Replacement Algorithms

 FIFO (first-in first-out)

 OPT (Optimal)

 LRU (Least Recently used)

 Counting based page replacement algorithms:


 LFU (Least Frequently used)
 MFU (Most Frequently used)
FIFO Page Replacement Algorithm
 A FIFO (first-in, first out) page replacement algorithm associates with
each page the time when that page was brought into memory.

 When a page must be replaced, the oldest page is chosen.

Example:
If there are 3 frames and are initially empty.
Reference string:
7, 0, 1, 2, 0, 3, 0, 4, 2, 3, 0, 3, 2, 1, 2, 0, 1, 7, 0, 1

Number of page faults=15


Cont…
 An active page may be replaced requiring to reload it immediately
causing an increased page fault rate and slows process execution.

Belady’s anomaly:
 For some page-replacement algorithms, the page-fault rate may increase
as the number of allocated frames increases.
OPT Page Replacement Algorithm
 The Optimal (OPT) algorithm has the lowest page-fault rate of all
algorithms.
 It never suffer from Belady’s anomaly.
 It replace the page that will not be used for the longest period of time.

Example:
If no. of frames=3
Search string: 7, 0, 1, 2, 0, 3, 0, 4, 2, 3, 0, 3, 2, 1, 2, 0, 1, 7, 0, 1

Number of page faults=9


LRU Page Replacement Algorithm
 The Least Recently Used (LRU) page replacement algorithm replaces
the page that has not been used for the longest period of time.

Example:
No. of frames=3
Search string: 7, 0, 1, 2, 0, 3, 0, 4, 2, 3, 0, 3, 2, 1, 2, 0, 1, 7, 0, 1

Number of page faults=12


Counting-Based Page Replacement

 Least Frequently Used (LFU)

 Most Frequently Used (MFU)


LFU Page Replacement

 The Least Frequently Used (LFU) page-replacement algorithm requires


that the page with the smallest count be replaced.

Problem:
 When a page is used heavily during the initial phase of a process, but
then is never used again, it has a large count and remains in memory
even though it is no longer needed.
MFU Page Replacement
 The Most Frequently Used (MFU) page-replacement algorithm is based
on the argument that the page with the smallest count was probably just
brought in and has yet to be used.

 The implementation of MFU nor LFU replacement algorithms is


expensive, and they do not approximate OPT replacement.
Thrashing

 If a process doesn’t have enough frames it needs to execute it will


replace some actively used pages causing a high page fault rate.

 A process is said to be thrashing if it spends more time in paging than


executing.
Homework Question

1. Consider the following page reference string:


7, 2, 3, 1, 2, 5, 3, 4, 6, 7, 7, 1, 0, 5, 4, 6, 2, 3, 0 , 1.

Assuming demand paging with three frames, how many page faults would
occur for the following replacement algorithms?

 LRU replacement
 FIFO replacement
 Optimal replacement

You might also like