09 Memory

The document provides an overview of main memory, its organization, and memory management techniques, including memory protection, address binding, static and dynamic linking, and dynamic loading. It discusses the challenges of memory allocation, fragmentation, and the use of page tables for managing multiple processes. Additionally, it covers the role of the Memory Management Unit (MMU) in address translation and the benefits of paged memory allocation to reduce fragmentation.

Uploaded by

tmcurti4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views153 pages

09 Memory

Uploaded by

tmcurti4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 153

Memory

Chapter 9
Background on Main Memory
• Memory is a large(ish) Linearly Addressable array of
bytes
– Typically, a byte is an 8-bit octet, but memory could be
organized in different units.
– Think of main memory as being big, but smaller than you’d
like it to be
• Registers and main memory are the only storage the
CPU can directly access
– So, a program must be brought into main memory
(typically from disk) before it can be run.
– Registers are fast, main memory is slower, cache helps to
make up the difference
Memory Layout
• Pretend main memory is divided into two
regions
– Space for a resident operating system
• Maybe in low memory addresses
• With structures like the interrupt vector table and
memory-mapped I/O
– Space for multiple user processes in the rest of
memory
• Want to fit in as many user processes as we
can
A Simple System for Memory
Protection
• The OS allocates a region of memory to each process.
• We have a base and limit
register for memory protection
(for now)
– Provides memory protection
– Memory must be allocated
as a big, contiguous block for
each process.

Pretend (tiny)
memory addresses.
A Simple System for Memory
Protection
• Within each process, memory is organized into
different regions.
– A text section for executable code
– A data section for global variables
– A stack for local variables
– A heap for dynamic allocation
– Example program:
MemoryRegions.c
Address Binding
• Address Binding : that’s how we choose physical
memory addresses program code uses.
– Really, the addresses for each program symbol
– Compile time: compiler chooses an address when it
builds the program
• This builds absolute code, must be run in a particular
location and rebuilt if it’s going to run elsewhere
– Load time: Choose memory address when the
program starts running
– Execution time: Can move after it’s started running
We’re probably going to need more help from the
hardware.
Building a User Program
Static Linking
• Static Linking
– Do all linking at build
time
– Application code and
libraries linked into a
single load image
– Ready to execute, just
copy into memory
and run.
Static Linking
• Executable includes a copy
of needed libraries, but
just the parts it needs
• So, somewhat reduced
memory footprint

$ ls –l libc.a
3153906 bytes
$ gcc –static Hello.c
$ ls –l a.out
615997 bytes
Linking at Program Start-Up
Dynamic Linking
• Dynamic Linking
– Link with library code
at load time
– Get the latest
(compatible) version of
every library
– Smaller executable
image

$ gcc Hello.c
$ ls –l a.out
7096 bytes
Dynamic Linking
• Can be done with no special help from the OS
– Initially, a small piece of code, a stub, gets called
instead of the subroutine
– On the first call, the stub finds and replaces itself
with the actual subroutine
• Particularly useful for libraries
Dynamic Linking
• Programs share
common code on
disk,
• But, not
necessarily in
memory.
Shared Libraries
• Shared Dynamically
Linked Libraries
– Just one copy of the
library on disk,
– … and in memory.
– Significantly smaller
memory footprint
– Definitely requires help
from the OS.
– And, we’re going to
need more than
base/limit registers
Dynamic Loading
Dynamic Loading
• We want the most out of a limited physical memory
• Dynamic Loading : Load parts of the program when
they are first needed
– e.g., load a subroutine when its first called
– Overlays: same region of memory used for several
subroutines, one after another
• Useful if large amounts of code are needed later in
execution (or maybe not at all)
– Faster start-up times
– Reduced process memory footprint (maybe)
• No help from the OS required … but it might be nice.
Swapping
• When running lots of processes, memory may fill
up
– For example my laptop is running 238 processes right
now (and another 152 inside a virtual machine)
– But, most of them are idle most of the time
• Solution, temporarily remove an entire process
from memory
– Backing store, fast, reasonably large storage (usually
disk) used to hold memory contents for processes that
don’t reside in main memory right now
– Bring process’ memory image back into physical
memory when there’s more room
What’s Swapping Look Like?
Swapping
• This is called swapping
– Process PCB stays in the process table
– Really, we are preempting a process’ memory
– So, this is a different way a process can block
• Major cost of swapping is transfer time
– Total transfer time is proportional to process
memory size
Making Room for Processes
• Address binding matters
• With compile-time address
binding
– We have to put a process in the
memory it was compiled for

• With load-time address binding

– We can load a process into any
region of memory
– As long as there’s enough
Load-Time Addressing Binding
• There are a few techniques for starting a
program anywhere
in memory
– Use only relocatable
(position-independent) code
machine instructions that
use relative addresses.
– Modify executable
code as it is loaded.
Making Room for Processes
• Multiple partition allocation
– The OS has to find room for each running process.
– Giving out and reclaiming space as they start and finish.
• With base and limit registers, this requires: contiguous allocation
– Each new process needs one unbroken block.
Dynamic Storage Allocation
• This is a general problem
– The OS find space for processes as they start and exit
– Inside each process, malloc()/new must manage heap space for
requests
– OS manages all of physical physical memory
– malloc()/new manages the block of heap memory inside a
process

The standard library

manages heap space
The OS manages all
the same way.
of memory.
Dynamic Storage Allocation
• To manage dynamically allocated storage:
– Maintain a list of allocated regions
– Maintain a list of free memory holes
– Find sufficiently large holes when a new requests
arrive
– Re-claim memory when it’s no longer in use
– Efficiently coalesce adjacent holes into larger ones
Dynamic Storage-Allocation Problem
• We have a list of allocated regions and a list of holes (free
blocks between them)
• A request for a size-n block comes along
– First-fit : walk down the list, choose the first hole as large as n
– Best-fit : find the smallest hole that’s as large as n
• Maybe a good idea, less wasted space
• But, it could take a little longer
• ... and, the leftover hole will probably be useless
– Next-fit : get the first sufficiently large block after the last one
allocated
• May promote locality
– Worst-fit : find the largest hole and use it
• Maybe the left-over hole will be big enough to be useful
• Maybe more efficient to implement
Memory Partitioning for User
Processes
• Variable partition allocation
– Find room for each new process based on its size.
– We can pack them into memory one after
another, without any gaps.
Memory Partitioning for User
Processes
• Variable partition allocation
– We can pack them into memory one after
another, without any gaps.
Memory Partitioning for User
Processes
• Processes will finish up
• … and new ones will start up and re-use their
memory.

The OS will clear this

memory before it lets
another process use it.
Memory Partitioning for User
Processes
• May get some external fragmentation

We ended up with a
small gap between
processes.
Fragmentation
• Some memory is going to be wasted
• External Fragmentation
– There may be some wasted memory between
allocated regions
– Lots of holes, but all to small to use
• 50-percent rule
– With first-fit, if a total of n bytes of memory have
been allocated, another ½ n will be lost to
fragmentation
Memory Partitioning for User
Processes
• If a hole is very close to the size of a memory
request
– It may not be practical to keep up with the left-
over memory.

It may cost more memory

to keep up with this block
than the size of the block
itself.
Memory Partitioning for User
Processes
• We may give a little more memory than was
needed
– to simplify memory bookkeeping.
– That’s called internal fragmentation

Extra space. More

than what was
requested.
Memory Partitioning for User
Processes
• Eventually, we may have a lot of memory,
that’s too fragmented to use.
Compaction
• Can we move allocated blocks around to squeeze them
together?
– That’s compaction, lots of little fragments become one big
fragment
– Can we do this with running programs? We can if we have
execution-time address binding
Making Programs Relocatable
• Moving a running program will be tricky
• The program uses memory addresses all over its code.
• All these addresses will be wrong if we move the
program elsewhere.
Making Programs Relocatable
• If we want to be able to move a process ...
we can’t let it use real memory addresses.
• We’ll call theses Physical Addresses

Sorry. I can’t let

you use physical
addresses.
Making Programs Relocatable
• Instead, we’ll only let processes use Logical Addresses
– A made-up system of addresses that we let a process use.
– These will stay the same, even when we need to move the
process.

How about if we let It’s OK. You can

you call this “address keep calling that
25” instead? “address 25”.
Address Translation
• Every time a process tries to access a logical address
• .. it needs to be quickly converted to a physical address.
• This is called address translation.

I know you
You tied to access really mean
logical address 25. physical address
325.
Address Translation with Base / Limit
• We can create a simple address translation
scheme using just base & limit registers
– All logical addresses are relative to the base.
Same logical
address
25 from the
base
register?
Now it
translates
to physical
That’s
address 125
physical
address 325
Making Programs Relocatable
• We’re using the base register as a relocation
register
– Adding it to every logical address to get the
corresponding physical address
The Memory-Management Unit
(MMU)
• A hardware device that handles mapping
logical to physical addresses
• Typically, part of the CPU
• For example, we can use base as a relocation
register
– The MMU will add the base register to every
logical address
– Automatically, whenever the CPU is in user mode
– Fast, simple, easy to implement this kind of MMU
The Memory-Management Unit
(MMU)
• The user program only sees logical addresses
– It never sees the real physical addresses
– Really, why would you want to?
• Of course, the OS still has to think about
physical addresses (and each process’ logical
address space). The process gives us a logical
address.
– Consider a call like:
read( fd, buffer, 100 );
But, we’ll need to give the device
controller a physical address.
Address Translation Requirements
• However, just a base and limit register won’t
be enough to let us
implement
– Shared memory
– Shared libraries
– Other things we’ll
talk about later
Paged Memory
• Memory allocation would be easier if processes memory
didn’t need to be contiguous
– Can we do this? Let processes use a little bit of physical
memory here, and a little bit there?
– Process still needs to be able to see its logical address space as
contiguous (why?)
– We certainly need this for shared libraries
• OK, we’ll allocate process memory as multiple fixed-sized
blocks, called pages
• Process memory = a sequence of pages
– Divide the process’ logical address space into fixed-sized pages
• Physical memory = a sequence of page-sized frames
– Divide physical memory into frames , each able to hold a page
– Size typically a power of 2, between 512 bytes and 8,192 bytes
Paged Memory
• To run a program of
size n pages, we just
need n free frames
(anywhere in memory)
– We’ll tolerate some
internal fragmentation
– But, we’ll virtually
eliminate external
fragmentation
Paged Memory
• But, if process pages
could be scattered all
over physical memory
…
– How do we know how
to access a particular
byte of a process’
memory?
– We need a table that
says where process
pages are.
– We need a page table
Page Table Organization
• Line 0 says what
frame holds page 0
• Line 1 says what
frame holds page 1
• And so on
• It’s an array of frame
numbers (indexed by
page)
Logical and Physical views of Memory
Multiple Processes and Page Tables
• Each process has its
own page table