0% found this document useful (0 votes)

15 views5 pages

TLB & Caches: N N N N

The document discusses address translation and caching techniques in computer systems. It introduces the translation lookaside buffer (TLB) which caches recent address translations and caches which store recently accessed memory values. It describes how TLB and cache accesses can be overlapped for better performance and discusses software-controlled versus hardware-controlled TLB approaches.

Uploaded by

hoang.van.tuan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views5 pages

TLB & Caches: N N N N

Uploaded by

hoang.van.tuan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

TLB & Caches

n TLB contain the most recent address translations

Address Translation Wrap up n Usually fully associative
Small – 16 to 32
Demand Paging
n

n Access time <= cycle

n Table: virtual address, physical address pairs
n Cache: contain the values of memory cells recently
accessed
Arvind Krishnamurthy n Usually direct mapped (or very small associativity)
Spring 2001 n Direct mapped è entry to check is (address % cache size)
n Virtually addressed caches:
n Suffer from synonym problems

n Multiple cache entries could be caching a physical location

n Cache entry: data and physical address from which it is obtained

Overlapped TLB & Cache Access Hardware-controlled TLB

n Sequential access: slow n On a TLB miss
VA PA miss n Hardware loads the PTE into the TLB
n Need to write back if there is no free entry
Trans- Main
CPU Cache
lation Memory n Generate a page fault if the page containing the PTE is invalid
hit n VM software performs fault handling
data n Restart the CPU
n Overlapped access: n On a TLB hit, hardware checks the valid bit
n Virtual address à TLB à Physical address n If valid, pointer to page in memory
VPage # offset PPage # offset n If invalid, the hardware generates a page fault
n Perform page fault handling

n Direct mapped cache: use low address bits to index cache n Restart the faulting instruction

n Therefore, if cache size <= page size, overlapped access is possible

n Or if we guarantee that translation does not modify the low address
bits even if cache size > page size (more complex page allocation)

Software-controlled TLB Summary

n On a miss in TLB, fault into OS software n Physical memory:
n On a hit, hardware checks for the valid bit n no protection gcc
n Hardware approach n limited size gcc
n almost forces contiguous allocation
n Efficient emacs
n sharing visible to program
n Inflexible
n easy to share data
n Software approach
n Virtual memory
n Inefficient
n each program isolated from others
n Flexible
n transparent: can’t tell where running
n can share code, data gcc gcc
n non-contiguous allocation
n Today: illusion of infinite memory

1
Why partial residency? Expanding physical memory
n Assumption we made to simplify things: n Virtual address translated to:
n All of process’s data is in memory n Physical memory ($1/meg). Very fast, but small
Load entire process into memory before it can run
Disk ($.01/meg). Very large, but very slow (millis vs nanos)
n
n

n Disk (persistent data)

n Problems?
n wasteful of space (process doesn’t use all of its memory)
n limits multiprograminng disk
n slow (especially with big process) page table

n Solution: partial residency

n demand paging: only bring in pages actually used
n paging: only keep frequently used pages in memory
n Mechanism:
n use virtual memory to map some addresses to physical pages, some to disk
Physical memory

Disk-sized memory run faster Demand paging mechanism

n Want: disk-sized memory that’s as fast as physical mem n Enhancement: page table has “present” (valid) bit
n 90/10 rule: 10% of memory gets 90% of memory refs n if present, pointer to page frame in memory
n so, keep that 10% in real memory, the other 90% on disk n if not present, go to disk (disk block number)
n how to pick which 10%? (look at past references) n Hardware traps to OS on reference to missing page
(in MIPS/Nachos, trap on TLB miss, OS checks page table valid bit)
# of references

n OS software
choose an old page to replace
n

if old page has been modified, write contents back to disk

n change its page table entry and TLB entry

n load new page into memory from disk

Memory address n update page table entry

n continue thread

all this is transparent, OS can run another job in the meantime!

Disk
Physical memory

Main problems Problem: resuming after a fault

n how to resume a process after a fault?
n Fault might have happened in the middle of an inst!
n need to save state and resume
n process might have been in the middle of an instruction! Usr program

n what to fetch? fault alloc page

add r1, r2, r3
just the needed page or more?
move (sp)++, r2
read from disk OS
resume
n

set mapping
n what to eject?
n physical memory always too small, which page to replace?
n Key constraint: don’t want user process to be aware that page
n may need to write the evicted page back to the disk fault happened (just like context switching)
n Can we skip the faulting instruction? No.
n how many pages for each process?
n Can we restart the instruction from the beginning?
n what to do when not enough memory?
n Not if it has partial-side effects.
n how to deal with thrashing?
n Can we inspect instruction to figure out what to do?
n May be ambiguous where it was.

2
Solution: hardware support Deciding what page(s) to fetch
n RISC machines are pretty simple: n Page selection: when to bring pages into memory
n instructions tend to have 1 memory ref & 1 side effect. n Like all caches: we need to know the future.
n Thus, only need faulting address and faulting PC. n Doesn’t the user know?
n Example: MIPS n Not reliably
n How to communicate that to the OS?
n Easy load-time hack: demand paging
Fault: epc = 0xffdd0, n Load initial page(s). Run. Load others on fault.
0xffdcc: add r1,r2,r3 bad va = 0x0ef80
0xffdd0: ld r1, 0(sp) fault handler
ld init pages ld page ld page ...
ld page
jump 0xffdd0 n When will startup be slower? Memory less utilized?
n Most systems do some sort of variant of this
n CISC harder:
n multiple memory references and side effects; interpret the instruction? n Tweak: pre-paging. Get page & its neighbors

Deciding what page to eject Page replacement algorithms

n Find some page in memory and swap it out n Basic algorithms
n Random (? used by TLB)
n Goal: minimum number of page faults n FIFO
n page fault rate 0 ≤ p ≤ 1.0 n Optimal (MIN)
n if p = 0 no page faults n LRU
n if p = 1, every reference is a fault n LRU approximations (Clock, FIFO extensions, etc.)
n effective memory access time (EAT)
EAT = (1 – p) x memory access n Goal: low page faults but cheap & simple to support
+ p (page fault overhead
+ [swap page out // often ctxt swtch to avoid
+ swap page in ] // often ctxt swtch to avoid
n Examples: use memory reference string
+ restart overhead) 1, 2, 3, 4, 1, 2, 5, 1, 2, 3, 4, 5
n Example: mem access -- 1 micro sec; swap cost -- 10 milli sec assuming 3 physical pages

First-In-First-Out (FIFO) Optimal or MIN

n Algorithm:
n Replace the page that won’t be used for the longest time
Recently
5 3 4 7 9 11 2 1 15
Page n Pros
loaded out n Minimal page faults
n This is an off-line algorithm for performance analysis

n Algorithm n Cons
n Throw out the oldest page n No on-line implementation

n Pros
n Low-overhead implementation
n Cons
n May replace the heavily used pages

3
Least Recently Used (LRU) Implementing LRU
n Algorithm
n Replace page that hasn’t been used for the longest time
n Question
n What hardware mechanisms are required to implement LRU? Mostly Least
recently used 5 3 4 7 9 11 2 1 15 recently used

n Perfect
n Use a timestamp on each reference
n Keep a list of pages ordered by time of reference
n Is this practical?

More page frames →fewer faults? FIFO with 2nd chance

n Consider the following reference string with 4 page frames
Recently Page
n FIFO replacement 5 3 4 7 9 11 2 1 15
loaded out
n 1, 2, 3, 4, 1, 2, 5, 1, 2, 3, 4, 5
n 10 page faults If reference bit is 1

n Consider the same reference string with 3 page frames n Main idea: add a “reference bit” (or use bit) per PTE
n FIFO replacement n Check the reference-bit of the oldest page
n 1, 2, 3, 4, 1, 2, 5, 1, 2, 3, 4, 5 n If it is 0, then replace it
n 9 page faults! n If it is 1, clear the referent-bit, put it to the end of the list, and
continue searching
n This is called Belady’s anomaly
n Pros
n Fast and do not replace a heavily used page
n Cons
n The worst case may take a long time

Clock: Simple FIFO + 2nd Chance Not Recently Used (NRU)

Oldest page n Algorithm
n Randomly pick a page from the following (in this order)
n Not referenced and not modified

n Not referenced and modified

n Referenced and not modified

n Referenced and modified

n FIFO clock algorithm n Pros

n Hand points to the oldest page
n Easy to implement
n On a page fault, follow the hand to inspect pages
n Second chance n Cons
n If the reference bit is 1, set it to 0 and advance the hand n Not very good performance and also takes time to classify
n If the reference bit is 0, use it for replacement
n What is the difference between Clock and the previous one?

4
Enhanced FIFO with 2nd-chance State per page table entry
n Same as the basic FIFO with 2nd chance, except that this Many machines maintain four bits per page table entry:
method considers both reference bit and modified bit
n (0,0): neither recently used nor modified n use (aka reference): set when page is referenced,
n (0,1): not recently used but modified cleared by “clock algorithm”
n (1,0): recently used but clean
n (1,1): recently used and modified n modified (aka dirty): set when page is modified, cleared
when page is written to disk
n Pros
n Avoid write back n valid (aka present): ok for program to reference this
n Cons page
More complicated
read-only: ok for program to read page, but not to modify
n
n

it (e.g., for catching modifications to code pages)

How many pages are allocated to

State in software-loaded TLB each process ?
n What if we have software-loaded TLB (as in Nachos)? n Each process needs minimum number of pages.

n Hardware sets use bit in TLB; when TLB entry is replaced, n Example: IBM 370 – 6 pages to handle SS MOVE
software copies use bit back to page table instruction:
n instruction is 6 bytes, might span 2 pages.
2 pages to handle from.
n Software manages TLB entries as FIFO list; everything not n

2 pages to handle to.

in TLB is second-chance list. n

n Two major allocation schemes.

n fixed allocation
n priority allocation

Fixed allocation Priority allocation

n Use a proportional allocation scheme using priorities rather than size.
n Equal allocation – e.g., if 100 frames and 5
processes, give each 20 pages. n If process Pi generates a page fault,
n Proportional allocation – Allocate according to n select for replacement one of its frames.
the size of process. n select for replacement a frame from a process with lower priority number.

si = size of process pi n Global replacement – process selects a replacement frame from the set
S = ∑ si m = 64 of all frames; one process can take a frame from another.
m = total number of frames s1 = 10
si s2 = 127 n Local replacement – each process selects from only its own set of
ai = allocation for pi = × m 10 allocated frames.
S a1 = × 64 ≈ 5
137
127
a2 = × 64 ≈ 59
137

Paging & Segmentation - New
No ratings yet
Paging & Segmentation - New
88 pages
12 Memory Management 20-09-2024
No ratings yet
12 Memory Management 20-09-2024
47 pages
Slide 8 OS Virtual Memory 2025
No ratings yet
Slide 8 OS Virtual Memory 2025
68 pages
Os PPT - Unit 4-Part2 - RC
No ratings yet
Os PPT - Unit 4-Part2 - RC
37 pages
Process Mgt-Chapter 3 (Three)
No ratings yet
Process Mgt-Chapter 3 (Three)
52 pages
VM Os
No ratings yet
VM Os
53 pages
Operating System
No ratings yet
Operating System
62 pages
VM MM DAY2 RaghuGangolu
No ratings yet
VM MM DAY2 RaghuGangolu
60 pages
M4 Virtual Memory2
No ratings yet
M4 Virtual Memory2
50 pages
Os Virtual Memory
No ratings yet
Os Virtual Memory
89 pages
Mem Policy
No ratings yet
Mem Policy
42 pages
10-11-2021 Virtual Memory PRIYA Modify
No ratings yet
10-11-2021 Virtual Memory PRIYA Modify
54 pages
Chapter 9: Virtual Memory
No ratings yet
Chapter 9: Virtual Memory
71 pages
Notes Os
No ratings yet
Notes Os
29 pages
Ch9 Virtual Memory
No ratings yet
Ch9 Virtual Memory
30 pages
Lec15 Pagereplace
100% (1)
Lec15 Pagereplace
33 pages
Chapter 9: Virtual Memory
No ratings yet
Chapter 9: Virtual Memory
71 pages
Module 4
No ratings yet
Module 4
43 pages
Module 5.2 VirtualMemory
No ratings yet
Module 5.2 VirtualMemory
26 pages
Unit 6
No ratings yet
Unit 6
30 pages
Cls7 VirtualMemory
No ratings yet
Cls7 VirtualMemory
30 pages
9 - Virtual Memory
No ratings yet
9 - Virtual Memory
43 pages
Week10 Virtual Memory
No ratings yet
Week10 Virtual Memory
58 pages
Lecture 8 - Virtual Memory
No ratings yet
Lecture 8 - Virtual Memory
45 pages
Course Instructor: Nausheen Shoaib
No ratings yet
Course Instructor: Nausheen Shoaib
70 pages
Virtual Memory This Document Is About Tje Virtual Memory in Operatibg System.
No ratings yet
Virtual Memory This Document Is About Tje Virtual Memory in Operatibg System.
22 pages
Ch5 Memory Management - CONTINUE
No ratings yet
Ch5 Memory Management - CONTINUE
47 pages
S2000 Ddec Iv 170708
100% (4)
S2000 Ddec Iv 170708
95 pages
He-Dieu-Hanh - Kai-Li - Vmpaging - (Cuuduongthancong - Com)
No ratings yet
He-Dieu-Hanh - Kai-Li - Vmpaging - (Cuuduongthancong - Com)
24 pages
Virtual Memory Page Replacement
0% (1)
Virtual Memory Page Replacement
40 pages
3 - 1.chap6 Page Replacement Example
No ratings yet
3 - 1.chap6 Page Replacement Example
23 pages
9 Virtual Memory
No ratings yet
9 Virtual Memory
43 pages
Virtual Memory: Operating Systems: Internals and Design Principles, 6/E
No ratings yet
Virtual Memory: Operating Systems: Internals and Design Principles, 6/E
65 pages
Digital Video Guidebook
100% (2)
Digital Video Guidebook
18 pages
Relay Setting
No ratings yet
Relay Setting
144 pages
Virtual Memory
No ratings yet
Virtual Memory
7 pages
OSLec 19&20
No ratings yet
OSLec 19&20
71 pages
Urb 100 Tuning Ver1
No ratings yet
Urb 100 Tuning Ver1
23 pages
Unit-3 Os Notes
No ratings yet
Unit-3 Os Notes
33 pages
Virtual Memory: Bilkent University Department of Computer Engineering CS342 Operating Systems
No ratings yet
Virtual Memory: Bilkent University Department of Computer Engineering CS342 Operating Systems
85 pages
Page Replacement Algorithms Page Replacement Algorithms
No ratings yet
Page Replacement Algorithms Page Replacement Algorithms
38 pages
CS162 Operating Systems and Systems Programming Page Allocation and Replacement
No ratings yet
CS162 Operating Systems and Systems Programming Page Allocation and Replacement
29 pages
Virtual Memory: Virtual Memory - Separation of User Logical Memory From Physical Memory
100% (1)
Virtual Memory: Virtual Memory - Separation of User Logical Memory From Physical Memory
26 pages
Operating System Notes
No ratings yet
Operating System Notes
47 pages
COS 318: Operating Systems Virtual Memory Paging: Andy Bavier Computer Science Department Princeton University
No ratings yet
COS 318: Operating Systems Virtual Memory Paging: Andy Bavier Computer Science Department Princeton University
24 pages
Huawei ICT Competition 2023-2024 Exam Outline - Cloud Track
0% (1)
Huawei ICT Competition 2023-2024 Exam Outline - Cloud Track
1 page
W4118: Virtual Memory: Instructor: Junfeng Yang
No ratings yet
W4118: Virtual Memory: Instructor: Junfeng Yang
29 pages
How To Use NFC Shield With Arduino and Demo Code
No ratings yet
How To Use NFC Shield With Arduino and Demo Code
8 pages
Memory Management: Background Swapping Contiguous Allocation Paging Segmentation Segmentation With Paging
No ratings yet
Memory Management: Background Swapping Contiguous Allocation Paging Segmentation Segmentation With Paging
55 pages
Swapping Page Replacement Algorithms Thrashing
No ratings yet
Swapping Page Replacement Algorithms Thrashing
34 pages
Key Points in Memory Management
No ratings yet
Key Points in Memory Management
102 pages
He-Dieu-Hanh - Kai-Li - Vmdesign - (Cuuduongthancong - Com)
No ratings yet
He-Dieu-Hanh - Kai-Li - Vmdesign - (Cuuduongthancong - Com)
24 pages
CSE451 Introduction To Operating Systems Winter 2012: Paging
No ratings yet
CSE451 Introduction To Operating Systems Winter 2012: Paging
29 pages
Step-By-Step Example For Practical PCB Design - Power Supply Design Tutorial Section 3-3 - Power Electronics
No ratings yet
Step-By-Step Example For Practical PCB Design - Power Supply Design Tutorial Section 3-3 - Power Electronics
28 pages
Virtual Mem Isla+teacher
No ratings yet
Virtual Mem Isla+teacher
30 pages
Chapter 9: Virtual Memory: Silberschatz, Galvin and Gagne ©2011 Operating System Concepts Essentials - 8 Edition
No ratings yet
Chapter 9: Virtual Memory: Silberschatz, Galvin and Gagne ©2011 Operating System Concepts Essentials - 8 Edition
42 pages
7SJ602 Catalogue V35
50% (2)
7SJ602 Catalogue V35
31 pages
Lecture 10 New
No ratings yet
Lecture 10 New
56 pages
EC612 User Manual
No ratings yet
EC612 User Manual
131 pages
Unit IV CAL 817 Operating System
No ratings yet
Unit IV CAL 817 Operating System
17 pages
Page Replacement Algorithms
No ratings yet
Page Replacement Algorithms
5 pages
Virtual Memory Management
No ratings yet
Virtual Memory Management
25 pages
Capgemini Continuous Testing Report 2019 1564884718
No ratings yet
Capgemini Continuous Testing Report 2019 1564884718
36 pages
8 Switch Magnum 10KT
No ratings yet
8 Switch Magnum 10KT
51 pages
Computer Architecture: Nguyễn Trí Thành
No ratings yet
Computer Architecture: Nguyễn Trí Thành
70 pages
Deep Dive Aurora
No ratings yet
Deep Dive Aurora
55 pages
VISI Machining 5axis
No ratings yet
VISI Machining 5axis
2 pages
Page Faults and The Clock Algorithm
No ratings yet
Page Faults and The Clock Algorithm
43 pages
HDL Based Synthesis
No ratings yet
HDL Based Synthesis
23 pages
Mem 4
No ratings yet
Mem 4
8 pages
Operating Sytems: B.Tech Ii Yr (Term 08-09) Unit 4 PPT Slides Text Books
No ratings yet
Operating Sytems: B.Tech Ii Yr (Term 08-09) Unit 4 PPT Slides Text Books
53 pages
Utilization of The AISC Steel Sculpture For An Introductory Construction Plan Reading Course
No ratings yet
Utilization of The AISC Steel Sculpture For An Introductory Construction Plan Reading Course
7 pages
Lecture 13
No ratings yet
Lecture 13
14 pages
Madhav Institute of Technology & Science, Gwalior
No ratings yet
Madhav Institute of Technology & Science, Gwalior
2 pages
Co PDF
No ratings yet
Co PDF
123 pages
China Book Digital Publishing Market Analysis Yanping Bryant Openbook Trajectory
No ratings yet
China Book Digital Publishing Market Analysis Yanping Bryant Openbook Trajectory
37 pages
Buffer Stock Management System
No ratings yet
Buffer Stock Management System
11 pages
Case Study - Facebook Business Model
No ratings yet
Case Study - Facebook Business Model
12 pages
15 Demand Paging, Thrashing, Working Sets
No ratings yet
15 Demand Paging, Thrashing, Working Sets
8 pages
CSC118 - Fundamentals of Algorithm Development
0% (1)
CSC118 - Fundamentals of Algorithm Development
3 pages
Scheduling Using Priorities: New Priority Old Priority Decay-Factor
No ratings yet
Scheduling Using Priorities: New Priority Old Priority Decay-Factor
4 pages
CH 11
No ratings yet
CH 11
30 pages
Project Report Template 2023.docx-1
No ratings yet
Project Report Template 2023.docx-1
10 pages
The Deadlock Problem: Lock (A) Lock (B) Lock (B) Lock (A)
No ratings yet
The Deadlock Problem: Lock (A) Lock (B) Lock (B) Lock (A)
4 pages
Recap: Translation Box (MMU)
No ratings yet
Recap: Translation Box (MMU)
4 pages
Today's Lecture: Performing I/Os
No ratings yet
Today's Lecture: Performing I/Os
4 pages
CS Students' Brief On CSS
No ratings yet
CS Students' Brief On CSS
43 pages
Raids and Availability
No ratings yet
Raids and Availability
3 pages
Distributed File Systems: Arvind Krishnamurthy Spring 2001
No ratings yet
Distributed File Systems: Arvind Krishnamurthy Spring 2001
3 pages
This Lecture: Physical Reality (Disks) File System Abstraction
No ratings yet
This Lecture: Physical Reality (Disks) File System Abstraction
8 pages
Recap: Process Creation in Unix
No ratings yet
Recap: Process Creation in Unix
4 pages
Outline: Access Control Lists (ACL) : Keep Lists of Access For Each Domain With
No ratings yet
Outline: Access Control Lists (ACL) : Keep Lists of Access For Each Domain With
5 pages
Technical Note Fortimail Transparent Mode Options Explained Revision 0.7
No ratings yet
Technical Note Fortimail Transparent Mode Options Explained Revision 0.7
10 pages
Outline
No ratings yet
Outline
4 pages
Alloy Blend Soln
No ratings yet
Alloy Blend Soln
12 pages
3 - Identifying Information Sources
No ratings yet
3 - Identifying Information Sources
7 pages
ViaLiteHD 1U Rack Chassis HRK1x DS 2
No ratings yet
ViaLiteHD 1U Rack Chassis HRK1x DS 2
2 pages
Creating A Standby Using RMAN Duplicate (RAC or Non RAC) (Doc ID 1617946.1)
No ratings yet
Creating A Standby Using RMAN Duplicate (RAC or Non RAC) (Doc ID 1617946.1)
11 pages
Our Strategic Searching Lesson Plan:, Grade Level: 6-9
No ratings yet
Our Strategic Searching Lesson Plan:, Grade Level: 6-9
4 pages
Forever Flashlight Rip Cord: User's Guide For
No ratings yet
Forever Flashlight Rip Cord: User's Guide For
9 pages

TLB & Caches: N N N N

Uploaded by

TLB & Caches: N N N N

Uploaded by

TLB & Caches

n TLB contain the most recent address translations

n Access time <= cycle

n Multiple cache entries could be caching a physical location

n Cache entry: data and physical address from which it is obtained

Overlapped TLB & Cache Access Hardware-controlled TLB

n Therefore, if cache size <= page size, overlapped access is possible

Software-controlled TLB Summary

n Disk (persistent data)

n Solution: partial residency

Disk-sized memory run faster Demand paging mechanism

if old page has been modified, write contents back to disk

n change its page table entry and TLB entry

n load new page into memory from disk

Memory address n update page table entry

all this is transparent, OS can run another job in the meantime!

Main problems Problem: resuming after a fault

n what to fetch? fault alloc page

Deciding what page to eject Page replacement algorithms

First-In-First-Out (FIFO) Optimal or MIN

More page frames →fewer faults? FIFO with 2nd chance

Clock: Simple FIFO + 2nd Chance Not Recently Used (NRU)

n Not referenced and modified

n Referenced and not modified

n Referenced and modified

n FIFO clock algorithm n Pros

it (e.g., for catching modifications to code pages)

How many pages are allocated to

2 pages to handle to.

n Two major allocation schemes.

Fixed allocation Priority allocation

You might also like