0% found this document useful (0 votes)

14 views22 pages

Lec 5

This document discusses block replacement techniques and write strategies for cache memory. It describes common block replacement algorithms like random, FIFO, LRU, LFU, and NRU. It also discusses implementations of pseudo-LRU and techniques like RRIP. For write strategies, it explains write-through versus write-back approaches and whether to use write allocation on a write miss. It categorizes cache misses as compulsory, capacity, or conflict misses.

Uploaded by

jettychetan524

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views22 pages

Lec 5

Uploaded by

jettychetan524

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Multicore Computer Architecture - Storage and Interconnects

Lecture 5
Block Replacement Techniques & Write Strategy

Dr. John Jose

Assistant Professor
Department of Computer Science & Engineering
Indian Institute of Technology Guwahati, Assam.
Processor Memory Performance Gap
Memory Hierarchy
Four cache memory design choices
 Where can a block be placed in the cache?
– Block Placement
 How is a block found if it is in cache memory?
– Block Identification
 Which block should be replaced on a miss?
– Block Replacement
 What happens on a write?
– Write Strategy
Block Replacement
 Cache has finite size. What do we do when it is full?
 Direct Mapped is Easy
 Which block in the selected set of a set associative cache?
Block Replacement Algorithms
 Random
 First In First Out (FIFO)
 Least Recently Used, pseudo-LRU
 Last In First Out (LIFO)
 Not Recently Used (NRU)
 Least Frequently Used (LFU)
 Re-Reference Interval Predication (RRIP)
 Optimal
Random Replacement Policy
 Random policy needs a pseudo-random number generator
 Overhead is an O(1) amount of work per block replacement
 Makes no attempt to take advantage of any temporal or spatial
localities
FIFO Replacement Policy
 First-in, First-out(FIFO) policy evict the block that has been in
the cache the longest
 It requires a queue Q to store references
 Blocks are enqueued in Q, dequeue operation on Q to determine
which block to evict.
 Overhead is an O(1) amount of work per block replacement
Optimal Replacement Policy
 Evict block with longest reuse distance
 i.e. next reference to block is farthest in future
 Requires knowledge of the future!
 Can’t build it, but can model it with trace
 Useful, since it reveals opportunity
 Optimal better than LRU
 (X,A,B,C,D,X): LRU 4-way SA cache, 2nd X will miss
Least-Recently Used Policy
 For associativity =2, LRU is equivalent to NMRU
 Single bit per line indicates LRU/MRU
 Set/clear on each access
 For a>2, LRU is difficult/expensive
 Timestamps? How many bits?
 Must find min timestamp on each eviction
 Sorted list? Re-sort on every access?
 List overhead: log2(a) bits /block
 Shift register implementation
Random vs FIFO vs LRU
New block Old block (chosen at random)
Random policy:

New block Old block(present longest)

FIFO policy:

Insert time: 8:00 am 7:48am 9:05am 7:10am 7:30 am 10:10am 8:45am

New block Old block(least recently used)

LRU policy:

last used: 7:25am 8:12am 9:22am 6:50am 8:20am 10:02am 9:50am

LRU Implementation
Cycle 1 Cycle 2 Cycle 3 Cycle 4
Hit in CL 0 Hit in CL 4 Hit in CL 7 Miss: replace CL 6

4 LRU 4 LRU 6 LRU 6 LRU 3 LRU

6 6 3 3 1

3 3 1 1 5

1 1 7 5 2

0 7 5 2 0

7 5 2 0 4

5 2 0 4 7

2 MRU 0 MRU 4 MRU 7 MRU 6 MRU

Practical Pseudo-LRU

J
Older 0 F
1 C
1 B
0 X
1 Y
Newer 0 A
1 Z

 Rather than true LRU, use binary tree

 Each node records which half is older/newer
 Update nodes on each reference
 Follow older pointers to find LRU victim
Practical Pseudo-LRU

J J Y X Z BC F A
F
C 011: PLRU Block B
B is here
X
Y 110: MRU block
A is here
Z

Partial Order Encoded in Tree:

B C F A
Z<A Y<X B<C J<F
J
A>X C<F Y X

A>F Z
Practical Pseudo-LRU
J Refs: J,Y,X,Z,B,C,F,A
Older 0 F
1 C
011: PLRU Block B
1 B is here
0 X
1 Y
110: MRU block
Newer 0 A is here
1 Z
 Binary tree encodes PLRU partial order
 At each level point to LRU half of subtree
 Each access: flip nodes along path to block
 Eviction: follow LRU path
 Overhead: (a-1)/a bits per block
Not Recently Used (NRU)
 Keep NRU state in 1 bit/block
 Bit is reset to 0 when installed / re referenced
 Bit is set to 1 when it is not referenced and other block in the
same set is referenced
 Evictions favor NRU=1 blocks
 If all blocks are NRU=0 / 1 then pick by random
 Provides some scan and thrash resistance
 Randomizing evictions rather than strict LRU order
Re-reference Interval Prediction
 RRIP
 Extends NRU to multiple bits
 Start in the middle
 promote on hit
 demote over time
 Can predict near-immediate, intermediate, and distant re-
reference
Least Frequently Used
 Counter per block, incremented on reference
 Evictions choose lowest count
 Logic not trivial (a2 comparison/sort)
 Storage overhead
 1 bit per block: same as NRU
 How many bits are helpful?
Write strategy
 Write through: The information is written to both the block in
the cache and to the block in the next level memory
 Write Through: read misses do not need to write back
evicted line contents
 Write back: The information is written only to the block in the
cache. The modified cache block is written to main memory
only when it is replaced.
 is block clean or dirty?
 Write Back: no writes of repeated writes
What About Write Miss?
 Write allocate: The block is loaded into cache on a write miss

 No-Write allocate: The block is modified in the memory but not

in cache
Types of Cache Misses
 Compulsory
 Very first access to a block
 Will occur even in an infinite cache
 Capacity
 If cache cannot contain all the blocks needed
 Misses in fully associative cache (due to the capacity)
 Conflict
 If too many blocks map to the same set
 Occurs in associative or direct mapped cache
[email protected]
http://www.iitg.ac.in/johnjose/

Portable Programming Device by Salto: RW PPD
No ratings yet
Portable Programming Device by Salto: RW PPD
16 pages
OSPF Demystified With RFC: Request For Comments Translated Into Practice
From Everand
OSPF Demystified With RFC: Request For Comments Translated Into Practice
Redouane MEDDANE
5/5 (1)
IP Routing Protocols All-in-one: OSPF EIGRP IS-IS BGP Hands-on Labs
From Everand
IP Routing Protocols All-in-one: OSPF EIGRP IS-IS BGP Hands-on Labs
Redouane MEDDANE
No ratings yet
Elements of Cache Design
No ratings yet
Elements of Cache Design
6 pages
Cache Memory - Block Replacement Techniques: CS223 Computer Architecture & Organization
No ratings yet
Cache Memory - Block Replacement Techniques: CS223 Computer Architecture & Organization
16 pages
Cache Presentation
No ratings yet
Cache Presentation
45 pages
Onur Comparch Fall2017 Lecture3 Afterlecture
No ratings yet
Onur Comparch Fall2017 Lecture3 Afterlecture
219 pages
OS Project Replacement Policies
No ratings yet
OS Project Replacement Policies
3 pages
Cache Memory: CS 322M Digital Logic & Computer Architecture
No ratings yet
Cache Memory: CS 322M Digital Logic & Computer Architecture
16 pages
Computer Architecture - Lecture 06
No ratings yet
Computer Architecture - Lecture 06
18 pages
CH 4 e F08
No ratings yet
CH 4 e F08
4 pages
15IF11 Multicore B
No ratings yet
15IF11 Multicore B
36 pages
A Survey On - Page Replacement Algorithms
No ratings yet
A Survey On - Page Replacement Algorithms
13 pages
CPUlogic Design 11 Cache
No ratings yet
CPUlogic Design 11 Cache
27 pages
Chapter 5
No ratings yet
Chapter 5
16 pages
AC14L08 Memory Hierarchy
No ratings yet
AC14L08 Memory Hierarchy
20 pages
Chapter 4 - Memory Part 3
No ratings yet
Chapter 4 - Memory Part 3
18 pages
Cache Algorithms: From Wikipedia, The Free Encyclopedia
No ratings yet
Cache Algorithms: From Wikipedia, The Free Encyclopedia
5 pages
Cache Basics and Operation
No ratings yet
Cache Basics and Operation
42 pages
CH04
No ratings yet
CH04
46 pages
Cache Read Write Policies
No ratings yet
Cache Read Write Policies
9 pages
Akhil Pranay Week13Discussion
No ratings yet
Akhil Pranay Week13Discussion
30 pages
Lab3 Suppl
No ratings yet
Lab3 Suppl
25 pages
R RRRRRRRR Final
No ratings yet
R RRRRRRRR Final
28 pages
Introduction To Operating Systems: Class 10-1: Swapping - Policy (Ch. 22)
No ratings yet
Introduction To Operating Systems: Class 10-1: Swapping - Policy (Ch. 22)
24 pages
CA I - Chapter 5 Caches 3
No ratings yet
CA I - Chapter 5 Caches 3
70 pages
Onur 447 Spring15 Lecture19 High Performance Caches Afterlecture
No ratings yet
Onur 447 Spring15 Lecture19 High Performance Caches Afterlecture
57 pages
Tut 09
No ratings yet
Tut 09
12 pages
CH04 COA10e
No ratings yet
CH04 COA10e
41 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
46 pages
UNIT-IV Memory and I/O
No ratings yet
UNIT-IV Memory and I/O
36 pages
Computer Arch 06
No ratings yet
Computer Arch 06
41 pages
Cache Memory Simulation Presentation
No ratings yet
Cache Memory Simulation Presentation
13 pages
L38 PDF
No ratings yet
L38 PDF
19 pages
Cache Design:: Making It Real
No ratings yet
Cache Design:: Making It Real
19 pages
Coa (21CS34)
No ratings yet
Coa (21CS34)
13 pages
05) Cache Memory Introduction
No ratings yet
05) Cache Memory Introduction
20 pages
hw4 927001590
No ratings yet
hw4 927001590
7 pages
Lec8 Memory
No ratings yet
Lec8 Memory
17 pages
Lecture 8
No ratings yet
Lecture 8
33 pages
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
No ratings yet
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
32 pages
Large and Fast: Exploiting Memory Hierarchy
No ratings yet
Large and Fast: Exploiting Memory Hierarchy
48 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
46 pages
Lecture 4 - Cache 3
No ratings yet
Lecture 4 - Cache 3
20 pages
Verification and Computer Architecture Important Links
No ratings yet
Verification and Computer Architecture Important Links
22 pages
Course Code: CS 283 Course Title: Computer Architecture: Class Day: Friday Timing: 12:00 To 1:30
No ratings yet
Course Code: CS 283 Course Title: Computer Architecture: Class Day: Friday Timing: 12:00 To 1:30
23 pages
Cache Replacement Policies, Types of Cache Miss, Writing Policies
No ratings yet
Cache Replacement Policies, Types of Cache Miss, Writing Policies
15 pages
Cache Replacement
No ratings yet
Cache Replacement
10 pages
Com Arch Lec Slide 3 2
No ratings yet
Com Arch Lec Slide 3 2
31 pages
Chapter 5.1-5.6 Memory
No ratings yet
Chapter 5.1-5.6 Memory
26 pages
Chapter 4 V
No ratings yet
Chapter 4 V
37 pages
CH04 Cache Memory
No ratings yet
CH04 Cache Memory
44 pages
Chapter 4 (Continued) : Caching Testing Memory Modules
No ratings yet
Chapter 4 (Continued) : Caching Testing Memory Modules
20 pages
18 Caches Cornell PDF
No ratings yet
18 Caches Cornell PDF
43 pages
Cache Memory
No ratings yet
Cache Memory
47 pages
Lec 27
No ratings yet
Lec 27
28 pages
Unit 4 q4 and 5 and 6
No ratings yet
Unit 4 q4 and 5 and 6
5 pages
Dynamic Non-Decaying ABRIP For Shared Level 3 Cache Memory Systems
No ratings yet
Dynamic Non-Decaying ABRIP For Shared Level 3 Cache Memory Systems
7 pages
CH04 COA9e Cache Memory Repaired
No ratings yet
CH04 COA9e Cache Memory Repaired
42 pages
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
No ratings yet
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
55 pages
10 Caches
No ratings yet
10 Caches
34 pages
FAQ
No ratings yet
FAQ
8 pages
CSC211, Hand Outs
No ratings yet
CSC211, Hand Outs
107 pages
Assignment PLC Youssef Awad
No ratings yet
Assignment PLC Youssef Awad
8 pages
Prog 3112 First-Quarter-Exam - Attempt-Review
100% (1)
Prog 3112 First-Quarter-Exam - Attempt-Review
18 pages
Network Access Protection (NAP)
No ratings yet
Network Access Protection (NAP)
15 pages
SF Dump
No ratings yet
SF Dump
20 pages
The New IBM z13 PART 1 - SHARE Feb21a2014
No ratings yet
The New IBM z13 PART 1 - SHARE Feb21a2014
56 pages
Debug Log
No ratings yet
Debug Log
935 pages
Adpcm-Hco PB 1 5 2
No ratings yet
Adpcm-Hco PB 1 5 2
2 pages
Mainboard - X8SIL-F
No ratings yet
Mainboard - X8SIL-F
102 pages
C Programming Data Types Homework
No ratings yet
C Programming Data Types Homework
3 pages
Linux Commands List From RAVI
No ratings yet
Linux Commands List From RAVI
72 pages
Detailed Lesson Plan in ICT Excel
100% (3)
Detailed Lesson Plan in ICT Excel
5 pages
Firmware Update Release For 615 Series IEC Product Version 4.0 FP1 (E) Protection Relays
No ratings yet
Firmware Update Release For 615 Series IEC Product Version 4.0 FP1 (E) Protection Relays
5 pages
DLD Notes
No ratings yet
DLD Notes
23 pages
Java Lab Assignment, 2014
No ratings yet
Java Lab Assignment, 2014
4 pages
Antelope
0% (1)
Antelope
71 pages
Process
No ratings yet
Process
73 pages
Mscit Syllabus 08 09
No ratings yet
Mscit Syllabus 08 09
30 pages
Multiple Choice Questions: Total Marks: 50 Total Time: 1 Hour
No ratings yet
Multiple Choice Questions: Total Marks: 50 Total Time: 1 Hour
28 pages
FMS 2023 Flexible Data Placement FDP Overview 1
No ratings yet
FMS 2023 Flexible Data Placement FDP Overview 1
25 pages
Chapter 2: Basic Elements of Java: Ava Rogramming: From Problem Analysis To Program Design
No ratings yet
Chapter 2: Basic Elements of Java: Ava Rogramming: From Problem Analysis To Program Design
41 pages
Spiral Model/ (Iterative Model) : Requirement Collection
No ratings yet
Spiral Model/ (Iterative Model) : Requirement Collection
2 pages
Viva Question
No ratings yet
Viva Question
10 pages
Cisco Meraki MX Cloud Managed Security Appliances
No ratings yet
Cisco Meraki MX Cloud Managed Security Appliances
22 pages
Source Code: 1. Create A Series From An Ndarray and A Dictionary of Values
No ratings yet
Source Code: 1. Create A Series From An Ndarray and A Dictionary of Values
28 pages
OOP - I GTU Study Material Presentations Unit-2 23032022090252AM
No ratings yet
OOP - I GTU Study Material Presentations Unit-2 23032022090252AM
70 pages
VR2272B AMI Capsule Configuration Iss 01
No ratings yet
VR2272B AMI Capsule Configuration Iss 01
10 pages
4IT1 01R Que 20190514
No ratings yet
4IT1 01R Que 20190514
20 pages

Lec 5

Uploaded by

Lec 5

Uploaded by

Multicore Computer Architecture - Storage and Interconnects

Dr. John Jose

New block Old block(present longest)

Insert time: 8:00 am 7:48am 9:05am 7:10am 7:30 am 10:10am 8:45am

last used: 7:25am 8:12am 9:22am 6:50am 8:20am 10:02am 9:50am

4 LRU 4 LRU 6 LRU 6 LRU 3 LRU

2 MRU 0 MRU 4 MRU 7 MRU 6 MRU

 Rather than true LRU, use binary tree

Partial Order Encoded in Tree:

 No-Write allocate: The block is modified in the memory but not

You might also like