0% found this document useful (0 votes)

38 views17 pages

Memory Hierarchy Design: A Quantitative Approach, Fifth Edition

PPT

Uploaded by

Aniruddha Shinde

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views17 pages

Memory Hierarchy Design: A Quantitative Approach, Fifth Edition

PPT

Uploaded by

Aniruddha Shinde

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

Computer Architecture

A Quantitative Approach, Fifth Edition

Chapter 2
Memory Hierarchy Design

Copyright © 2012, Elsevier Inc. All rights reserved. 1

Introduction
Memory Hierarchy

Copyright © 2012, Elsevier Inc. All rights reserved. 2

Introduction
Causes of misses
 Compulsory
 First reference to a block
 Capacity
 Blocks discarded and later retrieved
 Conflict
 Program makes repeated references to multiple
addresses from different blocks that map to the same
location in the cache

Copyright © 2012, Elsevier Inc. All rights reserved. 3

Introduction
Memory Hierarchy Basics

 Note that speculative and multithreaded

processors may execute other instructions
during a miss
 Reduces performance impact of misses

Copyright © 2012, Elsevier Inc. All rights reserved. 4

Introduction
Memory Hierarchy Basics
 Six basic cache optimizations:
 Larger block size
 Reduces compulsory misses
 Increases capacity and conflict misses, increases miss penalty
 Larger total cache capacity to reduce miss rate
 Increases hit time, increases power consumption
 Higher associativity
 Reduces conflict misses
 Increases hit time, increases power consumption
 Higher number of cache levels
 Reduces overall memory access time
 Giving priority to read misses over writes
 Use a write buffer!
 Reduces miss penalty
 Avoiding address translation in cache indexing
 Reduces hit time

Copyright © 2012, Elsevier Inc. All rights reserved. 5

ADVANCED CACHE
OPTIMIZATIONS

Copyright © 2012, Elsevier Inc. All rights reserved. 6

Advanced Optimizations
1. Small and simple first level caches

 Critical timing path:

 addressing tag memory, then
 comparing tags, then
 selecting correct set
 Direct-mapped caches can overlap tag compare
and transmission of data
 Lower associativity reduces power because
fewer cache lines are accessed

Copyright © 2012, Elsevier Inc. All rights reserved. 7

Advanced Optimizations
2. Way Prediction
 To improve hit time, predict the way to pre-set
mux
 Mis-prediction gives longer hit time
 Prediction accuracy
 > 90% for two-way
 > 80% for four-way
 I-cache has better accuracy than D-cache
 First used on MIPS R10000 in mid-90s
 Used on ARM Cortex-A8
 Extend to predict block as well
 “Way selection”
 Increases mis-prediction penalty

Copyright © 2012, Elsevier Inc. All rights reserved. 8

Advanced Optimizations
3. Pipelining Cache
 Pipeline cache access to improve bandwidth
 Examples:
 Pentium: 1 cycle
 Pentium Pro – Pentium III: 2 cycles
 Pentium 4 – Core i7: 4 cycles

 Increases branch mis-prediction penalty

 Makes it easier to increase associativity

Copyright © 2012, Elsevier Inc. All rights reserved. 9

Advanced Optimizations
4. Nonblocking Caches
 Allow hits before
previous misses
complete
 “Hit under miss”
 “Hit under multiple
miss”
 L2 must support this
 In general,
processors can hide
L1 miss penalty but
not L2 miss penalty

Copyright © 2012, Elsevier Inc. All rights reserved. 10

Advanced Optimizations
5. Multibanked Caches
 Organize cache as independent banks to
support simultaneous access
 ARM Cortex-A8 supports 1-4 banks for L2
 Intel i7 supports 4 banks for L1 and 8 banks for L2

 Interleave banks according to block address

Copyright © 2012, Elsevier Inc. All rights reserved. 11

Advanced Optimizations
6. Critical Word First, Early Restart
 Critical word first
 Request missed word from memory first
 Send it to the processor as soon as it arrives
 Early restart
 Request words in normal order
 Send missed work to the processor as soon as it
arrives

 Effectiveness of these strategies depends on

block size and likelihood of another access to
the portion of the block that has not yet been
fetched

Advanced Optimizations
7. Merging Write Buffer
 When storing to a block that is already pending in the
write buffer, update write buffer
 Reduces stalls due to full write buffer
 Do not apply to I/O addresses

No write
buffering

Write buffering

Advanced Optimizations
8. Compiler Optimizations
 Loop Interchange
 Swap nested loops to access memory in
sequential order

 Blocking
 Instead of accessing entire rows or columns,
subdivide matrices into blocks
 Requires more memory accesses but improves
locality of accesses

Advanced Optimizations
9. Hardware Prefetching
 Fetch two blocks on miss (include next
sequential block)

Pentium 4 Pre-fetching

Advanced Optimizations
10. Compiler Prefetching
 Insert prefetch instructions before data is
needed
 Non-faulting: prefetch doesn’t cause
exceptions

 Register prefetch
 Loads data into register
 Cache prefetch
 Loads data into cache

 Combine with loop unrolling and software

pipelining

Advanced Optimizations
Summary

PHP Microservices
From Everand
PHP Microservices
Carlos Pérez Sánchez
3/5 (1)
Memory Hierarchy Design: A Quantitative Approach, Fifth Edition
No ratings yet
Memory Hierarchy Design: A Quantitative Approach, Fifth Edition
22 pages
Memory Hierarchy Design: A Quantitative Approach, Fifth Edition
No ratings yet
Memory Hierarchy Design: A Quantitative Approach, Fifth Edition
37 pages
CS530 Fall2015 Lecture6
No ratings yet
CS530 Fall2015 Lecture6
3 pages
COMP 740: Computer Architecture and Implementation: Montek Singh
No ratings yet
COMP 740: Computer Architecture and Implementation: Montek Singh
41 pages
CAQA6e ch2
No ratings yet
CAQA6e ch2
51 pages
Memory Hierarchy Design-Aca
No ratings yet
Memory Hierarchy Design-Aca
15 pages
Unit II
No ratings yet
Unit II
9 pages
CHAPTER 2 Memory Hierarchy Design & APPENDIX B. Review of Memory Heriarchy
No ratings yet
CHAPTER 2 Memory Hierarchy Design & APPENDIX B. Review of Memory Heriarchy
73 pages
Cache 1 54
No ratings yet
Cache 1 54
54 pages
CS 322M Digital Logic & Computer Architecture: Cache Optimization Techniques-II
No ratings yet
CS 322M Digital Logic & Computer Architecture: Cache Optimization Techniques-II
14 pages
Memory 2
No ratings yet
Memory 2
31 pages
Cache Miss Penalty Reduction: #1 - Multilevel Caches
No ratings yet
Cache Miss Penalty Reduction: #1 - Multilevel Caches
8 pages
UNIT-IV Memory and I/O
No ratings yet
UNIT-IV Memory and I/O
36 pages
Advanced Cache Optimizations - : Adapted From Patterson and Hennessey (Morgan Kauffman Pubs)
No ratings yet
Advanced Cache Optimizations - : Adapted From Patterson and Hennessey (Morgan Kauffman Pubs)
12 pages
5.2 Eleven Advanced Optimizations of Cache Performance
No ratings yet
5.2 Eleven Advanced Optimizations of Cache Performance
13 pages
Advanced Computer Architecture-06CS81-Memory Hierarchy Design
No ratings yet
Advanced Computer Architecture-06CS81-Memory Hierarchy Design
18 pages
Lecture16 PDF
No ratings yet
Lecture16 PDF
4 pages
Lec 33
No ratings yet
Lec 33
26 pages
Cache Optimizations
No ratings yet
Cache Optimizations
29 pages
Cache Misses
No ratings yet
Cache Misses
8 pages
Lecture4-Ch2-Memory Hierarchy Design
No ratings yet
Lecture4-Ch2-Memory Hierarchy Design
34 pages
Ec6009 Advanced Computer Architecture Unit V Memory and I/O: Cache Performance
No ratings yet
Ec6009 Advanced Computer Architecture Unit V Memory and I/O: Cache Performance
16 pages
10 Caches
No ratings yet
10 Caches
34 pages
CA Lecture 08
No ratings yet
CA Lecture 08
38 pages
ch2 Appb
No ratings yet
ch2 Appb
58 pages
Memory Hierarchy Design: A Quantitative Approach, Fifth Edition
No ratings yet
Memory Hierarchy Design: A Quantitative Approach, Fifth Edition
11 pages
Final Exam Topics: CSE 564 Computer Architecture Summer 2017
No ratings yet
Final Exam Topics: CSE 564 Computer Architecture Summer 2017
78 pages
Lec 4a
No ratings yet
Lec 4a
25 pages
L07 MemoryII
No ratings yet
L07 MemoryII
27 pages
Computer Science 246 Computer Architecture: Si 2009 Spring 2009 Harvard University
No ratings yet
Computer Science 246 Computer Architecture: Si 2009 Spring 2009 Harvard University
27 pages
CS530 Fall2015 Lecture7
No ratings yet
CS530 Fall2015 Lecture7
7 pages
Memory Hierarchy - Introduction: Cost Performance of Memory Reference
No ratings yet
Memory Hierarchy - Introduction: Cost Performance of Memory Reference
52 pages
Cache Writing & Performance
No ratings yet
Cache Writing & Performance
23 pages
Cache Org
No ratings yet
Cache Org
19 pages
Memory Cache
No ratings yet
Memory Cache
18 pages
Improving Cache Performance:: Average Memory Access Time Amat T + Miss Rate X Miss Penalty
No ratings yet
Improving Cache Performance:: Average Memory Access Time Amat T + Miss Rate X Miss Penalty
16 pages
Cache
No ratings yet
Cache
34 pages
Coa Poster Content
No ratings yet
Coa Poster Content
2 pages
Question: Who Cares About The Memory Hierarchy?: Caches and Memory Systems I
No ratings yet
Question: Who Cares About The Memory Hierarchy?: Caches and Memory Systems I
13 pages
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
No ratings yet
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
55 pages
Chapter 2 Neede For Guide Line Help From Smiw
No ratings yet
Chapter 2 Neede For Guide Line Help From Smiw
7 pages
Ca Q,,a 4TH Sem
No ratings yet
Ca Q,,a 4TH Sem
18 pages
Memory Hierarchy - Ways To Reduce Misses: DAP Spr. 98 ©UCB 1
No ratings yet
Memory Hierarchy - Ways To Reduce Misses: DAP Spr. 98 ©UCB 1
23 pages
Lecture 5 Cache Optimization
No ratings yet
Lecture 5 Cache Optimization
25 pages
25 e 50 Beb 5 Aad 8 F 60
No ratings yet
25 e 50 Beb 5 Aad 8 F 60
49 pages
L18 Cache Wrap Up
No ratings yet
L18 Cache Wrap Up
30 pages
Cache Optimizations
No ratings yet
Cache Optimizations
23 pages
Chapter 2 Adv 2007 PPTV 4
No ratings yet
Chapter 2 Adv 2007 PPTV 4
54 pages
Lecture 13 - Introduction To Cache
No ratings yet
Lecture 13 - Introduction To Cache
47 pages
Week6 Memory Part2
No ratings yet
Week6 Memory Part2
23 pages
Cau 6 Cache
No ratings yet
Cau 6 Cache
25 pages
02b Cache
No ratings yet
02b Cache
48 pages
Compiler Optimizations and Prefetching
No ratings yet
Compiler Optimizations and Prefetching
22 pages
Lecture 16
No ratings yet
Lecture 16
22 pages
Chapter5 PDF
No ratings yet
Chapter5 PDF
95 pages
Chapter # 05
No ratings yet
Chapter # 05
42 pages
Lec 34
No ratings yet
Lec 34
26 pages
DDCA Ch8
No ratings yet
DDCA Ch8
86 pages
The Ceph Handbook: Building and Managing Scalable Distributed Storage Systems
From Everand
The Ceph Handbook: Building and Managing Scalable Distributed Storage Systems
Robert Johnson
No ratings yet
Singly Linked List
No ratings yet
Singly Linked List
3 pages
Chapter 2 Lecture 3&4 ASM
No ratings yet
Chapter 2 Lecture 3&4 ASM
73 pages
CS614 Midterm Grand Solved Quiz
No ratings yet
CS614 Midterm Grand Solved Quiz
30 pages
Netwrix Auditor Installation Configuration Guide
No ratings yet
Netwrix Auditor Installation Configuration Guide
232 pages
Xiv 11.5.1 Xcli
No ratings yet
Xiv 11.5.1 Xcli
704 pages
Assignment 3
No ratings yet
Assignment 3
1 page
A+ OS Sample Question
No ratings yet
A+ OS Sample Question
38 pages
InteliGateway Global-Guide 2022-05
No ratings yet
InteliGateway Global-Guide 2022-05
61 pages
NC CEDARS Presentation 2009 Conference
No ratings yet
NC CEDARS Presentation 2009 Conference
24 pages
DP7 00 Integ Microsoft
No ratings yet
DP7 00 Integ Microsoft
232 pages
Radcom Solution Overview
No ratings yet
Radcom Solution Overview
17 pages
CANopen Programmer's Manual PDF
No ratings yet
CANopen Programmer's Manual PDF
257 pages
DN01 Manual en
No ratings yet
DN01 Manual en
38 pages
Mcse
No ratings yet
Mcse
2 pages
Database Security Issues
No ratings yet
Database Security Issues
7 pages
Roll No. - Course:-ADIT - 6 Month Time: - 1 Hours Mark: - 80 Student Sign - Examiner Sign
No ratings yet
Roll No. - Course:-ADIT - 6 Month Time: - 1 Hours Mark: - 80 Student Sign - Examiner Sign
6 pages
CCNP Workbook V3
No ratings yet
CCNP Workbook V3
216 pages
C Tutorial
No ratings yet
C Tutorial
91 pages
DDIC Objects
No ratings yet
DDIC Objects
71 pages
Custom Url
No ratings yet
Custom Url
3 pages
Phprunner
80% (5)
Phprunner
1,346 pages
Chapter1 5th
No ratings yet
Chapter1 5th
43 pages
A Novel Disk Scheduling Algorithm: Presented by
No ratings yet
A Novel Disk Scheduling Algorithm: Presented by
19 pages
HCDisk 2 EN
No ratings yet
HCDisk 2 EN
5 pages
MGW Hardware Units
No ratings yet
MGW Hardware Units
14 pages
Unified Batch and Real Time Stream Processing
No ratings yet
Unified Batch and Real Time Stream Processing
68 pages
ID3 Decision Tree Classifier From Scratch in Python - by Bernardo Garcia Del Rio - Towards Data Science
No ratings yet
ID3 Decision Tree Classifier From Scratch in Python - by Bernardo Garcia Del Rio - Towards Data Science
15 pages
Splunk As A SIEM Tech Brief
100% (1)
Splunk As A SIEM Tech Brief
3 pages
Microsoft Azure-Case Study Document
No ratings yet
Microsoft Azure-Case Study Document
8 pages
Odjms Report
No ratings yet
Odjms Report
45 pages

Memory Hierarchy Design: A Quantitative Approach, Fifth Edition

Uploaded by

Memory Hierarchy Design: A Quantitative Approach, Fifth Edition

Uploaded by

Computer Architecture

A Quantitative Approach, Fifth Edition

Copyright © 2012, Elsevier Inc. All rights reserved. 1

Copyright © 2012, Elsevier Inc. All rights reserved. 2

Copyright © 2012, Elsevier Inc. All rights reserved. 3

 Note that speculative and multithreaded

Copyright © 2012, Elsevier Inc. All rights reserved. 4

Copyright © 2012, Elsevier Inc. All rights reserved. 5

Copyright © 2012, Elsevier Inc. All rights reserved. 6

 Critical timing path:

Copyright © 2012, Elsevier Inc. All rights reserved. 7

Copyright © 2012, Elsevier Inc. All rights reserved. 8

 Increases branch mis-prediction penalty

Copyright © 2012, Elsevier Inc. All rights reserved. 9

Copyright © 2012, Elsevier Inc. All rights reserved. 10

 Interleave banks according to block address

Copyright © 2012, Elsevier Inc. All rights reserved. 11

 Effectiveness of these strategies depends on

Copyright © 2012, Elsevier Inc. All rights reserved. 12

Copyright © 2012, Elsevier Inc. All rights reserved. 13

Copyright © 2012, Elsevier Inc. All rights reserved. 14

Copyright © 2012, Elsevier Inc. All rights reserved. 15

 Combine with loop unrolling and software

Copyright © 2012, Elsevier Inc. All rights reserved. 16

Copyright © 2012, Elsevier Inc. All rights reserved. 17

You might also like