0% found this document useful (0 votes)

2 views45 pages

CMP3010L08 Memory

The document discusses memory hierarchy in computer architecture, emphasizing the principles of locality, including temporal and spatial locality. It explains the structure and operation of caches, particularly direct-mapped caches, and the trade-offs involved in cache design, such as block size and write policies. Additionally, it covers cache access mechanisms, hit/miss terminology, and the implications of different write policies on cache performance.

Uploaded by

Mostafa Mohamed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views45 pages

CMP3010L08 Memory

Uploaded by

Mostafa Mohamed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

CMP3010: Computer Architecture

L08: Memory Hierarchy

Dina Tantawy
Computer Engineering Department
Cairo University
Agenda
• Introduction
• The principle of locality
• Temporal locality
• Spatial locality
• Memory Hierarchy
• The basics of caches
• Direct mapped memory
Is it a “Computing” Machine ??
Is it a “Computing” Machine ??
Compute

Load/store data
to memory

Fetch instruction Load Store data

from memory to registers
Memory
Is it a “Computing” Machine ??
Compute

Load/store data
to memory

Fetch instruction Load Store data

from memory to registers
Introduction

– Large memories are slow but cheap

–Small memories are fast but expensive
•Make the average access time small by:
–Servicing most accesses from a small, fast memory.

6
Principle of locality
States that programs access a relatively small
portion of their address space at any instant of
time.

Temporal locality Spatial locality

If an item is referenced, If an item is referenced,
it will tend to be items whose addresses are
referenced again soon. close by will tend to be
referenced soon. ‹#›
Memory Hierarchy
• A structure that uses multiple levels of memories; as the distance from
the CPU increases, the size of the memories and the access time both
increase.

• Main memory is implemented from DRAM , while levels closer to the

processor (caches) use SRAM.

• Memory hierarchies take advantage of temporal locality by keeping

more recently accessed data items closer to the processor. Memory
hierarchies take advantage of spatial locality by moving blocks consisting
of multiple contiguous words in memory to upper levels of the
hierarchy.
8
Memory Hierarchy
• The data is similarly hierarchical: a level closer to the processor is
generally a subset of any level further away, and all the data is stored
at the lowest level

9
The basic structure of memory hierarchy

10
Memory Hierarchy : Principles of Operation
• A memory hierarchy can consist of multiple levels, but data is copied
between only two adjacent levels at a time.
• Upper Level (Cache) : the one closer to the processor
• Smaller, faster, and uses more expensive technology
• Lower Level (Memory): the one further away from the processor
• Bigger, slower, and uses less expensive technology

• Block (line): The minimum unit of information that can be either present
or not present in a cache.

11
Memory Hierarchy : Terminologies
•Hit: data appears in some block in the upper level
–Hit Rate: the fraction of memory access found in the upper level
–Hit Time: Time to access the upper level which consists of
cache access time + Time to determine hit/miss

•Miss: data needs to be retrieved from a block in the lower level

–Miss Rate = 1 - (Hit Rate)
–Miss Penalty = Time to replace a block in the upper level + Time to deliver the
block to the processor

•Hit Time << Miss Penalty

12
Memory Hierarchy

13
The Basics of Caches

14
The Basics of Caches
• How do we know if a data item is in the cache?
• How do we find it?

16
The Basics of Caches

17
Direct Mapped Cache
Direct-mapped cache: A
cache structure in which
each memory location is
mapped to exactly one
location in the cache.

18
Direct Mapped Cache
Direct-mapped cache: A
Which Should I search ?
cache structure in which
each memory location is (Block address) Mod (number of
Blocks in cache)
mapped to exactly one
location in the cache. i.e. Address 11001 → 25
25%8 = 1

19
Direct Mapped Cache
• Because each cache location can contain the contents of a number of
different memory locations, how do we know whether the data in
the cache corresponds to a requested word?

Address 11001 & 10001 both maps to block 1

• Tag: A field in a table used for a memory hierarchy that contains the
address information required to identify whether the associated block
in the hierarchy corresponds to a requested word.

20
Direct Mapped Cache
• How to recognize that a cache block does not have valid
information?

• Valid bit: A field in the tables of a memory hierarchy that indicates

that the associated block in the hierarchy contains valid data.

21
Accessing Caches

MISS

22
Accessing Caches

MISS

23
Accessing Caches

HIT

24
Accessing Caches

HIT

25
Accessing Caches

MISS

26
Accessing Caches

MISS

27
Accessing Caches

HIT

28
Accessing Caches

MISS

29
Accessing Caches

HIT

30
Definition of a Cache Block
• Cache Block: the cache data that has in its own cache tag
• Example:
• 4-byte Direct Mapped cache: Block Size = 1 Byte
• Take advantage of Temporal Locality: If a byte is referenced,
it will tend to be referenced soon.
• Did not take advantage of Spatial Locality: If a byte is referenced, its adjacent
bytes will be referenced soon.
• To take advantage of Spatial Locality: increase the block size
Valid Cache Tag Direct Mapped Cache Data
Byte 0
Byte 1
Byte 2
Byte 3
31
Example: 1 KB Direct Mapped Cache with 32-Byte Blocks
• For a 2 ** N byte cache, the address is split as follows:
• The uppermost (32 - N) bits are always the Cache Tag
• The lowest M bits are the Byte Select (Block Size = 2 ** M) – bytes within the block
31 9 4 0
Cache Tag Cache Index Byte Offset
Example: 0x50 Ex: 0x01 Ex: 0x00
Stored as part
of the cache
“state”
Valid Cache Cache Data
Bit Tag Byte 31 Byte 1 Byte 0 0

: :
0x50 Byte 63 Byte 33Byte 32 1
2
3

: : :
Byte 1023 Byte 992 31

:
Direct-Mapped Cache
Tag Index Byte
Offset

t
k b
V Tag Data Block

2k
line
s
t
=

HIT Data Word or Byte

33
Block Size Tradeoff
• In general, larger block size take advantage of spatial locality BUT:
• Larger block size means larger miss penalty:
• Takes longer time to fill up the block
• If block size is too big relative to cache size, miss rate will go up
• Average Access Time:
• = Hit Time + Miss Penalty x Miss Rate

Average
Access
Miss Miss Time
Penalty Rate Exploits Spatial
Locality
Increased Miss
Fewer blocks: Penalty
compromises & Miss Rate
temporal
locality

Block Block Block

Size 34
Size Size
Cache Size
• The total number of bits needed for a cache is a function of the cache
size and the address size, because the cache includes both the
storage for the data and the tags.

35
Cache Size
• For the following situation:
• 32bit byte addresses
• A directmapped cache
• The cache size is 2^n blocks, so n bits are used for the index
• The block size is 2^m words ,so m bits are used for the word within
the block, and two bits are used for the byte part of the address.
• the size of the tag field is 32 - (n + m + 2).

• The total number of bits in a direct mapped cache is:

36
Exercise

37
Exercise

38
What happens if we replaced a block
that already has data ??

‹#›
Read and Write Policies
• Cache read is much easier to handle than cache write:
• Instruction cache is much easier to design than data cache

• Cache write:
• How do we keep data in the cache and memory consistent?

40
Read and Write Policies
• Two write options when the data block is in the memory :
• Write Through: write to cache and memory at the same time.
• Isn’t memory too slow for this?

• Write Back: write to cache only. Write the cache block to memory
when that cache block is being replaced on a cache miss.
• Need a “dirty” bit for each cache block
• Control can be complex

41
Write Buffer for Write Through
Cache
Processor DRAM

Write Buffer

• A Write Buffer is needed between the Cache and Memory

• Processor: writes data into the cache and the write buffer
• Memory controller: write contents of the buffer to memory
• Write buffer is just a FIFO:
• Typical number of entries: 4
• Works fine if: Store frequency (w.r.t. time) << 1 / DRAM write cycle
• Memory system designer’s nightmare:
• Store frequency (w.r.t. time) -> 1 / DRAM write cycle
• Write buffer saturation
42
What if the data block we are writing to
not in the memory ?

‹#›
Write Miss Policies
• Write allocate (also called fetch on write): data at the missed-
write location is loaded to cache, followed by a write-hit
operation. In this approach, write misses are like read misses.

• No-write allocate (also called write-no-allocate or write

around): data at the missed-write location is not loaded to
cache, and is written directly to the backing store. In this
approach, data is loaded into the cache on read misses only.

‹#›
Write Miss Policies
• Both write-through and write-back policies can use
either of these write-miss policies, but usually they are
paired in this way:

• A write-back cache uses write allocate, h o p i n g f o r

subsequent writes (or even reads) to the same
location, which is now cached.

• A write-through cache uses no-write allocate. H e r e ,

subsequent writes have no advantage, since they still
need to be written directly to the backing store.

‹#›
Thank you

‹#›

Memory Organization AndCache Mapping Study 13
100% (1)
Memory Organization AndCache Mapping Study 13
55 pages
R320LC 7 PDF
100% (2)
R320LC 7 PDF
515 pages
Cache Mapping
100% (1)
Cache Mapping
44 pages
Computer Organization and Architecture: Cache Memory
100% (1)
Computer Organization and Architecture: Cache Memory
57 pages
Cache Memory-Direct Mapping
0% (1)
Cache Memory-Direct Mapping
30 pages
6.EBS1-PTFA27-SAQA-PLQA-1002-D00 - Project Quality Plan
No ratings yet
6.EBS1-PTFA27-SAQA-PLQA-1002-D00 - Project Quality Plan
28 pages
CAO - Lecutre7 Cache Memory
100% (1)
CAO - Lecutre7 Cache Memory
39 pages
Cache Memory CAD
No ratings yet
Cache Memory CAD
16 pages
Computer Organization and Architecture (AT70.01)
No ratings yet
Computer Organization and Architecture (AT70.01)
49 pages
Memory Hierarchy Design
No ratings yet
Memory Hierarchy Design
115 pages
CODch 7 Slides
No ratings yet
CODch 7 Slides
49 pages
Module 4: Memory System Organization & Architecture
No ratings yet
Module 4: Memory System Organization & Architecture
97 pages
Wk10a Cache PDF
No ratings yet
Wk10a Cache PDF
25 pages
06 - Memory System - I
No ratings yet
06 - Memory System - I
63 pages
Direct-Mapped Cache: Write Allocate With Write-Through Protocol
No ratings yet
Direct-Mapped Cache: Write Allocate With Write-Through Protocol
25 pages
Photogrammetry Surveying
No ratings yet
Photogrammetry Surveying
56 pages
Lecture 13 16 Post
No ratings yet
Lecture 13 16 Post
24 pages
The Motivation For Caches: Memory System
No ratings yet
The Motivation For Caches: Memory System
9 pages
EE6304 Lecture9 Mem Caches
No ratings yet
EE6304 Lecture9 Mem Caches
61 pages
Chapter 6 Cache Memory
No ratings yet
Chapter 6 Cache Memory
22 pages
Handset Control Units
No ratings yet
Handset Control Units
60 pages
Math10 Q2W1 2 OHSP
No ratings yet
Math10 Q2W1 2 OHSP
18 pages
CH 06
No ratings yet
CH 06
58 pages
04 - Cache Memory
No ratings yet
04 - Cache Memory
47 pages
9 - Cache
No ratings yet
9 - Cache
58 pages
6.cache Memory - BVK
No ratings yet
6.cache Memory - BVK
47 pages
פרק ט - גדול ומהיר - ניצול היררכיות זיכרון
No ratings yet
פרק ט - גדול ומהיר - ניצול היררכיות זיכרון
77 pages
11 Cache Memory
No ratings yet
11 Cache Memory
40 pages
361 Computer Architecture Lecture 14: Cache Memory
No ratings yet
361 Computer Architecture Lecture 14: Cache Memory
20 pages
Input Output Organization (2.3)
No ratings yet
Input Output Organization (2.3)
151 pages
BiD 05
No ratings yet
BiD 05
97 pages
5 Memory Hierarchy
No ratings yet
5 Memory Hierarchy
99 pages
Cache + Associations Ch-4
No ratings yet
Cache + Associations Ch-4
52 pages
Memory
No ratings yet
Memory
57 pages
Cache Memory
No ratings yet
Cache Memory
51 pages
55-Types of Caches, Caches Misses,-04!03!2025
No ratings yet
55-Types of Caches, Caches Misses,-04!03!2025
64 pages
13 - Large and Fast Exploiting Memory Hierarchy Final
No ratings yet
13 - Large and Fast Exploiting Memory Hierarchy Final
118 pages
CH 4.ppt Type I
No ratings yet
CH 4.ppt Type I
60 pages
Lecture 04 IS064
No ratings yet
Lecture 04 IS064
41 pages
Computer Architecture: Memory Hierarchy Design
No ratings yet
Computer Architecture: Memory Hierarchy Design
60 pages
Chapter 5 - Memory
No ratings yet
Chapter 5 - Memory
44 pages
DECO - Module 4.3 - Cache
No ratings yet
DECO - Module 4.3 - Cache
20 pages
Cache - Memory - Concept
No ratings yet
Cache - Memory - Concept
73 pages
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
71 pages
Lec8 - Caches
No ratings yet
Lec8 - Caches
55 pages
ch5 1
No ratings yet
ch5 1
44 pages
CS2115 Chapter-6
No ratings yet
CS2115 Chapter-6
45 pages
04 Cache Memory
No ratings yet
04 Cache Memory
71 pages
Chapter 2z
No ratings yet
Chapter 2z
54 pages
Computer Organization and Architecture (AT70.01) : Comp. Sc. and Inf. MGMT
No ratings yet
Computer Organization and Architecture (AT70.01) : Comp. Sc. and Inf. MGMT
49 pages
6.module 2 - Part 2
No ratings yet
6.module 2 - Part 2
39 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
57 pages
Ch4 CacheMemory
No ratings yet
Ch4 CacheMemory
29 pages
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
No ratings yet
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
32 pages
How To Implement Modbus TCP Protocol Using VBA With Excel - Acc Automation
No ratings yet
How To Implement Modbus TCP Protocol Using VBA With Excel - Acc Automation
18 pages
Chap 6
No ratings yet
Chap 6
48 pages
Computer Architecture: Memory Organization
No ratings yet
Computer Architecture: Memory Organization
65 pages
EC 5001 - Memory 1
No ratings yet
EC 5001 - Memory 1
56 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
64 pages
4.1 Computer Memory System Overview
No ratings yet
4.1 Computer Memory System Overview
12 pages
(EM) FWC ICT 2025 1st Term Paper With Scheme-1
No ratings yet
(EM) FWC ICT 2025 1st Term Paper With Scheme-1
21 pages
ITP-False Celing-NS-MSS-A-003-R-01
100% (1)
ITP-False Celing-NS-MSS-A-003-R-01
2 pages
Cache1 2
No ratings yet
Cache1 2
30 pages
CHAPTER 4 Research Methodology
No ratings yet
CHAPTER 4 Research Methodology
10 pages
Kali Linux Assuring Security by Penetration Testing Sample Chapter
No ratings yet
Kali Linux Assuring Security by Penetration Testing Sample Chapter
43 pages
IoT Module-3
No ratings yet
IoT Module-3
36 pages
Phases of A Compiler
No ratings yet
Phases of A Compiler
6 pages
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
No ratings yet
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
53 pages
H3C S5560X-EI Series Converged Gigabit Switches: Product Overview
No ratings yet
H3C S5560X-EI Series Converged Gigabit Switches: Product Overview
16 pages
Drag Force: The Basics of Transport Phenomena
No ratings yet
Drag Force: The Basics of Transport Phenomena
12 pages
Technical Brief Stats Concepts 19c
No ratings yet
Technical Brief Stats Concepts 19c
27 pages
Presentation On Impact of DG On Power Quality
No ratings yet
Presentation On Impact of DG On Power Quality
19 pages
The Use of Multimedia Instruction in Enhancing Learners' Performance in Filipino 6
No ratings yet
The Use of Multimedia Instruction in Enhancing Learners' Performance in Filipino 6
33 pages
Hearing Handicap Inventory - Screening Version (HHIE-S) : Instructions
No ratings yet
Hearing Handicap Inventory - Screening Version (HHIE-S) : Instructions
2 pages
Data Sheet: Stereo Audio Coder-Decoder For MD, CD and MP3
No ratings yet
Data Sheet: Stereo Audio Coder-Decoder For MD, CD and MP3
68 pages
Company Profile RPC
No ratings yet
Company Profile RPC
31 pages
LPS 3
No ratings yet
LPS 3
10 pages
Beige and Blue Minimal Modern Thesis Defense Presentation
No ratings yet
Beige and Blue Minimal Modern Thesis Defense Presentation
25 pages
CMP3010L03 Pipelining
No ratings yet
CMP3010L03 Pipelining
42 pages
Practice Q Ans
No ratings yet
Practice Q Ans
11 pages
Project Direction
No ratings yet
Project Direction
1 page
Rosemount Level Switch
No ratings yet
Rosemount Level Switch
24 pages
Pattern Classification 08. Gaussian Mixture Model: Abdelmoniem Bayoumi, PHD
No ratings yet
Pattern Classification 08. Gaussian Mixture Model: Abdelmoniem Bayoumi, PHD
12 pages
05 Density Estimation
No ratings yet
05 Density Estimation
29 pages
02 Training Patterns
No ratings yet
02 Training Patterns
18 pages
Series: Small To Medium Displacement Vane Pump
No ratings yet
Series: Small To Medium Displacement Vane Pump
2 pages
Journal of Energy Storage
No ratings yet
Journal of Energy Storage
14 pages
Adda247 - No. 1 APP For Banking & SSC Preparation
No ratings yet
Adda247 - No. 1 APP For Banking & SSC Preparation
6 pages
18A - Jupyter Notebook
No ratings yet
18A - Jupyter Notebook
8 pages
Larry Vuw Process
No ratings yet
Larry Vuw Process
8 pages
Storage Area Networks For Dummies
From Everand
Storage Area Networks For Dummies
Christopher Poelker
3.5/5 (2)
Mastering Efficient Memory Management in C++: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering Efficient Memory Management in C++: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
Optimized Caching Techniques: Application for Scalable Distributed Architectures
From Everand
Optimized Caching Techniques: Application for Scalable Distributed Architectures
Peter Jones
No ratings yet
Distributed Caching & Data Management: Mastering Redis, Memcached, And Apache Ignite Caching
From Everand
Distributed Caching & Data Management: Mastering Redis, Memcached, And Apache Ignite Caching
Rob Botwright
No ratings yet

CMP3010L08 Memory

Uploaded by

CMP3010L08 Memory

Uploaded by

CMP3010: Computer Architecture

L08: Memory Hierarchy

Fetch instruction Load Store data

Fetch instruction Load Store data

– Large memories are slow but cheap

Temporal locality Spatial locality

• Main memory is implemented from DRAM , while levels closer to the

• Memory hierarchies take advantage of temporal locality by keeping

•Miss: data needs to be retrieved from a block in the lower level

•Hit Time << Miss Penalty

Address 11001 & 10001 both maps to block 1

• Valid bit: A field in the tables of a memory hierarchy that indicates

HIT Data Word or Byte

Block Block Block

• The total number of bits in a direct mapped cache is:

• A Write Buffer is needed between the Cache and Memory

• No-write allocate (also called write-no-allocate or write

• A write-back cache uses write allocate, h o p i n g f o r

• A write-through cache uses no-write allocate. H e r e ,

You might also like