0% found this document useful (0 votes)

177 views

Cache Memory

Cache memory What is Cache memory? Cache Hit and Miss Locality Principle Cache Replacement Algorithms Mapping Comparison of Cache Sizes

Uploaded by

Budditha Hettige

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

177 views

Cache Memory

Cache memory What is Cache memory? Cache Hit and Miss Locality Principle Cache Replacement Algorithms Mapping Comparison of Cache Sizes

Uploaded by

Budditha Hettige

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

CSC 203 1.

Computer System Architecture

Budditha Hettige
Department of Statistics and Computer Science University of Sri Jayewardenepura

Cache memory

Budditha Hettige

Cache Memory
A highspeed,speed small memory Most frequently used memory words are kept in When CPU needs a word, it first checks it in cache. If not found, checks in memory

Budditha Hettige

Cache and Main Memory

Budditha Hettige

Cache memory Vs Main Memory

Budditha Hettige

Cache Hit and Miss

Cache Hit: a request to read from memory, which can satisfy from the cache without using the main memory. Cache Miss: A request to read from memory, which cannot be satisfied from the cache, for which the main memory has to be consulted.
Budditha Hettige

Locality Principle
PRINCIPAL OF LOCALITY is the tendency to reference data items that are near other recently referenced data items, or that were recently referenced themselves. TEMPORAL LOCALITY : memory location that is referenced once is likely to be referenced multiple times in near future. SPATIAL LOCALITY : memory location that is referenced once, then the program is likely to be reference a nearby memory location in near future.
Budditha Hettige

Locality Principle

Let
c cache access time m main memory access time h hit ratio (fraction of all references that can be satisfied out of cache)
miss ratio = 1h

Average memory access time = c + (1h) m

H =1 No memory references H=0 all are memory references
Budditha Hettige

Example:
Suppose that a word is read k times in a short interval

First reference: memory, Other k1 references: cache h = k1 k

Memory access time = c + m k

Budditha Hettige

Cache Memory
Main memories and caches are divided into fixed sized blocks Cache lines blocks inside the cache On a cache miss, entire cache line is loaded into cache from memory Example:
64K cache can be divided into 1K lines of 64 bytes, 2K lines of 32 byte etc

Unified cache
instruction and data use the same cache

Split cache
Instructions in one cache and data in another
Budditha Hettige

A system with three levels of cache

Budditha Hettige

Pentium 4 Block Diagram

Budditha Hettige

Replacement Algorithm
Optimal Replacement: replace the block which is no longer needed in the future. If all blocks currently in Cache Memory will be used again, replace the one which will not be used in the future for the longest time. Random selection: replace a randomly selected block among all blocks currently in Cache Memory.
Budditha Hettige

Replacement Algorithm
FIFO (first-in first-out): replace the block that has been in Cache Memory for the longest time. LRU (Least recently used): replace the block in Cache Memory that has not been used for the longest time. LFU (Least frequently used): replace the block in Cache Memory that has been used for the least number of times
Budditha Hettige

Cache Memory Placement Policy

Three commonly used methods to translate main memory addresses to cache memory addresses.
Associative Mapped Cache Direct-Mapped Cache Set-Associative Mapped Cache

The choice of cache mapping scheme affects cost and performance, and there is no single best method that is appropriate for all situations
Budditha Hettige

Associative Mapping

Budditha Hettige

Associative Mapping
A block in the Main Memory can be mapped to any block in the Cache Memory available (not already occupied) Advantage: Flexibility. An Main Memory block can be mapped anywhere in Cache Memory. Disadvantage: Slow or expensive. A search through all the Cache Memory blocks is needed to check whether the address can be matched to any of the tags.
Budditha Hettige

Direct Mapping

Budditha Hettige

Direct Mapping
To avoid the search through all CM blocks needed by associative mapping, this method only allows # blocks in main memory # blocks in cache memory Blocks to be mapped to each Cache Memory block. Each entry (row) in cache can hold exactly one cache line from main memory 32byte cache line size cache can hold 64KB
Budditha Hettige

Direct Mapping
Advantage: Direct mapping is faster than the associative mapping as it avoids searching through all the CM tags for a match. Disadvantage: But it lacks mapping flexibility. For example, if two MM blocks mapped to same CM block are needed repeatedly (e.g., in a loop), they will keep replacing each other, even though all other CM blocks may be available.
Budditha Hettige

Set-Associative Mapping

Budditha Hettige

Set-Associative Mapping
This is a trade-off between associative and direct mappings where each address is mapped to a certain set of cache locations. The cache is broken into sets where each set contains "N" cache lines, let's say 4. Then, each memory address is assigned a set, and can be cached in any one of those 4 locations within the set that it is assigned to. In other words, within each set the cache is associative, and thus the name. Budditha Hettige 22

Set Associative cache

LRU (Least Recently Used) algorithm is used
keep an ordering of each set of locations that could be accessed from a given memory location whenever any of present lines are accessed, it updates list, making that entry the most recently accessed when it comes to replace an entry, one at the end of list is discarded
Budditha Hettige

Load-Through and Store-Through

Load-Through : When the CPU needs to read a word from the memory, the block containing the word is brought from MM to CM, while at the same time the word is forwarded to the CPU.
Store-Through : If store-through is used, a word to be stored from CPU to memory is written to both CM (if the word is in there) and MM. By doing so, a CM block to be replaced can be overwritten by an in-coming block without being saved to MM.
Budditha Hettige

Cache Write Methods

Words in a cache have been viewed simply as copies of words from main memory that are read from the cache to provide faster access. However this view point changes. There are 3 possible write actions:
Write the result into the main memory Write the result into the cache Write the result into both main memory and cache memory

Budditha Hettige

Cache Write Methods

Write Through: A cache architecture in which data is written to main memory at the same time as it is cached. Write Back / Copy Back: CPU performs write only to the cache in case of a cache hit. If there is a cache miss, CPU performs a write to main memory. When the cache is missed :
Write Allocate: loads the memory block into cache and updates the cache block No-Write allocation: this bypasses the cache and writes the word directly into the memory.
Budditha Hettige

Cache Evaluation
Problem Solution Processor on which feature first appears

External memory slower than the system bus Increased processor speed results in external bus becoming a bottleneck for cache access. Internal cache is rather small, due to limited space on chip

Add external cache using faster memory technology Move external cache on-chip, operating at the same speed as the processor Add external L2 cache using faster technology than main memory

386

486

Budditha Hettige

Cache Evaluation
Problem Increased processor speed results in external bus becoming a bottleneck for L2 cache access Solution Move L2 cache on to the processor chip. Create separate back-side bus that runs at higher speed than the main (front-side) external bus. The BSB is dedicated to the L2 cache. Add external L3 cache. Move L3 cache on-chip Processor on which feature first appears Pentium II

Pentium Pro

Some applications deal with massive databases and must have rapid access to large amounts of data. The on-chip caches are too small.

Pentium III Pentium IV

Budditha Hettige

Comparison of Cache Sizes

Processor Type

IBM 360/85
PDP-11/70 VAX 11/780 IBM 3033 IBM 3090 Intel 80486 Pentium PowerPC 601

Mainframe
Minicomputer Minicomputer Mainframe Mainframe PC PC PC

Year of Introduction 1968 1975 1978 1978 1985 1989 1993 1993

L1 cache

L2 cache

L3 cache

16 to 32 KB
1 KB 16 KB 64 KB 128 to 256 KB 8 KB 8 KB/8 KB 32 KB

256 to 512 KB

PowerPC 620
PowerPC G4 IBM S/390 G4 IBM S/390 G6 Pentium 4 IBM SP

PC
PC/server Mainframe Mainframe PC/server High-end server

1996
1999 1997 1999 2000 2000

32 KB/32 KB
32 KB/32 KB 32 KB 256 KB 8 KB/8 KB 64 KB/32 KB

256 KB to 1 MB 256 KB 8 MB 256 KB 8 MB

2 MB 2 MB

CRAY MTAb
Itanium SGI Origin 2001 Itanium 2 IBM POWER5 CRAY XD-1

Supercomputer
PC/server High-end server PC/server High-end server Supercomputer

2000
2001 2001 2002 2003

8 KB
16 KB/16 KB 32 KB/32 KB 32 KB 64 KB

2 MB
96 KB 4 MB 256 KB 1.9 MB 1MB

4 MB 6 MB 36 MB 29

Budditha Hettige 64 KB/64 KB 2004

Chap 2 Data Types Fall 2014
No ratings yet
Chap 2 Data Types Fall 2014
78 pages
Basic Specman - Chapter - 3
100% (1)
Basic Specman - Chapter - 3
112 pages
Lecture 4 - Synthesis - Part 2 2022
No ratings yet
Lecture 4 - Synthesis - Part 2 2022
86 pages
Tutorial STM32F767
No ratings yet
Tutorial STM32F767
39 pages
Cache Memory
67% (3)
Cache Memory
72 pages
Advanced Operating Systems
No ratings yet
Advanced Operating Systems
51 pages
Embedded Systems Notes
No ratings yet
Embedded Systems Notes
88 pages
Stack and SUBROUTINES Bindu Agarwalla
No ratings yet
Stack and SUBROUTINES Bindu Agarwalla
15 pages
Memory Management in RTOS
No ratings yet
Memory Management in RTOS
20 pages
Memory Management
No ratings yet
Memory Management
21 pages
Chapter 1: Introduction To Computers, Programs, and Java Objectives
100% (1)
Chapter 1: Introduction To Computers, Programs, and Java Objectives
12 pages
CortexM4 FPU
No ratings yet
CortexM4 FPU
14 pages
8051 LAB Assignment Questions
0% (1)
8051 LAB Assignment Questions
5 pages
Operating System Lab Manual-3
No ratings yet
Operating System Lab Manual-3
68 pages
Unit IV Memory System Notes
No ratings yet
Unit IV Memory System Notes
13 pages
Computer Architecture - Memory System
100% (1)
Computer Architecture - Memory System
22 pages
Pentium Architecture
No ratings yet
Pentium Architecture
3 pages
715ECT04 Embedded Systems 2M & 16M
0% (1)
715ECT04 Embedded Systems 2M & 16M
32 pages
Lesson10 - FAT - FILE SYSTEM
100% (1)
Lesson10 - FAT - FILE SYSTEM
39 pages
Emb Lab Manual Final
100% (1)
Emb Lab Manual Final
142 pages
Context Switch
No ratings yet
Context Switch
12 pages
DataStructureAlgorithm Manual
No ratings yet
DataStructureAlgorithm Manual
48 pages
Module 3.3 - Problems On Performance
No ratings yet
Module 3.3 - Problems On Performance
54 pages
System Verilog - Interview
No ratings yet
System Verilog - Interview
9 pages
ARM7 Processor Architecture
No ratings yet
ARM7 Processor Architecture
33 pages
Algorithm Analysis Design Lecture1 PowerPoint Presentation
No ratings yet
Algorithm Analysis Design Lecture1 PowerPoint Presentation
9 pages
Chapter 4 - Cache Memory
0% (1)
Chapter 4 - Cache Memory
50 pages
Microprocessors - Introduction To 8086
No ratings yet
Microprocessors - Introduction To 8086
21 pages
Chapter 3 - CPU Architecture
No ratings yet
Chapter 3 - CPU Architecture
62 pages
Interupt in Arm
No ratings yet
Interupt in Arm
28 pages
Rabin-Karp Algorithm For Pattern Searching: Examples
No ratings yet
Rabin-Karp Algorithm For Pattern Searching: Examples
5 pages
Debugger Arm
No ratings yet
Debugger Arm
223 pages
CH-1 1 Pipelining
No ratings yet
CH-1 1 Pipelining
43 pages
Operating System Concepts Chapter 3 Exercise Solution Part 1
No ratings yet
Operating System Concepts Chapter 3 Exercise Solution Part 1
3 pages
Types of Micro Operations
91% (11)
Types of Micro Operations
17 pages
Ch01.Ppt Data Mining
No ratings yet
Ch01.Ppt Data Mining
46 pages
WINSEM2023-24 BCSE205L TH VL2023240500897 2024-03-15 Reference-Material-I
No ratings yet
WINSEM2023-24 BCSE205L TH VL2023240500897 2024-03-15 Reference-Material-I
17 pages
Chapter 05
No ratings yet
Chapter 05
19 pages
ARM: An Advanced Microcontroller
No ratings yet
ARM: An Advanced Microcontroller
54 pages
WilliamStallings Chp3 PDF
No ratings yet
WilliamStallings Chp3 PDF
60 pages
Mips
No ratings yet
Mips
46 pages
Parallel Processing Sisd Simd Misd Mimd
100% (2)
Parallel Processing Sisd Simd Misd Mimd
2 pages
CHAPTER 3 - 1 - Ver2-Intro To Assembly Language PDF
100% (1)
CHAPTER 3 - 1 - Ver2-Intro To Assembly Language PDF
34 pages
Data Types in C Language
100% (1)
Data Types in C Language
4 pages
File Allocation Methods
No ratings yet
File Allocation Methods
9 pages
6.1 Emerging Databases
No ratings yet
6.1 Emerging Databases
18 pages
Hard Disk Controller (HDC)
100% (1)
Hard Disk Controller (HDC)
5 pages
BCS402 M3
No ratings yet
BCS402 M3
110 pages
File Operations PDF
No ratings yet
File Operations PDF
35 pages
Cache Memory and Associative Memory 2.2.2
No ratings yet
Cache Memory and Associative Memory 2.2.2
7 pages
PPL Unit 3-1
No ratings yet
PPL Unit 3-1
25 pages
Operating System Memory Management
No ratings yet
Operating System Memory Management
13 pages
Computer Peripherals & Interfacing
No ratings yet
Computer Peripherals & Interfacing
128 pages
Caal Lab Manual
100% (1)
Caal Lab Manual
63 pages
Logic synthesis Standard Requirements
From Everand
Logic synthesis Standard Requirements
Gerardus Blokdyk
No ratings yet
Cache and Caching: Electrical and Electronic Engineering
No ratings yet
Cache and Caching: Electrical and Electronic Engineering
15 pages
Memory Cache
No ratings yet
Memory Cache
18 pages
Cache and Caching: Electrical and Electronic Engineering
No ratings yet
Cache and Caching: Electrical and Electronic Engineering
15 pages
6.Module 2_Part 2
No ratings yet
6.Module 2_Part 2
39 pages
Input Output Organization(2.3)
No ratings yet
Input Output Organization(2.3)
151 pages
CO & OS Unit-3 (Only Imp Concepts)
No ratings yet
CO & OS Unit-3 (Only Imp Concepts)
26 pages
Computer System Architecture: Budditha Hettige
No ratings yet
Computer System Architecture: Budditha Hettige
26 pages
Multi-Core Architectures
No ratings yet
Multi-Core Architectures
32 pages
Computer System Architecture: Budditha Hettige
No ratings yet
Computer System Architecture: Budditha Hettige
8 pages
3D Graphics
No ratings yet
3D Graphics
24 pages
2D Drawing
No ratings yet
2D Drawing
24 pages
GLUT
No ratings yet
GLUT
24 pages
OpenGl and Code::Blocks
No ratings yet
OpenGl and Code::Blocks
24 pages
Prolog Practical Guide
No ratings yet
Prolog Practical Guide
19 pages
Assignment 1 BUS5BIM PDF
No ratings yet
Assignment 1 BUS5BIM PDF
3 pages
5 - Basic Config
No ratings yet
5 - Basic Config
40 pages
Endress Houser FT GP01114DEN - 0117
No ratings yet
Endress Houser FT GP01114DEN - 0117
252 pages
Intel (R) - HID - Event - Filter - Release Notes - Bring Up Guide - Rev4.0
No ratings yet
Intel (R) - HID - Event - Filter - Release Notes - Bring Up Guide - Rev4.0
28 pages
Pareto Analysis
No ratings yet
Pareto Analysis
5 pages
ACI Multi-Pod Upgrade MOP - Adecco v.05
No ratings yet
ACI Multi-Pod Upgrade MOP - Adecco v.05
71 pages
SAP EWM Online Training
No ratings yet
SAP EWM Online Training
9 pages
Cek Virus CMD
No ratings yet
Cek Virus CMD
3 pages
General Specifications: GS 33K05D10-50E
No ratings yet
General Specifications: GS 33K05D10-50E
16 pages
Agri Informatics
No ratings yet
Agri Informatics
5 pages
Exam Questions 700-150: Introduction To Cisco Sales
No ratings yet
Exam Questions 700-150: Introduction To Cisco Sales
9 pages
Configuring A Network Installation Manager
No ratings yet
Configuring A Network Installation Manager
7 pages
IBM Security Verify Access Level 2 Quiz Attempt Review PDF
No ratings yet
IBM Security Verify Access Level 2 Quiz Attempt Review PDF
11 pages
Unit I - Basic Structure of Computer
No ratings yet
Unit I - Basic Structure of Computer
8 pages
Accelerate Computing Vision and Image Processing Using VPI 1.1 by Rodolfo Lima
No ratings yet
Accelerate Computing Vision and Image Processing Using VPI 1.1 by Rodolfo Lima
23 pages
Software Engineering Chapter
No ratings yet
Software Engineering Chapter
41 pages
K. Roach, "Meijer-G Function Representations,"
No ratings yet
K. Roach, "Meijer-G Function Representations,"
8 pages
101 Email Etiquette Tips
No ratings yet
101 Email Etiquette Tips
0 pages
Ts Manager
No ratings yet
Ts Manager
1 page
Markdown Guide
No ratings yet
Markdown Guide
11 pages
Oracle Partitioned Tables
No ratings yet
Oracle Partitioned Tables
38 pages
1 (Reserved For OBC)
No ratings yet
1 (Reserved For OBC)
2 pages
Answer: C: Set 4 MCQ
No ratings yet
Answer: C: Set 4 MCQ
12 pages
Etsi Gs ZSM 002: Zero-Touch Network and Service Management (ZSM) Reference Architecture
No ratings yet
Etsi Gs ZSM 002: Zero-Touch Network and Service Management (ZSM) Reference Architecture
80 pages
Log
No ratings yet
Log
133 pages
Industrial Scientific - 17152355-1
No ratings yet
Industrial Scientific - 17152355-1
20 pages
Feynman x27 S Tips On Physics Reflections Advice
No ratings yet
Feynman x27 S Tips On Physics Reflections Advice
2 pages
DISC 112: Assignment 7 Lab Section 2 (02:30 PM - 05:20 PM) Roll No.: - Date: 9 March 2017
No ratings yet
DISC 112: Assignment 7 Lab Section 2 (02:30 PM - 05:20 PM) Roll No.: - Date: 9 March 2017
4 pages
QR Api
No ratings yet
QR Api
19 pages

Cache Memory

Uploaded by

Cache Memory

Uploaded by

CSC 203 1.

Computer System Architecture

Cache and Main Memory

Cache memory Vs Main Memory

Cache Hit and Miss

Average memory access time = c + (1h) m

First reference: memory, Other k1 references: cache h = k1 k

Memory access time = c + m k

A system with three levels of cache

Pentium 4 Block Diagram

Cache Memory Placement Policy

Set Associative cache

Load-Through and Store-Through

Cache Write Methods

Cache Write Methods

Pentium III Pentium IV

Comparison of Cache Sizes

256 KB to 1 MB 256 KB 8 MB 256 KB 8 MB

Budditha Hettige 64 KB/64 KB 2004

You might also like