0% found this document useful (0 votes)

295 views52 pages

Computer Organization & Architecture: Cache Memory

This document discusses cache memory and its role in computer organization and architecture. It begins by describing the characteristics of computer memory, including location, capacity, unit of transfer, access method, performance, physical type, and organization. It then discusses the memory hierarchy from registers to disk storage. The document outlines different cache design parameters like size, mapping function, replacement algorithm, write policy, and block size. It provides examples of cache implementations in Intel and IBM processors.

Uploaded by

muhammad farooq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

295 views52 pages

Computer Organization & Architecture: Cache Memory

Uploaded by

muhammad farooq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 52

Computer Organization & Architecture

Chapter 4
Cache Memory
Characteristics of Computer Memory
• Location
• Capacity
• Unit of transfer
• Access method
• Performance
• Physical type
• Physical characteristics
• Organisation
Characteristics of Computer Memory
Location of Memory
• CPU
—Registers
• Internal
—Cache, Main Memory
• External
—Accessible via I/O module
—Hard disk, Optical disk
Memory Hierarchy - Diagram
Memory Hierarchy List
• Registers
• L1 Cache
• L2 Cache
• Main memory
• Virtual memory (OS)
• Disk
• Optical
• Tape
Memory Capacity
• Word size
—The natural unit of organisation
—No. of bits used to represent an integer and to
instruction length
• Number of words
—or Bytes
• Word length = 8, 16, 32 bits
Unit of Transfer
• Internal
—Unit of transfer = no. of lines in data bus
—32, 64, 128, 256 bits
• External
—Usually a block which is much larger than a
word
• Addressable unit
—Smallest location which can be uniquely
addressed
—Word or block
Access Methods (1)
• Sequential
—Start at the beginning and read through in
order
—Access time depends on location of data and
previous location
—e.g. tape
• Direct
—Individual blocks have unique addresses
—Access is by jumping to vicinity plus sequential
search
—Access time depends on location and previous
location
—e.g. disk
Access Methods (2)
• Random
—Individual addresses identify locations exactly
—Access time is independent of location or
previous access
—e.g. RAM
• Associative
—Data is located by a comparison with contents
of a portion of the store
—Access time is independent of location or
previous access
—Word retrieved based on a portion of its
contents rather than its address
—e.g. cache
Performance Units
• Access time (latency)
—Time between presenting the address and
getting the valid data
• Memory Cycle time
—Time may be required for the memory to
“recover” before next access
—Cycle time = access time + recovery time
• Transfer Rate
—Rate at which data can be moved
Physical Types
• Semiconductor
—RAM
• Magnetic
—Disk & Tape
• Optical
—CD & DVD
• Others
Physical Characteristics
• Decay (requires refresh circuitry)
• Volatility (requires voltages)
• Erasable (re-writeability)
• Power consumption

• Organization
—Physical arrangement of bits in word
The Bottom Line
• How much?
—Memory Capacity
• How fast?
—Transfer Time
• How expensive?
—Monetary cost
So you want fast?
• It is possible to build a computer which
uses only static RAM
• This would be very fast

• This would need no cache

• This would cost very high
Locality of Reference
• During the execution of a program,
memory references tend to cluster
—Sequential execution
—Consecutive instructions
—Repetitive access to variables
—Loops
• If one instruction is being executed,
then it is very likely that nearby
instructions will also be executed
—Fetch block of instructions rather than a single
instruction
Spatial & Temporal Locality
• Spatial Locality
—Tendency of execution to involve a number of
memory locations that are clustered
—Sequential execution of instructions
—Sequential access to data values e.g. from a
table
• Temporal Locality
—Tendency of a processor to access memory
locations that have been used recently
—Loop execution
Cache
• Small amount of fast memory
• Sits between CPU and main memory
• May be located on CPU chip
Cache Hierarchy
• L1 Cache -> Closest to the CPU
• L2 Cache -> Next
• L3 Cache -> Farthest from CPU
Cache Hierarchy: On-Chip Cache
Cache Hierarchy
Instruction iCache vs. Data dCache

instruction cache

data cache
Cache/Main Memory Structure

[email protected] 23
Cache operation – overview
• CPU requests contents of memory location
• Check cache for this data
• If present, get from cache (fast)
• If not present, read required block from
main memory to cache
• Then deliver from cache to CPU
• Cache includes tags to identify which
block of main memory is in each cache
slot
Cache Read Operation - Flowchart
Typical Cache Interconnection
Cache Design Parameters
• Size
• Mapping Function
• Replacement Algorithm
• Write Policy
• Block Size
• Number of Caches

[email protected] 27
Size does matter
• Cost
—More cache is expensive
• Speed
—More cache is faster (up to a point)
—Checking cache for data takes time

[email protected] 28
Comparison of Cache Sizes

[email protected] 29
Finding Cache Size on Computer

http://www.cpuid.com/downloads/cpu-z/1.57-setup-en.exe
[email protected] 30
Typical Mapping Function
• Cache of 64kByte
—Cache block of 4 bytes
—i.e. cache is 16k (214) lines of 4 bytes
• 16MBytes main memory
—24 bit address
—(224=16M)

[email protected] 31
Direct Mapping
• Each block of main memory maps to only
one cache line
—i.e. if a block is in cache, it must be in one
specific place
• Address is in two parts
• Least Significant w bits identify unique
word
• Most Significant s bits specify one memory
block
• The MSBs are split into a cache line field r
and a tag of s-r (most significant)
[email protected] 32
Direct Mapping

[email protected] 33
Direct Mapping pros & cons
• Simple
• Inexpensive
• Fixed location for given block
—If a program accesses 2 blocks that map to the
same line repeatedly, cache misses are very
high (This low hit ratio is called thrashing)

[email protected] 34
Associative Mapping
• A main memory block can load into any
line of cache
• Every line’s tag is examined for a match
• Cache searching gets expensive

[email protected] 35
Set Associative Mapping
• Cache is divided into a number of sets
• Each set contains a number of lines
• A given block maps to any line in a given
set
—e.g. Block B can be in any line of set i
• e.g. 2 lines per set
—2 way associative mapping
—A given block can be in one of 2 lines in only
one set

[email protected] 36
Cache Hit Ratio & L2 Cache Size

[email protected] 37
Cache Misses & Associativity

[email protected] 38
Replacement Algorithms (1): Direct mapping
• No choice
• Each block only maps to one line
• Replace that line

[email protected] 39
Replacement Algorithms (2)
Associative & Set Associative
• Hardware implemented algorithm (fast)
• Least Recently used (LRU)
—e.g. in 2 way set associative
—Which of the 2 blocks is LRU?
• First in first out (FIFO)
—replace block that has been in cache longest
• Least frequently used
—replace block which has fewest hits
• Others

[email protected] 40
Write Policy
• Must not overwrite a cache block unless
main memory is up to date
• Multiple CPUs may have individual caches
• I/O may address main memory directly

[email protected] 41
Write Through Policy
• All writes go to main memory as well as
cache
• Multiple CPUs can monitor main memory
traffic to keep local cache up to date
• Lots of traffic
• Slows down writes

[email protected] 42
Write Back Policy
• Updates are initially made in cache only
• Update bit for cache slot is set when
update occurs
• If block is to be replaced, write to main
memory only if update bit is set
• I/O must access main memory through
cache
• 15% of memory references are writes

[email protected] 43
Intel Cache Evolution

[email protected] 44
Pentium 4 Cache
• Pentium (all versions) – two on chip L1 caches
— Data & instructions
• Pentium III – L3 cache added off chip
• Pentium 4
— L1 caches
– 8k bytes
– four way set associative
— L2 cache
– Feeding both L1 caches
– 256k
– 8 way set associative
— L3 cache on chip

[email protected] 45
Pentium 4 Block Diagram

[email protected] 46
Pentium 4 Core Processor
• Fetch/Decode Unit
— Fetches instructions from L2 cache
— Decode into micro-ops
— Store micro-ops in L1 cache
• Out of order execution logic
— Schedules micro-ops
— Based on data dependence and resources
— May speculatively execute
• Execution units
— Execute micro-ops
— Data from L1 cache
— Results in registers
• Memory subsystem
— L2 cache and systems bus
[email protected] 47
Intel Core i7 Block Diagram

[email protected] 48
IBM PowerPC Cache Organization
• 601 – single 32kb 8 way set associative
• 603 – 16kb (2 x 8kb) two way set
associative
• 604 – 32kb
• 620 – 64kb
• G3 & G4
—64kb L1 cache
– 8 way set associative
—256k, 512k or 1M L2 cache
– two way set associative
• G5
—32kB instruction cache
—64kB data cache
[email protected] 49
Questions ???

[email protected] 50
Virtual Memory
• An Operating System construct

• Virtual memory combines your computer’s

RAM with temporary space on your hard
disk.

• When RAM runs low, virtual memory

moves data from RAM to a space called
a paging file.

[email protected] 51
Virtual
Memory

[email protected] 52

معالجة مايكروية محاضرة 1 .MH
100% (1)
معالجة مايكروية محاضرة 1 .MH
26 pages
Quantum COA
88% (17)
Quantum COA
293 pages
PHP Lab - Iv Sem - Bca
No ratings yet
PHP Lab - Iv Sem - Bca
16 pages
3 Sem Dbms Notes
No ratings yet
3 Sem Dbms Notes
104 pages
Chapter 5 Large and Fast Exploiting Memory Hierarchy
No ratings yet
Chapter 5 Large and Fast Exploiting Memory Hierarchy
101 pages
Free Space Management
No ratings yet
Free Space Management
19 pages
Raid
No ratings yet
Raid
148 pages
Os - Unit 5
No ratings yet
Os - Unit 5
60 pages
OS PPT Introduction
No ratings yet
OS PPT Introduction
43 pages
CH04 COA11e
No ratings yet
CH04 COA11e
48 pages
ARM Notes For Students
100% (3)
ARM Notes For Students
24 pages
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
55 pages
William Stallings Computer Organization and Architecture 7th Edition
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition
57 pages
Unit-2-Computer Organization Best Notes For Bca
No ratings yet
Unit-2-Computer Organization Best Notes For Bca
21 pages
Parameters of Cache Memory: - Cache Hit - Cache Miss - Hit Ratio - Miss Penalty
No ratings yet
Parameters of Cache Memory: - Cache Hit - Cache Miss - Hit Ratio - Miss Penalty
18 pages
Project 25 CGPA Calculator
No ratings yet
Project 25 CGPA Calculator
16 pages
WINSEM2023-24 BCSE205L TH VL2023240500897 2024-03-15 Reference-Material-I
No ratings yet
WINSEM2023-24 BCSE205L TH VL2023240500897 2024-03-15 Reference-Material-I
17 pages
Virtual Memory in Operating System.
No ratings yet
Virtual Memory in Operating System.
21 pages
Lecture 1 Introduction To Computer Architecture and Organization
No ratings yet
Lecture 1 Introduction To Computer Architecture and Organization
69 pages
CS8651 - IP - Notes - Unit 4
No ratings yet
CS8651 - IP - Notes - Unit 4
90 pages
PPS - Unit 1
No ratings yet
PPS - Unit 1
69 pages
Computer Organization and Architecture: Cache Memory
100% (1)
Computer Organization and Architecture: Cache Memory
57 pages
Net Framework and C# Programming Practical File
No ratings yet
Net Framework and C# Programming Practical File
21 pages
5.1 Raid
100% (1)
5.1 Raid
14 pages
Chapter 5 Memory and Memory Interface
No ratings yet
Chapter 5 Memory and Memory Interface
56 pages
Presentation On Zener Diode: Submitted by
No ratings yet
Presentation On Zener Diode: Submitted by
10 pages
Bscs 6th Coa Midterm KFUEIT
No ratings yet
Bscs 6th Coa Midterm KFUEIT
1 page
Memory Organization
No ratings yet
Memory Organization
99 pages
Sample Project Report S
100% (1)
Sample Project Report S
43 pages
5.1 Paging and Segmentation in OS
No ratings yet
5.1 Paging and Segmentation in OS
4 pages
CHAPTER 12 - Memory Organization PDF
No ratings yet
CHAPTER 12 - Memory Organization PDF
34 pages
Rom Types
No ratings yet
Rom Types
2 pages
Wta Int 3 Imp Que With Sol
No ratings yet
Wta Int 3 Imp Que With Sol
43 pages
10 Disk Management
No ratings yet
10 Disk Management
39 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
51 pages
Practical File (AI)
No ratings yet
Practical File (AI)
15 pages
Os Lab Manual Final Os-2
No ratings yet
Os Lab Manual Final Os-2
88 pages
Final Practical List Computer Peripherals and Interface
No ratings yet
Final Practical List Computer Peripherals and Interface
42 pages
Entrepreneurship: Disciplined Entrepreneurship Course Review / Summary
100% (1)
Entrepreneurship: Disciplined Entrepreneurship Course Review / Summary
34 pages
Access Matrix: Implementation and Comparison
No ratings yet
Access Matrix: Implementation and Comparison
19 pages
CAHM Unit 1 Notes
No ratings yet
CAHM Unit 1 Notes
16 pages
Lecture 3 Multiprocessor Vs Multicomputer Vs DS
No ratings yet
Lecture 3 Multiprocessor Vs Multicomputer Vs DS
55 pages
5 Cache PDF
No ratings yet
5 Cache PDF
46 pages
Computer Architecture - Memory System
100% (1)
Computer Architecture - Memory System
22 pages
CH12
100% (1)
CH12
6 pages
L2: Internal Organization of Memory Chip
No ratings yet
L2: Internal Organization of Memory Chip
16 pages
MULTIMEDIA Unit 1
No ratings yet
MULTIMEDIA Unit 1
14 pages
2016 03 18 Training - Digital Governor
100% (1)
2016 03 18 Training - Digital Governor
28 pages
CAO - Question Bank
No ratings yet
CAO - Question Bank
30 pages
Chapter 5-The Memory System
No ratings yet
Chapter 5-The Memory System
84 pages
Individual Assignment
No ratings yet
Individual Assignment
1 page
VTU ECE CNLAB Manual 15ECL68
50% (4)
VTU ECE CNLAB Manual 15ECL68
2 pages
Group A Mysql Handout PDF
No ratings yet
Group A Mysql Handout PDF
7 pages
William Stallings Computer Organization and Architecture 8 Edition
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition
55 pages
Answer: A: Department of Information Technology UNIT-II-CAO-MCQ B.Tech 4 SEM
No ratings yet
Answer: A: Department of Information Technology UNIT-II-CAO-MCQ B.Tech 4 SEM
4 pages
Data Structure Using C: Proposal On Phone Directory Application Using Doubly-Linked List
No ratings yet
Data Structure Using C: Proposal On Phone Directory Application Using Doubly-Linked List
5 pages
Dbms PPT For Chapter 7
No ratings yet
Dbms PPT For Chapter 7
45 pages
Computer Organization: Course Objectives
No ratings yet
Computer Organization: Course Objectives
2 pages
VI Branch: Information Internet and Web Technologies
No ratings yet
VI Branch: Information Internet and Web Technologies
3 pages
Introduction To Advanced Semiconductor Memories
No ratings yet
Introduction To Advanced Semiconductor Memories
18 pages
Dbms Lesson Plan
No ratings yet
Dbms Lesson Plan
11 pages
Ee6502 Microprocessors and Microcontrollers
0% (1)
Ee6502 Microprocessors and Microcontrollers
97 pages
Entrepreneurship: Select A Beachhead Market
100% (1)
Entrepreneurship: Select A Beachhead Market
16 pages
Assignment 2
No ratings yet
Assignment 2
4 pages
0 Laboratory Exercise 1 Electric VLSI Tool
No ratings yet
0 Laboratory Exercise 1 Electric VLSI Tool
11 pages
Lenovo Thinkpad T460s LCFC NM-A581 PDF
No ratings yet
Lenovo Thinkpad T460s LCFC NM-A581 PDF
97 pages
Interrupt 8086
No ratings yet
Interrupt 8086
24 pages
Cse-Vii-Advanced Computer Architectures (10CS74) - Assignment PDF
No ratings yet
Cse-Vii-Advanced Computer Architectures (10CS74) - Assignment PDF
6 pages
Vens ATX Mobo Pattern PDF
No ratings yet
Vens ATX Mobo Pattern PDF
3 pages
Plume P6 Pro-PGN518 Diagram 1
No ratings yet
Plume P6 Pro-PGN518 Diagram 1
1 page
Finite State Machines (Mealy and Moore Machines) : Gookyi Dennis A. N. Soc Design Lab
No ratings yet
Finite State Machines (Mealy and Moore Machines) : Gookyi Dennis A. N. Soc Design Lab
18 pages
(Lecture) DigialSystems Chap02 P3
No ratings yet
(Lecture) DigialSystems Chap02 P3
29 pages
Chapter 1 - Introduction
No ratings yet
Chapter 1 - Introduction
91 pages
Coa 03 Function Interconnection
No ratings yet
Coa 03 Function Interconnection
61 pages
Display 14 Segmentos
No ratings yet
Display 14 Segmentos
4 pages
IOMUX User Guide
No ratings yet
IOMUX User Guide
86 pages
MBI5025
No ratings yet
MBI5025
13 pages
Computer Architecture MCQ
No ratings yet
Computer Architecture MCQ
5 pages
Registers 8051
No ratings yet
Registers 8051
6 pages
Entrepreneurship: Build An End User Profile
No ratings yet
Entrepreneurship: Build An End User Profile
18 pages
Sony Vaio FE880E Foxconn MS13 MBX 149
No ratings yet
Sony Vaio FE880E Foxconn MS13 MBX 149
67 pages
Computer Organization and Architecture: 06 Jumada II, 1440 Tuesday, 12 February 2019
No ratings yet
Computer Organization and Architecture: 06 Jumada II, 1440 Tuesday, 12 February 2019
15 pages
Addressing Mode
No ratings yet
Addressing Mode
4 pages
History of Distributed Systems
No ratings yet
History of Distributed Systems
12 pages
Question Bank and Objectives Mpi
No ratings yet
Question Bank and Objectives Mpi
7 pages
STM8A-Discovery User Manual
No ratings yet
STM8A-Discovery User Manual
48 pages
DC Expert Cs
No ratings yet
DC Expert Cs
2 pages
2018 ECE Annex IV - Laboratory Requirements
No ratings yet
2018 ECE Annex IV - Laboratory Requirements
11 pages
Evolution of Microprocessor PDF
No ratings yet
Evolution of Microprocessor PDF
2 pages
DS1722 Digital Thermometer With SPI/3-Wire Interface: Features Pin Assignment
No ratings yet
DS1722 Digital Thermometer With SPI/3-Wire Interface: Features Pin Assignment
13 pages
Chapter 7 PIPELINING and Risc Summary
No ratings yet
Chapter 7 PIPELINING and Risc Summary
12 pages
Arm Notes 2
No ratings yet
Arm Notes 2
9 pages
EEE342 Assignment 2
No ratings yet
EEE342 Assignment 2
1 page
Textbook of Engineering Chemistry
From Everand
Textbook of Engineering Chemistry
C. Parameswara Murthy
No ratings yet
Introduction to Linux: Installation and Programming
From Everand
Introduction to Linux: Installation and Programming
N. B. Venkateswarlu
No ratings yet

Computer Organization & Architecture: Cache Memory

Uploaded by

Computer Organization & Architecture: Cache Memory

Uploaded by

Computer Organization & Architecture

• This would need no cache

• Virtual memory combines your computer’s

• When RAM runs low, virtual memory

You might also like