0% found this document useful (0 votes)

76 views28 pages

Microprocessor System Design: Error Correcting Codes Principle of Locality Cache Architecture

The document discusses error correcting codes, cache architecture, and the principle of locality. It provides an overview of error correcting codes like parity and ECC used to detect and correct errors in memory. It also describes the basics of cache organization, how caches take advantage of locality to improve memory access times, and examples of direct mapped caches.

Uploaded by

Babasrinivas Guduru

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

76 views28 pages

Microprocessor System Design: Error Correcting Codes Principle of Locality Cache Architecture

Uploaded by

Babasrinivas Guduru

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

ECE 485/585

Microprocessor System Design

Lecture 8: Error Correcting Codes

Principle of Locality
Cache Architecture

Zeshan Chishti
Electrical and Computer Engineering Dept
Maseeh College of Engineering and Computer Science

Source: Lecture based on materials provided by Mark F.

Error Correcting Codes

ECE 485/585
Error Correction
 Motivation
 Failures/time proportional to number of bits
 As DRAM cells size & voltages shrink, more vulnerable
 Why was/is this not issue on your PC?
 Failure rate was low
 Few consumers would know what to do anyway
 DRAM banks too large – so much memory that not likely to encounter an error
 Servers (always) correct memory system errors (e.g. usually use ECC)
 Sources
 Alpha particles (impurities in IC manufacturing)
 Cosmic rays (vary with altitude)
 Bigger problem in Denver and on space-bound electronics
 Noise
 Need to handle failures throughout memory subsystem
 DRAM chips, module, bus
 DRAM chips don’t incorporate ECC
 Store the ECC bits in DRAM alongside the data bits
 Chipset (or integrated controller) handles ECC

ECE 485/585
Error Detection: Parity

[from Bruce Jacob]

ECE 485/585
Error Correction Codes (ECC)
Single bit error correction
requires n+1 check bits for 2n data bits

ECE 485/585
Error Correction Codes (ECC)

=1^0^0^0 = 1

1
ECE 485/585
Error Correction Codes (ECC)
An example: decoding and verifying
1
Sent ->
Recv’d ->
1

=1^0^0^0 = 1

1
1
R1011

ECE 485/585
Error Correction Codes (ECC)
Add another check bit – SECDED
Single Error Correction Double Error Detection

requires n+2 check bits for 2n data bits

ECE 485/585
Error Correction Codes (ECC)

64-bit data path + 8 bits ECC

stored to DRAM module

[from Bruce Jacob]

ECE 485/585
Cache Topics
 Cache Basics
 Memory vs. Processor Performance
 The Memory Hierarchy
 Registers, SRAM, DRAM, Disk
 Spatial and Temporal Locality
 Cache Hits and Misses
 Cache Organization
 Direct Mapped Caches
 Two-Way, Four-Way Caches
 Fully Associative (N-Way) Caches
 Sector-mapped caches
 Cache Line Replacement Algorithms
 Cache Performance and Performance improvements
 Cache Coherence
 Intel Cache Evolution
 Multicore Caches
 Cache Design Issues
ECE 485/585
The Problem: Memory Wall

Memory Technology Trends

Relative Performance Gains

100,000

10,000

1,000
Performance CPU

100

10
Memory

Year

ECE 485/585
From Hennessy & Patterson, Computer Architecture: A Quantitative Approach (4 th edition)
Memory System Design Tradeoffs

 A big challenge in memory system design is to provide a

sufficiently large memory capacity, with reasonable speed
at an affordable cost

 SRAM
 Complex basic cell circuit => fast access, but high cost per bit
 DRAM
 Simpler basic cell circuit => less cost per bit, but slower than
SRAMs
 Flash memory and Magnetic disks
 DRAMs provide more storage than SRAM but less than what is
necessary
 Disks provide a large amount of storage, but are much slower
than DRAMs
No single memory technology can provide both large capacity
and fast speed at an affordable cost
ECE 485/585
A Solution: Memory Hierarchy
From Hennessy & Patterson, Computer
Architecture: A Quantitative Approach (4th
edition)

Processor

Control Tertiary
Secondary Storage
Storage (Tape)
Main (Disk)
Third Memory
Second Level
Registers

(DRAM)
Datapath Level Cache
On-Chip
Cache

Cache (SRAM)
(SRAM)

Intermediate results Instructions File System Archive

Data Paging Backup
Cached DRAM
[Cached Files]

ECE 485/585
Intel Pentium 4 3.2 GHz Server

Component Access Speed

(Time for data to be
returned)
Registers 1 cycle =
0.3 nanoseconds
L1 Cache 3 cycles =
1 nanoseconds
L2 Cache 20 cycles =
7 nanoseconds
L3 Cache 40 cycles =
13 nanoseconds
Memory 300 cycles =
100 nanoseconds

ECE 485/585
How is the Hierarchy Managed?
 Registers  Memory
 Compiler
 Programmer
 Cache  Memory
 Hardware
 Memory  Disk
 Operating System (Virtual Memory: Paging)
 Programmer (File System)

ECE 485/585
Principle of Locality
 Analysis of programs indicates that many instructions in localized
areas of a program are executed repeatedly during some period of
time, while other instructions are executed relatively less
frequently
 These frequently executed instructions may be the ones in a loop,
nested loop or few procedures calling each other repeatedly
 This is called “locality of reference”

 Temporal locality of reference

 Recently executed instruction is likely to be executed again very soon
 Recently accessed data is likely to be accessed very soon

 Spatial locality of reference

 Instructions/data with addresses close to a recently accessed
instruction/data are likely to be accessed soon

A cache is designed to take advantage of both types of

“locality of reference”

ECE 485/585
Use of a Cache Memory

Processor Cache Main

memory

• A cache is a small but fast SRAM inserted between processor and main
memory
• Data in a cache is organized at the granularity of cache blocks
• When the processor issues a request for a memory address, an entire
block (e.g., 64 bytes) is transferred from the main memory to the cache
• Later references to same address can be serviced by the cache (temporal
locality)
• References to other addresses in this block can also be serviced by the
cache (spatial locality)
Higher locality => More requests serviced by the cache

ECE 485/585
Caching – Student Advising Analogy

Thousands of student folders

Indexed by 9-digit student ID
Located up the stairs and down the hall…a long walk

Space for 100 file folders at my desk

Located at my side – short access time

ECE 485/585
Cache Organization
 How is the Cache laid out?
 The cache is made up of a number of cache lines
(sometimes called blocks)
 Data is hauled into the cache from memory in “chunks”
(may be smaller than a line)
 If CPU requests 4 bytes of data, cache gets entire line
(32/64/128 bytes)
 Spatial locality says you’re likely to need that data anyway
 Incur cost only once rather than each time CPU needs piece of
data
 Ex: The Pentium P4 Xeon’s Level 1 Data Cache:
 Contains 8K bytes
 The cache lines are each 64 bytes
 This gives 8192 bytes / 64 bytes = 128 cache lines

ECE 485/585
Simple Direct Mapped Cache
31 address 43 0
index
4
data
0
1
2
3 set
4
5
cache 6
lines 7
8
(“sets”) 9
10
11
12
13
14
15
Use least significant 4 bits to determine which slot to cache data in
But…228 different addresses could have their data cached in the same spot

ECE 485/585
Simple Direct Mapped Cache (cont’d)
31 address 43 0
tag index
4
valid tag data
0
1
2
3
4
5 set
6
7
8
9
10
11
12
13
14
15

Need to store tag to be sure the data is for this address and not another
(Only need to store the address minus the index bits – 28 bits in this example)

ECE 485/585
Cache Hits and Misses

• When the processor needs to access some data, that data may or
may not be found in the cache
• If the data is found in the cache, it is called a cache hit
• Read hit:
• The processor reads data from cache and does not need to go to
memory
• Write hit:
 Cache has a replica of the contents of main memory, both cache
and main memory need to be updated

 If the data is not found in the cache, it is called a cache miss

 The block containing the data is transferred from memory to cache
 After the block is transferred, the desired data is forwarded to the
processor.
 The desired data may also be forwarded to the processor as soon as it
is transferred without waiting for the entire cache block to be
transferred. This is called load-through or critical word first

ECE 485/585
Cache Behavior – Reads

Read behavior

if Valid bit clear /* slot empty – cache miss */

stall CPU
read cache line from memory
set Valid bit
write Tag bits
deliver data to CPU
else /* slot occupied */
if Tag bits match /* cache hit! */
deliver data to CPU
else /* occupied by another – cache miss */
stall CPU
cast out existing cache line (“victim”)
read cache line from memory
write Tag bits
deliver data to CPU

ECE 485/585
Cache Behavior - Writes
 Policy decisions for all writes
 Write Through
 Replace data in cache and memory
 Requires write buffer to be effective
 Allows CPU to continue w/o waiting for DRAM

 Write Back
 Replace data in cache only
 Requires addition of “dirty” bit in tag/valid memory
 Write back on when:
 Cache flush is performed
 Line becomes victim and is cast out

 Policy decision for write miss

 Write Allocate
 Place the data into the cache
 Write No Allocate (or Write Around)
 Don’t place the data in the cache
 Philosophy – successive writes (without intervening read) unlikely
 Saves not only the cache line fill for the requested cache line but possibility of
casting out a line which is more likely to be used later

ECE 485/585
Write Buffer for Write-Through
 A Write Buffer is needed between cache and memory if
using Write Through policy to avoid having processor wait
 Processor writes data into the cache and the write buffer
 Memory controller write contents of the buffer to memory
 Write Buffer is just a FIFO
 Intel: “posted write buffer” PWB
 Small depth
 Store frequency << 1/DRAM write cycle

Cache
Processor DRAM

Write Buffer

ECE 485/585
Cache Behavior – Writes
Write behavior

if Valid bit set /* slot occupied */

if Tag bits match /* cache hit! */
write data to cache
write data to memory - or - set “dirty” bit for cache line write through or write back
else /* occupied by another */
stall CPU
cast out existing cache line (“victim”)
read cache line from memory Why?
write Tag bits
write data to cache
write data to memory - or - set “dirty” bit for cache line write through or write back

else /* slot empty */ assumes write allocate

stall CPU
read cache line from memory Why?
write Tag bits
set Valid bit
write data to cache
write data to memory - or - set “dirty” bit for cache line write through or write back
ECE 485/585
Why read a cache line for a write?

bytes to be written

D V Tag

cache line

Data being written by CPU is smaller than cache line

Write misses in cache
Have only single valid bit and tag bits for entire line
Subsequent read operation must find valid data for rest of cache line

ECE 485/585
Casting Out a Victim
 Depends upon policies
 Write Through
 Data in cache isn’t the only current copy (memory is up to date)
 Just over-write victim cache line with new cache line (change tag
bits)
 Write Back
 Must check dirty bit to see if victim cache line is modified
 If so, must write the victim cache line back to memory
 Can lead to interesting behavior
 A CPU “read” can cause memory “write” followed by “read”
 Write back dirty cache line (victim)
 Read new cache line
 A CPU “write” can cause a memory “write” followed by a
“read”
 Write back dirty cache line (victim)
 Read new cache line into which data will be written in cache
ECE 485/585

Appendix-34 Statcom Equipment Requirements
No ratings yet
Appendix-34 Statcom Equipment Requirements
216 pages
Trator Case 270 PDF
100% (1)
Trator Case 270 PDF
539 pages
Toyota Hiace
100% (3)
Toyota Hiace
15 pages
DJM1012 Mechatronic Workshop Practice-Machining Report
100% (7)
DJM1012 Mechatronic Workshop Practice-Machining Report
16 pages
G6500 en
No ratings yet
G6500 en
12 pages
MCA - HW - Lectures 3 and 4 - Prelim
No ratings yet
MCA - HW - Lectures 3 and 4 - Prelim
157 pages
CE 211 Quantity Surveying & Quantity Surveying & Estimation: Engr. Arsalaan Khan
No ratings yet
CE 211 Quantity Surveying & Quantity Surveying & Estimation: Engr. Arsalaan Khan
16 pages
6 Laws For The Glory of Sentient Beings
100% (5)
6 Laws For The Glory of Sentient Beings
3 pages
Sickle Plate
100% (1)
Sickle Plate
26 pages
4 Caches With Notes
No ratings yet
4 Caches With Notes
121 pages
Marketing BA ZC411/ MBA ZC411: BITS Pilani
No ratings yet
Marketing BA ZC411/ MBA ZC411: BITS Pilani
29 pages
Mark V Voter Mismatch
100% (1)
Mark V Voter Mismatch
6 pages
3 Memory With Notes
No ratings yet
3 Memory With Notes
136 pages
Unit 2 - Week1: Assignment-1
No ratings yet
Unit 2 - Week1: Assignment-1
4 pages
1268 Manual (12D) PDF
No ratings yet
1268 Manual (12D) PDF
66 pages
Coa PPT
No ratings yet
Coa PPT
158 pages
Image Processing (Rry025)
No ratings yet
Image Processing (Rry025)
22 pages
CH 5
No ratings yet
CH 5
116 pages
GV Dry Vacuum Pumps Instruction Manual
No ratings yet
GV Dry Vacuum Pumps Instruction Manual
58 pages
Unit - 5 DPCO
No ratings yet
Unit - 5 DPCO
35 pages
ELG3311: Assignment 2: Problem 5-22
No ratings yet
ELG3311: Assignment 2: Problem 5-22
10 pages
Fast Track Design and Construction of Bridges in India
No ratings yet
Fast Track Design and Construction of Bridges in India
10 pages
Ea4300f - en
No ratings yet
Ea4300f - en
48 pages
Computer Architecture Patterson Solution Manual
No ratings yet
Computer Architecture Patterson Solution Manual
5 pages
06 - Memory System - I
No ratings yet
06 - Memory System - I
63 pages
Week 11
No ratings yet
Week 11
45 pages
An 300
No ratings yet
An 300
7 pages
ch5 1
No ratings yet
ch5 1
44 pages
UNIT II - Multi Core Architecture
No ratings yet
UNIT II - Multi Core Architecture
102 pages
Cache Performance
No ratings yet
Cache Performance
44 pages
Job Analysis of HRM
No ratings yet
Job Analysis of HRM
4 pages
Unit IV ARM
No ratings yet
Unit IV ARM
43 pages
6.module 2 - Part 2
No ratings yet
6.module 2 - Part 2
39 pages
EE6304 Lecture9 Mem Caches
No ratings yet
EE6304 Lecture9 Mem Caches
61 pages
DigitalLogic ComputerOrganization L20 CachesP1 Handout
No ratings yet
DigitalLogic ComputerOrganization L20 CachesP1 Handout
43 pages
2-3 Compressors
No ratings yet
2-3 Compressors
63 pages
As It Was - Harry Styles
No ratings yet
As It Was - Harry Styles
1 page
Cache PPT
No ratings yet
Cache PPT
38 pages
Lecture 13 - Introduction To Cache
No ratings yet
Lecture 13 - Introduction To Cache
47 pages
Cache Concepts Memory
No ratings yet
Cache Concepts Memory
32 pages
פרק ט - גדול ומהיר - ניצול היררכיות זיכרון
No ratings yet
פרק ט - גדול ומהיר - ניצול היררכיות זיכרון
77 pages
Filters PDF
No ratings yet
Filters PDF
7 pages
TSSR
No ratings yet
TSSR
12 pages
Solutions
100% (1)
Solutions
20 pages
Manual Air Blast 300
No ratings yet
Manual Air Blast 300
13 pages
Lec8 - Caches
No ratings yet
Lec8 - Caches
55 pages
CS2115 Chapter-6
No ratings yet
CS2115 Chapter-6
45 pages
25 e 50 Beb 5 Aad 8 F 60
No ratings yet
25 e 50 Beb 5 Aad 8 F 60
49 pages
YellowBook UK
No ratings yet
YellowBook UK
18 pages
Bharat Sanchar Nigam Limited: Application Form For New Mobile Connection (D-Kyc Process)
No ratings yet
Bharat Sanchar Nigam Limited: Application Form For New Mobile Connection (D-Kyc Process)
3 pages
Pertemuan 6
No ratings yet
Pertemuan 6
56 pages
Power System Operation and Control
No ratings yet
Power System Operation and Control
8 pages
Week 12 - Lecture 12 - Memory
No ratings yet
Week 12 - Lecture 12 - Memory
27 pages
Images - EEEQP - RR410202 POWER SEMI CONDUCTOR DRIVES
No ratings yet
Images - EEEQP - RR410202 POWER SEMI CONDUCTOR DRIVES
8 pages
FRM-2530-04 Hot Work Permit Ver - 00
No ratings yet
FRM-2530-04 Hot Work Permit Ver - 00
1 page
Memory Hierarchies: Forecast - Memory (B5) - Motivation For Memory Hierarchy - Cache - Ecc - Virtual Memory
No ratings yet
Memory Hierarchies: Forecast - Memory (B5) - Motivation For Memory Hierarchy - Cache - Ecc - Virtual Memory
19 pages
Computer Organization and Architecture (AT70.01)
No ratings yet
Computer Organization and Architecture (AT70.01)
49 pages
COAL Assignment 2
No ratings yet
COAL Assignment 2
10 pages
Ifor Williams Ramps
No ratings yet
Ifor Williams Ramps
6 pages
Unit V
No ratings yet
Unit V
44 pages
Wk10a Cache PDF
No ratings yet
Wk10a Cache PDF
25 pages
Computer Organization and Architecture (AT70.01) : Comp. Sc. and Inf. MGMT
No ratings yet
Computer Organization and Architecture (AT70.01) : Comp. Sc. and Inf. MGMT
49 pages
CSE3214 Computer Network Protocols and Applications: Example 1 (R11)
No ratings yet
CSE3214 Computer Network Protocols and Applications: Example 1 (R11)
9 pages
CSE3214 Computer Network Protocols and Applications: Example 1 (R11)
No ratings yet
CSE3214 Computer Network Protocols and Applications: Example 1 (R11)
9 pages
Unbound GM Sheet
100% (1)
Unbound GM Sheet
1 page
Chapter 5.1-5.6 Memory
No ratings yet
Chapter 5.1-5.6 Memory
26 pages
Rotary Switch PDF
No ratings yet
Rotary Switch PDF
4 pages
Departmental Tests - November-2019 Session NOTIFICATION NO: 19/2019
No ratings yet
Departmental Tests - November-2019 Session NOTIFICATION NO: 19/2019
1 page
O 1 2 2R 4R V': ECE65 Lecture Notes (F. Najmabadi), Spring 2007
No ratings yet
O 1 2 2R 4R V': ECE65 Lecture Notes (F. Najmabadi), Spring 2007
7 pages
Announcement: Lab 1 Assignment
No ratings yet
Announcement: Lab 1 Assignment
35 pages
Chap 6
No ratings yet
Chap 6
48 pages
15IF11 Multicore B
No ratings yet
15IF11 Multicore B
36 pages
CODch 7 Slides
No ratings yet
CODch 7 Slides
49 pages
Solutions To HW11
No ratings yet
Solutions To HW11
9 pages
Cache Memory
No ratings yet
Cache Memory
39 pages
Cache Memory
No ratings yet
Cache Memory
60 pages
143f2010 04 PDF
No ratings yet
143f2010 04 PDF
3 pages
Exercise Set 5.1: Concept Review
No ratings yet
Exercise Set 5.1: Concept Review
7 pages
EE 321 Analog Electronics, Fall 2013 Homework #8 Solution
No ratings yet
EE 321 Analog Electronics, Fall 2013 Homework #8 Solution
7 pages
EE 321 Analog Electronics, Fall 2013 Homework #8 Solution
No ratings yet
EE 321 Analog Electronics, Fall 2013 Homework #8 Solution
7 pages
ECE4680 Computer Organization and Architecture Memory Hierarchy: Cache System
No ratings yet
ECE4680 Computer Organization and Architecture Memory Hierarchy: Cache System
25 pages
04 Cache Memory
No ratings yet
04 Cache Memory
36 pages
Address Translation, Caches, and Tlbs
No ratings yet
Address Translation, Caches, and Tlbs
32 pages
Computer Architecture: Cache Memory
No ratings yet
Computer Architecture: Cache Memory
28 pages
Midterm II - Solutions: Name
No ratings yet
Midterm II - Solutions: Name
9 pages
5 Year Professional Development Plan
No ratings yet
5 Year Professional Development Plan
1 page
Sampriya Chandra Cache Memory
No ratings yet
Sampriya Chandra Cache Memory
36 pages
Chapter 6
No ratings yet
Chapter 6
37 pages
Fundamentals of Computer Systems: Caches
No ratings yet
Fundamentals of Computer Systems: Caches
28 pages
Conspect of Lecture 7
No ratings yet
Conspect of Lecture 7
13 pages
Cse 410 Computer Systems: Hal Perkins Spring 2010 L T 13 C Hwit DPF Lecture 13 - Cache Writes and Performance
No ratings yet
Cse 410 Computer Systems: Hal Perkins Spring 2010 L T 13 C Hwit DPF Lecture 13 - Cache Writes and Performance
20 pages
5.5 Cache Organization
No ratings yet
5.5 Cache Organization
8 pages
Cache and Caching: Electrical and Electronic Engineering
No ratings yet
Cache and Caching: Electrical and Electronic Engineering
15 pages
Cache1 2
No ratings yet
Cache1 2
30 pages
Cache Mapping
No ratings yet
Cache Mapping
11 pages
ASA Chapter4
No ratings yet
ASA Chapter4
8 pages
MT02 ECE2560 Page1 of 1
No ratings yet
MT02 ECE2560 Page1 of 1
1 page
ECE 476 - Power System Analysis Fall 2017: Homework 10
No ratings yet
ECE 476 - Power System Analysis Fall 2017: Homework 10
1 page
ECE 476 - Power System Analysis Fall 2017: Homework 7
No ratings yet
ECE 476 - Power System Analysis Fall 2017: Homework 7
1 page
CSC358 Tutorial 8: TA: Lilin Zhang March 14-15, 2016
No ratings yet
CSC358 Tutorial 8: TA: Lilin Zhang March 14-15, 2016
15 pages
Lab 8
No ratings yet
Lab 8
10 pages
Cache and Caching: Electrical and Electronic Engineering
No ratings yet
Cache and Caching: Electrical and Electronic Engineering
15 pages
Between The Folds Project Description
No ratings yet
Between The Folds Project Description
2 pages
The Motivation For Caches: Memory System
No ratings yet
The Motivation For Caches: Memory System
9 pages
Cache AN3544
No ratings yet
Cache AN3544
12 pages
Understand CPU Caching Concepts
No ratings yet
Understand CPU Caching Concepts
14 pages
Average Access Time (AAT)
No ratings yet
Average Access Time (AAT)
6 pages
Cache Memories1
No ratings yet
Cache Memories1
3 pages
Pset10 Soln PDF
No ratings yet
Pset10 Soln PDF
6 pages
Pset10 Soln PDF
No ratings yet
Pset10 Soln PDF
6 pages
EE311 Analog Electronics #4 Q1) : Assignment Due 6 May 2010
No ratings yet
EE311 Analog Electronics #4 Q1) : Assignment Due 6 May 2010
6 pages
Unit 12 - Week 11
No ratings yet
Unit 12 - Week 11
4 pages
Soil-Mulch-Stone Price List
No ratings yet
Soil-Mulch-Stone Price List
1 page
RTK Waiver
No ratings yet
RTK Waiver
2 pages
Midterm2 PDF
No ratings yet
Midterm2 PDF
1 page

Microprocessor System Design: Error Correcting Codes Principle of Locality Cache Architecture

Uploaded by

Microprocessor System Design: Error Correcting Codes Principle of Locality Cache Architecture

Uploaded by

ECE 485/585

Microprocessor System Design

Lecture 8: Error Correcting Codes

Source: Lecture based on materials provided by Mark F.

[from Bruce Jacob]

requires n+2 check bits for 2n data bits

64-bit data path + 8 bits ECC

[from Bruce Jacob]

Memory Technology Trends

Relative Performance Gains

 A big challenge in memory system design is to provide a

Intermediate results Instructions File System Archive

Component Access Speed

 Temporal locality of reference

 Spatial locality of reference

A cache is designed to take advantage of both types of

Processor Cache Main

Thousands of student folders

Space for 100 file folders at my desk

 If the data is not found in the cache, it is called a cache miss

if Valid bit clear /* slot empty – cache miss */

 Policy decision for write miss

if Valid bit set /* slot occupied */

else /* slot empty */ assumes write allocate

Data being written by CPU is smaller than cache line

You might also like