Lec 22

The document summarizes techniques for improving main memory performance. It discusses using wider memory buses, interleaving memory across multiple banks, avoiding bank conflicts through software and hardware methods, and DRAM-specific interleaving using RAS and CAS signals. It also briefly covers virtual memory support, interactions between instruction-level parallelism and caching, and cache consistency issues in multi-processor systems.

Uploaded by

jyothibellary4233

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

76 views14 pages

Lec 22

Uploaded by

jyothibellary4233

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

LECTURE - 22

Topics for Today

Main memory

Scribe for today?
Main Memory

DRAM versus SRAM
DRAM is cheaper, but slower

Reducing the number of pins
At the cost of some performance
Address = RAS + CAS

Performance metrics: latency and bandwidth
#cycles to send address
#cycles to access a word
#cycles to send the data word
Main Memory Performance:
One-Word Wide Memory
CPU Suppose,
Bus (1 word) #cycles to send address = 4
Cache
#cycles to access 1 word = 24
#cycles to send data word = 4
Bus (1 word)
Cache line = 4 words
Main
Memory What is the miss penalty?
4 x (4 + 24 + 4) = 128 cycles
Technique-1: Wider Memory
CPU What is the miss penalty now?
Bus (1 word) 2 x (4 + 24 + 4) = 64 cycles
Mux
Cache Disadvantages?

Bus (2 words)
Larger bus width (cost)

Unit of memory addition
is larger
Main
Read-modify-write for
Memory
single-byte write, if
error-correction present
Technique-2: Interleaved-Memory
CPU What is the miss penalty
Bus (1 word) now?

Cache 4 + 24 + 4x4 =44 cycles

Notion of interleaving
Bus (1 word)
factor
Can the interleaving
factor be anything?

Bank-1 Bank-2 Bank-3 Bank-4

Technique-3: Independent
Memory Banks

Multiple independent accesses
Separate address and data lines

Needed for miss-under-miss scheme

Also, parallel I/O with CPU

Each independent bank may itself be
interleaved
Super-bank number and bank number
Memory-Bank Conflicts

Code can often be such that memory-bank
conflicts occur
No use of independent memory bank
organization under such conflicts

Example:
int x[2][512];
for(j = 0; j < 512; j++) {
for(i = 0; i < 2; i++) {
x[i][j]++;
}
}
Technique-4: Avoiding Memory-
Bank Conflicts

Software solutions:
Loop interchange (works for this example)
Expand array size so that it is not a power of two

Hardware solution:
Use prime number of banks
Bank num Addr % #banks
Addr within bank Addr #banks
Addr within bank Addr #words within bank
if #words within bank, and #banks are co-prime
Technique-5: DRAM-Specific
Interleaving

DRAM has RAS and CAS
Usually RAS and CAS are given one after
another
Same RAS can be used to read multiple
columns
DRAMs come with separate signals to allow
such access

Now, various remarks before finishing up with

memory-hierarchy design
Virtual Memory and Protection

OS requires support in terms of:
Two modes (at least) of execution: user,
supervisor/kernel
Some CPU state which is readable but not
writable in user mode

TLB

User/supervisor mode bit
Mechanisms to switch between the modes

System calls
ILP and Caching

Superscalar execution:
Cache must have enough ports to match the
peak bandwidth
Hit-under-miss, Miss-under-miss required

Speculative execution:
Suppress exception on speculative instructions
Don't stall the cache on a speculative instruction
cache miss
ILP vs. Caching:
Compiler Choices
int x[32][512]; int x[32][512];
for(j = 0; j < 512; j++) { for(i = 0; i < 32; i++) {
for(i = 0; i < 32; i++) { for(j = 0; j < 512; j++) {
x[i][j] = 2*x[i][j-1]; x[i][j] = 2*x[i][j-1];
} }
} }
Caches and Consistency

I/O using caches?
Interferes with CPU, may throw useful blocks

I/O using main memory
Write-through ==> No problem for CPU output
What about input?

Approach-1: OS marks memory block as non-cacheable

Approach-2: OS flushes the cache block after input

Approach-3: h/w checks if block is present in cache,
invalidate if cached (parallel set of tags for perf.)

Multi-processors –w ant same data in many
caches: cache-coherence problem

Final Exam Topics: CSE 564 Computer Architecture Summer 2017
No ratings yet
Final Exam Topics: CSE 564 Computer Architecture Summer 2017
78 pages
2.1 Memory System (S.K)
No ratings yet
2.1 Memory System (S.K)
30 pages
CS-30005 (HPC) - CS End Nov 2024
No ratings yet
CS-30005 (HPC) - CS End Nov 2024
23 pages
Lecture 15
No ratings yet
Lecture 15
81 pages
Chap 01
No ratings yet
Chap 01
11 pages
Chapter 2 Part 2
No ratings yet
Chapter 2 Part 2
18 pages
Unit3 Coa
No ratings yet
Unit3 Coa
30 pages
Exercices Memory-Caches
No ratings yet
Exercices Memory-Caches
31 pages
Kien-Truc-May-Tinh - David-Brooks - Cs146-Lecture17-Main-Memory - (Cuuduongthancong - Com)
No ratings yet
Kien-Truc-May-Tinh - David-Brooks - Cs146-Lecture17-Main-Memory - (Cuuduongthancong - Com)
16 pages
Ca Q,,a 4TH Sem
No ratings yet
Ca Q,,a 4TH Sem
18 pages
운영체제 01
No ratings yet
운영체제 01
60 pages
Chapter5-The Memory System
No ratings yet
Chapter5-The Memory System
36 pages
EE457Unit7b Interleaving
No ratings yet
EE457Unit7b Interleaving
38 pages
Lecture 12: Cache Innovations
No ratings yet
Lecture 12: Cache Innovations
17 pages
Chapter01 OSedition7Final
No ratings yet
Chapter01 OSedition7Final
62 pages
Embedded Systems Notes
No ratings yet
Embedded Systems Notes
15 pages
Oral Questions 2021 - Architecture
No ratings yet
Oral Questions 2021 - Architecture
14 pages
Memory 2
No ratings yet
Memory 2
31 pages
Chap 1
No ratings yet
Chap 1
48 pages
Computer Organisation and Architecture PYQ
No ratings yet
Computer Organisation and Architecture PYQ
14 pages
CA Chap5 Memory
No ratings yet
CA Chap5 Memory
91 pages
Document 90
No ratings yet
Document 90
12 pages
Computer Architecture 1st Semester Spring Session Unit 3
No ratings yet
Computer Architecture 1st Semester Spring Session Unit 3
33 pages
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
No ratings yet
Computer Architecture: Assoc. Prof. Nguyễn Trí Thành, Phd
55 pages
02b Cache
No ratings yet
02b Cache
48 pages
Topics: Cache Innovations (Sections 2.4, B.4, B.5), Virtual Memory Intro
No ratings yet
Topics: Cache Innovations (Sections 2.4, B.4, B.5), Virtual Memory Intro
20 pages
Week 12 - Lecture 12 - Memory
No ratings yet
Week 12 - Lecture 12 - Memory
27 pages
L07 MemoryII
No ratings yet
L07 MemoryII
27 pages
פרק ט - גדול ומהיר - ניצול היררכיות זיכרון
No ratings yet
פרק ט - גדול ומהיר - ניצול היררכיות זיכרון
77 pages
Memory Interfacing
No ratings yet
Memory Interfacing
65 pages
Onur 447 Spring15 Lecture17 Memoryhierarchyandcaches Afterlecture
No ratings yet
Onur 447 Spring15 Lecture17 Memoryhierarchyandcaches Afterlecture
51 pages
Chapter 2z
No ratings yet
Chapter 2z
54 pages
CA Final PDF
No ratings yet
CA Final PDF
13 pages
Comp Org Exam 3 Cheat Sheet
No ratings yet
Comp Org Exam 3 Cheat Sheet
3 pages
ACA Unit 2
No ratings yet
ACA Unit 2
45 pages
Computer Organization and Architecture Chapter 7 Large and Fast Exploiting
No ratings yet
Computer Organization and Architecture Chapter 7 Large and Fast Exploiting
32 pages
Main Memory DRAM's:: Time Between The Read Is Requested and The Desired
No ratings yet
Main Memory DRAM's:: Time Between The Read Is Requested and The Desired
3 pages
Lec8 - Caches
No ratings yet
Lec8 - Caches
55 pages
Cse 410 Computer Systems: Hal Perkins Spring 2010 L T 13 C Hwit DPF Lecture 13 - Cache Writes and Performance
No ratings yet
Cse 410 Computer Systems: Hal Perkins Spring 2010 L T 13 C Hwit DPF Lecture 13 - Cache Writes and Performance
20 pages
COA Lecture 23 24
No ratings yet
COA Lecture 23 24
20 pages
Computer Organization and Architecture
No ratings yet
Computer Organization and Architecture
53 pages
Computer System Overview: 1 Spring 2015
No ratings yet
Computer System Overview: 1 Spring 2015
48 pages
DRAM Basics by Prof. Matthew D. Sinclair
No ratings yet
DRAM Basics by Prof. Matthew D. Sinclair
103 pages
Module 5.3
No ratings yet
Module 5.3
39 pages
Computer System: Operating Systems: Internals and Design Principles
No ratings yet
Computer System: Operating Systems: Internals and Design Principles
62 pages
Lecture: Cache Hierarchies: Topics: Cache Innovations (Sections B.1-B.3, 2.1)
No ratings yet
Lecture: Cache Hierarchies: Topics: Cache Innovations (Sections B.1-B.3, 2.1)
20 pages
Memory Hierarchy Design: A Quantitative Approach, Fifth Edition
No ratings yet
Memory Hierarchy Design: A Quantitative Approach, Fifth Edition
37 pages
Memory Hierarchy: Haresh Dagale Dept of ESE
No ratings yet
Memory Hierarchy: Haresh Dagale Dept of ESE
32 pages
Main Memory DRAM's: - Access Time - Cycle Time
No ratings yet
Main Memory DRAM's: - Access Time - Cycle Time
3 pages
Computer Memory Organization: Elephants Don't Forget But Do Computers?
No ratings yet
Computer Memory Organization: Elephants Don't Forget But Do Computers?
9 pages
Memory Subsystem: Dr. Gayathri Sivakumar Assistant Professor (SG-I) School of Electronics VIT, Chennai
No ratings yet
Memory Subsystem: Dr. Gayathri Sivakumar Assistant Professor (SG-I) School of Electronics VIT, Chennai
16 pages
CHAPTER 2 Memory Hierarchy Design & APPENDIX B. Review of Memory Heriarchy
No ratings yet
CHAPTER 2 Memory Hierarchy Design & APPENDIX B. Review of Memory Heriarchy
73 pages
UNIT-IV Memory and I/O
No ratings yet
UNIT-IV Memory and I/O
36 pages
Computer Science 146 Computer Architecture
No ratings yet
Computer Science 146 Computer Architecture
16 pages
Chapter 6
No ratings yet
Chapter 6
37 pages
Main Memory: Computer Architecture COE 501
No ratings yet
Main Memory: Computer Architecture COE 501
18 pages
Cache1 2
No ratings yet
Cache1 2
30 pages
Peter Lorange - Innovations in Shipping (2020) PDF
100% (1)
Peter Lorange - Innovations in Shipping (2020) PDF
434 pages
Wave On A String
100% (1)
Wave On A String
25 pages
Hazmat
100% (1)
Hazmat
102 pages
Digital Electronics & Computer Organisation
No ratings yet
Digital Electronics & Computer Organisation
17 pages
Second-Generation Stack Computer Architecture
No ratings yet
Second-Generation Stack Computer Architecture
178 pages
Ch04 The Memory System
No ratings yet
Ch04 The Memory System
45 pages
06 Intro ERP Using GBI Case Study PP (Letter) en v2.11 PDF
No ratings yet
06 Intro ERP Using GBI Case Study PP (Letter) en v2.11 PDF
41 pages
Welcomes You To ISO 9001: 2015 Awareness Training Programme
100% (2)
Welcomes You To ISO 9001: 2015 Awareness Training Programme
184 pages
Experiment 8 Fuentes Mark
No ratings yet
Experiment 8 Fuentes Mark
29 pages
Chapter - 06 Legal and Ethical Issues
No ratings yet
Chapter - 06 Legal and Ethical Issues
23 pages
PR1 Characteristics Strengths and Weaknesses Kinds and Importance of Qualitative Research
No ratings yet
PR1 Characteristics Strengths and Weaknesses Kinds and Importance of Qualitative Research
13 pages
ReadISACA QAE Databases On ISACA PERFORMTica
No ratings yet
ReadISACA QAE Databases On ISACA PERFORMTica
9 pages
Cultures
No ratings yet
Cultures
3 pages
How To Enable and Use Remote Desktop For Windows 10
No ratings yet
How To Enable and Use Remote Desktop For Windows 10
11 pages
Substitute Leadership
100% (1)
Substitute Leadership
1 page
FVC Labor Union-Ptgwo vs. Sanama-Fvc-Siglo
100% (1)
FVC Labor Union-Ptgwo vs. Sanama-Fvc-Siglo
3 pages
Mca Iii Semester Software Lab Ii - Practicals List: Assignments For Design and Analysis of Algorithms (Daa)
No ratings yet
Mca Iii Semester Software Lab Ii - Practicals List: Assignments For Design and Analysis of Algorithms (Daa)
28 pages
Ganga-Ashtakam-1 Telugu PDF File9839
No ratings yet
Ganga-Ashtakam-1 Telugu PDF File9839
3 pages
Flowchart and Guidelines For Non-Degree Applications 2025 Via Google Form
No ratings yet
Flowchart and Guidelines For Non-Degree Applications 2025 Via Google Form
2 pages
Riphah International University: Student Information System
No ratings yet
Riphah International University: Student Information System
3 pages
An Introduction To Matrix Structural Analysis and Finite Element Methods
No ratings yet
An Introduction To Matrix Structural Analysis and Finite Element Methods
8 pages
Proposed RAT For VPs
No ratings yet
Proposed RAT For VPs
3 pages
Lect02.LecJan12 2006.PipelineProcessor
No ratings yet
Lect02.LecJan12 2006.PipelineProcessor
34 pages
2023 - RPIA Assessment 2
No ratings yet
2023 - RPIA Assessment 2
5 pages
Lec 19
No ratings yet
Lec 19
19 pages
Lec 11
No ratings yet
Lec 11
19 pages
Lec 06
No ratings yet
Lec 06
18 pages
Lec 12
No ratings yet
Lec 12
15 pages
Lec 15
No ratings yet
Lec 15
15 pages
Lec 24
No ratings yet
Lec 24
14 pages
Lec 13
No ratings yet
Lec 13
13 pages
Lec 05
No ratings yet
Lec 05
13 pages
Test 1A DF
No ratings yet
Test 1A DF
11 pages
TSP Java
No ratings yet
TSP Java
11 pages
TSP Java
No ratings yet
TSP Java
11 pages
KPCSW Report.2022
No ratings yet
KPCSW Report.2022
43 pages
CH3 4
No ratings yet
CH3 4
32 pages
US Manufacturing Output Falls in April On Weak Auto Production by
No ratings yet
US Manufacturing Output Falls in April On Weak Auto Production by
5 pages
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet
80-90 DT - Fiat Tractor (01/84 - 12/92)
No ratings yet
80-90 DT - Fiat Tractor (01/84 - 12/92)
2 pages
Lec 31
No ratings yet
Lec 31
5 pages
BSI05 Adba
No ratings yet
BSI05 Adba
3 pages
Notice of Recurrence: U.S. Department of Labor
No ratings yet
Notice of Recurrence: U.S. Department of Labor
4 pages
Maritime Sewip Datasheet
No ratings yet
Maritime Sewip Datasheet
2 pages
Michael's Resume 2024
No ratings yet
Michael's Resume 2024
3 pages
Avaya 9641GS IP Deskphone: Phones & Devices
No ratings yet
Avaya 9641GS IP Deskphone: Phones & Devices
4 pages
Final Sudeshna Resume
No ratings yet
Final Sudeshna Resume
1 page
BBMF2063 Tutorial Questions - 202306-10
No ratings yet
BBMF2063 Tutorial Questions - 202306-10
1 page
Lec 03
No ratings yet
Lec 03
16 pages

Lec 22

Uploaded by

Lec 22

Uploaded by

LECTURE - 22

Topics for Today

Cache 4 + 24 + 4x4 =44 cycles

Bank-1 Bank-2 Bank-3 Bank-4

Now, various remarks before finishing up with

You might also like