Tutorial 3

This document contains 6 tutorial problems about cache memory optimizations. The problems cover topics like calculating block size based on a memory address, determining when reducing miss rate or increasing hit latency improves average memory access time, calculating misses per 1000 instructions and memory stall cycles per miss, determining speedup from a perfect cache, and identifying which cache sets would be filled when executing a sequence of instructions and data accesses.

Uploaded by

Rama Devi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

Tutorial 3

Uploaded by

Rama Devi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Multicore Computer Architecture - Storage and Interconnects

Tutorial 3
Cache Memory Optimizations

Dr. John Jose

Assistant Professor
Department of Computer Science & Engineering
Indian Institute of Technology Guwahati, Assam.
Tutorial Problem-1
 The address of a word in a byte addressable 16MB physical memory is
0xAA0C2A. This word upon bringing to the cache is mapped to set 48.
What is the block size of the cache memory ?
 A A 0 C 2 A
 1010 1010 0000 1100 0010 1010
 1010 1010 0000 1100 0010 1010 offset  64bytes
Tutorial Problem-2
 A cache has access time (hit latency)=10 ns and miss rate is 5%. An
optimization was made to reduce the miss rate to 3 % but the hit latency
was increased to 15 ns. Under what condition this change will result in
better performance (Lower avg. memory access time)?
 AMAT 1 = HT1 + MR1 x MP HT1 = 10ns; MR1=0.05
 AMAT 2 = HT2 + MR2 x MP HT2 = 15ns; MR1=0.03
 AMAT2<AMAT1
Tutorial Problem-3
 A cache has hit rate of 90%, 64 byte block, cache hit latency of 5ns. Main
memory takes 150 ns to return first word (32 bits) of a block and 10 ns for
each subsequent word.
(a) What is the miss latency of the cache?
(b) If doubling the cache block size reduces the miss rate to 3%, does it
reduces average memory access time?
Tutorial Problem-3
 A cache has hit rate of 90%, 64 byte block, cache hit latency of 5ns. Main
memory takes 150 ns to return first word (32 bits) of a block and 10 ns for
each subsequent word.
(a) What is the miss latency of the cache?
(b) If doubling the cache block size reduces the miss rate to 3%, does it
reduces average memory access time?
Tutorial Problem-4
 For a cache, that has a miss rate of 3% and miss penalty of 500 cycles. In
a program 50% of the instructions are memory accesses (load-store)
 (a) Find the misses per 1000 instruction (MPKI)
 (b) Find memory stall cycles per miss
 Miss rate: miss/mem access = (miss / instruction)/(mem acc /instruction)
MR = MPI/MAPI MPI =MR x MAPI MAPI=1.5
Tutorial Problem-5
 Consider a cache system with miss rate of an I-cache is 2% and that of D-
cache is 4%. The processor CPI=2 without memory stalls and miss penalty
=100 cycles for all misses. Determine how much faster the processor
would run with a perfect cache that never missed. Assume frequency of all
loads and store is 36 %.
 Actual CPI real= Base CPI + stall CPI CPI ideal = Base CPI=2
 Stall CPI = (% use of IC x stall of IC)+(% use of DC x stall of DC)
Tutorial Problem-5
 miss penalty =100 cycles for all misses. Assume frequency of all loads and
store is 36 %.
 Actual CPI real= Base CPI + stall CPI CPI ideal = Base CPI=2
 Stall CPI = (% use of IC x stall of IC)+(% use of DC x stall of DC)
Tutorial Problem-6
 Consider a 32 bit processor with 16KB direct mapped L1-cache that uses
a block size of 4 words. It has an L2-cache of 256 KB with 4-way
associativity and block size of 8 words. The system uses a byte
addressable 256 MB DRAM system. Upon running a program, 16
consecutive fixed length instructions (each instruction is one word)
starting at main memory address 0x 8226620 are executed. These
instructions operate on an array A of 8 words, with starting address 0x
42AF5F8 Assuming caches are initially empty; indicate the non empty
sets on L1 cache and L2 cache after the execution of the program.
Tutorial Problem-6
 32 bit processor: 1 word  4 bytes: 256 MB DRAM  28 bit address
 L1 Cache: 16KB, direct mapped, block size= 4 words (16B)

 L2 Cache : 256 KB, 4-way, block size= 8 words (32B).

 Instruction 0x 8226620, 16 consecutive fixed length instructions (each

instruction is one word) Data 0x 42AF5F8 , array of 8 words.
Tutorial Problem-6
 L1 Cache: 16KB, direct mapped, block size= 4 words (16B)
 Instruction 0x 8226620, 16 consecutive fixed length instructions (each
instruction is one word) Data 0x 42AF5F8 , array of 8 words.
Tutorial Problem-6
 L2 Cache : 256 KB, 4-way, block size= 8 words (32B).
 Instruction 0x 8226620, 16 consecutive fixed length instructions (each
instruction is one word) Data 0x 42AF5F8 , array of 8 words.
Tutorial Problem-6
 Non-Empty Blocks
 L1: Sets 610, 611, 612,613 (4 words x 4 = 16 instructions)
Sets 863, 864, 865 ( 2 + 4 +2 words of data array A)

 L2: Sets 817, 818 (8 words x 2 = 16 instructions)

Sets 1967, 1968 ( 2 + 6 words of data array A)
[email protected]
http://www.iitg.ac.in/johnjose/

Cose222 HW4
No ratings yet
Cose222 HW4
5 pages
207 Assignment 6
No ratings yet
207 Assignment 6
7 pages
Computer Organization Exercise Answer7
No ratings yet
Computer Organization Exercise Answer7
7 pages
Lecture 41
No ratings yet
Lecture 41
41 pages
2010 Final Exam Solutions
0% (1)
2010 Final Exam Solutions
13 pages
Assign1 PDF
No ratings yet
Assign1 PDF
5 pages
Cache Memory
No ratings yet
Cache Memory
28 pages
Lect12 Cache
No ratings yet
Lect12 Cache
39 pages
A8 Solution 2
No ratings yet
A8 Solution 2
4 pages
Maths
No ratings yet
Maths
3 pages
Homework4 v2 Solution
No ratings yet
Homework4 v2 Solution
14 pages
Solution of CSE 240A Assignemnt 3
No ratings yet
Solution of CSE 240A Assignemnt 3
5 pages
Test 6 PracticeQuestion Cachememory 1
No ratings yet
Test 6 PracticeQuestion Cachememory 1
21 pages
Review Problems For Exam 1: MIPS (Instruction Count) / (Execution Time X 10
No ratings yet
Review Problems For Exam 1: MIPS (Instruction Count) / (Execution Time X 10
6 pages
5 1
No ratings yet
5 1
39 pages
School of Electronics Engineering (Sense) : Class Number: VL2021220101854 Semester
No ratings yet
School of Electronics Engineering (Sense) : Class Number: VL2021220101854 Semester
4 pages
Cmsc132part1 3rdexam
No ratings yet
Cmsc132part1 3rdexam
2 pages
15IF11 Multicore E PDF
No ratings yet
15IF11 Multicore E PDF
14 pages
10-cacheperf
No ratings yet
10-cacheperf
24 pages
PDF
No ratings yet
PDF
6 pages
Computer Org and Arch: R.Magesh
No ratings yet
Computer Org and Arch: R.Magesh
48 pages
Tutorial 7cache
No ratings yet
Tutorial 7cache
2 pages
DigitalLogic ComputerOrganization L22 CachesP3 Handout
No ratings yet
DigitalLogic ComputerOrganization L22 CachesP3 Handout
52 pages
Parameters of Cache Memory: - Cache Hit - Cache Miss - Hit Ratio - Miss Penalty
No ratings yet
Parameters of Cache Memory: - Cache Hit - Cache Miss - Hit Ratio - Miss Penalty
18 pages
Cache TLB
100% (1)
Cache TLB
15 pages
Cache Performance Average Memory Access Time
No ratings yet
Cache Performance Average Memory Access Time
23 pages
CSE 332 L 15 Complete - 26th Sep 2020
No ratings yet
CSE 332 L 15 Complete - 26th Sep 2020
16 pages
Lec 23
No ratings yet
Lec 23
13 pages
Assignment-3
No ratings yet
Assignment-3
4 pages
Test 6 PracticeQuestion Cachememory 1 Updated
No ratings yet
Test 6 PracticeQuestion Cachememory 1 Updated
22 pages
COA Final 19
No ratings yet
COA Final 19
8 pages
BaiTap Chuong4 PDF
No ratings yet
BaiTap Chuong4 PDF
8 pages
Cau 6 Cache
No ratings yet
Cau 6 Cache
25 pages
CA11_2023S1_new
No ratings yet
CA11_2023S1_new
26 pages
HW6 Spring2022 Solution 2
No ratings yet
HW6 Spring2022 Solution 2
10 pages
Cache Memory
No ratings yet
Cache Memory
10 pages
ARM hw5
No ratings yet
ARM hw5
5 pages
COSS MidSem 2020.07.05 MakeUp With Key COPYM06Tq# Name-Rana
No ratings yet
COSS MidSem 2020.07.05 MakeUp With Key COPYM06Tq# Name-Rana
5 pages
Ca Mod 2
No ratings yet
Ca Mod 2
40 pages
Tutorial - 1
No ratings yet
Tutorial - 1
2 pages
IT3030E CA Chap6 Memory
No ratings yet
IT3030E CA Chap6 Memory
65 pages
Final Exam - Fall 2008: COE 308 - Computer Architecture
No ratings yet
Final Exam - Fall 2008: COE 308 - Computer Architecture
8 pages
Advanced Architecture Memory
No ratings yet
Advanced Architecture Memory
13 pages
Midterm2 s2012 Sol
No ratings yet
Midterm2 s2012 Sol
5 pages
Exercise 5_with solution
No ratings yet
Exercise 5_with solution
8 pages
Week 6: Assignment Solutions
No ratings yet
Week 6: Assignment Solutions
4 pages
CPSC 312 Cache Memories: Topics
No ratings yet
CPSC 312 Cache Memories: Topics
39 pages
Solutions: 18-742 Advanced Computer Architecture
No ratings yet
Solutions: 18-742 Advanced Computer Architecture
8 pages
COATut 10
No ratings yet
COATut 10
1 page
hw4 Sol
No ratings yet
hw4 Sol
4 pages
Lecture # 1
No ratings yet
Lecture # 1
22 pages
Cache Org
No ratings yet
Cache Org
19 pages
Week8 SampleMidterm
No ratings yet
Week8 SampleMidterm
2 pages
Week8 SampleMidterm
No ratings yet
Week8 SampleMidterm
2 pages
szdbxcn
No ratings yet
szdbxcn
7 pages
CS2115 chapter-6
No ratings yet
CS2115 chapter-6
45 pages
IT3030E CA Chap6 Memory
No ratings yet
IT3030E CA Chap6 Memory
65 pages
PlayStation 2 Architecture: Architecture of Consoles: A Practical Analysis, #12
From Everand
PlayStation 2 Architecture: Architecture of Consoles: A Practical Analysis, #12
Rodrigo Copetti
No ratings yet
Memory Basics Explained
From Everand
Memory Basics Explained
Alisa Turing
No ratings yet
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
From Everand
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
Rodrigo Copetti
No ratings yet
W3 A3 Detailed
No ratings yet
W3 A3 Detailed
5 pages
Gem5 Practice
No ratings yet
Gem5 Practice
3 pages
Lec 6
No ratings yet
Lec 6
18 pages
Modeling A Hands On Physical Unclonable Functions
No ratings yet
Modeling A Hands On Physical Unclonable Functions
2 pages
Use Case For Administrator:Administrator Login: Projects & Coustomers
No ratings yet
Use Case For Administrator:Administrator Login: Projects & Coustomers
16 pages
Machine-Type Communications
No ratings yet
Machine-Type Communications
30 pages
P 2 Linked List PDF
No ratings yet
P 2 Linked List PDF
16 pages
Leveraging Security Mask On 3DEXPERIENCE Platform: Best Practices
No ratings yet
Leveraging Security Mask On 3DEXPERIENCE Platform: Best Practices
38 pages
BREAKDOWN TIMELINE - Rev1
No ratings yet
BREAKDOWN TIMELINE - Rev1
12 pages
UTD-NSM-2.2-WorkshopGuide-20240726 (1) (3)
No ratings yet
UTD-NSM-2.2-WorkshopGuide-20240726 (1) (3)
84 pages
Questionnaire On Laptop: Personal Details
No ratings yet
Questionnaire On Laptop: Personal Details
6 pages
Dynamo Workflow Handout - 20422 - AR20422-Moore-AU2016
No ratings yet
Dynamo Workflow Handout - 20422 - AR20422-Moore-AU2016
74 pages
PHP - File Inclusion: The Include Function
No ratings yet
PHP - File Inclusion: The Include Function
2 pages
PenMount Device Driver Users Guide Windows V3 2
No ratings yet
PenMount Device Driver Users Guide Windows V3 2
45 pages
SolarisTM Crash Analysis Tool
No ratings yet
SolarisTM Crash Analysis Tool
43 pages
Enterprise Asset Management: Product Management, SAP AG
No ratings yet
Enterprise Asset Management: Product Management, SAP AG
71 pages
Google LLM Conversational Recs
No ratings yet
Google LLM Conversational Recs
24 pages
Efficiency Considerations and Const References
No ratings yet
Efficiency Considerations and Const References
2 pages
How To Find Bapi For Particular Transaction in SAP
No ratings yet
How To Find Bapi For Particular Transaction in SAP
2 pages
The Parts and Components of A Security Camera System
No ratings yet
The Parts and Components of A Security Camera System
6 pages
Java Script W3schools
No ratings yet
Java Script W3schools
77 pages
QuickRide Logcat
No ratings yet
QuickRide Logcat
337 pages
All-In-One Git CheatSheet ?
No ratings yet
All-In-One Git CheatSheet ?
10 pages
HHT Manual(STVF7)
No ratings yet
HHT Manual(STVF7)
24 pages
Thuy Pham Minh: CMC Global
No ratings yet
Thuy Pham Minh: CMC Global
3 pages
D027B Omron
No ratings yet
D027B Omron
141 pages
14 GMD - Cycloconverter Control Terminal
100% (1)
14 GMD - Cycloconverter Control Terminal
15 pages
ICT-10-Basic-Networking-Quarter-4-with-Key-Answer
No ratings yet
ICT-10-Basic-Networking-Quarter-4-with-Key-Answer
83 pages
Adobe AEM Self Learning Topic
No ratings yet
Adobe AEM Self Learning Topic
16 pages
Balaguruswamy Solved Question Papers 2006
No ratings yet
Balaguruswamy Solved Question Papers 2006
73 pages
TJ1400-OLT-Solutions
No ratings yet
TJ1400-OLT-Solutions
2 pages
DDDW 2 Ingles
No ratings yet
DDDW 2 Ingles
5 pages
MIS Report - Group 4
No ratings yet
MIS Report - Group 4
14 pages
Xilinx Answer 65444 Windows
No ratings yet
Xilinx Answer 65444 Windows
7 pages

Tutorial 3

Uploaded by

Tutorial 3

Uploaded by

Multicore Computer Architecture - Storage and Interconnects

Dr. John Jose

 L2 Cache : 256 KB, 4-way, block size= 8 words (32B).

 Instruction 0x 8226620, 16 consecutive fixed length instructions (each

 L2: Sets 817, 818 (8 words x 2 = 16 instructions)

You might also like