0% found this document useful (0 votes)

6 views7 pages

HW5 Sol

The document outlines a homework assignment for an advanced computer architecture course, focusing on cache design and performance analysis. It includes detailed calculations for cache parameters, comparisons of cache replacement algorithms (LRU, FIFO), and the impact of virtual versus physical address usage in cache lookups. Additionally, it provides clock cycle analysis for memory accesses under different scenarios, demonstrating the efficiency of using virtual addresses in certain cache configurations.

Uploaded by

Prashanth Sriramsetty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views7 pages

HW5 Sol

Uploaded by

Prashanth Sriramsetty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Homework 5

(EE275: Advanced Computer Architecture)

Due Date: 11/3

Answer 1: (20 points)

The CPU has 64KB data cache for 2-way associative with each block being 64 bytes and a 40-bit
address.

a) (10 pts)
Offset = 64 bytes block (2 ^ 6 = 64)
Therefore, Offset = 6 bits
Index = 64kB /(2-way x 64) = 9 bits
tag = 40 - 9 – 6 = 25 bits

< ---------------------------------------------40 bits address----------------------------------------------------

b) (10 pts)
Tag array size
#-way associativity x # index bit combinations x tag bits = 2 x 29 x 25 = 25KB

Answer 2: (20 points)

a) (10 pts)
2- way set associative cache 4 set

LRU- method emulation

2-hits with LRU method

b) (10 pts)
FIFO Method
3- hits with FIFO method

FIFO method is better with 3-hits on cache

Answer 3: (24 points)

a) (3 pts)
Read hit:
LRU: 4 accesses for arrays of data/tag/miscellaneous components => 4*(20+5+1) =
104 power units.
FIFO and Random: 4 accesses for arrays data/tag components => 4*(20+5) = 100
power units.

b) (3 pts)
Read miss:
LRU: 4 accesses for arrays of data/tag/miscellaneous components => 4*(20+5+1) =
104 power units.
FIFO: 4 accesses for arrays data/tag components + one access to FIFO Pointer => 4*(20+5) +
1 = 101 power units.
Random: 4 accesses for arrays data/tag components => 4*(20+5) = 100 power units.

c) (3 pts)
Read hit (split access):
LRU: 4 accesses for arrays of tag/miscellaneous components plus one access to hit data array
=> 4*(5+1) + 20 = 44 power units.
FIFO, Random: 4 accesses for arrays of tag components plus one access to hit data array =>
4*(5) +20 = 40 power units.

d) (3 pts)
Read miss, split access (cost of line-fill ignored):
LRU: 4 accesses for arrays of tag/miscellaneous components => 4*(5+1) = 24
power units.
FIFO, Random: 4 accesses for arrays of tag components => 4*(5) = 20 power units.

e) (3 pts)
Read hit, split access with way prediction hit:
LRU: one access to arrays of tag/miscellaneous components plus one access to data array =>
(5+1) + 20 = 26 power units.
FIFO, Random: one access to arrays of tag components plus one access to data Array => 5+20
= 25 power units.

f) (3 pts)
Read hit, split access with way prediction miss:
LRU: one access to arrays of tag/miscellaneous components, plus 4 accesses of
tag/miscellaneous components, plus one access to data array => (5+1) + 4*(5+1) + 20 = 50
power units.
FIFO, Random: access to arrays of tag components, plus 4 accesses of tag components, plus one access to
data array => 5 + 4*(5) + 20 = 45 power units.

g) (3 pts)
Read miss, split access with way prediction miss (cost of line-fill ignored):
LRU: one access to arrays of tag/miscellaneous components, plus 4 accesses of
tag/miscellaneous components => (5+1) + 4*(5+1) = 30 power units.
FIFO: one access to arrays of tag components, plus 4 accesses to arrays of tag component, and
one access to miscellaneous component => 5 + 4*(5) +1 = 26 power units.
Random: one access to arrays of tag components, plus 4 accesses to arrays of tag component =>
5 +4*(5) = 25 power units.

h) (3 pts)
For every access:
P (way hit, cache hit) = 0.95
P (way miss, cache hit) = 0.02
P (way miss, cache miss) = 0.03
LRU = (0.95*26 + 0.02*50 + 0.03*30) power units
FIFO = (0.95*25 + 0.02*45 + 0.03*26) power units
Random = (0.95*25 + 0.02*45 + 0.03*25) power units
Answer 4: (36 points)
a) (18 pts)
The CPU reads a word at virtual address 124A5DF4, assuming the page translation is in the L2 TLB,
but not the L1 TLB, and the word is in memory, but not found in any cache:
Virtual address: (124A5DF4)16 = (0001 0010 0100 1010 0101 1101 1111 0100)2
L1 cache Binary Hex
Offset (4 bits): 0100 4
Index (10 bits): 01 1101 1111 1DF
Tag (18 bits): 0001 0010 0100 1010 01 4929
L1 TLB
Virtual page number (20 bits): 0001 0010 0100 1010 0101 124A5
Page offset (12 bits): L2 1101 1111 0100 DF4
TLB
Virtual page number (20 bits): 0001 0010 0100 1010 0101 124A5
Page offset (12 bits): 1101 1111 0100 DF4

Physical address: (036B0DF4)16 = (0000 0011 0110 1011 0000 1101 1111 0100)2
L2 cache
Offset (5 bits): 1 0100 14
Index (10 bits): 000 1101 111 6F
Tag (17 bits): 0000 0011 0110 1011 0 6D6
L3 cache
Offset (5 bits): 1 0100 14
Index (13 bits): 11 0000 1101 111 186F
Tag (14 bits): 0000 0011 0110 10 DA

Clock Action
0 CPU→L1 cache: look up 4 bytes at tag 4929, index 1DF, offset 4 (miss)
CPU→L1 TLB: look up virtual page 124A5
3 L1 TLB (miss)
4 L1 TLB→L2 TLB: look up virtual page 124A5
13 L2 TLB (hit)
L2 TLB returns translation to physical page 036B0 Construct
physical address 036B0DF4
14 L1 cache→L2 cache: look up 16 bytes at tag 6D6, index 6F, offset 14
18 L2 cache (miss)
19 L2 cache→L3 cache: look up 32 bytes at tag DA, index 186F, offset 14
33 L3 cache (miss)
34 L3 cache→Memory: look up 32 bytes with physical address 036B0DF4
133 Memory returns data for physical addresses 036B0DE0 - 036B0DFF
L3 replaces one block in set at index 186F, tag DA
L3 returns data for physical addresses 036B0DE0 - 036B0DFF
L2 replaces one block in set at index 6F, tag 6D6
L2 returns data for physical addresses 036B0DF0 - 036B0DFF,
virtual address 124A5DF0 - 124A5DFF
L1 replaces one block in set at index 1DF, tag 4929
CPU gets data for virtual address 124A5DF4, physical address 036B0DF4

b) (18 pts)
The CPU reads a word at virtual address 124A5DF4 with the same assumptions as the part a), but this
time assuming the L2 cache uses the virtual address for its index and tag instead of the physical address:
Virtual address: (124A5DF4)16 = (0001 0010 0100 1010 0101 1101 1111 0100)2
L1 cache Binary Hex
Offset (4 bits): 0100 4
Index (10 bits): 01 1101 1111 1DF
Tag (18 bits): 0001 0010 0100 1010 01 4929
L2 cache
Offset (5 bits): 1 0100 14
Index (10 bits): 101 1101 111 2EF
Tag (17 bits): 0001 0010 0100 1010 0 2494
L1 TLB
Virtual page number (20 bits): 0001 0010 0100 1010 0101 124A5
Page offset (12 bits): L2 1101 1111 0100 DF4
TLB
Virtual page number (20 bits): 0001 0010 0100 1010 0101 124A5
Page offset (12 bits): 1101 1111 0100 DF4
Physical address: (036B0DF4)16 = (0000 0011 0110 1011 0000 1101 1111 0100)2
L3 cache
Offset (5 bits): 1 0100 14
Index (13 bits): 11 0000 1101 111 186F
Tag (14 bits): 0000 0011 0110 10 DA
Clock Action
0 CPU→L1 cache: look up 4 bytes at tag 4929, index 1DF, offset 4 (miss) CPU→L1
TLB: look up virtual page 124A5
1 L1 cache→L2 cache: look up 16 bytes at tag 2494, index 2EF, offset 14
3 L1 TLB (miss)
4 L1 TLB→L2 TLB: look up virtual page 124A5
5 L2 cache (miss)
13 L2 TLB (hit)
L2 TLB returns translation to physical page 036B0 Construct
physical address 036B0DF4
14 L2 cache→L3 cache: look up 32 bytes at tag DA, index 186F, offset 14
28 L3 cache (miss)
29 L3 cache→Memory: look up 32 bytes with physical address 036B0DF4
128 Memory returns data for physical addresses 036B0DE0 - 036B0DFF
L3 replaces one block in set at index 186F, tag DA
L3 returns data for physical addresses 036B0DE0 - 036B0DFF,
virtual address 124A5DE0 - 124A5DFF
L2 replaces one block in set at index 2EF, tag 2494
L2 returns data for virtual address 124A5DF0 - 124A5DFF
L1 replaces one block in set at index 1DF, tag 4929
CPU gets data for virtual address 124A5DF4, physical address 036B0DF4
Convert the L2 cache to use the virtual address saves 5 cycles in total comparing to the case of the L2
cache in part a), because the L2 cache checking here can occur simultaneously with the L1 TLB checking.
It is better to make this change.

Micro Project Simple-Calculator-System-Python
88% (24)
Micro Project Simple-Calculator-System-Python
16 pages
(Ebook PDF) A Short Course in Photography: Digital 3rd Editionpdf Download
80% (5)
(Ebook PDF) A Short Course in Photography: Digital 3rd Editionpdf Download
54 pages
Smartone Mobile Communications
100% (2)
Smartone Mobile Communications
1 page
SAP SuccessFactors Onboarding Role-Based Permission Guidance - v1.3
No ratings yet
SAP SuccessFactors Onboarding Role-Based Permission Guidance - v1.3
31 pages
LJ-G Series: User's Manual
No ratings yet
LJ-G Series: User's Manual
320 pages
Group 1-Eng and AP-ldm Final Output
No ratings yet
Group 1-Eng and AP-ldm Final Output
511 pages
Manikandan-Software Testing Engineer
No ratings yet
Manikandan-Software Testing Engineer
1 page
Haloalkane and Haloarenes
No ratings yet
Haloalkane and Haloarenes
12 pages
Dev 1 Boomi
100% (1)
Dev 1 Boomi
11 pages
Solution For Chapter 4
100% (3)
Solution For Chapter 4
26 pages
Conversion of ECC To S4HANA
88% (8)
Conversion of ECC To S4HANA
21 pages
Extensive Study Guide
No ratings yet
Extensive Study Guide
27 pages
Course Outline 01 March Update
No ratings yet
Course Outline 01 March Update
16 pages
Unit 04 Modern Approach To Software Project and Economics
No ratings yet
Unit 04 Modern Approach To Software Project and Economics
35 pages
2010 Final Exam Solutions
0% (1)
2010 Final Exam Solutions
13 pages
Cache Associativity and Virtual Memory: Prof. Dr. E. Damiani
No ratings yet
Cache Associativity and Virtual Memory: Prof. Dr. E. Damiani
29 pages
Part 4 - Solution Design Documents - What You Need To Know
No ratings yet
Part 4 - Solution Design Documents - What You Need To Know
29 pages
11 Memory
No ratings yet
11 Memory
41 pages
Chapter 3 - Interaction
No ratings yet
Chapter 3 - Interaction
36 pages
HP - Probook.4510s.4520s.wistron.s Intel.H9265 4.48.4GK06.041.Rev .SD .Schematics 2
No ratings yet
HP - Probook.4510s.4520s.wistron.s Intel.H9265 4.48.4GK06.041.Rev .SD .Schematics 2
61 pages
Class11 Cache
No ratings yet
Class11 Cache
41 pages
Lec8 - Caches
No ratings yet
Lec8 - Caches
55 pages
Project Python
No ratings yet
Project Python
63 pages
CA I - Chapter 5 Caches 2
No ratings yet
CA I - Chapter 5 Caches 2
80 pages
CODch 7 Slides
No ratings yet
CODch 7 Slides
49 pages
Computer Architecture: Cache Memory
No ratings yet
Computer Architecture: Cache Memory
28 pages
Unit V
No ratings yet
Unit V
44 pages
Computer Org and Arch: R.Magesh
No ratings yet
Computer Org and Arch: R.Magesh
48 pages
【48通讯机架无屏 KSBT】A5 V2409 Ho-01-04 USER MANUAL - LFP48100PB~48300PB
No ratings yet
【48通讯机架无屏 KSBT】A5 V2409 Ho-01-04 USER MANUAL - LFP48100PB~48300PB
26 pages
Elements of Cache Design Pentium IV Cache Organization
No ratings yet
Elements of Cache Design Pentium IV Cache Organization
43 pages
Memory Organization
No ratings yet
Memory Organization
52 pages
GFS-154B M00 - iFIX Fundamentals Front Matter Volume 1 of 2
No ratings yet
GFS-154B M00 - iFIX Fundamentals Front Matter Volume 1 of 2
17 pages
09 Caches Tlbs
No ratings yet
09 Caches Tlbs
33 pages
Ai Model 1
No ratings yet
Ai Model 1
24 pages
CS2115 Chapter-6
No ratings yet
CS2115 Chapter-6
45 pages
CH05-COA11e - Modified
No ratings yet
CH05-COA11e - Modified
46 pages
Cache Memory Test 2 Papers
No ratings yet
Cache Memory Test 2 Papers
21 pages
Lec14 Demandpage
No ratings yet
Lec14 Demandpage
25 pages
Topic 1 Ict Notes
No ratings yet
Topic 1 Ict Notes
4 pages
Chapter 6
No ratings yet
Chapter 6
37 pages
Quantum Computing
No ratings yet
Quantum Computing
5 pages
Lec17 Cache 3
No ratings yet
Lec17 Cache 3
33 pages
Cao 5
No ratings yet
Cao 5
99 pages
10 Cache
No ratings yet
10 Cache
28 pages
CS162 Operating Systems and Systems Programming Caching and Demand Paging
No ratings yet
CS162 Operating Systems and Systems Programming Caching and Demand Paging
37 pages
Practice Midterm2 A Sol PDF
No ratings yet
Practice Midterm2 A Sol PDF
14 pages
Lec 23 CAOCache Memory
No ratings yet
Lec 23 CAOCache Memory
11 pages
SS2 DP XXX
No ratings yet
SS2 DP XXX
6 pages
Basic Philosophy: Cache Memory
No ratings yet
Basic Philosophy: Cache Memory
16 pages
CS704 Finalterm QA Past Papers
No ratings yet
CS704 Finalterm QA Past Papers
20 pages
FMB010 Quick Manual v1.3
No ratings yet
FMB010 Quick Manual v1.3
13 pages
Midterm Practice
No ratings yet
Midterm Practice
7 pages
Computer Organization Homework05: RISC-V Edition
No ratings yet
Computer Organization Homework05: RISC-V Edition
10 pages
Nguye865 hwk1
No ratings yet
Nguye865 hwk1
13 pages
ARC400 Update Manual
No ratings yet
ARC400 Update Manual
11 pages
Module 5 - 5 Marks
No ratings yet
Module 5 - 5 Marks
15 pages
4.2.7 Lab - Getting Familiar With The Linux Shell
No ratings yet
4.2.7 Lab - Getting Familiar With The Linux Shell
8 pages
FPGA Implementation of Cache Memory: Yogesh S. Watile, A. S. Khobragade
No ratings yet
FPGA Implementation of Cache Memory: Yogesh S. Watile, A. S. Khobragade
4 pages
Exe07 Caches 2024
No ratings yet
Exe07 Caches 2024
15 pages
Module 4 Part 2.2
No ratings yet
Module 4 Part 2.2
9 pages
Week 13 - Kavya's and Prithvi's Annotated Version
No ratings yet
Week 13 - Kavya's and Prithvi's Annotated Version
27 pages
CompArch 20c MM-1
No ratings yet
CompArch 20c MM-1
14 pages
CS 251 Assignment 6 Winter 2014
No ratings yet
CS 251 Assignment 6 Winter 2014
6 pages
10 - 5 - Sample Memory System (15 - 43)
No ratings yet
10 - 5 - Sample Memory System (15 - 43)
6 pages
Tutorial Sheet For Slow Learners.: CSE2001: Computer Architecture and Organization
No ratings yet
Tutorial Sheet For Slow Learners.: CSE2001: Computer Architecture and Organization
2 pages
Coa Citta-1
No ratings yet
Coa Citta-1
6 pages
ES Test Assignment Sol.
No ratings yet
ES Test Assignment Sol.
6 pages
FinalQuestion2 Soln
No ratings yet
FinalQuestion2 Soln
2 pages
Assosiative Mapping - Cache Memory
No ratings yet
Assosiative Mapping - Cache Memory
2 pages
Cose222 HW4
No ratings yet
Cose222 HW4
5 pages
CompEng 361 - Homework 4 - Solutions
No ratings yet
CompEng 361 - Homework 4 - Solutions
5 pages
Midterm2 s2012 Sol
No ratings yet
Midterm2 s2012 Sol
5 pages
18-742 Advanced Computer Architecture: Exam I October 8, 1997
No ratings yet
18-742 Advanced Computer Architecture: Exam I October 8, 1997
11 pages
Syllabus of Unreal Engine Certification Online Tra
No ratings yet
Syllabus of Unreal Engine Certification Online Tra
5 pages
SEC-A and B - COA
No ratings yet
SEC-A and B - COA
4 pages
323 MT 1
No ratings yet
323 MT 1
3 pages
Assign1 PDF
No ratings yet
Assign1 PDF
5 pages
Quiz 3
No ratings yet
Quiz 3
3 pages
Ca 2023-2024 HW4
No ratings yet
Ca 2023-2024 HW4
3 pages
Create A 5.1 Surround Audio Sequence: Adobe Premiere Pro
No ratings yet
Create A 5.1 Surround Audio Sequence: Adobe Premiere Pro
4 pages
Problem Solutions To Problems Marked With A in Logic Computer Design Fundamentals, Ed. 2
No ratings yet
Problem Solutions To Problems Marked With A in Logic Computer Design Fundamentals, Ed. 2
2 pages
CDA 3103 Final Exam Practice Fall 2024
No ratings yet
CDA 3103 Final Exam Practice Fall 2024
3 pages
InClassActivity16 Sol
No ratings yet
InClassActivity16 Sol
2 pages
CS61C Fall 2013 - 5 - Caches: 2 Columns 2 Rows 2 Cache "Images"
No ratings yet
CS61C Fall 2013 - 5 - Caches: 2 Columns 2 Rows 2 Cache "Images"
2 pages
How To Configure Client Authentication For The REST Adapter Sender Channel
No ratings yet
How To Configure Client Authentication For The REST Adapter Sender Channel
2 pages
Medit I700 Requirements - Buscar Con Google
No ratings yet
Medit I700 Requirements - Buscar Con Google
1 page
Ca Uinit4
No ratings yet
Ca Uinit4
3 pages
Blue Fox: Arm Assembly Internals and Reverse Engineering
From Everand
Blue Fox: Arm Assembly Internals and Reverse Engineering
Maria Markstedter
No ratings yet
Blowfish Cipher Tutorials - Herong's Tutorial Examples
From Everand
Blowfish Cipher Tutorials - Herong's Tutorial Examples
Herong Yang
No ratings yet
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet

HW5 Sol

Uploaded by

HW5 Sol

Uploaded by

Homework 5

(EE275: Advanced Computer Architecture)

Due Date: 11/3

Answer 1: (20 points)

< ---------------------------------------------40 bits address----------------------------------------------------

Answer 2: (20 points)

LRU- method emulation

FIFO method is better with 3-hits on cache

Answer 3: (24 points)

You might also like