0% found this document useful (0 votes)

78 views5 pages

Cose222 HW4

This document provides instructions for Assignment #4 in the COSE222: Computer Architecture course. It includes 6 questions about cache memory concepts like cache hit/miss rates, temporal/spatial locality, direct-mapped/set-associative caches, and TLB organization. Students are asked to analyze code snippets and memory address traces to calculate metrics like cache accesses, compulsory misses, hit rates, and address mappings. They are also asked to compare cache configurations and determine what cache parameters would improve processor performance.

Uploaded by

namwindows

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

78 views5 pages

Cose222 HW4

Uploaded by

namwindows

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

COSE222: Computer Architecture

Assignment #4
Due: Dec 15, 2021 (Wednesday) 11:59pm on Blackboard
Solutions (Total score: 150)

Please answer for the questions. Write your student ID and name on the top of the document. Submit
your homework with “PDF” format only. (You can easily generate the pdf files from Microsoft Word or
HWP. You can also handwrite your answers to scan the handwritten documents with “PDF” format. You
may use the document capture applications such as “Microsoft Lens” for scanning your documents with
your smartphones.)
The answer rules:
(1) You can write answers in both Korean and English.
(2) Please make your final answer numbers have three decimal places.
(3) Performance of A is improved by NN % compared to performance B if PerfA / PerfB = 1.NN.

1. The following code is written in C, where elements of an array are allocated contiguously. Arrays a
has 1024x1024 elements. Arrays b and c have 1024 elements respectively. Assume each element of
arrays is an 8-byte integer. The data type of all variables is an 8-byte integer also. Cache blocks are
allocated on write miss, and the size of a cache line is 64 bytes. Assume that cache size is infinite
(what?). (Hint: in this code, there are 6 variables: a, b, c, i, j, and sum)
int sum;
for (int i = 0; i < 1024; i++)
{
sum = 0;
for (int j = 0; j < 1024; j++)
sum += a[j+i*1024] + b[j];
c[i] = sum;
}

(a) Which variable references exhibit temporal locality? [3]

(b) Which variable references exhibit spatial locality? [3]

(c) Let’s focus on the data transfers for arrays a, b, and c. How many ld and sd instructions are issued
while executing this code? [3]
Array a:
Array b:
Array c:
(d) Let’s focus on the cache miss rates for arrays a, b, and c. Calculate the compulsory (cold) miss rate
for arrays. (Hint: We assume the cache size is infinite). [3]
Cod miss rate of A:
Cold miss rate of B:
Cold miss rate of C:

2. Below is a list of 64-bit memory address references, given as word addresses. (1 word = 4 bytes)
0xFD, 0xBA, 0x2C, 0xB5, 0x0E, 0xBE, 0x58, 0xBF, 0x02, 0x2B, 0xB4, 0x03

(a) Let us assume a direct-mapped cache has 16 blocks and a single block includes 2 word. What is the
size of this cache? [2]

(b) For each of these references, identify the binary word address, the tag, and the index given a direct-
mapped cache with 16 two-word blocks (i.e. the cache has 16 blocks, and the size of a single block is
two words.) Also list whether each reference is a hit or miss, assuming the cache is initially empty. [12]

(d) Let us assume that the size of a single block is increases to four words while the size of the direct-
mapped cache is the same. For each these references, identify the binary word address, the tag, and the
index. Also list if each reference is a hit or a miss, assuming the cache is initially empty. [12]

(e) Calculate the hit rate (in percentage) of the above cache. [2]

(f) Assume that the miss penalty of above caches is proportional to the size of fetched data from the
main memory or lower-level cache. Which cache configuration from questions (b) and (d) exhibits
better performance for the above word address stream? Explain your answer. [4]

3. For a direct-mapped cache design with a 64-bit address, the following bits of the address are used to
access the cache. (1 word = 4 bytes)

Tag Index Offset

63-12 11-6 5-0

(a) What is the cache block size (in words)? [4]

(b) How may blocks does the cache have? [4]

(c) What is the ratio between total bits required for such a cache implementation over the data storage
bits? Let us assume each cache block includes 1-bit “valid” field. [4]

4. Cache access time is usually proportional to the capacity of cache. Assume that main memory
accesses take 50 ns and that 36% of all instructions access data memory. The following table shows
data for L1 cache attached to each of two processors, P1 and P2.

Processor L1 size L1 miss rate L1 hit time

P1 2 KB 8% 0.5 ns
P2 4 KB 4% 0.8 ns

(a) Assuming that the L1 hit time determines the cycle time for P1 and P2, what are their respective
clock rates? [2]

(b) What is the Average Memory Access Time (AMAT) for P1 and P2 (in cycles)? [4]
(c) Assuming a base CPI is 1.0 without any memory stalls, what is the total CPI for P1 and P2? Which
processor is faster? When we say a “base CPI of 1.0”, we mean instructions complete in one cycle, unless
either the instruction access or the data access causes a cache miss. [4]

For the next problems, we will consider the addition of an L2 cache to P1 (to presumably make up for
its limited L1 cache capacity). Use the L1 cache capacities and hit times from the previous table when
solving these problems. The L2 miss rate indicated is its local miss rate, namely the L2 miss counts
divided by the total L2 access counts.

L2 size L2 miss rate L2 hit time

1 MB 80% 5.0 ns

(d) What is the AMAT for P1 with the addition of an L2 caches? Is the AMAT better or worse with the L2
cache? [4]

(e) Assuming a base CPI of 1.0 without any memory stalls, what is the total CPI for P1 with the addition
of an L2 cache? [4]

(f) What would the L2 miss rate need to be in order for P1 with an L2 cache to be faster than P1 without
an L2 cache? [4]

(g) What would the L2 miss rate need to be in order for P1 with an L2 cache to be faster than P2
without an L2 cache? [10]

5. Assume that the cache size is 128 bytes and the size of a single cache block is 32 bytes. Below is a
series of memory read reference set to the cache. Address point to bytes.
0x07, 0x15, 0x4D, 0x2A, 0x79, 0xAB, 0xCE, 0x2E, 0x20, 0x4B, 0x6D, 0x32
0x8A, 0xAF, 0x29, 0xC7, 0xCE, 0x01, 0x18, 0x07, 0x08, 0xAA, 0x08, 0x30

Classify each memory references as a hit or miss for the following caches. Calculate the total number of
misses also.
(a) Direct-mapped cache [12]
(b) Fully associative cache with LRU replacement [12]

(d) 2-way set associative cache with LRU replacement [12]

6. Let us assume that the size of virtual address is 48-bits and the size of physical memory is 8 GB. Word
size is 32-bits and page size is 4 KB. All addresses are byte-addressed.
(a) What is the maximum size of the virtual memory supported by this system? [2]

(b) What is the width of physical address? [2]

(c) Let us assume the TLB has 512 entries and TLB is two-way set associative. Which virtual address
bits are used to index the TLB? Which virtual address bits are used as tag of the table? [8]

Solution For Chapter 4
100% (3)
Solution For Chapter 4
26 pages
Cache TLB
100% (1)
Cache TLB
15 pages
How To Become A Professional Penetration Tester
100% (2)
How To Become A Professional Penetration Tester
18 pages
2010 Final Exam Solutions
0% (1)
2010 Final Exam Solutions
13 pages
CL10 MemoryMgmt
No ratings yet
CL10 MemoryMgmt
45 pages
Computer Architecture: Optional Homework Set: Black Board Due Date: Hard Copy Due Date
No ratings yet
Computer Architecture: Optional Homework Set: Black Board Due Date: Hard Copy Due Date
8 pages
Revision 1
No ratings yet
Revision 1
33 pages
CA11 2023S1 New
No ratings yet
CA11 2023S1 New
26 pages
Memory Organization
No ratings yet
Memory Organization
25 pages
Lect12 Cache
No ratings yet
Lect12 Cache
39 pages
Cache Memory Problems
100% (1)
Cache Memory Problems
3 pages
Sophos Firewall Vs Sonicwall Battlecard
No ratings yet
Sophos Firewall Vs Sonicwall Battlecard
10 pages
4.2 Cachememory
No ratings yet
4.2 Cachememory
12 pages
Cache Memory
No ratings yet
Cache Memory
24 pages
Cache Memory Test 2 Papers
No ratings yet
Cache Memory Test 2 Papers
21 pages
Computer Org and Arch: R.Magesh
No ratings yet
Computer Org and Arch: R.Magesh
48 pages
Homework4 v2 Solution
No ratings yet
Homework4 v2 Solution
14 pages
Practice Midterm2 A Sol PDF
No ratings yet
Practice Midterm2 A Sol PDF
14 pages
Module 4 - Cache Memory Problems
No ratings yet
Module 4 - Cache Memory Problems
8 pages
SZDBXCN
No ratings yet
SZDBXCN
7 pages
HW6 Spring2022 Solution 2
No ratings yet
HW6 Spring2022 Solution 2
10 pages
CSE 332 L 15 Complete - 26th Sep 2020
No ratings yet
CSE 332 L 15 Complete - 26th Sep 2020
16 pages
Lenovo ThinkPad T480 01YR328 NM B501
No ratings yet
Lenovo ThinkPad T480 01YR328 NM B501
104 pages
IT3030E Exercise Chap6
No ratings yet
IT3030E Exercise Chap6
6 pages
Comparch Answers and Questions
No ratings yet
Comparch Answers and Questions
7 pages
CompEng 361 - Homework 4 - Solutions
No ratings yet
CompEng 361 - Homework 4 - Solutions
5 pages
cs325 Fall10 Finalexam
No ratings yet
cs325 Fall10 Finalexam
9 pages
hw3 Cse490-590-Sp2025 Sol
No ratings yet
hw3 Cse490-590-Sp2025 Sol
6 pages
Catalog
100% (1)
Catalog
20 pages
Jntu Online Examinations (Mid 2 - Aca)
No ratings yet
Jntu Online Examinations (Mid 2 - Aca)
22 pages
207 Assignment 6
No ratings yet
207 Assignment 6
7 pages
hw3 - Cse490 590 sp2025
No ratings yet
hw3 - Cse490 590 sp2025
4 pages
Coa Assignment On Cache
No ratings yet
Coa Assignment On Cache
4 pages
Ex 5
No ratings yet
Ex 5
5 pages
Week 6: Assignment Solutions
No ratings yet
Week 6: Assignment Solutions
4 pages
3rdexam 2012Q
No ratings yet
3rdexam 2012Q
4 pages
Cache Memory Sheet and Model Answer: Solution
No ratings yet
Cache Memory Sheet and Model Answer: Solution
5 pages
BaiTap Chuong4 PDF
No ratings yet
BaiTap Chuong4 PDF
8 pages
323 MT 1
No ratings yet
323 MT 1
3 pages
18-742 Advanced Computer Architecture: Exam I October 8, 1997
No ratings yet
18-742 Advanced Computer Architecture: Exam I October 8, 1997
11 pages
Computer Organization Exercise Answer7
No ratings yet
Computer Organization Exercise Answer7
7 pages
CS152 Quiz #2: Name: - This Is A Closed Book, Closed Notes Exam. 80 Minutes 9 Pages
No ratings yet
CS152 Quiz #2: Name: - This Is A Closed Book, Closed Notes Exam. 80 Minutes 9 Pages
13 pages
Analog Testing 02
0% (1)
Analog Testing 02
39 pages
Solutions: 18-742 Advanced Computer Architecture
No ratings yet
Solutions: 18-742 Advanced Computer Architecture
8 pages
ARM hw5
No ratings yet
ARM hw5
5 pages
HW4
No ratings yet
HW4
3 pages
CS704 Finalterm QA Past Papers
No ratings yet
CS704 Finalterm QA Past Papers
20 pages
Practice Questions To Set 8
No ratings yet
Practice Questions To Set 8
8 pages
CS 251 Assignment 6 Winter 2014
No ratings yet
CS 251 Assignment 6 Winter 2014
6 pages
Problem 2 (15 Points)
No ratings yet
Problem 2 (15 Points)
2 pages
hw4 Sol
No ratings yet
hw4 Sol
4 pages
Tutorial 7cache
No ratings yet
Tutorial 7cache
2 pages
Cache Data Size: 32 Kib Cache Block Size: 2 Words Cache Access Time: 1 Cycle
No ratings yet
Cache Data Size: 32 Kib Cache Block Size: 2 Words Cache Access Time: 1 Cycle
2 pages
Ca Uinit4
No ratings yet
Ca Uinit4
3 pages
HW4 Fa17
No ratings yet
HW4 Fa17
4 pages
Midterm2 s2012 Sol
No ratings yet
Midterm2 s2012 Sol
5 pages
Part 1. Memory Analysis: Question 1 (40 PT) - You Are Given Designs of 3 Caches For A 16-Bit Address D1
No ratings yet
Part 1. Memory Analysis: Question 1 (40 PT) - You Are Given Designs of 3 Caches For A 16-Bit Address D1
4 pages
PDF
No ratings yet
PDF
6 pages
SPRING 2015 CDA 3101 Homework 3: Date-Assigned: Mar 27th, 2015 Due Dates: 11:55pm, April 7th, 2015
No ratings yet
SPRING 2015 CDA 3101 Homework 3: Date-Assigned: Mar 27th, 2015 Due Dates: 11:55pm, April 7th, 2015
5 pages
Computer Organization SET-1
No ratings yet
Computer Organization SET-1
3 pages
FB BGP Community Signaling
No ratings yet
FB BGP Community Signaling
12 pages
18-742 Advanced Computer Architecture: Test I February 24, 1998
No ratings yet
18-742 Advanced Computer Architecture: Test I February 24, 1998
10 pages
Cmsc132part1 3rdexam
No ratings yet
Cmsc132part1 3rdexam
2 pages
Assign1 PDF
No ratings yet
Assign1 PDF
5 pages
CS61C Fall 2013 - 5 - Caches: 2 Columns 2 Rows 2 Cache "Images"
No ratings yet
CS61C Fall 2013 - 5 - Caches: 2 Columns 2 Rows 2 Cache "Images"
2 pages
Standard Cell Libraries: - Presentation by Abhay Dixit Meeta Bhate Kedar Rajpathak
No ratings yet
Standard Cell Libraries: - Presentation by Abhay Dixit Meeta Bhate Kedar Rajpathak
18 pages
Faculty of Engineering - Academic Integrity Statement: Academic Regulation I-14 On Academic Fraud
100% (2)
Faculty of Engineering - Academic Integrity Statement: Academic Regulation I-14 On Academic Fraud
26 pages
IOT Based Humidity and Temperature Monitoring ETI
No ratings yet
IOT Based Humidity and Temperature Monitoring ETI
5 pages
Series: ADS-B Transponder
No ratings yet
Series: ADS-B Transponder
2 pages
System Programming
No ratings yet
System Programming
12 pages
Interactive Reports
No ratings yet
Interactive Reports
54 pages
Rom CMD Output
No ratings yet
Rom CMD Output
325 pages
Data Representation IGCSE
No ratings yet
Data Representation IGCSE
3 pages
ABB - CMS-660 String Monitoring
No ratings yet
ABB - CMS-660 String Monitoring
4 pages
Manual Indonesia Ford Mondeo
No ratings yet
Manual Indonesia Ford Mondeo
197 pages
Trouble Shooting Filter Manager Using FLTMC
0% (1)
Trouble Shooting Filter Manager Using FLTMC
7 pages
TH-D75A Ca Catalog
No ratings yet
TH-D75A Ca Catalog
2 pages
MB m.2 Support 180619 PDF
No ratings yet
MB m.2 Support 180619 PDF
2 pages
Half Adder Full Adder Class Material
No ratings yet
Half Adder Full Adder Class Material
3 pages
Resume 3
No ratings yet
Resume 3
2 pages
TR Red Hat Enterprise Linux Learning Path f19025 201908 en
No ratings yet
TR Red Hat Enterprise Linux Learning Path f19025 201908 en
2 pages
Virtual Tape Machines 11
No ratings yet
Virtual Tape Machines 11
23 pages
Lession Paln 22-23-1
No ratings yet
Lession Paln 22-23-1
22 pages
Pump Troubleshooting
No ratings yet
Pump Troubleshooting
20 pages
Sle 4442
No ratings yet
Sle 4442
12 pages
Re29729 2009-11 cs6
No ratings yet
Re29729 2009-11 cs6
4 pages
WB4KDI Upside Down Power Supplies
No ratings yet
WB4KDI Upside Down Power Supplies
4 pages
LF353
No ratings yet
LF353
6 pages
T4 Homework 4 Answers
No ratings yet
T4 Homework 4 Answers
1 page

Cose222 HW4

Uploaded by

Cose222 HW4

Uploaded by

COSE222: Computer Architecture

(a) Which variable references exhibit temporal locality? [3]

(b) Which variable references exhibit spatial locality? [3]

Word address Binary address Tag Index Hit/Miss

Word address Binary address Tag Index Hit/Miss

Tag Index Offset

(a) What is the cache block size (in words)? [4]

(b) How may blocks does the cache have? [4]

Processor L1 size L1 miss rate L1 hit time

L2 size L2 miss rate L2 hit time

(d) 2-way set associative cache with LRU replacement [12]

(b) What is the width of physical address? [2]

You might also like