Topics Covered: Memory Subsystem

This document provides examples of how different memory subsystem configurations can affect performance. Example 1 shows how interleaving memory across four modules can reduce the time to transfer a block of data from main memory to cache from 38 to 17 clock cycles. Example 2 discusses how cache hit rates impact average memory access time. Example 3 discusses the impact of multi-level caching with L1 and L2 caches. Example 4 calculates address fields for a set-associative cache and estimates a 4.14x improvement from caching frequently accessed data.

Uploaded by

Dhana Lingam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views7 pages

Topics Covered: Memory Subsystem

Uploaded by

Dhana Lingam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 7

Topics covered:

Memory subsystem
Example #1: Effect of Interleaving

Consider a cache which has 8 words per block. On a read miss, the block that
contains the desired word must be copied from the memory into the cache. Assume
that the hardware has following properties. It takes 1 clock cycle to send an address
to the main memory. The first word is accessed in 8 clock cycles, and subsequent
words are accessed in 4 clock cycles. Also, one clock cycle is necessary to send the
word to the cache. How many clock cycles does it take to send the block of words to
the cache?
The total time taken is 1 + 8 + (7x4) +1 = 38

2
Example #1: Effect of Interleaving
If the memory is constructed as four interleaved modules, then when the starting
address of the block arrives at the memory, all four modules being accessing the
required data using the high order bits of the address. After 8 clock cycles, each
module has one word of data in its DBR. These words are transferred to the cache
one word at a time during the next 4 clock cycles. During this time, the next word
in each module is accessed. Then it takes another 4 clock cycles to transfer these
words to the cache.
Therefore the total time taken is 1+8+4+4=17.

Speed up obtained during interleaving is 38/17 = 2.2

3
Example #2: Effect of cache on processor chip
Consider the impact of the cache on the overall performance of the computer. Let
h be the hit rate, M be the miss penalty, that is, the time to access information in the
main memory, and C the time to access information in the cache. Then, the average
access time experienced by the processor is given by:
Refer to page 332 of the text book

Let us consider the following example. If the computer has no cache, then it takes 10
clock cycles for every memory read access. For a computer which has a cache that
holds 8 word blocks and an interleaved main memory, it takes 17 clock cycles to
transfer a block from the main memory to the cache. Assume that 30% of the instructions
require a memory access, so there are 130 memory accesses for every 100 instructions
executed. Assume that the hit rate in the cache are 0.95 for instructions and 0.9 for
data. Then, the improvement in performance is:

130x10/100(0.95x1 + 0.05x17) + 30(0.9x1+0.1x17)=5.04

4
Example #3: Effect of L1 & L2 cache.

Consider the impact of L1 and L2 cache on the overall performance of the processor.
Let h1 be hit rate in cache L1, h2 the hit rate in cache L2, C1 the time to access
information in L1 cache, C2 time to access information in L2 cache, M is the time to
access information in the main memory. Then, the average access time of the
processor is given by:

Refer to page 335 of the text book.

5
Example #4: Set-associative cache

A computer system has a main memory of 64K 16-bit words. It consists of a cache of
128 blocks with 16 words per block organized in a block set associative manner
with 2 blocks per set.
(a) Calculate the number of bits in each of the TAG, SET and WORD fields of the
main memory address format.
(b) Assume that the cache is initially empty. Suppose that the processor fetches 2080
words from locations 0,1,....2079, in that order. It then repeats this fetch sequence
nine more times. If the cache is 10 times faster than the main memory, estimate
the improvement factor resulting from the use of the cache. Assume that the LRU
algorithm is used for block replacement.

(a) The main memory address is 16 bits.

The number of bits in the WORD field is 4.
The number of bits in the SET field is 6.
The number of bits in the TAG field is 16 - (6+4) = 6

6
Example #4: Set-associative cache
Words 0, 1, 2,....,2079 occupy blocks 0 to 129 in the main memory. After blocks 0,
127 have been read from the main memory into the cache on the first pass, the cache
is full. Because the replacement algorithm is LRU, main memory blocks that occupy
the first two sets of the 64 cache sets are always overwritten before they can be used
on a successive pass. In particular main memory blocks 0, 64 and 128 continually
displace each other in competing for the 2 block positions in cache set 0. Similarly,
main memory blocks 1, 65 and 129 continually displace each other in competing for
the 2 block positions in cache set 1. Main memory blocks that occupy the last 62 sets
are fetched once in the first pass and remain in the cache for the next 9 pases. On the
first pass all 130 blocks must be fetched from the main memory. On each of the 9
passes blocks in the last 62 sets of the cache (62x2=124) are found in the cache. The
remaining 6 blocks (130-124) must be fetced from the main memory.
Improvement factor = Time without cache/Time with cache
= 10x130x10t/(1x130x11t + 9(124x1t + 6x11t))
= 4.14

How To Find AMAT - Final - Question
100% (1)
How To Find AMAT - Final - Question
17 pages
53-Cache Memory - Principles, Cache Memory Management Techniques-28!02!2025
No ratings yet
53-Cache Memory - Principles, Cache Memory Management Techniques-28!02!2025
38 pages
Chapter 2z
No ratings yet
Chapter 2z
54 pages
Chap 5 Memory System p1
No ratings yet
Chap 5 Memory System p1
30 pages
CA11 2023S1 New
No ratings yet
CA11 2023S1 New
26 pages
Lec 10
No ratings yet
Lec 10
45 pages
Lec 5
No ratings yet
Lec 5
35 pages
Chap 4 Cache Memory
No ratings yet
Chap 4 Cache Memory
55 pages
Lecture 8
No ratings yet
Lecture 8
33 pages
Revision 1
No ratings yet
Revision 1
33 pages
10 Cacheperf
No ratings yet
10 Cacheperf
24 pages
Lecture 41
No ratings yet
Lecture 41
41 pages
Memory Cache: Computer Architecture and Organization
No ratings yet
Memory Cache: Computer Architecture and Organization
41 pages
23 Cache Memory Basics 11-03-2024
No ratings yet
23 Cache Memory Basics 11-03-2024
19 pages
6.module 2 - Part 2
No ratings yet
6.module 2 - Part 2
39 pages
Cache TLB
100% (1)
Cache TLB
15 pages
EIE3343 Lab 2
No ratings yet
EIE3343 Lab 2
10 pages
Solution To Assignment of COA On Cache
No ratings yet
Solution To Assignment of COA On Cache
8 pages
Chapter 2z
No ratings yet
Chapter 2z
54 pages
5 1
No ratings yet
5 1
39 pages
ch5 Easy
No ratings yet
ch5 Easy
27 pages
Bca 4 Decoa Memory Cache 1
No ratings yet
Bca 4 Decoa Memory Cache 1
5 pages
4.2 Cachememory
No ratings yet
4.2 Cachememory
12 pages
Module 5 - 5 Marks
No ratings yet
Module 5 - 5 Marks
15 pages
Module 4 - Cache Memory Problems
No ratings yet
Module 4 - Cache Memory Problems
8 pages
Cache Performance Average Memory Access Time
No ratings yet
Cache Performance Average Memory Access Time
23 pages
SRAM Main
No ratings yet
SRAM Main
7 pages
Unit 6
No ratings yet
Unit 6
25 pages
Lec 23
No ratings yet
Lec 23
13 pages
CH04 COA10e
No ratings yet
CH04 COA10e
41 pages
Assignment (G)
No ratings yet
Assignment (G)
5 pages
Advanced Architecture Memory
No ratings yet
Advanced Architecture Memory
13 pages
207 Assignment 6
No ratings yet
207 Assignment 6
7 pages
Memory Hierarchy Design
No ratings yet
Memory Hierarchy Design
115 pages
Tutorial 3
No ratings yet
Tutorial 3
14 pages
Cache Memory: A Safe Place For Hiding or Storing Things
No ratings yet
Cache Memory: A Safe Place For Hiding or Storing Things
34 pages
Study Guide 2
No ratings yet
Study Guide 2
4 pages
Cose222 HW4
No ratings yet
Cose222 HW4
5 pages
A8 Solution 2
No ratings yet
A8 Solution 2
4 pages
Sheet 01
No ratings yet
Sheet 01
3 pages
Cache Memory: A Safe Place For Hiding or Storing Things
100% (1)
Cache Memory: A Safe Place For Hiding or Storing Things
34 pages
Memory Hierarchies (Part 2) Review: The Memory Hierarchy
No ratings yet
Memory Hierarchies (Part 2) Review: The Memory Hierarchy
7 pages
ARM hw5
No ratings yet
ARM hw5
5 pages
Computer Organization Exercise Answer7
No ratings yet
Computer Organization Exercise Answer7
7 pages
Parameters of Cache Memory: - Cache Hit - Cache Miss - Hit Ratio - Miss Penalty
No ratings yet
Parameters of Cache Memory: - Cache Hit - Cache Miss - Hit Ratio - Miss Penalty
18 pages
Study Set 12 Memory Components and DRAM
No ratings yet
Study Set 12 Memory Components and DRAM
8 pages
Jaimin Brahmbhatt COSC 6351 Advanced Computer Architecture Assignment
No ratings yet
Jaimin Brahmbhatt COSC 6351 Advanced Computer Architecture Assignment
3 pages
Maths
No ratings yet
Maths
3 pages
Memory Latency
No ratings yet
Memory Latency
7 pages
Tutorial 7cache
No ratings yet
Tutorial 7cache
2 pages
SPRING 2015 CDA 3101 Homework 3: Date-Assigned: Mar 27th, 2015 Due Dates: 11:55pm, April 7th, 2015
No ratings yet
SPRING 2015 CDA 3101 Homework 3: Date-Assigned: Mar 27th, 2015 Due Dates: 11:55pm, April 7th, 2015
5 pages
Mekelle Institute of Technology: PC Hardware Troubleshooting (CSE501) Lecture - 4
No ratings yet
Mekelle Institute of Technology: PC Hardware Troubleshooting (CSE501) Lecture - 4
63 pages
cs325 Fall10 Finalexam
No ratings yet
cs325 Fall10 Finalexam
9 pages
ACA Unit-5
No ratings yet
ACA Unit-5
54 pages
Assign1 PDF
No ratings yet
Assign1 PDF
5 pages
Ca Uinit4
No ratings yet
Ca Uinit4
3 pages
CCSA 156-215.80-512Q (2020feb06 Revised)
100% (1)
CCSA 156-215.80-512Q (2020feb06 Revised)
230 pages
BaiTap Chuong4 PDF
No ratings yet
BaiTap Chuong4 PDF
8 pages
HW4
No ratings yet
HW4
3 pages
8.7.1.1 Lab - Configuring A Site-To-Site VPN Using Cisco IOS and CCP - Instructor
No ratings yet
8.7.1.1 Lab - Configuring A Site-To-Site VPN Using Cisco IOS and CCP - Instructor
47 pages
Design Hybrid Identity
No ratings yet
Design Hybrid Identity
869 pages
U.G. Department of Computer Applications N.G.M College 16 UBC 626 - Data Mining and Warehousing Multiple Choice Questions. (K1 Questions) Unit - I
No ratings yet
U.G. Department of Computer Applications N.G.M College 16 UBC 626 - Data Mining and Warehousing Multiple Choice Questions. (K1 Questions) Unit - I
11 pages
1 Chapter 9 Software Evolution
No ratings yet
1 Chapter 9 Software Evolution
23 pages
M.SC Computer Science: Mother Teresa Women'S University
No ratings yet
M.SC Computer Science: Mother Teresa Women'S University
96 pages
Firewalls
100% (1)
Firewalls
94 pages
INT243
No ratings yet
INT243
2 pages
Mol 5 Sat
No ratings yet
Mol 5 Sat
110 pages
Lecture 1 - Cryptography
No ratings yet
Lecture 1 - Cryptography
84 pages
Cassendra
100% (1)
Cassendra
21 pages
Internet Access Methods
No ratings yet
Internet Access Methods
26 pages
Dracula
No ratings yet
Dracula
403 pages
BruteForce SSH Attack Study
No ratings yet
BruteForce SSH Attack Study
36 pages
Configuring NAT Overload On A Cisco Router
No ratings yet
Configuring NAT Overload On A Cisco Router
4 pages
Top Web Hosting Services
No ratings yet
Top Web Hosting Services
5 pages
Evaluating Antivirus Evasion Tools Against Bitdefender Antivirus
No ratings yet
Evaluating Antivirus Evasion Tools Against Bitdefender Antivirus
30 pages
CA Module 26
No ratings yet
CA Module 26
27 pages
Class 10 It - Assignment Booklet
No ratings yet
Class 10 It - Assignment Booklet
10 pages
Unit 5 SoftwareTools
No ratings yet
Unit 5 SoftwareTools
50 pages
4CP0 02 Que 20211116
No ratings yet
4CP0 02 Que 20211116
16 pages
TRU Hacker Vs Hacker Event Details
No ratings yet
TRU Hacker Vs Hacker Event Details
4 pages
Objective: Professional Sumary: - 5 Years of Experience in Various MAINFRAME TECHNOLOGIES
No ratings yet
Objective: Professional Sumary: - 5 Years of Experience in Various MAINFRAME TECHNOLOGIES
4 pages
ECEG-4221-VLSI - Lec - 07 - PLD PAL PLA CPLD FPGA ROM
No ratings yet
ECEG-4221-VLSI - Lec - 07 - PLD PAL PLA CPLD FPGA ROM
39 pages
Bits Pilani: Iotsec: Uml Extension For Internet of Things Systems Security Modelling
No ratings yet
Bits Pilani: Iotsec: Uml Extension For Internet of Things Systems Security Modelling
14 pages
TOPIC 3 - Digital Citizenship and Netiquette
No ratings yet
TOPIC 3 - Digital Citizenship and Netiquette
23 pages
Tutorial MCQ
No ratings yet
Tutorial MCQ
17 pages
TEM-025 Example Installation Qualification Report Sample
No ratings yet
TEM-025 Example Installation Qualification Report Sample
1 page
DHCP Implementation Guide - 0.1
No ratings yet
DHCP Implementation Guide - 0.1
18 pages
A High Performance Blockchain Platform For Intelligent Devices
No ratings yet
A High Performance Blockchain Platform For Intelligent Devices
2 pages
PlayStation 2 Architecture: Architecture of Consoles: A Practical Analysis, #12
From Everand
PlayStation 2 Architecture: Architecture of Consoles: A Practical Analysis, #12
Rodrigo Copetti
No ratings yet
C & C++ Interview Questions You'll Most Likely Be Asked
From Everand
C & C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Topics Covered: Memory Subsystem

Uploaded by

Topics Covered: Memory Subsystem

Uploaded by

Topics covered:

Speed up obtained during interleaving is 38/17 = 2.2

130x10/100(0.95x1 + 0.05x17) + 30(0.9x1+0.1x17)=5.04

Refer to page 335 of the text book.

(a) The main memory address is 16 bits.

You might also like