0% found this document useful (0 votes)

30 views

Unit 3 Notes

Uploaded by

Snehal Salokhe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views

Unit 3 Notes

Uploaded by

Snehal Salokhe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

1.What is principle of locality?

Explain the typical memory hierarchy

The principle of locality is a key concept in computer architecture that refers to the tendency of
programs to access a relatively small portion of memory at any given time. This principle is divided into
two types:

Temporal Locality: This suggests that if a particular memory location is accessed, it is likely to be
accessed again soon. For example, loops and frequently called functions exhibit temporal locality.

Spatial Locality: This indicates that if a memory location is accessed, nearby memory locations are likely
to be accessed soon afterward. This is often seen in array accesses and sequential data processing.

Typical Memory Hierarchy

To take advantage of the principle of locality, modern computer systems use a hierarchical memory
structure that balances speed, cost, and size. The typical memory hierarchy includes:

Registers: The fastest type of memory, located within the CPU. They store a small amount of data that is
immediately needed by the processor for calculations and operations.

Cache Memory: This is a small, high-speed memory located close to the CPU. It is divided into levels (L1,
L2, and sometimes L3). Caches store frequently accessed data and instructions to reduce the time the
CPU takes to access data from main memory. L1 is the smallest and fastest, while L3 is larger but slower.

Main Memory (RAM): This is the primary storage used for currently running programs and data. It is
larger than cache but slower. RAM is volatile, meaning it loses its contents when the power is turned off.

Secondary Storage: This includes hard drives (HDDs) and solid-state drives (SSDs). It provides long-term
storage for data and programs but is much slower than RAM. SSDs are faster than HDDs but generally
more expensive.
Tertiary Storage: This is used for backup and archival purposes, such as magnetic tape or optical discs. It
has the slowest access times and is often used for data that is not frequently accessed.

2.Explain the set associative scheme of placing the block in a cache.

Ans-

Set associative mapping combines direct mapping with fully associative mapping by arrangement lines of
a cache into sets. The sets are persistent using a direct mapping scheme. However, the lines within each
set are treated as a small fully associative cache where any block that can save in the set can be stored to
any line inside the set.

The diagram represents this arrangement using a sample cache that uses four lines to a set.

A set-associative cache that includes k lines per set is known as a k way set-associative cache. Because
the mapping approach uses the memory address only like direct mapping does, the number of lines
included in a set should be similar to an integer power of two, for example, two, four, eight, sixteen, etc.
Example − Consider a cache with 29 = 512 lines, a block of memory contains 23 = 8 words, and the full
memory space includes 230 = 1G words. In a direct mapping scheme, this can leave 30 – 9 – 3 = 18 bits
for the tag.

By sending from direct mapping to set associative with a set size of two lines per set, the various sets
achieved equals half the number of lines. In the instance of the cache having 512 lines, we can achieve
256 sets of two lines each, which would require eight bits from the memory address to recognize the set.

This can leave 30 – 8 – 3 = 19 bits for the tag. By sending to four lines per set, the number of sets is
decreased to 128 sets needing 7 bits to recognize the set and twenty bits for the tag.

3.What is Miss rate? Explain three categories of cache misses in three Cs Model.
Miss Rate

The miss rate is a performance metric used in cache memory systems to indicate the percentage of
memory accesses that result in a cache miss. It is calculated as:

Miss Rate=Number of Cache Misses/Total Memory Accesses

A lower miss rate indicates better cache performance, as it means that more memory accesses are being
served by the cache rather than having to go to slower main memory.

Three Categories of Cache Misses (Three Cs Model)

The Three Cs Model categorizes cache misses into three distinct types: Compulsory Misses, Capacity
Misses, and Conflict Misses. Each type of miss arises from different causes:

Compulsory Misses (Cold Misses):

Definition: These misses occur the first time a block is accessed. When data is loaded into the cache for
the first time, it’s considered a compulsory miss.

Example: If a program is accessing an array for the first time, the initial accesses to that array will result
in compulsory misses until the relevant blocks are loaded into the cache.

Impact: Compulsory misses are inevitable and can’t be eliminated entirely, but their frequency can be
reduced through techniques like prefetching.

Capacity Misses:

Definition: These occur when the cache cannot hold all the blocks that are actively being used by the
program. As a result, previously loaded blocks are evicted before they are reused.
Example: If a cache has a limited size and a program accesses more data than can fit into that cache,
some blocks will be evicted, leading to misses when those blocks are accessed again.

Impact: Capacity misses can be mitigated by increasing cache size or optimizing data access patterns to
fit within the available cache.

Conflict Misses (Collision Misses):

Definition: These arise in set-associative or direct-mapped caches when multiple blocks compete for the
same cache line or set. Even if there is space in other cache lines, the particular block being accessed is
not available because it maps to a specific set or line that is already occupied by a different block.

Example: In a direct-mapped cache, if two different memory blocks map to the same cache line,
accessing one block will evict the other, leading to conflict misses.

Impact: Conflict misses can be reduced by using higher associativity (i.e., moving to a more flexible cache
structure) or optimizing the memory access pattern to reduce conflicts.

4.List and explain six basic cache optimizations in short.

1. Cache Size Increase:

Explanation: Increasing the cache size allows it to store more data blocks, reducing capacity misses. A
larger cache can hold more working data, which can be particularly beneficial for applications with high
data locality.

2. Higher Associativity:

Explanation: Using a higher level of associativity (e.g., going from direct-mapped to 4-way or 8-way set
associative) reduces conflict misses by allowing multiple blocks to map to the same set. This flexibility
decreases the chances of eviction of useful data.

3. Block Size Optimization:

Explanation: Choosing an optimal cache block (line) size is crucial. Larger blocks can exploit spatial
locality by fetching adjacent data along with the requested block. However, if blocks are too large, it can
lead to higher miss penalties and waste space for infrequently used data. A balance must be found based
on the access patterns of applications.

4. Prefetching:

Explanation: Prefetching involves predicting future memory accesses and loading data into the cache
before it is explicitly requested by the processor. This can significantly reduce latency for sequential
accesses and loops, though care must be taken to avoid evicting useful data.

5. Write Policies:
Explanation: Adjusting write policies, such as using write-back instead of write-through caching, can
improve performance. Write-back caching allows data to be written to the main memory only when it is
evicted from the cache, reducing memory traffic and improving overall speed.

6.Cache Replacement Policies:

Explanation: Implementing effective cache replacement policies (e.g., Least Recently Used (LRU),
First-In-First-Out (FIFO), or Random) can influence which cache lines to evict when new data needs to be
loaded. Better policies can reduce misses by keeping frequently accessed data in the cache longer.

5.What is way prediction? How it is used to reduce the cache hit time?
Way Prediction

Way prediction is a technique used in set-associative caches to improve access times by predicting which
way (or line) within a set will contain the required data. In a set-associative cache, each set contains
multiple cache lines, and way prediction aims to minimize the time taken to access these lines when
looking for data.

How Way Prediction Works

Predicting the Way:

Instead of checking all the lines in a set sequentially to find the required data, the system predicts which
line is likely to contain the desired block based on past access patterns. This prediction can be based on
historical usage data or specific algorithms.

Accessing the Cache:

Upon receiving a memory request, the cache controller uses the prediction to first check the predicted
line in the set. If the data is found (a hit), access time is reduced since only one line was checked.

If the predicted line does not contain the data (a miss), the system then checks the remaining lines in the
set. This process involves some overhead, but the initial access is often significantly faster than checking
all lines.

Benefits of Way Prediction

Reduced Cache Hit Time:

By reducing the number of cache lines that need to be checked on average, way prediction decreases the
cache access time, leading to faster data retrieval.

Improved Hit Rates:

In workloads with predictable access patterns (such as loops or frequently accessed data), way
prediction can improve the likelihood of cache hits.
Efficiency:

This technique minimizes the performance impact of the additional complexity introduced by using a
set-associative cache, making the access pattern more efficient.

Implementation Considerations

Prediction Mechanism: The effectiveness of way prediction heavily depends on the accuracy of the
prediction mechanism. Common strategies include:

Simple History: Keeping a record of recent accesses to predict which way to check first.

State Machines: Using finite state machines that track access patterns over time to make predictions.

Trade-offs: While way prediction can reduce hit times, it also introduces additional complexity and
potential overhead in terms of hardware resources and prediction accuracy.

6.Explain the use of loop interchange to reduce the miss rate with example.
Loop interchange is a code optimization technique used to improve data locality and reduce cache miss
rates in nested loops. By changing the order of loop iterations, you can enhance spatial locality, which
helps to maximize cache hits when accessing array elements.

How Loop Interchange Works

When accessing multidimensional arrays, the order in which loops iterate over the array can greatly
affect cache performance. If the innermost loop accesses data that is not contiguous in memory (leading
to scattered memory accesses), it can result in more cache misses. Loop interchange reorders these
loops to access data in a more cache-friendly manner.

Example

Consider the following example of a nested loop that processes a 2D array:

for (int i = 0; i < N; i++) {

for (int j = 0; j < M; j++) {

A[i][j] += B[i][j];

In this example:

A and B are two-dimensional arrays.

The outer loop iterates over rows (i), and the inner loop iterates over columns (j).

Cache Behavior

In a row-major order (which is how C/C++ stores arrays), this access pattern is cache-unfriendly because:

When you access A[i][j], you often load a cache line that contains several elements of A[i][*], but the
next access to A[i][j+1] may lead to a cache miss if the next iteration accesses a non-contiguous memory
location.

Applying Loop Interchange

To improve cache performance, you can interchange the loops:

for (int j = 0; j < M; j++) {

for (int i = 0; i < N; i++) {

A[i][j] += B[i][j];

Benefits of Loop Interchange

Improved Spatial Locality:

By iterating over the inner loop with i (rows) for each j (columns), you ensure that you are accessing
contiguous elements in memory, leading to better cache utilization.

Reduced Miss Rate:

With better data locality, the likelihood of cache hits increases, reducing the overall miss rate. This
means that once a cache line is loaded for A[i][j], the next access to A[i+1][j] is more likely to hit in the
cache.

Malware PDF
No ratings yet
Malware PDF
33 pages
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
JRC Full Inst.
No ratings yet
JRC Full Inst.
76 pages
Cache Memory: A Safe Place For Hiding or Storing Things
100% (1)
Cache Memory: A Safe Place For Hiding or Storing Things
34 pages
Memory Hierarchy Design-Aca
No ratings yet
Memory Hierarchy Design-Aca
15 pages
Cache Memory in Computer Organization
No ratings yet
Cache Memory in Computer Organization
5 pages
Lecture 5: Memory Hierarchy and Cache Traditional Four Questions For Memory Hierarchy Designers
No ratings yet
Lecture 5: Memory Hierarchy and Cache Traditional Four Questions For Memory Hierarchy Designers
10 pages
ch5 Easy
No ratings yet
ch5 Easy
27 pages
Unit 5 Notes (1)
No ratings yet
Unit 5 Notes (1)
26 pages
cache_ppt
No ratings yet
cache_ppt
38 pages
Cache Memory
No ratings yet
Cache Memory
11 pages
Memory Hierarchy 4.0
No ratings yet
Memory Hierarchy 4.0
50 pages
Sampriya Chandra Cache Memory
No ratings yet
Sampriya Chandra Cache Memory
36 pages
Cache Memory
No ratings yet
Cache Memory
4 pages
Conspect of Lecture 7
No ratings yet
Conspect of Lecture 7
13 pages
Cache Memory: A Safe Place For Hiding or Storing Things
No ratings yet
Cache Memory: A Safe Place For Hiding or Storing Things
34 pages
53-Cache Memory_ Principles, Cache Memory Management Techniques-28!02!2025
No ratings yet
53-Cache Memory_ Principles, Cache Memory Management Techniques-28!02!2025
38 pages
cache
No ratings yet
cache
5 pages
10_Caches
No ratings yet
10_Caches
34 pages
6.cache Memory - BVK
No ratings yet
6.cache Memory - BVK
47 pages
Module 2- Memory Organization
No ratings yet
Module 2- Memory Organization
12 pages
Computer Mapping and Different Memory
No ratings yet
Computer Mapping and Different Memory
9 pages
Memory originated
No ratings yet
Memory originated
28 pages
Shashank Aca Assignment
No ratings yet
Shashank Aca Assignment
21 pages
Memory Cache
No ratings yet
Memory Cache
18 pages
Cache Memory
No ratings yet
Cache Memory
8 pages
Cache Memory in Computer Organizatin
No ratings yet
Cache Memory in Computer Organizatin
12 pages
AC14L08 Memory Hierarchy
No ratings yet
AC14L08 Memory Hierarchy
20 pages
Chapter 2z Ppt
No ratings yet
Chapter 2z Ppt
54 pages
Cap Ese Q Answers
No ratings yet
Cap Ese Q Answers
11 pages
Chapter 3 Cache
No ratings yet
Chapter 3 Cache
38 pages
Ldco Unit 6 Notes
No ratings yet
Ldco Unit 6 Notes
44 pages
Associative Memory
No ratings yet
Associative Memory
25 pages
Chapter 6: Memory: - CPU Accesses Memory at Least Once Per Fetch-Execute Cycle: - Memory Is Organized Into A Hierarchy
No ratings yet
Chapter 6: Memory: - CPU Accesses Memory at Least Once Per Fetch-Execute Cycle: - Memory Is Organized Into A Hierarchy
25 pages
05) Cache Memory Introduction
No ratings yet
05) Cache Memory Introduction
20 pages
Cache: Why Level It: Departamento de Informática, Universidade Do Minho 4710 - 057 Braga, Portugal Nunods@ipb - PT
No ratings yet
Cache: Why Level It: Departamento de Informática, Universidade Do Minho 4710 - 057 Braga, Portugal Nunods@ipb - PT
8 pages
COA_PPT
No ratings yet
COA_PPT
158 pages
Chapter 2 Adv 2007 PPTV 4
No ratings yet
Chapter 2 Adv 2007 PPTV 4
54 pages
24-Cache Memory Mapping Techniques-14!03!2024
No ratings yet
24-Cache Memory Mapping Techniques-14!03!2024
36 pages
361 Computer Architecture Lecture 14: Cache Memory
No ratings yet
361 Computer Architecture Lecture 14: Cache Memory
20 pages
Chapter 2 Neede For Guide Line Help From Smiw
No ratings yet
Chapter 2 Neede For Guide Line Help From Smiw
7 pages
Cache
No ratings yet
Cache
34 pages
16-Cache Memory-13-03-2024
No ratings yet
16-Cache Memory-13-03-2024
50 pages
CACHE memory
No ratings yet
CACHE memory
24 pages
Implementation of Cache Memory
No ratings yet
Implementation of Cache Memory
15 pages
Memory Organization
No ratings yet
Memory Organization
9 pages
55-Types of Caches, Caches Misses,-04!03!2025
No ratings yet
55-Types of Caches, Caches Misses,-04!03!2025
64 pages
Ec6009 Advanced Computer Architecture Unit V Memory and I/O: Cache Performance
No ratings yet
Ec6009 Advanced Computer Architecture Unit V Memory and I/O: Cache Performance
16 pages
L18-Cache-Wrap-up
No ratings yet
L18-Cache-Wrap-up
30 pages
Improving Cache Performance:: Average Memory Access Time Amat T + Miss Rate X Miss Penalty
No ratings yet
Improving Cache Performance:: Average Memory Access Time Amat T + Miss Rate X Miss Penalty
16 pages
Week 13 - Lecture 13 - Memory (cont)
No ratings yet
Week 13 - Lecture 13 - Memory (cont)
31 pages
Computer Architecture: Memory Hierarchy Design
No ratings yet
Computer Architecture: Memory Hierarchy Design
60 pages
Cache 13115
No ratings yet
Cache 13115
20 pages
Cacche
No ratings yet
Cacche
6 pages
Cache
No ratings yet
Cache
36 pages
Memory Hierarchies (Part 2) Review: The Memory Hierarchy
No ratings yet
Memory Hierarchies (Part 2) Review: The Memory Hierarchy
7 pages
Memory Organization: by Saniya Mhatre
No ratings yet
Memory Organization: by Saniya Mhatre
10 pages
Cache: Contents and Introduction
No ratings yet
Cache: Contents and Introduction
13 pages
Lecture # 15
No ratings yet
Lecture # 15
27 pages
Cache Design
No ratings yet
Cache Design
59 pages
Caching: Acknowledgements
No ratings yet
Caching: Acknowledgements
6 pages
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
From Everand
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
Steve Brown
No ratings yet
GPU Accelerated Number Plate Localization
No ratings yet
GPU Accelerated Number Plate Localization
9 pages
Tia Portal V17 Technical Highlights
No ratings yet
Tia Portal V17 Technical Highlights
50 pages
Lesson 80
No ratings yet
Lesson 80
5 pages
FreeBSD Developers Handbook
No ratings yet
FreeBSD Developers Handbook
325 pages
SOC Workflow CE Guide v.3.7.4
No ratings yet
SOC Workflow CE Guide v.3.7.4
20 pages
Introducing The Amiga 300
No ratings yet
Introducing The Amiga 300
52 pages
PMT Hps Honeywell Enraf Ciu888 Configuration Manual
No ratings yet
PMT Hps Honeywell Enraf Ciu888 Configuration Manual
200 pages
MD ServiceOps Technical Presentation
100% (1)
MD ServiceOps Technical Presentation
70 pages
Requirement - Specification - Week1 SRS
No ratings yet
Requirement - Specification - Week1 SRS
24 pages
CNS V1
100% (1)
CNS V1
612 pages
Wordpress Theme Thesis 185
100% (3)
Wordpress Theme Thesis 185
4 pages
Algorithms Course Syllabus
No ratings yet
Algorithms Course Syllabus
4 pages
Warwick 2000 Brochure
No ratings yet
Warwick 2000 Brochure
2 pages
Apache-CloudStack-PoCGuide-2025-ShapeBlue
No ratings yet
Apache-CloudStack-PoCGuide-2025-ShapeBlue
106 pages
U-1, Poc-1, Carewell Pharma
No ratings yet
U-1, Poc-1, Carewell Pharma
21 pages
3rd PERIODICAL TEST
No ratings yet
3rd PERIODICAL TEST
9 pages
Hill Top Hotel: Waiter
No ratings yet
Hill Top Hotel: Waiter
4 pages
Unit 2 8086 System Bus Structures Fullunit
No ratings yet
Unit 2 8086 System Bus Structures Fullunit
36 pages
FSK M: Bjectives
No ratings yet
FSK M: Bjectives
4 pages
Instagram Followers Free QSZ 3
No ratings yet
Instagram Followers Free QSZ 3
3 pages
Introduction Aws Security
No ratings yet
Introduction Aws Security
13 pages
Saa 7824
No ratings yet
Saa 7824
76 pages
OS Activity 3
No ratings yet
OS Activity 3
2 pages
Legion Pro 7 16IRX8H 82WQ00B7MJ
No ratings yet
Legion Pro 7 16IRX8H 82WQ00B7MJ
2 pages
Cypress Third Class - Ranjan Dhakal
No ratings yet
Cypress Third Class - Ranjan Dhakal
21 pages
Computer Portfolio Class 10th
No ratings yet
Computer Portfolio Class 10th
71 pages
Secure Shell
No ratings yet
Secure Shell
12 pages
Pseudocode Notes
No ratings yet
Pseudocode Notes
13 pages

Unit 3 Notes

Uploaded by

Unit 3 Notes

Uploaded by

1.What is principle of locality?

Explain the typical memory hierarchy

Typical Memory Hierarchy

2.Explain the set associative scheme of placing the block in a cache.

Miss Rate=Number of Cache Misses/Total Memory Accesses

Three Categories of Cache Misses (Three Cs Model)

Compulsory Misses (Cold Misses):

Conflict Misses (Collision Misses):

4.List and explain six basic cache optimizations in short.

3. Block Size Optimization:

6.Cache Replacement Policies:

How Way Prediction Works

Predicting the Way:

Accessing the Cache:

Benefits of Way Prediction

Reduced Cache Hit Time:

Improved Hit Rates:

How Loop Interchange Works

Consider the following example of a nested loop that processes a 2D array:

for (int i = 0; i < N; i++) {

for (int j = 0; j < M; j++) {

A and B are two-dimensional arrays.

Applying Loop Interchange

To improve cache performance, you can interchange the loops:

for (int j = 0; j < M; j++) {

for (int i = 0; i < N; i++) {

Benefits of Loop Interchange

Improved Spatial Locality:

Reduced Miss Rate:

You might also like