0% found this document useful (0 votes)

44 views25 pages

4-Module #4-Shared-Memory-Students-Version-Final-October-24-2024

Uploaded by

Omar Amer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views25 pages

4-Module #4-Shared-Memory-Students-Version-Final-October-24-2024

Uploaded by

Omar Amer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Module #4

Shared Memory Architectures

Professor Mostafa Abd-El-Barr

Fall Term 2024-2025

Sunday, October 27, 2024 1

Outline
1. Introduction to Shared Memory Architecture
2. Classification of Shared Memory.
3. Bus-based Symmetric Multiprocessors
4. Cache Incoherence-Problem

2
Introduction
o A shared memory as shown in the Figure.
o In shared All inter-processor coordination and synchronization are accomplished via the global memory.
M M M M

Interconnection Network

P P P P

o Two main problems in designing a shared memory system:

1. Performance degradation due to contention, and
2. coherence problems.
o Performance degradation might happen when multiple processors are trying to access the shared memory simultaneously.
o Having multiple copies of data, spread throughout the caches, might lead to a coherence problem.
o Copies in the caches are coherent if they all equal the same value.
o If one of the processors writes over the value of one of the copies, then the copy becomes inconsistent because it no longer
equals the value of the other copies.
Classification of Shared Memory Systems
o This is the simplest shared memory.

o An arbitration unit within the memory module passes requests through to a memory controller.
o If the memory module is not busy and a single request arrives, then the arbitration unit passes that request to the memory and the request is
satisfied.
o If the arbitration unit receives two requests, it selects one of them and passes it to the memory controller.
o The module is placed in the busy state while a request is being serviced.
o If a new request arrives while the memory is busy servicing a previous request, the memory module sends a wait signal, through the memory
controller, to the processor making the new request.

o Based on the interconnection network used, shared memory systems can be categorized in the following categories:
✓ Uniform Memory Access (UMA)
o In the UMA a shared memory all processors have equal access time to any memory location.
o The interconnection network used in the UMA can be a single bus, multiple buses, or a crossbar switch.
o Because access to shared memory is balanced, these systems are also called SMP (Symmetric Multiprocessor) systems.
o Each processor has equal opportunity to read/write to memory, including equal access speed.
o Commercial examples of SMPs are Sun Microsystems multiprocessor servers and Silicon Graphics Inc. multiprocessor servers.
o A typical bus-structured SMP computer, as shown in the Figure. M M M M
Bus

C C C C

P P
Classification of Shared Memory Systems
✓ Non-uniform memory access (NUMA)
o In this architecture each processor has part of shared
memory attached.
o The memory has a single address space.
o Any processor could access any memory location
directly using its real address.
o The access time to modules depends on the distance to
the processor, i.e., a non-uniform memory access time.
o A number of architectures are used in a NUMA such as
the tree networks.
o Examples of NUMA architecture are BBN TC-2000,
SGI Origin 3000, and Cray T3E. Figure 4.4 shows
NUMA system organization

✓Cache-only memory Architecture

(COMA).
o Like the NUMA, each processor has part of the shared
memory, which is a cache memory in the COMA.
o A COMA system requires that data to be migrated to
the processor requesting it.
o The address space is made of all the caches.
o There is a cache directory (D) that helps in remote
cache access.
o The Figure shows the organization of COMA.
Bus Based Symmetric Multiprocessors
o A typical bus-based design uses high speed caches to solve the bus contention problem.
o We define the variables for hit rate, number of processors, processor speed, bus speed, and processor duty cycle rates as
follows:
▪ N = Number of processors
▪ h = hit rate of each cache, assumed to be the same for all caches
▪ (1 – h) = miss rate of all caches
▪ B = Bandwidth of the bus, measured in cycles/second
▪ I = Processor duty cycle, assumed to be identical for all processors, in fetches/cycle
▪ V = Peak processor speed, in fetches/second
o The effective bandwidth of the bus is B.I fetches/second.
o If each processor is running at a speed of V, then misses are being generated at a rate of V(1 – h).
o For an N-processor system, misses are simultaneously being generated at a rate of N(1 – h)V.
o This leads to saturation of the bus when N processors simultaneously try to access the bus., i.e. N(1 – h)V ≤ BI.
o The maximum number of processors with cache memories that the bus can support is given by the relation, N≤ BI/((1 – h)V)

o Example 1
▪ Suppose a shared memory system is constructed from processors that can execute V = 110 instructions/second and the
processor duty cycle I = 1.
▪ The caches are designed to support a hit rate of 97%, and the bus supports a peak bandwidth of B = 106 cycles/second.
▪ Then, (1 – h) = 0.03, and the maximum number of processors N is N ≤ 106/(0.03 * 110) = 32.
▪ Thus, the system we have in mind can support only 32 processors!
Cache Incoherence Problem
Single Processor Case caching
Hit: data in the cache

Miss: data is not in the cache

x
Memory
Cashe Hit rate (%): h
x
Cashe Miss rate (%): m = (1-h) Cache
P
Effect of Cashe Hit ratio
𝑡𝑎𝑣 = ℎ1 × 𝑡1 + 1 − ℎ1 𝑡1 + ℎ2 × 𝑡2 + 1 − ℎ2 𝑡2 + 𝑡3

ℎ1 = 0.8, 𝑡1 = 5 , 𝑡2 = 50 ns, 𝑡3 = 200 𝑛𝑠, ℎ2 = 0.8

Cache Incoherence Problem
✓ Writing to Cache in the Two processors case
• Write Through
• Write Back
Let X be an element of shared data which has been referenced by two processors, P1 and P2.

1. In the beginning, three copies of X are consistent.

2. If the processor P1 writes a new data X1 into the cache, by using write through policy, the same
copy will be written immediately into the shared memory.
3. In this case, consistency occurs between cache memory and the main memory.
4. When a write-back policy is used, the main memory will be updated when the modified data in the
cache is replaced or invalidated.
Cache Incoherence Problem
Multiprocessor Case caching
In a shared memory multiprocessor system with a separate cache memory for each processor, it is possible to have
many copies of shared data: one copy in the main memory and one in the local cache of each processor that requested
it.
When one of the copies of data is changed, the other copies must reflect that change.
Cache coherence: ensures that the changes in the values of shared operands (data) are propagated throughout the
system in a timely fashion.
The following are the requirements for cache coherence:
1. Write Propagation
Changes to the data in any cache must be propagated to other copies (of that cache line) in the peer caches.

2. Transaction Serialization
Reads/Writes to a single memory location must be seen by all processors in the same order.

Thus, if location X received two different values A and B, in this order, from any two processors, the processors can never read
location X as B and then read it as A. The location X must be seen with values A and B in that order.
Cache Incoherence Problem
Example: Consider that more than one processor has cached a copy of the memory location X.

The following conditions are necessary to achieve cache coherence:

1. If a read made by a processor P1 to a location X that follows a write by the same processor P1 to X,
given that no writes to X by another processor occurring between the write and the read instructions
made by P1, X must always return the value written by P1.

2. If a read made by a processor P1 to location X that follows a write by another processor P2 to X, given
that no other writes to X made by any processor occurring between the two accesses and with the read
and write being sufficiently separated, X must always return the value written by P2.

This condition defines the concept of coherent view of memory.

If processor P1 reads the old value of X, even after the write by P2, we can say that the memory is
incoherent (incoherent).
Cache Incoherence Problem
✓ Writing to Cache in Multiple processors case
X Shared

x x x

P1 P2 P3 Pn
-Multiple copies of x
-What if P1 updates x?

11
Cache Incoherence Problem
Cache Coherence Policies
✓Four Cases to handle Writing to Cache in n processors case
• Write Update - Write Through
• Write Update - Write Back
• Write Invalidate - Write Through
• Write Invalidate - Write Back
✓ Cashe-Memory coherence
Illustration of the Write-Through and Write Back in a single Processor
Write Through Write Back
Serial Event Memory Cache Memory Cache
1 X X
2 P reads X X X X X
3 P updates (write) X X’ X’ X X’
12
Cache Coherence Problem
✓ Cache – Cache Coherence
o When a task running on processor P requests the data in global memory location X, the contents of X are copied to
processor P’s local cache.
o Suppose processor Q also accesses X and wants to Write a new value to X.
o There are two fundamental cache coherence policies:
(1) write-invalidate, and
(2) write-update.
o Write-invalidate maintains consistency by reading from local caches until a write occurs.
o When any processor updates the value of X through a write, posting a dirty bit for X invalidates all copies.
o Processor Q invalidates all other copies of X when it writes a new value into its cache. This sets the dirty bit for X.
o Q can continue to change X without further notifications to other caches because Q has the only valid copy of X.
o When processor P wants to read X, it must wait until X is updated and the dirty bit is cleared.

o Write-update maintains consistency by immediately updating all copies in all caches.

o See Table for the write-update versus write-invalidate policies.
Write update Write invalidate
Serial Event P’s Cache Q’s Cache P’s Cache Q’s cache
1 P reads X X X
2 Q reads X X X X X
3 Q updates X X’ X’ INV X’
4 Q updates X’ X’’ X’’ INV X’’
Cache Incoherence Problem
Write-invalidate
x x’ x

x x x’ I x’ I

P1 P2 P3 P1 P2 P3 P1 P2 P3

Before Write Through Write back

14
Cache Incoherence Problem
Write-Update
x x’ x

x x x’ x’ x’ x’

P1 P2 P3 P1 P2 P3 P1 P2 P3

Before Write Through Write back

15
Cache Incoherence Problem
✓ Write Invalidate Write Through
o Multiple processors can read block copies from main memory safely until one processor
updates its copy.
o At this time, all cache copies are invalidated and the memory is updated to remain consistent.

State Description

Valid The copy is consistent with global memory

[VALID]

Invalid The copy is inconsistent

[INV]

16
Cache Incoherence Problem
Write Through- Write Invalidate (cont.)

Event Actions

Read Hit Use the local copy from the cache.

Read Miss Fetch a copy from global memory. Set the state of this copy to Valid.

Write Hit Perform the write locally. Broadcast an Invalid command to all caches. Update the global
memory.

Write Miss Get a copy from global memory. Broadcast an invalid command to all caches. Update the
global memory. Update the local copy and set its state to Valid.

Replace Since memory is always consistent, no write back is needed when a block is replaced.

17
Cache Incoherence Problem
✓ Write Invalidate Write Through

Example 1
X=5 M
1. P reads X
2. Q reads X
3. Q updates X, X=10 C C

4. Q reads X
5. Q updates X, X=15 P Q

6. P updates X, X=20
7. Q reads X
18
Cache Incoherence Problem
• The Table shows the contents of memory and the two caches after the execution of each operation when Write Invalidate Write
Through was used for cache coherence.
• The table also shows the state of the block containing X in P’s cache and Q’s cache.
Memory P’s Cache Q’s Cache
Serial Event Location Location State Location State
X X X
0 Original value 5
1 P reads X 5 5 VALID
(Read Miss)
2 Q reads X 5 5 VALID 5 VALID
(Read Miss)
3 Q updates X 10 5 INV 10 VALID
(Write Hit)
4 Q reads X 10 5 INV 10 VALID
(Read Hit)
5 Q updates X 15 5 INV 15 VALID
(Write Hit)
6 P updates X 20 20 VALID 15 INV
(Write Miss)
7 Q reads X 20 20 VALID 20 VALID
(Read Miss)
Cache Incoherence Problem
✓ Write Back- Write Invalidate (ownership)
o A valid block can be owned by memory and shared in multiple caches that can
contain only the shared copies of the block.

o Multiple processors can safely read these blocks from their caches until one
processor updates its copy.

o At this time, the writer becomes the only owner of the valid block and all other
copies are invalidated.

20
Cache Incoherence Problem
✓ Write Back- Write Invalidate (ownership)
State Description

Shared Data is valid and can be read safely. Multiple copies can be in this state
(Read-Only) [RO]

Exclusive Only one valid cache copy exists and can be read from and written to safely.
(Read-Write) Copies in other caches are invalid
[RW]

Invalid The copy is inconsistent

[INV]

21
Cache Incoherence Problem

✓ Write Back- Write Invalidate (ownership)

Event Action

Read Hit Use the local copy from the cache.

Read Miss If no Exclusive (Read-Write) copy exists, then supply a copy from global memory. Set the state
of this copy to Shared (Read-Only). If an Exclusive (Read-Write) copy exists, make a copy from
the cache that set the state to Exclusive (Read-Write), update global memory and local cache
with the copy. Set the state to Shared (Read-Only) in both caches.
Write Hit If the copy is Exclusive (Read-Write), perform the write locally. If the state is Shared (Read-
Only), then broadcast an Invalid to all caches. Set the state to Exclusive (Read-Write).
Write Miss Get a copy from either a cache with an Exclusive (Read-Write) copy, or from global memory
itself. Broadcast an Invalid command to all caches. Update the local copy and set its state to
Exclusive (Read-Write).
Block Replacement If a copy is in an Exclusive (Read-Write) state, it has to be written back to main memory if the
block is being replaced. If the copy is in Invalid or Shared (Read-Only) states, no write back is
needed when a block is replaced.
22
Cache Incoherence Problem

✓ Write Back- Write Invalidate (ownership)

Example:
Consider the shared memory system previously shown in the Figure and the following
operations:
1) P reads X,
M
2) Q reads X,
3) Q updates X,
4) Q reads X, C C
5) Q updates X,
6) P updates X, P Q

7) Q reads X.
23
Cache Incoherence Problem
✓Example:
o Consider the shared memory system previously shown in the Figure and the following operations: 1) P reads X, 2) Q reads
X, 3) Q updates X, 4) Q reads X, 5) Q updates X, 6) P updates X, 7) Q reads X.
o The Table shows the contents of memory and the two caches after the execution of each operation when Write Invalidate
Write Back was used for cache coherence. The table also shows the state of the block containing X in P’s cache and Q’s
cache.
Memory P’s Cache Q’s Cache
Serial Event Location Location State Location State
X X X
0 Original value 5
1 P reads X 5 5 RO
(Read Miss)
2 Q reads X 5 5 RO 5 RO
(Read Miss)
3 Q updates X 5 5 INV 10 RW
(Write Hit)
4 Q reads X 5 5 INV 10 RW
(Read Hit)
5 Q updates X 5 5 INV 15 RW
(Write Hit)
6 P updates X 5 20 RW 15 INV
(Write Miss)
7 Q reads X 20 20 RO 20 RO
(Read Miss)
References

• Textbook Chapter 4

Kvara - System Monitor
86% (7)
Kvara - System Monitor
5 pages
Cache Coherence and Synchronization - Tutorialspoint
No ratings yet
Cache Coherence and Synchronization - Tutorialspoint
7 pages
CSCI 8150 Advanced Computer Architecture
100% (2)
CSCI 8150 Advanced Computer Architecture
46 pages
Cs 6461 Computer Architecture Lecture 11
No ratings yet
Cs 6461 Computer Architecture Lecture 11
51 pages
1.symmetric and Distributed Shared Memory Architectures
79% (19)
1.symmetric and Distributed Shared Memory Architectures
29 pages
Lecture-7 SMP NUMA Cache Coherence
No ratings yet
Lecture-7 SMP NUMA Cache Coherence
34 pages
Shared Memory Multiprocessors: Logical Design and Software Interactions
No ratings yet
Shared Memory Multiprocessors: Logical Design and Software Interactions
107 pages
Acau 4
No ratings yet
Acau 4
20 pages
Lec 6
No ratings yet
Lec 6
8 pages
Lecture 5
No ratings yet
Lecture 5
15 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
23 pages
A Survey of Cache Coherence Mechanisms in Shared M
No ratings yet
A Survey of Cache Coherence Mechanisms in Shared M
27 pages
Coa Unit 3 Read
No ratings yet
Coa Unit 3 Read
19 pages
09 Communication Models of Parallel Platforms
No ratings yet
09 Communication Models of Parallel Platforms
25 pages
Nera Mini-C Installation Manual
100% (1)
Nera Mini-C Installation Manual
38 pages
09 Communication Models of Parallel Platforms
No ratings yet
09 Communication Models of Parallel Platforms
25 pages
Lecture 06
No ratings yet
Lecture 06
26 pages
05 Multiprocessor
No ratings yet
05 Multiprocessor
54 pages
Week 5
No ratings yet
Week 5
35 pages
R12 U5 MultiProcessor Architectures
No ratings yet
R12 U5 MultiProcessor Architectures
47 pages
25895
No ratings yet
25895
4 pages
ACA UNIT-5 Notes
No ratings yet
ACA UNIT-5 Notes
15 pages
0014 SharedMemoryArchitecture
No ratings yet
0014 SharedMemoryArchitecture
31 pages
L7 Multicore 1
No ratings yet
L7 Multicore 1
50 pages
Shared Memory. Distributed Memory. Hybrid Distributed-Shared Memory
No ratings yet
Shared Memory. Distributed Memory. Hybrid Distributed-Shared Memory
22 pages
L39 - Centralized Shared Memory Architectures
No ratings yet
L39 - Centralized Shared Memory Architectures
31 pages
Distributed Shared Memory: Introduction & Thisis
No ratings yet
Distributed Shared Memory: Introduction & Thisis
22 pages
Unit 4 - Advanced Computer Architecture - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Advanced Computer Architecture - WWW - Rgpvnotes.in
14 pages
Multiprocessors
No ratings yet
Multiprocessors
39 pages
KTMTSS Shared Memory Multiprocessor
No ratings yet
KTMTSS Shared Memory Multiprocessor
29 pages
2.symmetric Shared Memory Architectures
No ratings yet
2.symmetric Shared Memory Architectures
12 pages
Unit-4 DS
No ratings yet
Unit-4 DS
39 pages
Multi Processors and Thread Level Parallelism
No ratings yet
Multi Processors and Thread Level Parallelism
74 pages
Cache Coherence
No ratings yet
Cache Coherence
53 pages
Shared Memory Architecture
No ratings yet
Shared Memory Architecture
39 pages
MODULE 4 HPC
No ratings yet
MODULE 4 HPC
41 pages
CA-unit 5-Material-For Reference
No ratings yet
CA-unit 5-Material-For Reference
16 pages
Shared Memory Multiprocessors
No ratings yet
Shared Memory Multiprocessors
45 pages
Lecture4 (Share Memory-"According Access")
No ratings yet
Lecture4 (Share Memory-"According Access")
16 pages
DSM
No ratings yet
DSM
36 pages
IJARCCE-46 Cachemesiwithverilog
No ratings yet
IJARCCE-46 Cachemesiwithverilog
5 pages
Lec 6 SharedArch PDF
No ratings yet
Lec 6 SharedArch PDF
33 pages
Parallel Computer Architecture A Hardware-Software
No ratings yet
Parallel Computer Architecture A Hardware-Software
18 pages
Lecture 18: Coherence Protocols
No ratings yet
Lecture 18: Coherence Protocols
18 pages
Cache Coherence (Part 1)
No ratings yet
Cache Coherence (Part 1)
13 pages
Memory Hierarchy: Haresh Dagale Dept of ESE
No ratings yet
Memory Hierarchy: Haresh Dagale Dept of ESE
32 pages
Multiprocessor Cache Coherence
No ratings yet
Multiprocessor Cache Coherence
13 pages
Computer Architecture: Multiprocessors Shared Memory Architectures Prof. Jerry Breecher CSCI 240 Fall 2003
No ratings yet
Computer Architecture: Multiprocessors Shared Memory Architectures Prof. Jerry Breecher CSCI 240 Fall 2003
24 pages
Cache Coherence: CSE 661 - Parallel and Vector Architectures
No ratings yet
Cache Coherence: CSE 661 - Parallel and Vector Architectures
37 pages
Cache Coherence - MESI MOESI
No ratings yet
Cache Coherence - MESI MOESI
57 pages
18bce2429 Da 2 Cao
No ratings yet
18bce2429 Da 2 Cao
13 pages
Cache Coherence: From Wikipedia, The Free Encyclopedia
No ratings yet
Cache Coherence: From Wikipedia, The Free Encyclopedia
8 pages
Cache Coherence: Part I: CMU 15-418: Parallel Computer Architecture and Programming (Spring 2012)
No ratings yet
Cache Coherence: Part I: CMU 15-418: Parallel Computer Architecture and Programming (Spring 2012)
31 pages
Computer Applications-1-1
No ratings yet
Computer Applications-1-1
59 pages
Parallel 2
No ratings yet
Parallel 2
14 pages
What Is Parallel Computing
No ratings yet
What Is Parallel Computing
9 pages
Shared Memory Architecture Concepts and Performance Issues: Outline
No ratings yet
Shared Memory Architecture Concepts and Performance Issues: Outline
7 pages
Term Paper: Cahe Coherence Schemes
No ratings yet
Term Paper: Cahe Coherence Schemes
12 pages
Itec 55a Platform Technologies
100% (1)
Itec 55a Platform Technologies
3 pages
Multiprocessing: Flynn's Classification (1966)
No ratings yet
Multiprocessing: Flynn's Classification (1966)
8 pages
Interfacing of Arduino Hardware With MATLAB
No ratings yet
Interfacing of Arduino Hardware With MATLAB
17 pages
LC320DXN SFR2 LG
No ratings yet
LC320DXN SFR2 LG
35 pages
List Harga Dari CV - Siplah Sibos 2021 No Jenis Barang Laptop
No ratings yet
List Harga Dari CV - Siplah Sibos 2021 No Jenis Barang Laptop
144 pages
Installation
No ratings yet
Installation
15 pages
12cs 083 Project Synopsis1 Bookstore Management
No ratings yet
12cs 083 Project Synopsis1 Bookstore Management
20 pages
LNL X2210 Datasheet 04102024 tcm841-145617
No ratings yet
LNL X2210 Datasheet 04102024 tcm841-145617
2 pages
Cache Coherence: Caches Memory Coherence Caches Multiprocessing
No ratings yet
Cache Coherence: Caches Memory Coherence Caches Multiprocessing
4 pages
Computer Parts and Tools
No ratings yet
Computer Parts and Tools
128 pages
Pronatel - Primera Solicitud de Traslado de Repuestos A CICSA
No ratings yet
Pronatel - Primera Solicitud de Traslado de Repuestos A CICSA
12 pages
ES1988
No ratings yet
ES1988
60 pages
S32DS Release Notes 3.4.0
No ratings yet
S32DS Release Notes 3.4.0
14 pages
Tech Note 1018 - Optimizing Managed Memory For InTouch 2012 and Later
No ratings yet
Tech Note 1018 - Optimizing Managed Memory For InTouch 2012 and Later
5 pages
Cyclone V Device Handbook Volume 3 Hard Processor System Technical Reference Manual PDF
No ratings yet
Cyclone V Device Handbook Volume 3 Hard Processor System Technical Reference Manual PDF
670 pages
64C2 Data Sheet
No ratings yet
64C2 Data Sheet
2 pages
Breakout Game Report
No ratings yet
Breakout Game Report
14 pages
CSE 351 Course Outline
No ratings yet
CSE 351 Course Outline
6 pages
VSL 20240812
No ratings yet
VSL 20240812
10 pages
Dxdiag
No ratings yet
Dxdiag
31 pages
Arm Cpu Cores
No ratings yet
Arm Cpu Cores
64 pages
Unit 1 - Chapter 1 - Worksheet - Answer
No ratings yet
Unit 1 - Chapter 1 - Worksheet - Answer
3 pages
Multi Format Broadcast LCD Monitor: Operation Manual - v1.0
No ratings yet
Multi Format Broadcast LCD Monitor: Operation Manual - v1.0
36 pages
Microprocessor 8085 Programming
100% (3)
Microprocessor 8085 Programming
3 pages
HUANANZHI B75 M.2 Motherboard-HUANANZHI
No ratings yet
HUANANZHI B75 M.2 Motherboard-HUANANZHI
5 pages
8-1017876 Tyco Informacion General Pedestal FTTX
No ratings yet
8-1017876 Tyco Informacion General Pedestal FTTX
4 pages
Thermo Scientific T 1270 Rotor PDF
No ratings yet
Thermo Scientific T 1270 Rotor PDF
2 pages
Quot 01
No ratings yet
Quot 01
2 pages
DDR 3
No ratings yet
DDR 3
2 pages
The Complete Future Trait Guide
From Everand
The Complete Future Trait Guide
Hamze Ghalebi
No ratings yet
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
From Everand
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
Steve Brown
No ratings yet
Operating Systems Interview Questions You'll Most Likely Be Asked
From Everand
Operating Systems Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

4-Module #4-Shared-Memory-Students-Version-Final-October-24-2024

Uploaded by

4-Module #4-Shared-Memory-Students-Version-Final-October-24-2024

Uploaded by

Module #4

Shared Memory Architectures

Fall Term 2024-2025

Sunday, October 27, 2024 1

o Two main problems in designing a shared memory system:

✓Cache-only memory Architecture

Miss: data is not in the cache

ℎ1 = 0.8, 𝑡1 = 5 , 𝑡2 = 50 ns, 𝑡3 = 200 𝑛𝑠, ℎ2 = 0.8

1. In the beginning, three copies of X are consistent.

The following conditions are necessary to achieve cache coherence:

This condition defines the concept of coherent view of memory.

o Write-update maintains consistency by immediately updating all copies in all caches.

Before Write Through Write back

Before Write Through Write back

Valid The copy is consistent with global memory

Invalid The copy is inconsistent

Read Hit Use the local copy from the cache.

Invalid The copy is inconsistent

✓ Write Back- Write Invalidate (ownership)

Read Hit Use the local copy from the cache.

✓ Write Back- Write Invalidate (ownership)

You might also like