0% found this document useful (0 votes)

62 views14 pages

Lecture-3 Parallel Computer Memory Architecture

Parallel Computer Memory Architecture

Uploaded by

Meherab Hasan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views14 pages

Lecture-3 Parallel Computer Memory Architecture

Parallel Computer Memory Architecture

Uploaded by

Meherab Hasan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Parallel Computer Memory Architectures

❖Shared Memory

❖Distributed Memory

❖Hybrid Shared-Distributed Memory

Parallel Computer Memory Architectures
❖Shared Memory
❖ General Characteristics
▪ Shared memory parallel computers have the ability for all processors to access memory
as global address space.

▪ Multiple processors can operate independently but share the same memory resources.

▪ Changes in a memory location effected by one processor are visible to all other
processors.

▪ Shared memory machines have been classified as UMA and NUMA, based upon memory
access times.
Parallel Computer Memory Architectures
❖ Shared Memory
❖ Advantages
▪ Data sharing between processes is both fast and uniform due to the proximity of memory to
CPUs.

▪ Global address space provides a user-friendly programming perspective to memory.

❖ Disadvantages
▪ Primary disadvantage is the lack of scalability between memory and CPUs.

▪ Adding more CPUs can geometrically increases traffic on the shared memory-CPU path, and
for cache coherent systems, geometrically increase traffic associated with cache/memory
management.

▪ Programmer responsibility for synchronization constructs that insure "correct“ access of

global memory, is also increased.
Parallel Computer Memory Architectures
❖ Uniform Memory Access (UMA)
▪ Most commonly represented today by Symmetric Multiprocessor (SMP) machines.

▪ Identical processors.

▪ Equal access and access times to memory.

▪ Sometimes called CC-UMA - Cache Coherent UMA. Cache coherent means if one processor updates a location in
shared memory, all the other processors know about the update.
Parallel Computer Memory Architectures
❖ Non-Uniform Memory Access-
(NUMA)-
▪ Often made by physically linking two or more SMPs.

▪ One SMP can directly access memory of another SMP.

▪ Not all processors have equal access time to all memories.

▪ Memory access across link is slower (a processor can

access its own local memory faster than non-local
memory).

▪ If cache coherency is maintained, then may also be called

CC-NUMA - Cache Coherent NUMA.
Parallel Computer Memory Architectures

Basis for UMA NUMA

comparison
Basic Uses a single memory Multiple memory controller
controller
Memory Access time Equal Changes according to the
distance of microprocessor
Suitable for General purpose and Real-time and time-critical
time-sharing application applications
Bandwidth Limited More than UMA
Parallel Computer Memory Architectures
❖ Distributed Memory

❖ General Characteristics

▪ Distributed memory systems require a communication network to connect inter-processor memory.

▪ Processors have their own local memory. Memory addresses in one processor do not map to another
processor, so there is no concept of global address space across all processors.

▪ Because each processor has its own local memory, it operates independently. Changes it makes to
its local memory have no effect on the memory of other processors. Hence, the concept of cache
coherency does not apply.

▪ When a processor needs access to data in another processor, it is usually the task of the
programmer to explicitly define how and when data is communicated. Synchronization between
tasks is likewise the programmer's responsibility.
Parallel Computer Memory Architectures
❖ Distributed Memory

❖ Advantages

• Memory is scalable with the number of processors.

Increase the number of processors and the size of
memory increases proportionately.

• Each processor can rapidly access its own memory

without

• interference of other processor.

• The overhead of cache coherency
Parallel Computer Memory Architectures
❖ Distributed Memory
❖ Disadvantages
• The programmer is responsible for many of the details
associated with data communication between processors.

• It may be difficult to map existing data structures, based

on global memory, to this memory organization.

•Need distributed data structure.

• Non-uniform memory access times

•Data residing on a remote node takes longer to access

than node-local data.
Parallel Computer Memory Architectures
❖ Hybrid Shared-Distributed Memory
❖ General Characteristics
▪ The largest and fastest computers in the world today employ both shared and
distributed memory architectures.

▪ The shared memory component can be a shared memory machine and/or

graphics processing units (GPU).

▪ The distributed memory component is the networking of multiple shared

memory/GPU machines, which know only about their own memory - not the
memory on another machine. Therefore, network communications are required to
move data from one machine to another.
Parallel Computer Memory Architectures
❖ Hybrid Shared-Distributed Memory

❖Advantages and Disadvantages

▪Increased scalability is an important advantage.
▪Increased programmer complexity is an important disadvantage.
• What is SMP?
• Identify the memory architecture
• In which type of memory architecture does each processor have its own
local memory?
a) Shared memory
b) Distributed memory
• In which type of memory architecture does processor have its own local
memory?
a) Shared memory
b) Distributed memory
• Which one provides a user-friendly programming perspective to memory?
a) Shared memory
b) Distributed memory
• Which type of memory architecture requires the use of message passing to
exchange data between processors?
a) Shared memory
b) Distributed memory

Multiprocessors and Multicomputers
No ratings yet
Multiprocessors and Multicomputers
27 pages
Multicore Programming: K. Nagalakshmi, ASP/IT Department of Information Technology E.G.S. Pillay Engineering Technology
No ratings yet
Multicore Programming: K. Nagalakshmi, ASP/IT Department of Information Technology E.G.S. Pillay Engineering Technology
19 pages
09 Communication Models of Parallel Platforms
No ratings yet
09 Communication Models of Parallel Platforms
25 pages
Parallel Distributed Computing
No ratings yet
Parallel Distributed Computing
64 pages
Unit 5 DOS SCR
No ratings yet
Unit 5 DOS SCR
46 pages
PDC Notes by Zatch-1
No ratings yet
PDC Notes by Zatch-1
42 pages
U1-Theory of Parallelism
No ratings yet
U1-Theory of Parallelism
43 pages
Quiz Prep
No ratings yet
Quiz Prep
21 pages
Week 6 A
No ratings yet
Week 6 A
22 pages
Quiz Prep
No ratings yet
Quiz Prep
21 pages
Module 2 - Parallel Computing
No ratings yet
Module 2 - Parallel Computing
55 pages
09 Communication Models of Parallel Platforms
No ratings yet
09 Communication Models of Parallel Platforms
25 pages
cs501 Final Term Highlighted Handouts
No ratings yet
cs501 Final Term Highlighted Handouts
216 pages
Unit 2.1
No ratings yet
Unit 2.1
18 pages
Parallel Computers
No ratings yet
Parallel Computers
39 pages
Lecture 3
No ratings yet
Lecture 3
16 pages
Unit-4 DS
No ratings yet
Unit-4 DS
39 pages
KCS 713 Unit 1 Lecture 5
No ratings yet
KCS 713 Unit 1 Lecture 5
32 pages
Multi Processors and Thread Level Parallelism
No ratings yet
Multi Processors and Thread Level Parallelism
74 pages
Unit 1 - Part - 2
No ratings yet
Unit 1 - Part - 2
30 pages
Explicitly Parallel Platforms
No ratings yet
Explicitly Parallel Platforms
90 pages
COE4590 - 9 - Shared Mem - MessgPassing
No ratings yet
COE4590 - 9 - Shared Mem - MessgPassing
14 pages
Shared Memory. Distributed Memory. Hybrid Distributed-Shared Memory
No ratings yet
Shared Memory. Distributed Memory. Hybrid Distributed-Shared Memory
22 pages
Kung
No ratings yet
Kung
5 pages
G4.Parallel Computer Memory Architecture
No ratings yet
G4.Parallel Computer Memory Architecture
17 pages
CSCI 8150 Advanced Computer Architecture
100% (2)
CSCI 8150 Advanced Computer Architecture
18 pages
2 Parallel Computer Memory Architectures
No ratings yet
2 Parallel Computer Memory Architectures
26 pages
Parallel Computing Memory Architectures
No ratings yet
Parallel Computing Memory Architectures
14 pages
Lecture4 (Share Memory-"According Access")
No ratings yet
Lecture4 (Share Memory-"According Access")
16 pages
Part 1 - Lecture 2 - Parallel Hardware
No ratings yet
Part 1 - Lecture 2 - Parallel Hardware
60 pages
DSM
No ratings yet
DSM
36 pages
Distributed Shared Memory: Writes To A Logical Shared Address by One Thread Are Visible To Reads of The Other Threads
No ratings yet
Distributed Shared Memory: Writes To A Logical Shared Address by One Thread Are Visible To Reads of The Other Threads
41 pages
HPA - Notes
No ratings yet
HPA - Notes
5 pages
Unit III Multiprocessor Issues
No ratings yet
Unit III Multiprocessor Issues
42 pages
Shared Memory Archeitecure Easy
No ratings yet
Shared Memory Archeitecure Easy
3 pages
Lec 6 SharedArch PDF
No ratings yet
Lec 6 SharedArch PDF
33 pages
Parallel Processing: sp2016 Lec#5
No ratings yet
Parallel Processing: sp2016 Lec#5
27 pages
Lecture 3 PDC
No ratings yet
Lecture 3 PDC
21 pages
5 4 Parallel
No ratings yet
5 4 Parallel
47 pages
CS82 Advanced Computer Architecture: Parallel Computer Models 1.2 Multiprocessors and Multicomputers
No ratings yet
CS82 Advanced Computer Architecture: Parallel Computer Models 1.2 Multiprocessors and Multicomputers
19 pages
Parallel Computing Platforms and Memory System Performance: John Mellor-Crummey
No ratings yet
Parallel Computing Platforms and Memory System Performance: John Mellor-Crummey
43 pages
15 Parallel Processing
No ratings yet
15 Parallel Processing
36 pages
Introduction To Parallel Computing: John Von Neumann Institute For Computing
No ratings yet
Introduction To Parallel Computing: John Von Neumann Institute For Computing
18 pages
Advanced Computer Architecture Unit 1
No ratings yet
Advanced Computer Architecture Unit 1
23 pages
Classification - Shared Memory Systems
No ratings yet
Classification - Shared Memory Systems
3 pages
PP16 Lec4 Arch3
No ratings yet
PP16 Lec4 Arch3
23 pages
F2812 DSP Full Tutorial
100% (5)
F2812 DSP Full Tutorial
517 pages
Lec 5 SharedArch PDF
No ratings yet
Lec 5 SharedArch PDF
16 pages
P D Group2-2
No ratings yet
P D Group2-2
6 pages
Introduction To Hardware Architectures
No ratings yet
Introduction To Hardware Architectures
52 pages
Parallel Computing Lecture # 6: Parallel Computer Memory Architectures
No ratings yet
Parallel Computing Lecture # 6: Parallel Computer Memory Architectures
16 pages
Slides Taken From: Parallel Computing Platforms
No ratings yet
Slides Taken From: Parallel Computing Platforms
11 pages
Parallel Computing: Er. Anupama Singh Department of Computer Science & Engg
No ratings yet
Parallel Computing: Er. Anupama Singh Department of Computer Science & Engg
22 pages
S7-200 Quick Reference Information: Special Memory Bits
No ratings yet
S7-200 Quick Reference Information: Special Memory Bits
6 pages
D20MX Substation Controller Instruction Manual
No ratings yet
D20MX Substation Controller Instruction Manual
222 pages
Classification Based On Memory Access Architecture Shared Memory General Characteristics: General Characteristics
No ratings yet
Classification Based On Memory Access Architecture Shared Memory General Characteristics: General Characteristics
4 pages
Datasheet 4
No ratings yet
Datasheet 4
32 pages
Introduction To OS
No ratings yet
Introduction To OS
41 pages
What Is Parallel Computing
No ratings yet
What Is Parallel Computing
9 pages
Bow Thruster PCD3
No ratings yet
Bow Thruster PCD3
34 pages
Chapter 3 Processes
No ratings yet
Chapter 3 Processes
42 pages
A Project Report On ........
No ratings yet
A Project Report On ........
24 pages
S7 Distributed Safety Configuring and Program Min en US en-US
No ratings yet
S7 Distributed Safety Configuring and Program Min en US en-US
334 pages
Lesson Plans Computer Studies g8 Samples
No ratings yet
Lesson Plans Computer Studies g8 Samples
13 pages
Instantly Available Power Managed Desktop PC Design Guide: Intel Corporation
No ratings yet
Instantly Available Power Managed Desktop PC Design Guide: Intel Corporation
52 pages
A Survey of Recent Developments in Testability Safety and Security of RISC-V Processors
No ratings yet
A Survey of Recent Developments in Testability Safety and Security of RISC-V Processors
10 pages
Smart City Garbage Collection and Tracking System": A Presentation On
No ratings yet
Smart City Garbage Collection and Tracking System": A Presentation On
12 pages
NUMA
No ratings yet
NUMA
4 pages
Understanding Non-Uniform Memory Access - NUMA
No ratings yet
Understanding Non-Uniform Memory Access - NUMA
3 pages
Design and Implement of A Trimaran Unmanned Surfac
No ratings yet
Design and Implement of A Trimaran Unmanned Surfac
5 pages
Pipelined Processor Design
No ratings yet
Pipelined Processor Design
28 pages
LRN International GCSE Computer Science
No ratings yet
LRN International GCSE Computer Science
7 pages
Casio CPS7 Service
No ratings yet
Casio CPS7 Service
16 pages
Microprocessor Chapter 1 Introduction
No ratings yet
Microprocessor Chapter 1 Introduction
15 pages
Carroll Heiser 10
No ratings yet
Carroll Heiser 10
14 pages
Master Akshay M. Lunawat: A Seminar Report On Operating System
No ratings yet
Master Akshay M. Lunawat: A Seminar Report On Operating System
9 pages
04 - Clock Generator 8284A
No ratings yet
04 - Clock Generator 8284A
10 pages
6 CIRI Ladder Diagram
No ratings yet
6 CIRI Ladder Diagram
3 pages
Parallel Database
No ratings yet
Parallel Database
8 pages
Gujarat Technological University: Semester - V Subject Name
No ratings yet
Gujarat Technological University: Semester - V Subject Name
2 pages
LDCO June 2022 Endsem
No ratings yet
LDCO June 2022 Endsem
2 pages
SOP Sample
No ratings yet
SOP Sample
3 pages
Important Abbreviations - Computers
No ratings yet
Important Abbreviations - Computers
2 pages
IBM x3850 X5 and x3950 X5 Problem Determination and Service Guide
No ratings yet
IBM x3850 X5 and x3950 X5 Problem Determination and Service Guide
438 pages

Lecture-3 Parallel Computer Memory Architecture

Uploaded by

Lecture-3 Parallel Computer Memory Architecture

Uploaded by

Parallel Computer Memory Architectures

❖Hybrid Shared-Distributed Memory

▪ Global address space provides a user-friendly programming perspective to memory.

▪ Programmer responsibility for synchronization constructs that insure "correct“ access of

▪ Equal access and access times to memory.

▪ One SMP can directly access memory of another SMP.

▪ Not all processors have equal access time to all memories.

▪ Memory access across link is slower (a processor can

▪ If cache coherency is maintained, then may also be called

Basis for UMA NUMA

▪ Distributed memory systems require a communication network to connect inter-processor memory.

• Memory is scalable with the number of processors.

• Each processor can rapidly access its own memory

• interference of other processor.

• It may be difficult to map existing data structures, based

•Need distributed data structure.

• Non-uniform memory access times

•Data residing on a remote node takes longer to access

▪ The shared memory component can be a shared memory machine and/or

▪ The distributed memory component is the networking of multiple shared

❖Advantages and Disadvantages

You might also like