0% found this document useful (0 votes)

338 views14 pages

Parallel Computer Memory Architectures

The document discusses parallel computer memory architectures. It describes three main types: 1. Shared memory systems, which allow all processors access to the same global memory space. These include Uniform Memory Access (UMA) and Non-Uniform Memory Access (NUMA) systems. 2. Distributed memory systems, where each processor has its own local memory and communication is needed to access data on other processors. 3. Hybrid distributed shared memory systems, which combine shared and distributed memory by networking multiple shared memory machines together. This allows scalability while maintaining shared memory within individual nodes.

Uploaded by

Sunny Vivek

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

338 views14 pages

Parallel Computer Memory Architectures

Uploaded by

Sunny Vivek

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Parallel Computer

Memory Architectures

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 1

Memory Models
• Data and Instructions in a parallel program are stored in the
main memory - accessible for processors for the execution.
• Way in which the main memory is used by processors in a
multiprocessor system,
• Divided parallel systems onto
1. Shared memory system
2. Distributed memory systems

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 2

STRUCTURAL CLASSIFICATION
1. Shared Memory
General Characteristics
• Shared memory parallel computers vary widely, but common
ability for all processors to access all memory as Global Address
Space.
• Multiple processors can operate independently but share the
same memory resources.
• Changes in a memory location effected by one processor are
visible to all other processors.
• Shared memory machines have been classified
as UMA and NUMA, based upon memory access times

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 4

• Uniform Memory Access (UMA)
– Most commonly represented today by Symmetric
Multiprocessor (SMP) machines
– Identical processors
– Equal access and access times to memory
– Sometimes called CC-UMA - Cache Coherent UMA.

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 5

• Coherency: is an issue whenever there are multiple cores or
processors, each with its own cache. An update done on one core
may not be seen by another core, if the local cache on the second
core contains an old value of the affected memory location.
• Thus, whenever an update occurs to a memory location, copies of
the content of that memory location that are cached on other
caches must be invalidated.
• Such invalidation is done lazily in many processor architectures
leads to inconsistency.
• Cache coherency is accomplished at the hardware level.

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 6

Non-Uniform Memory Access (NUMA)
– Often made by physically linking two or more SMPs
– One SMP can directly access memory of another SMP
– Not all processors have equal access time to all memories
– Memory access across link is slower
– If cache coherency is maintained, then may also be called
CC-NUMA - Cache Coherent NUMA

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 7

Advantages
– Global address space provides a user-friendly programming
perspective to memory
– Data sharing between tasks is both fast and uniform due to
the proximity of memory to CPUs
Disadvantages
– Primary disadvantage is the lack of scalability between
memory and CPUs. Adding more CPUs can geometrically
increase traffic on the shared memory-CPU path, and for
cache coherent systems, geometrically increase traffic
associated with cache/memory management.
– Programmer responsibility for synchronization constructs
that ensure "correct" access of global memory.

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 8

2. Distributed Memory
General Characteristics
– Distributed memory systems require a communication
network to connect inter-processor memory.
– Each processors have their own local memory.
– Memory addresses in one processor do not map to another
processor - no concept of global address space
– Each processor has its own local memory, it operates
independently. Changes it makes to its local memory have no
effect on the memory of other processors. Hence, No
concept of cache coherency.

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 9

General Characteristics
– When a processor needs access to data in another processor,
it is usually the task of the programmer to explicitly define
how and when data is communicated.
– Synchronization between tasks is likewise the programmer's
responsibility.
– The network "fabric" used for data transfer varies widely,
though it can be as simple as Ethernet.

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 10

Advantages
– Memory and processors are highly scalable
– Increase the number of processors and the size of memory
increases proportionately.
– Each processor can rapidly access its own memory without
interference and without the overhead of cache coherency.
– Cost effectiveness: can use commodity, off-the-shelf
processors and networking.
Disadvantages
– The programmer is responsible for data communication
between processors.
– It may be difficult to map existing data structures, based on
global memory, to this memory organization.
– Non-uniform memory access times - data residing on a remote
node takes longer to access than node local data.
Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 11
Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 12
3. Hybrid Distributed Shared Memory
General Characteristics
The largest and fastest computers in the world today employ
both shared and distributed memory architectures.

shared memory component is usually a cache coherent SMP

machine. Processors on a given SMP can address that machine's
memory as global.

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 13

• The distributed memory component is the networking of
multiple shared memory, which know only about their own
memory - not the memory on another machine.
• Therefore, network communications are required to move data
from one machine to another.
• Current trends seem to indicate that this type of memory
architecture will continue to prevail and increase at the high end
of computing for the foreseeable future.
Advantages and Disadvantages
• Increased scalability is an important advantage
• Increased programmer complexity is an important disadvantage

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 14

Lecture-3 Parallel Computer Memory Architecture
No ratings yet
Lecture-3 Parallel Computer Memory Architecture
14 pages
Fundamentals of Parallel Computing
No ratings yet
Fundamentals of Parallel Computing
42 pages
Parallel Computing Challenges & Trends
No ratings yet
Parallel Computing Challenges & Trends
81 pages
Networking and Distributed Systems
100% (1)
Networking and Distributed Systems
34 pages
Get 211 Lecture Note 2024
No ratings yet
Get 211 Lecture Note 2024
118 pages
Unit - IV: Input - Output
No ratings yet
Unit - IV: Input - Output
50 pages
Module 1. Introduction To Information System
No ratings yet
Module 1. Introduction To Information System
22 pages
Unit 1 Modern Processors
100% (1)
Unit 1 Modern Processors
52 pages
Grid Computing: A Comprehensive Guide
No ratings yet
Grid Computing: A Comprehensive Guide
26 pages
What Is HCI & Its Importance
No ratings yet
What Is HCI & Its Importance
50 pages
Unit II Notes
No ratings yet
Unit II Notes
6 pages
16-State Transition Networks-07-Aug-2020Material - I - 07-Aug-2020 - State - Transition - Network - (STN)
No ratings yet
16-State Transition Networks-07-Aug-2020Material - I - 07-Aug-2020 - State - Transition - Network - (STN)
15 pages
Array & Vector Processor
No ratings yet
Array & Vector Processor
17 pages
Unit I: 1.2 Characteristics of Computer
No ratings yet
Unit I: 1.2 Characteristics of Computer
51 pages
Computer Network - Introduction
No ratings yet
Computer Network - Introduction
22 pages
15CS754 SAN Solution Manual
No ratings yet
15CS754 SAN Solution Manual
15 pages
Silberschatz Ch06 Process Synchronization
No ratings yet
Silberschatz Ch06 Process Synchronization
63 pages
Memory Organization and Cache Design
No ratings yet
Memory Organization and Cache Design
85 pages
Collaborative Multimedia Communication Systems
No ratings yet
Collaborative Multimedia Communication Systems
12 pages
Sna Unit-I
No ratings yet
Sna Unit-I
10 pages
Introduction to Distributed Systems
No ratings yet
Introduction to Distributed Systems
30 pages
Sna Unit-Iv
No ratings yet
Sna Unit-Iv
8 pages
Social Information Filtering
No ratings yet
Social Information Filtering
25 pages
Network Operating System
No ratings yet
Network Operating System
5 pages
Real-Time Database Management Techniques
100% (1)
Real-Time Database Management Techniques
27 pages
NoSQL vs Relational Databases Guide
No ratings yet
NoSQL vs Relational Databases Guide
29 pages
Virtual Memory in Operating System.
No ratings yet
Virtual Memory in Operating System.
21 pages
Project Management Unit 2 Lecture 4
No ratings yet
Project Management Unit 2 Lecture 4
31 pages
2 Identifiers and Variables
No ratings yet
2 Identifiers and Variables
16 pages
4-Types of Networks
No ratings yet
4-Types of Networks
2 pages
Network Components & Topologies Guide
No ratings yet
Network Components & Topologies Guide
10 pages
Fundamentals of Information Technology Notes
100% (1)
Fundamentals of Information Technology Notes
2 pages
PROCESS MANAGEMENT - Lecture
No ratings yet
PROCESS MANAGEMENT - Lecture
6 pages
Introduction to Distributed Systems
No ratings yet
Introduction to Distributed Systems
24 pages
Unit - III FIT Notes (Data Science)
No ratings yet
Unit - III FIT Notes (Data Science)
29 pages
Overview of Selection Control Structures
No ratings yet
Overview of Selection Control Structures
10 pages
Networking Hardware and Functions Overview
No ratings yet
Networking Hardware and Functions Overview
28 pages
Chapter 07 Productivity Applications
No ratings yet
Chapter 07 Productivity Applications
19 pages
Computer Memory System Overview
No ratings yet
Computer Memory System Overview
3 pages
Mobile Number Portability Seminar
100% (1)
Mobile Number Portability Seminar
24 pages
MODULE 3 Syncronization
No ratings yet
MODULE 3 Syncronization
22 pages
Java Collection Frame Work
No ratings yet
Java Collection Frame Work
10 pages
GOMS and KLM in HCI Modeling
No ratings yet
GOMS and KLM in HCI Modeling
36 pages
Object Oriented Databases Guide
No ratings yet
Object Oriented Databases Guide
97 pages
Bcn1043 - Computer Architecture & Organization
No ratings yet
Bcn1043 - Computer Architecture & Organization
4 pages
Unit-Ii PPT
No ratings yet
Unit-Ii PPT
43 pages
Unit 5 L4 Introduction To Formalism in Dialog Design, Design Using FSM
No ratings yet
Unit 5 L4 Introduction To Formalism in Dialog Design, Design Using FSM
26 pages
GSM Security and Authentication
No ratings yet
GSM Security and Authentication
14 pages
Chapter 1 Introduction To HCI
No ratings yet
Chapter 1 Introduction To HCI
35 pages
Overview of Distributed Operating Systems
No ratings yet
Overview of Distributed Operating Systems
25 pages
01 - Introduction To Pervasive Computing
No ratings yet
01 - Introduction To Pervasive Computing
82 pages
Requirements Modeling in Software Engineering
No ratings yet
Requirements Modeling in Software Engineering
20 pages
Lecture 2.0 - Issues in Design of Distributed System
100% (1)
Lecture 2.0 - Issues in Design of Distributed System
14 pages
Effective MIPS Rate and Cache Protocols Analysis
No ratings yet
Effective MIPS Rate and Cache Protocols Analysis
9 pages
Chapter 5 Data Representation PDF
No ratings yet
Chapter 5 Data Representation PDF
39 pages
Unit 1: The Database Environment: Topic 1: Basic Concepts and Terminologies Data vs. Information - Data
No ratings yet
Unit 1: The Database Environment: Topic 1: Basic Concepts and Terminologies Data vs. Information - Data
27 pages
Unit I English For Research Paper Writing
No ratings yet
Unit I English For Research Paper Writing
10 pages
It9224 Distributed Systems Important Questions UNIT-1
100% (1)
It9224 Distributed Systems Important Questions UNIT-1
4 pages
BlockChain Based Chat Application
No ratings yet
BlockChain Based Chat Application
6 pages
Unit 2.1
No ratings yet
Unit 2.1
18 pages
Cad Asuult
No ratings yet
Cad Asuult
12 pages
Intel: Microprocessors
No ratings yet
Intel: Microprocessors
42 pages
TamaraMunzner 2015 Cap 7.5 SeparateOrderAndAli VisualizationAnalysis
No ratings yet
TamaraMunzner 2015 Cap 7.5 SeparateOrderAndAli VisualizationAnalysis
13 pages
Computer Architecture Model Paper
No ratings yet
Computer Architecture Model Paper
1 page
Os MCQ
No ratings yet
Os MCQ
84 pages
Computer Architecture Sample Final
No ratings yet
Computer Architecture Sample Final
10 pages
MTech CSE Viva Question With Answers I Sem
No ratings yet
MTech CSE Viva Question With Answers I Sem
16 pages
Computer Oragiantion Unit 1
No ratings yet
Computer Oragiantion Unit 1
126 pages
Unit 2 F.C.S
No ratings yet
Unit 2 F.C.S
18 pages
Introduction & Matrix Multiplication: 6.172 Performance Engineering of Software Systems
No ratings yet
Introduction & Matrix Multiplication: 6.172 Performance Engineering of Software Systems
69 pages
Computer Science Crash Course - Session 1
No ratings yet
Computer Science Crash Course - Session 1
22 pages
Core 2 Duo
No ratings yet
Core 2 Duo
8 pages
P9000 Auto LUN User Guide P9500 Disk Array
No ratings yet
P9000 Auto LUN User Guide P9500 Disk Array
66 pages
Computer Organization and Architecture - Basic Processing Unit (Module 5)
No ratings yet
Computer Organization and Architecture - Basic Processing Unit (Module 5)
76 pages
Amd Cdna 3 White Paper
No ratings yet
Amd Cdna 3 White Paper
27 pages
BA 315 (Prelim Review)
No ratings yet
BA 315 (Prelim Review)
4 pages
June 2019 MS - Paper 2 AQA Computer Science GCSE
No ratings yet
June 2019 MS - Paper 2 AQA Computer Science GCSE
17 pages
Advanced Computer Architecture Q&A Guide
No ratings yet
Advanced Computer Architecture Q&A Guide
5 pages
Memory Systems: Concepts and Types
No ratings yet
Memory Systems: Concepts and Types
41 pages
1 HiPath 4000 V2.0 - Papagayo System, CA
0% (2)
1 HiPath 4000 V2.0 - Papagayo System, CA
86 pages
Computer System Overview
No ratings yet
Computer System Overview
52 pages
Merging Write Buffers
No ratings yet
Merging Write Buffers
14 pages
Memory Hierarchy in Computer Architecture
No ratings yet
Memory Hierarchy in Computer Architecture
48 pages
Introduction To SPARC M8 and T8 Server Architecture
No ratings yet
Introduction To SPARC M8 and T8 Server Architecture
44 pages
Phoenix BIOS Post Codes - BIOS Central
No ratings yet
Phoenix BIOS Post Codes - BIOS Central
9 pages
Computer System Architecture Lecture Notes
No ratings yet
Computer System Architecture Lecture Notes
7 pages
Purpose of Cache Memory Explained
No ratings yet
Purpose of Cache Memory Explained
4 pages
Evolution of Processors Seminar
No ratings yet
Evolution of Processors Seminar
30 pages
QoS-Aware Resource Partitioning
No ratings yet
QoS-Aware Resource Partitioning
17 pages
Exploring CAT Grade 12 - Hardware HTML Normalisation
No ratings yet
Exploring CAT Grade 12 - Hardware HTML Normalisation
75 pages

Parallel Computer Memory Architectures

Uploaded by

Parallel Computer Memory Architectures

Uploaded by

Parallel Computer

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 1

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 2

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 4

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 5

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 6

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 7

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 8

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 9

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 10

shared memory component is usually a cache coherent SMP

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 13

Dr Vengadeswaran Asst. Prof. ICS311 Parallel and Distributed Computing (Sem V) 14

You might also like