0% found this document useful (0 votes)

350 views33 pages

Chapter 2 - Parallel Programming Platforms

Uploaded by

Sonaiya Mahesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

350 views33 pages

Chapter 2 - Parallel Programming Platforms

Uploaded by

Sonaiya Mahesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Introduction to Parallel Computing

George Karypis Parallel Programming Platforms

Elements of a Parallel Computer

Hardware
Multiple Processors Multiple Memories Interconnection Network

System Software
Parallel Operating System Programming Constructs to Express/Orchestrate Concurrency

Application Software
Parallel Algorithms

Goal: Utilize the Hardware, System, & Application Software to either

Achieve Speedup: Tp = Ts/p Solve problems requiring a large amount of memory.

Parallel Computing Platform

Logical Organization
The users view of the machine as it is being presented via its system software

Physical Organization
The actual hardware architecture

Physical Architecture is to a large extent independent of the Logical Architecture

Logical Organization Elements

Control Mechanism
SISD/SIMD/MIMD/MISD
Single/Multiple Instruction Stream & Single/Multiple Data Stream

SPMD: Single Program Multiple Data

Logical Organization Elements

Communication Model
Shared-Address Space
UMA/NUMA/ccNUMA

Message-Passing

Physical Organization
Ideal Parallel Computer Architecture
PRAM: Parallel Random Access Machine

PRAM Models
EREW/ERCW/CREW/CRCW
Exclusive/Concurrent Read and/or Write

Concurrent Writes are resolved via

Common/Arbitrary/Priority/Sum

Physical Organization
Interconnection Networks (ICNs)
Provide processor-to-processor and processor-to-memory connections Networks are classified as:

Static
Consist of a number of point-to-point links
direct network

Dynamic
The network consists of switching elements that the various processors attach to
indirect network

Historically used to link processors-to-processors

distributed-memory system

Historically used to link processors-to-memory

shared-memory systems

Static & Dynamic ICNs

Evaluation Metrics for ICNs

Diameter
The maximum distance between any two nodes
Smaller the better.

Connectivity
The minimum number of arcs that must be removed to break it into two disconnected networks
Larger the better

Measures the multiplicity of paths

Bisection width
The minimum number of arcs that must be removed to partition the network into two equal halves.
Larger the better

Bisection bandwidth
Applies to networks with weighted arcsweights correspond to the link width (how much data it can transfer) The minimum volume of communication allowed between any two halves of a network
Larger the better

Cost
The number of links in the network
Smaller the better

Metrics and Dynamic Networks

Network Topologies
Bus-Based Networks
Shared medium Information is being broadcasted Evaluation:
Diameter: O(1) Connectivity: O(1) Bisection width: O(1) Cost: O(p)

Network Topologies
Crossbar Networks
Switch-based network Supports simultaneous connections Evaluation:
Diameter: O(1) Connectivity: O(1)? Bisection width: O(p)? Cost: O(p2)

Network Topologies
Multistage Interconnection Networks

Multistage Switch Architecture

Pass-through

Cross-over

Connecting the Various Stages

Blocking in a Multistage Switch

Routing is done by comparing the bit-level representation of source and destination addresses. -match goes via pass-through -mismatch goes via cross-over

Network Topologies
Complete and star-connected networks.

Network Topologies
Cartesian Topologies

Network Topologies
Hypercubes

Network Topologies
Trees

Summary of Performance Metrics

Physical Organization
Cache Coherence in Shared Memory Systems
A certain level of consistency must be maintained for multiple copies of the same data Required to ensure proper semantics and correct program execution
serializability

Two general protocols for dealing with it

invalidate & update

Invalidate/Update Protocols

Invalidate/Update Protocols
The preferred scheme depends on the characteristics of the underlying application
frequency of reads/writes to shared variables

Classical trade-off between communication overhead (updates) and idling (stalling in invalidates) Additional problems with false sharing Existing schemes are based on the invalidate protocol
A number of approaches have been developed for maintaining the state/ownership of the shared data

Communication Costs in Parallel Systems

Message-Passing Systems
The communication cost of a data-transfer operation depends on:
start-up time: ts
add headers/trailer, error-correction, execute the routing algorithm, establish the connection between source & destination

per-hop time: th
time to travel between two directly connected nodes. node latency

per-word transfer time: tw

1/channel-width

Store-and-Forward & Cut-Through Routing

Cut-through Routing Deadlocks

Messages 0, 1, 2, and 3 need to go to nodes A, B, C, and D, respectively

Communication Model Used for this Class

We will assume that the cost of sending a message of size m is:

In general true because ts is much larger than th and for most of the algorithms that we will study mtw is much larger than lth

Routing Mechanisms
Routing:
The algorithm used to determine the path that a message will take to go from the source to destination

Can be classified along different dimensions

minimal vs non-minimal deterministic vs adaptive

Dimension Ordered Routing

There is a predefined ordering of the dimensions Messages are routed along the dimensions in that order until they cannot move any further
X-Y routing for meshes E-cube routine for hypercubes

Topology Embeddings
Mapping between networks
Useful in the early days of parallel computing when topology specific algorithms were being developed.

Embedding quality metrics

dilation
maximum number of lines an edge is mapped to

congestion
maximum number of edges mapped on a single link

Mapping a Cartesian Topology onto a Hypercube

Cool things

Mapping a Cartesian Topology onto a Hypercube

Introduction To Parallel Computing: Solution Manual
No ratings yet
Introduction To Parallel Computing: Solution Manual
70 pages
Annotated Messages
0% (1)
Annotated Messages
1 page
Simulation of Digital Communication Systems Using Matlab
From Everand
Simulation of Digital Communication Systems Using Matlab
Mathuranathan Viswanathan
3.5/5 (22)
CFA Application
100% (3)
CFA Application
1 page
Slides Chapter 2 - Parallel Programming Platforms
No ratings yet
Slides Chapter 2 - Parallel Programming Platforms
33 pages
Lecture 4 Network Topologies For Parallel Architecture
No ratings yet
Lecture 4 Network Topologies For Parallel Architecture
34 pages
Chapter 4
No ratings yet
Chapter 4
46 pages
Lecture 5 Network Topologies For Parallel Architectures - Updated
No ratings yet
Lecture 5 Network Topologies For Parallel Architectures - Updated
46 pages
Parallel Computing: Overview: John Urbanic Urbanic@psc - Edu
No ratings yet
Parallel Computing: Overview: John Urbanic Urbanic@psc - Edu
34 pages
Static and Dynamic
No ratings yet
Static and Dynamic
43 pages
Parallel Computing: Overview: John Urbanic Urbanic@psc - Edu
No ratings yet
Parallel Computing: Overview: John Urbanic Urbanic@psc - Edu
33 pages
lect3-parallel system
No ratings yet
lect3-parallel system
31 pages
Parallel Processing Lecture3
No ratings yet
Parallel Processing Lecture3
54 pages
Lecture 4 Flynn's Classical Taxonomy
No ratings yet
Lecture 4 Flynn's Classical Taxonomy
43 pages
Parallel Programming Platforms (Part 1) : CSE3057Y Parallel and Distributed Systems
No ratings yet
Parallel Programming Platforms (Part 1) : CSE3057Y Parallel and Distributed Systems
38 pages
Lecture 3 - 3 Evaluating Static Interconnection Networks
No ratings yet
Lecture 3 - 3 Evaluating Static Interconnection Networks
41 pages
Lecture 5
No ratings yet
Lecture 5
72 pages
Advance Computer Architecture: Unit:Ii System Interconnect Architectures
No ratings yet
Advance Computer Architecture: Unit:Ii System Interconnect Architectures
53 pages
Aca Unit-3
No ratings yet
Aca Unit-3
10 pages
PDC - Lecture - No. 3
No ratings yet
PDC - Lecture - No. 3
34 pages
Parallel Architecture
No ratings yet
Parallel Architecture
33 pages
Introduction
No ratings yet
Introduction
46 pages
Parallel Processors: Session 5 Interconnection Networks
No ratings yet
Parallel Processors: Session 5 Interconnection Networks
48 pages
Intro To Communication: - Advantages
No ratings yet
Intro To Communication: - Advantages
13 pages
Unit 1
No ratings yet
Unit 1
25 pages
Interconnection Networks: Crossbar Switch, Which Can Simultaneously Connect Any Set of
No ratings yet
Interconnection Networks: Crossbar Switch, Which Can Simultaneously Connect Any Set of
11 pages
Parallel Architectures
No ratings yet
Parallel Architectures
160 pages
Lecture 4
No ratings yet
Lecture 4
33 pages
CMP 316 Data Communication and Networks WRITEUP Update
No ratings yet
CMP 316 Data Communication and Networks WRITEUP Update
122 pages
Parallel Architecture: Sathish Vadhiyar
No ratings yet
Parallel Architecture: Sathish Vadhiyar
26 pages
Lecture 3.2.4 (Various Interconnection Networks)
No ratings yet
Lecture 3.2.4 (Various Interconnection Networks)
5 pages
1 Module 1 Parallelism Fundamentals Motivation Key Concepts and Challenges Parallel Computing
No ratings yet
1 Module 1 Parallelism Fundamentals Motivation Key Concepts and Challenges Parallel Computing
81 pages
4 - Interconnection Networks
No ratings yet
4 - Interconnection Networks
57 pages
Parallel Programming Platforms (Part 2) : CSE3057Y Parallel and Distributed Systems
No ratings yet
Parallel Programming Platforms (Part 2) : CSE3057Y Parallel and Distributed Systems
20 pages
Distributed Memory Machines
No ratings yet
Distributed Memory Machines
10 pages
Lec3 InnerconnectionNetworks
No ratings yet
Lec3 InnerconnectionNetworks
28 pages
Pdcco 1
No ratings yet
Pdcco 1
8 pages
Chapt. 1 Intro. To Computer Networks
No ratings yet
Chapt. 1 Intro. To Computer Networks
44 pages
Lecture - 28
No ratings yet
Lecture - 28
24 pages
Module 1 DataCommunication First Chapter
No ratings yet
Module 1 DataCommunication First Chapter
90 pages
Multiprocessor Interconnection Networks Networks: CS 740 November 19, 2003
No ratings yet
Multiprocessor Interconnection Networks Networks: CS 740 November 19, 2003
8 pages
CS621 Final Term
No ratings yet
CS621 Final Term
111 pages
Additional Topics of Unit-I and Unit-II: Syed Rameem Zahra
No ratings yet
Additional Topics of Unit-I and Unit-II: Syed Rameem Zahra
21 pages
DFSSDF
No ratings yet
DFSSDF
73 pages
CS 6290 Many-Core & Interconnect: Milos Prvulovic Fall 2007
No ratings yet
CS 6290 Many-Core & Interconnect: Milos Prvulovic Fall 2007
21 pages
Introduction To MIMD Architectures
No ratings yet
Introduction To MIMD Architectures
17 pages
1 Module 1 Introduction To Multiprocessors September 29 2024
No ratings yet
1 Module 1 Introduction To Multiprocessors September 29 2024
29 pages
Network 34
No ratings yet
Network 34
76 pages
Chapter 7 Parallel Processing
No ratings yet
Chapter 7 Parallel Processing
29 pages
24-25 - Parallel Processing PDF
No ratings yet
24-25 - Parallel Processing PDF
36 pages
Computer Networks vs. Distributed Systems
No ratings yet
Computer Networks vs. Distributed Systems
68 pages
Fundamentals of Parallel Computers
No ratings yet
Fundamentals of Parallel Computers
6 pages
Chapter 1-Intro (2) (Compatibility Mode)
No ratings yet
Chapter 1-Intro (2) (Compatibility Mode)
14 pages
Interconnection Networks
No ratings yet
Interconnection Networks
31 pages
CN Module 1
No ratings yet
CN Module 1
74 pages
Solution 2-DD
No ratings yet
Solution 2-DD
70 pages
Parallel and Distributed Computing Research Paper
No ratings yet
Parallel and Distributed Computing Research Paper
8 pages
Networkingpdf
No ratings yet
Networkingpdf
105 pages
CN CHP 1
No ratings yet
CN CHP 1
73 pages
Routing in Wireless Mesh Networks
From Everand
Routing in Wireless Mesh Networks
Raghav Kumar
No ratings yet
Introduction to Internet & Web Technology: Internet & Web Technology
From Everand
Introduction to Internet & Web Technology: Internet & Web Technology
Dr. Yashpal singh
No ratings yet
200-301 CCNA (Cisco Certified Network Associate) Study Guide
From Everand
200-301 CCNA (Cisco Certified Network Associate) Study Guide
Anand Vemula
No ratings yet
Heuristic Search
No ratings yet
Heuristic Search
45 pages
Chapter: Computer Aided Software Engineering
No ratings yet
Chapter: Computer Aided Software Engineering
16 pages
Technical Feasibility v1.0
No ratings yet
Technical Feasibility v1.0
9 pages
Chapter Eight: Regular Expression Applications: Formal Language, Chapter 8, Slide 1
No ratings yet
Chapter Eight: Regular Expression Applications: Formal Language, Chapter 8, Slide 1
39 pages
Shivam Big Bazaar Project Report
100% (1)
Shivam Big Bazaar Project Report
85 pages
Ad6 Reading Task Nov 2023
No ratings yet
Ad6 Reading Task Nov 2023
1 page
TA1-K61-Final Test 1
No ratings yet
TA1-K61-Final Test 1
4 pages
Law Notes - Evidence Act
50% (2)
Law Notes - Evidence Act
35 pages
Uttarakhand Objective GK Questions With Answers
No ratings yet
Uttarakhand Objective GK Questions With Answers
9 pages
10 Chola Temples That One Must Pay A Visit in 202
No ratings yet
10 Chola Temples That One Must Pay A Visit in 202
2 pages
Parklands Law Campus Journal
No ratings yet
Parklands Law Campus Journal
184 pages
Syllabus Class 8 (2024-25)
No ratings yet
Syllabus Class 8 (2024-25)
21 pages
Beliefs and Attitude Towards Male Child Preference Among Residents of Igando Community, Alimosho Local Government, Lagos State.
No ratings yet
Beliefs and Attitude Towards Male Child Preference Among Residents of Igando Community, Alimosho Local Government, Lagos State.
26 pages
Pendragon Character Sheet Early BW BLANK v.2
No ratings yet
Pendragon Character Sheet Early BW BLANK v.2
2 pages
Case Report: Palatal Swelling: A Diagnostic Enigma
No ratings yet
Case Report: Palatal Swelling: A Diagnostic Enigma
6 pages
Comparative Economic Planning Week 4
No ratings yet
Comparative Economic Planning Week 4
89 pages
Konya City Booklet
No ratings yet
Konya City Booklet
16 pages
Homestay Mogg Instant Download
No ratings yet
Homestay Mogg Instant Download
39 pages
Planners Lab Vs Excel
No ratings yet
Planners Lab Vs Excel
24 pages
Dat Zip Zone Directory
No ratings yet
Dat Zip Zone Directory
2 pages
Sales Comparison (SKU Wise)
No ratings yet
Sales Comparison (SKU Wise)
2 pages
Ensuring Quality Standards Through Acceptance-Sampling: Why Were They Able To Pass The Tests?
No ratings yet
Ensuring Quality Standards Through Acceptance-Sampling: Why Were They Able To Pass The Tests?
38 pages
Report-Study Tour With Photos
No ratings yet
Report-Study Tour With Photos
5 pages
United States v. Nathaniel A. Richardson, JR., A/K/A Nathaniel Skeeter, A/K/A Skeet, United States of America v. Jermaine Cleavon Golden, 233 F.3d 223, 4th Cir. (2000)
No ratings yet
United States v. Nathaniel A. Richardson, JR., A/K/A Nathaniel Skeeter, A/K/A Skeet, United States of America v. Jermaine Cleavon Golden, 233 F.3d 223, 4th Cir. (2000)
13 pages
DBMS 2 3
No ratings yet
DBMS 2 3
2 pages
Local Scour Free Surface Mesh PDF
No ratings yet
Local Scour Free Surface Mesh PDF
19 pages
Atomic Structure
No ratings yet
Atomic Structure
1 page
Postulates
No ratings yet
Postulates
4 pages
My Biography: Mochammad Idfani Wahib Najib 11 IPA 4 18
No ratings yet
My Biography: Mochammad Idfani Wahib Najib 11 IPA 4 18
5 pages
Purposive Communication - Finals
No ratings yet
Purposive Communication - Finals
15 pages
Glanzer and Cunitz MSM Study 2 1966
0% (1)
Glanzer and Cunitz MSM Study 2 1966
3 pages
Pouch Making, - Go Bag - , Folded Bottom
No ratings yet
Pouch Making, - Go Bag - , Folded Bottom
7 pages

Chapter 2 - Parallel Programming Platforms

Uploaded by

Chapter 2 - Parallel Programming Platforms

Uploaded by

Introduction to Parallel Computing

George Karypis Parallel Programming Platforms

Elements of a Parallel Computer

Goal: Utilize the Hardware, System, & Application Software to either

Parallel Computing Platform

Physical Architecture is to a large extent independent of the Logical Architecture

Logical Organization Elements

SPMD: Single Program Multiple Data

Logical Organization Elements

Concurrent Writes are resolved via

Historically used to link processors-to-processors

Historically used to link processors-to-memory

Static & Dynamic ICNs

Evaluation Metrics for ICNs

Measures the multiplicity of paths

Metrics and Dynamic Networks

Multistage Switch Architecture

Connecting the Various Stages

Blocking in a Multistage Switch

Summary of Performance Metrics

Two general protocols for dealing with it

Communication Costs in Parallel Systems

per-word transfer time: tw

Store-and-Forward & Cut-Through Routing

Cut-through Routing Deadlocks

Communication Model Used for this Class

Can be classified along different dimensions

Dimension Ordered Routing

Embedding quality metrics

Mapping a Cartesian Topology onto a Hypercube

Mapping a Cartesian Topology onto a Hypercube

You might also like