Lecture 6 Parallel Programming Models

The document discusses various parallel programming models, including Shared Memory, Distributed Memory, and Hybrid Models, highlighting their architectures and programming paradigms such as OpenMP and MPI. It emphasizes the importance of choosing the right model based on available resources and personal preference, along with the significance of performance metrics, security, and energy efficiency in parallel computing. Additionally, it introduces high-level programming models like SPMD and MPMD, and mentions applications in cloud computing and GPU programming.

Uploaded by

hassiedward977

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views17 pages

Lecture 6 Parallel Programming Models

Uploaded by

hassiedward977

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

CSC334 Parallel & Distributed Computing

Lecture # 06
Parallel Programming Models
Suliman Khan

Department of Computer Science

University of Lahore, Sargodha Campus
Parallel Computers

• Programming mode types

– Shared Memory
– Distributed Memory
– Hybrid Model
Parallel Programing Models
• Parallel programming models exist as an abstraction above hardware and memory
architectures
• These models are NOT specific to a particular type of machine or memory
architecture
• These models can (theoretically) be implemented on any underlying hardware
• Examples from past
• SHARED memory model on a DISTRIBUTED memory machine. Kendall Square Research (KSR)
ALLCACHE approach, “virtual shared memory"
• DISTRIBUTED memory model on a SHARED memory machine. Message Passing Interface (MPI)
on SGI Origin 2000, employed the CC-NUMA type of shared memory architecture, however, MPI
commonly done over a network of distributed memory machines
• Which model to use?
• Combination of what is available and personal choice
Shared Memory
• Architecture
Processors have direct access to global memory and
I/O through bus or fast switching network
• Cache Coherency Protocol guarantees
consistency of memory and I/O accesses
• Each processor also has its own memory (cache)
• Data structures are shared in global address space
• Concurrent access to shared memory must be coordinated
• Programming Models
– Multithreading (Thread Libraries)
– OpenMP P
P0 P1 ... Pn
0
Cach Cach Cach
e e e
Shared
Bus
Global Shared
Memory
Threads Model
• Threads implementations commonly comprise:
• A library of subroutines that are called from within parallel source code
• A set of compiler directives imbedded in either serial or parallel source code
• Historically, hardware vendors have implemented their own proprietary
versions of threads, making it difficult for programmers to develop
portable threaded applications
• Standardization efforts: POSIX Threads (IEEE POSIX 1003.1c) and
OpenMP (Industry standard)
• POSIX Part of Unix/Linux, Library based
• OpenMP Compiler directive based, Portable / multi-platform
• Mircosoft threads, Java, Python threads, CUDA threads for GPUs
OpenMP
• OpenMP: portable shared memory parallelism
• Higher-level API for writing portable
multithreaded applications
• Provides a set of compiler directives and library routines
for parallel application programmers
• API bindings for Fortran, C, and C++
Distributed Memory
Architecture
• Each Processor has direct access only to its local memory
• Processors are connected via high-speed interconnect
• Data structures must be distributed
• Data exchange is done via explicit processor-to-
processor communication: send/receive messages
• Programming Models
– Widely used standard: MPI
– Others: PVM, Express, P4, Chameleon, PARMACS, ...

Memory Memory Memory

P0 P1 ... Pn
Communicati
on
Interconne4 ct
Message Passing Interface
MPI provides:
• Point-to-point communication
• Collective operations
– Barrier synchronization
– gather/scatter operations
– Broadcast, reductions
• Different communication modes
– Synchronous/asynchronous
– Blocking/non-blocking
– Buffered/unbuffered
• Predefined and derived datatypes
• Virtual topologies
• Parallel I/O (MPI 2)
• C/C++ and Fortran bindings
Hybrid Model
• A hybrid model combines more than one of the
previously described programming models.
• A common example of a hybrid model is the
combination of the message passing model
(MPI) with the threads model (OpenMP).
• Threads perform computationally intensive
kernels using local, on-node data
• Communications between processes on
different nodes occurs over the network using
MPI
Hybrid Model
• Another similar and increasingly popular example
of a hybrid model is using MPI with CPU-GPU
(Graphics Processing Unit) programming.
• MPI tasks run on CPUs using local memory and
communicating with each other over a network.
• Computationally intensive kernels are off-loaded to
GPUs on node.
• Data exchange between node-local memory and
GPUs uses CUDA (or something equivalent)
High level programming model
• Single Program Multiple Data (SPMD)
• Multiple Program Multiple Data (MPMD)
SPMD
• Built upon any combination of the previously mentioned parallel
programming models
• SINGLE PROGRAM: All tasks execute their copy of the same program
simultaneously. This program can be threads, message passing, data
parallel or hybrid.
• MULTIPLE DATA: All tasks may use different data

-tasks do not necessarily have to execute the

entire program
- perhaps only a portion of it
MPMD
• built upon any combination of the previously mentioned parallel
programming models
• MULTIPLE PROGRAM: Tasks may execute different programs
simultaneously.
• The programs can be threads, message passing, data parallel or
hybrid.
• MULTIPLE DATA: All tasks may use different data

MPMD applications are not as common

as SPMD applications
Parallel and Distributed
Programming Models
• OPENMP
• MPI
• For message passing systems
• MapReduce and BigTable
• For internet clouds and data centers
• Service clouds require extension of Hadoop, EC2, S3 to facilitate distributed
computing over distributed storage system
• CUDA
• For NVIDIA GPUs
• Open Grid Service Architecture (OGSA)
• For grid application development
Performance, Security and Energy
Efficiency
• Performance Metrics
• CPU Speed, FLOPS, Job response
time, network latency, system
throughput, network bandwidth,
System overhead (OS boot time,
compile time, etc).
• Scalability
• Machine (size), software,
application, and technology
scalability
• Amdahl’s law
Performance, Security and Energy
Efficiency
• Security
• Threats to system and network
• Confidentiality, integrity, and availability
• Copyright protection
• System Defense technologies
• Data protection infrastructures (IDS)
• Energy efficiency
• Distributed power management
• Unused servers’ energy consumption
• Reducing energy in active servers
That’s all for today!!

2022-CAT-Grade 10-June Exam-Paper 2
67% (9)
2022-CAT-Grade 10-June Exam-Paper 2
11 pages
Solution Manual Digital System Design Roth: Read/Download
No ratings yet
Solution Manual Digital System Design Roth: Read/Download
2 pages
PRETEST-ICT - CSS NC II Grade-9 or 11
100% (2)
PRETEST-ICT - CSS NC II Grade-9 or 11
4 pages
CS2253 Computer Organization and Architecture Lecture Notes
No ratings yet
CS2253 Computer Organization and Architecture Lecture Notes
181 pages
Training Report On C and C++
67% (3)
Training Report On C and C++
20 pages
3.3-Recent Trends in Parallel Computing
No ratings yet
3.3-Recent Trends in Parallel Computing
12 pages
3 ParallelProgrammingModels
No ratings yet
3 ParallelProgrammingModels
20 pages
Lecture 1
No ratings yet
Lecture 1
23 pages
Parallel Programming
No ratings yet
Parallel Programming
108 pages
Prebook MCAP
No ratings yet
Prebook MCAP
11 pages
Chapter 2 - Parallel Algorithm Design
No ratings yet
Chapter 2 - Parallel Algorithm Design
84 pages
Multi Core Architectures and Programming
No ratings yet
Multi Core Architectures and Programming
10 pages
2 Parallel Computer Memory Architectures
No ratings yet
2 Parallel Computer Memory Architectures
26 pages
Parallel Programming Models
No ratings yet
Parallel Programming Models
25 pages
Parallel Programming: Process and Threads
No ratings yet
Parallel Programming: Process and Threads
18 pages
CICS 504 Computer Organization
No ratings yet
CICS 504 Computer Organization
35 pages
HPC Module 4
No ratings yet
HPC Module 4
18 pages
Assignment 2
No ratings yet
Assignment 2
6 pages
Parallel Processing
No ratings yet
Parallel Processing
31 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Cloud Computing CS 15-319: Programming Models-Part I Lecture 4, Jan 25, 2012
No ratings yet
Cloud Computing CS 15-319: Programming Models-Part I Lecture 4, Jan 25, 2012
40 pages
Intro Parallel Programming Paradigms
No ratings yet
Intro Parallel Programming Paradigms
45 pages
Overview of Parallel Computing: Shawn T. Brown
No ratings yet
Overview of Parallel Computing: Shawn T. Brown
46 pages
Demystifying Multicore Germany 14 PDF
No ratings yet
Demystifying Multicore Germany 14 PDF
82 pages
ParallelProgramming Start2016
No ratings yet
ParallelProgramming Start2016
41 pages
Meet-7-Parallel Programming Models Bag1
No ratings yet
Meet-7-Parallel Programming Models Bag1
17 pages
Parallel Programming
No ratings yet
Parallel Programming
42 pages
Parallel Programming
No ratings yet
Parallel Programming
17 pages
Parallel and Distributed Computing Lecture#12
No ratings yet
Parallel and Distributed Computing Lecture#12
19 pages
PDC Lecture 15 OpenMP
No ratings yet
PDC Lecture 15 OpenMP
18 pages
Mit Openmp Mpi
No ratings yet
Mit Openmp Mpi
77 pages
Parallel Programming For Multicore Machines Using OpenMP and MPI Lecture Notes (Dr. Constantinos Evangelinos) (Z-Library)
No ratings yet
Parallel Programming For Multicore Machines Using OpenMP and MPI Lecture Notes (Dr. Constantinos Evangelinos) (Z-Library)
292 pages
OpenMP Tutorial - Lawrence Livermore National Laboratory
No ratings yet
OpenMP Tutorial - Lawrence Livermore National Laboratory
75 pages
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
No ratings yet
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
26 pages
Introduction On OpenMPI
No ratings yet
Introduction On OpenMPI
14 pages
Mpi Openmp Handouts
No ratings yet
Mpi Openmp Handouts
67 pages
Introduction To Parallel Programming: Center For Institutional Research Computing
No ratings yet
Introduction To Parallel Programming: Center For Institutional Research Computing
98 pages
Lecture 1.2.2
No ratings yet
Lecture 1.2.2
13 pages
Programming Models
No ratings yet
Programming Models
21 pages
03 Programming
No ratings yet
03 Programming
63 pages
Parallel Programming Unit 2
No ratings yet
Parallel Programming Unit 2
71 pages
Multicore Code Entwicklung
No ratings yet
Multicore Code Entwicklung
33 pages
2 ParallelArchExec
No ratings yet
2 ParallelArchExec
46 pages
Lecture 4
No ratings yet
Lecture 4
20 pages
Memory in Multiprocessor System
No ratings yet
Memory in Multiprocessor System
52 pages
3.multicore Architecture and Programming
0% (1)
3.multicore Architecture and Programming
3 pages
About OpenMP
No ratings yet
About OpenMP
86 pages
3.introduction To Parallelism
No ratings yet
3.introduction To Parallelism
64 pages
Lecture-4 Parallel Programming Model
No ratings yet
Lecture-4 Parallel Programming Model
14 pages
Multi Threading
No ratings yet
Multi Threading
168 pages
Concurrency: CS2403 Programming Languages
No ratings yet
Concurrency: CS2403 Programming Languages
44 pages
Programming Assignment: On Openmp
No ratings yet
Programming Assignment: On Openmp
19 pages
Lec6 - TLP Data Dependence Solutions
No ratings yet
Lec6 - TLP Data Dependence Solutions
20 pages
IT105 Midterm Lecture Part1
No ratings yet
IT105 Midterm Lecture Part1
5 pages
Module 2
No ratings yet
Module 2
5 pages
CSC-334 - P&DC - Lab Manual - V2.0
No ratings yet
CSC-334 - P&DC - Lab Manual - V2.0
102 pages
Shared Memory Parallel Programming: Introduction To Openmp
No ratings yet
Shared Memory Parallel Programming: Introduction To Openmp
39 pages
Parallel Computing
No ratings yet
Parallel Computing
28 pages
Whitepaper Imsl Increase Performance Parallel Programming Numerical Libraries
No ratings yet
Whitepaper Imsl Increase Performance Parallel Programming Numerical Libraries
8 pages
OpenCL Programming and Architecture: Definitive Reference for Developers and Engineers
From Everand
OpenCL Programming and Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
OpenMP in Practice: Definitive Reference for Developers and Engineers
From Everand
OpenMP in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Parallel Programming with MPI: Definitive Reference for Developers and Engineers
From Everand
Parallel Programming with MPI: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
POSIX Threads Programming Essentials: Definitive Reference for Developers and Engineers
From Everand
POSIX Threads Programming Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Practical High Performance Computing: Definitive Reference for Developers and Engineers
From Everand
Practical High Performance Computing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Assignment 3
No ratings yet
Assignment 3
4 pages
Sign Language Recognition Using Computer Vision
No ratings yet
Sign Language Recognition Using Computer Vision
10 pages
Lecture 2 General Parallelism Terms
No ratings yet
Lecture 2 General Parallelism Terms
22 pages
Lecture W5c
No ratings yet
Lecture W5c
30 pages
Lecture W2abc 2
No ratings yet
Lecture W2abc 2
39 pages
Chapter 2 Dit
No ratings yet
Chapter 2 Dit
17 pages
Boyle ccs4 HW 03
No ratings yet
Boyle ccs4 HW 03
14 pages
Literals in Java
No ratings yet
Literals in Java
1 page
Saksham Report
No ratings yet
Saksham Report
22 pages
Melhores Praticas Ifix
No ratings yet
Melhores Praticas Ifix
113 pages
About Me Sample of PPT WD
No ratings yet
About Me Sample of PPT WD
6 pages
Manta RS-232 Communication Protocol 113.
No ratings yet
Manta RS-232 Communication Protocol 113.
7 pages
Desktop Dell Optiplex 7010 SFF (59P3N)
No ratings yet
Desktop Dell Optiplex 7010 SFF (59P3N)
3 pages
BGRFC
No ratings yet
BGRFC
22 pages
What Is The Full Form of MVC?
No ratings yet
What Is The Full Form of MVC?
12 pages
OSI Layers
No ratings yet
OSI Layers
19 pages
Eden Gebrekidan Front End Developer: Buffalo, NY 7162592843
No ratings yet
Eden Gebrekidan Front End Developer: Buffalo, NY 7162592843
2 pages
Quick Start Guide: Ethernet Communication With Mitsubishi Q Plcs
No ratings yet
Quick Start Guide: Ethernet Communication With Mitsubishi Q Plcs
34 pages
OpenShift - Container - Platform 4.18 About en US
No ratings yet
OpenShift - Container - Platform 4.18 About en US
28 pages
Chapter 5 - Network Programming
No ratings yet
Chapter 5 - Network Programming
32 pages
Automating Tasks Using The Automation 360 Excel Advanced Package
No ratings yet
Automating Tasks Using The Automation 360 Excel Advanced Package
18 pages
Tennecomp Minidek Part 2
No ratings yet
Tennecomp Minidek Part 2
24 pages
BTP 1
No ratings yet
BTP 1
21 pages
Posting Id Location Code Minimum Work Experience Month Maximum Work Experience Month Job Code End Date Target Number of Openings Description Skills
No ratings yet
Posting Id Location Code Minimum Work Experience Month Maximum Work Experience Month Job Code End Date Target Number of Openings Description Skills
7 pages
Ds Xi 1
No ratings yet
Ds Xi 1
3 pages
Log
No ratings yet
Log
467 pages
EE234 Final Exam Fall 2023-11 - Annotated
No ratings yet
EE234 Final Exam Fall 2023-11 - Annotated
8 pages
Logg 20250628
No ratings yet
Logg 20250628
384 pages
Ajp Project (1) Merged
No ratings yet
Ajp Project (1) Merged
22 pages
Network Programmability and Automation 2nd Edition (Fifth Early Release) Christian Adell Download PDF
No ratings yet
Network Programmability and Automation 2nd Edition (Fifth Early Release) Christian Adell Download PDF
49 pages

Lecture 6 Parallel Programming Models

Uploaded by

Lecture 6 Parallel Programming Models

Uploaded by

CSC334 Parallel & Distributed Computing

Department of Computer Science

• Programming mode types

Memory Memory Memory

-tasks do not necessarily have to execute the

MPMD applications are not as common

You might also like