0% found this document useful (0 votes)

4 views36 pages

CS-3006 7 MPI Advanced Topics

The document provides an overview of collective communication in the Message Passing Interface (MPI), detailing key operations such as Broadcast, Scatter, Gather, and Reduce. It outlines the properties and functionalities of these operations, including synchronization and data reduction methods. Additionally, it introduces various MPI functions like MPI_Bcast, MPI_Scatter, MPI_Gather, and their variations, along with examples and demos for practical understanding.

Uploaded by

i210792 Jahan Zaib Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views36 pages

CS-3006 7 MPI Advanced Topics

Uploaded by

i210792 Jahan Zaib Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 36

Message Passing Interface (MPI) -

Collective Communication

Dr. Muhammad Mateen Yaqoob,

Department of AI & DS,

National University of Computer & Emerging Sciences,
Islamabad Campus
Collective Communications

• Broadcast
• Scatter
• Gather
• Reduce
Collective Communications
• Processes may need to communicate with everyone else

• Three Main Classes:

1. Communications: Broadcast, Gather, Scatter
2. Synchronization: Barriers
3. Reductions: sum, max, etc.

• Properties:
– Must be executed by all processes (of the communicator)
– All processes in group call same operation at (roughly)
the same time
– All collective operations are blocking operations
Broadcast
• A one-to-many communication

before r ed
bcast

after r ed r ed r ed r ed r ed
bcast
e.g., root=1
• rank of the sending process (i.e., root process)
• must be given identically by all processes
Collective communication: Broadcast
Collective communication: Broadcast
Broadcasting with MPI_Bcast

• The contents of the send buffer is copied from a sender (i.e.,

root process) to all other processes, (including itself)

• The type signature (number of elements, data type) on any

process must be same (as on the root process)
Broadcasting with MPI_Bcast
Demo:
BroadCast.c
Collective communication: Scatter
MPI_Scatter
• MPI_Scatter is a collective routine that is similar to MPI_Bcast
• It sends chunks of an array to different processes
MPI_Scatter

The root sends a part of its send buffer to each process

• Process k receives sendcount elements starting with
sendbuf+k*sendcount
MPI_Scatter - Example
Demo:
Scatter.c
Collective communication: Gather
MPI_Gather
• MPI_Gather is the inverse of MPI_Scatter
• It takes elements from many processes and gathers them
to one single process
MPI_Gather
MPI_Gather
MPI_Gather

• The root receives data from all processes (from send buffers)
• It stores the data in the receive buffer ordered by the process
number of the senders
MPI_Gather - Example
Demo:
Gather.c
MPI_Scatterv
• MPI_Scatterv is a collective routine that is similar to MPI_Scatter
• It sends variable chunks of an array to different processes

Process-0

Process-1

Process-2

Process-3
Credits: https://www.cineca.it/
MPI_Scatterv Demo:
ScatterV.c

sendbuf: address of send buffer (significant only at root)

sendcounts: integer array (of length group size) specifying the number of
elements to send to each process
displs: integer array (of length group size). Entry i specifies the displacement
(relative to sendbuf from which to take the outgoing data to process i
sendtype: data-type of send buffer elements
recvcount: number of elements in receive buffer (integer)
recvtype: data-type of receive buffer elements
root: rank of sending process (integer)
MPI_Gatherv
• Different number of elements can be received by the root process
• Individual messages are stored according to displs in the receive
buffer

Credits: https://www.cineca.it/
MPI_Gatherv
MPI_Gatherv Demo:
GatherV.c

sendbuf: address of send buffer

sendcounts: number of elements in send buffer (integer)
sendtype: data-type of send buffer elements
recvbuf: address of the receive buff (significant at root)
recvcounts: integer array (of length group size) containing the number of elements
that are to be received from each process (on root)
displs: integer array (of length group size). Entry i specifies the displacement relative
to recvbuf at which to place data from process i (significant only at root)
recvtype: data-type of receive buffer elements (handle)
root: rank of receiving process (root)
Home Tasks
• MPI_Allgather
• Similar to MPI_Gather, but the result is available to all
processes
• MPI_Allgatherv
• Similar to MPI_Gatherv, but the result is available to all
processes
• MPI_Alltoall
• Similar to MPI_Allgather, each process performs a
scater followed by gather process
• MPI_Alltoallv
• Similar to MPI_Alltoall, but messages to different
processes can have different length
MPI_Alltoall
MPI_Alltoall makes a redistribution of contents such that
each process know the buffer of all others. It is a way to
implement the matrix data transposition
Synchronization
Barrier Synchronization

Credits: https://medium.com/@jaydesai36/barrier-synchronization-in-threads-3c56f947047
MPI_BARRIER Demo:
Barrier.c

It synchronizes ALL Processes (by blocking Processes) in

communicator until all processes have called MPI_Barrier.
Reductions
Collective communication: Reduce

• Data reduction involves reducing a set of numbers into a

smaller set of numbers via a function.
• Example: Consider the list [1, 2, 3, 4, 5]. Reducing this list of
numbers with the sum function would produce sum([1, 2, 3,
4, 5]) = 15.
• Similarly, the multiplication reduction would yield multiply([1,
2, 3, 4, 5]) = 120
Reductions
The communicated data of the processes are combined
via a specified operation, e.g. ’+’

Two different variants:

- Result is only available at the root process
- Result is available at all processes

Input values (at each process):

- Scalar variable: operation combines all values of the
processes
– Array: The elements of the arrays are combined in an
element-wise fashion. The result is an array.
MPI_Reduce

• This operation combines the elements in the send buffer

and delivers the result to root.
• Count, op, and root have to be equal in all processes
MPI_Reduce - Example
Demo:
Reduction.
c

Credits: https://dps.uibk.ac.at
Reduction Operations
Reduction Operations:

Data types:
– Operations are defined for appropriate data types
MPI_Allreduce

Similar to MPI_Reduce, returns the result value to all processes

Credits: https://dps.uibk.ac.at
Any Questions

MiniTool Partition Wizard Crack 12 Key Download Free 2025
No ratings yet
MiniTool Partition Wizard Crack 12 Key Download Free 2025
29 pages
Hox Correctipon
No ratings yet
Hox Correctipon
79 pages
Message Passing-1
No ratings yet
Message Passing-1
76 pages
Adspower Script
No ratings yet
Adspower Script
3 pages
2 Mpi
No ratings yet
2 Mpi
13 pages
Distributed-Memory Parallel Programming With MPI: Supervised By: Dr. Shaima Hagras
No ratings yet
Distributed-Memory Parallel Programming With MPI: Supervised By: Dr. Shaima Hagras
20 pages
Parts Guide Manual: Ineo+ 454e A5C0121
No ratings yet
Parts Guide Manual: Ineo+ 454e A5C0121
149 pages
Lecture 12-MPI Collective Communication
No ratings yet
Lecture 12-MPI Collective Communication
53 pages
IEEE - Software.magazine - vol.25.No.1.Jan - Feb.2008.retail - Ebook KiMERA
No ratings yet
IEEE - Software.magazine - vol.25.No.1.Jan - Feb.2008.retail - Ebook KiMERA
100 pages
Cs-3006 6 Mpi Basics 2
No ratings yet
Cs-3006 6 Mpi Basics 2
52 pages
Untitled
No ratings yet
Untitled
172 pages
Mpi Unit 5 Part 2 1
No ratings yet
Mpi Unit 5 Part 2 1
65 pages
FYP Final Report
No ratings yet
FYP Final Report
42 pages
CS-3006 - 6 - MPI Advanced Topics
No ratings yet
CS-3006 - 6 - MPI Advanced Topics
32 pages
Week 10
No ratings yet
Week 10
52 pages
MPI Pacheco Ch3
No ratings yet
MPI Pacheco Ch3
124 pages
ADA UNIT 3 Complete Notes
No ratings yet
ADA UNIT 3 Complete Notes
59 pages
ECE 1747H: Parallel Programming: Message Passing (MPI)
No ratings yet
ECE 1747H: Parallel Programming: Message Passing (MPI)
67 pages
Ms. V. Uma Maheswari, Assistant Lecturer, Department of Information Technology, National Institute of Technology, Surathkal
No ratings yet
Ms. V. Uma Maheswari, Assistant Lecturer, Department of Information Technology, National Institute of Technology, Surathkal
91 pages
08 1 MPI Comm Data Distributions
No ratings yet
08 1 MPI Comm Data Distributions
60 pages
Introduction To C MPI PM
No ratings yet
Introduction To C MPI PM
50 pages
Lecture 13-Derived Datatypes in MPI
No ratings yet
Lecture 13-Derived Datatypes in MPI
33 pages
WeatherApp Assignment
No ratings yet
WeatherApp Assignment
9 pages
Slides 41
No ratings yet
Slides 41
32 pages
Fashion - Worldwide Statista Market Forecast
No ratings yet
Fashion - Worldwide Statista Market Forecast
1 page
5CS022 Lecture 2
No ratings yet
5CS022 Lecture 2
24 pages
PDC Lecture 16 MPI - Net-New
No ratings yet
PDC Lecture 16 MPI - Net-New
59 pages
Distributed Systems and Cloud Computing
No ratings yet
Distributed Systems and Cloud Computing
24 pages
Lecture 15 MPI Summarization
No ratings yet
Lecture 15 MPI Summarization
26 pages
11.collectives II
No ratings yet
11.collectives II
20 pages
Mpi Programming 2
No ratings yet
Mpi Programming 2
57 pages
CH 6
No ratings yet
CH 6
47 pages
AI-Driven Approaches For Autonomous Vehicle Fleet Coordination and Routing
No ratings yet
AI-Driven Approaches For Autonomous Vehicle Fleet Coordination and Routing
22 pages
Week12 - L01 and L02
No ratings yet
Week12 - L01 and L02
22 pages
10.collectives I
No ratings yet
10.collectives I
31 pages
Operating Manual Programming and Diagnostic Tool
No ratings yet
Operating Manual Programming and Diagnostic Tool
40 pages
Key Concepts in MPI Programming: Processes
No ratings yet
Key Concepts in MPI Programming: Processes
6 pages
CSC4005 Tutorial3
No ratings yet
CSC4005 Tutorial3
40 pages
CH 4
No ratings yet
CH 4
16 pages
MPI Part2 Updated
No ratings yet
MPI Part2 Updated
20 pages
Lecture 11 Distributed Memory Programming
No ratings yet
Lecture 11 Distributed Memory Programming
28 pages
‎⁨تقرير⁩
No ratings yet
‎⁨تقرير⁩
16 pages
1 MPI Communications: CS424. Parallel Computing Lab#4
No ratings yet
1 MPI Communications: CS424. Parallel Computing Lab#4
30 pages
RajSingh HPCexp2B
No ratings yet
RajSingh HPCexp2B
3 pages
2024 - A Survey On Long Video Generation - Li Et Al
No ratings yet
2024 - A Survey On Long Video Generation - Li Et Al
11 pages
Cluster Lab Session 03
No ratings yet
Cluster Lab Session 03
9 pages
Introduction MPI - Chap2 - Slide 3
No ratings yet
Introduction MPI - Chap2 - Slide 3
16 pages
MPI Guide C++
No ratings yet
MPI Guide C++
9 pages
CC Domain4
No ratings yet
CC Domain4
67 pages
Unit Iv Distributed Memory Programming With Mpi
No ratings yet
Unit Iv Distributed Memory Programming With Mpi
19 pages
Got A Better Name? Please Let Me Know!
No ratings yet
Got A Better Name? Please Let Me Know!
30 pages
In-Row Vs Room Cooling Comparison
No ratings yet
In-Row Vs Room Cooling Comparison
3 pages
Aws Report 1
No ratings yet
Aws Report 1
7 pages
Software Engineer JD - Jane Street
No ratings yet
Software Engineer JD - Jane Street
1 page
Module 5
No ratings yet
Module 5
9 pages
Module 3 Solutions PCS Ia2 Q.banks
No ratings yet
Module 3 Solutions PCS Ia2 Q.banks
13 pages
Web Application Development: Essay Report Web Programming and Applications
No ratings yet
Web Application Development: Essay Report Web Programming and Applications
27 pages
كتالوج التركيب GUY-GRIP - Dead-End 2
No ratings yet
كتالوج التركيب GUY-GRIP - Dead-End 2
4 pages
Distributed Memory Programming With: Peter Pacheco
No ratings yet
Distributed Memory Programming With: Peter Pacheco
125 pages
Introduction To MPI Basics
No ratings yet
Introduction To MPI Basics
8 pages
Ijmet 08 10 013
No ratings yet
Ijmet 08 10 013
7 pages
Unit IV
No ratings yet
Unit IV
12 pages
Octnov 23
No ratings yet
Octnov 23
3 pages
Parallel Computing: MPI - Collective Communication
No ratings yet
Parallel Computing: MPI - Collective Communication
52 pages
Collective Comm
No ratings yet
Collective Comm
45 pages
MPI2
No ratings yet
MPI2
3 pages
Parallel & Distributed Computing: MPI - Message Passing Interface
No ratings yet
Parallel & Distributed Computing: MPI - Message Passing Interface
49 pages
MPI1
No ratings yet
MPI1
2 pages
Mpi Basic Operations
No ratings yet
Mpi Basic Operations
6 pages
UL Cert For Hanger
No ratings yet
UL Cert For Hanger
2 pages
DSGDSGDSG
No ratings yet
DSGDSGDSG
1 page
Parallel Programming With Message-Passing Interface (MPI)
No ratings yet
Parallel Programming With Message-Passing Interface (MPI)
6 pages
SAMSUNG QMF 85 Inch Professional Display SMART Signage - Data Sheet
No ratings yet
SAMSUNG QMF 85 Inch Professional Display SMART Signage - Data Sheet
8 pages
The History of A.T. Cross Company
100% (1)
The History of A.T. Cross Company
8 pages
3.5 Lecture Summary - Coursera
No ratings yet
3.5 Lecture Summary - Coursera
1 page
Point-to-Point Communication: MPI Send MPI Recv
No ratings yet
Point-to-Point Communication: MPI Send MPI Recv
4 pages
FIRST SEMESTER 2022-2023: of Programming Languages 10 Edition, Pearson, 2012.
No ratings yet
FIRST SEMESTER 2022-2023: of Programming Languages 10 Edition, Pearson, 2012.
3 pages
Practical No.24,26,27,28,30
No ratings yet
Practical No.24,26,27,28,30
10 pages
Ee101 Tutorial 1
No ratings yet
Ee101 Tutorial 1
1 page
MPI Exercises PDF
No ratings yet
MPI Exercises PDF
7 pages
Message Passing Interface: Parallel Processing Course University of Tehran
No ratings yet
Message Passing Interface: Parallel Processing Course University of Tehran
49 pages
Java Ring PPT
No ratings yet
Java Ring PPT
18 pages
English 5 Q2 Mod6 Types of Viewing Materials LSerrano
No ratings yet
English 5 Q2 Mod6 Types of Viewing Materials LSerrano
15 pages
ECE 1747H: Parallel Programming: Message Passing (MPI)
No ratings yet
ECE 1747H: Parallel Programming: Message Passing (MPI)
67 pages
MPI: Portable Parallel Programming For Scientific Computing: William Gropp Rusty Lusk Debbie Swider Rajeev Thakur
No ratings yet
MPI: Portable Parallel Programming For Scientific Computing: William Gropp Rusty Lusk Debbie Swider Rajeev Thakur
12 pages
Message Passing Interface (MPI) Programming
No ratings yet
Message Passing Interface (MPI) Programming
11 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Kubernetes Made Easy
From Everand
Kubernetes Made Easy
Pankaj Joshi
No ratings yet
Basic Information About C language PDF
From Everand
Basic Information About C language PDF
Suraj Das
No ratings yet
Operating Systems Interview Questions You'll Most Likely Be Asked
From Everand
Operating Systems Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

CS-3006 7 MPI Advanced Topics

Uploaded by

CS-3006 7 MPI Advanced Topics

Uploaded by

Message Passing Interface (MPI) -

Dr. Muhammad Mateen Yaqoob,

Department of AI & DS,

• Three Main Classes:

• The contents of the send buffer is copied from a sender (i.e.,

• The type signature (number of elements, data type) on any

The root sends a part of its send buffer to each process

sendbuf: address of send buffer (significant only at root)

sendbuf: address of send buffer

It synchronizes ALL Processes (by blocking Processes) in

• Data reduction involves reducing a set of numbers into a

Two different variants:

Input values (at each process):

• This operation combines the elements in the send buffer

Similar to MPI_Reduce, returns the result value to all processes

You might also like