CSC4005 Tutorial3

The document introduces MPI (Message Passing Interface) programming. It covers blocking and non-blocking point-to-point communication, MPI datatypes, probing, broadcast, scatter, gather, allgather and barrier operations. Examples are provided for each communication pattern.

Uploaded by

J Deng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views

CSC4005 Tutorial3

Uploaded by

J Deng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

CSC4005

Parallel Programming
Tutorial 3

Introduction to MPI Programming

Yangzhixin Luo, [email protected]
Outline of Tutorial 3
• Cluster Topology
• MPI: Blocking vs Non-blocking
• Point-wise Communication
▪ Blocking Communication
▪ MPI Supported Datatypes
▪ Probing
▪ Immediate (Non-blocking) Communication
• Multipoint Communication
▪ Broadcast
▪ Scatter
▪ Gather
▪ Allgather
▪ Barrier
▪ More Information
• Debugging
Cluster Topology
MPI: Blocking vs Non-blocking
・Blocking communication
Blocking communication is done using MPI_Send() and MPI_Recv().
These functions do not return (i.e., they block) until the communication is
finished.
The buffer passed to MPI_Send() can be reused, either because MPI saved it
somewhere, or because it has been received by the destination. Similarly,
MPI_Recv() returns when the receive buffer has been filled with valid data.

・Non-blocking communication
Non-blocking communication is done using MPI_Isend() and MPI_Irecv().
These function return immediately (i.e., they do not block) even if the
communication is not finished yet. You must call MPI_Wait() or MPI_Test() to
see whether the communication has finished.
Blocking Point-wise Communication
MPI_Send(
void* data,
int count,
MPI_Datatype datatype,
int destination,
int tag,
MPI_Comm communicator);
Blocking Point-wise Communication
MPI_Recv(
void* data,
int count,
MPI_Datatype datatype,
int source,
int tag,
MPI_Comm communicator,
MPI_Status* status);
MPI Supported Datatypes
Blocking Point-wise Communication
Example 1
// Find out rank, size
int world_rank;
MPI_Comm_rank(MPI_COMM_WORLD, &world_rank);
int world_size;
MPI_Comm_size(MPI_COMM_WORLD, &world_size);

int number;
if (world_rank == 0) {
number = -1;
MPI_Send(&number, 1, MPI_INT, 1, 0, MPI_COMM_WORLD);
} else if (world_rank == 1) {
MPI_Recv(&number, 1, MPI_INT, 0, 0, MPI_COMM_WORLD, MPI_STATUS_IGNORE);
printf("Process 1 received number %d from process 0\n", number);
}
Blocking Point-wise Communication
Example 2
if (0 == rank) {
data = 1999'12'08;
for (int i = 1; i < size; ++i) {
data = data + i;
std::cout << "sending " << data << " to " << i << std::endl;
MPI_Send(&data, 1, MPI_INT, i, 0, MPI_COMM_WORLD);
}
} else {
MPI_Recv(&data, 1, MPI_INT, 0, 0, MPI_COMM_WORLD, MPI_STATUS_IGNORE);
std::cout << "received " << data << " at " << rank << std::endl;
}
Blocking Point-wise Communication
Example 2 Output
Blocking Point-wise Communication
int MPI_Sendrecv(
const void *sendbuf,
int sendcount,
MPI_Datatype sendtype,
int dest,
int sendtag,
void *recvbuf,
int recvcount,
MPI_Datatype recvtype,
int source,
int recvtag,
MPI_Comm comm,
MPI_Status *status);
Blocking Point-wise Communication
Example 3
Blocking Point-wise Communication
Example 3 Output
Probing
MPI_Get_count(
MPI_Status* status,
MPI_Datatype datatype,
int* count);

MPI_Probe(
int source,
int tag,
MPI_Comm comm,
MPI_Status* status);
MPI_Status
If we pass an MPI_Status structure to the MPI_Recv function, it
will be populated with additional information about the receive
operation after it completes. The three primary pieces of information
include:

・The rank of the sender

The rank of the sender is stored in the MPI_SOURCE element of the
structure. That is, if we declare an MPI_Status stat variable, the rank can
be accessed with stat.MPI_SOURCE.
・The tag of the message
The tag of the message can be accessed by the MPI_TAG element of the
structure (similar to MPI_SOURCE).
・The length of the message
The length of the message does not have a predefined element in the
status structure. Instead, we have to find out the length of the message
with MPI_Get_count.
MPI_Get_count
Why would this information be necessary?
The MPI_Get_count function is used to determine the actual
receive amount.
It turns out that MPI_Recv can take MPI_ANY_SOURCE for the
rank of the sender and MPI_ANY_TAG for the tag of the message. For
this case, the MPI_Status structure is the only way to find out the actual
sender and tag of the message. Furthermore, MPI_Recv is not
guaranteed to receive the entire amount of elements passed as the
argument to the function call. Instead, it receives the amount of
elements that were sent to it (and returns an error if more elements were
sent than the desired receive amount).
MPI_Get_count Example
MPI_Probe
Instead of posting a receive and simply providing a really
large buffer to handle all possible sizes of messages, you can use
MPI_Probe to query the message size before actually receiving it.
MPI_Probe looks quite similar to MPI_Recv. In fact, you can
think of MPI_Probe as an MPI_Recv that does everything but
receive the message. Similar to MPI_Recv, MPI_Probe will block for
a message with a matching tag and sender. When the message is
available, it will fill the status structure with information. The user
can then use MPI_Recv to receive the actual message.
MPI_Probe Example
Immediate (Non-blocking) Point-wise
Communication
int MPI_Isend(
const void *buf,
int count,
MPI_Datatype datatype,
int dest,
int tag,
MPI_Comm comm,
MPI_Request *request);
Immediate (Non-blocking) Point-wise
Communication
int MPI_Irecv(
void *buf,
int count,
MPI_Datatype datatype,
int source,
int tag,
MPI_Comm comm,
MPI_Request *request);
Immediate (Non-blocking) Point-wise
Communication
MPI_Wait
MPI_Wait is a blocking call that returns only when a specified
operation has been completed (e.g., the send buffer is safe to
access). This call should be inserted at the point where the next
section of code depends on the buffer, because it forces the
process to block until the buffer is ready.

int MPI_Wait(
MPI_Request *request,
MPI_Status *status);
Immediate (Non-blocking) Point-wise
Communication
MPI_Test
MPI_Test is the nonblocking counterpart to MPI_Wait. Instead of
blocking until the specified message is complete, this function returns
immediately with a flag that says whether the requested message is complete
(true) or not (false). MPI_Test is basically a safe polling mechanism, and this
means we can again emulate blocking behavior by executing MPI_Test inside
of a while-loop.

int MPI_Test(
MPI_Request *request,
int *flag,
MPI_Status *status);
Immediate (Non-blocking) Point-wise
Communication Example
MPI_Request send_req, recv_req;
MPI_Isend(send.data(), size, MPI_INT, target, 0, MPI_COMM_WORLD, &send_req);
MPI_Irecv(recv.data(), size, MPI_INT, source, 0, MPI_COMM_WORLD, &recv_req);
int sflag = 0, rflag = 0;
do {
MPI_Test(&send_req, &sflag, MPI_STATUS_IGNORE);
MPI_Test(&recv_req, &rflag, MPI_STATUS_IGNORE);
} while (!sflag || !rflag);
Broadcast
A broadcast is one of the standard
collective communication techniques.
During a broadcast, one process sends the
same data to all processes in a
communicator. One of the main uses of
broadcasting is to send out user input to a
parallel program, or send out configuration
parameters to all processes.
In this example, process zero is the
root process, and it has the initial copy of
data. All of the other processes receive the
copy of data.
Broadcast
MPI_Bcast(
void* data,
int count,
MPI_Datatype datatype,
int root,
MPI_Comm communicator);
Broadcast Example
int main(int argc, char **argv) {
int data;
int rank;
MPI_Init(&argc, &argv);
MPI_Comm_rank(MPI_COMM_WORLD, &rank);
if (0 == rank) {
std::cin >> data;
}
MPI_Bcast(&data, 1, MPI_INT, 0, MPI_COMM_WORLD);
std::cout << data << std::endl;
MPI_Finalize();
}
MPI_Scatter
MPI_Scatter is a collective routine
that is very similar to MPI_Bcast.
MPI_Scatter involves a designated root
process sending data to all processes in a
communicator.
The primary difference between
MPI_Bcast and MPI_Scatter is small but
important. MPI_Bcast sends the same
piece of data to all processes while
MPI_Scatter sends chunks of an array to
different processes.
MPI_Scatter
MPI_Scatter(
void* send_data,
int send_count,
MPI_Datatype send_datatype,
void* recv_data,
int recv_count,
MPI_Datatype recv_datatype,
int root,
MPI_Comm communicator);
MPI_Gather
MPI_Gather is the inverse of MPI_Scatter.
Instead of spreading elements from one process to
many processes, MPI_Gather takes elements from
many processes and gathers them to one single
process. This routine is highly useful to many parallel
algorithms, such as parallel sorting and searching.
Similar to MPI_Scatter, MPI_Gather takes
elements from each process and gathers them to the
root process. The elements are ordered by the rank
of the process from which they were received.
MPI_Gather
MPI_Gather(
void* send_data,
int send_count,
MPI_Datatype send_datatype,
void* recv_data,
int recv_count,
MPI_Datatype recv_datatype,
int root,
MPI_Comm communicator);
MPI_Allgather
Given a set of elements distributed across
all processes, MPI_Allgather will gather all of the
elements to all the processes. In the most basic
sense, MPI_Allgather is an MPI_Gather followed
by an MPI_Bcast.
Just like MPI_Gather, the elements from
each process are gathered in order of their rank,
except this time the elements are gathered to all
processes. The function declaration for
MPI_Allgather is almost identical to MPI_Gather
with the difference that there is no root process
in MPI_Allgather.
MPI_Allgather
MPI_Allgather(
void* send_data,
int send_count,
MPI_Datatype send_datatype,
void* recv_data,
int recv_count,
MPI_Datatype recv_datatype,
MPI_Comm communicator);
MPI_Barrier
MPI_Barrier(MPI_COMM_WORLD);
MPI_Barrier
{
std::this_thread::sleep_for(std::chrono::milliseconds{100} * rank);
std::cout << "hi (no bar)" << std::endl;
}
MPI_Barrier(MPI_COMM_WORLD);
{
std::this_thread::sleep_for(std::chrono::milliseconds{100} * rank);
MPI_Barrier(MPI_COMM_WORLD);
std::cout << "hi (bar)" << std::endl;
}
More Information
・Reduce:
https://mpitutorial.com/tutorials/mpi-reduce-and-allreduce/

・Group Division:
https://mpitutorial.com/tutorials/introduction-to-groups-and-
communicators/
Debugging: Stacktrace
Debugging: Stacktrace
mpirun –timeout 5 --get-stack-traces ./main
Pass in Arguments
#include <iostream> ・Input:
using namespace std; ./main CSC4005 Parallel Programming

int main(int argc, char** argv) ・Output:

{ You have entered 4 arguments:
cout << "You have entered " << argc ./test
<< " arguments:" << "\n"; CSC4005
Parallel
for (int i = 0; i < argc; ++i)
Programming
cout << argv[i] << "\n";

return 0;
}
Pass in Arguments
・argc (ARGument Count) is int and stores number of command-
line arguments passed by the user including the name of the
program. The value of argc should be nonnegative.

・argv (ARGument Vector) is array of character pointers listing all

the arguments.

4 李孝悌：桃花扇底送南朝断裂的逸乐
No ratings yet
4 李孝悌：桃花扇底送南朝断裂的逸乐
53 pages
l2
No ratings yet
l2
24 pages
5CS022 Lecture 2
No ratings yet
5CS022 Lecture 2
24 pages
Unit Iv Distributed Memory Programming With Mpi
No ratings yet
Unit Iv Distributed Memory Programming With Mpi
19 pages
Mpi Unit 5 Part 2 1
No ratings yet
Mpi Unit 5 Part 2 1
65 pages
Send and Receive
No ratings yet
Send and Receive
11 pages
Message Passing and MPI: John Mellor-Crummey
No ratings yet
Message Passing and MPI: John Mellor-Crummey
78 pages
Ms. V. Uma Maheswari, Assistant Lecturer, Department of Information Technology, National Institute of Technology, Surathkal
No ratings yet
Ms. V. Uma Maheswari, Assistant Lecturer, Department of Information Technology, National Institute of Technology, Surathkal
91 pages
Module 3 Solutions PCS Ia2 Q.banks
No ratings yet
Module 3 Solutions PCS Ia2 Q.banks
13 pages
MPI Part2 Updated
No ratings yet
MPI Part2 Updated
20 pages
Intro_MPI
No ratings yet
Intro_MPI
60 pages
Distributed-Memory Parallel Programming With MPI: Supervised By: Dr. Shaima Hagras
No ratings yet
Distributed-Memory Parallel Programming With MPI: Supervised By: Dr. Shaima Hagras
20 pages
Lecture 12-MPI Collective Communication
No ratings yet
Lecture 12-MPI Collective Communication
53 pages
Distributed Memory Programming Using
No ratings yet
Distributed Memory Programming Using
113 pages
Lecture 11 Distributed Memory Programming
No ratings yet
Lecture 11 Distributed Memory Programming
28 pages
Message Passing Interface (MPI) Programming
No ratings yet
Message Passing Interface (MPI) Programming
11 pages
Unit4 RMD PDF
No ratings yet
Unit4 RMD PDF
18 pages
Introduction MPI - Chap2 - Slide 3
No ratings yet
Introduction MPI - Chap2 - Slide 3
16 pages
Apznzayhh7i3gk6w Cuvwt6frekq7pgon 9ygvyqpxxizr06xwwpcj29m2cyf7srhmq5cu Hawkzm7cn8obps 9rbemjx43qoi2aixrppfxvlfp9nmwowtjlseuprpbxpttdeipr Rkq Zraxgwytizjexby1hzff8pkune92ywhrc Aez8ev7xemzlvd Qovivr9vkxanyei
No ratings yet
Apznzayhh7i3gk6w Cuvwt6frekq7pgon 9ygvyqpxxizr06xwwpcj29m2cyf7srhmq5cu Hawkzm7cn8obps 9rbemjx43qoi2aixrppfxvlfp9nmwowtjlseuprpbxpttdeipr Rkq Zraxgwytizjexby1hzff8pkune92ywhrc Aez8ev7xemzlvd Qovivr9vkxanyei
19 pages
Parallel & Distributed Computing: MPI - Message Passing Interface
No ratings yet
Parallel & Distributed Computing: MPI - Message Passing Interface
49 pages
Message Passing Interface (MPI) Programming
No ratings yet
Message Passing Interface (MPI) Programming
11 pages
Intro To MPI: Hpc-Support@duke - Edu
No ratings yet
Intro To MPI: Hpc-Support@duke - Edu
56 pages
Mpi Basic Operations
No ratings yet
Mpi Basic Operations
6 pages
MPI_Guide_C++
No ratings yet
MPI_Guide_C++
9 pages
Lecture 11 MPI Point to Point Communication
No ratings yet
Lecture 11 MPI Point to Point Communication
36 pages
Mpi
No ratings yet
Mpi
30 pages
MiniTool Partition Wizard Crack 12 Key Download Free 2025
No ratings yet
MiniTool Partition Wizard Crack 12 Key Download Free 2025
29 pages
An Introduction To MPI: Parallel Programming With The Message Passing Interface
No ratings yet
An Introduction To MPI: Parallel Programming With The Message Passing Interface
48 pages
Introduction to C MPI PM
No ratings yet
Introduction to C MPI PM
50 pages
10.collectives I
No ratings yet
10.collectives I
31 pages
2-MPI
No ratings yet
2-MPI
13 pages
in3200-chap09
No ratings yet
in3200-chap09
56 pages
The Message Passing Interface (MPI)
No ratings yet
The Message Passing Interface (MPI)
18 pages
mpi2
No ratings yet
mpi2
46 pages
mpi1
No ratings yet
mpi1
20 pages
ATPESC 2019 Track-2 1-7-30 830am Guo-Raffenetti-Thakur-MPI For Scalable Computing
No ratings yet
ATPESC 2019 Track-2 1-7-30 830am Guo-Raffenetti-Thakur-MPI For Scalable Computing
199 pages
العملي
No ratings yet
العملي
55 pages
Message Passing Interface: Parallel Processing Course University of Tehran
No ratings yet
Message Passing Interface: Parallel Processing Course University of Tehran
49 pages
CH 6
No ratings yet
CH 6
47 pages
7.P2P-IV
No ratings yet
7.P2P-IV
27 pages
ECE 1747H: Parallel Programming: Message Passing (MPI)
No ratings yet
ECE 1747H: Parallel Programming: Message Passing (MPI)
67 pages
Lec5 MPI
No ratings yet
Lec5 MPI
28 pages
Lecture 13-Derived Datatypes in MPI
No ratings yet
Lecture 13-Derived Datatypes in MPI
33 pages
Clase 4 - Tutorial de MPI
No ratings yet
Clase 4 - Tutorial de MPI
35 pages
Lecture 15 MPI Summarization
No ratings yet
Lecture 15 MPI Summarization
26 pages
Mpi
No ratings yet
Mpi
67 pages
MPI Using Java PDF
No ratings yet
MPI Using Java PDF
22 pages
NGK Mpi
No ratings yet
NGK Mpi
74 pages
07_2_Introduction_MPI
No ratings yet
07_2_Introduction_MPI
27 pages
VSS-MPI-2
No ratings yet
VSS-MPI-2
23 pages
Point-to-Point Communication: MPI Send MPI Recv
No ratings yet
Point-to-Point Communication: MPI Send MPI Recv
4 pages
Chapter 4 - Message-Passing Programming, MPI
No ratings yet
Chapter 4 - Message-Passing Programming, MPI
79 pages
PDC Lecture 16 MPI - Net-New
No ratings yet
PDC Lecture 16 MPI - Net-New
59 pages
4 P2P-1
No ratings yet
4 P2P-1
31 pages
Module 5
No ratings yet
Module 5
9 pages
03-MPIProgramStructure[1]
No ratings yet
03-MPIProgramStructure[1]
42 pages
Message Passing Interface (MPI) : Steve Lantz Center For Advanced Computing Cornell University
No ratings yet
Message Passing Interface (MPI) : Steve Lantz Center For Advanced Computing Cornell University
53 pages
4.P2P-I
No ratings yet
4.P2P-I
42 pages
Cs-3006 6 Mpi Basics 2
No ratings yet
Cs-3006 6 Mpi Basics 2
52 pages
Tutorial 2
No ratings yet
Tutorial 2
58 pages
CH 03
No ratings yet
CH 03
56 pages
DDA3020 Lecture 02 Linear Algebra
No ratings yet
DDA3020 Lecture 02 Linear Algebra
37 pages

CSC4005 Tutorial3

Uploaded by

CSC4005 Tutorial3

Uploaded by

CSC4005

Introduction to MPI Programming

・The rank of the sender

int main(int argc, char** argv) ・Output:

・argv (ARGument Vector) is array of character pointers listing all

You might also like