0% found this document useful (0 votes)

50 views43 pages

SERC IntroMPI 2019-09-14 v0

MPI (Message Passing Interface) is a standard library used for parallel programming on distributed memory systems. It works by initializing multiple processes across nodes that can then communicate by exchanging messages. The key functions are MPI_Init to initialize processes, MPI_Comm_size and MPI_Comm_rank to determine the number of processes and a process' rank, and MPI_Finalize to terminate processes.

Uploaded by

dhruvbhagtani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views43 pages

SERC IntroMPI 2019-09-14 v0

Uploaded by

dhruvbhagtani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

Parallel Programming Using MPI

Short Course on HPC

14th September 2019

Aditya Krishna Swamy

[email protected]
SERC, Indian Institute of Science
What is MPI?
• MPI stands for Message Passing Interface
• It is a message-passing specification, a standard, for the vendors to
implement
• In practice, MPI is a library consisting of C functions and Fortran
subroutines (Fortran) used for exchanging data between processes
• An MPI library exists on ALL parallel computing facilities so it is highly
portable
• Also available for Python (mpi4py.scipy.org), R (Rmpi)
Why use MPI? 
• General
• MPI-1 ’92-’94, MPI-2 ~2008, MPI-3 2012. Has been around for ~25 years
• Widely used parallel model
• Libraries and algorithms readily available
• Very scalable: 1 ~ 300,000 cores
• Portable
• Works well with Hybrid models (MPI+X. X=OpenMP, CUDA, OpenACC,
OpenCL…)
• Your problem
• Want to speed up your calculation
• Want to scale up your problem size
• Your problem size is too large for a single node
MPI Implementations
• MPICH
• ANL, (foss) mpich.org/
• Intel, (free 1-year student license)
• Cray
• IBM
• OpenMPI, (foss)
• open-mpi.org/
• MVAPICH (free, BSD)
• mvapich.cse.ohio-state.edu/
MPI
Context: Distributed memory parallel
computers
– Each processor has its own memory and
cannot access the memory of other
processors
– A copy of the same executable runs as
MPI process ( on each processor core)
– All variables are private to each process
– Any data to be shared must be explicitly
transmitted from one to another

Cluster - 90’s
Modern HPC Facilities

MPI only Hybrid

Terminology
OS/Software
Process Independent stream of instructions.
OS provides dedicated resources
Thread(s) Created by process
Shares resources with process/fellow threads
Hardware
core “instruction stream processing unit”
“Processor/CPU” Single die that sits on a “Socket” (has 1 or more cores)
Node ~ “mother board”. With 1 or more sockets

Blade 1 or more Nodes

Cabinet Several blades
Basic MPI
• Basic functionality in a parallel program
• Start processes
• Send messages
• receive messages
• Synchronize
A simple program in C
#include <stdio>
#include <stdlib> Any PC
$ ./a.out
int main( int argc, char *argv[] )
{
Hello
printf(“Hello \n”)

return 0;
}
A simple MPI program in C
#include <stdio>
#include <stdlib> Any Cluster
#include “mpi.h" $ mpirun -n 8 ./a.out
int main( int argc, char *argv[] ) SahasraT
{
int nproc, myrank;
$ aprun -n 8 ./a.out

MPI_Init(&argc,&argv);
MPI_Comm_size(MPI_COMM_WORLD,&nproc);
Hello from 0.
MPI_Comm_rank(MPI_COMM_WORLD,&myrank); Hello from 1.
printf(“Hello from %d. \n”,myrank)
Hello from 2.
Hello from 3.
/* Finalize */
MPI_Finalize();
Hello from 4.
return 0; Hello from 5.
} Hello from 6.
Hello from 7.
Header file
• Defines MPI-related parameters and functions
#include "mpi.h" • Must be included in all routines calling MPI functions
int main( int argc, char *argv[] )• Can also use include file:
{ include mpif.h
int nproc, myrank;
/* Initialize MPI */
MPI_Init(&argc,&argv);
/* Get the number of processes */
MPI_Comm_size(MPI_COMM_WORLD,&nproc);
/* Get my process number (rank) */
MPI_Comm_rank(MPI_COMM_WORLD,&myrank);

Do work and make message passing calls…

/* Finalize */
MPI_Finalize();
return 0;
}
Initialization
#include "mpi.h"
int main( int argc, char *argv[] )
{
int nproc, myrank; • Must be called at the beginning of the code
/* Initialize MPI */ before any other calls to MPI functions
MPI_Init(&argc,&argv);
/* Get the number of processes */ • Sets up the communication channels between the
processes and gives each one a rank.
MPI_Comm_size(MPI_COMM_WORLD,&nproc);
/* Get my process number (rank) */
MPI_Comm_rank(MPI_COMM_WORLD,&myrank);

Do work and make message passing calls…

/* Finalize */
MPI_Finalize();
return 0;
}
How many processes do we have?
• Returns the number of processes available under
MPI_COMM_WORLD
#include "mpi.h" communicator
• This
int is theint
main( number usedchar
argc, on the mpiexec)(or mpirun)
*argv[]
{ command:
int mpiexec
nproc, –n nproc a.out
myrank;
/* Initialize MPI */
MPI_Init(&argc,&argv);
/* Get the number of processes */
MPI_Comm_size(MPI_COMM_WORLD,&nproc);
/* Get my process number (rank) */
MPI_Comm_rank(MPI_COMM_WORLD,&myrank);

Do work and make message passing calls…

/* Finalize */
MPI_Finalize();
return 0;
}
What is my rank?
#include "mpi.h"
int main( int argc, char *argv[] )
{ • Get my rank among all of the nproc processes under
MPI_COMM_WORLD
int nproc, myrank;
• This
/* Initialize MPI is
*/a unique number that can be used to distinguish this
process from the others
MPI_Init(&argc,&argv);
/* Get the number of processes */
MPI_Comm_size(MPI_COMM_WORLD,&nproc);
/* Get my process number (rank) */
MPI_Comm_rank(MPI_COMM_WORLD,&myrank);

Do work and make message passing calls…

/* Finalize */
MPI_Finalize();
return 0;
}
Termination
#include "mpi.h"
int main( int argc, char *argv[] )
{
int nproc, myrank;
/* Initialize MPI */
MPI_Init(&argc,&argv);
/* Get the number of processes */
MPI_Comm_size(MPI_COMM_WORLD,&nproc);
/* Get my process number (rank) */
MPI_Comm_rank(MPI_COMM_WORLD,&myrank);

Do work and make message passing calls…

/* Finalize */ • Must be called at the end of the properly

MPI_Finalize(); close all communication channels
return 0; • No more MPI calls after finalize
}
MPI Communicators
• An MPI Function: MPI_Comm_size(MPI_COMM_WORLD, &nproc);
• MPI_COMM_WORLD - communicator
• A communicator is a group of processes
– Each process has a unique rank within a specific communicator
– Rank starts from 0 and has a maximum value of (nproc-1). Fortran programmers
beware!
– Internal mapping of processes to processing units
– Necessary to specify communicator when initiating a communication by calling an MPI
function or routine.
• Default communicator MPI_COMM_WORLD, which contains all available
processes.
• Several communicators can coexist
– A process can belong to different communicators at the same time, but has a unique
rank in each communicator
A sample MPI program in Fortran 90
Program mpi_code
! Load MPI definitions
use mpi (or include mpif.h)

! Initialize MPI
call MPI_Init(ierr)

! Get the number of processes

call MPI_Comm_size(MPI_COMM_WORLD,nproc,ierr)

! Get my process number (rank)

call MPI_Comm_rank(MPI_COMM_WORLD,myrank,ierr)

Do work and make message passing calls…

! Finalize
call MPI_Finalize(ierr)

end program mpi_code

When hello-mpi runs
#include “mpi.h" #include “mpi.h"
int main( int argc, char *argv[] ){ int main( int argc, char *argv[] ){

int nproc, myrank; int nproc, myrank;

MPI_Init(&argc,&argv); MPI_Init(&argc,&argv);
MPI_Comm_size(MPI_COMM_WORLD,&nproc); MPI_Comm_size(MPI_COMM_WORLD,&nproc);
MPI_Comm_rank(MPI_COMM_WORLD,&myrank); MPI_Comm_rank(MPI_COMM_WORLD,&myrank);
printf(“Hello from %d. \n”,myrank) printf(“Hello from %d. \n”,myrank)
MPI_Finalize(); MPI_Finalize();
return 0; return 0;
}
Any Cluster }

#include “mpi.h"
int main( int argc, char *argv[] ){ $ mpirun -n 8 ./a.out #include “mpi.h"
int main( int argc, char *argv[] ){
int nproc, myrank;
MPI_Init(&argc,&argv); SahasraT int nproc, myrank;
MPI_Init(&argc,&argv);
MPI_Comm_size(MPI_COMM_WORLD,&nproc);
MPI_Comm_size(MPI_COMM_WORLD,&nproc);
MPI_Comm_rank(MPI_COMM_WORLD,&myrank);
printf(“Hello from %d. \n”,myrank) $ aprun -n 8 ./a.out MPI_Comm_rank(MPI_COMM_WORLD,&myrank);
printf(“Hello from %d. \n”,myrank)
MPI_Finalize();
return 0; MPI_Finalize();
} return 0;
}
Hello from 0.
#include “mpi.h" #include “mpi.h"
int main( int argc, char *argv[] ){ Hello from 4. int main( int argc, char *argv[] ){
int nproc, myrank;
MPI_Init(&argc,&argv);
MPI_Comm_size(MPI_COMM_WORLD,&nproc);
Hello from 2. int nproc, myrank;
MPI_Init(&argc,&argv);
MPI_Comm_size(MPI_COMM_WORLD,&nproc);
MPI_Comm_rank(MPI_COMM_WORLD,&myrank);
printf(“Hello from %d. \n”,myrank)
MPI_Finalize();
Hello from 1. MPI_Comm_rank(MPI_COMM_WORLD,&myrank);
printf(“Hello from %d. \n”,myrank)
MPI_Finalize();
return 0;
} Hello from 3. }
return 0;

#include “mpi.h"
Hello from 6.
#include “mpi.h"
int main( int argc, char *argv[] ){

int nproc, myrank;

Hello from 7. int main( int argc, char *argv[] ){

int nproc, myrank;

MPI_Init(&argc,&argv);
MPI_Comm_size(MPI_COMM_WORLD,&nproc); Hello from 5. MPI_Init(&argc,&argv);
MPI_Comm_size(MPI_COMM_WORLD,&nproc);
MPI_Comm_rank(MPI_COMM_WORLD,&myrank); MPI_Comm_rank(MPI_COMM_WORLD,&myrank);
printf(“Hello from %d. \n”,myrank) printf(“Hello from %d. \n”,myrank)
MPI_Finalize(); MPI_Finalize();
return 0; return 0;
} }
How much do I need to know?
• MPI is large: MPI-1 has over 125 functions/subroutines. MPI-3 has over
400
• MPI is small: Can actually most work with about 6 functions!
• Collective functions are EXTREMELY useful since they simplify the
coding and vendors optimize them for their interconnect hardware
• One can access flexibility when it is required.
• One need not master all parts of MPI to use it.
Do I need a Supercomputer?
• To learn MPI and develop MPI application - No. Your laptop/PC will suffice.
• Any number of MPI processes can be started even on a laptop
• Same accuracy, but not efficient
• Test your real application - Laptop, PC, Workstation, Cluster, Supercomputer
• Production runs - Laptop, PC, Workstation, Cluster, Supercomputer
Is communication needed? 
Domain Decomposition
• Most widely used method for grid-based calculations
Is communication needed? 
“Coloring”
• Useful for particle simulations
Proc 0 Proc 1 Proc 2 Proc 3 Proc 4
MPI Function Categories
• MPI calls to exchange data
• Point-to-Point communications
– Only 2 processes exchange data
– It is the basic operation of all MPI calls
• Collective communications
– A single call handles the communication between all the processes in a communicator
– There are 2 types of collective communications
• Data movement (e.g. MPI_Bcast)
• Reduction (e.g. MPI_Reduce)
• Synchronization:
• MPI_Barrier
• MPI_Wait
Send Message: MPI_Send
MPI_Send(&numToSend,1,MPI_INT,0,10,MPI_COMM_WORLD);

&numToSend Pointer to whatever information to send. In this case, an integer

1 the number of items to send. If sending a vector of 10 int's, we

would point to the first one in the above argument and set this to the
size of the array.
MPI_INT the type of object we are sending. In this case, an integer

0 Destination of the message. In this case, Rank 0

10 Message Tag. Useful to identify/sort messages

MPI_COMM_WORLD We don’t have any subsets yet. We just choose the “default”
Point to point: 2 processes at a time

MPI_Recv(recvbuf,count,datatype,source,tag,comm,status)

MPI_Sendrecv(sendbuf,sendcount,sendtype,dest,sendtag,
recvbuf,recvcount,recvtype,source,recvtag,comm,status)
Datatypes are:
FORTRAN: MPI_INTEGER, MPI_REAL, MPI_DOUBLE_PRECISION,
MPI_COMPLEX,MPI_CHARACTER, MPI_LOGICAL, etc…

C : MPI_INT, MPI_LONG, MPI_SHORT, MPI_FLOAT, MPI_DOUBLE, etc…

Predefined Communicator: MPI_COMM_WORLD

Collective communication: 
Broadcast
MPI_Bcast(buffer,count,datatype,root,comm,ierr)

P0 A B C D P0 A B C D
P1 Broadcast P1 A B C D
P2 P2 A B C D
P3 P3 A B C D

• One process (called “root”) sends data to all the other processes in the same
communicator
• Must be called by ALL processes with the same arguments
Collective communication: 
Gather
MPI_Gather(sendbuf,sendcount,sendtype,recvbuf,recvcount,
recvtype,root,comm,ierr)

P0 A P0 A B C D
P1 B Gather P1
P2 C P2

P3 D P3

• One root process collects data from all the other processes in the same communicator
• Must be called by all the processes in the communicator with the same arguments
• “sendcount” is the number of basic datatypes sent, not received (example above would
be sendcount = 1)
• Make sure that you have enough space in your receiving buffer!
Collective communication: 
Gather to All
MPI_Allgather(sendbuf,sendcount,sendtype,recvbuf,recvcount,
recvtype,comm,info)

P0 A P0 A B C D
P1 B Allgather P1 A B C D
P2 C P2 A B C D
P3 D P3 A B C D

• All processes within a communicator collect data from each other and end up with the
same information
• Must be called by all the processes in the communicator with the same arguments
• Again, sendcount is the number of elements sent
Collective communication: 
Reduction
MPI_Reduce(sendbuf,recvbuf,count,datatype,op,root,comm,ierr)

P0 A P0 A+B+C+D
P1 B Reduce (+) P1

P2 C P2
P3 D P3

• One root process collects data from all the other processes in the same communicator and
performs an operation on the received data
• Called by all the processes with the same arguments
• Operations are: MPI_SUM, MPI_MIN, MPI_MAX, MPI_PROD, logical AND, OR,
XOR, and a few more
• User can define own operation with MPI_Op_create()
Collective communication: 
Reduction to All
MPI_Allreduce(sendbuf,recvbuf,count,datatype,op,comm,ierr)

P0 A P0 A+B+C+D
P1 B Allreduce (+) P1 A+B+C+D

P2 C P2 A+B+C+D
P3 D P3 A+B+C+D

• All processes within a communicator collect data from all the other processes and
performs an operation on the received data
• Called by all the processes with the same arguments
• Operations are the same as for MPI_Reduce
More MPI collective calls
One “root” process send a different piece of the data to each one of the other
Processes (inverse of gather)
MPI_Scatter(sendbuf,sendcnt,sendtype,recvbuf,recvcnt,
recvtype,root,comm,ierr)

Each process performs a scatter operation, sending a distinct message to all

the processes in the group in order by index.
MPI_Alltoall(sendbuf,sendcount,sendtype,recvbuf,recvcnt,
recvtype,comm,ierr)

Synchronization: When necessary, all the processes within a communicator can

be forced to wait for each other although this operation can be expensive
MPI_Barrier(comm,ierr)
How to time your MPI code
• Several possibilities but MPI provides an easy to use function called
“MPI_Wtime()”. It returns the number of seconds since an arbitrary point of time in
the past.

FORTRAN: double precision MPI_WTIME()

C: double MPI_Wtime()

starttime=MPI_WTIME()
… program body …
endtime=MPI_WTIME()
elapsetime=endtime-starttime
Blocking communications
• The call waits until the data transfer is
done
– The sending process waits until all
data are transferred to the system
buffer (differences for eager vs
rendezvous protocols...)
– The receiving process waits until all
data are transferred from the system
buffer to the receive buffer
• All collective communications are
blocking
Non-blocking
• Returns immediately after the
data transferred is initiated
• Allows to overlap computation
with communication
• Need to be careful though
– When send and receive buffers
are updated before the transfer
is over, the result will be wrong
Debugging tips
Use “unbuffered” writes to do “printf-debugging” and always write out the process
id:
C: fprintf(stderr,”%d: …”,myid,…);
Fortran: write(0,*)myid,’: …’

If the code detects an error and needs to terminate, use MPI_ABORT. The
errorcode is returned to the calling environment so it can be any number.
C: MPI_Abort(MPI_Comm comm, int errorcode);
Fortran: call MPI_ABORT(comm, errorcode, ierr)

Use a parallel debugger such as Totalview or DDT

References
• Keywords for search “mpi”, or “mpi standard”, or “mpi tutorial”…
• https://www.mpich.org/static/docs/latest/
• http://www.mpi-forum.org (location of the MPI standard)
• http://www.llnl.gov/computing/tutorials/mpi/
• http://www.nersc.gov/nusers/help/tutorials/mpi/intro/

• MPI on Linux clusters:

– MPICH (http://www-unix.mcs.anl.gov/mpi/mpich/)
– Open MPI (http://www.open-mpi.org/)
Example: calculating π using numerical integration
#include <stdio.h>
#include <math.h>
int main( int argc, char *argv[] )
{
int n, myid, numprocs, i;
double PI25DT = 3.141592653589793238462643;
double mypi, pi, h, sum, x;
FILE *ifp;

ifp = fopen("ex4.in","r");

C version
fscanf(ifp,"%d",&n);
fclose(ifp);
printf("number of intervals = %d\n",n);

h = 1.0 / (double) n;
sum = 0.0;
for (i = 1; i <= n; i++) {
x = h * ((double)i - 0.5);
sum += (4.0 / (1.0 + x*x));
}
mypi = h * sum;

pi = mypi;
printf("pi is approximately %.16f, Error is %.16f\n",
pi, fabs(pi - PI25DT));
return 0;
}
#include "mpi.h"
#include <stdio.h>
#include <math.h>
int main( int argc, char *argv[] )
{
int n, myid, numprocs, i, j, tag, my_n;
double PI25DT = 3.141592653589793238462643; Root reads input
double mypi,pi,h,sum,x,pi_frac,tt0,tt1,ttf;
FILE *ifp;
MPI_Status Stat; and broadcast to
MPI_Request request;

    n = 1; all
tag = 1;
    MPI_Init(&argc,&argv);
    MPI_Comm_size(MPI_COMM_WORLD,&numprocs);
    MPI_Comm_rank(MPI_COMM_WORLD,&myid);

tt0 = MPI_Wtime();
if (myid == 0) {
ifp = fopen("ex4.in","r");
       fscanf(ifp,"%d",&n);
       fclose(ifp);
//printf("number of intervals = %d\n",n);
}
/* Global communication. Process 0 "broadcasts" n to all other processes */
    MPI_Bcast(&n, 1, MPI_INT, 0, MPI_COMM_WORLD);
Each process calculates its section of the integral and adds up
results with MPI_Reduce
…
h = 1.0 / (double) n;
sum = 0.0;
for (i = myid*n/numprocs+1; i <= (myid+1)*n/numprocs; i++) {
x = h * ((double)i - 0.5);
sum += (4.0 / (1.0 + x*x));
}
mypi = h * sum;

pi = 0.; /* It is not necessary to set pi = 0 */

/* Global reduction. All processes send their value of mypi to process 0

and process 0 adds them up (MPI_SUM) */
MPI_Reduce(&mypi, &pi, 1, MPI_DOUBLE, MPI_SUM, 0, MPI_COMM_WORLD);

ttf = MPI_Wtime();
printf("myid=%d pi is approximately %.16f, Error is %.16f time = %10f\n",
myid, pi, fabs(pi - PI25DT), (ttf-tt0));

MPI_Finalize();
return 0;
}
Thank you...
Non-blocking send and receive
Point to point:
MPI_Isend(buf,count,datatype,dest,tag,comm,request,ierr)
MPI_Irecv(buf,count,datatype,source,tag,comm,request,ierr)

The functions MPI_Wait and MPI_Test are used to complete a nonblocking communication
MPI_Wait(request,status,ierr)

MPI_Test(request,flag,status,ierr)
MPI_Wait returns when the operation identified by “request” is complete. This is a non-local operation.

MPI_Test returns “flag = true” if the operation identified by “request” is complete. Otherwise it returns
“flag = false”. This is a local operation.

MPI-3 standard introduces “non-blocking collective calls”

MPI + OpenMP
• By default, MPI assumes no threaded execution.
• MPI_Init(int *argc, char ***argv) old.
• MPI_Init_thread(int *argc,char ***argv,int required,
int *provided )
• required = MPI Thread Support level
• Levels of MPI Thread Support
•
Support Levels Description
MPI_THREAD_SINGLE Only one thread will execute
MPI_THREAD_FUNNELED Process may be multi-threaded, but only the main
thread will make MPI calls (calls are “funneled”
to main thread). *Default*
MPI_THREAD_SERIALIZE Process may be multi-threaded, and any thread
can make MPI calls, but threads cannot execute
MPI calls concurrently; they must take turns
(calls are “serialized”).
MPI_THREAD_MULTIPLE Multiple threads may call MPI, with no
restriction.

MiniTool Partition Wizard Crack 12 Key Download Free 2025
No ratings yet
MiniTool Partition Wizard Crack 12 Key Download Free 2025
29 pages
Lecture 10-Introduction To MPI
No ratings yet
Lecture 10-Introduction To MPI
51 pages
MPI Tutorial Fall Break 2022
No ratings yet
MPI Tutorial Fall Break 2022
60 pages
Get Everything For Free
91% (11)
Get Everything For Free
52 pages
Using MPI With Fortran - Research Computing University of Colorado Boulder Documentation
No ratings yet
Using MPI With Fortran - Research Computing University of Colorado Boulder Documentation
8 pages
CP4253 Map Unit Iv
No ratings yet
CP4253 Map Unit Iv
22 pages
Distributed Memory Programming With MPI: Peter Pacheco
No ratings yet
Distributed Memory Programming With MPI: Peter Pacheco
121 pages
HPC - NRW 02 MPI Concepts
No ratings yet
HPC - NRW 02 MPI Concepts
27 pages
2 Mpi
No ratings yet
2 Mpi
13 pages
NGK Mpi
No ratings yet
NGK Mpi
74 pages
Intro MPI
No ratings yet
Intro MPI
60 pages
Mpi 1
No ratings yet
Mpi 1
20 pages
Introduction To The Message Passing Interface (MPI
No ratings yet
Introduction To The Message Passing Interface (MPI
16 pages
Mpi 1
No ratings yet
Mpi 1
38 pages
CS-3006 - 5 - MPI Basics
No ratings yet
CS-3006 - 5 - MPI Basics
53 pages
Bopf Exercise
100% (1)
Bopf Exercise
70 pages
03 MPIProgramStructure
No ratings yet
03 MPIProgramStructure
42 pages
Basic Specman - Chapter - 3
100% (1)
Basic Specman - Chapter - 3
112 pages
Key Concepts in MPI Programming: Processes
No ratings yet
Key Concepts in MPI Programming: Processes
6 pages
CV Product Introduction - 082022
No ratings yet
CV Product Introduction - 082022
39 pages
Mpi Unit 5 Part 2 1
No ratings yet
Mpi Unit 5 Part 2 1
65 pages
Cs-3006 6 Mpi Basics 2
No ratings yet
Cs-3006 6 Mpi Basics 2
52 pages
In3200 Chap09
No ratings yet
In3200 Chap09
56 pages
‎⁨تقرير⁩
No ratings yet
‎⁨تقرير⁩
16 pages
Introduction To C MPI PM
No ratings yet
Introduction To C MPI PM
50 pages
Lec 9 DR Marwa Abbas
No ratings yet
Lec 9 DR Marwa Abbas
64 pages
Week 10
No ratings yet
Week 10
52 pages
Splunk SPLK 3001
No ratings yet
Splunk SPLK 3001
34 pages
Chapter 4 - Message-Passing Programming, MPI
No ratings yet
Chapter 4 - Message-Passing Programming, MPI
79 pages
Lecture05 MPI
No ratings yet
Lecture05 MPI
26 pages
07 2 Introduction MPI
No ratings yet
07 2 Introduction MPI
27 pages
Ship Motion Assignment
No ratings yet
Ship Motion Assignment
1 page
Assignment 5: Initial Checks On Hold and Tank Capacity, Resistance and Propulsion, Trim, Stability and Freeboard
No ratings yet
Assignment 5: Initial Checks On Hold and Tank Capacity, Resistance and Propulsion, Trim, Stability and Freeboard
1 page
Nscet E-Learning Presentation: Listen Learn Lead
No ratings yet
Nscet E-Learning Presentation: Listen Learn Lead
54 pages
Dive Into Sea of C
From Everand
Dive Into Sea of C
M Ashok
No ratings yet
1Z0 1033 23
No ratings yet
1Z0 1033 23
14 pages
Govind 4
No ratings yet
Govind 4
3 pages
The Catholic University of Eastern Africa
100% (1)
The Catholic University of Eastern Africa
7 pages
The 1 Page Python Book
From Everand
The 1 Page Python Book
Barani Kumar
2/5 (1)
PA
No ratings yet
PA
87 pages
Message Passing Interface (MPI) : EC3500: Introduction To Parallel Computing
100% (1)
Message Passing Interface (MPI) : EC3500: Introduction To Parallel Computing
40 pages
Parallel and Distributed Computing Lab Digital Assignment - 5
No ratings yet
Parallel and Distributed Computing Lab Digital Assignment - 5
7 pages
Lec5 MPI
No ratings yet
Lec5 MPI
28 pages
Lecture 11 Distributed Memory Programming
No ratings yet
Lecture 11 Distributed Memory Programming
28 pages
Introduction MPI - Chap2 - Slide 3
No ratings yet
Introduction MPI - Chap2 - Slide 3
16 pages
5 MPIprogramming
No ratings yet
5 MPIprogramming
43 pages
Message Passing Interface (MPI)
No ratings yet
Message Passing Interface (MPI)
22 pages
Message Passing Interface (MPI) : Steve Lantz Center For Advanced Computing Cornell University
No ratings yet
Message Passing Interface (MPI) : Steve Lantz Center For Advanced Computing Cornell University
53 pages
Easy Programming for Everyone
From Everand
Easy Programming for Everyone
Umar Asghar
No ratings yet
Mpi Lecture
No ratings yet
Mpi Lecture
129 pages
Introduction To MPI Basics
No ratings yet
Introduction To MPI Basics
8 pages
Forces On Ship: OE 1010 Introduction To Ocean Engineering
No ratings yet
Forces On Ship: OE 1010 Introduction To Ocean Engineering
62 pages
C Programming
From Everand
C Programming
Netra
No ratings yet
Parallel & Distributed Computing: MPI - Message Passing Interface
No ratings yet
Parallel & Distributed Computing: MPI - Message Passing Interface
49 pages
Week09 L2
No ratings yet
Week09 L2
13 pages
Basic MPI: Tom Murphy, Dave Joiner, Paul Gray, Henry Neeman, Charlie Peck, Alex Lemann, Kristina Wanous, Kevin Hunter
No ratings yet
Basic MPI: Tom Murphy, Dave Joiner, Paul Gray, Henry Neeman, Charlie Peck, Alex Lemann, Kristina Wanous, Kevin Hunter
22 pages
Introduction To MPI Ranger Lonestar
No ratings yet
Introduction To MPI Ranger Lonestar
67 pages
Class03 - MPI, Part 1, Intermediate PDF
No ratings yet
Class03 - MPI, Part 1, Intermediate PDF
83 pages
Lab Mpi
No ratings yet
Lab Mpi
32 pages
Intro To MPI: Hpc-Support@duke - Edu
No ratings yet
Intro To MPI: Hpc-Support@duke - Edu
56 pages
Intro To MPI
No ratings yet
Intro To MPI
44 pages
Jiang, Henn, Sharma - 2002 - Wash Waves Generated by Ships Moving On Fairways of Varying Topography
No ratings yet
Jiang, Henn, Sharma - 2002 - Wash Waves Generated by Ships Moving On Fairways of Varying Topography
15 pages
Mpi Basic Operations
No ratings yet
Mpi Basic Operations
6 pages
Lab Mpi
No ratings yet
Lab Mpi
29 pages
Parallel Programming Using Basic MPI Presented by Timothy H. Kaiser, Ph.D. San Diego Supercomputer Center
No ratings yet
Parallel Programming Using Basic MPI Presented by Timothy H. Kaiser, Ph.D. San Diego Supercomputer Center
19 pages
Clase 4 - Tutorial de MPI
No ratings yet
Clase 4 - Tutorial de MPI
35 pages
Mpi
No ratings yet
Mpi
30 pages
DCA201 Notes Unit 1
No ratings yet
DCA201 Notes Unit 1
12 pages
11th Comerce QUESTION BANK 2nd Term
No ratings yet
11th Comerce QUESTION BANK 2nd Term
6 pages
An Introduction To MPI: Parallel Programming With The Message Passing Interface
No ratings yet
An Introduction To MPI: Parallel Programming With The Message Passing Interface
48 pages
Abstract:: Invitation To Ezhil: A Tamil Programming Language For Early Computer-Science Education
No ratings yet
Abstract:: Invitation To Ezhil: A Tamil Programming Language For Early Computer-Science Education
11 pages
02 Mpi 0
No ratings yet
02 Mpi 0
19 pages
Create OOP ALV Using CL - SALV - TABLE - SAP Fiori, SAP HANA, SAPUI5, SAP Netweaver Gateway Tutorials, Interview Questions - SAP Learners
No ratings yet
Create OOP ALV Using CL - SALV - TABLE - SAP Fiori, SAP HANA, SAPUI5, SAP Netweaver Gateway Tutorials, Interview Questions - SAP Learners
4 pages
MCC - Hands On 1 - Step 4 - Synchronize CRM Data
No ratings yet
MCC - Hands On 1 - Step 4 - Synchronize CRM Data
6 pages
Naukri SarangBirewar (5y 0m) - 2
No ratings yet
Naukri SarangBirewar (5y 0m) - 2
4 pages
Reading Sample Sap Press Full Stack Web Development The Comprehensive Guide
No ratings yet
Reading Sample Sap Press Full Stack Web Development The Comprehensive Guide
49 pages
The Message Passing Interface (MPI)
No ratings yet
The Message Passing Interface (MPI)
18 pages
Message Passing Interface (MPI)
No ratings yet
Message Passing Interface (MPI)
14 pages
c13-Catia-V5 Sheets & Title Block
No ratings yet
c13-Catia-V5 Sheets & Title Block
28 pages
Java J2EE JBoss Ejb With Eclipse 2003
No ratings yet
Java J2EE JBoss Ejb With Eclipse 2003
42 pages
ChangeLog Driver 455
No ratings yet
ChangeLog Driver 455
31 pages
Parallel Architecture: Sathish Vadhiyar
No ratings yet
Parallel Architecture: Sathish Vadhiyar
26 pages
ATS Friendly Resume Template 3 2025 23 02 08 28 18
No ratings yet
ATS Friendly Resume Template 3 2025 23 02 08 28 18
1 page
Lab Android Part 7 SQLite Database
No ratings yet
Lab Android Part 7 SQLite Database
61 pages
Hls7000dn Use QSG Leb502001
No ratings yet
Hls7000dn Use QSG Leb502001
35 pages
Olga's Instructions Assignment
No ratings yet
Olga's Instructions Assignment
12 pages
Department of Information Technology
No ratings yet
Department of Information Technology
53 pages
Parallel Programming Models: Sathish Vadhiyar
No ratings yet
Parallel Programming Models: Sathish Vadhiyar
32 pages
Programming Environments On Sahasrat (Cray-Xc40 System)
No ratings yet
Programming Environments On Sahasrat (Cray-Xc40 System)
38 pages
FrontPage 2000
No ratings yet
FrontPage 2000
30 pages
CV Sept 2016
No ratings yet
CV Sept 2016
2 pages
Jclid190846 PDF
No ratings yet
Jclid190846 PDF
22 pages
Mesoscale Eddy Dynamics High Resolution Model
No ratings yet
Mesoscale Eddy Dynamics High Resolution Model
16 pages
Skrip Debian New
No ratings yet
Skrip Debian New
7 pages
Biharmonic Friction With A Smagorinsky-Like Viscosity For Use in Large-Scale Eddy-Permitting Ocean Models
No ratings yet
Biharmonic Friction With A Smagorinsky-Like Viscosity For Use in Large-Scale Eddy-Permitting Ocean Models
12 pages
Reflections FINAL PDF
No ratings yet
Reflections FINAL PDF
15 pages
Ocean Gyres Driven by Surface Buoyancy Forcing: Research Letter
No ratings yet
Ocean Gyres Driven by Surface Buoyancy Forcing: Research Letter
10 pages
Climate Tipping Points - Too Risky To Bet Against: Comment
No ratings yet
Climate Tipping Points - Too Risky To Bet Against: Comment
5 pages
CV
No ratings yet
CV
3 pages
Tyrone Assin GIC
No ratings yet
Tyrone Assin GIC
5 pages
Assignment 8
No ratings yet
Assignment 8
1 page
Corpex Technologies - Presentation by Anantha Lakshmi
No ratings yet
Corpex Technologies - Presentation by Anantha Lakshmi
11 pages
Transit Time (Days) : Eastbound
No ratings yet
Transit Time (Days) : Eastbound
2 pages
OE5170 Notes 08
No ratings yet
OE5170 Notes 08
3 pages
MVC Fundamentals Exercise Hints
No ratings yet
MVC Fundamentals Exercise Hints
6 pages

SERC IntroMPI 2019-09-14 v0

Uploaded by

SERC IntroMPI 2019-09-14 v0

Uploaded by

Parallel Programming Using MPI

Short Course on HPC

Aditya Krishna Swamy

MPI only Hybrid

Blade 1 or more Nodes

Do work and make message passing calls…

Do work and make message passing calls…

Do work and make message passing calls…

Do work and make message passing calls…

Do work and make message passing calls…

/* Finalize */ • Must be called at the end of the properly

! Get the number of processes

! Get my process number (rank)

Do work and make message passing calls…

end program mpi_code

int nproc, myrank; int nproc, myrank;

int nproc, myrank;

int nproc, myrank;

&numToSend Pointer to whatever information to send. In this case, an integer

1 the number of items to send. If sending a vector of 10 int's, we

0 Destination of the message. In this case, Rank 0

10 Message Tag. Useful to identify/sort messages

C : MPI_INT, MPI_LONG, MPI_SHORT, MPI_FLOAT, MPI_DOUBLE, etc…

Predefined Communicator: MPI_COMM_WORLD

Each process performs a scatter operation, sending a distinct message to all

Synchronization: When necessary, all the processes within a communicator can

FORTRAN: double precision MPI_WTIME()

Use a parallel debugger such as Totalview or DDT

• MPI on Linux clusters:

pi = 0.; /* It is not necessary to set pi = 0 */

/* Global reduction. All processes send their value of mypi to process 0

MPI-3 standard introduces “non-blocking collective calls”

You might also like