0% found this document useful (0 votes)

15 views39 pages

Shared Memory Parallel Programming: Introduction To Openmp

Uploaded by

spamailbat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views39 pages

Shared Memory Parallel Programming: Introduction To Openmp

Uploaded by

spamailbat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Shared Memory Parallel

Programming
Introduction to OpenMP

These slides were originally written by Dr. Barbara Chapman, University of Houston
Outline
• Introduction to OpenMP
• Parallel Programming with OpenMP
– Worksharing, tasks, data environment, synchronization
• OpenMP Performance and Best Practices
• Hybrid MPI/OpenMP
• Case Studies and Examples
• Reference Materials

2
OpenMP* Overview
C$OMP FLUSH #pragma omp critical
C$OMP THREADPRIVATE(/ABC/) CALL OMP_SET_NUM_THREADS(10)
C$OMP OpenMP: An API for
parallel do shared(a, Writing
b, c) Multithreaded
call omp_test_lock(jlok)
call OMP_INIT_LOCK (ilok)
Applications
C$OMP MASTER
C$OMP ATOMIC
C$OMP Stands
SINGLE for Open
PRIVATE(X) Multi-Processing
setenv OMP_SCHEDULE “dynamic”
A setDOofORDERED
C$OMP PARALLEL compiler directives
PRIVATE (A, B, and
C) library
C$OMP ORDERED
routines for parallel application programmers
C$OMP PARALLEL REDUCTION (+: A, B)
Greatly simplifies writing multi-threaded (MT)
C$OMP SECTIONS

programs
#pragma omp parallelin Fortran,
for C and
private(A, B) C++!$OMP BARRIER
Standardizes
C$OMP PARALLEL last 20 years
COPYIN(/blk/) C$OMP of
DO SMP practice
lastprivate(XX)

Nthrds = OMP_GET_NUM_PROCS() omp_set_lock(lck)

* The name “OpenMP” is the property of the OpenMP Architecture Review Board.
What is OpenMP?

• Industry standard for shared memory

programming for scientific applications
– Initial focus on scientific applications
• Main ideas:
– Support productivity
– Provide portability
• For Fortran, C and C++
Using OpenMP

• Widely available
• Single source code: parallel
and sequential code
• Ease of use, incremental
approach to programming
• Can be combined with MPI
to create “hybrid” code

Flexibility: threadids allow for

explicit multithreaded programming
The OpenMP ARB 2008

• OpenMP is maintained by the OpenMP Architecture Review

Board (the ARB), which
• Interprets OpenMP
• Writes new specifications - keeps OpenMP relevant
• Works to increase the impact of OpenMP
• Members are organizations - not individuals
– Current members
• Permanent: AMD, Cray, Fujitsu, HP, IBM, Intel, Microsoft, NEC, PGI,
SGI, Sun
• Auxiliary: ASCI, cOMPunity, EPCC, KSL, NASA, RWTH Aachen

www.compunity.org
The OpenMP ARB 2011

• OpenMP is maintained by the OpenMP Architecture

Review Board (the ARB), which
• Interprets OpenMP
• Writes new specifications - keeps OpenMP relevant
• Works to increase the impact of OpenMP
• Members are organizations - not individuals
– Current members
• Permanent: AMD, CAPS Entreprise, Cray, Fujitsu, HP, IBM, Intel,
Microsoft, NEC, Nvidia, Oracle, PGI, Texas Instruments
• Auxiliary: ANL, cOMPunity, EPCC, NASA, LANL, ASC/LLNL,
ORNL, RWTH Aachen, TACC

www.openmp.org
OpenMP Meeting 2013

8
• Oct 1997 – 1.0 Fortran
• Oct 1998 – 1.0 C/C++
• Nov 1999 – 1.1 Fortran (interpretations added)
• Nov 2000 – 2.0 Fortran
• Mar 2002 – 2.0 C/C++
• May 2005 – 2.5 Fortran/C/C++ (mostly a merge)
• Apr 2008 – 3.0 Fortran/C/C++ (extensions)
• July 2011 – 3.1 Fortran/C/C++ (extensions)
• March 2013 – 4.0 Fortran/C/C++ (extensions)
• Nov 2015 – 4.5 Fortran/C/C++ (extensions)
• Nov 2018 – 5.0 Fortran/C/C++ (extensions)
• Nov 2020 – 5.1 Fortran/C/C++ (extensions)

• Committees work to maintain API, keep it relevant:

– “Keep it simple”
– As far as possible, keep implementations consistent

http://www.openmp.org
OpenMP Overview
• A set of compiler directives inserted in the source
program
• Also some library functions
• Ideally, compiler directives do not affect
sequential code
– pragmas in C / C++
– (specially written) comments in Fortran code
The OpenMP Shared Memory API
• High-level directive-based multithreaded programming
– The user makes strategic decisions
– Compiler figures out details
– Threads communicate by sharing variables
– Synchronization to order accesses and prevent data conflicts
– Structured programming to reduce likelihood of bugs

#pragma omp parallel

#pragma omp for schedule(dynamic)
for (I=0;I<N;I++){
NEAT_STUFF(I);
} /* implicit barrier here */

11
Summary: What is OpenMP?
• De-facto standard API to write shared memory
parallel applications in C, C++, and Fortran
• Consists of:
– Compiler directives
– Runtime routines
– Environment variables
• Initial version released end of 1997
– For Fortran only
– Subsequent releases for C, C++
• Version 2.5 merged specs for all three languages
12
OpenMP Components
Runtime Environment
Directives environment variables
• Parallel region • Number of threads • Number of threads
• Worksharing constructs • Thread ID • Scheduling type
• Dynamic thread
• Tasking adjustment • Dynamic thread
adjustment
• Synchronization • Nested parallelism
• Nested parallelism
• Data-sharing attributes • Schedule
• Active levels • Stacksize
• Thread limit • Idle threads
• Nesting level • Active levels
• Ancestor thread • Thread limit
• Team size
• Locking
• Wallclock timer
13
OpenMP Syntax
• Most OpenMP constructs are compiler directives using pragmas.
– For C and C++, the pragmas take the form:
#pragma omp construct [clause [clause]…]
– For Fortran, the directives take one of the forms:
• Fixed form
*$OMP construct [clause [clause]…]
C$OMP construct [clause [clause]…]
• Free form (but works for fixed form too)
!$OMP construct [clause [clause]…]
• Include file and the OpenMP lib module
#include <omp.h>
use omp_lib

OpenMP sentinel forms: #pragma omp !$OMP

14
Idea of OpenMP
Sequential code:
statement1;
statement2;
statement3;
Assume we want to execute statement 2 in parallel,
and statement 1 and 3 sequentially
Idea of OpenMP

statement 1;
#pragma <specific OpenMP directive>
statement2;
statement3;
Statement 2 (may be) executed in parallel
Statement 1 and 3 are executed sequentially
Idea of OpenMP

statement 1
!$OMP <specific OpenMP directive>
statement2
!$OMP END <specific OpenMP directive>
statement3
Statement 2 (may be) executed in parallel
Statement 1 and 3 are executed sequentially
Basic Idea of OpenMP

• Program has sequential part and parallel parts

• Initial (master) thread executes sequential part
• Master and slaves execute parallel parts
– Initial thread creates team of slave threads and
becomes master of the team
– fork-join approach
OpenMP Fork-Join Execution Model
• Master thread spawns multiple worker threads
as needed, together form a team
• Parallel region is a block of code executed by
all threads in a team simultaneously
Barrier
Master thread

Worker thread A Nested

Parallel
region
Parallel Regions
19
Basic Idea of OpenMP

• Each thread performs part of the work

• One thread per processor (or more if
multicore or multithreading)
– But not time-slicing
• Sequential parts executed by single thread
• Dependences in parallel parts require
synchronization between threads
Role of User

• User inserts directives telling compiler how

statements are to be executed
– what parts of the program are parallel
– how to assign code in parallel regions to threads
– what data is private (local) to threads
• Compiler generates explicit threaded code
Role of User

• User must remove any dependences in parallel

parts
• Or introduce appropriate synchronization
• OpenMP compiler does not check for them!
– It is up to programmer to ensure correctness
• Some tools exist to help check this
OpenMP Compiler
• OpenMP: thread programming at “high level”.
– The user does not need to specify all the details
• Assignment of work to threads
• Creation of threads
• Compiler figures out details
– Generates multithreaded code with calls to its runtime
– Runtime starts threads, passes work to them,
organizes synchronization

• Compiler must be “told” to process OpenMP

– Compiler flags (non-standard) enable OpenMP (e.g. –
openmp, -xopenmp, -fopenmp, -mp)
– Else OpenMP is ignored

23
Status of OpenMP Implementation

• OpenMP compiler translates code and user

directives into multithreaded application
– Is part of most standard compilers today
• Works on true shared memory machines (SMPs)
and DSM architectures
• The runtime is custom: a compiler has its own
• We look briefly at implementation strategy later
OpenMP Usage

sequential Sequential
compiler Program

Annotated Fortran/C/C++
Source compiler

Parallel
OpenMP Program
compiler
OpenMP Usage

• If program is compiled sequentially

– OpenMP comments and pragmas are ignored
• If code is compiled for parallel execution
– comments and/or pragmas are read, and
– drive translation into parallel program
• Ideally, one source for both sequential and
parallel program (big maintenance plus)
Where Does OpenMP Run?
Hardware OpenMP
Platforms support
CPU CPU CPU CPU
Shared Memory Available
Systems
cache cache cache cache
Distributed Shared Available
Memory Systems
(ccNUMA) Shared bus
Distributed Memory via Software
Systems DSM Shared Memory

(Hyperthreading and Available Shared memory architecture

other kinds of Chip
MultiThreading)
Are Caches “Coherent” or Not?
• Coherence means different copies of same location have same
value, incoherent otherwise:
• p1 and p2 both have cached copies of data (= 0)
• p1 writes data=1
– May “write through” to memory
• p2 reads data, but gets the “stale” cached copy
– This may happen even if it read an updated value
of another variable, flag, that came from memory

data = 0

data 1
data 0 data 0

p1 p2
CS267 Lecture 6 28
OpenMP Memory Model
• OpenMP assumes a shared memory
• Threads communicate by sharing variables.

• Synchronization protects data conflicts.

– Synchronization is expensive.
• Change how data is accessed to minimize the need for synchronization.

29
How do threads interact?
• OpenMP is a shared memory model.
• Threads interact (“communicate”) by sharing variables
• Unintended sharing of data causes race conditions:
• the program’s outcome may change if the threads are
scheduled differently
• To prevent race conditions:
• Use synchronization to order data access and protect data
conflicts
• Synchronization is expensive so:
• Do what you can to change how data is accessed to minimize
the need for synchronization.
OpenMP Parallel Computing Solution Stack

End User

Application

Directives, Environment
OpenMP library
Compiler variables

Runtime library

OS/system

31
Parallel Regions
• You create threads in OpenMP with the “omp
parallel” pragma.
• For example, to create a 4 thread parallel region:
Runtime function
double A[1000]; to request a
Each thread
executes a omp_set_num_threads(4); certain number of
copy of the #pragma omp parallel threads
code within {
the int ID = omp_get_thread_num();
structured pooh(ID,A); Runtime function
block returning a thread
}
ID
 Each thread calls pooh(ID,A) for ID = 0 to 3
Parallel Regions
double A[1000];
• Each thread executes the omp_set_num_threads(4);
same code redundantly. #pragma omp parallel
{
int ID = omp_get_thread_num();
double A[1000]; pooh(ID, A);
}
omp_set_num_threads(4) printf(“all done\n”);

A single
copy of A pooh(0,A) pooh(1,A) pooh(2,A) pooh(3,A)
is shared
between all
threads.
printf(“all done\n”); Threads wait here for all threads to
finish before proceeding (i.e. a barrier)
OpenMP: Structured blocks (C/C++)
• Most constructs apply to structured blocks
• Structured block: a block with one point of entry at the top and one
point of exit at the bottom.
• The only “branches” allowed are STOP statements in Fortran and
exit() in C/C++
OK NOT
#pragma omp parallel if(go_now()) goto more;
{ #pragma omp parallel OK
int id = omp_get_thread_num(); {
more: res[id] = do_big_job(id); int id = omp_get_thread_num();
if(!conv(res[id]) goto more; more: res[id] = do_big_job(id);
} if(conv(res[id]) goto done;
printf(“ All done \n”); go to more;
}
done: if(!really_done()) goto more;

A structured block Not a structured block

OpenMP Parallel Regions
• In C/C++: a block is a single statement or a group of statement
between { }
#pragma omp parallel #pragma omp parallel for
{ for(i=0;i<N;i++) {
id = omp_get_thread_num(); res[i] = big_calc(i);
res[id] = lots_of_work(id); A[i] = B[i] + res[i];
} }

• In Fortran: a block is a single statement or a group of statements

between directive/end-directive pairs.
C$OMP PARALLEL C$OMP PARALLEL DO
10 wrk(id) = garbage(id) do i=1,N
res(id) = wrk(id)**2 res(i)=bigComp(i)
if(.not.conv(res(id)) goto 10 end do
C$OMP END PARALLEL C$OMP END PARALLEL DO
35
Scope of OpenMP Region
A parallel region can span multiple source files.

bar.f
foo.f
C$OMP PARALLEL subroutine whoami
call whoami + external omp_get_thread_num
C$OMP END PARALLEL integer iam, omp_get_thread_num
iam = omp_get_thread_num()
C$OMP CRITICAL
Static/lexical Dynamic extent print*,’Hello from ‘, iam
extent of of parallel C$OMP END CRITICAL
parallel region region includes Orphaned directives
return can appear outside a
lexical extent
end parallel construct

36
Exercise:
A multi-threaded “Hello world” program
• Write a multithreaded program where each thread
prints “hello world”.

void main()
{
int ID = 0;
printf(“ hello(%d) ”, ID);
printf(“ world(%d) \n”, ID);

}
A multi-threaded “Hello world” program
• Write a multithreaded program where each thread
prints “hello world”.
#include “omp.h” OpenMP include file
void main() Parallel region with default Sample Output:
{ number of threads
hello(1) hello(0) world(1)
#pragma omp parallel
world(0)
{
hello (3) hello(2)
int ID =
world(2)
omp_get_thread_num();
printf(“ hello(%d) ”, ID); world(3)
printf(“ world(%d) \n”, ID);
} } End of the parallel region Runtime library function to
return a thread ID.
39

5412220A02 S - ECDIS User Manual
100% (1)
5412220A02 S - ECDIS User Manual
185 pages
WIKY - Service Manual - v1.31: WWW - Alfastreet.si
100% (1)
WIKY - Service Manual - v1.31: WWW - Alfastreet.si
31 pages
Verilum® 5.2: Video Display Calibration and Conformance Tracking
No ratings yet
Verilum® 5.2: Video Display Calibration and Conformance Tracking
19 pages
Mplab Pickit4 Debugger Ug Ds50002751a
No ratings yet
Mplab Pickit4 Debugger Ug Ds50002751a
93 pages
Gps Arduino Coding
No ratings yet
Gps Arduino Coding
10 pages
Final Report Restaurant Management System
No ratings yet
Final Report Restaurant Management System
36 pages
Ford Acronyms List
No ratings yet
Ford Acronyms List
32 pages
AUTODYN - Chapter 11 - Parallel - Processing PDF
No ratings yet
AUTODYN - Chapter 11 - Parallel - Processing PDF
42 pages
Learning Autodesk Revit
No ratings yet
Learning Autodesk Revit
91 pages
Chapter 3 - Block Ciphers and The Data Encryption Standard
No ratings yet
Chapter 3 - Block Ciphers and The Data Encryption Standard
47 pages
8 GHZ To 16 GHZ, 4-Channel, X Band and Ku Band Beamformer: Adar1000
No ratings yet
8 GHZ To 16 GHZ, 4-Channel, X Band and Ku Band Beamformer: Adar1000
65 pages
Hospital Management System Synopsis
No ratings yet
Hospital Management System Synopsis
9 pages
LabVIEW Design Patterns
No ratings yet
LabVIEW Design Patterns
56 pages
Unit 3 - Programming Multi-Core and Shared Memory
No ratings yet
Unit 3 - Programming Multi-Core and Shared Memory
100 pages
Enterpise Campus Design Routed Access
No ratings yet
Enterpise Campus Design Routed Access
90 pages
OMP Common Core-Voss
No ratings yet
OMP Common Core-Voss
217 pages
IM ch04
No ratings yet
IM ch04
8 pages
AT90USB1286
No ratings yet
AT90USB1286
39 pages
A Survey On Context-Aware Systems PDF
No ratings yet
A Survey On Context-Aware Systems PDF
16 pages
DS-2XS6A25G0-I CH20S40 (No+battery) Datasheet V5.5.111 20220120
No ratings yet
DS-2XS6A25G0-I CH20S40 (No+battery) Datasheet V5.5.111 20220120
8 pages
Openmp Overview
No ratings yet
Openmp Overview
74 pages
On The RAN: The State of Next Generation RAN Transformations
No ratings yet
On The RAN: The State of Next Generation RAN Transformations
22 pages
Omp Hands On SC08 PDF
No ratings yet
Omp Hands On SC08 PDF
153 pages
Openmp: Author: Blaise Barney, Lawrence Livermore National Laboratory
No ratings yet
Openmp: Author: Blaise Barney, Lawrence Livermore National Laboratory
62 pages
Route Summarization
No ratings yet
Route Summarization
17 pages
Lec 12 OpenMP
No ratings yet
Lec 12 OpenMP
152 pages
RHEL 6 - 6.2 Technical Notes
No ratings yet
RHEL 6 - 6.2 Technical Notes
496 pages
OpenMPSlides Tamu SC PDF
No ratings yet
OpenMPSlides Tamu SC PDF
74 pages
Lecture 2 Applied Cryptography
No ratings yet
Lecture 2 Applied Cryptography
34 pages
Parallel Programming Using Openmp: Mike Bailey
No ratings yet
Parallel Programming Using Openmp: Mike Bailey
27 pages
OpenMP 01 Introduction
No ratings yet
OpenMP 01 Introduction
70 pages
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 19-Aug-2021 Module 2 - OpenMP
No ratings yet
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 19-Aug-2021 Module 2 - OpenMP
10 pages
Prof. Dr. Aman Ullah Khan
No ratings yet
Prof. Dr. Aman Ullah Khan
27 pages
Parallel Programming Module 2
No ratings yet
Parallel Programming Module 2
112 pages
Parallel Programming
No ratings yet
Parallel Programming
108 pages
Parallel Computing and Openmp Tutorial: Shao-Ching Huang
No ratings yet
Parallel Computing and Openmp Tutorial: Shao-Ching Huang
58 pages
It1050 Ooc
No ratings yet
It1050 Ooc
6 pages
Likhit Hegu
No ratings yet
Likhit Hegu
3 pages
Open MP
No ratings yet
Open MP
35 pages
Lecture Open MP
No ratings yet
Lecture Open MP
25 pages
OpenMP Tutorial
100% (1)
OpenMP Tutorial
82 pages
Omp Hands On SC08
No ratings yet
Omp Hands On SC08
153 pages
Openmp: Parallel Processing
No ratings yet
Openmp: Parallel Processing
40 pages
Introduction To OpenMP
No ratings yet
Introduction To OpenMP
46 pages
OpenMP Tutorial - Lawrence Livermore National Laboratory
No ratings yet
OpenMP Tutorial - Lawrence Livermore National Laboratory
75 pages
Omp Handouts
No ratings yet
Omp Handouts
109 pages
Beginning OpenMP
No ratings yet
Beginning OpenMP
20 pages
CS8083 UNIT III Notes
No ratings yet
CS8083 UNIT III Notes
26 pages
Mpsoc Architectures Openmp
No ratings yet
Mpsoc Architectures Openmp
35 pages
Introduction To Open MP
No ratings yet
Introduction To Open MP
42 pages
Parallel Programming Using OpenMP
No ratings yet
Parallel Programming Using OpenMP
76 pages
A Tutorial On Parallel Computing On Shared Memory Systems
No ratings yet
A Tutorial On Parallel Computing On Shared Memory Systems
23 pages
Concurrent and Parallel Programming Unit V-Notes Unit V Openmp, Opencl, Cilk++, Intel TBB, Cuda 5.1 Openmp
No ratings yet
Concurrent and Parallel Programming Unit V-Notes Unit V Openmp, Opencl, Cilk++, Intel TBB, Cuda 5.1 Openmp
10 pages
Openmp: John H. Osorio Ríos
No ratings yet
Openmp: John H. Osorio Ríos
24 pages
Open MP
No ratings yet
Open MP
30 pages
Chapter 3 - Shared-Memory Programming, OpenMP
No ratings yet
Chapter 3 - Shared-Memory Programming, OpenMP
65 pages
Programming Assignment: On Openmp
No ratings yet
Programming Assignment: On Openmp
19 pages
Open MPLecture
No ratings yet
Open MPLecture
54 pages
Entry Test PHARM D - 2023 1
No ratings yet
Entry Test PHARM D - 2023 1
1 page
Openmp
No ratings yet
Openmp
61 pages
Iot Imp Question
No ratings yet
Iot Imp Question
4 pages
11-Programming With OpenMP
No ratings yet
11-Programming With OpenMP
28 pages
Battery Replacement SOP V1.2
No ratings yet
Battery Replacement SOP V1.2
9 pages
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
Openmp
No ratings yet
Openmp
21 pages
Recommendation System in Python
No ratings yet
Recommendation System in Python
6 pages
OpenMP Basics
No ratings yet
OpenMP Basics
47 pages
Lect11 Openmp1
No ratings yet
Lect11 Openmp1
35 pages
09 OpenMP Intro
No ratings yet
09 OpenMP Intro
15 pages
Unit III
No ratings yet
Unit III
15 pages
OpenMP Workshop Day 1
No ratings yet
OpenMP Workshop Day 1
49 pages
Cs6801 Mcap MGM
No ratings yet
Cs6801 Mcap MGM
7 pages
Updated - CS8083 MCP UNIT III Notes
No ratings yet
Updated - CS8083 MCP UNIT III Notes
26 pages
Advanced Web Programming - Chapter 3
No ratings yet
Advanced Web Programming - Chapter 3
10 pages
OpenMP - Reference Book
No ratings yet
OpenMP - Reference Book
59 pages
Openmp 6pp
No ratings yet
Openmp 6pp
5 pages
Openmp 1
No ratings yet
Openmp 1
38 pages
Unit 3
No ratings yet
Unit 3
13 pages
About OpenMP
No ratings yet
About OpenMP
86 pages
Lecture 10 Shared Memory Programming With OpenMP
No ratings yet
Lecture 10 Shared Memory Programming With OpenMP
30 pages
MC 2
No ratings yet
MC 2
14 pages
ParallelProgramming Start2016
No ratings yet
ParallelProgramming Start2016
41 pages
Parallel Programming Unit 2
No ratings yet
Parallel Programming Unit 2
71 pages
CS-3006 8 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 8 UsingOpenMP SharedMemoryProgramming
61 pages
Openmp HPC Ass1
No ratings yet
Openmp HPC Ass1
43 pages
Openmp
No ratings yet
Openmp
95 pages
OpenMP SPM
No ratings yet
OpenMP SPM
9 pages
OpenMP in Practice: Definitive Reference for Developers and Engineers
From Everand
OpenMP in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Zig Programming: From Zero to Systems Master
From Everand
Zig Programming: From Zero to Systems Master
Niklas Hoffmann
No ratings yet
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
From Everand
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
Manoj R Chakravarthi
No ratings yet

Shared Memory Parallel Programming: Introduction To Openmp

Uploaded by

Shared Memory Parallel Programming: Introduction To Openmp

Uploaded by

Shared Memory Parallel

Nthrds = OMP_GET_NUM_PROCS() omp_set_lock(lck)

• Industry standard for shared memory

Flexibility: threadids allow for

• OpenMP is maintained by the OpenMP Architecture Review

• OpenMP is maintained by the OpenMP Architecture

• Committees work to maintain API, keep it relevant:

#pragma omp parallel

OpenMP sentinel forms: #pragma omp !$OMP

• Program has sequential part and parallel parts

Worker thread A Nested

• Each thread performs part of the work

• User inserts directives telling compiler how

• User must remove any dependences in parallel

• Compiler must be “told” to process OpenMP

• OpenMP compiler translates code and user

• If program is compiled sequentially

(Hyperthreading and Available Shared memory architecture

• Synchronization protects data conflicts.

A structured block Not a structured block

• In Fortran: a block is a single statement or a group of statements

You might also like