0% found this document useful (0 votes)

14 views61 pages

CS-3006 8 UsingOpenMP SharedMemoryProgramming

The document provides an overview of OpenMP, an API for programming shared-memory parallel systems in Fortran and C/C++. It covers the goals of OpenMP, its syntax, directives, and constructs for parallel programming, including memory models, thread management, and synchronization. Additionally, it includes examples of code implementations and discusses the handling of shared and private data in parallel regions.

Uploaded by

Rafay Khattak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views61 pages

CS-3006 8 UsingOpenMP SharedMemoryProgramming

Uploaded by

Rafay Khattak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 61

Programming Shared-Memory Parallel

Systems - OpenMP

Dr. Muhammad Aleem,

Department of Computer Science,

National University of Computer & Emerging Sciences,
Islamabad Campus
Parallel and Distributed Computing
• The difference includes whether the processes
communicate using shared or a distributed memory
OpenMP
• Programming of shared memory systems
• An API for Fortran and C/C++
– Directives
– Runtime routines
– Environment variables

Lecture credits: https://computing.llnl.gov/tutorials/openMP/

Memory Models
Goals
• Standardization
– Provide a standard among a variety of shared
memory architectures (platforms)
• High-level interfaces to thread programming
• Multi-vendor support
• Multi-OS support (Unix, Windows, Mac, etc.)
• The MP in OpenMP is for Multi-Processing
• Don’t confuse OpenMP with Open MPI! :)
Credits: http://its.unc.edu/research-computing.html
Release History
Programming Shared Memory Systems
• Explicit Parallelism
– For example, pthreads

• Programmer Directed
– For example, OpenMP
Hello World – pthreads based version
#include <pthread.h>
#include <stdio.h>

void* thrfunc(void* arg) {

printf("hello from thread %d\n", *(int*)arg);
}

int main(void) {
pthread_t thread[4];
pthread_attr_t attr;
int arg[4] = {0,1,2,3};
int i;
// setup joinable threads
pthread_attr_init(&attr);
pthread_attr_setdetachstate(&attr,PTHREAD_CREATE_JOINABLE);
// create N threads
for(i=0; i<4; i++)
pthread_create(&thread[i], &attr, thrfunc,
(void*)&arg[i]);

// wait for the N threads to finish

for(i=0; i<4; i++)
pthread_join(thread[i], NULL);
... and the OpenMP version
#include <omp.h>
#include <stdio.h>

int main(int argc, char* argv[])

{
#pragma omp parallel {
printf("Hello World... from thread = %d\n",
omp_get_thread_num());
}
}

Compilation: $ gcc -fopenmp hello.c -o hello

Execution: $ export OMP_NUM_THREADS=4
$ ./hello

Demo: hello.c
Compiling
Intel (icc, ifort, icpc)
-openmp
PGI (pgcc, pgf90, …)
-mp
GNU (gcc, gfortran, g++)
-fopenmp
OpenMP - User Interface Model
• Shared Memory with thread based parallelism

• Not a new language

• Compiler directives, library calls, and environment

variables extend the base language
– f77, f90, f95, C, C++

• Not automatic parallelization

– User explicitly specifies parallelism
– NOTE: Compiler does not ignore user directives even if
wrong
OpenMP - Syntax
• Parallelism is highlighted using compiler directives
or pragmas

• For C and C++, the pragmas take the form:

#pragma omp construct [clause [clause]…]

• Any compiler (even if it does not have OpenMP support)

can compile the program (with no parallelism though)
Fork*/Join Execution Model
• An OpenMP program starts as a single thread (master thread).
• Additional threads (Team) are created when the master hits a
parallel region.
• When all threads finished the parallel region, the new threads are
given back to the runtime or operating system

*Not to be confused with fork() system call

Using OpenMP
• OpenMP is usually used to parallelize loops:
– Find most time consuming loops
– Split them among threads

Split-up this loop between multiple threads

void main( ) void main( )
{ {
double Res[1000]; double Res[1000];
#pragma omp parallel for
for(int i=0;i<1000;i++) { for(int i=0;i<1000;i++) {
do_huge_comp(Res[i]); do_huge_comp(Res[i]);
} }
} }
Sequential program Parallel program
OpenMP Directives
OpenMP - Directives
• OpenMP compiler directives are used for various
purposes:
– Spawning a parallel region
– Dividing blocks of code among threads
– Distributing loop iterations between threads
–…

sentinel directive-name [clause, ...]

#pragma omp parallel private(var)

Supported Clauses for the Parallel Construct
Valid Clauses:
if (logical expression)
num_threads (integer)
private (list of variables)
firstprivate (list of variables)
shared (list of variables)
default (none|shared|private *fortran only*)
copyin (list of variables)
reduction (operator: list)
…
OpenMP Constructs
• OpenMP constructs can be divided into 5
categories:
1. Parallel Regions
2. Work-sharing
3. Data Environment
4. Synchronization
5. Runtime functions/environment variables
OpenMP: Parallel Regions
• You create threads in OpenMP with “omp parallel” pragma
• For example: a 4-thread based Parallel region:
int A[10];
omp_set_num_threads(4);
#pragma omp parallel Demo: helloFun.c
{
int ID =omp_get_thread_num();
fun1(ID,A);
}

• Implicit barrier at the end of parallel block

• Each thread calls fun1(ID,A) for ID = 0 to 3
• Each thread executes the same code within the block
Credits: University of Houston
The parallel directive
• A parallel region is a block of code that will be executed by
multiple threads
• When (in serial program) a PARALLEL directive is found, a
team of threads is created and main-thread (serial
execution thread) becomes the master of the team
• Master thread has id or number 0 (within that team)
• The code is duplicated and all threads will execute that
code
• There is an implicit barrier at the end of a parallel region
• Master thread continues execution after this point
The parallel directive
• Some common clauses include:
– if (expression)
– private (list)
– shared (list)
– num_threads (integer-expression)
How Many Threads?
• The number of threads in a parallel region is determined
by the following factors, in order of precedence:
1. Evaluation of the if clause
2. Setting of the NUM_THREADS clause
3. Use of the omp_set_num_threads( ) library function
4. Setting of the OMP_NUM_THREADS environment
variable
5. Implementation default: Usually the number of CPUs on
a node

• Threads are numbered from 0 (master thread) to N-1

IF clause

• Execute in parallel if expression is true

• Otherwise serial execution
NUM_THREADS clause

#pragma omp parallel if(np>1)

num_threads(np)
{
…
}

• Execute in parallel if expression is true

• Executes using np number of threads
omp_set_num_threads( ) function
#define TOTAL_THREADS 8
int main( )
{
omp_set_num_threads(TOTAL_THREADS);
#pragma omp parallel
{
. . .
}
. . .

• Execute in parallel using 8 threads

OMP_NUM_THREADS – Environment Variable

$ export OMP_NUM_THREADS=4
$ echo $OMP_NUM_THREADS

• Sets and displays the value of the environment

variable OMP_NUM_THREADS
Execution Status in Parallel Region

int omp_in_parallel()

• Returns non-zero: if execution is in parallel region

• Returns zero: if execution in non-parallel region

Demo: PRegion.c
Shared and Private Data
• Shared data are accessible by all threads

• A reference a[5] to a shared array accesses the

same address in all threads

• Private data are accessible only by a thread

– Each thread has its own copy

• The default is shared

Shared and Private Data
int main(int argc, char* argv[])
{
int threadData = 10;

// Beginning of parallel region

#pragma omp parallel private(threadData)
{
threadData =200;
}

// Ending of parallel region

printf("Value: %d\n", threadData);
}

Demo: SPData.c
Shared and Private Data
#pragma omp parallel shared(list)

• Default behavior
• List will be shared
• Each thread access the same memory location
• Initial value (for the first thread) will be same as before
the region
• Final value will be updated by the last thread leaving the
region
• Problems: Data Race
Shared and Private Data

• Data local to thread

• You should not rely on any initial and terminal value
(after execution of the parallel region)
• Separate “Stack Memory” for each thread’s private data
• No storage associated with original object (even with
same name for data-items)

• Use firstprivate and/or lastprivate clause to override

Shared and Private Data

• Variables in list are private

• Initialized with the value the variable had before entering
the construct

• Used in “for” loops

• Variables in list are private
• The thread that executes the final iteration of the loop
Shared and Private Data
#pragma omp parallel default (private) shared(list)
#pragma omp parallel default (shared) private(list)
#pragma omp parallel default (none) private(list)
shared(list)

• Alter the default behavior

• To implement customized access behavior
Shared and Private Data – Example (1/4)

Demo: SPDE1.c
Shared and Private Data – Example (2/4)

Demo: SPDE1.c
Shared and Private Data – Example (3/4)

Demo: SPDE1.c
Shared and Private Data – Example (4/4)

Demo: SPDE1.c
Getting ID of Current Thread
int main(int argc, char* argv[])
{
int iam, nthreads;
#pragma omp parallel private(iam,nthreads)
num_threads(2)
{
iam = omp_get_thread_num();
nthreads = omp_get_num_threads();
printf(“ThradID %d, out of %d threads\n”, iam,
nthreads);
if (iam == 0)
printf(“Here is the Master Thread.\n”);
else
printf(“Here is another thread.\n”);
}

}
Demo: CTID.c
Work-Sharing Constructs
• If all the threads are doing the same thing, what is the
advantage then?

• Within each “Team” threads are assigned IDs, with master

thread assigned ID 0
– omp_get_thread_num() //to get thread number

Can we use this to distribute tasks amongst the

“team” members?

• Work-sharing constructs distribute the specified work to

all threads within the current team
Do/For Work-Sharing Construct
• DO / for - shares iterations of a loop across the
team

#pragma omp for [clause ...] newline

There is an implicit synchronization after

#pragma omp for
Do/For Work-Sharing Construct
• SCHEDULE clause describes how iterations of the
loop are divided among the threads in the team

Chunks of specified size assigned round-robin

Chunks of specified size are assigned when thread finishes

previous chunk (work-Stealing mechanism)
Do/For Work-Sharing Construct
int main(int argc, char* argv[])
{
int i, a[10];
#pragma omp parallel num_threads(2)
{
#pragma omp for schedule(static, 2)
for ( i=0; i<10;i++)
a[i] = omp_get_thread_num();
}

for ( i=0; i<10;i++)

printf("%d",a[i]);
}

Demo: ForConst.c
Do/For Work-Sharing Construct
int main(int argc, char* argv[])
{
int sum, counter, inputList[6] = {11,45,3,5,12,-3};
#pragma omp parallel num_threads(2)
{
#pragma omp for schedule(static, 3)
for (counter=0; counter<6; counter++) {
printf("%d adding %d to the
sum\n",omp_get_thread_num(),
inputList[counter]);

sum+=inputList[counter];
} //end of for
} //end of parallel section

printf("The summed up Value: %d", sum);

}
Demo: ForConst2.c
Do/For Work-Sharing Construct
int main(int argc, char* argv[])
{
int max=0, counter, inputList[6] = {11,45,3,5,12,-
3};
#pragma omp parallel num_threads(2)
{
#pragma omp for schedule(static, 3)
for (counter=0; counter<6; counter++) {
if(max<inputlist[couner])
max=inputlist[counter];
} //end of for
} //end of parallel section

printf("The summed up Value: %d", sum);

}

Demo: ForConst2.c
For Work-Sharing –Synchronized
For Work-Sharing – Non Synchronized
Problems with Static Scheduling
• What happens if loop iterations do not take the same
amount of time?
▪ Load imbalance
Dynamic Scheduling
• Fixed size chunks assigned on the fly
• Work-stealing mechanism

• Disadvantage: more overhead as compared to Static

Demo: LoopSched.c
ThreadCount: OpenMP Implementation
int main(int argc, char* argv[])
{
int threadCount=0;
#pragma omp parallel num_threads(100)
{
int myLocalCount = threadCount;
threadCount++;
sleep(1);
myLocalCount++;
myLocalCount++;
threadCount = myLocalCount;
threadCount = myLocalCount;

}
printf("Total Number of Threads: %d\n", threadCount);
}

Demo: TCount1.c
Critical-Section (CS) Problem
⮚ n processes all competing to use some shared data
⮚ Each process has a code segment, called critical section,
in which the shared data is accessed

⮚ Problem (ensures that):

– Two process are not allowed to execute in their critical
section at the same time
– Access to the critical section must be an atomic action
Critical Section
A leaves critical section
A enters critical section

Process A

B enters critical section

B blocked

Process B

T T T T
1 2 3
B attempts to enter 4
B leaves
critical section critical section

Mutual Exclusion
At any given time, only one process is in the critical
OpenMP - Synchronization Constructs
• The CRITICAL directive specifies a region of code that
must be executed by only one thread at a time

• If a thread is currently executing inside a CRITICAL region

and another thread attempts to execute it, it will block
until the first thread exits that CRITICAL region.

pragma omp critical [ name ]

…
… back to threadCount
int main(int argc, char* argv[])
{
int threadCount;
#pragma omp parallel num_threads(5)
{
#pragma omp critical
{
int myLocalCount = threadCount;
sleep(1);
myLocalCount++;
threadCount = myLocalCount;
}
}
printf("Total Number of Threads: %d\n", threadCount);
}

Demo: TCount2.c
OpenMP - Synchronization Constructs
• The MASTER directive specifies a region that is to
be executed only by the master thread of the
team

• All other threads on the team skip this section of

code

#pragma omp master

…

Demo: MasterOnly.c
OpenMP - Synchronization Constructs
• When a BARRIER directive is reached, a thread will wait
at that point until all other threads have reached that
barrier

• All threads then resume executing in parallel the code

that follows the barrier.

#pragma omp barrier

…
Barrier Synchronization

all here?

Demo: Barrier.c
Reduction (Data-sharing Attribute Clause)
• The REDUCTION clause performs a reduction operation on
the variables that appear in the list
• A private copy for each list variable is created and initialized
for each thread
• At the end of the reduction, the reduction variable (all private
copies) is examined and the shared variable’s final result is
written.

#pragma omp operator: list

…
operator can be +,-,*,&&,||,max,min …
Reduction (Data-sharing Attribute Clause)
int main(int argc, char* argv[])
{
srand(time(NULL));
int winner;
#pragma omp parallel reduction(max:winner) num_threads(10)
{
winner = (rand() % 1000) + omp_get_thread_num();
printf("Thread: %d has Chosen: %d\n",
omp_get_thread_num(),winner);
}
printf("Winner: %d\n", winner);

Demo: Reduction.c
Any Questions?

Skills in Mathematics Coordinate Geometry WWW - examSAKHA.in
100% (8)
Skills in Mathematics Coordinate Geometry WWW - examSAKHA.in
654 pages
CS-3006 5 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 5 UsingOpenMP SharedMemoryProgramming
76 pages
Openmp: Parallel Processing
No ratings yet
Openmp: Parallel Processing
40 pages
OPENMP1
No ratings yet
OPENMP1
67 pages
Open MP
No ratings yet
Open MP
35 pages
Lect11 Openmp1
No ratings yet
Lect11 Openmp1
35 pages
High Performance Computing (HPC) - Lec3
No ratings yet
High Performance Computing (HPC) - Lec3
35 pages
Presentation2 HS OpenMP
No ratings yet
Presentation2 HS OpenMP
29 pages
Lecture 10 Shared Memory Programming With OpenMP
No ratings yet
Lecture 10 Shared Memory Programming With OpenMP
30 pages
Openmp HPC Ass1
No ratings yet
Openmp HPC Ass1
43 pages
Unit 3
No ratings yet
Unit 3
13 pages
About OpenMP
No ratings yet
About OpenMP
86 pages
Mpsoc Architectures Openmp
No ratings yet
Mpsoc Architectures Openmp
35 pages
Open MPLecture
No ratings yet
Open MPLecture
54 pages
Num Tech
No ratings yet
Num Tech
39 pages
Open MP
No ratings yet
Open MP
30 pages
Introduction To OpenMP
No ratings yet
Introduction To OpenMP
46 pages
Unit 4 Shared-Memory Parallel Programming With Openmp
No ratings yet
Unit 4 Shared-Memory Parallel Programming With Openmp
37 pages
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
No ratings yet
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
46 pages
OpenMPSlides Tamu SC PDF
No ratings yet
OpenMPSlides Tamu SC PDF
74 pages
Unit III
No ratings yet
Unit III
15 pages
Lec 12 OpenMP
No ratings yet
Lec 12 OpenMP
152 pages
Chap4 OpenMP
No ratings yet
Chap4 OpenMP
35 pages
Introduction To Open MP
No ratings yet
Introduction To Open MP
42 pages
Chapter 3 - Shared-Memory Programming, OpenMP
No ratings yet
Chapter 3 - Shared-Memory Programming, OpenMP
65 pages
A Tutorial On Parallel Computing On Shared Memory Systems
No ratings yet
A Tutorial On Parallel Computing On Shared Memory Systems
23 pages
Parallel Programming Using Openmp: Mike Bailey
No ratings yet
Parallel Programming Using Openmp: Mike Bailey
27 pages
Openmp Overview
No ratings yet
Openmp Overview
74 pages
OpenMP Examples
No ratings yet
OpenMP Examples
12 pages
OpenMPSlides Tamu SC
No ratings yet
OpenMPSlides Tamu SC
80 pages
OpenMP Basics
No ratings yet
OpenMP Basics
47 pages
OpenMP 2
No ratings yet
OpenMP 2
3 pages
Lecture Open MP
No ratings yet
Lecture Open MP
35 pages
OPENMP
No ratings yet
OPENMP
37 pages
10 OpenMP-2
No ratings yet
10 OpenMP-2
25 pages
Shared Memory: Openmp Environment and Synchronization
No ratings yet
Shared Memory: Openmp Environment and Synchronization
32 pages
OpenMP P1
No ratings yet
OpenMP P1
32 pages
Openmp 1
No ratings yet
Openmp 1
38 pages
Parallel Programming Module 2
No ratings yet
Parallel Programming Module 2
112 pages
Lecture Open MP
No ratings yet
Lecture Open MP
25 pages
4 Openmp
No ratings yet
4 Openmp
32 pages
PDSOpen MP
No ratings yet
PDSOpen MP
22 pages
Openmp Programming: Aiichiro Nakano
No ratings yet
Openmp Programming: Aiichiro Nakano
10 pages
Omp Handouts
No ratings yet
Omp Handouts
109 pages
Parallel Computing and Openmp Tutorial: Shao-Ching Huang
No ratings yet
Parallel Computing and Openmp Tutorial: Shao-Ching Huang
58 pages
Openmp: Openmp Adds Constructs For Shared-Memory
No ratings yet
Openmp: Openmp Adds Constructs For Shared-Memory
15 pages
Unit Iii
No ratings yet
Unit Iii
61 pages
Govindarajan - ParallelizationPrinciples NSM AstroPhysics
No ratings yet
Govindarajan - ParallelizationPrinciples NSM AstroPhysics
50 pages
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
Omp Hands On SC08
No ratings yet
Omp Hands On SC08
153 pages
Omp Hands On SC08 PDF
No ratings yet
Omp Hands On SC08 PDF
153 pages
Week5 Lec13 OpenMP Parallel Sum
No ratings yet
Week5 Lec13 OpenMP Parallel Sum
14 pages
Openmp
No ratings yet
Openmp
115 pages
Openmp 2pp
No ratings yet
Openmp 2pp
15 pages
Openmp
No ratings yet
Openmp
21 pages
Perl One-Liners: 130 Programs That Get Things Done
From Everand
Perl One-Liners: 130 Programs That Get Things Done
Peteris Krumins
4/5 (3)
NumPy Recipes
From Everand
NumPy Recipes
Martin McBride
No ratings yet
Mastering Python
From Everand
Mastering Python
Rick van Hattem
No ratings yet
The Complete Future Trait Guide
From Everand
The Complete Future Trait Guide
Hamze Ghalebi
No ratings yet
Basic Information About C language PDF
From Everand
Basic Information About C language PDF
Suraj Das
No ratings yet
Basics of Python Programming: A Quick Guide for Beginners
From Everand
Basics of Python Programming: A Quick Guide for Beginners
Krishna Kumar Mohbey
No ratings yet
EASA Human Performance PDF
No ratings yet
EASA Human Performance PDF
15 pages
Hidromecánica Andina Cía. Ltda.: Integrantes
No ratings yet
Hidromecánica Andina Cía. Ltda.: Integrantes
36 pages
IBPS PO App2k19
No ratings yet
IBPS PO App2k19
4 pages
Evolution of International Trade in India
100% (4)
Evolution of International Trade in India
7 pages
U.S Capitol
No ratings yet
U.S Capitol
4 pages
Online Application On Permit To Operate - Ed
No ratings yet
Online Application On Permit To Operate - Ed
82 pages
Turning Error-Liquid Swirl: Calibrtion
No ratings yet
Turning Error-Liquid Swirl: Calibrtion
4 pages
2 21st Century Literature
No ratings yet
2 21st Century Literature
18 pages
DLL Matatag - Music&arts 4 Q1 W1
No ratings yet
DLL Matatag - Music&arts 4 Q1 W1
20 pages
Knife Fighting Guide
90% (10)
Knife Fighting Guide
8 pages
Chem Exp-2
No ratings yet
Chem Exp-2
6 pages
O Level Maths Paper 2 Marking Scheme
No ratings yet
O Level Maths Paper 2 Marking Scheme
9 pages
INDIA - Nepal
No ratings yet
INDIA - Nepal
3 pages
Xunke CNC Machine CO., LTD
No ratings yet
Xunke CNC Machine CO., LTD
9 pages
CV Varun Joshi - Integration Lead - 0722
No ratings yet
CV Varun Joshi - Integration Lead - 0722
4 pages
Arif Maulana Husain-Resume
No ratings yet
Arif Maulana Husain-Resume
2 pages
Diagnosis of ADHD Using SVM Algorithm: Attention Deficit Hyperactivity Disorder (ADHD) Is One of The
No ratings yet
Diagnosis of ADHD Using SVM Algorithm: Attention Deficit Hyperactivity Disorder (ADHD) Is One of The
4 pages
Forbo - Food
No ratings yet
Forbo - Food
32 pages
Instant Download Cold War An International History Carole Fink Ebook 2025 Edition
No ratings yet
Instant Download Cold War An International History Carole Fink Ebook 2025 Edition
67 pages
Cpar Reviewer
No ratings yet
Cpar Reviewer
2 pages
Blade Runner 2049 (Undated) (FYC)
67% (3)
Blade Runner 2049 (Undated) (FYC)
109 pages
Moto Guzzi Factory Original V750ie Engine Manual
No ratings yet
Moto Guzzi Factory Original V750ie Engine Manual
151 pages
Don Fink Competitive Training Plan
No ratings yet
Don Fink Competitive Training Plan
24 pages
Solonin - 2020 - Local Literatures - Tangut - Xixia - Brill
No ratings yet
Solonin - 2020 - Local Literatures - Tangut - Xixia - Brill
25 pages
Biscuit Industry in India - An Overview
No ratings yet
Biscuit Industry in India - An Overview
5 pages
UC Test Answer 1
No ratings yet
UC Test Answer 1
6 pages
Step by Step Configuration Integration of FI With MM PDF
No ratings yet
Step by Step Configuration Integration of FI With MM PDF
14 pages
HSN Model Exam QP
No ratings yet
HSN Model Exam QP
2 pages
School Readiness Checklist
No ratings yet
School Readiness Checklist
1 page

CS-3006 8 UsingOpenMP SharedMemoryProgramming

Uploaded by

CS-3006 8 UsingOpenMP SharedMemoryProgramming

Uploaded by

Programming Shared-Memory Parallel

Dr. Muhammad Aleem,

Department of Computer Science,

Lecture credits: https://computing.llnl.gov/tutorials/openMP/

void* thrfunc(void* arg) {

// wait for the N threads to finish

int main(int argc, char* argv[])

Compilation: $ gcc -fopenmp hello.c -o hello

• Not a new language

• Compiler directives, library calls, and environment

• Not automatic parallelization

• For C and C++, the pragmas take the form:

• Any compiler (even if it does not have OpenMP support)

*Not to be confused with fork() system call

Split-up this loop between multiple threads

sentinel directive-name [clause, ...]

#pragma omp parallel private(var)

• Implicit barrier at the end of parallel block

• Threads are numbered from 0 (master thread) to N-1

• Execute in parallel if expression is true

#pragma omp parallel if(np>1)

• Execute in parallel if expression is true

• Execute in parallel using 8 threads

• Sets and displays the value of the environment

• Returns non-zero: if execution is in parallel region

• A reference a[5] to a shared array accesses the

• Private data are accessible only by a thread

– Each thread has its own copy

• The default is shared

// Beginning of parallel region

// Ending of parallel region

• Data local to thread

• Use firstprivate and/or lastprivate clause to override

• Variables in list are private

• Used in “for” loops

• Alter the default behavior

• Within each “Team” threads are assigned IDs, with master

Can we use this to distribute tasks amongst the

• Work-sharing constructs distribute the specified work to

#pragma omp for [clause ...] newline

There is an implicit synchronization after

Chunks of specified size assigned round-robin

Chunks of specified size are assigned when thread finishes

for ( i=0; i<10;i++)

printf("The summed up Value: %d", sum);

printf("The summed up Value: %d", sum);

• Disadvantage: more overhead as compared to Static

⮚ Problem (ensures that):

B enters critical section

• If a thread is currently executing inside a CRITICAL region

pragma omp critical [ name ]

• All other threads on the team skip this section of

#pragma omp master

• All threads then resume executing in parallel the code

#pragma omp barrier

#pragma omp operator: list

You might also like