0% found this document useful (0 votes)

55 views35 pages

Chap4 OpenMP

The document provides an overview of OpenMP, describing it as an application programming interface that uses compiler directives to manage shared memory parallelism across a master thread and slave threads using a fork-join model. It discusses key OpenMP constructs like parallel regions, work-sharing directives like for that divide loop iterations across threads, and synchronization methods. Examples are given of using OpenMP directives like parallel and for to add arrays in parallel across multiple threads.

Uploaded by

Michael Shi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views35 pages

Chap4 OpenMP

Uploaded by

Michael Shi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Shared-Memory

Programming: OpenMP

National Tsing-Hua University

2017, Summer Semester
What’s OpenMP
OpenMP == Open specification for Multi-Processing
An API : multi-threaded, shared memory parallelism
Portable: the API is specified for C/C++ and Fortran
Fork-Join model: the master thread forks a specified
number of slave threads and divides task among them
Compiler Directive Based: Compiler takes care of
generating code that forks/joins threads and divide
tasks to threads

2
Example
 Add two data arrays in parallel by specifying
compiler directives:
 Slave threads are forked and each thread works on
different iterations
#include <omp.h>
// Serial code
int A[10], B[10], C[10];

// Beginning of parallel section. Fork a team of threads.

#pragma omp parallel for num_threads(10)
{
for (int i=0; i<10; i++)
A[i] = B[i] + C[i];
} /* All threads join master thread and terminate */

3
OpenMP Directives
 C/C++ Format:
#pragma omp directive-name [clause, ...] newline
Required. Valid OpenMP Optional. Clauses can Required.
directive: parallel, be in any order, and
do, for repeated as necessary.
 Example:
 #pragma omp parallel default(shared) private(beta,pi)

directive-name clause clause

 General Rules:
 Case sensitive
 Only one directive-name may be specified per directive
 Each directive applies to at most one succeeding statement,
which must be a structured block
4
OpenMP Outline
 Parallel Region Construct
 Parallel Directive

 Working-Sharing Construct
 DO/for Directive
 SECTIONS Directive
 SINGLE Directive

 Synchronization Construct
 Date Scope Attribute Clauses
 Run-Time Library Routines
5
Parallel Region Constructs --- Parallel Directive
 A parallel region is a block of code executed by multiple
threads #pragma omp parallel [clause …...]
if (scalar_expression)
num_threads (integer-expression)
structured_block
 Overview:
 When PARALLEL is reached, a team of threads is created
 The parallel region code is duplicated and executed by all threads
 There is an implied barrier at the end of a parallel section.
 One thread terminates, all threads terminate
 Limitations:
 A parallel region must be a structured block that does not span
multiple routines or code files
 It is illegal to branch (goto) into or out of a parallel region, but
you could call other functions within a parallel region 6
Parallel Region --- How Many Threads
 The number of threads in a parallel region is
determined in order of following precedence:
 Evaluation of the IF clause
If FALSE, it is executed serially by the master thread
E.g: #pragma omp parallel IF(para == true)
 Setting of the num_threads clause
E.g.: #pragma omp parallel num_threads(10)
 Use of the omp_set_num_threads() library function
Called BEFORE the parallel region
 Setting of the OMP_NUM_THREADS environment variable
Called BEFORE the parallel region
 By default - usually the number of CPUs on a node
7
Nested Parallel Region
// A total of 6 “hello world!” is printed
#pragma omp parallel num_threads(2)
{
#pragma omp parallel num_threads(3)
{
printf(“hello world!”);
}
}

 check if nested parallel regions are enabled

 omp_get_nested ()
 To disable/enable nested parallel regions:
 omp_set_nested (bool)
 Setting of the OMP_NESTED environment variable
 If nested is not supported or enabled:
 Only one thread is created for the nested parallel region code
8
OpenMP Outline
 Parallel Region Construct
 Parallel Directive

 Working-Sharing Construct
 DO/for Directive
 SECTIONS Directive
 SINGLE Directive

 Synchronization Construct
 Date Scope Attribute Clauses
 Run-Time Library Routines
9
Work-Sharing Constructs
 Definition:
 A work-sharing construct divides the execution of the
enclosed code region among the threads that
encounter it
 Work-sharing constructs DO NOT launch new threads
 There is no implied barrier upon entry to a work-
sharing construct, however there is an implied
barrier at the end of a work sharing construct

10
Type of Work-Sharing Constructs
DO / for - shares SECTIONS - breaks SINGLE - serializes
iterations of a loop work into separate, a section of code
across the team. discrete sections of by running with a
Represents a type of code. Each section is single thread.
"data parallelism". executed by a thread.

 Notice:
 should be enclosed within a parallel region for parallelism
11
DO / for Directive
 Purpose: indicate the iterations of the loop immediately
following it must be executed in parallel by the team of
threads #pragma omp for [clause …...]
schedule (type [,chunk])
ordered
nowait
collapse (n)
for_loop
 Do/for Directive Specific Clauses:
 nowait: Do not synchronize threads at the end of the loop
 schedule: Describes how iterations are divided among threads
 ordered: Iterations must be executed as in a serial program
 collapse: Specifies how many loops in a nested loop should be
collapsed into one large iteration space and divided according
to the schedule clause
12
DO / for Directive --- Schedule Clause
 STATIC
 Loop iterations are divided into chunks
 If chunk is not specified, the iterations are evenly (if possible)
divided contiguously among the threads
 Then statically assigned to threads

 DYNAMIC: When a thread finishes one chunk (default size: 1), it

is dynamically assigned another
 GUIDED: Similar to DYNAMIC except chunk size decreases over
time (better load balancing)
 RUNTIME: The scheduling decision is deferred until runtime by
the environment variable OMP_SCHEDULE
 AUTO: The scheduling decision is delegated to the compiler
and/or runtime system
13
Scheduling Examples
 A for loop with 100 iterations and 4 threads:
 schedule(static, 10)
Thread0: Iter0-10, Iter40-50, Iter80-90
Thread0: Iter10-20, Iter50-60, Iter90-100
Thread0: Iter20-30, Iter60-70
Thread0: Iter30-40, Iter70-80
 schedule(dynamic, 10)
Thread0: Iter0-10, Iter70-80, Iter80-90, Iter90-100
Thread0: Iter10-20, Iter50-60
Thread0: Iter20-30, Iter60-70
Thread0: Iter30-40, Iter40-50
14
Scheduling Examples
 A for loop with 100 iterations and 4 threads:
 schedule(guided, 10)
Thread0: Iter0-10, Iter40-50, Iter80-85
Thread0: Iter10-20, Iter50-60, Iter85-90
Thread0: Iter20-30, Iter60-70, Iter90-95
Thread0: Iter30-40, Iter70-80, Iter95-100

15
DO / for Directive --- Example
#include <omp.h>
#define NUM_THREAD 2
#define CHUNKSIZE 100
#define N 1000
main () {
int a[N], b[N], c[N];
/* Some initializations */
for (int i=0; i < N; i++) a[i] = b[i] = i;
int chunk = CHUNKSIZE; Shared variables Private variables
int thread = NUM_THREAD; among threads of each thread

#pragma omp parallel num_thread(thread) shared(a,b,c) private(i)

{
#pragma omp for schedule(dynamic,chunk) nowait
for (int i=0; i < N; i++) c[i] = a[i] + b[i];
} /* end of parallel section */
} 16
DO / for Directive --- Order
#pragma omp parallel for #pragma omp parallel for order
for (int i = 0; i < 10; i++) for (int i = 0; i < 3; i++)
printf("i=%d, thread = %d\n", printf("i=%d, thread = %d\n",
i, omp_get_thread_num()); i, omp_get_thread_num());

i=2, thread = 0 i=0, thread = 0

i=0, thread = 1 i=1, thread = 1
i=1, thread = 2 i=2, thread = 2
i=3, thread = 1 i=3, thread = 1
i=4, thread = 0 i=4, thread = 0
i=8, thread = 2 i=5, thread = 2
i=5, thread = 1 i=6, thread = 1
i=6, thread = 2 i=7, thread = 2
i=9, thread = 1 i=8, thread = 1
i=7, thread = 1 i=9, thread = 1

17
DO / for Directive --- Collapse
#pragma omp parallel num_thread(6) #pragma omp parallel num_thread(6)
#pragma omp for schedule(dynamic) #pragma omp for schedule(dynamic)
for (int i = 0; i < 3; i++) collapse(2)
for (int j = 0; j < 3; j++) for (int i = 0; i < 3; i++)
printf("i=%d, j=%d, thread = %d\n", for (int j = 0; j < 3; j++)
i, j, omp_get_thread_num()); printf("i=%d, j=%d, thread = %d\n",
i, j, omp_get_thread_num());
i=1, j=0, thread = 1 i=0, j=0, thread = 0
i=2, j=0, thread = 2 i=0, j=2, thread = 1
i=0, j=0, thread = 0 i=1, j=0, thread = 2
i=1, j=1, thread = 1 i=2, j=0, thread = 4
i=2, j=1, thread = 2 i=0, j=1, thread = 0
i=0, j=1, thread = 0 i=1, j=2, thread = 3
i=1, j=2, thread = 1 i=2, j=2, thread = 5
i=2, j=2, thread = 2 i=1, j=1, thread = 2
i=0, j=2, thread = 0 i=2, j=1, thread = 4
18
SECTIONS Directive
 A non-iterative work-sharing construct
 It specifies that the enclosed section(s) of CODE are to be
divided among the threads in the team
 Independent SECTION directives are nested within a
SECTIONS directive
 Each SECTION is executed ONCE by ONE thread
 The mapping between threads and sections is decided by the
library implementation
#pragma omp sections [clause …...]
{
#pragma omp section
structured_block

#pragma omp section

structured_block
} 19
SECTIONS Directive --- Example
int N = 1000
int a[N], b[N], c[N], d[N];
#pragma omp parallel num_thread(2) shared(a,b,c,d) private(i)
{
#pragma omp sections /* specify sections*/
{
#pragma omp section /* 1st section*/
{
for (int i=0; i < N; i++) c[i] = a[i] + b[i];
}
#pragma omp section /* 2nd section*/
{
for (int i=0; i < N; i++) d[i] = a[i] + b[i];
}
} /* end of section */
} /* end of parallel section */ 20
SINGLE Directive
 The SINGLE directive specifies that the enclosed code is to be
executed by only one thread in the team.
 May be useful when dealing with sections of code that are
not thread safe (such as I/O)
 Threads in the team that do not execute the SINGLE directive,
wait at the end of the enclosed code block, unless a nowait
clause is specified int input;
 Example: #pragma omp parallel num_thread(10) shared(input)
{
// computing code that can be prcessed in parallel
#pragma omp single /* specify section
{
scanf("%d", &input);
} /* end of seralized I/O call */

printf(“input is %d”, input);

} /* end of parallel section */ 21
OpenMP Outline
 Parallel Region Construct
 Parallel Directive

 Working-Sharing Construct
 DO/for Directive
 SECTIONS Directive
 SINGLE Directive

 Synchronization Construct
 Date Scope Attribute Clauses
 Run-Time Library Routines
22
Synchronization Constructs
 For synchronization purpose among threads
#pragma omp [synchronization_directive] [clause …...]
structured_block
 Synchronization Directives
 master: only executed by the master thread
No implicit barrier at the end
More efficient than SINGLE directive
 critical: must be executed by only one thread at a time
Threads will be blocked until the critical section is clear
 barrier: blocked until all threads reach the call
 atomic: memory location must be updated atomically
provide a mini-critical section
23
LOCK OpenMP Routine
 void omp_init_lock(omp_lock_t *lock)
 Initializes a lock associated with the lock variable
 void omp_destroy_lock(omp_lock_t *lock)
 Disassociates the given lock variable from any locks
 void omp_set_lock(omp_lock_t *lock)
 Force the thread to wait until the specified lock is available
 void omp_unset_lock(omp_lock_t *lock)
 Releases the lock from the executing subroutine
 int omp_test_lock(omp_lock_t *lock)
 Attempts to set a lock, but does NOT block if unavailable

24
Example & Comparison
 Advantage of using critical over lock:
 no need to declare, initialize and destroy a lock
 you always have explicit control over where your
critical section ends
#include <omp.h>
 Less overhead with main () {
compiler assist int count=0;
omp_lock_t *lock;
#include <omp.h> omp_init_lock(lock)
main () { #pragma omp parallel
{
int count=0; omp_set_lock(lock);
#pragma omp parallel count++;
#pragma omp critical omp_unset_lock(lock);
count++; }
} omp_destory_lock(lock)
} 25
OpenMP Outline
 Parallel Region Construct
 Parallel Directive

 Working-Sharing Construct
 DO/for Directive
 SECTIONS Directive
 SINGLE Directive

 Synchronization Construct
 Date Scope Attribute Clauses
 Run-Time Library Routines
26
OpenMP Date Scope
 This is critical to understand the scope of each data
 OpenMP is based on shared memory programming model
 Most variables are shared by default
 Global shared variables:
 File scope variables, static
 Private non-shared variables:
 Loop index variables
 Stack variables in subroutines called from parallel regions
 Data scope can be explicitly defined by clauses…
 PRIVATE , SHARED, FIRSTPRIVATE, LASTPRIVATE
 DEFAULT, REDUCTION, COPYIN
27
Date Scope Attribute Clauses
 PRIVATE (var_list):
 Declares variables in its list to be private to each thread;
variable value is NOT initialized & will not be maintained
outside the parallel region
 SHARED (var_list):
 Declares variables in its list to be shared among all threads
 By default, all variables in the work sharing region are
shared except the loop iteration counter.
 FIRSTPRIVATE (var_list):
 Same as PRIVATE clause, but the variable is INITIALIZED
according to the value of their original objects prior to
entry into the parallel region
 LASTPRIVATE (var_list)
 Same as PRIVATE clause, with a copy from the LAST loop
iteration or section to the original variable object
28
Examples
 firstprivate (var_list)
int var1 = 10;
#pragma omp parallel firstprivate (var1)
{
printf(“var1:%d” var1);
}

 lastprivate (var_list)
int var1 = 10;
#pragma omp parallel lastprivate (var1) num_thread(10)
{
int id = omp_get_thread_num();
sleep(id);
var1=id;
}
printf(“var1:%d”, var1);
29
Date Scope Attribute Clauses
 DEFAULT (PRIVATE | FIRSTPRIVATE | SHARED | NONE)
 Allows the user to specify a default scope for ALL variables in
the parallel region
 COPYIN (var_list)
 Assigning the same variable value based on the instance from
the master thread
 COPYPRIVATE (var_list)
 Broadcast values acquired by a single thread directly to all
instances in the other thread
 Associated with the SINGLE directive
 REDUCTION (operator: var_list)
 A private copy for each list variable is created for each thread
 Performs a reduction on all variable instances
 Write the final result to the global shared copy 30
Reduction Clause Example
#include <omp.h>
main () {
int i, n, chunk, a[100], b[100], result;
n = 10; chunk = 2; result = 0;
for (i=0; i < n; i++) a[i] = b[i] = I;

#pragma omp parallel for default(shared) private(i) \

schedule(static,chunk) reduction(+:result)
{
for (i=0; i < n; i++) result = result + (a[i] * b[i]);
}
printf("Final result= %f\n",result);
}
 Reduction operators:
 +, *, &, |, ^, &&, ||
31
OpenMP Clause Summary
Clause Directive
PARALLEL DO/for SECTIONS SINGLE
IF V
PRIVATE V V V V
SHARED V V
DEFAULT V
FIRSTPRIVATE V V V V
LASTPRIVATE V V
REDUCTION V V V
COPYIN V
COPYPRIVATE V
SCHEDULE V
ORDERED V
NOWAIT V V

 Synchronization Directives DO NOT accept clauses

32
OpenMP Outline
 Parallel Region Construct
 Parallel Directive

 Working-Sharing Construct
 DO/for Directive
 SECTIONS Directive
 SINGLE Directive

 Synchronization Construct
 Date Scope Attribute Clauses
 Run-Time Library Routines
33
Run-Time Library Routines
 void omp_set_num_threads(int num_threads)
 Sets the number of threads that will be used in the next parallel region
 int omp_get_num_threads(void)
 Returns the number of threads currently executing for the parallel region
 int omp_get_thread_num(void)
 Returns the thread number of the thread, within the team, making this call
 The master thread of the team is thread 0
 int omp_get_thread_limit (void)
 Returns the maximum number of OpenMP threads available to a program
 int omp_get_num_procs(void)
 Returns the number of processors that are available to the program
 int omp_in_parallel(void)
 determine if the section of code which is executing is parallel or not
Many others are available for more complicated usage
34
Reference
 Textbook:
 Parallel Computing Chap8

 openMP Tutorial
 https://computing.llnl.gov/tutorials/openMP/

 openMP API
 http://gcc.gnu.org/onlinedocs/libgomp.pdf

Osdp - V2 1 - 5 - 2014
No ratings yet
Osdp - V2 1 - 5 - 2014
55 pages
Bihar STET OS Notes and 100 MCQs
No ratings yet
Bihar STET OS Notes and 100 MCQs
10 pages
Assignment-1 (All The Component of Motherboard)
No ratings yet
Assignment-1 (All The Component of Motherboard)
8 pages
Lab 4 Database
No ratings yet
Lab 4 Database
7 pages
Govindarajan - ParallelizationPrinciples NSM AstroPhysics
No ratings yet
Govindarajan - ParallelizationPrinciples NSM AstroPhysics
50 pages
Omp Sync Data Runtime Environment
No ratings yet
Omp Sync Data Runtime Environment
59 pages
MSI Creator TRX40 Manual
No ratings yet
MSI Creator TRX40 Manual
91 pages
NetBackup104 DeployGuide Kubernetes Clusters
No ratings yet
NetBackup104 DeployGuide Kubernetes Clusters
318 pages
Computer Questions Asked in JKSSB 2 DSK
No ratings yet
Computer Questions Asked in JKSSB 2 DSK
62 pages
CS-3006 8 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 8 UsingOpenMP SharedMemoryProgramming
61 pages
Day 3
No ratings yet
Day 3
48 pages
Super X5Ss8-Gm Super X5Sse-Gm Super X5Sse-Gmii: User'S Manual
No ratings yet
Super X5Ss8-Gm Super X5Sse-Gm Super X5Sse-Gmii: User'S Manual
88 pages
Openmp HPC Ass1
No ratings yet
Openmp HPC Ass1
43 pages
Openmp 1
No ratings yet
Openmp 1
38 pages
OpenMP P1
No ratings yet
OpenMP P1
32 pages
Cao Da1
No ratings yet
Cao Da1
9 pages
About OpenMP
No ratings yet
About OpenMP
86 pages
PDSOpen MP
No ratings yet
PDSOpen MP
22 pages
Program Excecution ExpFinal
No ratings yet
Program Excecution ExpFinal
10 pages
Unit III
No ratings yet
Unit III
15 pages
CS-3006 5 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 5 UsingOpenMP SharedMemoryProgramming
76 pages
Work Replication With Parallel Region: #Pragma Omp Parallel For (For (J 0 J 10 J++) Printf ("Hello/n") )
No ratings yet
Work Replication With Parallel Region: #Pragma Omp Parallel For (For (J 0 J 10 J++) Printf ("Hello/n") )
19 pages
OPENMP1
No ratings yet
OPENMP1
67 pages
4DS Australian Open Briefings Presentation
No ratings yet
4DS Australian Open Briefings Presentation
27 pages
Ipc - Assig 1
No ratings yet
Ipc - Assig 1
9 pages
OpenMP and MPI Multiple Choice Questions (MCQS) For Exam Preparation
No ratings yet
OpenMP and MPI Multiple Choice Questions (MCQS) For Exam Preparation
13 pages
Parallel Programming Module 3
No ratings yet
Parallel Programming Module 3
44 pages
Parallel Programming Module 2
No ratings yet
Parallel Programming Module 2
112 pages
High Performance Computing (HPC) - Lec3
No ratings yet
High Performance Computing (HPC) - Lec3
35 pages
Cortina EPON OLT-User Manual-Comman Line Operation-V1.6 20170301
No ratings yet
Cortina EPON OLT-User Manual-Comman Line Operation-V1.6 20170301
249 pages
W7L2 OpenMP4 Worksharing
No ratings yet
W7L2 OpenMP4 Worksharing
26 pages
4.OpenMP Done
No ratings yet
4.OpenMP Done
3 pages
HPC Lab
No ratings yet
HPC Lab
24 pages
Lect11 Openmp1
No ratings yet
Lect11 Openmp1
35 pages
Aspire One 1410
No ratings yet
Aspire One 1410
256 pages
DS1822-Parallel Computing - Unit2
No ratings yet
DS1822-Parallel Computing - Unit2
25 pages
Engr. Faisal Ahmed
No ratings yet
Engr. Faisal Ahmed
23 pages
Operating Manual Winamp
100% (1)
Operating Manual Winamp
7 pages
Openmp Overview
No ratings yet
Openmp Overview
74 pages
Unit 3
No ratings yet
Unit 3
13 pages
10 OpenMP-2
No ratings yet
10 OpenMP-2
25 pages
Chapter 3 - Shared-Memory Programming, OpenMP
No ratings yet
Chapter 3 - Shared-Memory Programming, OpenMP
65 pages
4 Openmp
No ratings yet
4 Openmp
32 pages
Lecture Open MP
No ratings yet
Lecture Open MP
25 pages
Introduction To OpenMP
No ratings yet
Introduction To OpenMP
46 pages
Inventory of DCP 2018 22
No ratings yet
Inventory of DCP 2018 22
2 pages
Computer Project Input Output Devices
No ratings yet
Computer Project Input Output Devices
15 pages
Introduction To Open MP
No ratings yet
Introduction To Open MP
42 pages
Shared Memory: Openmp Environment and Synchronization
No ratings yet
Shared Memory: Openmp Environment and Synchronization
32 pages
DHA Suffa University: Course Out Line: Computer Organization and Assembly Language
No ratings yet
DHA Suffa University: Course Out Line: Computer Organization and Assembly Language
2 pages
Lecture Open MP
No ratings yet
Lecture Open MP
35 pages
Parallel Pragma Suspend or Resume of Thread
No ratings yet
Parallel Pragma Suspend or Resume of Thread
3 pages
Openmp
No ratings yet
Openmp
61 pages
Grade 10 (Computer System Servicing) : Western Bicutan National High School Ph1 Ep Housing Pinagsama Taguig City
No ratings yet
Grade 10 (Computer System Servicing) : Western Bicutan National High School Ph1 Ep Housing Pinagsama Taguig City
6 pages
OpenMP 2
No ratings yet
OpenMP 2
3 pages
Location Wise Details MASTER
No ratings yet
Location Wise Details MASTER
2 pages
Openmp: Openmp Adds Constructs For Shared-Memory
No ratings yet
Openmp: Openmp Adds Constructs For Shared-Memory
15 pages
Apple Company
No ratings yet
Apple Company
21 pages
Work Replication With Parallel Region: #Pragma Omp Parallel For (For (J 0 J 10 J++) Printf ("Hello/n") )
No ratings yet
Work Replication With Parallel Region: #Pragma Omp Parallel For (For (J 0 J 10 J++) Printf ("Hello/n") )
19 pages
Unit 6 - Pipeline, Vector Processing and Multiprocessors
No ratings yet
Unit 6 - Pipeline, Vector Processing and Multiprocessors
23 pages
MalingVPN V81
No ratings yet
MalingVPN V81
29 pages
Zookeeper: Coordinating Your Cluster
No ratings yet
Zookeeper: Coordinating Your Cluster
13 pages
Lec 12 OpenMP
No ratings yet
Lec 12 OpenMP
152 pages
Dameware Installation Guide
No ratings yet
Dameware Installation Guide
85 pages
Daylight Saving Time For SAP Systems
No ratings yet
Daylight Saving Time For SAP Systems
7 pages
Developed By: Sujit Jhare Assembly Language Programming
100% (1)
Developed By: Sujit Jhare Assembly Language Programming
11 pages
OPENMP Language Features - Part 1 - 2
No ratings yet
OPENMP Language Features - Part 1 - 2
38 pages
Openmp: Parallel Processing
No ratings yet
Openmp: Parallel Processing
40 pages
Open MP
No ratings yet
Open MP
35 pages
Parallel Computing and Openmp Tutorial: Shao-Ching Huang
No ratings yet
Parallel Computing and Openmp Tutorial: Shao-Ching Huang
58 pages
OpenMP Reference
No ratings yet
OpenMP Reference
2 pages
Presentation2 HS OpenMP
No ratings yet
Presentation2 HS OpenMP
29 pages
A Tutorial On Parallel Computing On Shared Memory Systems
No ratings yet
A Tutorial On Parallel Computing On Shared Memory Systems
23 pages
OpenMP Examples
No ratings yet
OpenMP Examples
12 pages
Mpsoc Architectures Openmp
No ratings yet
Mpsoc Architectures Openmp
35 pages
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
No ratings yet
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
46 pages
Num Tech
No ratings yet
Num Tech
39 pages
Open MP
No ratings yet
Open MP
30 pages
Embedded Systems & Its Applications: Prepared by
No ratings yet
Embedded Systems & Its Applications: Prepared by
104 pages
BCSL 056
No ratings yet
BCSL 056
0 pages
Historycal Flowcode
No ratings yet
Historycal Flowcode
4 pages
Open MPLecture
No ratings yet
Open MPLecture
54 pages
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
Procedure of Computer Maintenance
No ratings yet
Procedure of Computer Maintenance
47 pages
g100 Installation
No ratings yet
g100 Installation
8 pages
Perl One-Liners: 130 Programs That Get Things Done
From Everand
Perl One-Liners: 130 Programs That Get Things Done
Peteris Krumins
4/5 (3)
Basic Information About C language PDF
From Everand
Basic Information About C language PDF
Suraj Das
No ratings yet
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Chap4 OpenMP

Uploaded by

Chap4 OpenMP

Uploaded by

Shared-Memory

National Tsing-Hua University

// Beginning of parallel section. Fork a team of threads.

directive-name clause clause

 check if nested parallel regions are enabled

 DYNAMIC: When a thread finishes one chunk (default size: 1), it

#pragma omp parallel num_thread(thread) shared(a,b,c) private(i)

i=2, thread = 0 i=0, thread = 0

#pragma omp section

printf(“input is %d”, input);

#pragma omp parallel for default(shared) private(i) \

 Synchronization Directives DO NOT accept clauses

You might also like