0% found this document useful (0 votes)

16 views38 pages

Open MP3

The document discusses OpenMP environment variables and synchronization constructs. It reviews the OpenMP clauses like private, shared, default, reduction, if, and schedule. It describes synchronization directives like barrier, single, and master. Critical sections and the atomic directive are presented as ways to avoid race conditions when updating shared variables. Finally, environment variables for OpenMP like OMP_NUM_THREADS, OMP_DYNAMIC, OMP_SCHEDULE, and OMP_NESTED are explained. An example of calculating Pi using the Monte Carlo method in parallel is also provided.

Uploaded by

l215376

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views38 pages

Open MP3

Uploaded by

l215376

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Parallel and Distributed Computing

CS3006

Lecture 10
OpenMP-III
6th April 2022

Dr. Rana Asif Rehman

CS3006 - Spring 2022

Review of OpenMP Clause List

 Private
 firstprivate, lastprivate
 Shared
 Default
 private, shared, none
 Reduction
 If clause
 Schedule
 Static, dynamic, guided, runtime
 nowait

CS3006 - Spring 2022

Synchronization in
OpenMP
Barrier Directive

 On encountering this directive, all threads in a team

wait until others have caught up, and then release

#pragma omp barrier

CS3006 - Spring 2022

Single Directive

 A single directive specifies a structured block that is

executed by a single (arbitrary) thread in parallel
region
 Implicit barrier

#pragma omp single [clause list]

structured block

CS3006 - Spring 2022

Master Directive

 The master directive is a specialization of the single

directive in which only the master thread executes
the structured block
 No implicit barrier

#pragma omp master

structured block

CS3006 - Spring 2022

Critical Sections
(#pragma omp critical)
 A Critical Section is a code segment that has a shared variable
and need to be executed as an atomic action.
 It means that in a group of cooperating processes/threads,
at a given point of time, only one process must be
executing its critical section
 Forces threads to be mutex (mutually exclusive)
Only one thread at a time executes the given code section
double area, pi, x;
int i, n;
...
area = 0.0;
for (i = 0; i < n; i++) {
x += (i+0.5)/n; //can be calculated independently
area += 4.0/(1.0 + x*x); //requires mutex lock.
}
pi = area / n;

CS3006 - Spring 2022

Critical Sections
(#pragma omp critical)
 If we simply parallelize the loop... A race condition
may occur

double area, pi, x;

int i, n;
...
area = 0.0;
#pragma omp parallel for private(x)
for (i = 0; i < n; i++) {
x = (i+0.5)/n;
area += 4.0/(1.0 + x*x); //not atomic
}
pi = area / n;

CS3006 - Spring 2022

Critical Sections
(#pragma omp critical)
 Race Condition
Value of area Thread A Thread B

11.667
+ 3.765
11.667

15.432 + 3.563

15.230

 Thread A reads value of area first

 Thread B reads value of area before A can update its value
 Thread A updates value of area
 Thread B ignores update by A and writes its incorrect value to
area
CS3006 - Spring 2022
Critical Sections
(#pragma omp critical)
Race Condition
 A race condition is created when one process may
“race ahead” of another and overwrite the change
made by the first process to the shared variable

area 15.230 Answer should be 18.995

Thread A 15.432 Thread B 15.230

CS3006 - Spring 2022 area += 4.0/(1.0 + x*x)

Critical Sections
(#pragma omp critical)

 Critical section: a portion of code that only thread at a time may

execute
 We denote a critical section by putting the pragma
#pragma omp critical [(name)]
 Optional identifier name can be used to identify a critical region
 Solves the problem but, as only one thread at a time may
execute the statement; it becomes sequential code
double area, pi, x;
int i, n;
...
area = 0.0;
#pragma omp parallel for private(x)
for (i = 0; i < n; i++) {
x = (i+0.5)/n;
#pragma omp critical
area += 4.0/(1.0 + x*x);
}
CS3006 - Spring 2022 pi = area / n;
Atomic Directive

 The atomic directive specifies that the single

memory location update should be performed as
an atomic operation

#pragma omp atomic

Update instruction e.g., x++

CS3006 - Spring 2022

Environment
Variables in OpenMP
Environment Variables in OpenMP
 OpenMP provides additional environment variables
that help control execution of parallel programs

 OMP_NUM_THREADS
 OMP_DYNAMIC
 OMP_SCHEDULE
 OMP_NESTED

CS5009 - Advanced Operating Systems

Environment Variables in OpenMP
OMP_NUM_THREADS
 Specifies the default number of threads created
upon entering a parallel region.
 The number of threads can be changed during run-
time using:
 omp_set_num_threads(int threads) routine [OR]
 num_threads clause  num_threads(int threads)

 Setting OMP_NUM_THREADS to 4 using bash:

“ export OMP_NUM_THREADS=4 “

CS5009 - Advanced Operating Systems

Environment Variables in OpenMP

OMP_DYNAMIC
 when set to TRUE, allows the number of threads to be
controlled at runtime. It means Openmp will use its
dynamic adjustment algorithm to create number of
threads that may optimize system performance
 Incase of TRUE , total number of threads generated may not
be equal to the threads requested by using the omp_set_num
threads() function or the num_threads clause.
 Incase of FALSE, usually total no. of generated threads in a
parallel region become as requested by the num_threads
clause
 OpenMP routines for setting/getting dynamic status:
 void omp_set_dynamic (int flag); //disables if flag=0
 Should be called from outside of a parallel region
 int omp_get_dynamic (); //return value of dynamic status
CS5009 - Advanced Operating Systems
Environment Variables in OpenMP
OMP_DYNAMIC[dynamic.c]

workers = omp_get_max_threads(); //can use num_procs

printf("%d maximum allowed threads\n", workers);
printf("total number of allocated cores are:%d\n", omp_get_num_procs());
omp_set_dynamic(1);
omp_set_num_threads(8);
printf("total number of requested when dynamic is true are:%d\n", 8);
#pragma omp parallel
{
#pragma omp single nowait
printf("total threads in parallel region1=%d:\n", omp_get_num_threads());
#pragma omp for
for (i = 0; i < mult; i++)
{a = complex_func();}
}

CS5009 - Advanced Operating Systems

Environment Variables in OpenMP
OMP_DYNAMIC[dynamic.c]

omp_set_dynamic(0);
omp_set_num_threads(8);
printf("total number of requested when dynamic is false
are:%d\n", 8);
#pragma omp parallel
{
#pragma omp single nowait
printf("total threads in parallel region2=%d:\n",
omp_get_num_threads());
#pragma omp for
for (i = 0; i < mult; i++)
{a = complex_func();}
}

CS5009 - Advanced Operating Systems

Environment Variables in OpenMP

OMP_SCHEDULE

 Controls the assignment of iteration spaces associated

with for directives that use the runtime scheduling class
 Possible values: static, dynamic, and guided
 Can also be used along with chunk size [optional]
 If chunk size is not specified than default chunk-size of 1 is
used.

 Setting OMP_SCHEDULE to guided with minimum chunk

size of 4 using Ubuntu-based terminal:
“ export OMP_SCHEDULE= " guided,4" “
CS5009 - Advanced Operating Systems
Environment Variables in OpenMP

OMP_NESTED
 Default value is FALSE
 While using nested parallel pragma inside another, the
nested one is executed by the original team instead of
making new thread team.
 When TRUE
 Enables nested parallelism
 While using nested parallel pragma code inside another, it
makes a new team of threads for executing the nested
one.
 Use omp_set_nested(int val) with non-zero value to
set this variable to TRUE.
 When called with ‘0’ as argument, it set the variable to
FALSE
CS5009 - Advanced Operating Systems
Environment Variables in OpenMP
OMP_NESTED[nested.c]
omp_set_nested(0);
#pragma omp parallel num_threads(2)
{
#pragma omp single
printf("Level 1: number of threads in the team : %d\n",
omp_get_num_threads());

#pragma omp parallel num_threads(4)

{
#pragma omp single
printf("Level 2: number of threads in the team : %d\n",
omp_get_num_threads());
}
}

CS5009 - Advanced Operating Systems

Environment Variables in OpenMP
OMP_NESTED[nested.c]
omp_set_nested(1);
#pragma omp parallel num_threads(2)
{
#pragma omp single
printf("Level 1: number of threads in the team : %d\n",
omp_get_num_threads());

#pragma omp parallel num_threads(4)

{
#pragma omp single
printf("Level 2: number of threads in the team : %d\n",
omp_get_num_threads());
}
}

CS5009 - Advanced Operating Systems

Example
Computing Pi using Monti Carlo method

Preliminary Idea:

points in circle
Pi = 4 x ( )
points in square

Here a=0.5 , b=0.5 and r=0.5

CS5009 - Advanced Operating Systems
Computing Pi using Monti Carlo method

Steps
For all the random points
1. Calculate total points in the circle
2. Divide points in the circle to the points in the square
 Total number of points are also the total number of points inside the
square
3. Multiply this fraction with 4

As number of random points increases, the value of Pi

approaches to real value (i.e., 3.14179…..)

CS5009 - Advanced Operating Systems

Computing Pi using Monti Carlo method
Sequential Implementation
int niter= 100000000;
count=0;
seed(time(0));
for (i=0; i<niter;++i) //10 million
{
//get random points
x = (double)random()/RAND_MAX;
y = (double)random()/RAND_MAX;
z = ((x-0.5)*(x-0.5))+((y-0.5)*(y-0.5));
//check to see if point is in unit circle
if (z<0.25)
{
++count;
}
}
pi = ((double)count/(double)niter)*4.0; //p = 4(m/n)
printf("Seq_Pi: %f\n", pi);

CS5009 - Advanced Operating Systems

Computing Pi using Monti Carlo method
(Parallel construct [parallel_pi.c])
#pragma omp parallel shared(niter) private(i, x, y, z, chunk_size, seed) reduction(+ : count)
{
num_threads = omp_get_num_threads();
chunk_size = niter / num_threads;
seed=omp_get_thread_num();
#pragma omp master
{printf("chunk_size=%ld\n",chunk_size);}

count=0;
for (i=0;i<chunk_size; i++)
{
//get random points
x = (double)rand_r(&seed)/(double)RAND_MAX;
y = (double)rand_r(&seed)/(double)RAND_MAX;
z = ((x-0.5)*(x-0.5))+((y-0.5)*(y-0.5));
//check to see if point is in unit circle
if (z<0.25)
{
++count;
}
}
}
pi = ((double)count/(double)niter)*4.0;
CS5009 - Advanced Operating Systems
Parallelizing linked lists

Consider the following code:

current=head;
while(current->next != NULL){
complex_func(current->key); //complex consumer func
current=current->next;
}
 Assume that complex function can be computed foreach key
value independently
 The code can’t be parallelized directly as:
 We don’t have omp constructs to parallelize while loops and equivalent for
loop don’t have canonical form.
 This is because we don’t know number of iterations in advance
 If we simply put ‘omp parallel pragma’ before while, program semantics
will not be assured
CS5009 - Advanced Operating Systems
Parallelizing linked lists
[Naïve idea:1 with logical error]
Consider the following code:
current=head;
#pragma omp parallel firstprivate(current)
{
while(current-> next != NULL){
complex_func(current->key); //complex consumer func
current=current-> next;
}
}
 Creates team of threads, each with private ‘current’ variable.
 Each thread will execute for all the nodes in the list
 This means every thread will perform work equal to sequential
code
 No speedup achieved, this will rather increase execution time

CS5009 - Advanced Operating Systems

Parallelizing linked lists
[Naïve idea:2 with logical error]
Consider the following code:
current=head;
#pragma omp parallel shared(current)
{
while(current-> next!=NULL){ //line 1
complex_func(current->key); //complex consumer func
current=current-> next; //line 3
}
}
 Creates team of threads sharing same ‘current’ variable.
 For first while iteration, complex_func may be called by each
thread with same key-value.
 Semantics/atomics will not be ensured (i.e., multiple threads
executing line-3 can change line-1 result for other threads)
 So, output may not be as assumed
CS5009 - Advanced Operating Systems
Parallelizing linked lists
[Naïve but Correct parallelization]
Observations:
1. We don’t know in advance the number nodes in the list
2. We also don’t know how to access all the nodes parallelly
from the list. This because the linked-list can only be accessed
sequentially

3. We can parallelize it using the following steps

1. Count number of nodes in the list call it ‘C’
2. Allocate a dynamic array of pointers-to-list of size ‘C’. Now using loop,
copy address of ith node to the ith element in the pointers-array.
3. Now we can use for-loop that can iterate on this array of pointers.
Furthermore, this for-loop can also be parallelized

CS5009 - Advanced Operating Systems

Parallelizing linked lists
[Naïve but Correct parallelization]
1. Count number of nodes in the list call it ‘C’
//struct LIST{ int key; LIST* ptr; } list;

int C=0; LIST *p =head;

//Here assume head is pointer to the start of the list.
while(p != NULL){
p=p->next;
C++;
}

CS5009 - Advanced Operating Systems

Parallelizing linked lists
[Naïve but Correct parallelization]
2. Allocate a dynamic array of pointers-to-list of size ‘C’ and
using loop, copy address of ith node to the ith element in the
pointers-array

LIST **Parray = new LIST* [C];

p =head;
while(p != NULL){
Parray[i]=p;
p=p->next;
}

CS5009 - Advanced Operating Systems

Parallelizing linked lists
[Naïve but Correct parallelization]
3. Now we can use for-loop that can iterate on this array of
pointers. Furthermore, this for-loop can also be parallelized

#pragma omp parallel for schedule(static,1)

for(i=0;i<C;i++){
complex_func(Parray[i]->key);
}

 This method can result in speedups only if tasks are complex

enough to overcome the data-movement costs.
 Usually, data-movements are more costly than the
computations
 So, we need to devise another solution

CS5009 - Advanced Operating Systems

Parallelizing linked lists
[A relatively better implementation ]
//omptask.c and tasktime.c //execute using g++
#pragma omp parallel
{
#pragma omp single //single process will go into the region
{
current=head;
while(current->ptr!=NULL){
//following line creates a task and adds to logical task pool.
#pragma omp task firstprivate(current)
complex_func(current->key);

current = current->ptr;
}
}
}
Total threads=4
CS5009 - Advanced Operating Systems
Total complex itters= 100 Million
List size= 10 nodes
Parallelizing linked lists
[omp task illustration]

CS5009 - Advanced Operating Systems

Questions

CS3006 - Spring 2022

References
1. Kumar, V., Grama, A., Gupta, A., & Karypis, G. (2017). Introduction to parallel computing. Redwood City, CA:
Benjamin/Cummings.

CS3006 - Spring 2022

Social Bookmarking Site List With Page Rank
100% (2)
Social Bookmarking Site List With Page Rank
19 pages
Reznor Handbook
100% (1)
Reznor Handbook
72 pages
OMP Common Core-Voss
No ratings yet
OMP Common Core-Voss
217 pages
Unit 4 Shared-Memory Parallel Programming With Openmp
No ratings yet
Unit 4 Shared-Memory Parallel Programming With Openmp
37 pages
Lec 12 OpenMP
No ratings yet
Lec 12 OpenMP
152 pages
OpenMPSlides Tamu SC PDF
No ratings yet
OpenMPSlides Tamu SC PDF
74 pages
OpenMP 01 Introduction
No ratings yet
OpenMP 01 Introduction
70 pages
CPT5 - Short Circuit Analysis - July 25, 2005
100% (3)
CPT5 - Short Circuit Analysis - July 25, 2005
235 pages
Shared Memory: Openmp Environment and Synchronization
No ratings yet
Shared Memory: Openmp Environment and Synchronization
32 pages
Num Tech
No ratings yet
Num Tech
39 pages
Parallel Programming Using Openmp: Mike Bailey
No ratings yet
Parallel Programming Using Openmp: Mike Bailey
27 pages
PC File
No ratings yet
PC File
57 pages
OpenMP Presentation
No ratings yet
OpenMP Presentation
51 pages
Azizul Azri Bin Mustaffa - PEC12-60
No ratings yet
Azizul Azri Bin Mustaffa - PEC12-60
36 pages
Openmp Tutorial: Seung-Jai Min
No ratings yet
Openmp Tutorial: Seung-Jai Min
30 pages
Openmp: Parallel Processing
No ratings yet
Openmp: Parallel Processing
40 pages
MTP3 & M3ua
No ratings yet
MTP3 & M3ua
40 pages
A Tutorial On Parallel Computing On Shared Memory Systems
No ratings yet
A Tutorial On Parallel Computing On Shared Memory Systems
23 pages
Open MPLecture
No ratings yet
Open MPLecture
54 pages
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
Chap4 OpenMP
No ratings yet
Chap4 OpenMP
35 pages
Enterprise Architecture PDF
No ratings yet
Enterprise Architecture PDF
175 pages
Openmp Overview
No ratings yet
Openmp Overview
74 pages
OpenMPSlides Tamu SC
No ratings yet
OpenMPSlides Tamu SC
80 pages
Calamansi Extract and Onion As Alternative Cockroach Killer
No ratings yet
Calamansi Extract and Onion As Alternative Cockroach Killer
6 pages
Openmp Programming: Aiichiro Nakano
No ratings yet
Openmp Programming: Aiichiro Nakano
10 pages
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
No ratings yet
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
46 pages
Open MP
No ratings yet
Open MP
30 pages
OpenMP Examples
No ratings yet
OpenMP Examples
12 pages
4 Openmp
No ratings yet
4 Openmp
32 pages
Presentation2 HS OpenMP
No ratings yet
Presentation2 HS OpenMP
29 pages
Introduction To Open MP
No ratings yet
Introduction To Open MP
42 pages
Lecture Open MP
No ratings yet
Lecture Open MP
25 pages
Openmp Tutorial: Seung-Jai Min
No ratings yet
Openmp Tutorial: Seung-Jai Min
30 pages
Towards A Critical Health Psychology Practice
100% (1)
Towards A Critical Health Psychology Practice
15 pages
OpenMP Basics
No ratings yet
OpenMP Basics
47 pages
Introduction To OpenMP
No ratings yet
Introduction To OpenMP
46 pages
Asme Section V B Se-1211
No ratings yet
Asme Section V B Se-1211
6 pages
OpenMP P1
No ratings yet
OpenMP P1
32 pages
Open MP2
No ratings yet
Open MP2
28 pages
Parallel Programming Module 2
No ratings yet
Parallel Programming Module 2
112 pages
Parallel Programming Module 3
No ratings yet
Parallel Programming Module 3
44 pages
Openmp
No ratings yet
Openmp
61 pages
Lecture Open MP
No ratings yet
Lecture Open MP
35 pages
OpenMP 2
No ratings yet
OpenMP 2
3 pages
English Set: Sl. No
No ratings yet
English Set: Sl. No
12 pages
Chapter 3 - Shared-Memory Programming, OpenMP
No ratings yet
Chapter 3 - Shared-Memory Programming, OpenMP
65 pages
OPENMP1
No ratings yet
OPENMP1
67 pages
Column Layout Plan: Trims International (BD) LTD
No ratings yet
Column Layout Plan: Trims International (BD) LTD
1 page
Program Excecution ExpFinal
No ratings yet
Program Excecution ExpFinal
10 pages
Notes-Exc - 1
No ratings yet
Notes-Exc - 1
2 pages
Lect11 Openmp1
No ratings yet
Lect11 Openmp1
35 pages
High Performance Computing (HPC) - Lec3
No ratings yet
High Performance Computing (HPC) - Lec3
35 pages
Multiple Injuries After Ship Tips Over at Edinburgh Dockyard
No ratings yet
Multiple Injuries After Ship Tips Over at Edinburgh Dockyard
10 pages
OPENMP
No ratings yet
OPENMP
37 pages
Designoftwowayslab
No ratings yet
Designoftwowayslab
23 pages
Smallest Physical Size: Screen Screen Operate Nozzle at or Above 4 Bar
No ratings yet
Smallest Physical Size: Screen Screen Operate Nozzle at or Above 4 Bar
1 page
Airforceregs
No ratings yet
Airforceregs
308 pages
Labor Law BarVenture 2024
No ratings yet
Labor Law BarVenture 2024
4 pages
Mun of La Carlota V NAWASA
No ratings yet
Mun of La Carlota V NAWASA
2 pages
24 Coercion Exercise
No ratings yet
24 Coercion Exercise
1 page
Product HRBX01K02
No ratings yet
Product HRBX01K02
3 pages
Unit Iii
No ratings yet
Unit Iii
61 pages
Lecture 10 Shared Memory Programming With OpenMP
No ratings yet
Lecture 10 Shared Memory Programming With OpenMP
30 pages
Tap Magic Eco Oil Sds en Us 2023pdf
No ratings yet
Tap Magic Eco Oil Sds en Us 2023pdf
8 pages
Final PPT CAMPUS
No ratings yet
Final PPT CAMPUS
20 pages
Lecture 12 Synchronization Constructs in OpenMP
No ratings yet
Lecture 12 Synchronization Constructs in OpenMP
32 pages
Chapter 2 - ProGARD Foam Bladder Tank Manual
No ratings yet
Chapter 2 - ProGARD Foam Bladder Tank Manual
38 pages
CS-3006 5 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 5 UsingOpenMP SharedMemoryProgramming
76 pages
Forex Trading D
No ratings yet
Forex Trading D
3 pages
Wa0005.
No ratings yet
Wa0005.
17 pages
Mandeville-The Grumbling Hive
No ratings yet
Mandeville-The Grumbling Hive
5 pages
Maths Notes Unit 5
No ratings yet
Maths Notes Unit 5
36 pages
Boarding Pass (IXR MAA)
No ratings yet
Boarding Pass (IXR MAA)
2 pages
Lecture 13 PDC Bcs 6ef Smi Spring 2025
No ratings yet
Lecture 13 PDC Bcs 6ef Smi Spring 2025
17 pages
OpenMP SPM
No ratings yet
OpenMP SPM
9 pages
Openmp HPC Ass1
No ratings yet
Openmp HPC Ass1
43 pages
CS-3006 8 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 8 UsingOpenMP SharedMemoryProgramming
61 pages
Mcap-Lab Manual 1
No ratings yet
Mcap-Lab Manual 1
19 pages
OpenMP and MPI Multiple Choice Questions (MCQS) For Exam Preparation
No ratings yet
OpenMP and MPI Multiple Choice Questions (MCQS) For Exam Preparation
13 pages
HPC - Unit 3
No ratings yet
HPC - Unit 3
15 pages
IBS Index AIR1CA Atul Sir Book V2 Unofficial V2 1727813243638
No ratings yet
IBS Index AIR1CA Atul Sir Book V2 Unofficial V2 1727813243638
14 pages
4.OpenMP Done
No ratings yet
4.OpenMP Done
3 pages
11 Best Step - How To Plant An Avocado Seed in Soil - October 2024
No ratings yet
11 Best Step - How To Plant An Avocado Seed in Soil - October 2024
31 pages
Omp Sync Data Runtime Environment
No ratings yet
Omp Sync Data Runtime Environment
59 pages

Open MP3

Uploaded by

Open MP3

Uploaded by

Parallel and Distributed Computing

Dr. Rana Asif Rehman

CS3006 - Spring 2022

CS3006 - Spring 2022

 On encountering this directive, all threads in a team

#pragma omp barrier

CS3006 - Spring 2022

 A single directive specifies a structured block that is

#pragma omp single [clause list]

CS3006 - Spring 2022

 The master directive is a specialization of the single

#pragma omp master

CS3006 - Spring 2022

CS3006 - Spring 2022

double area, pi, x;

CS3006 - Spring 2022

 Thread A reads value of area first

area 15.230 Answer should be 18.995

Thread A 15.432 Thread B 15.230

CS3006 - Spring 2022 area += 4.0/(1.0 + x*x)

 Critical section: a portion of code that only thread at a time may

 The atomic directive specifies that the single

#pragma omp atomic

CS3006 - Spring 2022

CS5009 - Advanced Operating Systems

 Setting OMP_NUM_THREADS to 4 using bash:

CS5009 - Advanced Operating Systems

workers = omp_get_max_threads(); //can use num_procs

CS5009 - Advanced Operating Systems

CS5009 - Advanced Operating Systems

 Controls the assignment of iteration spaces associated

 Setting OMP_SCHEDULE to guided with minimum chunk

#pragma omp parallel num_threads(4)

CS5009 - Advanced Operating Systems

#pragma omp parallel num_threads(4)

CS5009 - Advanced Operating Systems

Here a=0.5 , b=0.5 and r=0.5

As number of random points increases, the value of Pi

CS5009 - Advanced Operating Systems

CS5009 - Advanced Operating Systems

Consider the following code:

CS5009 - Advanced Operating Systems

3. We can parallelize it using the following steps

CS5009 - Advanced Operating Systems

int C=0; LIST *p =head;

CS5009 - Advanced Operating Systems

LIST **Parray = new LIST* [C];

CS5009 - Advanced Operating Systems

#pragma omp parallel for schedule(static,1)

 This method can result in speedups only if tasks are complex

CS5009 - Advanced Operating Systems

CS5009 - Advanced Operating Systems

CS3006 - Spring 2022

CS3006 - Spring 2022

You might also like