0% found this document useful (0 votes)

76 views14 pages

Course: Parallel Processing Lab #2 - Multithreads and Openmp

This lab covers multithreading and OpenMP. It includes 4 examples of multithreading programs using POSIX threads: 1) creating and terminating threads, 2) passing arguments to threads, 3) explicitly creating joinable threads, and 4) using a mutex to protect shared data from a race condition. It also provides an introduction to OpenMP, explaining that it is an API that allows programmers to easily parallelize loops and sections of code across multiple processors.

Uploaded by

Long Nhật

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

76 views14 pages

Course: Parallel Processing Lab #2 - Multithreads and Openmp

Uploaded by

Long Nhật

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

B Ach KhoA UNIVERSITY Of T ECHNOLOGY

FACULTY of COMPUTER SCIENCE & ENGINEERING

Course: Parallel Processing

Lab #2 – Multithreads and OpenMP

Thin Nguyen

Goal: This lab helps student to revise knowledge about multitheads and know how to use OpenMP.

1
CONTENTS CONTENTS

Contents
1 Multithreads 5
1.1 POSIX Threads - Linux 5
1.2 Examples 5

2 Multithread Programming with OpenMP 10

2.1 Motivation 10
2.2 Examples 10

3. Exercises 14

Page 2 of 15
1.4 Set up Virtual Machine on your laptop [Optional] 2 REVIEW
1 INTRODUCTION

1 Multithreads
1.1 POSIX Threads - Linux
What is Pthreads?
Historically, hardware vendors have implemented their own proprietary versions of threads. These
implementations differed substantially from each other making it difficult for programmers to
develop portable threaded applications. The POSIX standard has continued to evolve and undergo
revisions, including the Pthreads specification.

Pthreads are defined as a set of C language programming types and procedure calls, implemented
with a pthread.h header/include file and a thread library - though this library may be part of another
library, such as libc, in some implementations.

Figure 3: Shared Memory Model

All threads have access to the same global, shared memory. Threads also have their own private data.
Programmers are responsible for synchronizing access (protecting) globally shared data.

1.2 Examples
Compiling Threaded Programs: several examples of compile commands used for pthreads codes
are listed in the table below.

Table 1: Command lines for compiling Threaded programs

Compiler / Platform Compiler Command Description
INTEL Linux icc -pthread C
icpc -pthread C++
PGI Linux pgcc -lpthread C
pgCC -lpthread C++
GNULinux, Blue Gene gcc -pthread GNU C
g++ -pthread GNU C++

Example 1: Pthread Creation and Termination

Page 5 of 15
2.2 Examples 2 REVIEW

This simple example code creates 10 threads with the pthread create() routine. Each thread prints a ”Hello
World!” message, and then terminates with a call to pthread exit().
#include <pthread.h>
#include <stdio.h>
#define NUM_THREADS 10

// user-defined functions
void * user_def_func(void *threadID){
long TID;
TID = (long) threadID;
printf("Hello World! from thread #%ld\n", TID);
pthread_exit(NULL);
}

int main( int argc, char *argv){

pthread_t threads[NUM_THREADS];
int create_flag;
long i;
for(i = 0; i < NUM_THREADS; i++){
printf("In main: creating thread %ld\n", i);
create_flag = pthread_create(&threads[i], NULL, user_def_func, (void *)i);
i f (create_flag){
printf("ERROR: return code from pthread_create() is %d\n", create_flag);
exit(-1);
}
}

// free thread
pthread_exit(NULL);
return 0;
}

Example 2: Thread Argument Passing

This code fragment demonstrates how to pass a simple integer to each thread. The calling thread uses
a unique data structure for each thread, insuring that each thread’s argument remains intact
throughout the program.

...
/* Thread Argument Passing */
// case-study 1
long taskids[NUM_THREADS];

// case-study 2
...

int main ( int argc, char *argv[]){

pthread_t threads[NUM_THREADS];
int creation_flag;
long i;
for(i = 0; i < NUM_THREADS; i++){
// pass arguments
taskids[i] = i;
printf("In main: creating thread %ld\n", i);

Page 6 of 15
2.2 Examples 2 REVIEW

creation_flag = pthread_create(&threads[i], NULL, user_def_func, (void *)taskids[i]);

...
}

Question: how to setup/pass multiple arguments via a structure?

Example 3: A joinable state for portability purposes

Demonstrates how to explicitly create pthreads in a joinable state for portability purposes. Also shows
how to use the pthread exit status parameter.
#include <pthread.h>
#include <stdio.h>
#include <stdlib.h>
#include <math.h>

// Define CONSTANTS
#define NUM_THREADS 4
#define NUM_LOOPS 1000000

// user-defined function
void *user_def_func(void *threadID){
long TID;
TID = (long) threadID;
int i;
double result = 0.0;
printf("Thread %ld starting...\n", TID);
for(i = 0; i < NUM_LOOPS; i++){
result = result + sin(i) * tan(i);
}

printf("Thread %ld done. Result = %e\n", TID, result);

pthread_exit((void*) threadID);
}

int main ( int argc, char *argv[]){

pthread_t threads[NUM_THREADS];
pthread_attr_t attr; // attribute of threads
int creation_flag, join_flag;
long i;
void *status; // status of threads

/* Initialize and set thread detached atribute */

pthread_attr_init(&attr);
pthread_attr_setdetachstate(&attr, PTHREAD_CREATE_JOINABLE);
for(i = 0; i < NUM_THREADS; i++){
printf("In main: creating thread %ld\n", i);
creation_flag = pthread_create(&threads[i], &attr, user_def_func, (void *)i);
i f (creation_flag){
printf("ERROR: return code from pthread_create() is %d\n", creation_flag);
exit(-1);
}
}
Page 7 of 15
2.2 Examples 2 REVIEW

/* Free attribute and wait for the other threads */

pthread_attr_destroy(&attr);
for(i = 0; i < NUM_THREADS; i++){
join_flag = pthread_join(threads[i], &status);
i f (join_flag){
printf("ERROR: return code from pthread_join() is %d\n", join_flag);
exit(-1);
}
printf("Main: completed join with thread %ld having a status of %ld\n", i, (long)status);
}

printf("Main: program completed. Exiting.\n");

pthread_exit(NULL);
return 0;
}

Example 4: Race condition

This example uses a mutex variable to protect the global sum while each thread updates it. Race
condition is an important problem in parallel programming.
#include <pthread.h>
#include <stdio.h>
#include <stdlib.h>

/* Define global data where everyone can see them */

#define NUMTHRDS 8
#define VECLEN 100000
pthread_mutex_t mutexsum;
int *a, *b;
long sum=0.0;

void dotprod(void arg)

{
/* Each thread works on a different set of data.
* The offset is specified by the arg parameter. The size of
* the data for each thread is indicated by VECLEN.
*/
int i, start, end, offset, len;
long tid;
tid = (long)arg;
offset = tid;
len = VECLEN;
start = offset*len;
end = start + len;

/* Perform my section of the dot product */

printf("thread: %ld starting. start=%d end=%d\n",tid,start,end-1);
for (i=start; i<end ; i++) {
pthread_mutex_lock(&mutexsum);
sum += (a[i] * b[i]);
pthread_mutex_unlock(&mutexsum);
}
printf("thread: %ld done. Global sum now is=%li\n",tid,sum);
pthread_exit((void*) 0);

Page 8 of 15
2.2 Examples 2 REVIEW

int main ( int argc, char *argv[])

{
long i;
void *status;
pthread_t threads[NUMTHRDS];
pthread_attr_t attr;
/* Assign storage and initialize values */
a = ( int * ) malloc (NUMTHRDS*VECLEN* si ze of ( int));
b = ( int * ) malloc (NUMTHRDS*VECLEN* si ze of ( int));
for (i=0; i<VECLEN*NUMTHRDS; i++)
a[i]=b[i]=1;

/* Initialize mutex variable */

pthread_mutex_init(&mutexsum, NULL);
/* Create threads as joinable, each of which will execute the dot product
* routine. Their offset into the global vectors is specified by passing
* the "i" argument in pthread_create().
*/
pthread_attr_init(&attr);
pthread_attr_setdetachstate(&attr, PTHREAD_CREATE_JOINABLE);
for(i=0;i<NUMTHRDS;i++)
pthread_create(&threads[i], &attr, dotprod, (void *)i);
pthread_attr_destroy(&attr);

/* Wait on the other threads for final result */

for(i=0;i<NUMTHRDS;i++) {
pthread_join(threads[i], &status);
}

/* After joining, print out the results and cleanup */

printf ("Final Global Sum=%li\n",sum);
free (a);
free (b);

pthread_mutex_destroy(&mutexsum);
pthread_exit(NULL);
}

Page 9 of 15
2 MULTITHREAD PROGRAMMING WITH OPENMP

2 Multithread Programming with OpenMP

2.1 Motivation
What is OpenMP?

• An Application Program Interface (API) that may be used to explicitly direct multithreaded,
shared memory parallelism.
• Comprised of three primary API components:
– Compiler Directives
– Runtime Library Routines
– Environment Variables
Goals of OpenMP

• Standardization
• Lean and Mean
• Ease of Use
• Portability
Shared Memory Model OpenMP is designed for multi-processor/core, shared memory machines.
The underlying architecture can be shared memory UMA or NUMA.

Figure 4: Shared Memory Model for OpenMP

2.2 Examples

Compiling Threaded Programs: several examples of compile commands used for pthreads codes
are listed in the table below.

Page 10 of 15
Example1: Simple ”Hello World” program. Every thread executes all code enclosed in the parallel region.
OpenMP library routines are used to obtain thread identifiers and total number of threads.

#include <omp.h>

int main( int argc, char *argv[]) {

int nthreads, tid;

/* Fork a team of threads with each thread having a private tid variable */
#pragma omp parallel private(tid)
{

/* Obtain and print thread id */

tid = omp_get_thread_num();
printf("Hello World from thread = %d\n", tid);

/* Only master thread does this */

i f (tid == 0)
{
nthreads = omp_get_num_threads();
printf("Number of threads = %d\n", nthreads);
}

} /* All threads join master thread and terminate */

}

Page 11 of 15
3.2 Examples 3 MULTITHREAD PROGRAMMING WITH OPENMP

Example 2: Work-Sharing Constructs - DO / for Directive. The DO / for directive specifies that the
iterations of the loop immediately following it must be executed in parallel by the team. This assumes a
parallel region has already been initiated, otherwise it executes in serial on a single processor

#include <omp.h>

/* Define some values */

#define N 1000
#define CHUNKSIZE 100
#define OMP_NUM_THREADS 10
#define MAX_THREADS 48

/* Global variables */
int main( int argc, char **argv){
int i, chunk;
float a[N], b[N], c[N];

/* Some initializations */
for(i = 0; i < N; i++){
a[i] = b[i] = i * 1.0; // values = i with float type
}

chunk = CHUNKSIZE;
#pragma omp parallel shared(a,b,c,chunk) private(i)
{
omp_set_num_threads(OMP_NUM_THREADS);
#pragma omp for schedule(dynamic,chunk) nowait
for(i = 0; i < N; i++){
int tid = omp_get_thread_num();
printf("Iter %d running from thread %d\n", i, tid);
c[i] = a[i] + b[i];
}
}

Page 11 of 15
3.2 Examples 3 MULTITHREAD PROGRAMMING WITH OPENMP

/* Validation */
printf("Vector c: \n");
for(i = 0; i < 10; i++){
printf("%f ", c[i]);
}
printf("...\n");
/* Statistic */
// printf("Num of iter with thread:\n");
// for(i = 0; i < MAX_THREADS; i++){
// if(count[i] != 0)
// printf("\tThread %d run %d iter.\n", i, count[i]);
// }

return 0;
}

Example 3: Work-Sharing Constructs - SECTIONS Directive

#include <omp.h>

/* Define some values */

#define N 1000
#define CHUNKSIZE 100
#define OMP_NUM_THREADS 12
#define MAX_THREADS 48

/* Global variables */
int count[MAX_THREADS];
int main( int argc, char **argv){
int i, chunk;
float a[N], b[N], c[N], d[N];

/* Some initializations */
for(i = 0; i < N; i++){
a[i] = i * 1.0;
b[i] = i + 2.0;
}
for(i = 0; i < OMP_NUM_THREADS; i++){
count[i] = 0;
}

chunk = CHUNKSIZE;
#pragma omp parallel shared(a,b,c,d) private(i)
{
omp_set_num_threads(OMP_NUM_THREADS);
#pragma omp sections nowait
{
#pragma omp section
for(i = 0; i < N; i++){
int tid_s1 = omp_get_thread_num();
printf("\tIter %d running from thread %d\n", i, tid_s1);
c[i] = a[i] + b[i];
// Increase count

Page 12 of 15
3.2 Examples 3 MULTITHREAD PROGRAMMING WITH OPENMP

count[tid_s1]++;
}

#pragma omp section

for(i = 0; i < N; i++){
int tid_s2 = omp_get_thread_num();
printf("\tIter %d running from thread %d\n", i, tid_s2);
d[i] = a[i] * b[i];
// Increase count
count[tid_s2]++;
}
}
}
/* Validation */
printf("Vector c: \n\t");
for(i = 0; i < 10; i++){
printf("%f ", c[i]);
}
printf("...\n");
printf("Vector d: \n\t");
for(i = 0; i < 10; i++){
printf("%f ", d[i]);
}
printf("...\n");

/* Statistic */
printf("Num of iter with thread:\n");
for(i = 0; i < MAX_THREADS; i++){
i f (count[i] != 0)
printf("\tThread %d run %d iter.\n", i, count[i]);
}

return 0;
}

Example 4: THREADPRIVATE Directive The THREADPRIVATE directive is used to make

global file scope variables (C/C++) or common blocks (Fortran) local and persistent to a thread
through the execution of multiple parallel regions.

#include <omp.h>
/* Define some values */
#define N 1000
#define CHUNKSIZE 10
#define MAX_THREADS 48
#define NUM_THREADS 4
/* Global variables */
int count[MAX_THREADS];
int a, b, i, tid;
float x;
#pragma omp threadprivate(a, x)
int main( int argc, char **argv){
/* Explicitly turn off dynamic threads */
omp_set_dynamic(0);
omp_set_num_threads(NUM_THREADS);

Page 13 of 15
4 EXERCISES

printf("1st Parallel Region:\n");

#pragma omp parallel private(b,tid)
{
tid = omp_get_thread_num();
25 a = tid;
b = tid;
x = 1.1 * tid + 1.0;
printf("Thread %d: a, b, x = %d, %d, %f\n", tid, a, b, x);
}

printf("************************************ \n");
printf("Master thread doing serial work here\n");
printf("************************************ \n");
printf("2nd Parallel Region:\n");
#pragma omp parallel private(tid)
{
tid = omp_get_thread_num();
printf("Thread %d: a, b, x = %d, %d, %f\n", tid, a, b, x);
}
return 0;
}

3. Exercises
1. Matrix multiplication with Pthread: implement a parallel version for the given source code with
POSIX Thread. Student need to complete //TODO part in the source code. After you finish, lets run
the program with matrix sizes: 10, 100, 1000, 10000, 20000 (at least 10000) ... and record th e
execution time with the command:

// For example:
$ time ./mul_mat_pthread_output 1000 1

Finally, you plot a graph of performance between Serial Version (already provided in graph.py) and
Pthread Version, as Figure 5. In the source code, if you want to modify some variables or data type, it
is ok. Note: you can plot the graph on your machine by python. Please search Google to setup Python
and plot the graph (Matplotlib library is recommended).

2. OpenMP: pi will be computed by creating a Riemann Integral

(http://mathworld.wolfram.com/RiemannIntegral.html) over half a circle, as Figure 6. As the area of a circle
with a radius of 1 is equal to , this integral will yield
/2. The algorithm implemented is:
• Create an array rect containing the indices 0 to numsteps
• Create an array midPt that contains the middle points of all the rectangles
• Create an array area that contains the area of all the rectangles
• Sum over area and multiply by 2
The code is already prepared in the files pi simple.cpp and pi simple.h.

Page 14 of 15
5 SUBMISSION

3. OpenMP: Matrix multiplication is a standard problem in HPC. This computation is exemplified in

the Basic Linear Algebra Subroutine (BLAS) function SGEMM. Many libraries contain h ighly
optimized code to execute this problem. In this exercise we define 3 matrices A, B and C of
dimension N x N. All elements of matrix A are equal to 1, and all values in B are set to 2. The
resulting matrix C should therefore consist of elements equal to 1*2*N.

4. OpenMP: Cholesky Decomposition Algorithm (Bonus) A standard problem in HPC is solving a

system of linear equations. What values do you need for a, b, c and d to fulfill these equations?

2a + b + 2c + 5d = 24
a + 3b + c + 4d = 15
2a + b + 4c + 7d = 28
5a + 4b + 7c + 3d = -21

One solution is the so-called matrix decomposition. In many cases these problems lead to a
symmetric (and positive definite) matrix, which can be efficiently decomposed with the Cholesky
Decomposition algorithm. Note: The parallel versions of this Cholesky implementation do not scale
very well with the number of CPUs.

Note: Student just need to change the given source code provided. All of exercises have the .py file to
plot the graph for evaluating performance among scales and problem sizes, so you need to record the
results and plot the graph. The number of threads as well as problem sizes is declared in .py files.

Page 15 of 15

ProjectManager Template Gantt Chart Excel ND
No ratings yet
ProjectManager Template Gantt Chart Excel ND
13 pages
Objective:: King Fahd University of Petroleum and Minerals
No ratings yet
Objective:: King Fahd University of Petroleum and Minerals
14 pages
Multi-Threaded Programming With POSIX Threads - Linux Systems Programming
No ratings yet
Multi-Threaded Programming With POSIX Threads - Linux Systems Programming
2,608 pages
Lecture 4
No ratings yet
Lecture 4
41 pages
Inter Process Communication
No ratings yet
Inter Process Communication
84 pages
Ecole Militaire Polytechnique: Content
No ratings yet
Ecole Militaire Polytechnique: Content
15 pages
Threads and Multithreading
No ratings yet
Threads and Multithreading
36 pages
CS211 Lec 15
No ratings yet
CS211 Lec 15
21 pages
Help Lab Threads
No ratings yet
Help Lab Threads
17 pages
C++ Multithreading: Creating Threads
No ratings yet
C++ Multithreading: Creating Threads
5 pages
Ex 5
No ratings yet
Ex 5
8 pages
Operating System Lab Notes of Multithreading Using P Threads
No ratings yet
Operating System Lab Notes of Multithreading Using P Threads
5 pages
front_threads
No ratings yet
front_threads
18 pages
Programming Shared Address Space Platforms: Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar
No ratings yet
Programming Shared Address Space Platforms: Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar
67 pages
DD The Superior College Lahore: Bscs 5C
No ratings yet
DD The Superior College Lahore: Bscs 5C
15 pages
COS-7
No ratings yet
COS-7
63 pages
Section 5 PDF
No ratings yet
Section 5 PDF
6 pages
Lect9 Pthread
No ratings yet
Lect9 Pthread
24 pages
Unix Threads
No ratings yet
Unix Threads
36 pages
LAB - 5-Threading Using Pthreads API
No ratings yet
LAB - 5-Threading Using Pthreads API
8 pages
2.5 POSIX Threads
No ratings yet
2.5 POSIX Threads
9 pages
Multithreading C++
No ratings yet
Multithreading C++
9 pages
POSIX Threads: Tutorial Number 2 Principles of Parallel Architectures
No ratings yet
POSIX Threads: Tutorial Number 2 Principles of Parallel Architectures
30 pages
Lab Manual 09 (1)
No ratings yet
Lab Manual 09 (1)
6 pages
Lab12(3)
No ratings yet
Lab12(3)
14 pages
Pthread Tutorial by Peter (Good One)
No ratings yet
Pthread Tutorial by Peter (Good One)
29 pages
A 9 SysytemCallLab
No ratings yet
A 9 SysytemCallLab
6 pages
Pthreads
No ratings yet
Pthreads
70 pages
Session 15 POSIX Threads and Mutex
No ratings yet
Session 15 POSIX Threads and Mutex
20 pages
P Threads
No ratings yet
P Threads
72 pages
C++ Pthread Tutorial
No ratings yet
C++ Pthread Tutorial
26 pages
Operating System - Lab 4
No ratings yet
Operating System - Lab 4
7 pages
Programming Assignment 2
No ratings yet
Programming Assignment 2
5 pages
ch4并发编程
No ratings yet
ch4并发编程
45 pages
POSIX Threads
No ratings yet
POSIX Threads
20 pages
CS303-Lab 5 (2)
No ratings yet
CS303-Lab 5 (2)
8 pages
Programming Shared Address Space Platforms
No ratings yet
Programming Shared Address Space Platforms
44 pages
Exercise
No ratings yet
Exercise
3 pages
Department of Information Technology Assignment No. 2 TITLE: Implement Multithreading For Matrix Multiplication Using Pthreads. Objective
No ratings yet
Department of Information Technology Assignment No. 2 TITLE: Implement Multithreading For Matrix Multiplication Using Pthreads. Objective
6 pages
Lab 5: Threads Creation Using Posix Api: Pthread
100% (1)
Lab 5: Threads Creation Using Posix Api: Pthread
3 pages
Lecture 9 Programming Shared Address Space Platforms using POSIX Thread API.pptx
No ratings yet
Lecture 9 Programming Shared Address Space Platforms using POSIX Thread API.pptx
35 pages
Threads: Multicore Programming Multithreading Models Thread Libraries Threading Issues Operating System Examples
No ratings yet
Threads: Multicore Programming Multithreading Models Thread Libraries Threading Issues Operating System Examples
22 pages
Posix Thread
No ratings yet
Posix Thread
22 pages
Operating Systems 9
No ratings yet
Operating Systems 9
15 pages
Lab 5
No ratings yet
Lab 5
8 pages
P Threads
No ratings yet
P Threads
18 pages
PrakW9S03 Multithreading Using Posix Thread
No ratings yet
PrakW9S03 Multithreading Using Posix Thread
4 pages
Thread Implementation: For Parallel Processing
No ratings yet
Thread Implementation: For Parallel Processing
42 pages
Parallel Progamming With Pthreads
No ratings yet
Parallel Progamming With Pthreads
79 pages
Lab03 Multithreading
No ratings yet
Lab03 Multithreading
4 pages
P Threads Intro
No ratings yet
P Threads Intro
19 pages
AOS - Lab - Skill - I&II - Synchronization, Shared Memory Programming
No ratings yet
AOS - Lab - Skill - I&II - Synchronization, Shared Memory Programming
8 pages
Pthread PDF
No ratings yet
Pthread PDF
33 pages
MAP - Unit2
No ratings yet
MAP - Unit2
134 pages
Lab 07 - Programming Threads
No ratings yet
Lab 07 - Programming Threads
9 pages
Introduction To Pthreads
No ratings yet
Introduction To Pthreads
19 pages
Ud923 Birrell Paper
No ratings yet
Ud923 Birrell Paper
38 pages
Computer Science, Career and Job
From Everand
Computer Science, Career and Job
Ramkrishna Ghosh
No ratings yet
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
C Programming
From Everand
C Programming
Netra
No ratings yet
End of Term III Circular 2024
No ratings yet
End of Term III Circular 2024
3 pages
ML Project Report
No ratings yet
ML Project Report
16 pages
Perimeter Length: 560m: QTY Unit U-Cost Amount Description I. Preliminaries
No ratings yet
Perimeter Length: 560m: QTY Unit U-Cost Amount Description I. Preliminaries
1 page
Ac Stress Distribution in The Insulation Single Core Screened Xlpe Cable - 132 KV 300 MM
No ratings yet
Ac Stress Distribution in The Insulation Single Core Screened Xlpe Cable - 132 KV 300 MM
2 pages
Surveyin Midterm
No ratings yet
Surveyin Midterm
5 pages
Lis-02-Three Months Rolling Programme
No ratings yet
Lis-02-Three Months Rolling Programme
9 pages
Marketing Program Structure Full
No ratings yet
Marketing Program Structure Full
2 pages
Extended Refinery Gas Analyzer
No ratings yet
Extended Refinery Gas Analyzer
2 pages
Astm D5084-10 Permeability TRX
100% (1)
Astm D5084-10 Permeability TRX
24 pages
ED PCK 6 Act 1 MIDTERM
No ratings yet
ED PCK 6 Act 1 MIDTERM
3 pages
(IFIP Advances in Information and Communication Technology 344) Yanjun Zuo, Xu Ma, Long Qi, Xinglong Liao (Auth.), Daoliang Li, Yande Liu, Yingyi Chen (Eds.)-Computer and Computing Technologies in Agr
No ratings yet
(IFIP Advances in Information and Communication Technology 344) Yanjun Zuo, Xu Ma, Long Qi, Xinglong Liao (Auth.), Daoliang Li, Yande Liu, Yingyi Chen (Eds.)-Computer and Computing Technologies in Agr
794 pages
4th Periodical Test-G7 New 2023
No ratings yet
4th Periodical Test-G7 New 2023
4 pages
DM-OUHROD-2024-1576 Enclosures
No ratings yet
DM-OUHROD-2024-1576 Enclosures
67 pages
3esguerra v. Villanueva 21 SCRA 1314 GR L 23191 12191967 G.R. No. L 23191
No ratings yet
3esguerra v. Villanueva 21 SCRA 1314 GR L 23191 12191967 G.R. No. L 23191
3 pages
Part List LCCH 250-660.4k
100% (1)
Part List LCCH 250-660.4k
3 pages
STAAD Sample Manual
No ratings yet
STAAD Sample Manual
7 pages
"2020" Seminar Information: FORD 6R140W - 6R80
No ratings yet
"2020" Seminar Information: FORD 6R140W - 6R80
4 pages
Pds Nax q125 F
No ratings yet
Pds Nax q125 F
3 pages
SEL Study Material - 4
No ratings yet
SEL Study Material - 4
30 pages
Topic-: Social Cost Benefit Analysis, UNIDO Approach, Shadow Pricing
No ratings yet
Topic-: Social Cost Benefit Analysis, UNIDO Approach, Shadow Pricing
23 pages
(131) Why is 4s orbital filled before 3d_ - Quora
No ratings yet
(131) Why is 4s orbital filled before 3d_ - Quora
12 pages
18 Series Rev C Installation and Operation PDF
No ratings yet
18 Series Rev C Installation and Operation PDF
256 pages
Module IV. Minimum Corporate Income Tax IAET GIT
No ratings yet
Module IV. Minimum Corporate Income Tax IAET GIT
11 pages
Silikon Oil Msds
No ratings yet
Silikon Oil Msds
5 pages
China Agricultural Forestry Machinery Industry Profile Isic2921
No ratings yet
China Agricultural Forestry Machinery Industry Profile Isic2921
8 pages
FA23SE051 - VanTTN2 - TrungNQ46 - Outfit Foryou
No ratings yet
FA23SE051 - VanTTN2 - TrungNQ46 - Outfit Foryou
4 pages
BIMWERX Coordination Workflows With Revit
No ratings yet
BIMWERX Coordination Workflows With Revit
36 pages
Titan Project
No ratings yet
Titan Project
58 pages
Financial Instrument v.03
No ratings yet
Financial Instrument v.03
49 pages

Course: Parallel Processing Lab #2 - Multithreads and Openmp

Uploaded by

Course: Parallel Processing Lab #2 - Multithreads and Openmp

Uploaded by

B Ach KhoA UNIVERSITY Of T ECHNOLOGY

FACULTY of COMPUTER SCIENCE & ENGINEERING

Course: Parallel Processing

2 Multithread Programming with OpenMP 10

Figure 3: Shared Memory Model

Table 1: Command lines for compiling Threaded programs

Example 1: Pthread Creation and Termination

int main( int argc, char *argv){

Example 2: Thread Argument Passing

int main ( int argc, char *argv[]){

creation_flag = pthread_create(&threads[i], NULL, user_def_func, (void *)taskids[i]);

Question: how to setup/pass multiple arguments via a structure?

Example 3: A joinable state for portability purposes

printf("Thread %ld done. Result = %e\n", TID, result);

int main ( int argc, char *argv[]){

/* Initialize and set thread detached atribute */

/* Free attribute and wait for the other threads */

printf("Main: program completed. Exiting.\n");

Example 4: Race condition

/* Define global data where everyone can see them */

void *dotprod(void *arg)

/* Perform my section of the dot product */

int main ( int argc, char *argv[])

/* Initialize mutex variable */

/* Wait on the other threads for final result */

/* After joining, print out the results and cleanup */

2 Multithread Programming with OpenMP

Figure 4: Shared Memory Model for OpenMP

int main( int argc, char *argv[]) {

int nthreads, tid;

/* Obtain and print thread id */

/* Only master thread does this */

} /* All threads join master thread and terminate */

/* Define some values */

Example 3: Work-Sharing Constructs - SECTIONS Directive

/* Define some values */

#pragma omp section

Example 4: THREADPRIVATE Directive The THREADPRIVATE directive is used to make

printf("1st Parallel Region:\n");

2. OpenMP: pi will be computed by creating a Riemann Integral

3. OpenMP: Matrix multiplication is a standard problem in HPC. This computation is exemplified in

4. OpenMP: Cholesky Decomposition Algorithm (Bonus) A standard problem in HPC is solving a

You might also like

void dotprod(void arg)