0% found this document useful (0 votes)

7 views4 pages

Tp2 - Openmp (Introduction) : Imad Kissami

The document outlines a series of exercises for an OpenMP course, focusing on parallel programming techniques. Exercises include creating OpenMP programs for thread management, parallelizing PI calculations, matrix multiplication, and the Jacobi method. Each exercise emphasizes the importance of understanding shared versus private variables and performance analysis through various threading configurations.

Uploaded by

Mohi Gpt4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views4 pages

Tp2 - Openmp (Introduction) : Imad Kissami

Uploaded by

Mohi Gpt4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Mohammed VI Polytechnic University

TP2 - OpenMP (Introduction)

Imad Kissami
February 16, 2025

Exercise 1:
In this very simple exercise, you need to :

1. Write an OpenMP program displaying the number of threads used for the execution and
the rank of each of the threads.

2. Compile the code manually to create a monoprocessor executable and a parallel executable.

3. Test the programs obtained with different numbers of threads for the parallel program,
without submitting in batch.

Output example for the parallel program with 4 threads :

Hello from the rank 2 thread
Hello from the rank 1 thread
Hello from the rank 3 thread
Hello from the rank 0 thread
Parallel execution of hello_world with 4 threads

Exercise 2: Parallelizing of PI calculation

static long num_steps = 100000;
double step;
int main ()
{
int i; double x, pi , sum = 0.0;
step = 1.0/( double) num_steps;
for (i=0;i< num_steps; i++){
x = (i+0.5)* step;
sum = sum + 4.0/(1.0+x*x);
}
pi = step * sum;
}

1. Create a parallel version of the pi program using a parallel construct.

2. Don’t use #pragma parallel for

3. Pay close attention to shared versus private variables.

4. use double omp_get_wtime() to calculate the CPU time.

Exercise 3: Pi with loops

• Go back to the serial pi program and parallelize it with a loop construct

• Your goal is to minimize the number of changes made to the serial program (add only 1
line)
2

Exercise 4: Parallelizing Matrix Multiplication with OpenMP

// Allocate memory dynamically

double *a = (double *) malloc(m * n * sizeof(double ));
double *b = (double *) malloc(n * m * sizeof(double ));
double *c = (double *) malloc(m * m * sizeof(double ));

// Initialize matrices
for (int i = 0; i < m; i++) {
for (int j = 0; j < n; j++) {
a[i * n + j] = (i + 1) + (j + 1); // Access via 1D indexing
}
}

for (int i = 0; i < n; i++) {

for (int j = 0; j < m; j++) {
b[i * m + j] = (i + 1) - (j + 1);
}
}

for (int i = 0; i < m; i++) {

for (int j = 0; j < m; j++) {
c[i * m + j] = 0;
}
}

// Matrix multiplication
for (int i = 0; i < m; i++) {
for (int j = 0; j < m; j++) {
for (int k = 0; k < n; k++) {
c[i * m + j] += a[i * n + k] * b[k * m + j];
}
}
}

The code calculates the matrix product:

C =A×B

• In this exercise, you must:

1. Insert the appropriate OpenMP directives and analyze the code performance.
2. Use Collapse directive to parallelize this matrix multiplication code.
3. Run the code using 1, 2, 4, 8, 16 threads and plot the speedup and eﬀiciency.
4. Test the loop iteration repartition modes (STATIC, DYNAMIC, GUIDED) and vary the
chunk sizes.

Exercise 5: Parallelizing of Jacobi Method with OpenMP

The program solves a general linear system using the Jacobi iterative method.
# include <stdio.h>
# include <stdlib.h>
# include <string.h>
# include <float.h>
# include <math.h>
# include <sys/time.h>
# include <omp.h> // Replaces time.h

// Default matrix size

# ifndef VAL_N
# define VAL_N 120
#endif
# ifndef VAL_D
# define VAL_D 80
#endif

// Random initialization of an array

void random_number(double* array , int size) {

for (int i = 0; i < size; i++) {
array[i] = (double)rand () / (double )( RAND_MAX - 1);
}
}

int main () {
int n = VAL_N , diag = VAL_D;
int i, j, iteration = 0;
double norme;

// Correct 2D matrix allocation

double *a = (double *) malloc(n * n * sizeof(double ));
double *x = (double *) malloc(n * sizeof(double ));
double *x_courant = (double *) malloc(n * sizeof(double ));
double *b = (double *) malloc(n * sizeof(double ));

if (!a || !x || !x_courant || !b) {

fprintf(stderr , "Memory␣allocation␣failed !\n");
exit(EXIT_FAILURE );
}

// Time measurement variables

struct timeval t_elapsed_0 , t_elapsed_1;
double t_elapsed;

double t_cpu_0 , t_cpu_1 , t_cpu;

// Matrix and RHS initialization

srand (421); // For reproducibility
random_number(a, n * n);
random_number(b, n);

// Strengthening the diagonal

for (i = 0; i < n; i++) {
a[i * n + i] += diag; // Corrected indexing
}

// Initial solution
for (i = 0; i < n; i++) {
x[i] = 1.0;
}

// Start timing
t_cpu_0 = omp_get_wtime ();
gettimeofday (& t_elapsed_0 , NULL );

// Jacobi Iteration
while (1) {
iteration ++;

for (i = 0; i < n; i++) {

x_courant[i] = 0;
for (j = 0; j < i; j++) {
x_courant[i] += a[j * n + i] * x[j]; // Corrected indexing
}
for (j = i + 1; j < n; j++) {
x_courant[i] += a[j * n + i] * x[j]; // Corrected indexing
}
x_courant[i] = (b[i] - x_courant[i]) / a[i * n + i]; // Corrected indexing
}

// Convergence test
double absmax = 0;
for (i = 0; i < n; i++) {
double curr = fabs(x[i] - x_courant[i]);
if (curr > absmax)
absmax = curr;
}
norme = absmax / n;

if (( norme <= DBL_EPSILON) || (iteration >= n)) break;

// Copy x_courant to x
memcpy(x, x_courant , n * sizeof(double ));
}
4

// End timing
gettimeofday (& t_elapsed_1 , NULL );
t_elapsed = (t_elapsed_1.tv_sec - t_elapsed_0.tv_sec) +
(t_elapsed_1.tv_usec - t_elapsed_0.tv_usec) / 1e6;

t_cpu_1 = omp_get_wtime ();

t_cpu = t_cpu_1 - t_cpu_0;

// Print result
fprintf(stdout , "\n\n"
"␣␣␣System␣size␣␣␣␣␣␣␣␣␣:␣%5d\n"
"␣␣␣Iterations␣␣␣␣␣␣␣␣␣␣:␣%4d\n"
"␣␣␣Norme␣␣␣␣␣␣␣␣␣␣␣␣␣␣␣:␣%10.3E\n"
"␣␣␣Elapsed␣time␣␣␣␣␣␣␣␣:␣%10.3E␣sec.\n"
"␣␣␣CPU␣time␣␣␣␣␣␣␣␣␣␣␣␣:␣%10.3E␣sec.\n",
n, iteration , norme , t_elapsed , t_cpu
);

// Free allocated memory

free(a);
free(x);
free(x_courant );
free(b);

return EXIT_SUCCESS;
}

A×x=b

1. In this exercice, you must solve the system in parallel.

2. Run the code using 1, 2, 4, 8, 16 threads and plot the speedup and eﬀiciency.

Parallel Computing Lab Manual PDF
100% (1)
Parallel Computing Lab Manual PDF
51 pages
Lab Pratice First Lab Manual
No ratings yet
Lab Pratice First Lab Manual
81 pages
Lab Programs
No ratings yet
Lab Programs
18 pages
Programming - AutoCAD 2004 Activex and VBA Developer's Guide
100% (4)
Programming - AutoCAD 2004 Activex and VBA Developer's Guide
398 pages
SPPU Pattern2019 Fds Unit 2
No ratings yet
SPPU Pattern2019 Fds Unit 2
31 pages
Campus Recruitment System - Mini Project Report
No ratings yet
Campus Recruitment System - Mini Project Report
17 pages
PASHA ICT Computer Packages
100% (3)
PASHA ICT Computer Packages
109 pages
All 1313 Combined 2
No ratings yet
All 1313 Combined 2
114 pages
Boeing 777 Manual
No ratings yet
Boeing 777 Manual
11 pages
IBM DS8900F Performance Best Practices and Monitoring
No ratings yet
IBM DS8900F Performance Best Practices and Monitoring
294 pages
Report On The Physical Count of Inventories Office Equipment
No ratings yet
Report On The Physical Count of Inventories Office Equipment
13 pages
A11 Manual
100% (1)
A11 Manual
53 pages
Cloud Security For Dummies Webinar
100% (1)
Cloud Security For Dummies Webinar
14 pages
CP4252 Multicore Architecture and Programming Lab Manual
No ratings yet
CP4252 Multicore Architecture and Programming Lab Manual
26 pages
Multicore Architecture and Programming Lab Manual
No ratings yet
Multicore Architecture and Programming Lab Manual
29 pages
Vector Addition: Exercise 1 (Openmp-I) Scenario - I
100% (1)
Vector Addition: Exercise 1 (Openmp-I) Scenario - I
15 pages
EapSimAka - Seek-For-Android - Support For EAP-SIM and EAP-AKA in Android
No ratings yet
EapSimAka - Seek-For-Android - Support For EAP-SIM and EAP-AKA in Android
5 pages
Coding Sites - : Hackerearth or Hackerrank Interviewbit
No ratings yet
Coding Sites - : Hackerearth or Hackerrank Interviewbit
4 pages
Omp Hands On SC08 PDF
No ratings yet
Omp Hands On SC08 PDF
153 pages
Vsphere Esxi Vcenter Server 65 Security Guide
No ratings yet
Vsphere Esxi Vcenter Server 65 Security Guide
234 pages
PC File
No ratings yet
PC File
57 pages
Day 2 1 Advanced-Openmp
No ratings yet
Day 2 1 Advanced-Openmp
52 pages
Minecraft Single Player Commands Mod Command List
No ratings yet
Minecraft Single Player Commands Mod Command List
5 pages
Excelente
No ratings yet
Excelente
64 pages
Coal 1
No ratings yet
Coal 1
55 pages
Omp Hands On SC08
No ratings yet
Omp Hands On SC08
153 pages
Mgate 5119 Series User'S Manual: Version 1.0, February 2022
No ratings yet
Mgate 5119 Series User'S Manual: Version 1.0, February 2022
67 pages
Yuma Dev Manual
No ratings yet
Yuma Dev Manual
97 pages
PDC-Lab 21BCE10419
No ratings yet
PDC-Lab 21BCE10419
20 pages
Lab 3
No ratings yet
Lab 3
23 pages
Omp Exercises
No ratings yet
Omp Exercises
81 pages
Dell Unity-Pools-Config
No ratings yet
Dell Unity-Pools-Config
78 pages
E 3 (Openmp - Iii) : Matrix Multiplication
No ratings yet
E 3 (Openmp - Iii) : Matrix Multiplication
10 pages
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
Exercise 1 (Openmp-I)
No ratings yet
Exercise 1 (Openmp-I)
10 pages
Lab Manual
No ratings yet
Lab Manual
31 pages
HPC Programs
No ratings yet
HPC Programs
19 pages
Mercury Performance Test Tool
No ratings yet
Mercury Performance Test Tool
77 pages
20bce2126 PDC Lab Da 3
No ratings yet
20bce2126 PDC Lab Da 3
11 pages
Embedded Microprocessors and Its Applications
No ratings yet
Embedded Microprocessors and Its Applications
33 pages
MPC LAB Manual New
No ratings yet
MPC LAB Manual New
24 pages
Gauravkumar 221it027@it301 Lab2
No ratings yet
Gauravkumar 221it027@it301 Lab2
28 pages
Cp4292 Multicore Lab Multicore Lab Removed
No ratings yet
Cp4292 Multicore Lab Multicore Lab Removed
37 pages
CP 4292 MCP Lab Manual
No ratings yet
CP 4292 MCP Lab Manual
20 pages
MAP Lab Mannual
No ratings yet
MAP Lab Mannual
24 pages
4 Performance.4x
No ratings yet
4 Performance.4x
14 pages
Parallel and Distributed Computing Lab Digital Assignment - 3
No ratings yet
Parallel and Distributed Computing Lab Digital Assignment - 3
10 pages
HPC Codes-2
No ratings yet
HPC Codes-2
15 pages
LSMW TCP As91new1
No ratings yet
LSMW TCP As91new1
12 pages
Programming Assignment: On Openmp
No ratings yet
Programming Assignment: On Openmp
19 pages
HPC Project Report
No ratings yet
HPC Project Report
10 pages
Openmp Lab: Antonio Gómez-Iglesias Agomez@Tacc - Utexas.Edu Texas Advanced Computing Center
No ratings yet
Openmp Lab: Antonio Gómez-Iglesias Agomez@Tacc - Utexas.Edu Texas Advanced Computing Center
17 pages
OpenMP Basics
No ratings yet
OpenMP Basics
47 pages
MAP Lab Completed
No ratings yet
MAP Lab Completed
29 pages
Lab # 2 by Akram
No ratings yet
Lab # 2 by Akram
14 pages
MPC LAB Manual New
No ratings yet
MPC LAB Manual New
23 pages
Mcap-Lab Manual 1
No ratings yet
Mcap-Lab Manual 1
19 pages
Question 1 - Serial: Output
No ratings yet
Question 1 - Serial: Output
9 pages
(Serial)
No ratings yet
(Serial)
8 pages
CP4292 Multicore Architecture Lab Manual
No ratings yet
CP4292 Multicore Architecture Lab Manual
36 pages
Class XI (As Per CBSE Board) : Informatics Practices
No ratings yet
Class XI (As Per CBSE Board) : Informatics Practices
9 pages
1 Introduction p3
No ratings yet
1 Introduction p3
10 pages
Multicore
No ratings yet
Multicore
23 pages
CP4292 Mcap
No ratings yet
CP4292 Mcap
24 pages
OpenMP Programs
No ratings yet
OpenMP Programs
4 pages
HPC Lab Manual 2317 Merged Organized
No ratings yet
HPC Lab Manual 2317 Merged Organized
35 pages
MC Openmp
No ratings yet
MC Openmp
10 pages
Assignment No. 02 Semester: Spring 2021 CS304-Object Oriented Programming Total Marks: 20
No ratings yet
Assignment No. 02 Semester: Spring 2021 CS304-Object Oriented Programming Total Marks: 20
7 pages
Assignment 04
No ratings yet
Assignment 04
16 pages
Lab3 PAP
No ratings yet
Lab3 PAP
14 pages
U1 Programa4 S12021
No ratings yet
U1 Programa4 S12021
6 pages
Homework #4 El6201 - Parallel System: 1 Openmp Matrix Addition
No ratings yet
Homework #4 El6201 - Parallel System: 1 Openmp Matrix Addition
6 pages
UiPath RPAv1
No ratings yet
UiPath RPAv1
5 pages
Part 1
No ratings yet
Part 1
48 pages
Appendix A: PS/2 Mouse Commands: Commands Sent by Host To Mouse Code Description and Mouse Action
No ratings yet
Appendix A: PS/2 Mouse Commands: Commands Sent by Host To Mouse Code Description and Mouse Action
5 pages
OpenMP Matrix
No ratings yet
OpenMP Matrix
6 pages
Part 7
No ratings yet
Part 7
41 pages
BAIT3273 Tutorial 5: Core Cloud Services - Manage Services With The Azure Portal
No ratings yet
BAIT3273 Tutorial 5: Core Cloud Services - Manage Services With The Azure Portal
4 pages
Part 2
No ratings yet
Part 2
33 pages
AMD Instinct™ MI100 Microarchitecture - ROCm Documentation
No ratings yet
AMD Instinct™ MI100 Microarchitecture - ROCm Documentation
4 pages
Tp3 - Openmp (Parallel Sections, Single, Master, Synchronization)
No ratings yet
Tp3 - Openmp (Parallel Sections, Single, Master, Synchronization)
3 pages
Lab 7
No ratings yet
Lab 7
3 pages
OpenMP 2
No ratings yet
OpenMP 2
3 pages
Lab 2
No ratings yet
Lab 2
2 pages
Prob pp2
No ratings yet
Prob pp2
2 pages
Practice OpenMP
No ratings yet
Practice OpenMP
2 pages
Inf3380 Oblig2 2011
No ratings yet
Inf3380 Oblig2 2011
3 pages
Mubashir CV
No ratings yet
Mubashir CV
3 pages
BP 30C25Z BP 30C25ZT Brochure
No ratings yet
BP 30C25Z BP 30C25ZT Brochure
2 pages
Java Practise Exercise
No ratings yet
Java Practise Exercise
3 pages
HHJGHH
No ratings yet
HHJGHH
1 page
C Programming
From Everand
C Programming
Netra
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet

Tp2 - Openmp (Introduction) : Imad Kissami

Uploaded by

Tp2 - Openmp (Introduction) : Imad Kissami

Uploaded by

Mohammed VI Polytechnic University

TP2 - OpenMP (Introduction)

Output example for the parallel program with 4 threads :

Exercise 2: Parallelizing of PI calculation

1. Create a parallel version of the pi program using a parallel construct.

2. Don’t use #pragma parallel for

3. Pay close attention to shared versus private variables.

4. use double omp_get_wtime() to calculate the CPU time.

Exercise 3: Pi with loops

Exercise 4: Parallelizing Matrix Multiplication with OpenMP

// Allocate memory dynamically

for (int i = 0; i < n; i++) {

for (int i = 0; i < m; i++) {

The code calculates the matrix product:

• In this exercise, you must:

Exercise 5: Parallelizing of Jacobi Method with OpenMP

// Default matrix size

// Random initialization of an array

void random_number(double* array , int size) {

// Correct 2D matrix allocation

if (!a || !x || !x_courant || !b) {

// Time measurement variables

double t_cpu_0 , t_cpu_1 , t_cpu;

// Matrix and RHS initialization

// Strengthening the diagonal

for (i = 0; i < n; i++) {

if (( norme <= DBL_EPSILON) || (iteration >= n)) break;

t_cpu_1 = omp_get_wtime ();

// Free allocated memory

1. In this exercice, you must solve the system in parallel.

You might also like