0% found this document useful (0 votes)

19 views

Program Excecution ExpFinal

Coding

Uploaded by

Medini Sree S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views

Program Excecution ExpFinal

Coding

Uploaded by

Medini Sree S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Programs Codes

Intro to Threads and parallel programming

Program 1

#include <stdio.h>

#include <omp.h>

int main() {

// Start a parallel region

#pragma omp parallel

{ // Each thread prints "Hello, World!" with its thread ID

printf("Hello, World! from thread %d\n", omp_get_thread_num());

return 0;

Output

C:\Users\PC\Documents\HPC-College>gcc -fopenmp helloworld.c

C:\Users\PC\Documents\HPC-College>a.exe

Hello, World! from thread 1

Hello, World! from thread 0

Hello, World! from thread 3

Hello, World! from thread 4

Hello, World! from thread 2

Hello, World! from thread 7

Hello, World! from thread 6

Hello, World! from thread 5

Explanation:

• #include <omp.h>: This includes the OpenMP header file, that provides declarations of OpenMP
functions and directives.

• #pragma omp parallel: This instruction forces the compiler to create a parallel region where many
threads will execute the code in the block.
• printf("Hello from thread %d

", omp_get_thread_num());: Inside this parallel region, each of the threads prints a message
showing the number of threads.

• Compiling with OpenMP:

• To compile this program, you need to use a compiler supporting OpenMP. In the case of GCC, you
need to use the following flag: -fopenmp. Here's how to do that: Code copy

gcc -fopenmp -o myprogram myprogram.c

• It tells GCC to enable OpenMP and link the underlying libraries.

Running:

• When you execute the compiled program, "Hello from thread X" will be printed where X is the
number of the thread assigned by OpenMP. How many threads are created actually depends on the
environment settings or on the default configuration.

• Environment Variables:

• The number of threads used in the parallel region can be influenced through the
OMP_NUM_THREADS environment variable. For instance:

export OMP_NUM_THREADS=4

• This sets the number of threads to 4.

Thread Safety:

• The printf function is normally thread-safe in most implementations, though it is a good

practice that your code remains designed to avoid race conditions, especially with more complex
data handling.

The OpenMP #pragma omp parallel directive is used to tell your code what should be parallel.
Below is an explanation of how the code works and what it does.

What is #pragma omp parallel for?

1.Parallel Execution:

Pragma omp parallel "#pragma omp parallel" serves primarily to create a parallel region in
which multiple threads can execute code concurrently. This implies that the block of code following
the directive is going to be executed simultaneously by several threads, making parallel processing
possible.

2. Automatic Thread Management:

Whenever #pragma omp parallel is encountered, OpenMP supports the threads creation
and deletion automatically. You do not need to create or synchronize threads by yourself; OpenMP
manages these issues.
3. Block of Code Execution:

Fast, each thread will execute the code within the parallel region by itself. This enables the
programmer to split the work among several threads; this is in most cases does reduce the total
execution time of the program if these tasks can be run in parallel.

Here,

#pragma omp parallel begins a parallel region.

• printf is called from within this parallel region by all threads. Since omp_get_thread_num() returns
the ID of the thread that executes the current code, each thread prints a different ID.

Important Features

1.Number of Threads:

The number of threads in the parallel region is decided by the OpenMP runtime system. You
can override this number using environment variables like OMP_NUM_THREADS or
programmatically specify this with omp_set_num_threads().

2.Private and Shared Variables:

Variables in a parallel region can either be private to each thread or shared between
threads. Variables are shared by default unless otherwise specified by additional OpenMP clauses,
for example, private, firstprivate, lastprivate.

3.Synchronization:

The basic synchronization issues are handled implicitly by OpenMP. Although in case of
complex synchronization needs, like race conditions, you may be required to use more OpenMP
constructs, such as critical sections or even atomic operations.

4.Scalability:

The #pragma omp parallel directive allows you to write parallel code that should scale with
the number of available processors or cores. By using a number of cores, you can gain immense
speedup in computational tasks.

Example with Control of Number of Threads

If you want to explicitly set the number of threads, you can use:

#include <omp.h>

#include <stdio.h>

int main() {

omp_set_num_threads(4); // Set the number of threads to 4

#pragma omp parallel

{ printf("Hello from thread %d\n", omp_get_thread_num());

} return 0;

This example is modified to force the number of threads to be 4 with the call to
omp_set_num_threads(). The output will now contain messages from 4 threads, if the system
supports as many threads.

Conclusion

Finally, the #pragma omp parallel declaration summarizes one of the basic parallelizing constructs
of OpenMP for parallelizing code for execution on multiple threads; it exploits multi-core
processors for a performance benefit.

Here, this modification offers control to have only 4 threads; thus, a system with only 4 threads will
execute.

General Pragma Program

#include <omp.h>

#include <stdio.h>

int main() {

// Start parallel region

#pragma omp parallel

{ // Ensure that task creation is done by only one thread

#pragma omp single

{ // Create tasks

for (int i = 0; i < 10; i++) {

#pragma omp task

{ // Process task i

printf("Processing task %d in thread %d\n", i, omp_get_thread_num());

} // End of single block

// Ensure all tasks are completed before exiting parallel region

#pragma omp taskwait

} // End of parallel region

return 0;

Output

C:\Users\PC\Documents\HPC-College>a.exe

Processing task 0 in thread 4

Processing task 8 in thread 5

Processing task 9 in thread 5

Processing task 4 in thread 2

Processing task 3 in thread 1

Processing task 5 in thread 7

Processing task 6 in thread 6

Processing task 1 in thread 0

Processing task 7 in thread 4

Processing task 2 in thread 3

Explanation of Code

1. Parallel Region:

#pragma omp parallel: This is the directive that initiates a parallel region where multiple threads
are created.

2. Single Directive:

#pragma omp single: It ensures that only one thread in the whole parallel region executes the
enclosed code block. It is mainly used to ensure that only one thread either creates tasks or some
initialization.

3. Task Creation:

The block #pragma omp single inside contains a for loop going from 0 to 9, and for every iteration, it
creates a new task with #pragma omp task.

4. Processing of Tasks:

Each task, represented by the block of code inside #pragma omp task, will be executed
asynchronously by any available thread in the parallel region.

5. Every line indicates which thread has processed which task; the example as a whole, how tasks
are distributed between the threads.

6. Conclusion
With these modifications, the code now creates and executes tasks correctly inside a parallel
region. The #pragma omp single ensures only one thread creates tasks, while #pragma omp
taskwait ensures that all the tasks are completed before exiting a parallel region.

General Array Assigning Program

#include <omp.h>

#include <stdio.h>

#define N 1000 // Example size of the array

int main() { int array[N];

// Initialize the array with some values

for (int i = 0; i < N; i++)

{ array[i] = i;

// Parallelize and vectorize the loop

#pragma omp parallel for simd

for (int i = 0; i < N; i++) {

array[i] = array[i] * 2;

} // Print the first 10 elements to verify

for (int i = 0; i < 10; i++) {

printf("array[%d] = %d\n", i, array[i]);

return 0;

Output

C:\Users\PC\Documents\HPC-College>a.exe

array[0] = 0

array[1] = 2

array[2] = 4

array[3] = 6

array[4] = 8
array[5] = 10

array[6] = 12

array[7] = 14

array[8] = 16

array[9] = 18

Explanation of Directives

1. #pragma omp parallel for:

This directive informs the compiler to parallelize the for loop across multiple threads. The portion
of the loop's iteration is handled by each thread. Distribution of the iterations across threads is
done for its concurrent execution.

2. #pragma omp simd:

This directive instructs the compiler to vectorize the loop using SIMD – Single Instruction,
Multiple Data. SIMD operations perform the same operation on many data points simultaneously
with a single instruction. This can greatly improve performance for vectorizable operations.

How It Works

1. Parallelization:

Directives #pragma omp parallel for have spread iterations of the loop over available threads and
many iterations may be executed in parallel. The OpenMP runtime is responsible for spawning
threads and distributing the parallel work among the threads.

2. Vectorization:

#pragma omp simd The directive tells the compiler that it should produce SIMD instructions for the
loop. It allows the loop to be executed in parallel on modern processor's SIMD hardware
capabilities such as SSE or AVX instructions on x86 architectures.

Example Use Case

Let's consider N as 1000. We create an array and initialize an array having values from 0 to 999. The
following parallelized and vectorized loop will double all elements of array. Initialization: array[i] = i;
- This instruction fills array having values from 0 to 999. Execution of a loop: array[i] = array[i] * 2; -
This statement doubles each element in array[]. The loop is parallelized and vectorized for better
performance.

Practical Considerations

1. Compiler Supports:

Make sure your compiler does support OpenMP and SIMD directives. Most of the modern
ones, such as GCC, Clang, and Intel's ICC, do support it.
2.Flags for the Compiler:

O While compiling, use flags to enable OpenMP, and optimization. For GNU GCC and Clang,
for instance:

Flag –fopenmp turns the OpenMP on, and –O2 turns on optimization including SIMD
vectorization.

3.Data alignment:

Ensure to align your data in memory for SIMD operations. Some compilers and processors
need the data to be aligned at a certain location for the best SIMD performance.

In this output, it can be seen, that each element has become doubled, what shows, that designed
loop, parallelized and vectorised, has worked.

General Example of Adding sum program

#include<stdio.h>

main(){ int i,sum=0;

for(i=1;i<=100;i++)

sum=sum+i;

printf("Sum is %d",sum);

} Output 5050

#include <stdio.h>: Includes the standard input-output library necessary for using printf.

int main(): The entry point of the program. In C, main should return an int, so it's best to declare it
as int main().

int i, sum = 0;: Declares i for the loop counter and sum to accumulate the total.

for (i = 1; i <= 100; i++): A loop that iterates from 1 to 100.

sum = sum + i;: Adds the current value of i to sum in each iteration.

printf("Sum is %d\n", sum);: Prints the final sum after the loop finishes.

return 0;: Returns 0 to indicate that the program completed successfully.

Manual Calculation Formula {N(N+1)}/2 =Sum.

Now making sum with parallel programming and threads > Example program

#include <stdio.h>

#include <omp.h> // Include the OpenMP header for parallel programming

int main() {
int sum = 0; // Variable to hold the final sum

int tsum[4]; // Array to hold partial sums from each thread

int i; // Loop index variable

omp_set_num_threads(4); // Set the number of threads to be used in parallel regions

#pragma omp parallel

{ int id = omp_get_thread_num(); // Get the unique ID of the current thread

// Initialize the partial sum for each thread

tsum[id] = 0;

#pragma omp for

for (i = 0; i <= 100; i++) {

// Each thread calculates the sum of numbers from 0 to 100

tsum[id] += i; }

// After the parallel region, sum up the partial sums from each thread

for (i = 0; i < 4; i++) {

// Print the sum calculated by each thread

printf("\nThe sum in thread id %d is %d", i, tsum[i]);

// Add the partial sum of each thread to the total sum

sum += tsum[i];

// Print the final sum which is the total sum of numbers from 0 to 100

printf("\nsum = %d\n", sum);

return 0; // Return 0 to indicate successful completion

} Output

The sum in thread id 0 is 325

The sum in thread id 1 is 950

The sum in thread id 2 is 1575

The sum in thread id 3 is 2200

sum=5050
Key Points of the Program

1. Initialization:

o int sum=0, tsum[4], i; initializes the variables. sum will store the final sum of all
integers from 0 to 100. tsum[4] is an array to store partial sums from each thread.

2. OpenMP Setup:

o omp_set_num_threads(4); sets the number of threads to 4.

o #pragma omp parallel starts a parallel region with 4 threads.

o Each thread initializes its local tsum[id] to 0, where id is the thread number.

3. Parallel Loop:

o #pragma omp for divides the for-loop iterations among the threads. Each thread
calculates its partial sum in tsum[id].

4. Aggregation:

o After the parallel region, the main thread prints the partial sum computed by each
thread and aggregates these into sum.

Programming with MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
Programming with MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
4.5/5 (3)
PostgreSQL Administration
No ratings yet
PostgreSQL Administration
66 pages
Parallel Programming Module 2
No ratings yet
Parallel Programming Module 2
112 pages
Presentation2 HS OpenMP
No ratings yet
Presentation2 HS OpenMP
29 pages
Lecture Open MP
No ratings yet
Lecture Open MP
35 pages
NGK Openmp
No ratings yet
NGK Openmp
13 pages
OpenMP Examples
No ratings yet
OpenMP Examples
12 pages
Cao Da1
No ratings yet
Cao Da1
9 pages
OpenMP 2
No ratings yet
OpenMP 2
3 pages
Openmp: Openmp Adds Constructs For Shared-Memory
No ratings yet
Openmp: Openmp Adds Constructs For Shared-Memory
15 pages
Beginning OpenMP
No ratings yet
Beginning OpenMP
20 pages
CS-3006 5 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 5 UsingOpenMP SharedMemoryProgramming
76 pages
Introduction To Open MP
No ratings yet
Introduction To Open MP
42 pages
Updated_CS8083 MCP UNIT III notes
No ratings yet
Updated_CS8083 MCP UNIT III notes
26 pages
Introduction To OpenMP
No ratings yet
Introduction To OpenMP
46 pages
Openmp: Parallel Processing
No ratings yet
Openmp: Parallel Processing
40 pages
OPENMP1
No ratings yet
OPENMP1
67 pages
ipc_assig 1
No ratings yet
ipc_assig 1
9 pages
Lec 12 OpenMP
No ratings yet
Lec 12 OpenMP
152 pages
Openmp 2pp
No ratings yet
Openmp 2pp
15 pages
UNIT 3
No ratings yet
UNIT 3
13 pages
OpenMP P1
No ratings yet
OpenMP P1
32 pages
Chap4 OpenMP
No ratings yet
Chap4 OpenMP
35 pages
Lecture 06 - OpenMP
No ratings yet
Lecture 06 - OpenMP
37 pages
High Performance Computing (HPC) - Lec3
No ratings yet
High Performance Computing (HPC) - Lec3
35 pages
4 Openmp
No ratings yet
4 Openmp
32 pages
Unit 4 Shared-Memory Parallel Programming With Openmp
No ratings yet
Unit 4 Shared-Memory Parallel Programming With Openmp
37 pages
CP4253 Map Unit Iii
No ratings yet
CP4253 Map Unit Iii
26 pages
10 OpenMP-2
No ratings yet
10 OpenMP-2
25 pages
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
No ratings yet
Programming Shared-Memory Platforms With Openmp: John Mellor-Crummey
46 pages
Openmp Programming: Aiichiro Nakano
No ratings yet
Openmp Programming: Aiichiro Nakano
10 pages
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
Open MPLecture
No ratings yet
Open MPLecture
54 pages
Openmp 6pp
No ratings yet
Openmp 6pp
5 pages
OpenMP Basics
No ratings yet
OpenMP Basics
47 pages
Open MP
No ratings yet
Open MP
35 pages
OMP Common Core-Voss
No ratings yet
OMP Common Core-Voss
217 pages
A Tutorial On Parallel Computing On Shared Memory Systems
No ratings yet
A Tutorial On Parallel Computing On Shared Memory Systems
23 pages
Lect11 Openmp1
No ratings yet
Lect11 Openmp1
35 pages
OpenMP Tutorial
100% (1)
OpenMP Tutorial
82 pages
Parallel Programming Using Openmp: Mike Bailey
No ratings yet
Parallel Programming Using Openmp: Mike Bailey
27 pages
Unit 3 - Programming Multi-Core and Shared Memory
No ratings yet
Unit 3 - Programming Multi-Core and Shared Memory
100 pages
07 OpenMP
No ratings yet
07 OpenMP
28 pages
CS8083 UNIT III Notes
No ratings yet
CS8083 UNIT III Notes
26 pages
OpenMPSlides Tamu SC PDF
No ratings yet
OpenMPSlides Tamu SC PDF
74 pages
Lecture 10 Shared Memory Programming with OpenMP.pptx
No ratings yet
Lecture 10 Shared Memory Programming with OpenMP.pptx
30 pages
Chapter 5
No ratings yet
Chapter 5
92 pages
OPENMP
No ratings yet
OPENMP
37 pages
Openmp
No ratings yet
Openmp
21 pages
final
No ratings yet
final
30 pages
Shared Memory: Openmp Environment and Synchronization
No ratings yet
Shared Memory: Openmp Environment and Synchronization
32 pages
Mpsoc Architectures Openmp
No ratings yet
Mpsoc Architectures Openmp
35 pages
Open MP
No ratings yet
Open MP
30 pages
Parallel Pragma Suspend or Resume of Thread
No ratings yet
Parallel Pragma Suspend or Resume of Thread
3 pages
Chapter 3 - Shared-Memory Programming, OpenMP
No ratings yet
Chapter 3 - Shared-Memory Programming, OpenMP
65 pages
Openmp 1
No ratings yet
Openmp 1
38 pages
OpenMP 01 Introduction
No ratings yet
OpenMP 01 Introduction
70 pages
Lab # 2 by Akram
No ratings yet
Lab # 2 by Akram
14 pages
Perl One-Liners: 130 Programs That Get Things Done
From Everand
Perl One-Liners: 130 Programs That Get Things Done
Peteris Krumins
4/5 (3)
C Programming
From Everand
C Programming
Netra
No ratings yet
Introduction to PHP, Part 5, Second Edition
From Everand
Introduction to PHP, Part 5, Second Edition
Adam Majczak
No ratings yet
Chronux Tutorial Slides and Matlab
No ratings yet
Chronux Tutorial Slides and Matlab
71 pages
Resume Archi-1
No ratings yet
Resume Archi-1
1 page
Online Food Order IN PHP CSS Js AND MYSQL
No ratings yet
Online Food Order IN PHP CSS Js AND MYSQL
21 pages
Factors Influencing The Adoption of Mobi PDF
No ratings yet
Factors Influencing The Adoption of Mobi PDF
7 pages
Javascript Variables
No ratings yet
Javascript Variables
12 pages
Online Voting System Project Proposal (Presentation Slide)
No ratings yet
Online Voting System Project Proposal (Presentation Slide)
14 pages
DBMSR1
No ratings yet
DBMSR1
37 pages
Double Heads
No ratings yet
Double Heads
54 pages
Instant Download Linux Photography Dmitri Popov PDF All Chapters
100% (2)
Instant Download Linux Photography Dmitri Popov PDF All Chapters
65 pages
L011223 - K5 Software Manual
No ratings yet
L011223 - K5 Software Manual
411 pages
Export Quickreport To PDF
No ratings yet
Export Quickreport To PDF
2 pages
Wonderware - InTouch Access Anywhere Server 2020
No ratings yet
Wonderware - InTouch Access Anywhere Server 2020
45 pages
India Salary Guide 2011
No ratings yet
India Salary Guide 2011
32 pages
Dct10 Operating Instruction English
No ratings yet
Dct10 Operating Instruction English
89 pages
Queueing Problem
No ratings yet
Queueing Problem
7 pages
The Output of Information
No ratings yet
The Output of Information
14 pages
Sad 2
No ratings yet
Sad 2
8 pages
Jaswanth's Resume
No ratings yet
Jaswanth's Resume
1 page
Lab3 Cryptography
No ratings yet
Lab3 Cryptography
5 pages
USA California Driver's License online generator — Verif Tools
No ratings yet
USA California Driver's License online generator — Verif Tools
1 page
Syllabus MBAFinancialTechnology2020 21
No ratings yet
Syllabus MBAFinancialTechnology2020 21
67 pages
Current Log
No ratings yet
Current Log
7 pages
Quantitative Finance Collector
50% (2)
Quantitative Finance Collector
460 pages
Assignment 2 Check-In - 1612497121 - 2022-09-01-09-13-40-Am
100% (1)
Assignment 2 Check-In - 1612497121 - 2022-09-01-09-13-40-Am
3 pages
Artificial Intelligence IN Power Station: Seminar Report On
No ratings yet
Artificial Intelligence IN Power Station: Seminar Report On
31 pages
"Synchronization and Load Sharing ": Technical Documentation
No ratings yet
"Synchronization and Load Sharing ": Technical Documentation
41 pages
SRS Document NGO Management System
No ratings yet
SRS Document NGO Management System
18 pages
Modern Corporate Template 16x9
No ratings yet
Modern Corporate Template 16x9
10 pages
Combination of Variable Sol.
No ratings yet
Combination of Variable Sol.
4 pages