0% found this document useful (0 votes)

27 views5 pages

SWE2017 - Lab Assignment 1pages-7

program mpi code

Uploaded by

ccannavar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views5 pages

SWE2017 - Lab Assignment 1pages-7

program mpi code

Uploaded by

ccannavar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

SWE2017 - Parallel Programming Lab Assignment – 6

alternative strategies like atomic operations or reduction techniques might be preferable, as

they can reduce the overhead associated with critical sections.

5.
Code:
#include <stdio.h>
#include <omp.h>

int main()
{
int counter = 0;
int total_transactions = 100;
int transactions_per_thread = 20;

#pragma omp parallel num_threads(5) // Create 5 threads

{
for (int i = 0; i < transactions_per_thread; i++)
{
// Critical section to safely update the shared counter variable
#pragma omp critical
{
counter++;
printf("Thread %d processed a transaction. Current counter:
%d\n", omp_get_thread_num(), counter);
}
}
}

printf("Final counter value after all transactions: %d\n", counter);

return 0;
}

Output:
SWE2017 - Parallel Programming Lab Assignment – 6
SWE2017 - Parallel Programming Lab Assignment – 6

The #pragma omp critical directive is essential in this scenario because it ensures that
only one thread at a time can access and update the counter variable. Without this
protection, multiple threads might read the same initial counter value simultaneously,
leading to race conditions. For example, two threads could read the same value (say 10),
both increment it, and both write 11 back to counter, resulting in a missed update and an
incorrect final count.

Performance Consideration
The use of #pragma omp critical serializes access to counter, meaning only one thread
can update it at any given moment. This can lead to thread contention and performance
degradation, especially if many threads frequently access the critical section. In high-
throughput applications, alternative strategies like atomic operations (e.g., #pragma omp
atomic) might be more efficient, as they provide a lighter lock specifically for single
increments and could reduce overhead compared to a full critical section.

6.
Code:
#include <mpi.h>
#include <stdio.h>

int is_valid(int number)

{
int digits[6];
int sum = 0;

for (int i = 5; i >= 0; --i)

{
digits[i] = number % 10;
number /= 10;
}

if (digits[0] == 0)
return 0;

for (int i = 0; i < 5; ++i)

{
if (digits[i] == digits[i + 1])
return 0;
}

for (int i = 0; i < 6; ++i)

{
sum += digits[i];
}
if (sum == 7 || sum == 11 || sum == 13)
return 0;
SWE2017 - Parallel Programming Lab Assignment – 6

return 1;
}

int main(int argc, char **argv)

{
int rank, size, local_count = 0, global_count = 0;

MPI_Init(&argc, &argv);
MPI_Comm_rank(MPI_COMM_WORLD, &rank);
MPI_Comm_size(MPI_COMM_WORLD, &size);

int start = 100000 + rank * (900000 / size);

int end = start + (900000 / size);

for (int i = start; i < end; i++)

{
if (is_valid(i))
{
local_count++;
}
}

MPI_Reduce(&local_count, &global_count, 1, MPI_INT, MPI_SUM, 0,

MPI_COMM_WORLD);

if (rank == 0)
{
printf("Total valid identifiers: %d\n", global_count);
}

MPI_Finalize();
return 0;
}

Output:

7.
SWE2017 - Parallel Programming Lab Assignment – 6

In a real-time traffic surveillance system that requires high-speed lane detection, optimizing
the Hough Transform is essential. Below is a refined approach using OpenMP for parallel
processing, workload division, atomic operations, and quantization, alongside an
explanation of adaptive edge detection thresholding.

1. OpenMP and MPI

OpenMP is more suitable than MPI for this application because it uses a shared-memory
model, allowing efficient real-time, frame-by-frame processing on a single multi-core
system. OpenMP’s thread-based parallelism reduces latency by distributing tasks across CPU
cores, which is critical in real-time settings. Conversely, MPI is optimized for distributed
systems and would introduce network communication overhead, slowing down the frame
processing required for real-time responsiveness in high-resolution video streams.

2. Workload Division for High-Resolution Frames (e.g., 4K)

In OpenMP, dividing a high-resolution frame (like 4K) into regions—either strips or tiles—
lets each thread process a smaller portion independently. This approach minimizes memory
contention and optimizes cache usage, as each thread performs edge detection and Hough
Transform operations only on its designated region. For example, each thread could handle
edge detection and voting in the Hough space for a strip of the image, then contribute its
results to a shared accumulator array, enabling the detection of lane markings across the
entire frame.

3. Using #pragma omp atomic for Shared Accumulator Array

The Hough Transform requires an accumulator array, where each element represents a line
parameter (angle and distance) and stores votes from detected edges. The #pragma omp
atomic directive helps avoid race conditions in this array by ensuring that each increment
operation is atomic, which means only one thread can update a specific accumulator cell at
a time. This ensures consistency when multiple threads vote on the same line, preventing
data loss or overwriting in the shared array.

To further reduce contention, threads could maintain private accumulator arrays during
processing, later combining them into the main accumulator array.

4. Role of theta_quantize and r_quantize Functions

Quantizing the angle and distance values reduces the computational load while maintaining
accuracy in detecting lane markings:

• theta_quantize: Discretizes the angle (θ) into a set number of bins, which limits the
orientations evaluated, saving processing time by focusing on only the most relevant
angles for lane detection.
• r_quantize: Discretizes the distance (r) from the origin to the line, grouping similar
values into the same bins. This reduces the resolution of the Hough space, speeding
up the accumulation process without sacrificing line-detection accuracy.

Parallel Computing Lab Manual PDF
100% (1)
Parallel Computing Lab Manual PDF
51 pages
Unit 3 Coordinaton and Agreement Algorithm
No ratings yet
Unit 3 Coordinaton and Agreement Algorithm
119 pages
Deadlock Os Lecture Notes
No ratings yet
Deadlock Os Lecture Notes
11 pages
Chap2 Ds
100% (1)
Chap2 Ds
58 pages
BCS515D - Model Question Paper (Reference)
No ratings yet
BCS515D - Model Question Paper (Reference)
2 pages
CSC-334 - P&DC - Lab Manual - V2.0
No ratings yet
CSC-334 - P&DC - Lab Manual - V2.0
102 pages
Introduction To Openmp: Openmp in Small Bites: Overview
No ratings yet
Introduction To Openmp: Openmp in Small Bites: Overview
123 pages
Concurrency Control Techniques
No ratings yet
Concurrency Control Techniques
64 pages
DBMS UNIT - 3 (From Deadlock)
No ratings yet
DBMS UNIT - 3 (From Deadlock)
7 pages
CH 10
No ratings yet
CH 10
51 pages
CS33 S25 L15 More OpenMP Annotated
No ratings yet
CS33 S25 L15 More OpenMP Annotated
65 pages
Osc Unit 3
No ratings yet
Osc Unit 3
79 pages
HPC Lab Manual 2317 Merged Organized
No ratings yet
HPC Lab Manual 2317 Merged Organized
35 pages
Cs3551 Question Bank R 2021
No ratings yet
Cs3551 Question Bank R 2021
5 pages
Lec7 - TLP Shared Memory and OpenMP
No ratings yet
Lec7 - TLP Shared Memory and OpenMP
45 pages
MAP Lab Mannual
No ratings yet
MAP Lab Mannual
24 pages
MPC LAB Manual New
No ratings yet
MPC LAB Manual New
24 pages
5 2 Concurrency-Locks
No ratings yet
5 2 Concurrency-Locks
26 pages
CP4292 Multicore Architecture Lab Manual
No ratings yet
CP4292 Multicore Architecture Lab Manual
36 pages
Untitled Document
No ratings yet
Untitled Document
23 pages
2-3-Process Sync-1
No ratings yet
2-3-Process Sync-1
64 pages
Lecture 9-OpenMP Coclusion
No ratings yet
Lecture 9-OpenMP Coclusion
39 pages
Unit 4 Shared-Memory Parallel Programming With Openmp
No ratings yet
Unit 4 Shared-Memory Parallel Programming With Openmp
37 pages
Mcap-Lab Manual 1
No ratings yet
Mcap-Lab Manual 1
19 pages
7.two Phase Locking Protocols and Deadlock
No ratings yet
7.two Phase Locking Protocols and Deadlock
7 pages
Micro
No ratings yet
Micro
30 pages
CP4292 Mcap
No ratings yet
CP4292 Mcap
24 pages
HPC Printout 1
No ratings yet
HPC Printout 1
22 pages
Lect 15 - Transaction and Concurrency Control
No ratings yet
Lect 15 - Transaction and Concurrency Control
53 pages
Untitled Document
No ratings yet
Untitled Document
23 pages
Lab Manual
No ratings yet
Lab Manual
31 pages
High Performance Computing (HPC) - Lec3
No ratings yet
High Performance Computing (HPC) - Lec3
35 pages
CP 4292 MCP Lab Manual
No ratings yet
CP 4292 MCP Lab Manual
20 pages
Unit 3DC
No ratings yet
Unit 3DC
28 pages
Ch.3 - IPC
No ratings yet
Ch.3 - IPC
27 pages
Computer Science Paper 3 HL
No ratings yet
Computer Science Paper 3 HL
3 pages
CP4292 Mcap
No ratings yet
CP4292 Mcap
15 pages
PDC LAB Experiment 2
No ratings yet
PDC LAB Experiment 2
12 pages
Question 1 - Serial: Output
No ratings yet
Question 1 - Serial: Output
9 pages
The Ricart-Agrawala's Algorithm
No ratings yet
The Ricart-Agrawala's Algorithm
18 pages
PDC Experiments
No ratings yet
PDC Experiments
11 pages
Pdcnotes
No ratings yet
Pdcnotes
23 pages
Presentation2 HS OpenMP
No ratings yet
Presentation2 HS OpenMP
29 pages
Pdclab 8
No ratings yet
Pdclab 8
16 pages
(Serial)
No ratings yet
(Serial)
8 pages
Mod 5
No ratings yet
Mod 5
22 pages
Chapter 7 Deadlocks
No ratings yet
Chapter 7 Deadlocks
13 pages
Deadlock - Lecture 5
No ratings yet
Deadlock - Lecture 5
17 pages
Lab # 2 by Akram
No ratings yet
Lab # 2 by Akram
14 pages
CP4252 Multicore Architecture and Programming Lab Manual
No ratings yet
CP4252 Multicore Architecture and Programming Lab Manual
26 pages
10 OpenMP-2
No ratings yet
10 OpenMP-2
25 pages
Comparison Threads
No ratings yet
Comparison Threads
10 pages
PDC-Lab 21BCE10419
No ratings yet
PDC-Lab 21BCE10419
20 pages
Hadoop
No ratings yet
Hadoop
11 pages
Day 2 1 Advanced-Openmp
No ratings yet
Day 2 1 Advanced-Openmp
52 pages
Map55611 1 2
No ratings yet
Map55611 1 2
6 pages
Ass Parallel
No ratings yet
Ass Parallel
11 pages
SWE2029 - Agile Development Process - CAT2 - Answer Key
No ratings yet
SWE2029 - Agile Development Process - CAT2 - Answer Key
5 pages
Concurrency Control
No ratings yet
Concurrency Control
8 pages
Module 4 - 4.6 - Understanding Shared Variables and Their Protection Mechanisms in OpenMP
No ratings yet
Module 4 - 4.6 - Understanding Shared Variables and Their Protection Mechanisms in OpenMP
5 pages
SWE2017 - Lab Assignment 1pages-7
No ratings yet
SWE2017 - Lab Assignment 1pages-7
5 pages
SWE2017 - Lab Assignment 1pages-7
No ratings yet
SWE2017 - Lab Assignment 1pages-7
5 pages
Lecture 23
No ratings yet
Lecture 23
7 pages
End Sem Lab Exam Q Paper
No ratings yet
End Sem Lab Exam Q Paper
3 pages
.Trashed-1650000204-Hpc Prac Exam
No ratings yet
.Trashed-1650000204-Hpc Prac Exam
5 pages
MPIreport
No ratings yet
MPIreport
4 pages
Module 2
No ratings yet
Module 2
5 pages
DDBMS Assignments
No ratings yet
DDBMS Assignments
3 pages
DC Exp 8
No ratings yet
DC Exp 8
3 pages
Top 15 Blockchain Questions
No ratings yet
Top 15 Blockchain Questions
3 pages
Vector Addition: Exercise 1 (Openmp-I) Scenario - I
100% (1)
Vector Addition: Exercise 1 (Openmp-I) Scenario - I
15 pages
Practice OpenMP
No ratings yet
Practice OpenMP
2 pages
Assignment 7 Solution
No ratings yet
Assignment 7 Solution
3 pages
Knowledge Management
No ratings yet
Knowledge Management
1 page
JD For AI Intern
No ratings yet
JD For AI Intern
1 page
Volt Poster
No ratings yet
Volt Poster
1 page
Deadlock
No ratings yet
Deadlock
2 pages
Pseudo Code of Mpi Programs
No ratings yet
Pseudo Code of Mpi Programs
22 pages
HPC Programs
No ratings yet
HPC Programs
19 pages
Openmp Tutorial: Seung-Jai Min
No ratings yet
Openmp Tutorial: Seung-Jai Min
30 pages
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
DC Active Learning Groups - Panel E
No ratings yet
DC Active Learning Groups - Panel E
2 pages
Excelente
No ratings yet
Excelente
64 pages
Par - 1 In-Term Exam - Course 2017/18-Q2
No ratings yet
Par - 1 In-Term Exam - Course 2017/18-Q2
7 pages
Parallel and Distributed Computing Lab Digital Assignment - 3
No ratings yet
Parallel and Distributed Computing Lab Digital Assignment - 3
10 pages
E 3 (Openmp - Iii) : Matrix Multiplication
No ratings yet
E 3 (Openmp - Iii) : Matrix Multiplication
10 pages
Name: Harshvardhan Singh Gahlaut Reg. No.: 19BCE2372 Slot: L41+L42
No ratings yet
Name: Harshvardhan Singh Gahlaut Reg. No.: 19BCE2372 Slot: L41+L42
3 pages
Exercise 1 (Openmp-I)
No ratings yet
Exercise 1 (Openmp-I)
10 pages
Openmp Tutorial: Seung-Jai Min
No ratings yet
Openmp Tutorial: Seung-Jai Min
30 pages
Python Beyond Limits: Python, #3
From Everand
Python Beyond Limits: Python, #3
AnwaarX
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
C Programming
From Everand
C Programming
Netra
No ratings yet
What's New in .NET 8? A Complete Guide to the Latest Features
From Everand
What's New in .NET 8? A Complete Guide to the Latest Features
Nitika
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)

SWE2017 - Lab Assignment 1pages-7

Uploaded by

SWE2017 - Lab Assignment 1pages-7

Uploaded by

SWE2017 - Parallel Programming Lab Assignment – 6

alternative strategies like atomic operations or reduction techniques might be preferable, as

#pragma omp parallel num_threads(5) // Create 5 threads

printf("Final counter value after all transactions: %d\n", counter);

int is_valid(int number)

for (int i = 5; i >= 0; --i)

for (int i = 0; i < 5; ++i)

for (int i = 0; i < 6; ++i)

int main(int argc, char **argv)

int start = 100000 + rank * (900000 / size);

for (int i = start; i < end; i++)

MPI_Reduce(&local_count, &global_count, 1, MPI_INT, MPI_SUM, 0,

1. OpenMP and MPI

2. Workload Division for High-Resolution Frames (e.g., 4K)

3. Using #pragma omp atomic for Shared Accumulator Array

4. Role of theta_quantize and r_quantize Functions

You might also like