0% found this document useful (0 votes)

61 views5 pages

Problem Statement

1) The problem is to write a program that takes a matrix as input and calculates the sum of each column to output in a result vector. 2) The program is implemented sequentially using a for loop to iterate through the matrix and add column elements to the result vector. 3) A parallel implementation is also created using threads, with one thread calculating the sum for each column concurrently.

Uploaded by

ash1205

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views5 pages

Problem Statement

Uploaded by

ash1205

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Problem Statement:

Given a matrix M of size n m, write a program using C++ that computes the sum of
each column so
that the result vector V of size m is defined like so:
xample:
!he matrix has size "x# hence n$" and m$#
the resultant vector of size m$# is sum of columns such that
V%&' with & in %(,#)
for &$(
V%('
sum from i$( to n*+ of M,i,&)
i-e- sum from i$( to . of M,i,() :-here &$( and n$"
i-e- M,(,() + M,+,() + M,/,() + M,.,()
i-e- / + 0 + .+ 1 $+0-
V%(' $ +0-
for &$+
V%+'
sum from i$+ to n*+ of M,i,&)
i-e- sum from i$+ to . of M,i,+) :-here &$+ and n$"
i-e- M,(,+) + M,+,+) + M,/,+) + M,.,+)
i-e- . + # + "+ + $+2-
V%+' $ +2-
similarl3 all the # values ,from ( through 4) can 5e computed as shown in the figure-
Task 1: Sequential and scalar implementation.
6artial sum is implemented in c++ with floating point values of single precision using
float data t3pe- 7 two dimensional matrix M of n x m is used to store the input and a
one dimensional arra3 of size m is used to store the output- !he choice of data
structures is to keep the solution simple-
!he program is written for and targeted at a 2" 5it 8inux with intel core i. 9ehalem
,the code*name for an :ntel processor micro*architecture) architecture-
!he program has three parts:
+- accept input-
:nput is stored in a two dimensional arra3 named M of size rowlength x collength-
7ll the elements are floats- Values are stored from ( through row*+ and ( through col*+
last row and col are empt3- :ndex starts from ( therefore +
st
row +
st
col element is given
53 %('%(' and so forth as indicated in ta5le 5elow-
4
,(,()
1
,(,+)
.
,(,/)
(
,(,.)
(
,+,()
.
,+,+)
/
,+,/)
+
,+,.)
/
,/,()
+
,/,+)
+
,/,/)
(
,/,.)
/- calculate partial sum-
!his is the computation part of algorithm- :t solves the pro5lem se;uentiall3 53
computing values of V[j] for & from ( through collength.
for (int i = 0; i < rowlength; ++i)
{
for (int j = 0; j < collength; ++j)
{
V[j] += M[i][j];
}
}
!he loop works as follows:
first i is set to ( and & loops from ( through . ,as collength is " in our example)
when j < collength evaluated as false, the inner loop exits and : is incremented-
9ow, i is set to + and & loops again from ( through .- !his continues till the outer loop
condition fails-
!he Vector V during the execution is as follows:
:nitiall3 when it is declared, V is initiated to all zeroes- !here fore 5efore loop is
entered V$ <(,(,(,(=-
when inner loop is executed with i$( fixed, V is same as all first row elements, ,(,(),
,(,+), ,(,/), ,(,.)- V$<(,(,(,(=+<4,1,.,(=$<4,1,.,(=-
when inner loop is executed with i$+ fixed, V is changed to second row elements added
to first row elements, ,+,(), ,+,+), ,+,/), ,+,.)- V$<4,1,.,(=+<(,.,/,+=$<4,#,1,+=
and so on-
:n other words,
i$(
&$( v%('$ v%('+M%('%('$ (+4 $4
&$+ v%+'$ v%+'+M%('%+'$ (+1 $1
&$/ v%/'$ v%/'+M%('%/'$ (+. $.
&$. v%.'$ v%.'+M%('%.'$ (+( $(
i$+
&$( v%('$ v%('+M%+'%('$ 4+( $4
&$+ v%+'$ v%+'+M%+'%+'$ 1+. $#
&$/ v%/'$ v%/'+M%+'%/'$ .+/ $1
&$. v%.'$ v%.'+M%+'%.'$ (++ $+
i$/
&$( v%('$ v%('+M%/'%('$ 4+/ $0
&$+ v%+'$ v%+'+M%/'%+'$ #++ $0
&$/ v%/'$ v%/'+M%/'%/'$ 1++ $2
&$. v%.'$ v%.'+M%/'%.'$ ++( $+
7ll these steps are performed se;uentiall3 one after the other-
.-printing the output-
!his is simpl3 done 53 looping through the output vector V from ( through m-
sample output:
Task 2: Parallel Implementation
7 parallel implementation of partial sum is done using pthreads 5uilt in c++>s newest
standard std++-
7dvantages:
6rallel processing can easil3 5e achieved using <thread> header file- Calling
thread (funP,arguments) function creates a thread which executes fun6 function
parallel to parent thread executing main,) function-
8imitations:
c++++ standard thread li5rar3 is not implemented on all compilers- :t is there in GCC
on 8inux, 5ut not on M:9G? on windows- !@M*GCC is the one of few if not onl3
compiler on windows to implements this li5rar3-
:f a two dimentional arra3 needs to passed as an argument, its size should 5e known
and specified 5efore compilation- 7 workaround was to define the M and V as glo5al
varia5les-
!he data structure for holding Matrix M is changed from two dimentioanl arra3 to a
single dimentional arra3 of size nAm- M%i'%&' for /@ is e;uivalent of M%iAcol+&'-
6arallel computation part of algorithm:
:nstead of se;uentiall3 computing value of V%&' one after another, we have a thread
each for ever3 element in V-
:f there are m columns in matrix, we create m threads each to derive the partial sum
for that column-
for (int j = 0; j < collength; ++j)
{
thread T(addcol, j, rowlength, collength);
T.join();
}
!his loop creates a thread each for each column and calls addcols function- Boin,) is
used so that main,) waits till all the threads complete their execution- !his is to ensure
that main does not exit prematurel3-
void addcol(int j, int rowlength, int collength)
{
for(int i=0; i< rowlength; i++)
{
V[j] += M[i*collength+j];
}
cout<< "thread" << j << "computed V[" << j << "] as"<< V[j] << endl;
}
!he addcolfunction adds each column values and gets value of one element of V-
!he long output is to demonstrate that the values are generated 53 individual threads-
?hen n is large, there is a huge advantage of parallel computing as compared to
se;uential solution- ?hen 5oth n and m are small, the se;uential solution is 5etter
than overhead of creating threads- Cor moderate size inputs, 5oth solutions> run time is
compara5le-
Dample output:

Lec03 1 Program Optimizations
No ratings yet
Lec03 1 Program Optimizations
43 pages
Dsa File
No ratings yet
Dsa File
60 pages
Lecture 7 - Optimizations - A 2025
No ratings yet
Lecture 7 - Optimizations - A 2025
55 pages
2D Array Lab Manual
No ratings yet
2D Array Lab Manual
6 pages
CP4292 Mcap
No ratings yet
CP4292 Mcap
24 pages
Daa 2
No ratings yet
Daa 2
9 pages
L18 L19 2DArray
No ratings yet
L18 L19 2DArray
30 pages
Reaseacrch Papers
No ratings yet
Reaseacrch Papers
1 page
Business Attire Bill
No ratings yet
Business Attire Bill
1 page
Comp2011 s2024 Midterm Questions
No ratings yet
Comp2011 s2024 Midterm Questions
18 pages
RECORDeditable
No ratings yet
RECORDeditable
33 pages
I Bcom Ca C PRG
No ratings yet
I Bcom Ca C PRG
17 pages
Soumya Pandey - Freelance Agreement PDF
No ratings yet
Soumya Pandey - Freelance Agreement PDF
4 pages
MCS-011 2024-2025 - From DHANBAD
No ratings yet
MCS-011 2024-2025 - From DHANBAD
24 pages
5.user Defined Functions
No ratings yet
5.user Defined Functions
2 pages
Graded Lab 3
No ratings yet
Graded Lab 3
3 pages
Flowchart and Guidelines For Non-Degree Applications 2025 Via Google Form
No ratings yet
Flowchart and Guidelines For Non-Degree Applications 2025 Via Google Form
2 pages
FCP Assigment
No ratings yet
FCP Assigment
32 pages
Student Name Student Registration Number Class &section: AIML Study Level: UG/PG Year &term: Subject Name Name of The Assessment Date of Submission
No ratings yet
Student Name Student Registration Number Class &section: AIML Study Level: UG/PG Year &term: Subject Name Name of The Assessment Date of Submission
13 pages
Finance Services.3
No ratings yet
Finance Services.3
10 pages
AbhinavSingh Amfile 02
No ratings yet
AbhinavSingh Amfile 02
36 pages
12 - Ayush Ahirrao - Assignment 2
No ratings yet
12 - Ayush Ahirrao - Assignment 2
10 pages
Matrix Chain
No ratings yet
Matrix Chain
5 pages
Java News April 2025 Scribd Ready
No ratings yet
Java News April 2025 Scribd Ready
3 pages
Programming For Problem Solving: Experiment - 7
No ratings yet
Programming For Problem Solving: Experiment - 7
4 pages
NM Record
No ratings yet
NM Record
87 pages
Lab Report 8
No ratings yet
Lab Report 8
28 pages
C++ Code Assessment
No ratings yet
C++ Code Assessment
26 pages
Greedy
No ratings yet
Greedy
17 pages
Lab 6
No ratings yet
Lab 6
6 pages
NM & S Record - Cce Correction
No ratings yet
NM & S Record - Cce Correction
35 pages
Tiny Project 1
No ratings yet
Tiny Project 1
2 pages
C++ Program
No ratings yet
C++ Program
17 pages
Diagnostic Test 15 Dependent Prepositions
No ratings yet
Diagnostic Test 15 Dependent Prepositions
1 page
Role of Principal
No ratings yet
Role of Principal
3 pages
Yashaswini (DBMS)
No ratings yet
Yashaswini (DBMS)
8 pages
Rock Cycle - Metamorphic Rocks
No ratings yet
Rock Cycle - Metamorphic Rocks
33 pages
LEC12-Optimization and New Trends
No ratings yet
LEC12-Optimization and New Trends
23 pages
All DS Questions GEEKY
No ratings yet
All DS Questions GEEKY
255 pages
Multicore Architecture and Programming Lab Manual
No ratings yet
Multicore Architecture and Programming Lab Manual
29 pages
Arrays Dsa
No ratings yet
Arrays Dsa
13 pages
Lab6 - Linear Algebra in C On A Microcontroller
No ratings yet
Lab6 - Linear Algebra in C On A Microcontroller
8 pages
DS Record
No ratings yet
DS Record
26 pages
Experiment 4
No ratings yet
Experiment 4
3 pages
Lab
No ratings yet
Lab
22 pages
3 RD Sem Results
No ratings yet
3 RD Sem Results
2 pages
Mat Multipli
No ratings yet
Mat Multipli
4 pages
Lab 7
No ratings yet
Lab 7
4 pages
Mid Term 1 - Solution
No ratings yet
Mid Term 1 - Solution
4 pages
OpenAcc Assignment Questions
No ratings yet
OpenAcc Assignment Questions
11 pages
F-22 Paper Model Template Craft
No ratings yet
F-22 Paper Model Template Craft
1 page
Chalukya Exp Second Ac (2A) : Electronic Reservation Slip (ERS)
No ratings yet
Chalukya Exp Second Ac (2A) : Electronic Reservation Slip (ERS)
3 pages
Worksheet 6 - Solution
No ratings yet
Worksheet 6 - Solution
4 pages
F22 PF Final+Solution
No ratings yet
F22 PF Final+Solution
15 pages
Arrays 2D Lecture 17
No ratings yet
Arrays 2D Lecture 17
20 pages
Notice of Recurrence: U.S. Department of Labor
No ratings yet
Notice of Recurrence: U.S. Department of Labor
4 pages
09 Pointers Arrays
No ratings yet
09 Pointers Arrays
34 pages
Final Exam DC AC Machinery May 23
No ratings yet
Final Exam DC AC Machinery May 23
5 pages
Engineering Foundation 2020-2021
No ratings yet
Engineering Foundation 2020-2021
5 pages
Unit 2 Basic Optimization Techniques For Serial Code
No ratings yet
Unit 2 Basic Optimization Techniques For Serial Code
31 pages
C++ Implementation
No ratings yet
C++ Implementation
2 pages
Lab - 03 Task On 2D Array
No ratings yet
Lab - 03 Task On 2D Array
3 pages
Lab3
No ratings yet
Lab3
11 pages
6 Internship Contract Agreement f2f
No ratings yet
6 Internship Contract Agreement f2f
2 pages
Lab 7
No ratings yet
Lab 7
3 pages
CEO Database
100% (2)
CEO Database
176 pages
Business Data Analysis Using Excel, 2010 (David Whigham) PDF
75% (4)
Business Data Analysis Using Excel, 2010 (David Whigham) PDF
315 pages
Lab 7
No ratings yet
Lab 7
3 pages
Double Dimensional Array: Initialization Method Method-1 Method - 2
No ratings yet
Double Dimensional Array: Initialization Method Method-1 Method - 2
8 pages
Devesh
No ratings yet
Devesh
22 pages
Ejercicio I:: Int Main Int Matriz
No ratings yet
Ejercicio I:: Int Main Int Matriz
10 pages
C++ Matric Calculator Backup
No ratings yet
C++ Matric Calculator Backup
7 pages
Spa - For Companies
No ratings yet
Spa - For Companies
2 pages
Package Desire': R Topics Documented
No ratings yet
Package Desire': R Topics Documented
22 pages
C Programs Final
No ratings yet
C Programs Final
37 pages
Parallel and Distributed Computing: Ansh Goyal 17BCE1278
No ratings yet
Parallel and Distributed Computing: Ansh Goyal 17BCE1278
4 pages
Avaya 9641GS IP Deskphone: Phones & Devices
No ratings yet
Avaya 9641GS IP Deskphone: Phones & Devices
4 pages
Msme Tool Room, Indore: Bio-Data
No ratings yet
Msme Tool Room, Indore: Bio-Data
2 pages
Car Safety Comprehension
100% (1)
Car Safety Comprehension
9 pages
Illycaffe: The Starbucks Threat: Marketing Strategy
No ratings yet
Illycaffe: The Starbucks Threat: Marketing Strategy
12 pages
Employee Engagement Survey - Proposal
No ratings yet
Employee Engagement Survey - Proposal
7 pages
MX SB RO: User Manual
No ratings yet
MX SB RO: User Manual
23 pages
Job Analysis The Process and Its Uses
No ratings yet
Job Analysis The Process and Its Uses
13 pages
CP4252 Multicore Architecture and Programming Lab Manual
No ratings yet
CP4252 Multicore Architecture and Programming Lab Manual
26 pages
Computer Science Discrete Mathematics
No ratings yet
Computer Science Discrete Mathematics
11 pages
University of Toronto Faculty of Applied Science and Engineering Aps106 Midterm Ii - March 27, 2014
No ratings yet
University of Toronto Faculty of Applied Science and Engineering Aps106 Midterm Ii - March 27, 2014
5 pages
Academic Full Length Practice Test
0% (1)
Academic Full Length Practice Test
25 pages
Vasu Resume
No ratings yet
Vasu Resume
2 pages
A 32nm Fully Integrated Reconfigurable Switched-Capacitor DC-DC Converter Delivering 0.55W/mm2 at 81% Efficiency
No ratings yet
A 32nm Fully Integrated Reconfigurable Switched-Capacitor DC-DC Converter Delivering 0.55W/mm2 at 81% Efficiency
3 pages
Waste Hierarchy
No ratings yet
Waste Hierarchy
4 pages
Judiciary Handbook
No ratings yet
Judiciary Handbook
5 pages
On-Load Tap-Changers For Power Transformers: MR Publication
100% (2)
On-Load Tap-Changers For Power Transformers: MR Publication
24 pages

Problem Statement

Uploaded by

Problem Statement

Uploaded by

Problem Statement:

You might also like