Problem Statement

Uploaded by

The Gamer Last night

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views2 pages

Problem Statement

Uploaded by

The Gamer Last night

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Assignment 1: Due on Feb 15, 2024

In this assignment, you will develop two parallel implementations of LU decomposition that use Gaussian elimination to factor a dense N x N matrix into an
upper-triangular one and a lower-triangular one. In matrix computations, pivoting involves finding the largest magnitude value in a row, column, or both and
then interchanging rows and/or columns in the matrix for the next step in the algorithm. The purpose of pivoting is to reduce round-off error, which enhances
numerical stability. In your assignment, you will use row pivoting, a form of pivoting involves interchanging rows of a trailing submatrix based on the largest
value in the current column. To perform LU decomposition with row pivoting, you will compute a permutation matrix P such that PA = LU. The permutation
matrix keeps track of row exchanges performed.

Below is pseudocode for a sequential implementation of LU decomposition with row pivoting.

inputs: a(n,n)
outputs: π(n), l(n,n), and u(n,n)

initialize π as a vector of length n

initialize u as an n x n matrix with 0s below the diagonal
initialize l as an n x n matrix with 1s on the diagonal and 0s above the diagonal
for i = 1 to n
π[i] = i
for k = 1 to n
max = 0
for i = k to n
if max < |a(i,k)|
max = |a(i,k)|
k' = i
if max == 0
error (singular matrix)
swap π[k] and π[k']
swap a(k,:) and a(k',:)
swap l(k,1:k-1) and l(k',1:k-1)
u(k,k) = a(k,k)
for i = k+1 to n
l(i,k) = a(i,k)/u(k,k)
u(k,i) = a(k,i)
for i = k+1 to n
for j = k+1 to n
a(i,j) = a(i,j) - l(i,k)*u(k,j)

Here, the vector π is a compact representation of a permutation matrix p(n,n),

which is very sparse. For the ith row of p, π(i) stores the column index of
the sole position that contains a 1.
You will write two shared-memory parallel programs that perform LU decomposition using row pivoting. You will develop one solution using the Pthreads
programming model and one using OpenMP.

Each LU decomposition implementation should accept two arguments: n - the size of a matrix, followed by t - the number of threads. Your programs will
allocate an n x n matrix a of double precision (64-bit) floating point variables. You should initialize the matrix with uniform random numbers computed using a
suitable random number generator, such as drand48, drand48_r, or the C++11 facilities for pseudo-random number generation. (Note: if you are generating
random numbers in parallel, you will need to use a reentrant random number generator and seed the random number generator for each thread differently.) Apply
LU decomposition with partial pivoting to factor the matrix into an upper-triangular one and a lower-triangular one.

To check your answer, compute the sum of Euclidean norms of the columns of the residual matrix (this sum is known as the L2,1 norm) computed as PA-LU.
Print the value of the L2,1 norm of the residual. (It should be very small.)

The verification step need not be parallelized. Have your program time the LU decomposition phase by reading the real-time clock before and after and printing
the difference.

The formal components of the assignment are listed below:

Write a shared-memory parallel program that uses OpenMP to perform LU decomposition with partial pivoting.

Write a shared-memory parallel program that uses Pthreads to perform LU decomposition with partial pivoting.

Write a document that describes how your programs work. This document should not include your programs, though it may include figures containing
pseudo-code that sketch the key elements of your parallelization strategy for each implementation. Explain how your program partitions the data, work
and exploits parallelism. Justify your implementation choices. Explain how the parallel work is synchronized.

Use problem size n = 8000 to evaluate the performance of your implementations. If your sequential running time is too long for the interactive queue, you
may base your timing measurements on n=7000. Prepare a table that includes your timing measurements for the LU decomposition phase of your
implementations on 1, 2, 4, 8, and 16 threads. Plot graphs of the parallel efficiency of your program executions. Plot a point for each of the executions.
The x axis should show the number of processors. The Y axis should show your measured parallel efficiency for the execution. Construct your plot so that
the X axis of the graph intersects the Y axis at Y=0.

In this assignment, reading and writing shared data will account for much of the execution cost. Accordingly, you should pay attention to how you lay out the
data and how your parallelizations interact with your data layout. You should consider whether you want to use a contiguous layout for the array, or whether you
want to represent the array as a vector, of n pointers to n-element data vectors. You should explicitly consider how false sharing might arise and take appropriate
steps to minimize its impact on performance.

Co 2
No ratings yet
Co 2
22 pages
Mth643 Quize
No ratings yet
Mth643 Quize
15 pages
CS021 - Assessment 10 2213686117142407
100% (1)
CS021 - Assessment 10 2213686117142407
3 pages
Assignment 1 ME502
0% (1)
Assignment 1 ME502
4 pages
Bisection Method
100% (1)
Bisection Method
4 pages
Block Lu Factorization
No ratings yet
Block Lu Factorization
22 pages
RG2 ParallelizationPrinciples HPCAI Jan2020
No ratings yet
RG2 ParallelizationPrinciples HPCAI Jan2020
40 pages
Endsem With Sol PDF
No ratings yet
Endsem With Sol PDF
16 pages
COMP422 - Assignment2 Report
No ratings yet
COMP422 - Assignment2 Report
5 pages
Efficient Parallel Algorithm For
No ratings yet
Efficient Parallel Algorithm For
12 pages
Comp 372 Assignment 3
No ratings yet
Comp 372 Assignment 3
11 pages
2011 Quiz 4 Sol
No ratings yet
2011 Quiz 4 Sol
17 pages
24 FEM Lecture 8 On 7th Oct 2019 (79) Rayleigh Ritz Method
No ratings yet
24 FEM Lecture 8 On 7th Oct 2019 (79) Rayleigh Ritz Method
79 pages
SIT315 M2 - S2P TaskSheet
No ratings yet
SIT315 M2 - S2P TaskSheet
1 page
Exercise 9
No ratings yet
Exercise 9
5 pages
Optimizer Methods HYSYS PDF
No ratings yet
Optimizer Methods HYSYS PDF
9 pages
8 Week Report
No ratings yet
8 Week Report
23 pages
Project 3
No ratings yet
Project 3
5 pages
EE 242 Numerical Methods For Electrical Engineering Project 1: Gaussian Elimination With Partial Pivoting
No ratings yet
EE 242 Numerical Methods For Electrical Engineering Project 1: Gaussian Elimination With Partial Pivoting
3 pages
Parallel Algorithm Merged
No ratings yet
Parallel Algorithm Merged
76 pages
Assignment No. 2 PDC 21L-1786
No ratings yet
Assignment No. 2 PDC 21L-1786
6 pages
Excelente
No ratings yet
Excelente
64 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
CS124PSET1
No ratings yet
CS124PSET1
12 pages
DS226 Assgn 02
No ratings yet
DS226 Assgn 02
3 pages
Assignment Definition - Matrix Maths C Programming
No ratings yet
Assignment Definition - Matrix Maths C Programming
4 pages
410A Week 5
No ratings yet
410A Week 5
23 pages
LAB03 Report
No ratings yet
LAB03 Report
8 pages
Tp2 - Openmp (Introduction) : Imad Kissami
No ratings yet
Tp2 - Openmp (Introduction) : Imad Kissami
4 pages
Parallel Distributed Computing Assignment 2
No ratings yet
Parallel Distributed Computing Assignment 2
2 pages
Par - 1 In-Term Exam - Course 2018/19-Q2
No ratings yet
Par - 1 In-Term Exam - Course 2018/19-Q2
9 pages
Assignement 2
No ratings yet
Assignement 2
3 pages
a9e53af944a8707be5d2cf40d808e480707a1a99344781e96aa63d95233ac249
No ratings yet
a9e53af944a8707be5d2cf40d808e480707a1a99344781e96aa63d95233ac249
12 pages
Final Quiz 2 3
No ratings yet
Final Quiz 2 3
4 pages
Assignment of Algorithm
No ratings yet
Assignment of Algorithm
9 pages
Assignment 3 - COMP2129
No ratings yet
Assignment 3 - COMP2129
4 pages
Simulating Ocean Currents
No ratings yet
Simulating Ocean Currents
35 pages
Example Euler Method
No ratings yet
Example Euler Method
11 pages
Problem of Parallelization of Matrix Multiplication (C Project)
No ratings yet
Problem of Parallelization of Matrix Multiplication (C Project)
3 pages
A3
No ratings yet
A3
1 page
Par - 1 In-Term Exam - Course 2017/18-Q2
No ratings yet
Par - 1 In-Term Exam - Course 2017/18-Q2
7 pages
Selina Concise Mathematics Class 7 ICSE Solutions For Chapter 11 - Fundamental Concepts
No ratings yet
Selina Concise Mathematics Class 7 ICSE Solutions For Chapter 11 - Fundamental Concepts
146 pages
CS4961: Parallel Programming Midterm Exam October 20, 2011
No ratings yet
CS4961: Parallel Programming Midterm Exam October 20, 2011
4 pages
Tiny Project 1
No ratings yet
Tiny Project 1
2 pages
Practice Questions
No ratings yet
Practice Questions
3 pages
Ass Parallel
No ratings yet
Ass Parallel
11 pages
Parallel and Distributed Computing Lab Digital Assignment - 3
No ratings yet
Parallel and Distributed Computing Lab Digital Assignment - 3
10 pages
Practical-8 AIM:-To Perform LU Decomposition On A Given Matrix in MATLAB
No ratings yet
Practical-8 AIM:-To Perform LU Decomposition On A Given Matrix in MATLAB
14 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
Mathematical Lab PDF
No ratings yet
Mathematical Lab PDF
29 pages
Sheet #6 Ensemble + Neural Nets + Linear Regression + Backpropagation + CNN
No ratings yet
Sheet #6 Ensemble + Neural Nets + Linear Regression + Backpropagation + CNN
4 pages
HPC Programs
No ratings yet
HPC Programs
19 pages
Question 1 - Serial: Output
No ratings yet
Question 1 - Serial: Output
9 pages
Essay Algebra 2023
No ratings yet
Essay Algebra 2023
3 pages
Vertical and Horizontal Assymptote of Rational Functions
No ratings yet
Vertical and Horizontal Assymptote of Rational Functions
13 pages
(Serial)
No ratings yet
(Serial)
8 pages
Parallel Random Access Machine (PRAM) : Control
No ratings yet
Parallel Random Access Machine (PRAM) : Control
9 pages
Polynomials Ix 23-24 - QP
No ratings yet
Polynomials Ix 23-24 - QP
6 pages
COL333/671: Introduction To AI
No ratings yet
COL333/671: Introduction To AI
37 pages
L03 Problem Solving As Search I
No ratings yet
L03 Problem Solving As Search I
66 pages
2022 Mid 1
No ratings yet
2022 Mid 1
4 pages
FEM Introduction: Solving ODE - BVP Using The Least Squares Method
No ratings yet
FEM Introduction: Solving ODE - BVP Using The Least Squares Method
12 pages
H2 MBT Revision Package Complex Numbers Solutions
No ratings yet
H2 MBT Revision Package Complex Numbers Solutions
3 pages
COL334 Assignment 2 Final
No ratings yet
COL334 Assignment 2 Final
5 pages
Solution For The Laplace Equation
No ratings yet
Solution For The Laplace Equation
10 pages
SCILAB Solver NMOP PDF
No ratings yet
SCILAB Solver NMOP PDF
11 pages
Bicubic Interpolation Wiki PDF
No ratings yet
Bicubic Interpolation Wiki PDF
4 pages
Study Material: Free Master Class Series
No ratings yet
Study Material: Free Master Class Series
43 pages
Soft Computing Perceptron Neural Network in MATLAB
No ratings yet
Soft Computing Perceptron Neural Network in MATLAB
8 pages
Artificial Neural Network in Matlab: Hany Ferdinando
No ratings yet
Artificial Neural Network in Matlab: Hany Ferdinando
13 pages
AP PreCalc - Tutorial 1 - Teacher
No ratings yet
AP PreCalc - Tutorial 1 - Teacher
11 pages
Dr. Meenakshi Sood Associate Professor, NITTTR Chandigarh: Meenkashi@nitttrchd - Ac.in
No ratings yet
Dr. Meenakshi Sood Associate Professor, NITTTR Chandigarh: Meenkashi@nitttrchd - Ac.in
39 pages
MBA 19 PAT 302 DS Unit 1.3.3 VAM
No ratings yet
MBA 19 PAT 302 DS Unit 1.3.3 VAM
25 pages
Euler Method
No ratings yet
Euler Method
36 pages
COL333/671: Introduction To AI
No ratings yet
COL333/671: Introduction To AI
17 pages
Cnvtol: Nonlinear Options
No ratings yet
Cnvtol: Nonlinear Options
3 pages
Digital Signal Processing Mid Term Examination Sub Code:-Etec 306 Time: 1Hr Max Marks:30
No ratings yet
Digital Signal Processing Mid Term Examination Sub Code:-Etec 306 Time: 1Hr Max Marks:30
5 pages
Groebner Bases
No ratings yet
Groebner Bases
9 pages
Thank You
No ratings yet
Thank You
1 page
Numerical Methods and Differential Equations: Code: EE3T1
No ratings yet
Numerical Methods and Differential Equations: Code: EE3T1
3 pages
Soft COmputing Dsadsadasdsad
No ratings yet
Soft COmputing Dsadsadasdsad
3 pages
Programming Fundamentals
No ratings yet
Programming Fundamentals
9 pages
3-SAT Notes
No ratings yet
3-SAT Notes
7 pages
Single Image Super-Resolution Using Deep Learning
No ratings yet
Single Image Super-Resolution Using Deep Learning
1 page
Worked Examples in Mechanical Vibrations using MATLAB
From Everand
Worked Examples in Mechanical Vibrations using MATLAB
Eric Okoth Ogur
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Trifocal Tensor: Exploring Depth, Motion, and Structure in Computer Vision
From Everand
Trifocal Tensor: Exploring Depth, Motion, and Structure in Computer Vision
Fouad Sabry
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Direct Linear Transformation: Practical Applications and Techniques in Computer Vision
From Everand
Direct Linear Transformation: Practical Applications and Techniques in Computer Vision
Fouad Sabry
No ratings yet
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
From Everand
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
Fouad Sabry
No ratings yet
Exercises of Numerical Analysis
From Everand
Exercises of Numerical Analysis
Simone Malacrida
No ratings yet
Introduction to Numerical Analysis
From Everand
Introduction to Numerical Analysis
Simone Malacrida
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

Problem Statement

Uploaded by

Problem Statement

Uploaded by

Assignment 1: Due on Feb 15, 2024

Below is pseudocode for a sequential implementation of LU decomposition with row pivoting.

initialize π as a vector of length n

Here, the vector π is a compact representation of a permutation matrix p(n,n),

The formal components of the assignment are listed below:

You might also like