Parallel Algorithm Analysis

A parallel algorithm executes multiple instructions simultaneously on different processors, improving computational speed for tasks involving large data sets. Key factors in analyzing parallel algorithms include time complexity, speedup, number of processors, and total cost, with Amdahl's Law highlighting limitations based on the non-parallelizable portion of a program. Scalability measures a parallel algorithm's ability to effectively utilize additional processors, while efficiency assesses how well processors are utilized during computation.

Uploaded by

hibanahm12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views11 pages

Parallel Algorithm Analysis

Uploaded by

hibanahm12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

PARALLEL ALGORITHM

ANALYSIS
PARALLEL ALGORITHM
An algorithm is a sequence of steps that take inputs from the user and after some computation, produces an
output. A parallel algorithm is an algorithm that can execute several instructions simultaneously on different
processing devices and then combine all the individual outputs to produce the final result.

Concurrent processing is essential where the task involves processing a huge bulk of complex data. Examples
include − accessing large databases, aircraft testing, astronomical calculations, atomic and nuclear physics,
biomedical analysis, economic planning, image processing, robotics, weather forecasting, web-based services,
etc.
Parallelism is the process of processing several set of instructions simultaneously. It reduces the total
computational time. Parallelism can be implemented by using parallel computers, i.e. a computer with many
processors. Parallel computers require parallel algorithm, programming languages, compilers and operating
system that support multitasking.
🡪 Sequential Algorithm − An algorithm in which some consecutive steps of instructions are executed in a
chronological order to solve a problem.
🡪 Parallel Algorithm − The problem is divided into sub-problems and are executed in parallel to get
individual outputs. Later on, these individual outputs are combined together to get the final desired
output.
ANALYSIS OF PARALLEL ALGORITHM
Parallel algorithms are designed to improve the computation speed of a computer. For analyzing a
Parallel Algorithm, we normally consider the following parameters
🡪 Time complexity (Execution Time),
🡪 Speedup of an Algorithm
🡪 Total number of processors used, and
🡪 Total cost.
Speedup of an Algorithm
The speedup of a parallel algorithm over a corresponding sequential algorithm is the ratio of the
compute time for the sequential algorithm to the time for the parallel algorithm.
Speedup is defined as the ratio of the worst-case execution time of the fastest known sequential
algorithm for a particular problem to the worst-case execution time of the parallel algorithm.
speedup = Worst case execution time of the fastest known sequential for a particular problem
—------------------------------------------------------------------------------------------------
Worst case execution time of the parallel algorithm
Speedup of an Algorithm
It is the ratio between
time needed for the most efficient sequential algorithm to perform a computation and time needed for to
perform same computation on a machine incorporating parallelism.
Sp=Ts/Tp

Where Ts&Tp are times needed if single or P processors used

Speedup completely depends on number of processors involved in computation.
Number of processors Time to compute parallel algorithm
Goal of any parallel scalable system is to achieve a linear speedup although it is not easy.
🡪May not possible to parallelize all parts of application program.Parts that are sequential take time to
execute irrespective of number of processors used.The achievable speedup is limited by these sequential
parts.
🡪Overhead caused by initiation,synchronization and communication among processors.Overhead tends to
increase if number of processors in system increase and this puts an upper limit on speedup that can be
achieved.

🡪
Number of Processors Used
🡪 The number of processors used is an important factor in analyzing the efficiency of a
parallel algorithm.
🡪 The cost to buy, maintain, and run the computers are calculated.
🡪 Larger the number of processors used by an algorithm to solve a problem, more costly
becomes the obtained result.
🡪 But more the number of processors,maybe speed up the computation.
🡪 The cost of solving a problem on a parallel system is defined as the product of run time and
the number of processors. • A cost‐optimal parallel system solves a problem with a cost
proportional to the execution time of the fastest known sequential algorithm on a single
processor.
Total Cost
Efficiency
🡪 How effectively processors in parallel model are utilized in solving a problem.
🡪 The efficiency is defined as the ratio of speedup to the number of processors. Efficiency
measures the fraction of time for which a processor is usefully utilized
🡪 If processors are idle for long period of time or overhead occurring due to inter processor
communication is high,then efficiency is less.
Ep=Sp/p=Ts/pTp
🡪 Sp is speedup of system with p processors.
🡪 Value of Ep ranges from 0-1.
🡪 Parallel system with ideal speedup will have efficiency as 1,means system is utilised in best
possible way.
🡪 Another way of expressing maximum speedup is by using Amdahl’s law which formulates
speedup as function of amount of parallelism and computation that is inherently sequential.
Scalability
The scalability of a parallel algorithm on a parallel architecture is a measure of its capacity to
effectively utilize an increasing number of processors.
•Scalability: of a parallel system is a measure of its capability to increase speedup on proportion
to the number of processors
Scalability, the ability to proportionally increase overall system capacity by adding hardware, is
important for clusters. Software scalability is also called parallelization efficiency. This is the
difference between the actual speedup you get and the speedup you get with a specific number of
processors.
Amdahl’s Law
Amdahl's Law (1967) • The speedup of a program using multiple processors in parallel computing is limited by
the time needed for the serial fraction of the problem.
Amdahl's Law evaluates the predicted system speedup if one component is enhanced.
The basic idea behind Amdahl's Law is to identify the portion of a program that can be parallelized and the portion
that must be executed sequentially. The speedup achievable by parallelizing a program is limited by the
non-parallelizable portion. The law is expressed by the following formula:

Speedup=

where:
● Speedup is the potential speedup of the parallelized system,
● f is the fraction of the program that must be executed sequentially (the non-parallelizable part),
● p is the fraction of the program that can be parallelized.
The formula suggests that as the number of processors (p) increases, the speedup approaches a limit determined
by the non-parallelizable fraction (f). In other words, even if you add more processors to a system, the overall
speedup will be limited by the sequential part of the program.
Amdahl's Law highlights the importance of identifying and optimizing the critical path in a program to achieve
meaningful speedup when parallelizing applications. It also serves as a cautionary reminder that not all programs
can be parallelized effectively, and improvements in performance may be limited by inherent sequential
dependencies.
Let's consider an example to illustrate Amdahl's Law:
Suppose we have a program with two parts: Part A, which represents 30% of the total execution time
and cannot be parallelized (sequential part), and Part B, which represents 70% of the total execution
time and can be parallelized.
f=0.3 (portion that cannot be parallelized)
p=0.7 (portion that can be parallelized)
Using Amdahl's Law formula:Speedup=1/(0.3+0.7/p)
Let's calculate the speedup for different values of p (the fraction of the program that can be
parallelized):
If we have only one processor :Speedup=1/(0.3+0.7/1) =1
This means that with one processor, there is no speedup because the sequential part still takes the same
amount of time.

If we have four processors 4( p=4):Speedup=1/(0.3+0.7/4) 1/.0475 ~=2.11

In this case, the speedup is approximately 2.11, meaning that the program could run more than twice as
fast with four processors compared to a single processor.
What is Amdahl's law used for?
Amdahl's law is a formula that is used to calculate the theoretical speedup in latency of a
system when part of the system is improved. It is used to determine the maximum performance
improvement that can be achieved by optimising a specific portion of the system.
Drawbacks
Amdahl's law has a few drawbacks. These are as follows:
● Scaling falls off when the number of processors increases. This is due to synchronization
barriers (locks) and memory collisions.
● It isn't easy to compute the value of f
● p. This is because the serializable part occurs not only in code but also in the kernel and
the hardware. Secondly, profiling is an essential part of this too.
● Consistency of the private data cache of the multiprocessor systems is also to be
considered.

Performance Metrices
100% (1)
Performance Metrices
18 pages
Fundamentals of Multicore Software Development PDF
No ratings yet
Fundamentals of Multicore Software Development PDF
322 pages
Document
No ratings yet
Document
10 pages
Instructor-S Guide To Parallel Programming in C With Mpi and Openmp
No ratings yet
Instructor-S Guide To Parallel Programming in C With Mpi and Openmp
91 pages
Sols Book PDF
100% (1)
Sols Book PDF
120 pages
Java Multithreading For Senior Engineering Interviews Part I
No ratings yet
Java Multithreading For Senior Engineering Interviews Part I
80 pages
PDC Week 2 (Performance Metrice, Amdahl's Law)
No ratings yet
PDC Week 2 (Performance Metrice, Amdahl's Law)
18 pages
Homework 1 - Computer Architecture - HCMIU
No ratings yet
Homework 1 - Computer Architecture - HCMIU
3 pages
Parallel Computing - Unit III
No ratings yet
Parallel Computing - Unit III
74 pages
Module 1-Performance Measure
No ratings yet
Module 1-Performance Measure
14 pages
All Numerical Unit-1
No ratings yet
All Numerical Unit-1
28 pages
Assignment 1: Sample Solution
No ratings yet
Assignment 1: Sample Solution
8 pages
Solution
No ratings yet
Solution
16 pages
CS-3006 10 PerformanceAnalysis
No ratings yet
CS-3006 10 PerformanceAnalysis
52 pages
Accelerating Inference For High Resolution Images With Quantization and Distributed Deep Learning
No ratings yet
Accelerating Inference For High Resolution Images With Quantization and Distributed Deep Learning
8 pages
MPI Python Workshop Day1 Fall2024
No ratings yet
MPI Python Workshop Day1 Fall2024
22 pages
Lecture 3.1.4 (Amdahl's Law)
No ratings yet
Lecture 3.1.4 (Amdahl's Law)
13 pages
Pervasive Computing Concepts PDF
No ratings yet
Pervasive Computing Concepts PDF
71 pages
Unit 1 - Part 3
No ratings yet
Unit 1 - Part 3
17 pages
UNIT-2 Parallel Programming Challenges
No ratings yet
UNIT-2 Parallel Programming Challenges
32 pages
Lecture 6 (Amdahl's Law)
No ratings yet
Lecture 6 (Amdahl's Law)
13 pages
Chapter 4
No ratings yet
Chapter 4
73 pages
Chapter (7) Performance Analysis Techniques: Asmaa Ismail Farah Basil Raua Waleed
No ratings yet
Chapter (7) Performance Analysis Techniques: Asmaa Ismail Farah Basil Raua Waleed
46 pages
Performance Metrics
No ratings yet
Performance Metrics
34 pages
CS621 Week 14 - Complete
No ratings yet
CS621 Week 14 - Complete
69 pages
Computer Architecture
No ratings yet
Computer Architecture
63 pages
Week 7
No ratings yet
Week 7
27 pages
Amdahl's Law
No ratings yet
Amdahl's Law
5 pages
HW2 Solutions
No ratings yet
HW2 Solutions
4 pages
5 Amdahl
No ratings yet
5 Amdahl
3 pages
Unit 2 Performance Evaluations: Structure Nos
No ratings yet
Unit 2 Performance Evaluations: Structure Nos
18 pages
COE4590 12 Amdahls Law
No ratings yet
COE4590 12 Amdahls Law
18 pages
Week 01 Lec 2 - 05!03!2024 (Types of Parallelism)
No ratings yet
Week 01 Lec 2 - 05!03!2024 (Types of Parallelism)
17 pages
Screenshot 2024-12-05 at 2.01.32 PM
No ratings yet
Screenshot 2024-12-05 at 2.01.32 PM
49 pages
Lecture 3.1.4 (Amdahl's Law)
No ratings yet
Lecture 3.1.4 (Amdahl's Law)
4 pages
Measuring Computer Performance
No ratings yet
Measuring Computer Performance
26 pages
Lecture04 PDF
No ratings yet
Lecture04 PDF
27 pages
AMDAHL's LAW
No ratings yet
AMDAHL's LAW
3 pages
Lect 02
No ratings yet
Lect 02
51 pages
Unit 4
No ratings yet
Unit 4
64 pages
Lecture 02
No ratings yet
Lecture 02
31 pages
PDC ch#5
No ratings yet
PDC ch#5
12 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
33 pages
Lecture 2 Amdahl's Law and Karp-Flatt Metric
0% (1)
Lecture 2 Amdahl's Law and Karp-Flatt Metric
14 pages
Parallel2 PDF
No ratings yet
Parallel2 PDF
16 pages
Week - 01 - Lec - 2 - 05-03-2021 (Types of Parallelism)
No ratings yet
Week - 01 - Lec - 2 - 05-03-2021 (Types of Parallelism)
17 pages
Computer Architecture: CS/B.TECH (CSE-NEW) /SEM-4/CS-403/2012
No ratings yet
Computer Architecture: CS/B.TECH (CSE-NEW) /SEM-4/CS-403/2012
8 pages
34-Amdahl''s Law-10-04-2023
No ratings yet
34-Amdahl''s Law-10-04-2023
9 pages
HPC 4th Unit - 240504 - 160030
No ratings yet
HPC 4th Unit - 240504 - 160030
19 pages
Homework 1
No ratings yet
Homework 1
10 pages
Lecture-11 Amdhals Law Gustafsons Law
No ratings yet
Lecture-11 Amdhals Law Gustafsons Law
16 pages
Analytical Modeling of Parallel Systems: Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar
No ratings yet
Analytical Modeling of Parallel Systems: Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar
67 pages
Week 7
No ratings yet
Week 7
27 pages
Laraib Cs - 39 Assig 1
No ratings yet
Laraib Cs - 39 Assig 1
4 pages
Homework 1
No ratings yet
Homework 1
18 pages
Nmam Institute of Technology: Department of Computer Science and Engineering
No ratings yet
Nmam Institute of Technology: Department of Computer Science and Engineering
8 pages
Lecture Week - 3 Amdahl Law 1
No ratings yet
Lecture Week - 3 Amdahl Law 1
19 pages
DSECL ZG 522: Big Data Systems: Session 2: Parallel and Distributed Systems
No ratings yet
DSECL ZG 522: Big Data Systems: Session 2: Parallel and Distributed Systems
58 pages
Performance Metrics For Parallel Programs: 8 March 2010
No ratings yet
Performance Metrics For Parallel Programs: 8 March 2010
44 pages
Quiz For Chapter 1
No ratings yet
Quiz For Chapter 1
12 pages
Performance Evaluation of Parallel Computers
No ratings yet
Performance Evaluation of Parallel Computers
37 pages
Analytical Modeling of Parallel Systems: Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar
No ratings yet
Analytical Modeling of Parallel Systems: Ananth Grama, Anshul Gupta, George Karypis, and Vipin Kumar
36 pages
Pc7 Performance
No ratings yet
Pc7 Performance
50 pages
Amdahls Law - Advanced Computer Architecture
No ratings yet
Amdahls Law - Advanced Computer Architecture
2 pages
OOAD
No ratings yet
OOAD
67 pages
HPC Overview
No ratings yet
HPC Overview
45 pages
Principles of Scalable Performance
No ratings yet
Principles of Scalable Performance
61 pages
Cad Asuult
No ratings yet
Cad Asuult
12 pages
HAQu: Hardware-Accelerated Queueing For Fine-Grained Threading On A Chip Multiprocessor
No ratings yet
HAQu: Hardware-Accelerated Queueing For Fine-Grained Threading On A Chip Multiprocessor
12 pages
Speed Up Laws
No ratings yet
Speed Up Laws
21 pages
Pc98 Lect5 Part1 Speedup
No ratings yet
Pc98 Lect5 Part1 Speedup
36 pages
Zindagi Zama Da
No ratings yet
Zindagi Zama Da
21 pages
Speedup and Efficiency of Parallel Algorithms: N N N P T Sequential T N P S
No ratings yet
Speedup and Efficiency of Parallel Algorithms: N N N P T Sequential T N P S
4 pages
Amdahl's Law (Autosaved)
No ratings yet
Amdahl's Law (Autosaved)
12 pages
Lecture 4 Analytical Modeling of Parallel Programs
No ratings yet
Lecture 4 Analytical Modeling of Parallel Programs
11 pages
Amdahl's Law, Also Known As Amdahl's Argument,: Parallel Computing Speedup Computer Architect Gene Amdahl Afips
No ratings yet
Amdahl's Law, Also Known As Amdahl's Argument,: Parallel Computing Speedup Computer Architect Gene Amdahl Afips
3 pages
Performance Analysis: PE PE
No ratings yet
Performance Analysis: PE PE
10 pages
Performance BigQuery Vs Redshift
No ratings yet
Performance BigQuery Vs Redshift
14 pages
Lecture 4 - Parallel Computing Metrics
No ratings yet
Lecture 4 - Parallel Computing Metrics
3 pages
Cao AMDAHL's Law
No ratings yet
Cao AMDAHL's Law
4 pages
Scalable Parallel Computing
No ratings yet
Scalable Parallel Computing
11 pages
3.2 Performance Evaluations
No ratings yet
3.2 Performance Evaluations
18 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
The Software Programmer: Basis of common protocols and procedures
From Everand
The Software Programmer: Basis of common protocols and procedures
S Mathioudakis
No ratings yet
Computer Algebra: Fundamentals and Applications
From Everand
Computer Algebra: Fundamentals and Applications
Fouad Sabry
No ratings yet

Parallel Algorithm Analysis

Uploaded by

Parallel Algorithm Analysis

Uploaded by

PARALLEL ALGORITHM

Where Ts&Tp are times needed if single or P processors used

If we have four processors 4( p=4):Speedup=1/(0.3+0.7/4) 1/.0475 ~=2.11

You might also like