0% found this document useful (0 votes)

31 views

Chapter 1PARALLEL PROGRAM

Parallel, concurrent and distributed systems can execute multiple components or processes simultaneously to offer advantages over sequential systems like higher performance. Parallel systems use multiple processors to execute tasks in parallel according to Flynn's taxonomy: SISD, SIMD, MISD, MIMD. Concurrent systems use processes that execute tasks interleaved and may or may not be parallel. Distributed systems consist of nodes connected by a network that communicate to achieve goals and can offer scalability and fault tolerance.

Uploaded by

nagaasaamulataa32

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views

Chapter 1PARALLEL PROGRAM

Uploaded by

nagaasaamulataa32

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Chapter 1: Introduction Parallel, Concurrent and Distributed Systems The Need for

Parallelism

Parallel, concurrent and distributed systems are types of computing systems that involve
multiple components or processes that can execute simultaneously or asynchronously.
These systems can offer advantages such as higher performance, scalability, reliability,
and fault tolerance over traditional sequential or centralized systems.
Parallel systems are systems that use multiple processors or cores to execute multiple
tasks or subtasks of a single problem at the same time. Parallel systems can be classified
according to Flynn’s taxonomy of computer architecture 1, which is based on the number
of instruction streams and data streams in the system. The four categories are:

 SISD (Single Instruction, Single Data): A system that executes a single instruction on a
single data element at a time. This is the simplest and most common type of system,
such as a single-core CPU.
 SIMD (Single Instruction, Multiple Data): A system that executes a single instruction on
multiple data elements at the same time. This type of system can exploit data
parallelism, which is when the same operation can be applied to different parts of the
data. An example of this type of system is a GPU, which can perform graphics operations
on many pixels or vertices in parallel.
 MISD (Multiple Instruction, Single Data): A system that executes multiple instructions on
a single data element at a time. This type of system is rare and mostly theoretical, as it is
not clear what the benefit of such a system would be. An example of this type of system
is a fault-tolerant system that uses multiple processors to perform the same
computation on the same data and compare the results for consistency.
 MIMD (Multiple Instruction, Multiple Data): A system that executes multiple instructions
on multiple data elements at a time. This type of system can exploit both data
parallelism and task parallelism, which is when different operations can be applied to
different parts of the data or different subproblems. An example of this type of system is
a multiprocessor or a multicore CPU, which can run multiple threads or processes in
parallel.

Concurrent systems are systems that use multiple processes or threads to execute
multiple tasks or subtasks of a single or multiple problems in an interleaved or
overlapping manner. Concurrent systems may or may not be parallel, depending on
whether the processes or threads can run simultaneously on multiple processors or
cores, or whether they have to share a single processor or core and switch between
them. Concurrent systems can be classified according to the parallel programming
models2, which are abstractions of parallel hardware and software that define how
parallel processes communicate and synchronize. The common models are:
 Shared memory model: A model that assumes that all processes share a common
address space and can access the same variables or data structures. This model provides
a unified and convenient way of communication and data sharing, but also poses
challenges such as memory consistency, cache coherence, synchronization, and
scalability. An example of this model is a shared memory multiprocessor or a multicore
CPU, which can use locks, semaphores, monitors, or atomic operations to coordinate the
access to shared data.
 Message passing model: A model that assumes that each process has its own address
space and can only communicate with other processes by sending and receiving
messages. This model provides a scalable and fault-tolerant way of communication and
data sharing, but also poses challenges such as latency, bandwidth, load balancing, and
synchronization. An example of this model is a distributed memory multiprocessor or a
cluster, which can use MPI, PVM, or sockets to exchange messages between processes.
 Threads model: A model that assumes that each process can create multiple threads
that share the same address space and resources of the process, but can execute
independently and concurrently. This model provides a way of exploiting both
concurrency and parallelism within a single process, but also poses challenges such as
thread management, synchronization, and deadlock. An example of this model is a
multithreaded program that can run on a single or multiple processors or cores, which
can use pthreads, Java threads, or OpenMP to create and manage threads.
 Data parallel model: A model that assumes that the same computation can be applied
to different parts of a large data set in parallel. This model provides a way of exploiting
data parallelism without explicitly managing the communication and synchronization
between processes, but also poses challenges such as data distribution, load balancing,
and scalability. An example of this model is a data parallel program that can run on a
SIMD array processor or a vector processor, which can use CUDA, OpenCL, or Fortran 90
to express data parallel operations.

Distributed systems are systems that consist of multiple autonomous nodes or

computers that are connected by a network and can communicate and coordinate with
each other to achieve a common goal. Distributed systems can be seen as a special case
of concurrent systems, where the processes are physically separated and communicate
by message passing. Distributed systems can offer advantages such as higher scalability,
reliability, and fault tolerance over centralized systems, but also pose challenges such as
heterogeneity, transparency, concurrency, consistency, replication, fault tolerance, and
security. Examples of distributed systems are:

 Cluster: A distributed system that consists of a collection of homogeneous or

heterogeneous nodes that are connected by a high-speed network and can work
together to perform parallel or distributed computations. An example of a cluster is a
Beowulf cluster, which is a low-cost cluster that uses commodity hardware and open-
source software to achieve high performance.
 Grid: A distributed system that consists of a collection of geographically dispersed and
loosely coupled nodes that are connected by a wide-area network and can share
resources and services to perform large-scale and complex computations. An example
of a grid is a computational grid, which is a grid that uses the idle CPU cycles of the
nodes to execute parallel or distributed applications.
 Cloud: A distributed system that consists of a collection of virtualized and elastic nodes
that are connected by a network and can provide on-demand access to resources and
services to perform scalable and flexible computations. An example of a cloud is a public
cloud, which is a cloud that is owned and operated by a third-party provider and offers
services to the general public or organizations over the internet.

The need for parallelism arises from the increasing demand for higher performance,
scalability, reliability, and fault tolerance in computing systems. Parallelism can help
improve these aspects by exploiting the inherent parallelism in the problems or
applications, by using multiple processors or cores to execute multiple tasks or subtasks
at the same time or in an interleaved or overlapping manner, and by using multiple
nodes or computers to communicate and coordinate with each other to achieve a
common goal. However, parallelism also introduces new challenges and complexities in
the design, development, analysis, and evaluation of parallel, concurrent, and distributed
systems, which require appropriate hardware, software, and tools to support them.

Parallel Hardware and Parallel Software

 Flynn’s taxonomy of computer architecture: a classification of parallel systems based on

the number of instruction streams and data streams
 Four categories: SISD, SIMD, MISD, MIMD
 Examples and applications of each category

Slide 3: Abstract Model of Parallel Computer

 A simplified representation of a parallel system that captures its essential features

 Useful for designing and analyzing parallel algorithms and programs
 Common models: PRAM, BSP, LogP, CGM, etc.

Slide 4: Multiprocessor Architecture

 A parallel system that consists of multiple processors connected by a shared memory or

a network
 Two types: shared memory multiprocessor and distributed memory multiprocessor
 Advantages and disadvantages of each type

Slide 5: Pipelining

 A technique of overlapping the execution of different stages of an instruction or a task

 Increases the throughput and performance of a processor or a system
 Examples: instruction pipelining, arithmetic pipelining, pipeline parallelism, etc.

Slide 6: Array Processors

 A parallel system that consists of multiple processing elements that operate on arrays of
data in parallel
 Two types: SIMD array processor and vector processor
 Examples: GPU, DSP, Cray-1, etc.

Slide 7: Parallel Programming Models

 An abstraction of parallel hardware and software that defines how parallel processes
communicate and synchronize
 Two aspects: process interaction and problem decomposition
 Common models: shared memory model, message passing model, threads model, data
parallel model, etc.

Slide 8: Processes and Threads

 Processes: independent units of execution that have their own address space and
resources
 Threads: lightweight units of execution that share the address space and resources of a
process
 Benefits of using processes and threads: concurrency, parallelism, modularity,
responsiveness, etc.

Slide 9: Shared Memory

 A form of memory that can be accessed by multiple processes or threads simultaneously

 Provides a unified and convenient way of communication and data sharing
 Challenges: memory consistency, cache coherence, synchronization, scalability, etc.

Slide 10: Distributed Memory

 A form of memory that is distributed across multiple nodes or processors that

communicate by message passing
 Provides a scalable and fault-tolerant way of communication and data sharing
 Challenges: latency, bandwidth, load balancing, synchronization, etc.

Slide 11: Hybrid Systems

 A combination of shared memory and distributed memory systems

 Exploits the advantages of both types of systems
 Examples: NUMA, cluster, grid, cloud, etc.

Slide 12: Software Tools

 Tools that facilitate the development and execution of parallel programs

 Examples: compilers, libraries, frameworks, languages, debuggers, profilers, etc.
 Criteria for choosing tools: portability, performance, productivity, etc.

Slide 13: Performance

 A measure of how well a parallel system or program achieves its objectives

 Metrics: speedup, efficiency, scalability, etc.
 Factors: problem size, number of processors, communication overhead, load balancing,
etc.

Slide 14: Amdahl’s Law

 A formula that relates the speedup of a parallel program to the fraction of the program
that can be parallelized
 Speedup = 1 / ( (1 - p) + p / n )
 p: the fraction of the program that can be parallelized
 n: the number of processors
 Implication: the speedup is limited by the sequential part of the program

Slide 15: Gustafson-Barsis’s Law

 A formula that relates the speedup of a parallel program to the fraction of the program
that is sequential
 Speedup = n - s * (n - 1)
 s: the fraction of the program that is sequential
 n: the number of processors
 Implication: the speedup can be increased by increasing the problem size

KST CR Motion Cooperation 22 en
No ratings yet
KST CR Motion Cooperation 22 en
147 pages
Parallel Computing
No ratings yet
Parallel Computing
28 pages
Module 4- Architecture
No ratings yet
Module 4- Architecture
22 pages
Unit 1
No ratings yet
Unit 1
21 pages
PARALLEL VS DISTRIBUTED COMPUTING
No ratings yet
PARALLEL VS DISTRIBUTED COMPUTING
9 pages
Parallel_computing
No ratings yet
Parallel_computing
32 pages
Parallel and distributed computing
No ratings yet
Parallel and distributed computing
16 pages
Programação Paralela e Distribuída
No ratings yet
Programação Paralela e Distribuída
39 pages
08 Parallel algorithms approches
No ratings yet
08 Parallel algorithms approches
12 pages
QNA Unit 2
No ratings yet
QNA Unit 2
11 pages
15cs72aca Module-5 Aca
No ratings yet
15cs72aca Module-5 Aca
53 pages
Unit2_a
No ratings yet
Unit2_a
70 pages
Flynns
No ratings yet
Flynns
41 pages
Cloud Computing - Lecture 3
No ratings yet
Cloud Computing - Lecture 3
22 pages
CS802A Lec-2 PDF
No ratings yet
CS802A Lec-2 PDF
28 pages
Hardware Multithreading
No ratings yet
Hardware Multithreading
10 pages
Introduction
No ratings yet
Introduction
34 pages
Parallel and Distributed Computing Systems
100% (1)
Parallel and Distributed Computing Systems
57 pages
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
No ratings yet
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
47 pages
Lec1 Introduction to Parallel Computing (2)
No ratings yet
Lec1 Introduction to Parallel Computing (2)
40 pages
Swami Vivekananda Institute of Science &: Technology
No ratings yet
Swami Vivekananda Institute of Science &: Technology
8 pages
20ai503 U1 LP3 22-23
No ratings yet
20ai503 U1 LP3 22-23
13 pages
Parallel Processing
No ratings yet
Parallel Processing
31 pages
Introduction To Parallel Computing LLNL
No ratings yet
Introduction To Parallel Computing LLNL
44 pages
1. GPU Unit-1
No ratings yet
1. GPU Unit-1
10 pages
3.3-Recent Trends in Parallel Computing
No ratings yet
3.3-Recent Trends in Parallel Computing
12 pages
Lecture 2 General Parallelism Terms
No ratings yet
Lecture 2 General Parallelism Terms
22 pages
I Notes
No ratings yet
I Notes
27 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
2 Parallel Computer Memory Architectures
No ratings yet
2 Parallel Computer Memory Architectures
26 pages
COA U5 PPT Full
No ratings yet
COA U5 PPT Full
43 pages
Unit I 2 Marks With Answer
No ratings yet
Unit I 2 Marks With Answer
6 pages
Seminar
No ratings yet
Seminar
85 pages
Flynn's Classification
No ratings yet
Flynn's Classification
4 pages
Parallel and Distributed Algorithms: Johnnie W. Baker
No ratings yet
Parallel and Distributed Algorithms: Johnnie W. Baker
67 pages
High Performance Computing
100% (2)
High Performance Computing
164 pages
Week1 - Parallel and Distributed Computing
100% (1)
Week1 - Parallel and Distributed Computing
46 pages
PP_CS(451)
No ratings yet
PP_CS(451)
89 pages
Week1-Parallel-and-Distributed-Computing
No ratings yet
Week1-Parallel-and-Distributed-Computing
55 pages
High Performance Computing
No ratings yet
High Performance Computing
17 pages
Unit 1
No ratings yet
Unit 1
22 pages
Parallel
No ratings yet
Parallel
5 pages
Model
No ratings yet
Model
14 pages
p1
No ratings yet
p1
30 pages
Parallel Computation Lecture Notes
No ratings yet
Parallel Computation Lecture Notes
44 pages
P 1
No ratings yet
P 1
44 pages
Flynn taxonomy
No ratings yet
Flynn taxonomy
4 pages
Coa Unit 04
No ratings yet
Coa Unit 04
85 pages
KCS 713 Unit 1 Lecture 5
No ratings yet
KCS 713 Unit 1 Lecture 5
32 pages
Rohini 74684926776
No ratings yet
Rohini 74684926776
24 pages
Module 1
No ratings yet
Module 1
118 pages
Parallel Computig Assignment
No ratings yet
Parallel Computig Assignment
15 pages
1.2 Underlying Principles of Parallel and Distributed Computing
No ratings yet
1.2 Underlying Principles of Parallel and Distributed Computing
42 pages
01 Intro Parallel Computing
No ratings yet
01 Intro Parallel Computing
40 pages
Synopsis On "Massive Parallel Processing (MPP) "
No ratings yet
Synopsis On "Massive Parallel Processing (MPP) "
4 pages
Paralle Processing in Brief
No ratings yet
Paralle Processing in Brief
31 pages
Mastering Concurrency and Multithreading in C++: Unlock the Secrets of Expert-Level Skills
From Everand
Mastering Concurrency and Multithreading in C++: Unlock the Secrets of Expert-Level Skills
Larry Jones
No ratings yet
Concurrency and Multithreading in C: POSIX Threads and Synchronization
From Everand
Concurrency and Multithreading in C: POSIX Threads and Synchronization
Larry Jones
No ratings yet
Operating System Interview Questions and Answers
From Everand
Operating System Interview Questions and Answers
Manish Soni
No ratings yet
Database And Computer Management: SERIES 1, #3
From Everand
Database And Computer Management: SERIES 1, #3
Elias Mutegi
No ratings yet
Database Management System
From Everand
Database Management System
Knowledge Flow
No ratings yet
Full Download (Ebook) Encyclopedia of Software Engineering , vol 2 by John J. Marciniak ISBN 9780471210078, 0471210072 PDF DOCX
100% (4)
Full Download (Ebook) Encyclopedia of Software Engineering , vol 2 by John J. Marciniak ISBN 9780471210078, 0471210072 PDF DOCX
81 pages
AADLv2 - An Introduction
No ratings yet
AADLv2 - An Introduction
134 pages
DeepSpeed Inference - Enabling Efficient Inference of Transformer Models at Unprecedented Scale
No ratings yet
DeepSpeed Inference - Enabling Efficient Inference of Transformer Models at Unprecedented Scale
13 pages
Os Unit 3
No ratings yet
Os Unit 3
10 pages
Sem 5
No ratings yet
Sem 5
2 pages
Green Software Development Model
No ratings yet
Green Software Development Model
6 pages
Parallel Algorithem
No ratings yet
Parallel Algorithem
15 pages
Posa2 Schmidt
No ratings yet
Posa2 Schmidt
136 pages
1.4-Parallel Computer Architecture
No ratings yet
1.4-Parallel Computer Architecture
22 pages
Opp Nono Assignment 1
100% (1)
Opp Nono Assignment 1
10 pages
Course: USCS303 TOPICS (Credits: 02 Lectures/Week:03) Operating System
No ratings yet
Course: USCS303 TOPICS (Credits: 02 Lectures/Week:03) Operating System
1 page
Steady State Co Simulation With Digsilent Programacion C DSL DLL
No ratings yet
Steady State Co Simulation With Digsilent Programacion C DSL DLL
6 pages
Java Lab Assignments 2nd Year Engineering
No ratings yet
Java Lab Assignments 2nd Year Engineering
20 pages
Dayananda Sagar University: A Mini Project Report ON
No ratings yet
Dayananda Sagar University: A Mini Project Report ON
24 pages
Os Module 2 21 Scheme Notes
No ratings yet
Os Module 2 21 Scheme Notes
23 pages
@placement - Fellas Telegram
No ratings yet
@placement - Fellas Telegram
3 pages
Semaphores and Monitors
No ratings yet
Semaphores and Monitors
30 pages
ESD Objective Questions
No ratings yet
ESD Objective Questions
14 pages
Semester 3 and 4 Syllabus
No ratings yet
Semester 3 and 4 Syllabus
27 pages
CS2041 C# Unit 3
No ratings yet
CS2041 C# Unit 3
21 pages
Concurrency in Computing
No ratings yet
Concurrency in Computing
16 pages
Java Strings
No ratings yet
Java Strings
51 pages
Customization To Set Up BP & Customer Integration: 1. Activate Creation of Post Processing Orders
No ratings yet
Customization To Set Up BP & Customer Integration: 1. Activate Creation of Post Processing Orders
10 pages
PG TRB OS CLASS6
No ratings yet
PG TRB OS CLASS6
40 pages
CP7204-Advanced Operating Systems
No ratings yet
CP7204-Advanced Operating Systems
7 pages
Training Document For EMS
No ratings yet
Training Document For EMS
16 pages
ch6 Revised
No ratings yet
ch6 Revised
33 pages
Computer Architecture Assignment 1
No ratings yet
Computer Architecture Assignment 1
12 pages
OS Notes 3
No ratings yet
OS Notes 3
8 pages

Chapter 1PARALLEL PROGRAM

Uploaded by

Chapter 1PARALLEL PROGRAM

Uploaded by

Chapter 1: Introduction Parallel, Concurrent and Distributed Systems The Need for

Distributed systems are systems that consist of multiple autonomous nodes or

 Cluster: A distributed system that consists of a collection of homogeneous or

Parallel Hardware and Parallel Software

 Flynn’s taxonomy of computer architecture: a classification of parallel systems based on

Slide 3: Abstract Model of Parallel Computer

 A simplified representation of a parallel system that captures its essential features

Slide 4: Multiprocessor Architecture

 A parallel system that consists of multiple processors connected by a shared memory or

 A technique of overlapping the execution of different stages of an instruction or a task

Slide 6: Array Processors

Slide 7: Parallel Programming Models

Slide 8: Processes and Threads

Slide 9: Shared Memory

 A form of memory that can be accessed by multiple processes or threads simultaneously

Slide 10: Distributed Memory

 A form of memory that is distributed across multiple nodes or processors that

Slide 11: Hybrid Systems

 A combination of shared memory and distributed memory systems

Slide 12: Software Tools

 Tools that facilitate the development and execution of parallel programs

Slide 13: Performance

 A measure of how well a parallel system or program achieves its objectives

Slide 14: Amdahl’s Law

Slide 15: Gustafson-Barsis’s Law

You might also like