0% found this document useful (0 votes)

65 views27 pages

Chapter 6 Parallel and Concurrent Computing

Chapter 5 of ICS 2410 discusses parallel and concurrent systems, defining key concepts such as concurrency and parallelism, and introducing Flynn's Taxonomy which classifies parallel computers into SISD, SIMD, MISD, and MIMD categories. It elaborates on the characteristics and programming methods of these systems, including multiprocessors and multicomputers, as well as various forms of parallelism like data and task parallelism. The chapter also highlights the advantages and disadvantages of SIMD and MIMD architectures.

Uploaded by

Jorams Barasa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views27 pages

Chapter 6 Parallel and Concurrent Computing

Uploaded by

Jorams Barasa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

ICS 2410 Advanced

Topics in Computer
Science
Chapter 5 Parallel and
Concurrent Systems

1
Some Definitions

 Concurrent - Events or processes which

seem to occur or progress at the same
time.
 Parallel–Events or processes which occur
or progress at the same time
 Parallelprogramming (also, unfortunately,
sometimes called concurrent programming), is
a computer programming technique that
provides for the parallel execution of
operations , either
 within a single parallel computer
 or across a number of systems. 2

 In the latter case, the term distributed

Flynn’s Taxonomy
 Best
known classification scheme for parallel
computers.
 Depends on parallelism it exhibits with its
 Instruction stream
 Data stream
A sequence of instructions (the instruction
stream) manipulates a sequence of operands
(the data stream)
 Theinstruction stream (I) and the data
stream (D) can be either single (S) or multiple
(M)
3

 Four combinations: SISD, SIMD, MISD, MIMD

SISD
 Single Instruction, Single Data
 Single-CPU systems
 i.e.,
uniprocessors
 Note: co-processors don’t count as additional
processors
 Concurrent processing allowed
 Instructionprefetching
 Pipelined execution of instructions
 Concurrent execution allowed
 Thatis, independent concurrent tasks can
execute different sequences of operations.
 Most Important Example: a PC
4
SIMD
 Single instruction, multiple data
 Oneinstruction stream is broadcast to all
processors
 Each processor, also called a processing
element (or PE), is usually simplistic and
logically is essentially an ALU;
 PEs do not store a copy of the program
nor have a program control unit.
 Individual
processors can remain idle
during execution of segments of the
program (based on a data test).
5
SIMD (cont.)
 Allactive processor executes the same
instruction synchronously, but on different
data
 Technically, on a memory access, all active
processors must access the same location
in their local memory.
 The data items form an array (or vector)
and an instruction can act on the complete
array in one cycle.

6
How to View a SIMD
Machine
 Think of soldiers all in a unit.
 The commander selects certain
soldiers as active – for example, the
first row.
 The commander barks out an order
to all the active soldiers, who execute
the order synchronously.
 The remaining soldiers do not
execute orders until they are re-
activated.
7
MIMD
 Multiple instruction, multiple data
 Processors are asynchronous, since they can
independently execute different programs
on different data sets.
 Communications are handled either
 through shared memory.
(multiprocessors)
 byuse of message passing
(multicomputers)
 MIMD’shave been considered by most
researchers to include the most powerful 8

and least restricted computers.

MIMD (cont. 2/4)
 Have very major communication costs
 When compared to SIMDs
 Internal ‘housekeeping activities’ are often overlooked
 Maintaining distributed memory & distributed databases
 Synchronization or scheduling of tasks
 Load balancing between processors
 Onemethod for programming MIMDs is for all
processors to execute the same program.
 Execution of tasks by processors is still asynchronous
 Called SPMD method (single program, multiple data)
 Usual method when number of processors are large.
 Considered to be a “data parallel programming” style
for MIMDs.

9
MIMD (cont 3/4)
 A more common technique for programming MIMDs is to
use multi-tasking:
 The problem solution is broken up into various tasks.
 Tasks are distributed among processors initially.
 If new tasks are produced during executions, these may
handled by parent processor or distributed
 Each processor can execute its collection of tasks
concurrently.
 Ifsome of its tasks must wait for results from other tasks or new
data , the processor will focus the remaining tasks.
 Larger programs usually run a load balancing algorithm in
the background that re-distributes the tasks assigned to
the processors during execution
 Either dynamic load balancing or called at specific times
 Dynamic scheduling algorithms may be needed to assign
a higher execution priority to time-critical tasks
 E.g., on critical path, more important, earlier deadline, etc.
10
Multiprocessors
(Shared Memory MIMDs)
 Allprocessors have access to all memory
locations .
 Two types: UMA and NUMA
 UMA (uniform memory access)
 Frequently called symmetric multiprocessors or
SMPs
 Similar to uniprocessor, except additional,
identical CPU’s are added to the bus.
 Each processor has equal access to memory
and can do anything that any other processor
can do. 11

 SMPs have been and remain very popular

Multiprocessors (cont.)
 NUMA (non-uniform memory access).
 Has a distributed memory system.
 Each memory location has the same address
for all processors.
 Access time to a given memory location varies
considerably for different CPUs.
 Normally,fast cache is used with NUMA
systems to reduce the problem of different
memory access time for PEs.
 Creates problem of ensuring all copies of the
same data in different memory locations are
identical. 12
Multicomputers
(Message-Passing MIMDs)
 Processors are connected by a network
 Interconnection network connections is one possibility
 Also, may be connected by Ethernet links or a bus.
 Each processor has a local memory and can only
access its own local memory.
 Data is passed between processors using
messages, when specified by the program.
 Message passing between processors is
controlled by a message passing language
(typically MPI)
 The problem is divided into processes or tasks
that can be executed concurrently on individual
processors. Each processor is normally assigned
13

multiple processes.
Multiprocessors vs
Multicomputers
 Programmingdisadvantages of
message-passing
 Programmers must make explicit message-
passing calls in the code
 This
is low-level programming and is error
prone.
 Datais not shared between processors but
copied, which increases the total data size.
 Dataintegrity problem: Difficulty to
maintain correctness of multiple copies of
data item.
14
Multiprocessors vs Multicomputers (cont)

 Programming advantages of message-

passing
 No problem with simultaneous access to data.
 Allows different PCs to operate on the same data
independently.
 Allows PCs on a network to be easily upgraded when
faster processors become available.
 Mixed“distributed shared memory”
systems exist
 Lots of current interest in a cluster of SMPs.
 Easier
to build systems with a very large
number of processors. 15
Seeking Concurrency
Several Different Ways Exist
 Data parallelism
 Task parallelism
Sometimes called control
parallelism or functional
parallelism.
 Pipelining

16
Data Parallelism
 All tasks (or processors) apply the same set of
operations to different data.

 Example: for i  0 to 99 do
a[i]  b[i] + c[i]
endfor

 Operations may be executed concurrently

17
Data Parallelism Features
 Each processor performs the same data
computation on different data sets
 Computations can be performed either
synchronously or asynchronously
 Defn: Grain Size is the average number of
computations performed between
communication or synchronization steps

18
Task/Functional/
Control/Job Parallelism
 Independent tasks apply different operations to
different data elements

a2
b3
m  (a + b) / 2
s  (a2 + b2) / 2
v  s - m2

 First and second statements may execute concurrently

 Third and fourth statements may execute concurrently
 Normally, this type of parallelism deals with
concurrent execution of tasks, not statements
19
Control Parallelism
Features
 Problem is divided into different
non-identical tasks
 Tasks
are divided between the
processors so that their
workload is roughly balanced
 Parallelismat the task level is
considered to be coarse grained
parallelism

20
Pipelining
 Divide a process into stages
 Produce several items simultaneously

21
Compute Partial Sums
Consider the for loop:
p[0]  a[0]
for i  1 to 3 do
p[i]  p[i-1] + a[i]
endfor
 This computes the partial sums:

p[0]  a[0]
p[1]  a[0] + a[1]
p[2]  a[0] + a[1] + a[2]
p[3]  a[0] + a[1] + a[2] + a[3]
 The loop is not data parallel as there are dependencies.
 However, we can stage the calculations in order to
22

achieve some parallelism.

SIMD Machines
 An early SIMD computer designed for
vector and matrix processing was the Illiac
IV computer
 Initialdevelopment at the University of Illinois
1965-70
 Moved to NASA Ames, completed in 1972 but
not fully functional until 1976.
 The MPP, DAP, the Connection Machines
CM-1 and CM-2, and MasPar’s MP-1 and
MP-2 are examples of SIMD computers
 The CRAY-1 and the Cyber-205 use
pipelined arithmetic units to support
vector operations and are sometimes
called a pipelined SIMD 23
Today’s SIMDs
 SIMD functionality is sometimes
embedded in sequential machines.
 Others are being build as part of hybrid
architectures.
 Some SIMD and SIMD-like features are
included in some multi/many core
processing units
 Some SIMD-like architectures have been
build as special purpose machines,
although some of these could classify as
general purpose.
24
Advantages of SIMDs
 Less hardware than MIMDs as they
have only one control unit.
 Control units are complex.
 Less memory needed than MIMD
 Only one copy of the instructions need
to be stored
 Allows more data to be stored in
memory.
 Much less time required for
communication between PEs and
data movement.
25
Advantages of SIMDs (cont)
 Singleinstruction stream and
synchronization of PEs make SIMD
applications easier to program,
understand, & debug.
 Similar to sequential programming
 Control flow operations and scalar
operations can be executed on the control
unit while PEs are executing other
instructions.
 Less complex hardware in SIMD since no
message decoder is needed in the PEs 26

 MIMDs need a message decoder in each PE.

SIMD Shortcoming Claims
 Claim 1: SIMDs have a data-parallel
orientation, but not all problems are data-
parallel
 Claim2: Speed drops for conditionally
executed branches
 Claim 3: Don’t adapt to multiple users
well.
 Claim 4: Do not scale down well to
“starter” systems that are affordable.
 Claim5: Requires customized VLSI for
27

processors and expense of control units in

Baker CHPT 5 SIMD Good
No ratings yet
Baker CHPT 5 SIMD Good
94 pages
Parallel Computer Models: PCA Chapter 1
No ratings yet
Parallel Computer Models: PCA Chapter 1
61 pages
Parallel and Distributed Algorithms: Johnnie W. Baker
No ratings yet
Parallel and Distributed Algorithms: Johnnie W. Baker
67 pages
ch.9 Pipeline MoDIFIED
No ratings yet
ch.9 Pipeline MoDIFIED
76 pages
CS213 Parallel Processing Syllabus
No ratings yet
CS213 Parallel Processing Syllabus
26 pages
Chapter - 5 Parallel Processing
No ratings yet
Chapter - 5 Parallel Processing
117 pages
Parallel Processing Essentials
No ratings yet
Parallel Processing Essentials
49 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Parallel Processing Explained
No ratings yet
Parallel Processing Explained
22 pages
Programação Paralela e Distribuída
No ratings yet
Programação Paralela e Distribuída
39 pages
CS 213: Parallel Processing Syllabus
No ratings yet
CS 213: Parallel Processing Syllabus
26 pages
Parallel Processing
No ratings yet
Parallel Processing
35 pages
Understanding SISD in Parallel Computing
No ratings yet
Understanding SISD in Parallel Computing
17 pages
Perfect ? I
No ratings yet
Perfect ? I
7 pages
SIMD and Associative Computational Models: Parallel & Distributed Algorithms
No ratings yet
SIMD and Associative Computational Models: Parallel & Distributed Algorithms
31 pages
CC Unit 1.2
No ratings yet
CC Unit 1.2
39 pages
COA Unit - 4
No ratings yet
COA Unit - 4
31 pages
1/1 Multiprocessors (Or) Shared Memory Multi-Processor Model
No ratings yet
1/1 Multiprocessors (Or) Shared Memory Multi-Processor Model
17 pages
Pda 2
No ratings yet
Pda 2
105 pages
Aca Unit 1.1
No ratings yet
Aca Unit 1.1
20 pages
COA U5 PPT Full
No ratings yet
COA U5 PPT Full
43 pages
Flynn's Classification
No ratings yet
Flynn's Classification
4 pages
PARALLEL PROGRAMMING Module 1
No ratings yet
PARALLEL PROGRAMMING Module 1
20 pages
Unit 1
No ratings yet
Unit 1
21 pages
Overview of Parallel Processing Units
No ratings yet
Overview of Parallel Processing Units
64 pages
Multiprocessor Basics & Performance
No ratings yet
Multiprocessor Basics & Performance
52 pages
NOTES
No ratings yet
NOTES
19 pages
Parallel Computing for Tech Students
No ratings yet
Parallel Computing for Tech Students
14 pages
Chapter 9
No ratings yet
Chapter 9
28 pages
ACA1
No ratings yet
ACA1
26 pages
Overview of Parallel Processing Types
No ratings yet
Overview of Parallel Processing Types
31 pages
CA Chap7 Multicores Multiprocessors
No ratings yet
CA Chap7 Multicores Multiprocessors
42 pages
COE4590 10 Flyns
No ratings yet
COE4590 10 Flyns
15 pages
CSA Presentation
No ratings yet
CSA Presentation
37 pages
Flynn's Taxonomy and SISD SIMD MISD MIMD
86% (14)
Flynn's Taxonomy and SISD SIMD MISD MIMD
7 pages
Unit 1 - Part - 2
No ratings yet
Unit 1 - Part - 2
30 pages
Parallel Computing Platforms and Memory System Performance: John Mellor-Crummey
No ratings yet
Parallel Computing Platforms and Memory System Performance: John Mellor-Crummey
43 pages
Ch12 Parallel Proc3-Aula
No ratings yet
Ch12 Parallel Proc3-Aula
35 pages
Parallel Computer Models Overview
No ratings yet
Parallel Computer Models Overview
20 pages
Flynn's Taxonomy of Parallel Processing
No ratings yet
Flynn's Taxonomy of Parallel Processing
7 pages
Lecture 3
No ratings yet
Lecture 3
49 pages
CH5 Parallel Processing
No ratings yet
CH5 Parallel Processing
30 pages
Flynn's Classification
No ratings yet
Flynn's Classification
46 pages
21cs401 CA Unit V
No ratings yet
21cs401 CA Unit V
16 pages
Unit 4
No ratings yet
Unit 4
16 pages
Unit-1 ACA
No ratings yet
Unit-1 ACA
26 pages
Ch7 Processing
No ratings yet
Ch7 Processing
22 pages
PP16 Lec4 Arch3
No ratings yet
PP16 Lec4 Arch3
23 pages
Module - 4 - Parallel Processing
No ratings yet
Module - 4 - Parallel Processing
32 pages
Lecture 3 - 1 Dichotomy of Parallel Computing Platforms
No ratings yet
Lecture 3 - 1 Dichotomy of Parallel Computing Platforms
17 pages
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
No ratings yet
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
47 pages
HPA - Notes
No ratings yet
HPA - Notes
5 pages
COA Module5 Notes
No ratings yet
COA Module5 Notes
20 pages
Parallel Computer Architecture
No ratings yet
Parallel Computer Architecture
44 pages
SPC 2405 Artificial Intelligence Programming
No ratings yet
SPC 2405 Artificial Intelligence Programming
3 pages
Solar Cat 1
No ratings yet
Solar Cat 1
1 page
Multivariate Analysis Notes 2023
No ratings yet
Multivariate Analysis Notes 2023
189 pages
SMA 2432 - Design and Analysis of Sample Surveys-PRINTREADY
No ratings yet
SMA 2432 - Design and Analysis of Sample Surveys-PRINTREADY
3 pages
2245 Time Series Sps Analysis I Year III Semester II-1
No ratings yet
2245 Time Series Sps Analysis I Year III Semester II-1
4 pages
Lecture 14
No ratings yet
Lecture 14
10 pages
Lecture 6
No ratings yet
Lecture 6
7 pages
Lecture 3
No ratings yet
Lecture 3
8 pages
Lecture 7
No ratings yet
Lecture 7
4 pages
LECTURE 11b
No ratings yet
LECTURE 11b
4 pages
Cloud Computing: Key Concepts and Models
No ratings yet
Cloud Computing: Key Concepts and Models
44 pages
Chapter 3 Service Oriented Architectures
No ratings yet
Chapter 3 Service Oriented Architectures
36 pages
Lecture 12
No ratings yet
Lecture 12
8 pages
Lecture 11
No ratings yet
Lecture 11
7 pages
Veeam Backup 11 0 Cloud Administrator Guide
No ratings yet
Veeam Backup 11 0 Cloud Administrator Guide
475 pages
AIX File System Command Guide
No ratings yet
AIX File System Command Guide
4 pages
c6 Data Sheet
No ratings yet
c6 Data Sheet
118 pages
Docker Linux Post-Install Steps
No ratings yet
Docker Linux Post-Install Steps
4 pages
Software MCQQQ
No ratings yet
Software MCQQQ
2 pages
Application Onboarding Template Sample
No ratings yet
Application Onboarding Template Sample
49 pages
IEEE SRS Format Overview
33% (6)
IEEE SRS Format Overview
1 page
CPU Scheduling Questions With Answers
No ratings yet
CPU Scheduling Questions With Answers
5 pages
Test Code: Abcpdf
No ratings yet
Test Code: Abcpdf
1 page
Comprehensive AWS Overview Guide
No ratings yet
Comprehensive AWS Overview Guide
4 pages
Computer Inventors
No ratings yet
Computer Inventors
10 pages
Latitude 7220 Rugged Extreme Spec Sheet PDF
No ratings yet
Latitude 7220 Rugged Extreme Spec Sheet PDF
5 pages
Creating A Standby Using RMAN Duplicate (RAC or Non RAC) (Doc ID 1617946.1)
No ratings yet
Creating A Standby Using RMAN Duplicate (RAC or Non RAC) (Doc ID 1617946.1)
11 pages
Enterasys Switch Commands
No ratings yet
Enterasys Switch Commands
3 pages
CS610 TCP/IP MCQs Overview
No ratings yet
CS610 TCP/IP MCQs Overview
13 pages
Conditional Access Troubleshooting Guide
No ratings yet
Conditional Access Troubleshooting Guide
1 page
Functional Components of A Computer
No ratings yet
Functional Components of A Computer
23 pages
6th Sem Electronics EL 6215 - PIC Microcontroller and Embedded Systems April 2020 A
No ratings yet
6th Sem Electronics EL 6215 - PIC Microcontroller and Embedded Systems April 2020 A
2 pages
CUDA8.0 Installation Guide Linux
No ratings yet
CUDA8.0 Installation Guide Linux
41 pages
Release Notes For Cisco ASDM, Version 7.1 (X)
No ratings yet
Release Notes For Cisco ASDM, Version 7.1 (X)
46 pages
SWR Mock 2025 0795 Computer Science P2
No ratings yet
SWR Mock 2025 0795 Computer Science P2
4 pages
Technical Service Bulletin: Greenline Branding
No ratings yet
Technical Service Bulletin: Greenline Branding
4 pages
Java Notes 3
100% (1)
Java Notes 3
523 pages
Yealink T4xS Series Provisoning Guide (Rev - 031020)
No ratings yet
Yealink T4xS Series Provisoning Guide (Rev - 031020)
6 pages
Topics in This Course ..: BITS Pilani, Pilani Campus
No ratings yet
Topics in This Course ..: BITS Pilani, Pilani Campus
24 pages
G6 Unit 5 Software
No ratings yet
G6 Unit 5 Software
11 pages
Ug1075 Zynq Ultrascale PKG Pinout PDF
No ratings yet
Ug1075 Zynq Ultrascale PKG Pinout PDF
234 pages
Load Balancing Configuration Guide
No ratings yet
Load Balancing Configuration Guide
28 pages
Cert Guide. Red Hat Rhcsa 9 (Ex200) 1St Edition Sander Van Vugt - Ebook PDF PDF Download
No ratings yet
Cert Guide. Red Hat Rhcsa 9 (Ex200) 1St Edition Sander Van Vugt - Ebook PDF PDF Download
66 pages
Cloud Disaster Recovery
No ratings yet
Cloud Disaster Recovery
27 pages

Chapter 6 Parallel and Concurrent Computing

Uploaded by

Chapter 6 Parallel and Concurrent Computing

Uploaded by

ICS 2410 Advanced

 Concurrent - Events or processes which

 In the latter case, the term distributed

 Four combinations: SISD, SIMD, MISD, MIMD

and least restricted computers.

 SMPs have been and remain very popular

 Programming advantages of message-

 Operations may be executed concurrently

 First and second statements may execute concurrently

achieve some parallelism.

 MIMDs need a message decoder in each PE.

processors and expense of control units in

You might also like