0% found this document useful (0 votes)

93 views29 pages

Lecture 3 Flynn's Classical Taxonomy

This document discusses Flynn's taxonomy for classifying parallel computer architectures. It describes the four categories in Flynn's taxonomy: SISD, SIMD, MISD, and MIMD. SISD refers to a single instruction stream, single data stream computer (i.e. a serial computer). SIMD involves a single instruction stream operating on multiple data streams. MISD uses multiple instruction streams on a single data stream. MIMD uses multiple instruction and data streams and includes most current supercomputers, grid computers, and networked parallel computers. The document provides examples and details on each classification.

Uploaded by

nimranoor137

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views29 pages

Lecture 3 Flynn's Classical Taxonomy

Uploaded by

nimranoor137

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 29

Parallel and Distributed Computing

CS 3006 (BCS-7A | BDS-7A)

Lecture 3
Danyal Farhat
FAST School of Computing
NUCES Lahore
Flynn’s Classical Taxonomy
and Processor to Memory
Connection Strategies
Hardware Architecture Classifications
• Flynn’s Classification
Differentiates multiprocessor computers according to the dimensions of
instruction and data

• Feng’s Classification
Mainly based on serial and parallel processing in the computer system

• Handler’s Classification
Calculated on the basis of degree of parallelism and pipelining in system
levels
Flynn’s Classical Taxonomy
• Most Widely used parallel computer classifications
• Differentiates multiprocessor computers according to the
dimensions of instruction and data
Instruction stream: Sequence of instructions from memory to control unit
Data stream: Sequence of data from memory to control unit
• SISD: Single Instruction stream, Single Data stream
• SIMD: Single Instruction stream, Multiple Data stream
• MISD: Multiple Instruction stream, Single Data stream
• MIMD: Multiple Instruction stream, Multiple Data stream
Processor Organizations
SISD
• A serial (non-parallel computer)
• Single instruction: one instruction per cycle
• Single data: only one data stream per cycle
• Easy and deterministic execution
Example:
• Single CPU workstations
• Most workstations from HP, IBM and SGI are SISD
machines
SISD (Cont.)
• Performance of a processor can be measured with:
MIPS Rate = f x IPC
Million instructions per second (MIPS) is an approximate measure of a
computer's raw processing power
f (clock frequency of processor); IPC (instructions per cycle)
How to increase performance of uniprocessor?
• Multithreading
• Increasing clock frequency
• Increasing number of instructions completed during a
processor cycle (multiple pipelines in a superscalar
architecture and/or out of order execution)
SISD – Multithreading
• Run multiple threads on the same core concurrently
• Context switch implemented in hardware
• Minimum hardware support: replicate architectural state
All running threads must have their own context
Multiple register sets in the core
Multiple state registers
 Program Counter (PC)
 Memory Address Register (MAR)
 Accumulator Register (ACC)
SISD – Multithreading (Cont.)
Implicit Multithreading
• concurrent execution of multiple threads extracted from a
single sequential program
• Managed by processor hardware
• Improve individual application performance
Explicit Multithreading
• concurrent execution of instructions from different explicit
threads, either by interleaving instructions from different
threads or by parallel execution on parallel pipelines
SISD-Explicit Multithreading
• Four approaches for explicit multithreading
Interleaved multithreading (fine-grained): switching can be at each
clock cycle. In case of few active threads, performance degrades
Blocked multithreading (coarse-grained): events like cache miss
produce switch
Simultaneous multithreading (SMT): execution units of a superscalar
processor receive instructions from multiple threads
Chip multiprocessing: e.g. dual core (not SISD)
• Architectures like IA-64 Very Long Instruction Word (VLIW)
allow multiple instructions (to be executed in parallel) in a
single word
SISD-Explicit Multithreading (Cont.)
Interleaved Multithreading (fine-grained):
• Fetch instructions from different threads in consecutive cycles
• In every clock cycle, we fetch an instruction for a thread
switching is at each clock cycle
• In case of few active threads, performance degrades
SISD-Explicit Multithreading (Cont.)
Blocked Multithreading (coarse-grained):
• Another thread is started when a thread is blocked
• Events like cache miss, waiting for I/O produce switch
• Switch to different thread when a long latency event (e.g. L2
cache miss) occurs
SISD-Explicit Multithreading (Cont.)
Simultaneous Multithreading (SMT):
• Fetch instructions from different threads in single cycle
• Execution units of a superscalar processor receive instructions
from multiple threads
• A superscalar processor is a CPU that implements a form of
parallelism called instruction-level parallelism within a single
processor
Intel’s Hyper Threading Technology
• A single physical processor appears as two logical processors
by applying two-threaded SMT approach
Example: Intel Pentium 4 in 2002

• Each logical processor maintains a complete set of architecture

state (general-purpose registers, control registers,…)

• Logical processors share nearly all other resources such as

caches, execution units, branch predictors, control logic and
buses
Intel’s Hyper Threading Technology (Cont.)
• Partitioned resources are recombined when only one thread is
active
• Add less than 5% to the relative chip size
• Improve performance by 16% to 28%
SIMD

• Homogeneous processing units / processing elements (PEs)

• Single instruction: All processor units execute the same
instruction at any given time
• Multiple data: Each processing unit can operate on different
data set
Example: Add A and B, C and D, X and Z
SIMD (Cont.)
• Each processing element has an associated data memory
So that each instruction is executed on a different set of data by the
different processors
• Used by vector and array processors
Suitable for vector and matrix calculations
• Vector processors act on array of similar data (only when
executing in vector mode) and in this case they are several
times faster than when executing in scalar mode
Example: NEC SX-8 processors run at 2 GHz for vectors and 1 GHz for
scalar operations
SIMD - Example
• A good example is the processing of pixels on screen
• A sequential processor would examine each pixel one at a time
and apply the processing instruction
• An array or vector processor can process all the elements of an
array simultaneously
• Game consoles and graphic cards make heavy use of such
processors to shift those pixels
• Such designs are usually dedicated to a particular application
and not commonly marketed for general purpose computing
SIMD-Example
MISD
• A single data stream is transmitted to a set of processors, each
of which executes a different instruction sequence
• Each processing unit operates on the data independently via
independent instruction stream
• This structure is not commercially implemented
• An example of use could be multiple cryptography algorithms
attempting to crack a coded message
MISD (Cont.)
• Example: 3 processors executes three different instructions on
same data set
MIMD
• Multiple instruction: Every processor may execute a different
instruction stream
• Multiple data: Every processor may work with a different data
stream
Examples:
• Most of the current supercomputers
• Grid computers
• Networked parallel computers
• Symmetric Multiprocessors (SMP) computers
MIMD (Cont.)
MIMD systems are mainly:

Shared Memory (SM) Systems:

• Multiple CPUs all of which share the same address space
(there is only one memory)

Distributed Memory (DM) Systems:

• Each CPU has its own associated memory
• CPUs are connected by some network (clusters)
MIMD - Shared Memory

• All processors have access to all memory as a global address

space
Uniform Memory Access (UMA)

• From all processing units to the shared memory, the data

access time is constant.
• Mostly represented by Symmetric Multiprocessor (SMP)
machines.
Non-Uniform Memory Access (NUMA)

• From all processing units to the shared memory, the data

access time is not constant.
Shared Memory Interconnection Network
• Main problem is how to do interconnections of the CPUs to
each other and to the memory

There are three main network topologies available:

• Crossbar (n2 connections - datapath without sharing)
• -network(n log2 n connections - log2 n switching stages
and shared on a path)
• Central databus (1 connections - n shared)
Shared Memory Interconnection Network (Cont.)
Thank You!

Android Development Internship Report
100% (2)
Android Development Internship Report
18 pages
Flynn's Taxonomy and SISD SIMD MISD MIMD
86% (14)
Flynn's Taxonomy and SISD SIMD MISD MIMD
7 pages
Lecture 2 Amdahl's Law and Karp-Flatt Metric
0% (1)
Lecture 2 Amdahl's Law and Karp-Flatt Metric
14 pages
Flynns Taxonomy
0% (1)
Flynns Taxonomy
79 pages
WEEK 8 Period 1 LESSON NOTE
No ratings yet
WEEK 8 Period 1 LESSON NOTE
3 pages
ICT Grade 3 Exam
100% (6)
ICT Grade 3 Exam
4 pages
15 MACRO COMPILER 62073e2
No ratings yet
15 MACRO COMPILER 62073e2
166 pages
Pronest Software
No ratings yet
Pronest Software
2 pages
Flynn's Classification
No ratings yet
Flynn's Classification
4 pages
Authentication and Key Agreement Based On Anonymous Identity For Peer
100% (1)
Authentication and Key Agreement Based On Anonymous Identity For Peer
59 pages
Procreate 101 Everything You Need To Know To Get Started
No ratings yet
Procreate 101 Everything You Need To Know To Get Started
7 pages
Automotive Industry Pitch Deck by Slidesgo
No ratings yet
Automotive Industry Pitch Deck by Slidesgo
41 pages
3.array Processors
100% (3)
3.array Processors
14 pages
RHEL 8 Cheat Sheet Red Hat Developer
No ratings yet
RHEL 8 Cheat Sheet Red Hat Developer
20 pages
IJARCCE6G S Prabhudev Parallel PDF
No ratings yet
IJARCCE6G S Prabhudev Parallel PDF
4 pages
Unit 1 - Foundations of Hci
No ratings yet
Unit 1 - Foundations of Hci
24 pages
HP Vectra PIe8
No ratings yet
HP Vectra PIe8
3 pages
Thesis Software Download
100% (2)
Thesis Software Download
4 pages
Lecture 4 Network Topologies For Parallel Architecture
No ratings yet
Lecture 4 Network Topologies For Parallel Architecture
34 pages
Ibm Utl Uxsp Vvib07p-6.80 Rhel6 32-64
No ratings yet
Ibm Utl Uxsp Vvib07p-6.80 Rhel6 32-64
12 pages
Flynn's Taxonomy of Computer Architecture
No ratings yet
Flynn's Taxonomy of Computer Architecture
8 pages
Flynns Classification
No ratings yet
Flynns Classification
12 pages
WT Unit-2 Notes
No ratings yet
WT Unit-2 Notes
41 pages
Powerview Model Pv780: Operations Manual
No ratings yet
Powerview Model Pv780: Operations Manual
20 pages
Benjamin Avdic: 11-7138 210 Street Langley, BC V2Y 0H7 778-536-7183
No ratings yet
Benjamin Avdic: 11-7138 210 Street Langley, BC V2Y 0H7 778-536-7183
3 pages
CH - 1 Intro To Wordpad With Task
No ratings yet
CH - 1 Intro To Wordpad With Task
6 pages
Əməliyyat Sistemləri 2-Ci Kollokvium
No ratings yet
Əməliyyat Sistemləri 2-Ci Kollokvium
6 pages
Parallel Processing in Processor Organization: Prabhudev S Irabashetti
No ratings yet
Parallel Processing in Processor Organization: Prabhudev S Irabashetti
4 pages
Advanced Computer Architecture: The Architecture of Parallel Computers
No ratings yet
Advanced Computer Architecture: The Architecture of Parallel Computers
44 pages
Batath
No ratings yet
Batath
19 pages
Lecture 1 Introduction To PDC
No ratings yet
Lecture 1 Introduction To PDC
17 pages
Lecture 5 Principles of Parallel Algorithm Design
No ratings yet
Lecture 5 Principles of Parallel Algorithm Design
30 pages
Mil-Module 6
No ratings yet
Mil-Module 6
3 pages
Unit 5-Microcontroller ALP
No ratings yet
Unit 5-Microcontroller ALP
44 pages
Flynn's Taxonomy: 1. Sisd
No ratings yet
Flynn's Taxonomy: 1. Sisd
7 pages
Flynn'S Classification: Cs6303 Computer Architecture
No ratings yet
Flynn'S Classification: Cs6303 Computer Architecture
11 pages
A Comprehensive Survey of Various Processor Types & Latest Architectures
No ratings yet
A Comprehensive Survey of Various Processor Types & Latest Architectures
7 pages
CABALLERO - 2004 - WARM Project Wildland-Urban Area Fire Risk Management Framework and Results
No ratings yet
CABALLERO - 2004 - WARM Project Wildland-Urban Area Fire Risk Management Framework and Results
11 pages
Flynn's Taxonomy
No ratings yet
Flynn's Taxonomy
18 pages
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
No ratings yet
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
26 pages
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
No ratings yet
CS 213: Parallel Processing Architectures: Laxmi Narayan Bhuyan
26 pages
NVIDIA - Success Factors Behind $1 Trillion Ecosystem
No ratings yet
NVIDIA - Success Factors Behind $1 Trillion Ecosystem
11 pages
Flynns Classification
No ratings yet
Flynns Classification
27 pages
Edu Crackers 4
No ratings yet
Edu Crackers 4
15 pages
CS802A Lec-2 PDF
No ratings yet
CS802A Lec-2 PDF
28 pages
Parallel Computer Models: PCA Chapter 1
No ratings yet
Parallel Computer Models: PCA Chapter 1
61 pages
Parallel Processors: Session 2
No ratings yet
Parallel Processors: Session 2
32 pages
SIMD and Associative Computational Models: Parallel & Distributed Algorithms
No ratings yet
SIMD and Associative Computational Models: Parallel & Distributed Algorithms
31 pages
5 4 Parallel
No ratings yet
5 4 Parallel
47 pages
Baker CHPT 5 SIMD Good
No ratings yet
Baker CHPT 5 SIMD Good
94 pages
Batch - 01 Report
No ratings yet
Batch - 01 Report
70 pages
Lecture13 - Full IS1500
No ratings yet
Lecture13 - Full IS1500
34 pages
Parallel Computing Unit 2 - Parallel Computing Architecture
No ratings yet
Parallel Computing Unit 2 - Parallel Computing Architecture
49 pages
Lec 5
No ratings yet
Lec 5
14 pages
App Start
No ratings yet
App Start
15 pages
William Stallings Computer Organization and Architecture: Parallel Processing
No ratings yet
William Stallings Computer Organization and Architecture: Parallel Processing
40 pages
Pixyz Brochure - Revised
No ratings yet
Pixyz Brochure - Revised
2 pages
NOTES
No ratings yet
NOTES
19 pages
Parallel & Distributed Computing: By: M. Imran Siddiqui
No ratings yet
Parallel & Distributed Computing: By: M. Imran Siddiqui
25 pages
Parallel Computing
No ratings yet
Parallel Computing
34 pages
Coa-Unit - 5 Notes
No ratings yet
Coa-Unit - 5 Notes
38 pages
Parallel Processing
No ratings yet
Parallel Processing
22 pages
New 223 Arm Processors
No ratings yet
New 223 Arm Processors
12 pages
Parallel Processing Parallel Processing
No ratings yet
Parallel Processing Parallel Processing
64 pages
How To Bypass Windows 11's TPM, CPU and RAM Requirements - Tom's
No ratings yet
How To Bypass Windows 11's TPM, CPU and RAM Requirements - Tom's
29 pages
Lecture-5 Flynn Taxonomy
No ratings yet
Lecture-5 Flynn Taxonomy
17 pages
Cs8083 MCP Unit I Notes
No ratings yet
Cs8083 MCP Unit I Notes
31 pages
Introduction To Parallel Processing
No ratings yet
Introduction To Parallel Processing
49 pages
Advanced Computer Architecture: The Architecture of Parallel Computers
No ratings yet
Advanced Computer Architecture: The Architecture of Parallel Computers
44 pages
BCSE412L - Parallel Computing 04
No ratings yet
BCSE412L - Parallel Computing 04
9 pages
Week 4 PDC
No ratings yet
Week 4 PDC
11 pages
COA U5 PPT Full
No ratings yet
COA U5 PPT Full
43 pages
06 Flynn-S Classification
No ratings yet
06 Flynn-S Classification
31 pages
Lecture 02
No ratings yet
Lecture 02
19 pages
Flynn's Classification - SISD, SIMD, MISD & MIMD
No ratings yet
Flynn's Classification - SISD, SIMD, MISD & MIMD
15 pages
Lecture ParallelArchTLP-DLP
No ratings yet
Lecture ParallelArchTLP-DLP
52 pages
Aca Unit 1.1
No ratings yet
Aca Unit 1.1
20 pages
Lecture 2
No ratings yet
Lecture 2
12 pages
Chapter2 Part 3
No ratings yet
Chapter2 Part 3
27 pages
சண்முக நாமாவளி ஸ்ரீமத் பாம்பன் குமரகுருதாச சுவாமிகள்
No ratings yet
சண்முக நாமாவளி ஸ்ரீமத் பாம்பன் குமரகுருதாச சுவாமிகள்
693 pages
Flynn Taxonomy
No ratings yet
Flynn Taxonomy
4 pages
Computer Architecture Flynn's Taxonomy
No ratings yet
Computer Architecture Flynn's Taxonomy
4 pages
CV Project Proposal
No ratings yet
CV Project Proposal
3 pages
ACA1
No ratings yet
ACA1
29 pages
Unit IV CA
No ratings yet
Unit IV CA
73 pages
COE4590 10 Flyns
No ratings yet
COE4590 10 Flyns
15 pages
Architecture
No ratings yet
Architecture
67 pages
Flynn's and Fengs Architecture
No ratings yet
Flynn's and Fengs Architecture
28 pages
Chapter 6 Parallel and Concurrent Computing
No ratings yet
Chapter 6 Parallel and Concurrent Computing
27 pages
Lecture 3.1.1 (Parallelism in Uniprocessor System, Flynn - S Classification)
No ratings yet
Lecture 3.1.1 (Parallelism in Uniprocessor System, Flynn - S Classification)
8 pages
Parallel Processing Lecture2
No ratings yet
Parallel Processing Lecture2
62 pages
CC Unit 1.2
No ratings yet
CC Unit 1.2
39 pages
Parallel Prrocessor
No ratings yet
Parallel Prrocessor
12 pages
Operating Systems Interview Questions You'll Most Likely Be Asked
From Everand
Operating Systems Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Lecture 3 Flynn's Classical Taxonomy

Uploaded by

Lecture 3 Flynn's Classical Taxonomy

Uploaded by

Parallel and Distributed Computing

CS 3006 (BCS-7A | BDS-7A)

• Each logical processor maintains a complete set of architecture

• Logical processors share nearly all other resources such as

• Homogeneous processing units / processing elements (PEs)

Shared Memory (SM) Systems:

Distributed Memory (DM) Systems:

• All processors have access to all memory as a global address

• From all processing units to the shared memory, the data

• From all processing units to the shared memory, the data

There are three main network topologies available:

You might also like