0% found this document useful (0 votes)

29 views

W3C1 Principles of Parallel Computing

Uploaded by

Hung Nguyen Vo Khac

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views

W3C1 Principles of Parallel Computing

Uploaded by

Hung Nguyen Vo Khac

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Principles of Parallel Computing

(Mastering Cloud Computing: Chapter#2)

Rashmi Kansakar
Computing Eras

➢ Sequential - 1940s+

➢ Parallel and Distributed - 1960s+

➔ But… CS typically teaches sequential

➔ Parallel programming is hard!
➔ Moore’s Law (modern CPUs) now require us into write
parallel programs.
Computing Eras…
Von Neumann Architecture
What is Serial Computing?

Traditionally, software written for serial computation architecture

ü Run on a single computer

having a single Central
Processing Unit (CPU)

ü A problem is broken into a

discrete series of
instructions
ü Instructions are executed
one after another

ü Only one instruction may

execute at any moment in
time
What is Parallel Computing?
Parallel computing is the simultaneous use of
multiple compute resources to solve a computational problem
ü Breaking the problem into
independent parts
ü Each processing element
execute its part of algorithm
simultaneously with the
others
ü Computational problem
solved in less time with
multiple compute resources
than with a single compute
resource
ü Compute resources can be a
single computer with multiple
processors or Several
networked computers
Parallel Vs. Distributed
➢ Parallel - tightly coupled system
○ Computation divided among processors sharing common memory
○ Homogeneous components. Each processor same type & capacity
○ Defined loosening with InfiniBand & distributed memory

➢ Distributed – Architecture that allows computation is broken down into

units and executed concurrently
○ Parallel a subtype - distributed more general
○ Different nodes, processors, or cores
○ Heterogeneous components
○ E.g GRID computing, Internet computing systems
Parallel Computing

Renewed interest…
➢ Larger computation tasks
➢ CPU have reached physical limits
➢ Hardware features (pipelining, superscalar, etc) require
complex compilers - reached limits
➢ Vector processing effective, but applicability isolated
➢ Networking technology mature
Hardware Architectures

➢ Single instruction, single data (SISD)

➢ Single instruction, multiple data (SIMD)

➢ Multiple instruction, single data (MISD)

➢ Multiple instruction, multiple data (MIMD)

Single Instruction Single Data (SISD)

§ Sequential computers (no parallel instruction/data streams)

§ Single Instruction is being acted by the CPU in 1 clock cycle
§ Single Data being used as input during any 1 click cycle
§ Older generation mainframes, minicomputers
§ ‘Normal’ computers, modern day PCs, Macs
§ CS1, CS2, DS - typically programming
Single Instruction Multiple Data (SIMD)

§ Multiple data streams against a single instruction stream to perform operations

§ Multiple Data: Each processing unit can operate on a different data element
§ Vector Processor 1 instruction operate on 1D arrays of data called vectors
§ Scientific workloads, vector and matrix operations
§ GPUs (CUDA), Sony PS3 Cell processor (1, 2), Cray’s vector processor
Multiple Instruction Single Data (MISD)

§ Each processing unit operates on data independently via separate

instruction streams. Example y = sin(x) + cos(x) + tan(x)
§ A single data stream is fed into multiple processing units
§ No commercial machines exist, though CPU superscalar and pipelining
have a similar feel
Multiple Instruction Multiple Data (MIMD)

§ Multiple autonomous processors simultaneously executing different instructions on

different data
§ Multiple Instruction: Every processor may be executing a different instruction stream
§ Multiple Data: Every processor may be working with a different data stream
§ Asynchronous transfers are generally faster than synchronous transfers
§ Most supercomputers, networked parallel clusters, grids, clouds, multi-core PCs
Memory Architectures in MIMD

Shared Memory Distributed-memory

- All PEs are connected to a single global - All PEs have local memory. Loosely-
memory and have access to it coupled multiprocessor systems
- Tightly-coupled multiprocessor systems - Cost effectiveness: can use
- Communication thru shared memory commodity, off-the-shelf processors
- Shared MIMD easier to program, less - Failure can be isolated. Popular
tolerant to failure, and harder to scale - Each processor can access its own
- Failure affect entire entire system memory without interference
Hardware Architectures
Flynn's taxonomy: Based upon the number of concurrent instruction
and data streams available in the architecture

Ø SISD
Uniprocessors
Ø SIMD
Vector Processors
Parallel Processing
Ø MISD
Maybe Pipelined Computers
Ø MIMD
Multi-Computers
Multi-Processors
Parallel Processor Architectures

§ The computers of today, and

tomorrow, have tremendous
processing power that require parallel
programming to fully utilize.
§ There are significant differences
between sequential and parallel
programming, that can be
challenging.
§ With early exposure to these
differences, students are capable of
achieving performance improvements
with multicore programming.
How to Program in Parallel?

➢ Problem specific!
➢ Approaches:
○ Data parallelism
■ MapReduce (data based)
○ Process parallelism
■ Game/Cell Processor (code based)
○ Farmer-and-worker model
■ Web serving (Apache) (thread based)
Level of Parallelism
➢ Goal?
○ Never have a processor idle!
○ ‘Grain size’ important
■ How you break up the problem.
Grain Size Code Item Parallelized By

Large (task) Separate (heavyweight process) Programmer

Medium (control) Function or procedure (thread) Programmer

Fine Loop or instruction block Compiler

Very Fine Instruction Processor or OS

Programmer
Compiler Level of Parallelism…

Processor &
OS
Limit to Parallelism (yet)
Linear speedup not possible
➢ Doubling # cores doesn’t double speed
■ Communication overhead
➢ General guidelines:
■ Computation speed = sqrt(system cost) or faster a system
becomes the more expensive it is to make it faster
■ Speed of parallel computer increases as the log(n) of
processors (n = number of processors)
Parallel Overhead
➢ Sequential program runs in a single processor, and has single line of control
➢ Parallel programming is making many processors collectively work on a single
program
➢ Parallel Overhead:
○ The amount of time required to coordinate parallel tasks, as opposed to doing useful
work.

➢ Parallel overhead can include factors such as:

○ Task start-up time
○ Synchronizations
○ Data communications
○ Software overhead imposed by parallel languages, libraries,
○ operating system, etc.
○ Task termination time
Why Use Parallel Computing ?
Ø Save time and money
- In theory, throwing more resources at a task will shorten its time to completion
- Parallel computers can be built from cheap, commodity Components
Ø Solve larger problems
- Many problems are so large and complex that it is impractical or impossible to
solve them on a single computer, especially given limited computer memory
Ø Provide concurrency
- A single compute resource can only do one thing at a time.
- Multiple computing resources can be doing many things simultaneously
Ø Limits to serial computing
- Increasingly expensive to make a single processor faster
- Using a larger number of moderately fast commodity processors to achieve
the same or better performance is less expensive
The Future
Trends indicated by ever faster networks, distributed systems, and multi-processor
computer architectures clearly show that PARALLELISM is the future of
computing
Ø There has been a
greater than 1000x
increase in
supercomputer
performance, with no
end currently in sight
Ø The race is already on
for Exascale Computing!

1 exaFlops = 10^18,
FLOATING POINT OPERATIONS PER SECOND
FLOP machine will do one "operation" in a second.
Moore’s Law and beyond
Moore's Law originated around 1970;
Simplified version of this law states that
processor speeds, or overall processing
power for computers will double every two
years. Moore's Law is no longer relevant,
GPUs advancing faster pace than CPUs
Improving parallel processing CPU vs. GPU
Ø CPU is sometimes called the brains of a
computer while a GPU acts as a specialized
microprocessor.
Ø CPU is good at handling multiple tasks but
a GPU can handle a few specific tasks very fast.
Ø GPU (graphical processing unit) is a
programmable processor designed to quickly
render high resolution images and video.
Ø CUDA cores are an Nvidia GPU's equivalent of
CPU cores. They are optimized for running a
large number of calculations
simultaneously
Ø GPU render
Billions of
triangles Intel XEON NVIDIA TITAN V
per second PLATINUM 9282 CUDA Cores: 5,120
CPU Cores: 56 cores Tensor Cores: 640
Ø PARALLEL
Retail: $50,000 + Retail: $2,999
PROCESSING Transistors : 8 Billion Transistors : 21 Billion
in steroid!
Beyond GPU to dedicated ML silicon
CPU GPU
1. CPU stands for Central Processing While GPU stands for Graphics
Unit. Processing Unit.
2. CPU consumes or needs more While it consumes or requires less
memory than GPU. memory than CPU.
3. The speed of CPU is less than GPU’s While GPU is faster than CPU’s
speed. speed.
4. CPU contain minute powerful cores. While it contain more weak cores.
5. CPU is suitable for serial instruction While GPU is not suitable for serial
processing. instruction processing.
6. CPU is not suitable for parallel While GPU is suitable for parallel
instruction processing. instruction processing.
7. CPU emphasis on low latency. While GPU emphasis on high
throughput.
Comparing GPU & CPU

MythBusters hosts: Adam and Jamie

Paint the Mona Lisa in 80 Milliseconds!

https://www.youtube.com/watch?v=WmW6SD-EHVY
Why Deep Learning uses GPUs

• Artificial intelligence with PyTorch and CUDA.

• CUDA cores are an Nvidia GPU's equivalent of CPU cores. They are
optimized for running large number of calculations simultaneously
• Discuss how CUDA fits in with PyTorch, and more importantly, why
we use GPUs in neural network programming.

https://www.youtube.com/watch?v=6stDhEA0wFQ

Your Electronic Ticket Receipt
No ratings yet
Your Electronic Ticket Receipt
2 pages
Advanced Oracle SQL Tuning
No ratings yet
Advanced Oracle SQL Tuning
5 pages
Learn Multithreading with Modern C++
From Everand
Learn Multithreading with Modern C++
James Raynard
No ratings yet
Lec1 Introduction to Parallel Computing (2)
No ratings yet
Lec1 Introduction to Parallel Computing (2)
40 pages
Week1-Parallel-and-Distributed-Computing
No ratings yet
Week1-Parallel-and-Distributed-Computing
55 pages
KCS 713 Unit 1 Lecture 5
No ratings yet
KCS 713 Unit 1 Lecture 5
32 pages
Parallel Computing Terminology
No ratings yet
Parallel Computing Terminology
11 pages
01 Intro Parallel Computing
No ratings yet
01 Intro Parallel Computing
40 pages
Parallel Computing Main
No ratings yet
Parallel Computing Main
47 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
90 pages
Basics of Parallel Programming: Unit-1
No ratings yet
Basics of Parallel Programming: Unit-1
79 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Unit 1
No ratings yet
Unit 1
22 pages
Cloud Computing - Lecture 3
No ratings yet
Cloud Computing - Lecture 3
22 pages
Lecture Parallel Computing
No ratings yet
Lecture Parallel Computing
6 pages
Theory of Distributed Computing and Parallel Processing With Its Applications, Advantages and Disadvantages
No ratings yet
Theory of Distributed Computing and Parallel Processing With Its Applications, Advantages and Disadvantages
11 pages
u 1 c
No ratings yet
u 1 c
20 pages
Parallel 123
No ratings yet
Parallel 123
28 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
28 pages
Introduction To Parallel Computing LLNL
No ratings yet
Introduction To Parallel Computing LLNL
44 pages
Lecture 4
No ratings yet
Lecture 4
27 pages
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
No ratings yet
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
47 pages
Computer Achitecture II - Parallel - Computing
No ratings yet
Computer Achitecture II - Parallel - Computing
46 pages
Lecture 2 General Parallelism Terms
No ratings yet
Lecture 2 General Parallelism Terms
22 pages
Introduction To Computing
No ratings yet
Introduction To Computing
6 pages
Topic 1 2024
No ratings yet
Topic 1 2024
41 pages
Introduction To Parallel Computing
No ratings yet
Introduction To Parallel Computing
38 pages
Module 1: Parallelism Fundamentals Week 1 Learning Outcomes
No ratings yet
Module 1: Parallelism Fundamentals Week 1 Learning Outcomes
8 pages
CS0051 - Module 01 - Subtopic 1
No ratings yet
CS0051 - Module 01 - Subtopic 1
27 pages
Unit VI Parallel Programming Concepts
No ratings yet
Unit VI Parallel Programming Concepts
90 pages
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
No ratings yet
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
30 pages
Parallel Programming Module 1
No ratings yet
Parallel Programming Module 1
71 pages
Synopsis On "Massive Parallel Processing (MPP) "
No ratings yet
Synopsis On "Massive Parallel Processing (MPP) "
4 pages
Flynns
No ratings yet
Flynns
41 pages
Paralle Processing in Brief
No ratings yet
Paralle Processing in Brief
31 pages
CS0051 - Module 01
No ratings yet
CS0051 - Module 01
52 pages
Multi Threading
No ratings yet
Multi Threading
168 pages
Overview of Parallel Computing: Shawn T. Brown
No ratings yet
Overview of Parallel Computing: Shawn T. Brown
46 pages
Chapter 1 - Parallel Architectures
No ratings yet
Chapter 1 - Parallel Architectures
60 pages
Parallel_computing
No ratings yet
Parallel_computing
32 pages
Unit 5
No ratings yet
Unit 5
66 pages
Introduction To Parallel Programming
No ratings yet
Introduction To Parallel Programming
129 pages
UNIT 3
No ratings yet
UNIT 3
46 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
28 pages
Intro To Parallel Computing
No ratings yet
Intro To Parallel Computing
127 pages
Week1 - Parallel and Distributed Computing
100% (1)
Week1 - Parallel and Distributed Computing
46 pages
Unit2_a
No ratings yet
Unit2_a
70 pages
08 Parallel algorithms approches
No ratings yet
08 Parallel algorithms approches
12 pages
Introduction To Parallel Computing-Dr Nousheen
No ratings yet
Introduction To Parallel Computing-Dr Nousheen
43 pages
PP Cuda Unit1 1
No ratings yet
PP Cuda Unit1 1
77 pages
07 - Chapter 1 PDF
No ratings yet
07 - Chapter 1 PDF
27 pages
PARALLEL VS DISTRIBUTED COMPUTING
No ratings yet
PARALLEL VS DISTRIBUTED COMPUTING
9 pages
CS802A Lec-2 PDF
No ratings yet
CS802A Lec-2 PDF
28 pages
Parallel and Distributed Computing Systems
100% (1)
Parallel and Distributed Computing Systems
57 pages
Hpc_unit-1 Insem Notes
No ratings yet
Hpc_unit-1 Insem Notes
76 pages
Parallel Processor Computing Unit 1
No ratings yet
Parallel Processor Computing Unit 1
10 pages
Required Babes
No ratings yet
Required Babes
21 pages
CC UNIT-1 Material
No ratings yet
CC UNIT-1 Material
26 pages
Parallel Computing Varun Patial
No ratings yet
Parallel Computing Varun Patial
41 pages
Parallel Computing: Er. Anupama Singh Department of Computer Science & Engg
No ratings yet
Parallel Computing: Er. Anupama Singh Department of Computer Science & Engg
22 pages
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
W2C1 History Building Blocks Cloud Computing
No ratings yet
W2C1 History Building Blocks Cloud Computing
38 pages
Working With Libraries and Frameworks
No ratings yet
Working With Libraries and Frameworks
2 pages
Functions and Modules
No ratings yet
Functions and Modules
2 pages
Control Flow and Loops
No ratings yet
Control Flow and Loops
2 pages
dầu hạt cải-dầu oliu
No ratings yet
dầu hạt cải-dầu oliu
14 pages
NoI DUNG On TaP KIeM TRA HoC Ky I - MoN TIeNG ANH 11 THi dIeM - Nam Hoc 2019 - 2020 5edaa36538
No ratings yet
NoI DUNG On TaP KIeM TRA HoC Ky I - MoN TIeNG ANH 11 THi dIeM - Nam Hoc 2019 - 2020 5edaa36538
18 pages
Reviewer (STAS111)
No ratings yet
Reviewer (STAS111)
14 pages
MCQ On Electricity and Magnetism
79% (14)
MCQ On Electricity and Magnetism
3 pages
AZ Evergreen Moldy Bread v1 GY
No ratings yet
AZ Evergreen Moldy Bread v1 GY
6 pages
Formato de Conteo Vehicular Xls
No ratings yet
Formato de Conteo Vehicular Xls
2 pages
Hrf628af6 PDF
No ratings yet
Hrf628af6 PDF
34 pages
Steps in Human Resource Planning
No ratings yet
Steps in Human Resource Planning
2 pages
RPT 2020 Bahasa Inggeris Tingkatan 3 KSSM Sumberpendidikan
No ratings yet
RPT 2020 Bahasa Inggeris Tingkatan 3 KSSM Sumberpendidikan
3 pages
Invertis University, Bareilly: First Shift - Block 3 Second Shift - Block 1 & 2 Odd Semester Examination 2018-19
No ratings yet
Invertis University, Bareilly: First Shift - Block 3 Second Shift - Block 1 & 2 Odd Semester Examination 2018-19
14 pages
Loan Policy 1
No ratings yet
Loan Policy 1
13 pages
Ball Valve Data Sheet: Item Requirement Notes
No ratings yet
Ball Valve Data Sheet: Item Requirement Notes
1 page
Improving Diagnosis in Health Care
100% (1)
Improving Diagnosis in Health Care
369 pages
Mastercam 2018 Mill Essentials Professional Courseware 1st Edition Mariana Lendel - Download the ebook now and read anytime, anywhere
100% (4)
Mastercam 2018 Mill Essentials Professional Courseware 1st Edition Mariana Lendel - Download the ebook now and read anytime, anywhere
63 pages
Underground Fire Protection Pipes & Equipments
No ratings yet
Underground Fire Protection Pipes & Equipments
6 pages
SS2 Technical Drawing Lesson Plan Week 5
100% (1)
SS2 Technical Drawing Lesson Plan Week 5
6 pages
Project Dates: What Are The Dates in SAP Project System?
No ratings yet
Project Dates: What Are The Dates in SAP Project System?
4 pages
IMC Unit 1 Mock Exam 2 V17 June 2020 Final Version 11
No ratings yet
IMC Unit 1 Mock Exam 2 V17 June 2020 Final Version 11
25 pages
Sponsorship and Fundraising
No ratings yet
Sponsorship and Fundraising
8 pages
Different Kinds of Drugs and Its Effects
No ratings yet
Different Kinds of Drugs and Its Effects
13 pages
22nd - 25th Oct Consolidated - Seating - Plan - Morning - Shift
No ratings yet
22nd - 25th Oct Consolidated - Seating - Plan - Morning - Shift
4 pages
Get Principles of Biomedical Engineering Second Edition Sundararajan Madihally free all chapters
100% (3)
Get Principles of Biomedical Engineering Second Edition Sundararajan Madihally free all chapters
45 pages
上海外语口译证书培训与考试系列丛书·英语中级口译证书考试中级口译教程 (第四版) (梅德明) (Z-Library) -1
No ratings yet
上海外语口译证书培训与考试系列丛书·英语中级口译证书考试中级口译教程 (第四版) (梅德明) (Z-Library) -1
697 pages
FLSmidth Cross-Bar Cooler Brochure
No ratings yet
FLSmidth Cross-Bar Cooler Brochure
8 pages
Beijing National Stadium Bird's Nest
No ratings yet
Beijing National Stadium Bird's Nest
16 pages
Mrs. Vandana - Aligarh - Master Walkin Closet and Bathroom Closet - Wardrobe
No ratings yet
Mrs. Vandana - Aligarh - Master Walkin Closet and Bathroom Closet - Wardrobe
6 pages
BM Report Lipton
No ratings yet
BM Report Lipton
10 pages
Thermal PRP of Matter
No ratings yet
Thermal PRP of Matter
18 pages