0% found this document useful (0 votes)

56 views52 pages

Lec1 and 2

This document provides an introduction to a course on parallel programming. The objectives of the course are to learn how to program parallel processors and systems, develop real applications on hardware, and discuss current parallel computing contexts. It explains that multicore and many-core architectures are now common due to technology trends, and many programmers will need to develop parallel software to take advantage of these resources. It provides an overview of the roadmap for the course, covering why parallel computing is needed, how to write parallel programs, and what topics will be covered, including concurrent, parallel and distributed systems.

Uploaded by

Waleed Awan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views52 pages

Lec1 and 2

Uploaded by

Waleed Awan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 52

An Introduction to Parallel Programming

Lecture 1
Why Parallel Computing?

1
INTRODUCTION
WEEK 01
Course Objectives
 Learn how to program parallel processors and
systems
 Learn how to think in parallel and write correct parallel
programs
 Achieve performance and scalability through
understanding of architecture and software mapping
 Significant hands-on programming experience
 Develop real applications on real hardware
 Discuss the current parallel computing context
 What are the drivers that make this course timely
 Contemporary programming models and architectures, and
where is the field going

3
Why is this Course Important?
 Multi-core and many-core era is here to stay
 Why? Technology Trends

 Many programmers will be developing parallel software

 But still not everyone is trained in parallel programming

 Learn how to put all these vast machine resources to the best

use!
 Useful for
 Joining the industry

 Graduate school

 Our focus
 Teach core concepts

 Use common programming models

 Discuss broader spectrum of parallel computing

4
Roadmap
 Why we need ever-increasing performance.
 Why we’re building parallel systems.
 Why we need to write parallel programs.
 How do we write parallel programs?
 What we’ll be doing.
 Concurrent, parallel, distributed!

5
Parallel and Distributed
Computing
 Parallel computing (processing):
 the use of two or more processors (computers), usually

within a single system, working simultaneously to solve a

single problem.
 Distributed computing (processing):
 any computing that involves multiple computers

remote from each other that each have a role in a

computation problem or information processing.

 Parallel programming:
 the human process of developing programs that express what

computations should be executed in parallel.

6
Parallel Computing
To be run using multiple CPUs
◦A problem is broken into discrete parts that can be solved
concurrently
◦Each part is further broken down to a series of
instructions

Instructions from each part execute simultaneously on

different CPUs

Page 7
Parallel Computing Example

Page 8
Parallel Computing
The simultaneous use of multiple compute resources to solve a
computational problem.
Compute Resources
The compute resources can include:
◦A single computer with multiple processors/cores
◦An arbitrary number of computers connected by
a network
◦A combination of both

Page 10
Why we need ever-increasing
performance
 Computational power is increasing, but so are our
computation problems and needs.
 Problems we never dreamed of have been
solved because of past increases, such as
decoding the human genome.
 More complex problems are still waiting to be
solved.

11
Climate modeling
 National Oceanic and Atmospheric Administration
(NOAA) has more than 20PB of data and processes
80TB/day

12
Climate modeling

One
Another processor
processor computes
computes this part
this part in
parallel

Processors in adjacent blocks in the grid communicate their result.

13
Data analysis
 CERN’s Large Hadron Collider (LHC) produces about 15PB per year
 High-energy physics workflows involve a range of both data-intensive and
compute-intensive activities.
 The collision data from the detectors on the LHC needs to be filtered to select a
few thousand interesting collisions from as many as one billion that may take
place each second.
 The WLCG produces a massive sample of billions of simulated beam crossings,
trying to predict the response of the detector and compare it to known physics
processes and potential new physics signals.

14
Drug discovery
 Computational drug discovery and design (CDDD) based on HPC is a
combination of pharmaceutical chemistry, computational chemistry, and
biology using supercomputers, and has become a critical technology in
drug research and development.

15
Why Parallel Computing?
The Real World is Massively Parallel:
◦Parallel computing attempts to emulate the natural world
◦Many complex, interrelated events happening at the same time, yet within a
sequence.

Page 12
Why Parallel Computing?

Page 13
Why Parallel Computing?
To solve larger, more complex Problems:
numerical simulations of complex systems and
"Grand Challenge Problems" such as:

◦weather and climate forecasting

◦chemical and nuclear reactions
◦geological, seismic activity
◦mechanical devices (spacecraft )
◦electronic circuits
◦manufacturing processes
Why Parallel Computing?

To provide Concurrency:
Commercial applications require the processing of large amounts of data
in sophisticated ways.

Page 15
Why Parallel Computing?

Example applications include:

◦parallel databases, data mining
◦web search engines, web based business services
◦computer-aided diagnosis in medicine
◦management of national and multi-national corporations
◦advanced graphics and virtual reality, particularly in the
entertainment industry

Page 16
Why Parallel Computing?
◦ To save time
◦ To solve larger problems
◦ To provide concurrency

Page 17
Why Parallel Computing?

Parallel computing is an attempt to maximize the

infinite but seemingly limited commodity called
time!

Page 18
Who and What?
Top500.org provides statistics on parallel computing
users.
Who and What?

Page 20
The Future?
During the past 20 years, the trends indicated by ever faster
networks, distributed systems, and multi-processor
architectures clearly show that parallelism is the future of
computing.
In this same time period, there has been a greater than
500,000x increase in supercomputer performance, with no
end currently in sight.
The race is already on for Exascale Computing!
Exaflop = 1018 calculations per second

Page 21
Towards parallel hardware

26
Why we’re building parallel
systems
 Up to now, performance increases have been
attributable to increasing density of transistors.

 But there are inherent

problems.

27
A little physics lesson
 Smaller transistors = faster processors.
 Faster processors = increased power
consumption.
 Increased power consumption =
increased heat.
 Increased heat = unreliable processors.

28
Evolution of processors in the
last 50 years
Evolution of processors in the last 50 years

29
How small is 5nm?

https://www.tsmc.com/english/dedicatedFoundry/technology/logic/l_5nm
30
An intelligent solution
 Instead of designing and building faster
microprocessors, put multiple processors on a
single integrated circuit.
 Move away from single-core systems to
multicore processors.
 Introducing parallelism!!!

31
Basic Computer Architecture
 Old computers – one  New computers have 4
unit to execute or more cpu cores
instructions

Core

32
Memory Cache
 L1 Cache
 Size is up to 2MB
 Typically 100 times faster than RAM
 L2 Cache
 Size is typically between 256KB to 8MB
 Typically 25 times faster than RAM
 L3 Cache
 Size is up to 64MB
 L3 cache is a general memory pool that the entire chip can make use of

Each core has its own L1 and L2

caches, while the L3 cache, also
called the Last Level Cache or
LLC, is shared among cores.
33
Basic Concepts

High Performance Computing (HPC)

◦Using the world's fastest computers to solve large/complex
computational problems.

Page 23
Basic Concepts
Task
◦A logically discrete (independent) section of computational work.
◦A task is typically a program or set of instructions executed by a
processor.

Page 24
Basic Concepts

Parallel Task
◦A task that can be executed by multiple processors safely (yields
correct results)

Page 25
Basic Concepts
Parallel Program
◦A program which consists of multiple tasks running on multiple
processors, simultaneously.

Page 26
Basic Concepts
Serial Execution
◦Execution of a program sequentially, one statement at a time. All
parallel tasks will have sections that must be executed serially.

Parallel Execution
◦Execution of a program by more than one task, with each task being
able to execute the same or different statement at the same moment
in time (simultaneously).

Page 27
Basic Concepts
Node
◦A standalone "computer in a box".
◦Usually comprised of multiple processors/cores, memory, network
interfaces, etc.

Page 28
Basic Concepts
Communications
◦The data exchange between parallel tasks.
◦There are several ways this can be accomplished, such as through a
shared memory bus or over a network.

Page 29
Basic Concepts

Synchronization
◦The coordination of parallel tasks in real time, very often associated
with communications.
◦Synchronization usually involves waiting by at least one task, and can
therefore cause a parallel application's execution time to increase.

Page 30
Basic Concepts
Massively Parallel
◦Refers to the hardware that comprises a parallel system - having
many processors.
◦The meaning of many keeps increasing (up to 6 digits!!!!).

Page 31
Basic Concepts
Parallel Computing System
◦Consists of multiple processors having direct (usually bus based)
access to common physical memory.
◦All processors communicate with each other using a shared memory

Page 32
Basic Concepts
Distributed System
◦Contains multiple processors connected by a communication
network.
◦Refers to network based memory access for physical memory that is
not common.

Page 33
Memory Models
 There are three common kinds of parallel
memory models
 Shared
 Distributed
 Hybrid

46
Shared Memory Model
 All cores share the same pool of memory
 HPC Architecture – we talked about the
memory available on one node
 Any memory changes seen by all
processors

47
Benefits and Drawback
 Benefit:
 Data sharing is fast

 Drawback:
 Adding more processors may lead to performance
issues when accessing the same shared memory
resource (memory contention)

48
Distributed Memory Model
 In a distributed memory model, each core has its own
memory
 Processors communicate only through a network connection
and/or communication protocol ( e.g., MPI )
 Changes to local memory associated with processor do not
have an impact on other processors
 Remote-memory access must be explicitly managed by the
programmer

49
Benefits and Drawbacks
 Biggest benefit is scalability
 Adding more processors doesn’t result in resource
contention as far as memory is concerned

 Biggest Drawback
 Can be tedious to program for distributed memory
models
 All data relocation must be programmed by hand

50
Hybrid Memory Model
 As the name implies, the hybrid memory model is a
combination of the shared and distributed memory
models
 Most large and fast clusters today admit a hybrid-
memory model
 A certain number of cores share the memory on one
node, but are connected to the cores sharing memory
on other nodes through a network

51
Benefits and Drawbacks
 Benefit:
 Scalability

 Drawback
 Must know how to program communication
between nodes (e.g., MPI)

Precia I30 MR (202701)
50% (2)
Precia I30 MR (202701)
38 pages
Practical High Performance Computing: Definitive Reference for Developers and Engineers
From Everand
Practical High Performance Computing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Desigo CC Engineering Manual Version 2.1
100% (1)
Desigo CC Engineering Manual Version 2.1
402 pages
HCIE H12-891 Written0208
No ratings yet
HCIE H12-891 Written0208
7 pages
NGN Syllabus
No ratings yet
NGN Syllabus
4 pages
Chapter 4. Linked List - Notes
No ratings yet
Chapter 4. Linked List - Notes
25 pages
Instruction of NK260 Network Connection-R1
No ratings yet
Instruction of NK260 Network Connection-R1
18 pages
MV Substation-A
No ratings yet
MV Substation-A
1 page
Chetna Resume Final 12
No ratings yet
Chetna Resume Final 12
1 page
Coal Hauling
No ratings yet
Coal Hauling
8 pages
Tutorial Letter 101/0/2024: Introduction To Programming
No ratings yet
Tutorial Letter 101/0/2024: Introduction To Programming
15 pages
Chapter 5 - Identify and Access Management - Part 2
No ratings yet
Chapter 5 - Identify and Access Management - Part 2
4 pages
GPW Kv700p Mh01 Engb
No ratings yet
GPW Kv700p Mh01 Engb
23 pages
Parallel Computing 1 Unit
No ratings yet
Parallel Computing 1 Unit
59 pages
Digital Guardian Report From IT Central Station 2021-04-03 19vj
No ratings yet
Digital Guardian Report From IT Central Station 2021-04-03 19vj
34 pages
COMPILER DESIGN LAB Manual
No ratings yet
COMPILER DESIGN LAB Manual
32 pages
SN. Description Verified / Confirmed Notes Y N N/A: Safe Corner For Safety Company Allied Contracting Catering
No ratings yet
SN. Description Verified / Confirmed Notes Y N N/A: Safe Corner For Safety Company Allied Contracting Catering
1 page
Sylla CS 11 01 12 2022
No ratings yet
Sylla CS 11 01 12 2022
5 pages
Effects of Fire On People
No ratings yet
Effects of Fire On People
1 page
Electrical Power Distribution
No ratings yet
Electrical Power Distribution
1 page
2-HIV Nursing
No ratings yet
2-HIV Nursing
4 pages
CS8711-Cloud Computing Lab Manual
No ratings yet
CS8711-Cloud Computing Lab Manual
95 pages
LTC6401-8 - Data Sheets
No ratings yet
LTC6401-8 - Data Sheets
16 pages
Distributed Embedded System
No ratings yet
Distributed Embedded System
7 pages
Lab 01
No ratings yet
Lab 01
12 pages
Santhiya AVS Electronib System
No ratings yet
Santhiya AVS Electronib System
6 pages
RabbitMQ Management
No ratings yet
RabbitMQ Management
3 pages
Parallel Processing
No ratings yet
Parallel Processing
61 pages
CMP 252 - Parallelism Fundamentals
No ratings yet
CMP 252 - Parallelism Fundamentals
64 pages
r00 O02m06 Acc XX XX SDW FR 05011 Labeling and Addressing Detectors
No ratings yet
r00 O02m06 Acc XX XX SDW FR 05011 Labeling and Addressing Detectors
9 pages
Roo Oo2m06 Acc XX XX SDW FL 05001 Main Kitchen Shura Red Sea c3
No ratings yet
Roo Oo2m06 Acc XX XX SDW FL 05001 Main Kitchen Shura Red Sea c3
1 page
R5900056 07 ReferenceGuide
No ratings yet
R5900056 07 ReferenceGuide
104 pages
Unit-4 PHP Final
No ratings yet
Unit-4 PHP Final
104 pages
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet
Generic Questions
No ratings yet
Generic Questions
70 pages
U1&u2 Padcom-25
No ratings yet
U1&u2 Padcom-25
95 pages
1607181570417713
No ratings yet
1607181570417713
2 pages
BDS Session 2
No ratings yet
BDS Session 2
56 pages
Lecture 3
No ratings yet
Lecture 3
24 pages
PDC 3
No ratings yet
PDC 3
26 pages
1 Introduction
No ratings yet
1 Introduction
48 pages
Lecture 1 - Introduction To PDC
No ratings yet
Lecture 1 - Introduction To PDC
24 pages
PPM CCTV
No ratings yet
PPM CCTV
2 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
34 pages
Serial Interfaces I2C, Spi, I2S
100% (3)
Serial Interfaces I2C, Spi, I2S
37 pages
PDC Complete Course File
No ratings yet
PDC Complete Course File
422 pages
AI Promt Engineering Prelim LAB Exam
No ratings yet
AI Promt Engineering Prelim LAB Exam
19 pages
Do Ethernet Splitters Reduce Speed Glynis Navarrete
No ratings yet
Do Ethernet Splitters Reduce Speed Glynis Navarrete
4 pages
HPC BOOk
No ratings yet
HPC BOOk
68 pages
01 - Lecture Intro To HPC
No ratings yet
01 - Lecture Intro To HPC
62 pages
Automatic Marking Machine For Tube, Marker Label & Terminal Id Board
No ratings yet
Automatic Marking Machine For Tube, Marker Label & Terminal Id Board
2 pages
Lecture 2 Introduction To Parallel and Distributed Computing
No ratings yet
Lecture 2 Introduction To Parallel and Distributed Computing
29 pages
Parallel Distributed Computing
No ratings yet
Parallel Distributed Computing
51 pages
Config DBA 170
No ratings yet
Config DBA 170
5 pages
PP Cuda Unit1 1
No ratings yet
PP Cuda Unit1 1
77 pages
Parallel Computing
No ratings yet
Parallel Computing
57 pages
001 - DDS IIIT Jan 10th
No ratings yet
001 - DDS IIIT Jan 10th
34 pages
Agilent 5530 Dynamic Calibrator - Update Information
No ratings yet
Agilent 5530 Dynamic Calibrator - Update Information
11 pages
Parallel and Distributed Computing-1
No ratings yet
Parallel and Distributed Computing-1
17 pages
Week1 Parallel and Distributed Computing
No ratings yet
Week1 Parallel and Distributed Computing
55 pages
Lecture 4
No ratings yet
Lecture 4
27 pages
Introduction To Parallel Computing
No ratings yet
Introduction To Parallel Computing
38 pages
SPPU - BE - HPC - Unit 1 Notes
67% (3)
SPPU - BE - HPC - Unit 1 Notes
47 pages
Lec1-Introduction To Parallel - Distributed System
No ratings yet
Lec1-Introduction To Parallel - Distributed System
29 pages
Week 1
No ratings yet
Week 1
74 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
24 pages
Arti PDF
0% (1)
Arti PDF
258 pages
PDC 1
No ratings yet
PDC 1
41 pages
Chapter1 - CLO1
No ratings yet
Chapter1 - CLO1
28 pages
Chapter # 1
No ratings yet
Chapter # 1
117 pages
BCSE412L - Parallel Computing 01
No ratings yet
BCSE412L - Parallel Computing 01
27 pages
High Performance Computing: Sabah Sayed
No ratings yet
High Performance Computing: Sabah Sayed
22 pages
Lecture Week - 1 Introduction 1 - SP-24
No ratings yet
Lecture Week - 1 Introduction 1 - SP-24
51 pages
Lec1 Introduction
No ratings yet
Lec1 Introduction
23 pages
10 Parallel Computing
No ratings yet
10 Parallel Computing
15 pages
Software Testing - Wikipedia, The Free Encyclopedia
No ratings yet
Software Testing - Wikipedia, The Free Encyclopedia
20 pages
02 - Lecture #2
No ratings yet
02 - Lecture #2
29 pages
Lecture 1 - Introduction To Parallel Computing
0% (1)
Lecture 1 - Introduction To Parallel Computing
32 pages
Lecture 9
No ratings yet
Lecture 9
72 pages
EE664: Introduction To Parallel Computing: Dr. Gaurav Trivedi Lectures 5-14
No ratings yet
EE664: Introduction To Parallel Computing: Dr. Gaurav Trivedi Lectures 5-14
170 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
90 pages
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
No ratings yet
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
30 pages
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 03-Aug-2021 Lecture1-Course Introduction
No ratings yet
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 03-Aug-2021 Lecture1-Course Introduction
39 pages
MObile Communication
No ratings yet
MObile Communication
61 pages
Intro Parallel Computing PDF
No ratings yet
Intro Parallel Computing PDF
58 pages
Introduction To Parallel Computing
No ratings yet
Introduction To Parallel Computing
30 pages
Basics of Parallel Programming: Unit-1
No ratings yet
Basics of Parallel Programming: Unit-1
79 pages
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
No ratings yet
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
63 pages
Parallel Computing An Introduction
No ratings yet
Parallel Computing An Introduction
40 pages
Parallel Computing: Charles Koelbel
No ratings yet
Parallel Computing: Charles Koelbel
12 pages
Parallel Computing Varun Patial
No ratings yet
Parallel Computing Varun Patial
41 pages
Introduction To Parallel Computing
100% (1)
Introduction To Parallel Computing
34 pages
Parallel Computing Main
No ratings yet
Parallel Computing Main
47 pages
What Is Parallel Computing 1 PDF
No ratings yet
What Is Parallel Computing 1 PDF
21 pages
Introduction To Parallel Co...
No ratings yet
Introduction To Parallel Co...
44 pages
The New Trends of Parallel Processing
No ratings yet
The New Trends of Parallel Processing
5 pages
Project - ParallelComputing BSR v2
No ratings yet
Project - ParallelComputing BSR v2
40 pages
Introduction To Parallel Computing LLNL
No ratings yet
Introduction To Parallel Computing LLNL
44 pages
Parallel Computing Terminology
No ratings yet
Parallel Computing Terminology
11 pages

Lec1 and 2

Uploaded by

Lec1 and 2

Uploaded by

An Introduction to Parallel Programming

 Many programmers will be developing parallel software

 Use common programming models

 Discuss broader spectrum of parallel computing

within a single system, working simultaneously to solve a

remote from each other that each have a role in a

computations should be executed in parallel.

Instructions from each part execute simultaneously on

Processors in adjacent blocks in the grid communicate their result.

◦weather and climate forecasting

Example applications include:

Parallel computing is an attempt to maximize the

 But there are inherent

Each core has its own L1 and L2

High Performance Computing (HPC)

You might also like