0% found this document useful (0 votes)

160 views47 pages

Parallel & Distributed Computing

This document provides an introduction to parallel computing. It discusses the von Neumann architecture and its limitations as processor speeds increase. Motivations for parallel computing include fundamental limits on single processor speed, heat dissipation, and the growing disparity between CPU and memory speeds. The document outlines several applications that require large-scale modeling and benefit from parallelization, such as weather simulation, drug design, and semiconductor processing. It also introduces concepts like cloud computing and tools for parallel programming including MPI and Eclipse.

Uploaded by

Shehroz Ali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

160 views47 pages

Parallel & Distributed Computing

Uploaded by

Shehroz Ali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

Parallel and Distributed Computing

Chapter 1: Introduction to Parallel Computing

Jun Zhang
Department of Computer Science
University of Kentucky
Lexington, KY 40506

Chapter 1: CS621 1
1.1a: von Neumann Architecture

 Common machine model

for over 70 years
 Stored-program concept
 CPU executes a stored
program
 A sequence of read and
write operations on the
memory
 Order of operations is
sequential

Chapter 1: CS621 2
1.1b: A More Detailed Architecture based
on von Neumann Model

Chapter 1: CS621 3
1.1c: Old von Neumann Computer

Chapter 1: CS621 4
1.1d: CISC von Neumann Computer

 CISC stands for Complex Instruction Set

Computer with a single bus system
 Harvard (RISC) architecture utilizes two
buses, a separate data bus and an address
bus
 RISC stands for Reduced Instruction Set
Computer
 They are SISD machines – Single Instruction
Stream on Single Data Stream
Chapter 1: CS621 5
1.1e: Personal Computer

Chapter 1: CS621 6
1.1f: John von Neumann

 December 28, 1903 –

February 8, 1957
 Hungarian mathematician
 Mastered calculus at 8
 Graduate level math at 12
 Got his Ph.D. at 23
 His proposal to his 1st wife,
“You and I might be able to
have some fun together,
seeing as how we both like
to drink."

Chapter 1: CS621 7
1.2a: Motivations for Parallel Computing

 Fundamental limits on single processor

speed
 Heat dissipation from CPU chips
 Disparity between CPU & memory speeds
 Distributed data communications
 Need for very large scale computing
platforms

Chapter 1: CS621 8
1.2b: Fundamental Limits – Cycle Speed

 Intel 8080 2MHz 1974

 ARM 2 8MHz 1986
 Intel Pentium Pro 200MHz 1996
 AMD Athlon 1.2GHz 2000
 Intel QX6700 2.66GHz 2006
 Intel Core i7 3770k 3.9GHz 2013
 Speed of light: 30cm in 1ns
 Signal travels about 10 times slower
Chapter 1: CS621 9
1.2c: High-End CPU is Expensive

Price for high-end

CPU rises sharply

Intel processor
price/performance

Chapter 1: CS621 10
1.2d: Moore’s Law
 Moore’s observation in 1965: the number of
transistors per square inch on integrated
circuits had doubled every year since the
integrated circuit was invented
 Moore’s revised observation in 1975: the
pace was slowed down a bit, but data
density had doubled approximately every 18
months
 How about the future? (price of computing
power falls by a half every 18 months?)
Chapter 1: CS621 11
1.2e: Moore’s Law – Held for Now

Chapter 1: CS621 12
1.3: Power Wall Effect in Computer
Architecture
 Too many transistors in a given chip die area
 Tremendous increase in power density
 Increased chip temperature
 High temperature slows down the transistor
switching rate and the overall speed of the
computer
 Chip may melt down if not cooled properly
 Efficient cooling systems are expensive

Chapter 1: CS621 13
1.3: Cooling Computer Chips

Some people suggest to put computer chips in

liquid nitrogen to cool them

Chapter 1: CS621 14
1.3: Solutions
Use multiple inexpensive processors
A processor with multiple cores

Chapter 1: CS621 15
1.3: A Multi-core Processor

Chapter 1: CS621 16
1.3a: CPU and Memory Speeds

 In 20 years, CPU speed (clock rate) has

increased by a factor of 1000
 DRAM speed has increased only by a factor
of smaller than 4
 How to feed data faster enough to keep CPU
busy?
 CPU speed: 1-2 ns
 DRAM speed: 50-60 ns
 Cache: 10 ns

Chapter 1: CS621 17
1.3b: Memory Access and CPU Speed

Chapter 1: CS621 18
1.3b: CPU, Memory, and Disk Speed

Chapter 1: CS621 19
1.3c: Possible Solutions

 A hierarchy of successively fast memory

devices (multilevel caches)
 Location of data reference (code)
 Efficient programming can be an issue
 Parallel systems may provide
1.) larger aggregate cache
2.) higher aggregate bandwidth to the
memory system

Chapter 1: CS621 20
1.3f: Multilevel Hierarchical Cache

Chapter 1: CS621 21
1.4a: Distributed Data Communications

 Data may be collected and stored at different

locations
 It is expensive to bring them to a central
location for processing
 Many computing assignments may be
inherently parallel
 Privacy issues in data mining and other large
scale commercial database manipulations

Chapter 1: CS621 22
1.4b: Distributed Data Communications

Chapter 1: CS621 23
1.4c: Move Computation to Data
(CS626: Large Scale Data Science)

Chapter 1: CS621 24
1.5a: Why Use Parallel Computing

 Save time – reduce wall clock time – many

processors work together
 Solve larger problems – larger than one
processor’s CPU and memory can handle
 Provide concurrency – do multiple things at
the same time: online access to databases,
search engine
Google’s 4,000 PC servers were one of the
largest clusters in the world (server farm)
Chapter 1: CS621 25
1.4b: Google’s Data Center

Chapter 1: CS621 26
1.5b: Other Reasons for Parallel
Computing
 Taking advantages of non-local resources –
using computing resources on a wide area
network, or even internet (grid or cloud
computing)
 Cost savings – using multiple “cheap”
computing resources instead of a high-end
CPU
 Overcoming memory constraints – for large
problems, using memories of multiple
computers may overcome the memory
constraint obstacle
Chapter 1: CS621 27
1.6a: Need for Large Scale Modeling

 Long term weather forecasting

 Large scale ocean modeling
 Oil reservoir simulations
 Car and airplane manufacturing
 Semiconductor simulation
 Pollution tracking
 Large scale commercial databases
 Aerospace (NASA microgravity modeling)
Chapter 1: CS621 28
1.6b: Semiconductor Simulation

 Before 1975, an engineer had to make several runs

through the fabrication line until a successful
device was fabricated
 Device dimensions shrink below 0.1 micro-meter
 A fabrication line costs 1.0 billion dollars to build
 A design must be thoroughly verified before it is
committed to silicon
 A realistic simulation for one diffusion process may
take days or months to run on a workstation
 Chip price drops quickly after entering the market

Chapter 1: CS621 29
1.4b: Semiconductor Diffusion Process

Chapter 1: CS621 30
1.6c: Drug Design

 Most drugs work by binding to a specific site,

called a receptor, on a protein
 A central problem is to find molecules
(ligands) with high binding affinity
 Need to accurately and efficiently estimate
electrostatic forces in molecular and atomic
interactions
 Calculate drug-protein binding energies from
quantum mechanics, statistical mechanics
and simulation techniques
Chapter 1: CS621 31
1.6d: Computing Protein Binding

Chapter 1: CS621 32
1.6d: Computer Aided Drug Design

Chapter 1: CS621 33
1.7: Issues in Parallel Computing

 Design of parallel computers

 Design of efficient parallel algorithms
 Methods for evaluating parallel algorithms
 Parallel computer languages
 Parallel programming tools
 Portable parallel programs
 Automatic programming of parallel computers
 Education of parallel computing philosophy
Chapter 1: CS621 34
1.8 Eclipse Parallel Tools Platform

 A standard, portable parallel integrated

development environment that supports a
wide range of parallel architectures and run
time systems (IBM)
 A scalable parallel debugger
 Support for the integration of a wide range of
parallel tools
 An environment that simplifies the end-user
interaction with parallel systems
Chapter 1: CS621 35
1.9 Message Passing Interface (MPI)

 We will use MPI for hands-on experience in

this class
 Message Passing Interface can be
downloaded from an online website (MPICH)
 Parallel computing can be simulated on your
own computers (Microsoft MPI)
 UK can provide distributed computing
services for research purposes, for free

Chapter 1: CS621 36
1.10 Cloud Computing
 Cloud Computing is a style of computing in which
dynamically scalable and often virtualized resources are
provided over the Internet
 Users need not have knowledge of, expertise in, or
control over the technology infrastructure in the “cloud”
that supports them
 Compared to: Grid Computing (cluster of networked,
loosely coupled computers)
 Utility Computing (packaging of computing resources as
a metered service)
 and Autonomic Computing (computer systems capable
of self-management)
Chapter 1: CS621 37
1.11 Cloud Computing

 By Sam Johnston in Wikimedia Commons

Chapter 1: CS621 38
1.12 Cloud Computing

A means to increase computing capacity or add computing capabilities at any time

without investing in new infrastructure, training new personnel, or licensing new
software
Chapter 1: CS621 39
1.13 Cloud Computing

Chapter 1: CS621 40
1.14 Cloud Computing

Chapter 1: CS621 41
1.15 Cloud Computing

Chapter 1: CS621 42
1.16 Top Players in Cloud Computing
 Amazon.com (IaaS, most comprehesive)
 Vmware (vCloud, software to build cloud)
 Microsoft (PaaS, IaaS)
 Salesforce.com (SaaS)
 Google (IaaS, PaaS, Google App Engine)
 Rackspace (IaaS)
 IBM (Openstack)
 Citrix (compete with Vmware, free cloud operating
system)
 Joyent (compete with Vmware, OpenStack, Citrix)
 SoftLayer (web hosting service provider)
Chapter 1: CS621 43
1.17 Parallel and Distributed
Computing (CS621)
 This class is about parallel and distributed computing
algorithms and applications
 Algorithms for communications between processors
 Algorithms to solve scientific computing problems
 Hands-on experience with message-passing interface
(MPI) to write parallel programs to solve some problems


 This class is NOT about parallel data processing

(CS626)

Chapter 1: CS621 44
*** Hands-on Experience
 You can down and install a copy of Microsoft MPI
(Message Passing Interface) at

https://www.microsoft.com/en-
us/download/details.aspx?id=57467

You also need MS Visual Studio to work with it.

https://visualstudio.microsoft.com/

Chapter 1: CS621 45
*** Hands-on Experience
 Here is a video demonstrating how to set up MS Visual
Studio for MS MPI programming

https://www.youtube.com/watch?v=IW3SKDM_yEs&t=330s

 Please make sure that you have installed the MS Visual

Studio and MS MPI on your computer

 Please run the sample MPI “Hello World” program

https://mpitutorial.com/tutorials/mpi-hello-world/

Chapter 1: CS621 46
*** Hands-on Experience

 If you have serious research projects that need a lot of

computing resources, you can also request an account
on a supercomputer at the UK Center for Computational
Science

 Go to the Center for Computational Science and get a

form to fill out and ask your advisor to sign it

 https://www.ccs.uky.edu/

Chapter 1: CS621 47

Microservice - 500k CCU
No ratings yet
Microservice - 500k CCU
52 pages
Chapter 9 SB Answers
75% (4)
Chapter 9 SB Answers
15 pages
2020 Guide On Monitoring and Evaluation For Beginners
100% (5)
2020 Guide On Monitoring and Evaluation For Beginners
30 pages
Traditional Process Models
No ratings yet
Traditional Process Models
23 pages
Basics of Parallel Programming: Unit-1
No ratings yet
Basics of Parallel Programming: Unit-1
79 pages
CS416 - Parallel and Distributed Computing: Lecture # 01
No ratings yet
CS416 - Parallel and Distributed Computing: Lecture # 01
20 pages
Fallsem2019-20 Cse4001 Eth Vl2019201001348 Reference Material Cse4001 Parallel and Distributed Computing May 2019 (003) 18
No ratings yet
Fallsem2019-20 Cse4001 Eth Vl2019201001348 Reference Material Cse4001 Parallel and Distributed Computing May 2019 (003) 18
4 pages
Assignment: Parallel and Distributed Computing Submitted To: Sir Shoaib Date: 25-03-2019
No ratings yet
Assignment: Parallel and Distributed Computing Submitted To: Sir Shoaib Date: 25-03-2019
5 pages
Parallel and Distributed Computing Lecture 03
No ratings yet
Parallel and Distributed Computing Lecture 03
44 pages
pdc2: MODULE2
No ratings yet
pdc2: MODULE2
113 pages
Lecture 1 - Parallel and Distributed Computing
100% (1)
Lecture 1 - Parallel and Distributed Computing
25 pages
Week1 - Parallel and Distributed Computing
100% (1)
Week1 - Parallel and Distributed Computing
46 pages
Parallel and Distributed Computing
100% (2)
Parallel and Distributed Computing
20 pages
Laboratory Manual For Distributed Computing PDF
No ratings yet
Laboratory Manual For Distributed Computing PDF
22 pages
Computer Laboratory Manual: Parallel and Distributed Computing
No ratings yet
Computer Laboratory Manual: Parallel and Distributed Computing
65 pages
Parallel & Distributed Computing
100% (1)
Parallel & Distributed Computing
52 pages
Cse4001 Parallel-And-Distributed-Computing Eth 1.1 47 Cse4001
50% (2)
Cse4001 Parallel-And-Distributed-Computing Eth 1.1 47 Cse4001
2 pages
Programming Fundamentals Lab (CL1002) : Laboratory Manual Fall 2021
No ratings yet
Programming Fundamentals Lab (CL1002) : Laboratory Manual Fall 2021
8 pages
Unit1 Parallel and Distributed
No ratings yet
Unit1 Parallel and Distributed
21 pages
C Project Proposal
No ratings yet
C Project Proposal
11 pages
Introduction To Distributed Systems
No ratings yet
Introduction To Distributed Systems
45 pages
Distributed File System - File Service Architecture
No ratings yet
Distributed File System - File Service Architecture
51 pages
DDBMS MCQ - 1
No ratings yet
DDBMS MCQ - 1
10 pages
Module 1: PARALLEL AND DISTRIBUTED COMPUTING
No ratings yet
Module 1: PARALLEL AND DISTRIBUTED COMPUTING
65 pages
UNIT 6 Hardware & Software Concepts PDF
No ratings yet
UNIT 6 Hardware & Software Concepts PDF
9 pages
اسئلة واجوبة معالجة متوازية نهائية
No ratings yet
اسئلة واجوبة معالجة متوازية نهائية
10 pages
Unit I Fundamentals of Computer Design and Ilp-1-14
No ratings yet
Unit I Fundamentals of Computer Design and Ilp-1-14
14 pages
COA Notes Unit - 2
No ratings yet
COA Notes Unit - 2
30 pages
COA Chapter 6
No ratings yet
COA Chapter 6
6 pages
Entities and Their Properties
No ratings yet
Entities and Their Properties
9 pages
Data Science Lab-KTU
No ratings yet
Data Science Lab-KTU
5 pages
M Tech Full Time Cloud Computing PDF
No ratings yet
M Tech Full Time Cloud Computing PDF
30 pages
Unit-1 - Introduction To Distributed Computing System
No ratings yet
Unit-1 - Introduction To Distributed Computing System
68 pages
Multiprocessors and Multicomputers
No ratings yet
Multiprocessors and Multicomputers
27 pages
PDC 1 - PD Computing
No ratings yet
PDC 1 - PD Computing
12 pages
Distributed Systems Question Bank-2021-2022
0% (2)
Distributed Systems Question Bank-2021-2022
7 pages
pdc1: MODULE 1: PARALLELISM FUNDAMENTALS
No ratings yet
pdc1: MODULE 1: PARALLELISM FUNDAMENTALS
42 pages
Data Communication and Networking Slides Chap 1
67% (3)
Data Communication and Networking Slides Chap 1
31 pages
Iot Sem4
No ratings yet
Iot Sem4
3 pages
CS - 687 Parallel and Distributed Computing
100% (2)
CS - 687 Parallel and Distributed Computing
3 pages
Distributed Systems Question Paper JNTUH
100% (1)
Distributed Systems Question Paper JNTUH
2 pages
LAB ASSIGNMENT With Output
No ratings yet
LAB ASSIGNMENT With Output
53 pages
Advanced Java Unit 3 Digital Notes
100% (1)
Advanced Java Unit 3 Digital Notes
67 pages
2 Mark Question With Answers
No ratings yet
2 Mark Question With Answers
9 pages
Cs3591 CN Unit 5 Notes
100% (1)
Cs3591 CN Unit 5 Notes
27 pages
Principles of Parallel and Distributed Computing
No ratings yet
Principles of Parallel and Distributed Computing
54 pages
Syllabus Parallel Computing
No ratings yet
Syllabus Parallel Computing
5 pages
VTU ECE CNLAB Manual 15ECL68
50% (4)
VTU ECE CNLAB Manual 15ECL68
2 pages
Question Bank Computer Networks UnitV
100% (1)
Question Bank Computer Networks UnitV
2 pages
Parallel Computers Networking PDF
No ratings yet
Parallel Computers Networking PDF
48 pages
Distributed System: Processes in DS
100% (1)
Distributed System: Processes in DS
36 pages
Assignment No. 1, DCC
No ratings yet
Assignment No. 1, DCC
5 pages
CS621 - Handouts - Mids
No ratings yet
CS621 - Handouts - Mids
61 pages
Computer Network 1
No ratings yet
Computer Network 1
5 pages
In Group14 Assignment
No ratings yet
In Group14 Assignment
40 pages
Components of System Administration
50% (2)
Components of System Administration
3 pages
Computer Network Lab Manual
No ratings yet
Computer Network Lab Manual
167 pages
CS 258 Parallel Computer Architecture: CS 258, Spring 99 David E. Culler Computer Science Division U.C. Berkeley
No ratings yet
CS 258 Parallel Computer Architecture: CS 258, Spring 99 David E. Culler Computer Science Division U.C. Berkeley
44 pages
1 Introduction
No ratings yet
1 Introduction
48 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
16 pages
Module II (CC)
No ratings yet
Module II (CC)
125 pages
CS4961 Parallel Programming: Course Details
No ratings yet
CS4961 Parallel Programming: Course Details
7 pages
PDC Complete Course File
No ratings yet
PDC Complete Course File
422 pages
Serial Port Using GUI
No ratings yet
Serial Port Using GUI
11 pages
CS50 Saptamana 05 Data Structures
No ratings yet
CS50 Saptamana 05 Data Structures
13 pages
Os 8
No ratings yet
Os 8
10 pages
Karthik T: Objective
No ratings yet
Karthik T: Objective
4 pages
MST 001
No ratings yet
MST 001
5 pages
Snake Bots
No ratings yet
Snake Bots
10 pages
Web Technology PDF
100% (1)
Web Technology PDF
359 pages
QUADRO English
No ratings yet
QUADRO English
25 pages
Spesifikasi Teknik Thena Pacs Standart Package
No ratings yet
Spesifikasi Teknik Thena Pacs Standart Package
2 pages
Fourier Series
100% (1)
Fourier Series
61 pages
Launch of Efile Audit Application, SOP and Manual
No ratings yet
Launch of Efile Audit Application, SOP and Manual
45 pages
CDU Engineering Course Structure & Workplacement Guide
No ratings yet
CDU Engineering Course Structure & Workplacement Guide
63 pages
Absent Letter Naveen
No ratings yet
Absent Letter Naveen
3 pages
Resume Tendencia de CV
No ratings yet
Resume Tendencia de CV
3 pages
Project Report HMT
No ratings yet
Project Report HMT
15 pages
Folder Comparison Report
No ratings yet
Folder Comparison Report
2 pages
Lecture: Fiber Bragg Grating Based Devices
No ratings yet
Lecture: Fiber Bragg Grating Based Devices
16 pages
Fox515 Technical Data
No ratings yet
Fox515 Technical Data
2 pages
Chapter 4 - Queues Revised
No ratings yet
Chapter 4 - Queues Revised
53 pages
Spesifikasi CT 16 Slice Supria 5MHU Ekatalog
0% (1)
Spesifikasi CT 16 Slice Supria 5MHU Ekatalog
5 pages
CHAPTER10
No ratings yet
CHAPTER10
15 pages
First Generation (1940 - 1956) : Vacuum Tubes
No ratings yet
First Generation (1940 - 1956) : Vacuum Tubes
4 pages
3D Maya PDF
100% (1)
3D Maya PDF
149 pages
Reversing An Array
No ratings yet
Reversing An Array
14 pages
Chair Design Resource Sheet
No ratings yet
Chair Design Resource Sheet
6 pages
Jhpolice Cyber Crime Investigation
No ratings yet
Jhpolice Cyber Crime Investigation
22 pages
Alteon
No ratings yet
Alteon
13 pages

Parallel & Distributed Computing

Uploaded by

Parallel & Distributed Computing

Uploaded by

Parallel and Distributed Computing

Chapter 1: Introduction to Parallel Computing

 Common machine model

 CISC stands for Complex Instruction Set

 December 28, 1903 –

 Fundamental limits on single processor

 Intel 8080 2MHz 1974

Price for high-end

Some people suggest to put computer chips in

 In 20 years, CPU speed (clock rate) has

 A hierarchy of successively fast memory

 Data may be collected and stored at different

 Save time – reduce wall clock time – many

 Long term weather forecasting

 Before 1975, an engineer had to make several runs

 Most drugs work by binding to a specific site,

 Design of parallel computers

 A standard, portable parallel integrated

 We will use MPI for hands-on experience in

 By Sam Johnston in Wikimedia Commons

A means to increase computing capacity or add computing capabilities at any time

 This class is NOT about parallel data processing

You also need MS Visual Studio to work with it.

 Please make sure that you have installed the MS Visual

 Please run the sample MPI “Hello World” program

 If you have serious research projects that need a lot of

 Go to the Center for Computational Science and get a

You might also like