Lec 01

This document provides an introduction to computational science and numerical methods. It discusses parallel computing models including shared memory with OpenMP and message passing with MPI. Key concepts covered include speedup, parallel efficiency, scaling, and Amdahl's law. Important notes state that GPUs are better for structured data while CPUs can handle varied computation and pipelining, and that GPUs embed many processing units without pipelining.

Uploaded by

SNaveenMathew

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views2 pages

Lec 01

Uploaded by

SNaveenMathew

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Lecture 01

Naveen Mathew Nathan S.

8/27/2019

Introduction
Book:
Computational science:
• simulations: getting numerical solutions for experimental settings that arise from theory
• data science: arises from observations
Numerical methods: eg - converting differential equations into algebraic equations.
Example: some observed pressure, temperature and velocity, use differential equation and solve numerically.
Did not work because the time step was large.
Equations: ∂e/∂t + v∇e = −P/ρ∇v + HOT , ∂v/∂t = ... + HOT
CPU - processor register - CPU cache (level 1, 2 and 3) - physical memory (RAM) - solid state memory
(non-volatile flash based) - virtual memory (file based storage)
Writing code to maximize the use of cache can speed up computation.
Pipelining:
CPUs don’t compute just 1 task at a time. They process some of a task, start another task, process, start, . . .
Compiler handles pipelining.
Vectorization:
Adding two double vectors: c = a + b, len(a) = 8
8 64-bit load-add-stores
Vectorization: add in chunks of 8 each. Add all elements
Heat dissipation becomes a problem as number of FLOPs of a processor increases. Alternative: more threads
of execution (greater concurrency). Also: hybrid parallelism.
Multiplying 2 floats ~ 20 pJ, read operand from on-chip memory at far end of chip ~ 1 nJ, Read operand
from off-chip RAM ~ 16 nJ
In future FLOP will be free, but storage and communication will be expensive

Parallel computing models

Shared memory (Eg: OpenMP)

• Requires special hardwards, each process can see all memory, parallelism implemented via compiler
directives
!OM P P ARALLELSHARED(A, B)P RIV AT E(i)!OMP DO SCHEDULE(STATIC) do i = 1, 199 A(i) =
B(i) + . . . enddo !OM P EN DDO!OMP END PARALLEL

1
Message passing (Eg: MPI)

• Each process has private memory, parallelism implemented via explicit transfers, can work on any
networked CPUs
call mpi_init(err) call mpi_comm_rank(MPI_COMM_WORLD, me, err) call mpi_snedrecv(parameters)
do i = 1, 199 A(i) = B(i) + . . . enddo
call mpi_finalize(err)

Concepts

• Speedup: SN = ttN1 where ti is time in i processors

• Parallel efficiency: speedup/N (efficiency = 1 => perfect parallel performance)
• Scaling: strong-scaling: speedup ∝ N for fixed workload, weak-scaling: speedup constant for workload
∝ N (increasing workload and number of processors)
N
• Amdahl’s law: SN = BN +(1−B) , B = % of algo that is serial for N processors

Important notes:
Where GPU may not be best: when the pattern of computation is different in each data unit. GPUs are
better for structured data.
CPU can do a bunch of tasks. A previous gate decides whether a gate receives a signal.
GPU doesn’t allow pipelining, but allows large number of processing units to be embedded.

Parallel Computing LessonPlan
No ratings yet
Parallel Computing LessonPlan
10 pages
Learning Python 4th Edition Mark Lutz PDF Download
100% (4)
Learning Python 4th Edition Mark Lutz PDF Download
61 pages
Programming On Parallel Machines
100% (2)
Programming On Parallel Machines
347 pages
Programming On Parallel Machines: Norm Matloff University of California, Davis
100% (1)
Programming On Parallel Machines: Norm Matloff University of California, Davis
347 pages
PDC Complete Course File
No ratings yet
PDC Complete Course File
422 pages
GPU Programming Slides 1
No ratings yet
GPU Programming Slides 1
33 pages
Omp Hands On
No ratings yet
Omp Hands On
200 pages
Parallel Computing 1 Unit
No ratings yet
Parallel Computing 1 Unit
59 pages
HPC Parallel
No ratings yet
HPC Parallel
122 pages
Parallel Programming
100% (2)
Parallel Programming
410 pages
Applied Parallel Computing-Honest
100% (1)
Applied Parallel Computing-Honest
218 pages
HPC BOOk
No ratings yet
HPC BOOk
68 pages
L01 Slides
No ratings yet
L01 Slides
24 pages
ParProcBook PDF
No ratings yet
ParProcBook PDF
410 pages
Owens
No ratings yet
Owens
67 pages
HPC - Unit-1 Insem Notes
No ratings yet
HPC - Unit-1 Insem Notes
76 pages
Co 1
No ratings yet
Co 1
66 pages
CMP 252 - Parallelism Fundamentals
No ratings yet
CMP 252 - Parallelism Fundamentals
64 pages
3.introduction To Parallelism
No ratings yet
3.introduction To Parallelism
64 pages
PP Cuda Unit1 1
No ratings yet
PP Cuda Unit1 1
77 pages
HPC Unit 1
No ratings yet
HPC Unit 1
65 pages
Cuda Mode Lecture2
No ratings yet
Cuda Mode Lecture2
33 pages
Week1 - Parallel and Distributed Computing
100% (1)
Week1 - Parallel and Distributed Computing
46 pages
Par Proc Book
No ratings yet
Par Proc Book
400 pages
001 - DDS IIIT Jan 10th
No ratings yet
001 - DDS IIIT Jan 10th
34 pages
Object Oriented Programming Through JAVA
No ratings yet
Object Oriented Programming Through JAVA
129 pages
HPC Note
No ratings yet
HPC Note
39 pages
Data Comms Networks Notes PDF
No ratings yet
Data Comms Networks Notes PDF
70 pages
DS1822 - Parallel Computing - Unit 1
No ratings yet
DS1822 - Parallel Computing - Unit 1
23 pages
Lecture Week - 1 Introduction 1 - SP-24
No ratings yet
Lecture Week - 1 Introduction 1 - SP-24
51 pages
02 RTVis GPGPU CUDA
No ratings yet
02 RTVis GPGPU CUDA
34 pages
M.tech (CSE) Data Science-Int 2019 2020
No ratings yet
M.tech (CSE) Data Science-Int 2019 2020
163 pages
01-07 VPLS Configuration
No ratings yet
01-07 VPLS Configuration
237 pages
Par Proc Book
No ratings yet
Par Proc Book
335 pages
Design of Parallel Algorithm'S: Faculty Guide: Group Members
No ratings yet
Design of Parallel Algorithm'S: Faculty Guide: Group Members
49 pages
Lec1 and 2
No ratings yet
Lec1 and 2
52 pages
Parallel Computing
No ratings yet
Parallel Computing
57 pages
PDC Experiments
No ratings yet
PDC Experiments
11 pages
Multicore02 2
No ratings yet
Multicore02 2
18 pages
Assignment 1 & 2
No ratings yet
Assignment 1 & 2
5 pages
Parallel & Distributed Computing
No ratings yet
Parallel & Distributed Computing
47 pages
Lecture 9
No ratings yet
Lecture 9
72 pages
Chapter 1 - Principles of Programming Languges
No ratings yet
Chapter 1 - Principles of Programming Languges
37 pages
HPC Lectures 1 5
No ratings yet
HPC Lectures 1 5
18 pages
09 ParallelizationRecap PDF
No ratings yet
09 ParallelizationRecap PDF
62 pages
Max DNA
No ratings yet
Max DNA
41 pages
Conceptual Programming: Conceptual Programming: Learn Programming the old way!
From Everand
Conceptual Programming: Conceptual Programming: Learn Programming the old way!
Avishek Sharma
No ratings yet
Parallel Processing: 6.004x Computation Structures Part 3 - Computer Organization
No ratings yet
Parallel Processing: 6.004x Computation Structures Part 3 - Computer Organization
41 pages
Lec1 Introduction
No ratings yet
Lec1 Introduction
23 pages
Parallel and Distributed Algorithms
No ratings yet
Parallel and Distributed Algorithms
65 pages
CS621 Cheatsheet
No ratings yet
CS621 Cheatsheet
11 pages
Advanced Computer Architecture Fall 2019 Multithreaded Architectures
No ratings yet
Advanced Computer Architecture Fall 2019 Multithreaded Architectures
31 pages
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
No ratings yet
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
63 pages
Communication Technologies Tutorial
100% (1)
Communication Technologies Tutorial
71 pages
QA Principles
No ratings yet
QA Principles
108 pages
PDC CS428 (For Website)
No ratings yet
PDC CS428 (For Website)
5 pages
.Trashed-1650000204-Hpc Prac Exam
No ratings yet
.Trashed-1650000204-Hpc Prac Exam
5 pages
HPC Overview
No ratings yet
HPC Overview
45 pages
List-Of-Comlab-Activities 2223
No ratings yet
List-Of-Comlab-Activities 2223
52 pages
Intro Parallel Programming 2015
No ratings yet
Intro Parallel Programming 2015
38 pages
Clase01 - Introducción Al Paralelismo
No ratings yet
Clase01 - Introducción Al Paralelismo
30 pages
Clase01 - Introducción Al Paralelismo
No ratings yet
Clase01 - Introducción Al Paralelismo
30 pages
High Performance Computing: 772 10 91 Thomas@chalmers - Se
No ratings yet
High Performance Computing: 772 10 91 Thomas@chalmers - Se
75 pages
CH 16 Interface Python With Mysql
No ratings yet
CH 16 Interface Python With Mysql
3 pages
Module 2: Switching Concepts: Switching, Routing, and Wireless Essentials v7.0 (SRWE)
No ratings yet
Module 2: Switching Concepts: Switching, Routing, and Wireless Essentials v7.0 (SRWE)
17 pages
ISE-20% Unit Test I-15% Unit Test II-15% ESE-50% (Minimum Passing Marks: 40%)
No ratings yet
ISE-20% Unit Test I-15% Unit Test II-15% ESE-50% (Minimum Passing Marks: 40%)
2 pages
Mansi Kadam PC Lab Assignment 1
No ratings yet
Mansi Kadam PC Lab Assignment 1
4 pages
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet
Technical Information Microscan3 Outdoorscan3 Nanoscan3 Data Output Via Udp and TCP Ip en Im0083701
No ratings yet
Technical Information Microscan3 Outdoorscan3 Nanoscan3 Data Output Via Udp and TCP Ip en Im0083701
76 pages
Quiz 2 CN Fa2-Bcs-287 (Ahad) 2
No ratings yet
Quiz 2 CN Fa2-Bcs-287 (Ahad) 2
5 pages
Elife Active8
No ratings yet
Elife Active8
12 pages
Data Sheet 6AV2128-3GB06-0AX0: General Information
No ratings yet
Data Sheet 6AV2128-3GB06-0AX0: General Information
6 pages
Gujarat Technological University: W.E.F. AY 2018-19
No ratings yet
Gujarat Technological University: W.E.F. AY 2018-19
3 pages
Et4020e B2 213576 210388 213575
No ratings yet
Et4020e B2 213576 210388 213575
97 pages
Parallel
No ratings yet
Parallel
4 pages
Human-Computer Interaction: Discipline of HCI
No ratings yet
Human-Computer Interaction: Discipline of HCI
32 pages
Fortra Data Classification Suite For Windows Extensibility Guide
No ratings yet
Fortra Data Classification Suite For Windows Extensibility Guide
55 pages
Comptia Server+ Primer
From Everand
Comptia Server+ Primer
John Greene
5/5 (1)
An Approach To Parallel Processing: Yashraj Rai Puja Padiya
No ratings yet
An Approach To Parallel Processing: Yashraj Rai Puja Padiya
3 pages
Exit Clearance Form: Formulir Pengunduran Diri
No ratings yet
Exit Clearance Form: Formulir Pengunduran Diri
1 page
AI Chatbot Project Report
No ratings yet
AI Chatbot Project Report
15 pages
ICT IGCSE Chap 4 - Network
No ratings yet
ICT IGCSE Chap 4 - Network
60 pages
Emirates e Ticket
No ratings yet
Emirates e Ticket
3 pages
7 Decidability
No ratings yet
7 Decidability
4 pages
Snap, Crackle, and Pop: Systems Technology, Inc., Hawthorne, CA, 90250
No ratings yet
Snap, Crackle, and Pop: Systems Technology, Inc., Hawthorne, CA, 90250
16 pages
CSE215 Lab Sec15 Course Outline
No ratings yet
CSE215 Lab Sec15 Course Outline
2 pages
Asif Updated Resume
No ratings yet
Asif Updated Resume
1 page
Hydrodynamics: Naveen Mathew Nathan S. 10/1/2019
No ratings yet
Hydrodynamics: Naveen Mathew Nathan S. 10/1/2019
2 pages
Lab 7 2023
No ratings yet
Lab 7 2023
19 pages
Photometric Redshift
No ratings yet
Photometric Redshift
1 page
NT UNIT - I Lecture 0
No ratings yet
NT UNIT - I Lecture 0
13 pages
RabbitMQ Management
No ratings yet
RabbitMQ Management
3 pages
NGN Syllabus
No ratings yet
NGN Syllabus
4 pages
Class 3 MS Word 2010 Part 2
No ratings yet
Class 3 MS Word 2010 Part 2
4 pages

Lec 01

Uploaded by

Lec 01

Uploaded by

Lecture 01

Naveen Mathew Nathan S.

Parallel computing models

Shared memory (Eg: OpenMP)

• Speedup: SN = ttN1 where ti is time in i processors

You might also like