0% found this document useful (0 votes)

18 views8 pages

HPC Lecture (1) Summary

Uploaded by

omargamalelziky

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views8 pages

HPC Lecture (1) Summary

Uploaded by

omargamalelziky

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

High Performance Computing (HPC) –

Lecture (1) Summary

Introduction to HPC
• HPC (High Performance Computing) refers to aggregating computing power for much
higher performance than typical computers.
• Used to solve large-scale problems in science, engineering, or business.

Key Drivers of HPC

• Increasing data generation.
• Need for complex simulations and modeling (e.g., climate science, physics).
• Limitations of single-core processors and clock speed increases.

Types of Computing
• Serial Computing: Executes one instruction at a time.
• Parallel Computing: Executes multiple calculations simultaneously; divides large
problems into smaller, concurrent tasks.

HPC Application Areas

• Science:
• - Space Science: Astrophysics and Astronomy.
• - Earth Science: Geological structure analysis, water resource modeling, seismic
exploration.
• - Atmospheric Science: Climate and weather forecasting, air quality.
• - Life Science: Drug design, genome sequencing, protein folding.
• - Nuclear Science: Nuclear power, nuclear medicine, defense.
• - Nano Science: Semiconductor physics, microfabrication, molecular biology.
• Engineering:
• - Crash Simulation: Used in automobile and mechanical engineering.
• - Aerodynamics Simulation: Aeronautics and mechanical engineering.
• - Structural Analysis: Civil engineering and architecture.
• Multimedia & Animation:
• - Increased demand for high resolution (4K, 8K), complex visual effects, real-time
rendering, and large data processing for gaming and VR.
Parallel Processing
• Uses von Neumann architecture (stored program and data in memory).

• Flynn's Taxonomy:

• - SISD: Single Instruction Single Data (traditional uniprocessor).

• - SIMD: Single Instruction Multiple Data (data parallelism).

• - MISD: Multiple Instructions Single Data (systolic arrays, pipelines).

• - MIMD: Multiple Instruction Multiple Data (shared/distributed memory).
- Pipelining :

Types of Parallelism
• Data Parallelism: Simultaneous processing of multiple data items.
• Functional Parallelism: Different independent modules run simultaneously.
• Overlapped/Temporal Parallelism: Tasks executed in an overlapped sequence (e.g.,
pipelining).

Performance Issues and Metrics

• Challenges: Overhead, interprocessor communication, imbalance.
• Performance Metrics:
• - Speedup (S): Ratio of single-processor time to n-processor time.

• - Efficiency (E): Useful parallel time divided by overall parallel time.

• - Throughput: Work done per time unit.

• - Application-specific measures: Like particle interactions per time unit.

The given expression is:

Y = (a * b) + (c / d) + e

We need to determine the sequential and parallel execution times, speedup, efficiency, and
throughput based on the dependency graph and schedule shown.

Step 1: Build the Dependency Graph

The dependency graph shows the order in which operations need to be performed based on
dependencies:
1. **Cycle 1**:
- Compute a * b (multiplication node, left side of graph).
- Compute c / d (division node, right side of graph).

Both operations can be performed in parallel as they are independent of each other.

2. **Cycle 2**:
- Sum the results of a * b and c / d (addition node in the middle).

3. **Cycle 3**:
- Add e to the result from Cycle 2 to get the final result Y.

This structure shows that the calculation of Y can be completed in **3 cycles** when using
parallel processing.

Step 2: Sequential and Parallel Execution Times

- **Sequential Time (Tsequential)**: This is the time it would take to compute Y without any
parallelism.
- The expression has four operations: multiplication, division, and two additions.
- Therefore, Tsequential = 4.

- **Parallel Time (Tparallel)**: This is the time it takes to compute Y with parallelism.
- As analyzed in the graph, we need only **3 cycles** to complete all operations with two
processors.
- Thus, Tparallel = 3.

Step 3: Calculating Speedup

Speedup is calculated as the ratio of the sequential time to the parallel time:
Speedup = Tsequential / Tparallel = 4 / 3 ≈ 1.33

Step 4: Calculating Efficiency

Efficiency measures how effectively the processors are being used. It is calculated by
dividing the speedup by the number of processors:
Efficiency = Speedup / Number of Processors = (4 / 3) / 2 = 4 / 6 ≈ 0.66 or 66%

Step 5: Calculating Throughput

Throughput represents the number of operations performed per cycle in parallel execution.
This can be calculated as:
Throughput = Total Operations / Tparallel = 4 / 3 ≈ 1.33 operations per cycle

Summary of Results

• Sequential Time (Tsequential): 4

• Parallel Time (Tparallel): 3
• Speedup: 4/3 or 1.33
• Efficiency: 66%
• Throughput: 4/3 or 1.33 operations per cycle

Summary
• Importance and applications of HPC.
• Parallel processing approaches (Flynn's Taxonomy).
• Performance metrics in evaluating HPC systems.

Parallel Programming For Modern High Performance Computing Systems (Czarnul, Pawel)
No ratings yet
Parallel Programming For Modern High Performance Computing Systems (Czarnul, Pawel)
330 pages
Introduction To High Performance Computing: Unit-I
No ratings yet
Introduction To High Performance Computing: Unit-I
70 pages
Lecture01 Intro ToHPC
No ratings yet
Lecture01 Intro ToHPC
48 pages
HPC Lecture 3
No ratings yet
HPC Lecture 3
139 pages
Generic Questions
No ratings yet
Generic Questions
70 pages
L1.3a HPC Concepts
No ratings yet
L1.3a HPC Concepts
43 pages
BDS Session 2
No ratings yet
BDS Session 2
56 pages
Mca 4
No ratings yet
Mca 4
61 pages
01 - Lecture Intro To HPC
No ratings yet
01 - Lecture Intro To HPC
62 pages
HPC Parallel
No ratings yet
HPC Parallel
122 pages
Unit 4-Mca
No ratings yet
Unit 4-Mca
29 pages
Parallel Programming: Sathish S. Vadhiyar Course Web Page
No ratings yet
Parallel Programming: Sathish S. Vadhiyar Course Web Page
36 pages
PP Cuda Unit1 1
No ratings yet
PP Cuda Unit1 1
77 pages
High Performance Computing (HPC) Lec1
No ratings yet
High Performance Computing (HPC) Lec1
30 pages
Lecture 3
No ratings yet
Lecture 3
24 pages
Lecture-2-06 01 2025
No ratings yet
Lecture-2-06 01 2025
21 pages
CS0051 - Module 01
No ratings yet
CS0051 - Module 01
52 pages
Unit 1
No ratings yet
Unit 1
31 pages
HPC Unit 2
No ratings yet
HPC Unit 2
72 pages
2 ND
No ratings yet
2 ND
19 pages
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
No ratings yet
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
41 pages
Module 1-Topic 1
No ratings yet
Module 1-Topic 1
36 pages
Chapter 02 - Asynchronous and Parallel Programming in
No ratings yet
Chapter 02 - Asynchronous and Parallel Programming in
55 pages
Multicore02 2
No ratings yet
Multicore02 2
18 pages
P 1
No ratings yet
P 1
44 pages
CAQA5e ch1
No ratings yet
CAQA5e ch1
42 pages
Introduction To HPC and Current Usage in HEP
No ratings yet
Introduction To HPC and Current Usage in HEP
33 pages
Introduction To High Performance Computing: Shaohao Chen Research Computing Services (RCS) Boston University
No ratings yet
Introduction To High Performance Computing: Shaohao Chen Research Computing Services (RCS) Boston University
29 pages
HPC BOOk
No ratings yet
HPC BOOk
68 pages
Parallel Programming Module 1
No ratings yet
Parallel Programming Module 1
71 pages
HPC Unit 1
No ratings yet
HPC Unit 1
65 pages
Parallel Computing
No ratings yet
Parallel Computing
91 pages
HPC - Unit-1 Insem Notes
No ratings yet
HPC - Unit-1 Insem Notes
76 pages
1 Introduction
No ratings yet
1 Introduction
65 pages
Week1 Parallel and Distributed Computing
No ratings yet
Week1 Parallel and Distributed Computing
55 pages
HPC Lectures 1 5
No ratings yet
HPC Lectures 1 5
18 pages
PDC Lecture 02
No ratings yet
PDC Lecture 02
35 pages
PDC 3
No ratings yet
PDC 3
26 pages
W3C1 Principles of Parallel Computing
No ratings yet
W3C1 Principles of Parallel Computing
28 pages
High Performance Computing: What Is It Used For and Why?
No ratings yet
High Performance Computing: What Is It Used For and Why?
19 pages
Lec1 Introduction
No ratings yet
Lec1 Introduction
23 pages
Lecture Notes On Parallel Computation
No ratings yet
Lecture Notes On Parallel Computation
30 pages
Lecture 9
No ratings yet
Lecture 9
72 pages
Cloud Computing
No ratings yet
Cloud Computing
27 pages
HPC Lecture 2 Points
No ratings yet
HPC Lecture 2 Points
7 pages
24-25 - Parallel Processing PDF
No ratings yet
24-25 - Parallel Processing PDF
36 pages
Lecture Week - 1 Introduction 1 - SP-24
No ratings yet
Lecture Week - 1 Introduction 1 - SP-24
51 pages
HPC Overview
No ratings yet
HPC Overview
45 pages
Week1 - Parallel and Distributed Computing
100% (1)
Week1 - Parallel and Distributed Computing
46 pages
Handbook HPC 23-24
No ratings yet
Handbook HPC 23-24
18 pages
Basics of Parallel Programming: Unit-1
No ratings yet
Basics of Parallel Programming: Unit-1
79 pages
Applied Parallel Computing-Honest
100% (1)
Applied Parallel Computing-Honest
218 pages
High Performance Computing Lecture 1 HPC Public
No ratings yet
High Performance Computing Lecture 1 HPC Public
50 pages
.Trashed-1650000204-Hpc Prac Exam
No ratings yet
.Trashed-1650000204-Hpc Prac Exam
5 pages
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
No ratings yet
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
63 pages
Parallel Computing
No ratings yet
Parallel Computing
57 pages
An Approach To Parallel Processing: Yashraj Rai Puja Padiya
No ratings yet
An Approach To Parallel Processing: Yashraj Rai Puja Padiya
3 pages
Lecture Notes On Parallel Processing Pipeline
No ratings yet
Lecture Notes On Parallel Processing Pipeline
12 pages
Henry Thesis PHD
No ratings yet
Henry Thesis PHD
275 pages
Unit 1 Introduction To Embedded System Design
No ratings yet
Unit 1 Introduction To Embedded System Design
67 pages
Performance Metrices
100% (1)
Performance Metrices
18 pages
Instructor-S Guide To Parallel Programming in C With Mpi and Openmp
No ratings yet
Instructor-S Guide To Parallel Programming in C With Mpi and Openmp
91 pages
Pipeline: A Simple Implementation of A RISC Instruction Set
No ratings yet
Pipeline: A Simple Implementation of A RISC Instruction Set
16 pages
Computer Architecture Unit 1 - Phase 2 PDF
No ratings yet
Computer Architecture Unit 1 - Phase 2 PDF
26 pages
Quantitative Principles of Computer Design
No ratings yet
Quantitative Principles of Computer Design
10 pages
PDC Week 2 (Performance Metrice, Amdahl's Law)
No ratings yet
PDC Week 2 (Performance Metrice, Amdahl's Law)
18 pages
Cse Viii Advanced Computer Architectures (06cs81) Notes
No ratings yet
Cse Viii Advanced Computer Architectures (06cs81) Notes
156 pages
Homework 1
No ratings yet
Homework 1
18 pages
1 Module 1 Introduction To Multiprocessors September 29 2024
No ratings yet
1 Module 1 Introduction To Multiprocessors September 29 2024
29 pages
Fundamental of Embedded System Design: Unit I
No ratings yet
Fundamental of Embedded System Design: Unit I
12 pages
Case Study
33% (3)
Case Study
4 pages
Unit 5
No ratings yet
Unit 5
29 pages
Jacob With Berry: Charanjit Kandola, Daniel Fagerlie Due: June 2, 2017
No ratings yet
Jacob With Berry: Charanjit Kandola, Daniel Fagerlie Due: June 2, 2017
5 pages
Advanced Computer Organization Itdti: I. Fixed Point Format Iii Increase in Addressing Modes and
No ratings yet
Advanced Computer Organization Itdti: I. Fixed Point Format Iii Increase in Addressing Modes and
9 pages
CPU Structure and Function CH 12
No ratings yet
CPU Structure and Function CH 12
17 pages
TD Micro Chap1 With Sol-2022
No ratings yet
TD Micro Chap1 With Sol-2022
4 pages
Lecture: Metrics To Evaluate Performance
No ratings yet
Lecture: Metrics To Evaluate Performance
15 pages
Lecture 1
No ratings yet
Lecture 1
20 pages
Virtual Cluster For HPC Education
No ratings yet
Virtual Cluster For HPC Education
10 pages
CSN 221: Computer Architecture and Microprocessors: Dr. Sudip Roy
No ratings yet
CSN 221: Computer Architecture and Microprocessors: Dr. Sudip Roy
17 pages
Amdahls Law - Advanced Computer Architecture
No ratings yet
Amdahls Law - Advanced Computer Architecture
2 pages
HAQu: Hardware-Accelerated Queueing For Fine-Grained Threading On A Chip Multiprocessor
No ratings yet
HAQu: Hardware-Accelerated Queueing For Fine-Grained Threading On A Chip Multiprocessor
12 pages
Parallel Project Section 3
No ratings yet
Parallel Project Section 3
2 pages
A High Performance MRP Part Explosion Process Using Computational Grid in A Distributed Database Environment
No ratings yet
A High Performance MRP Part Explosion Process Using Computational Grid in A Distributed Database Environment
6 pages
Advanced Computer Architecture Question 1: Mcqs
No ratings yet
Advanced Computer Architecture Question 1: Mcqs
4 pages
TUT2
No ratings yet
TUT2
3 pages
Introduction to the simulation of power plants for EBSILON®Professional Version 15
From Everand
Introduction to the simulation of power plants for EBSILON®Professional Version 15
Steffen Swat
No ratings yet
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet

HPC Lecture (1) Summary

Uploaded by

HPC Lecture (1) Summary

Uploaded by

High Performance Computing (HPC) –

Lecture (1) Summary

Key Drivers of HPC

HPC Application Areas

• - SISD: Single Instruction Single Data (traditional uniprocessor).

• - MISD: Multiple Instructions Single Data (systolic arrays, pipelines).

Performance Issues and Metrics

• - Efficiency (E): Useful parallel time divided by overall parallel time.

• - Throughput: Work done per time unit.

• - Application-specific measures: Like particle interactions per time unit.

Step 1: Build the Dependency Graph

Step 2: Sequential and Parallel Execution Times

Step 3: Calculating Speedup

Step 4: Calculating Efficiency

Step 5: Calculating Throughput

• Sequential Time (Tsequential): 4

You might also like