Computer Performance Measurement. Amdahl's Law

This document discusses measuring computer performance and Amdahl's law. It defines key terms like execution time, CPU time, clock cycles, and clock rate used to measure performance. Amdahl's law states that the maximum speedup from parallelizing a program is limited by the time needed for the sequential parts of the program. The more of a program that can be parallelized, the greater the potential speedup, but increasing processors yields diminishing returns as the sequential part remains.

Uploaded by

Naski Kuafni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views24 pages

Computer Performance Measurement. Amdahl's Law

Uploaded by

Naski Kuafni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Computer

performance
measurement.
Amdahl’s Law.
Outline
• Characterizing performance
• Performance and speed
• Basic terminology for measuring
• Performance and Execution Time
Characterizing Performance
• Not always obvious how to characterize
performance:
– motor cars
– football teams
– tennis players
– computers?
• Can lead to serious errors
– improve the processor to improve speed?
Performance and Speed
• Performance for a program on a particular
machine
1
– P e r fo r m a n c e ( X ) =
E x e c u tio n ( X )
P e r fo r m a n c e ( X ) E x e c u tio n (Y )
= = n
P e r fo r m a n c e (Y ) E x e c u tio n ( X )

– X is n times faster than Y

Measuring Time
• Execution time is the amount of time it takes
the program to execute in seconds.
• Time (computers do several tasks!)
– elapsed time based on a normal clock;
– CPU time is time spent executing this program
• excluding waiting for I/O, or other programs
Execution Time

Elapsed Time
(real time)

CPU time for this program I/O waiting

& other programs

Example (UNIX)
11.099u 8.659s 10:43.67 3.0%
(user) (system) (elapsed)
user system (seconds) (seconds) (min:secs)
(direct) (time in OS) CPU time = 3.0% of elapsed time
Measuring Amounts

• 1 bit
• 8 bits = 1 byte
• 1024bytes = 1 kilobyte = 1KByte = 1K = 210
• 1024KBytes = 1 megabyte = 1MB = 220
• 1024MB = 1 gigabyte = 1GB = 230
• 1024GB = 1 terabyte = 240
• and on to infinity
Intel or AMD ?
Measuring Times
• Duration
– 1 second
– 1/1000 second = 1 millisec = 1ms = 10-3 s
– 1/1,000,000 s = 1 microsec = 10-6 s
– 1/1,000,000,000s = 1 nanosec = 10-9 s
• Frequency
– 1 Hertz = 1 cycle per second
– 1 MHz = 1,000,000 cycles per sec
– 100MHz = 100,000,000 cycles per sec.
The type of processor a computer has not only affects
its overall performance, but it can also dictate what
type of software it uses.
Differences
32-bit and 64-bit

• number of calculations per second

• on the amount of RAM
– 32-bit computers maximum of 3-4GB
– 64-bit computer over 4 GB.
• The first smartphone with a 64-bit chip (Apple
A7) was the iPhone 5s.
Computer Clock Times
• Computers run according to a clock that runs
at a steady rate
• The time interval is called a clock cycle (eg,
10ns).
• The clock rate is the reciprocal of clock cycle -
a frequency, how many cycles per sec (eg,
100MHz).
– 10 ns = 1/100,000,000 (clock cycle), same as:-
– 1/10ns = 100,000,000 = 100MHz (clock rate).
Purchasing Decision
• Computer A has a 100MHz processor
• Computer B has a 300MHz processor
• So, B is faster, right?

NOPE!
• Now, let’s get it right…..
Measuring Performance

• The only important question: “HOW FAST

WILL MY PROGRAM RUN?”
• CPU execution time for a program
– = CPU clock cycles * cycle time
– (= CPU clock cycles / Clock rate)
• In computer design, trade-off between:
– clock cycle time, and
– number of cycles required for a program
Cycles Per Instruction
• The execution time of a program clearly must
depend on the number of instructions
– but different instructions take different times
• An expression that includes this is:-
– CPU clock cycles = N * CPI
• N = number of instructions
• CPI = average clock cycles per instruction
Example
• Machine A • Machine B
• clock cycle time • clock cycle time
– 10ns/cycle – 30ns/cycle
• CPI = 2.0 for prog X • CPI = 1.0 for prog X

Let I = number of instructions in the program.

CPU clock cycles (A) = I * 2.0 CPU clock cycles (B) = I * 1.0
CPU time (A) = CPU clock cycles * CPU time (B) = CPU clock cycles *
clock cycle time clock cycle time
= I * 2.0 * 10 = I * 1.0 * 30
= I * 20 ns = I * 30 ns

Execution(B) / Execution(A) = 30 / 20 = 1.5

Basic Performance Equation
• CPU Time = I * CPI * T
– I = number of instructions in program
– CPI = average cycles per instruction
– T = clock cycle time
• CPU Time = I * CPI / R
– R = 1/T the clock rate
• T or R are usually published as performance measures for a processor
• I requires special profiling software
• CPI depends on many factors (including memory).
Amdahl’s law
Amdahl’s Law
• Amdahl's law, named after computer architect
Gene Amdahl, is used to find the maximum
expected improvement to an overall system
when only part of the system is improved.
• Amdahl’s law can be interpreted more
technically, but in simplest terms it means that
it is the algorithm that decides the speedup
not the number of processors.
Amdahl’s Law
• A program (or algorithm) which can be
parallelized can be split up into two parts:
– A part which cannot be parallelized
– A part which can be parallelized
Introduction
• If F is the fraction of a program that is
sequential, and (1-F) is the fraction of program
or algorithm that can be parallelized, then the
maximum speed-up that can be achieved by
using P processors is:
Examples
• If 90% of a calculation can be parallelized (i.e.
10% is sequential) then the maximum speed-
up which can be achieved on 5 processors is
1/(0.1+(1-0.1)/5) or roughly 3.6 (i.e. the
program can theoretically run 3.6 times faster
on five processors than on one)
Examples
• If 90% of a calculation can be parallelized then the
maximum speed-up on 10 processors is 1/(0.1+(1-
0.1)/10) or 5.3 (i.e. investing twice as much
hardware speeds the calculation up by about
50%).
• If 90% of a calculation can be parallelized then the
maximum speed-up on 20 processors is 1/(0.1+(1-
0.1)/20) or 6.9 (i.e. doubling the hardware again
speeds up the calculation by only 30%).
Examples
• If 90% of a calculation can be parallelized then
the maximum speed-up on 1000 processors is
1/(0.1+(1-0.1)/1000) or 9.9 (i.e. throwing an
absurd amount of hardware at the calculation
results in a maximum theoretical (i.e. actual
results will be worse) speed-up of 9.9 vs a
single processor).

Cs23402 - Computer Architecture - Unit - 1
No ratings yet
Cs23402 - Computer Architecture - Unit - 1
161 pages
Lec 2
No ratings yet
Lec 2
31 pages
CS-3006 10 PerformanceAnalysis
No ratings yet
CS-3006 10 PerformanceAnalysis
52 pages
1aca L1
No ratings yet
1aca L1
35 pages
Chapter 1
No ratings yet
Chapter 1
51 pages
CS-3006 4 PerformanceAnalysis
No ratings yet
CS-3006 4 PerformanceAnalysis
62 pages
Lec 2
No ratings yet
Lec 2
31 pages
Quatitative Principle
No ratings yet
Quatitative Principle
56 pages
SEN307 Lecture 5
No ratings yet
SEN307 Lecture 5
34 pages
2 CPU Performance
No ratings yet
2 CPU Performance
35 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
47 pages
2 Week
No ratings yet
2 Week
35 pages
Computer Architecture Unit1
No ratings yet
Computer Architecture Unit1
20 pages
Performance Measures For Computers
No ratings yet
Performance Measures For Computers
53 pages
Measuring Computer Performance
No ratings yet
Measuring Computer Performance
26 pages
Computer Architecture
No ratings yet
Computer Architecture
26 pages
2 - Computer Organization and Architecture
No ratings yet
2 - Computer Organization and Architecture
21 pages
TLE-CSS10 - 11 - q2 - wk4 - Install Operating System and Drivers For Peripherals Devices - v3
No ratings yet
TLE-CSS10 - 11 - q2 - wk4 - Install Operating System and Drivers For Peripherals Devices - v3
28 pages
B38DF LS2b Performance
No ratings yet
B38DF LS2b Performance
20 pages
Lec 3
No ratings yet
Lec 3
21 pages
Computer Performance
No ratings yet
Computer Performance
35 pages
Computer Architecture Measurement
No ratings yet
Computer Architecture Measurement
26 pages
Da Ci
No ratings yet
Da Ci
13 pages
Computer Performance
No ratings yet
Computer Performance
17 pages
Sam E70/s70/v70/v71
No ratings yet
Sam E70/s70/v70/v71
1,943 pages
Week 10 Part 02 - Processor Performance (Answers)
No ratings yet
Week 10 Part 02 - Processor Performance (Answers)
35 pages
Lecture # 2
No ratings yet
Lecture # 2
33 pages
Computer Performance
No ratings yet
Computer Performance
22 pages
Week 10 Part 02 - Processor Performance (Q Only) - Tagged 2
No ratings yet
Week 10 Part 02 - Processor Performance (Q Only) - Tagged 2
23 pages
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
No ratings yet
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
56 pages
Week 2 - Lecture 2 - Performance Measurement
No ratings yet
Week 2 - Lecture 2 - Performance Measurement
25 pages
Ak 500 User Manual
No ratings yet
Ak 500 User Manual
27 pages
Module 3.3 - Problems On Performance
No ratings yet
Module 3.3 - Problems On Performance
54 pages
05 Performance
No ratings yet
05 Performance
16 pages
Puter Performance
No ratings yet
Puter Performance
15 pages
The Role of Performance: Chapter - 2
No ratings yet
The Role of Performance: Chapter - 2
40 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
17 pages
4 Performance
No ratings yet
4 Performance
27 pages
L5-L6-Performance Issues
No ratings yet
L5-L6-Performance Issues
47 pages
CA Lecture1
No ratings yet
CA Lecture1
9 pages
Cse - 321 - 2
No ratings yet
Cse - 321 - 2
37 pages
Computer Architecture Unit 1 - Phase 2 PDF
No ratings yet
Computer Architecture Unit 1 - Phase 2 PDF
26 pages
Intro
No ratings yet
Intro
14 pages
Foundation Course for Advanced Computer Studies
From Everand
Foundation Course for Advanced Computer Studies
Franck Ismael Djédjé
No ratings yet
Introduction To Computer Organization
No ratings yet
Introduction To Computer Organization
66 pages
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
No ratings yet
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
52 pages
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
28 pages
Computer Architecture Unit 1
No ratings yet
Computer Architecture Unit 1
12 pages
Computer Organization The Role of Performance
No ratings yet
Computer Organization The Role of Performance
45 pages
Lect 1
No ratings yet
Lect 1
56 pages
Chapter 1 Performance
No ratings yet
Chapter 1 Performance
32 pages
Lect 1
No ratings yet
Lect 1
54 pages
Computer Architecture 2
No ratings yet
Computer Architecture 2
17 pages
Week 13 14 - Performance Evaluation
No ratings yet
Week 13 14 - Performance Evaluation
19 pages
Comp Org Notes On Measuring Cpu Performance
No ratings yet
Comp Org Notes On Measuring Cpu Performance
4 pages
Computer Architecture Measuring Performance
No ratings yet
Computer Architecture Measuring Performance
33 pages
Data Loading 810D 840D
100% (1)
Data Loading 810D 840D
12 pages
Lec 2 Performance
No ratings yet
Lec 2 Performance
28 pages
Error Log
No ratings yet
Error Log
302 pages
COD Ch. 2 The Role of Performance
No ratings yet
COD Ch. 2 The Role of Performance
13 pages
OSY Notes Vol 2 (6th Chapter) - Ur Engineering Friend
No ratings yet
OSY Notes Vol 2 (6th Chapter) - Ur Engineering Friend
23 pages
Assessing and Understanding Performance
No ratings yet
Assessing and Understanding Performance
31 pages
Measuring Performance: Chris Clack B261 Systems Architecture
No ratings yet
Measuring Performance: Chris Clack B261 Systems Architecture
19 pages
M116C 1 M116C 1 Lect02-Performance
No ratings yet
M116C 1 M116C 1 Lect02-Performance
23 pages
CH 02a-Computer Performance
No ratings yet
CH 02a-Computer Performance
22 pages
HRT-711 Usermanual Eng v1.08
No ratings yet
HRT-711 Usermanual Eng v1.08
126 pages
CSE Lec5
No ratings yet
CSE Lec5
65 pages
Advanced Algorithms & Data Structures: Lecturer: Karimzhan Nurlan Berlibekuly
No ratings yet
Advanced Algorithms & Data Structures: Lecturer: Karimzhan Nurlan Berlibekuly
47 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
18 pages
Advanced Algorithms & Data Structures: Lecturer: Karimzhan Nurlan Berlibekuly
No ratings yet
Advanced Algorithms & Data Structures: Lecturer: Karimzhan Nurlan Berlibekuly
31 pages
BHT904BB UsersManual E3
No ratings yet
BHT904BB UsersManual E3
218 pages
The System Unit: Motherboard
No ratings yet
The System Unit: Motherboard
13 pages
8089 Programming HSC 2025
No ratings yet
8089 Programming HSC 2025
19 pages
Physical Characteristics of Disks
No ratings yet
Physical Characteristics of Disks
28 pages
Advanced Algorithms & Data Structures: Lecturer: Karimzhan Nurlan Berlibekuly
No ratings yet
Advanced Algorithms & Data Structures: Lecturer: Karimzhan Nurlan Berlibekuly
28 pages
Advanced Algorithms & Data Structures: Lecturer: Karimzhan Nurlan Berlibekuly
No ratings yet
Advanced Algorithms & Data Structures: Lecturer: Karimzhan Nurlan Berlibekuly
47 pages
Assignment
No ratings yet
Assignment
10 pages
Метод.указ. по ОИС (лаб 2) - eng
No ratings yet
Метод.указ. по ОИС (лаб 2) - eng
8 pages
Embedded System Question Paper
No ratings yet
Embedded System Question Paper
8 pages
Backup Memory Analogue Mega SG Settings: Consumer Info Available at
No ratings yet
Backup Memory Analogue Mega SG Settings: Consumer Info Available at
2 pages
HDD RAW Fix Partition !
No ratings yet
HDD RAW Fix Partition !
4 pages
Reseller
No ratings yet
Reseller
9 pages
Microprogramming PDF
No ratings yet
Microprogramming PDF
5 pages
Kazakhstan On The Way To Independence: The Phase of Development and Nation-Building Ideas
No ratings yet
Kazakhstan On The Way To Independence: The Phase of Development and Nation-Building Ideas
20 pages
Cultural Revolution in Soviet Union
No ratings yet
Cultural Revolution in Soviet Union
32 pages
Asterx-M3 Transition From
No ratings yet
Asterx-M3 Transition From
16 pages
Lecture #3 1 WW, Revolutions
No ratings yet
Lecture #3 1 WW, Revolutions
24 pages
Information Systems Modeling
No ratings yet
Information Systems Modeling
23 pages
Ewaste GR Dated 24.12.2014 PDF
No ratings yet
Ewaste GR Dated 24.12.2014 PDF
7 pages
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
No ratings yet
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
19 pages
Dxdiag - My PC Info
No ratings yet
Dxdiag - My PC Info
28 pages
Software Requirements Specification: Discipline "Fundamentals of Information Systems"
No ratings yet
Software Requirements Specification: Discipline "Fundamentals of Information Systems"
20 pages
Метод.указ. по ОИС (лаб1) -eng
No ratings yet
Метод.указ. по ОИС (лаб1) -eng
18 pages
Kazakhstan at The Beginning of The 20 Century (Baigabylova A)
No ratings yet
Kazakhstan at The Beginning of The 20 Century (Baigabylova A)
18 pages
Software Requirements Specification: Discipline "Fundamentals of Information Systems"
No ratings yet
Software Requirements Specification: Discipline "Fundamentals of Information Systems"
18 pages
Work Order Form: Maintenance
No ratings yet
Work Order Form: Maintenance
2 pages
Метод.указ. по ОИС (лаб3) - eng
No ratings yet
Метод.указ. по ОИС (лаб3) - eng
14 pages
mcp4725 Use Code
No ratings yet
mcp4725 Use Code
10 pages
Unit - Iv
No ratings yet
Unit - Iv
3 pages
Phrasal Verbs Section 4 Developed by B. Jolamanova Assignments
100% (1)
Phrasal Verbs Section 4 Developed by B. Jolamanova Assignments
2 pages
Satellite - L305D-S5974 Specs
No ratings yet
Satellite - L305D-S5974 Specs
3 pages
Sector Size Converter V1.0: User Manual
No ratings yet
Sector Size Converter V1.0: User Manual
7 pages
H6804102015 PDF
No ratings yet
H6804102015 PDF
2 pages
LCD Interfacing With ATMEGA2561: Sabina Batyrkhanovna
No ratings yet
LCD Interfacing With ATMEGA2561: Sabina Batyrkhanovna
5 pages
Guide of The Firmware Toolkit: Please Note
No ratings yet
Guide of The Firmware Toolkit: Please Note
3 pages
Prevent Overheating: Configurable TDP
No ratings yet
Prevent Overheating: Configurable TDP
4 pages
Laboratory Work 6 1. Individual Task!
No ratings yet
Laboratory Work 6 1. Individual Task!
2 pages
Using The Uploaded Data, Try To Answer The Following Questions
No ratings yet
Using The Uploaded Data, Try To Answer The Following Questions
1 page
Risks Type Probability Impact
No ratings yet
Risks Type Probability Impact
1 page
Final Version - End of Term Speaking
No ratings yet
Final Version - End of Term Speaking
1 page
Final Speaking Topics IELTS 2020
No ratings yet
Final Speaking Topics IELTS 2020
1 page
Phrasal Verbs Part 3
No ratings yet
Phrasal Verbs Part 3
1 page
Image For Dos
No ratings yet
Image For Dos
1 page

Computer Performance Measurement. Amdahl's Law

Uploaded by

Computer Performance Measurement. Amdahl's Law

Uploaded by

Computer

– X is n times faster than Y

CPU time for this program I/O waiting

• number of calculations per second

• The only important question: “HOW FAST

Let I = number of instructions in the program.

Execution(B) / Execution(A) = 30 / 20 = 1.5

You might also like