0% found this document useful (0 votes)

16 views

Measuring Computer Performance

Uploaded by

cse.20201016

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

Measuring Computer Performance

Uploaded by

cse.20201016

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Measuring Computer Performance

Performance
• Purchasing Perspective: given a collection of
machines (or upgrade options), which has the
• best performance ?
• least cost ?
• best performance / cost ?

• Computer Designer Perspective: faced with

design options, which has the
• best performance improvement ?
• least cost ?
• best performance / cost ?

• All require basis for comparison and metric for

evaluation
–Solid metrics lead to solid progress!
Two Notions of “Performance”
DC to Top Passen- Throughput
Plane
Paris Speed gers (pmph)
6.5 610
Boeing 747 470 286,700
hours mph

BAD/Sud 1350
3 hours 132 178,200
Concorde mph

•Which has higher performance?

•Time to deliver 1 passenger?
•Time to deliver 400 passengers?
•In a computer, time for 1 job called
Response Time or Execution Time
•In a computer, jobs per day or in unit-time called
Throughput or Bandwidth
Definitions
Performance is in units of things per sec
bigger is better
If we are primarily concerned with response time

performance(x) = 1
execution_time(x)

" F(ast) is n times faster than S(low) " means…

performance(F) execution_time(S)
n= =
performance(S) execution_time(F)
Example of Response Time v. Throughput
• Time of Concorde vs. Boeing 747?
• Concord is 6.5 hours / 3 hours = 2.2 times faster

• Throughput of Boeing vs. Concorde?

• Boeing 747: 286,700 pmph / 178,200 pmph = 1.6
times faster

• Boeing is 1.6 times (“60%”) faster in terms of throughput

• Concord is 2.2 times (“120%”) faster in terms of flying time

(response time)

We will focus primarily on execution time for a single job

Confusing Wording on Performance
• Will (try to) stick to “n times faster”; its less
confusing than “m % faster”

• As faster means both increased performance

and decreased execution time, to reduce
confusion we will (and you should) use
“improve performance” or
“improve execution time”
What is Time?
• Straightforward definition of time:
–Total time to complete a task, including disk accesses,
memory accesses, I/O activities, operating system overhead,
...
–“real time”, “response time” or “elapsed time”

• Alternative: just time processor (CPU)

is working only on your program (since multiple processes
running at same time)
–“CPU execution time” or “CPU time”
–Often divided into system CPU time (in OS) and user CPU
time (in user program)
How to Measure Time?

• User Time Þ seconds

• CPU Time: Computers constructed using a clock that runs at a

constant rate and determines when events take place in the
hardware
–These discrete time intervals called
clock cycles (or informally clocks or cycles)

–Length of clock period: clock cycle time

(e.g., 2 nanoseconds or 2 ns) and clock rate (e.g., 500
megahertz, or 500 MHz), which is the inverse of the clock
period; use these!
Measuring Time using Clock Cycles

CPU execution time for a progra

= Clock Cycles for a program x Clock Cycle Time

• Or = Clock Cycles for a program

Clock Rate
Measuring Time using Clock Cycles
• One way to define clock cycles:
Clock Cycles for program
= Instructions for a program (called “Instruction Count”)
x Average Clock cycles Per Instruction (abbreviated “CPI”)

• CPI is one way to compare two machines with same

instruction set, since Instruction Count would be the same
Performance Calculation
• CPU execution time for program
= Clock Cycles for program x Clock Cycle Time

• Substituting for clock cycles:

CPU execution time for program
= (Instruction Count x CPI)
x Clock Cycle Time
= Instruction Count x CPI x Clock Cycle Time
Performance Calculation (2/2)
CPU time = Instructions x Cycles x Seconds
Program Instruction Cycle
CPU time = Instructions x Cycles x Seconds
Program Instruction Cycle
CPU time = Instructions x Cycles x Seconds
Program Instruction Cycle
CPU time = Seconds
Program
• Product of all 3 terms: if missing a term, can’t
predict time, the real measure of performance
How to Calculate the 3 Components?
• Clock Cycle Time: in specification of computer (Clock Rate in
advertisements)
• Instruction Count:
–Count instructions in loop of small program
–Use simulator to count instructions
–Hardware counter in spec. register
• (Pentium II,III,4)

CPI:
Calculate: Execution Time / Clock cycle time
Instruction Count
Hardware counter in special register (PII,III,4)
CPU Performance
Calculating CPI Another Way
• First calculate CPI for each individual instruction
(add, sub, and, etc.)
• Next calculate frequency of each individual
instruction
• Finally multiply these two for each instruction
and add them up to get final CPI (the weighted
sum)
n Ij
CPI = å CPI j ´ Fj where Fj =
j =1 Instruction Count
Example (RISC processor)
Op Freqi CPIi Prod (% Time)
ALU 50% 1 .5 (23%)
Load 20% 5 1.0 (45%)
Store 10% 3 .3 (14%)
Branch 20% 2 .4 (18%)
2.2 (Where time spent)
Instruction Mix

How much faster would the machine be if a better data cache reduced the
average load time to 2 cycles?
Load à 20% x 2 cycles = .4
Total CPI 2.2 à 1.6
Relative performance is 2.2 / 1.6 = 1.38

How does this compare with reducing the branch instruction to 1 cycle?
Branch à 20% x 1 cycle = .2
Total CPI 2.2 à 2.0
Relative performance is 2.2 / 2.0 = 1.1
Amdahl's “Law”
• Speedup due to enhancement E:
• ExTime w/o E Performance w/ E
• Speedup(E) = -------------------- = ---------------------
• ExTime w/ E Performance w/o E

• Suppose that enhancement E accelerates a fraction F of the task

• by a factor S and the remainder of the task is unaffected then,
Performance improvement
• ExTime(with E) = ((1-F) + F/S) X ExTime(without E) is limited by how much the
improved feature is used à
Invest resources where
• Speedup(with E) = ExTime(without E) ÷ time is spent.
((1-F) + F/S) X ExTime(without E)
Example of Amdahl’s Law
• Floating point instructions are improved to run twice as fast,
but only 10% of the time was spent on these instructions
originally. How much faster is the new machine?
executionTimeold 1
Speedup = =
executionTimenew (1 - fraction fractionenhanced
enhanced ) +
Speedupenhanced
• The new machine is 1.053 times as fast, or 5.3% faster.
1 1.00
Speedup = = » 1.053 times faster
0.1 0.95
(1 - 0.1) +
2
• How much faster would the new machine be if floating point
instructions become 100 times faster?
Estimating Performance Improvements
• A state-of-the art processor currently requires 10 seconds to
execute a program.
• Processor performance improves by 50 percent per year.
• Assuming only processor performance is at issue, by what factor
does processor performance improve in 5 years?
newPerf = (1 + increase/year)numYears = (1+0.5)5 = 7.6

• How long will it take a processor to execute the program after 5

years?
executionTimenew = 10/7.59 = 1.32 seconds

• How many year will it take until the program can be executed in
1 second.
Performance Example

• Computers M1 and M2 are two implementations of the same

instruction set.
– M1 has a clock rate of 1600 MHz and M2 has a clock rate of 2.4 GHz.
– M1 has a CPI of 2.8 and M2 has a CPI of 3.2 for a given program.

• How many times faster is M2 than M1 for this program?

numInstr(M1) ´ CPI(M1) 2.8 cyc
executionTime(M1) clockRate(M1) 1600 ´106 cyc/sec
= = » 1.3
executionTime(M2) numInstr(M2) ´ CPI(M2) 3.2 cyc
clockRate(M2) 2.4 ´109 cyc/sec

• What would the clock rate of M1 have to be for them to have the
same execution time?
Marketing Metrics
• MIPS = Instruction Count /(Time * 10^6)
= Clock Rate / (CPI * 10^6)
– machines with different instruction sets ?
– programs with different instruction mixes ?
– dynamic frequency of instructions
– uncorrelated with performance

• MFLOP/s = FP Operations / (Time * 10^6)

– machine dependent
– often not where time is spent
Benchmarks

• Ideally run typical programs with typical

input before purchase, or before even build
machine
–Called a “workload”; For example:
–Engineer uses compiler, spreadsheet
–Author uses word processor, drawing program,
compression software

• In some situations it’s hard to do

–Don’t have access to machine to “benchmark”
before purchase
–Don’t know workload in future
Benchmarks
• Obviously, apparent speed of processor depends
on code used to test it

• Need industry standards so that different

processors can be fairly compared

• Companies exist that create these benchmarks:

“typical” code used to evaluate systems

• Need to be changed every 2 or 3 years since

designers could (and do!) target for these standard
benchmarks
Example Standardized Benchmarks
• Standard Performance Evaluation Corporation
(SPEC) SPEC CPU2000
–CINT2000 12 integer (gzip, gcc, crafty, perl, ...)
–CFP2000 14 floating-point (swim, mesa, art, ...)
–All relative to base machine
Sun 300MHz 256Mb-RAM Ultra5_10, which gets
score of 100
– www.spec.org/osg/cpu2000/
–They measure
• System speed (SPECint2000)
• System throughput (SPECint_rate2000)
Example Standardized Benchmarks
• SPEC
–Benchmarks distributed in source code
–Members of consortium select workload
• 30+ companies, 40+ universities
–Compiler, machine designers target benchmarks, so
try to change every 3 years
–The last benchmark released was SPEC 2006
“And in conclusion…”

• Good benchmarks, such as the SPEC benchmarks,

can provide an accurate basis for evaluating and
comparing computer performance.
• MIPS and MFLOPS are easy to use, but inaccurate
indicators of performance.
• Amdahl’s law provides an efficient method for
determining speedup due to an enhancement.
• Make the common case fast!

Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
80% (5)
Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
118 pages
CH 02a-Computer Performance
No ratings yet
CH 02a-Computer Performance
22 pages
Chapter 1 Lecture 2 & 3 - Computer Performance
No ratings yet
Chapter 1 Lecture 2 & 3 - Computer Performance
37 pages
Cse - 321 - 2
No ratings yet
Cse - 321 - 2
37 pages
The Role of Performance: Chapter - 2
No ratings yet
The Role of Performance: Chapter - 2
40 pages
M116C 1 M116C 1 Lect02-Performance
No ratings yet
M116C 1 M116C 1 Lect02-Performance
23 pages
4 Performance
No ratings yet
4 Performance
27 pages
Module 3.3 - Problems On Performance
No ratings yet
Module 3.3 - Problems On Performance
54 pages
Chapter 1 Lecture 2 & 3 - Performance
No ratings yet
Chapter 1 Lecture 2 & 3 - Performance
36 pages
Lec10 Performance
No ratings yet
Lec10 Performance
22 pages
Computer Performance
No ratings yet
Computer Performance
27 pages
COMP 303 Computer Architecture
No ratings yet
COMP 303 Computer Architecture
34 pages
Performance of Processor1
No ratings yet
Performance of Processor1
9 pages
C A Lecture-3
No ratings yet
C A Lecture-3
41 pages
Performance
No ratings yet
Performance
12 pages
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
28 pages
Performance Matrices
No ratings yet
Performance Matrices
14 pages
Performance
No ratings yet
Performance
51 pages
4 Perfrmance
No ratings yet
4 Perfrmance
30 pages
Computer Architecture Measuring Performance
No ratings yet
Computer Architecture Measuring Performance
33 pages
Week 10 Part 02 - Processor Performance (Q Only) - Tagged 2
No ratings yet
Week 10 Part 02 - Processor Performance (Q Only) - Tagged 2
23 pages
IT401 Computer Organization and Architecture: Prasun Ghosal
No ratings yet
IT401 Computer Organization and Architecture: Prasun Ghosal
30 pages
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
No ratings yet
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
52 pages
ComputerOrganization Chapter4 Performance Color
No ratings yet
ComputerOrganization Chapter4 Performance Color
37 pages
02 Performance
No ratings yet
02 Performance
23 pages
Computer Organization and Architecture (AT70.01)
No ratings yet
Computer Organization and Architecture (AT70.01)
29 pages
Lecture4 Performance Evaluation 2011
No ratings yet
Lecture4 Performance Evaluation 2011
34 pages
COD Ch. 2 The Role of Performance
No ratings yet
COD Ch. 2 The Role of Performance
28 pages
Computer Organization The Role of Performance
No ratings yet
Computer Organization The Role of Performance
45 pages
Computer Architecture Unit 1 - Phase 2 PDF
No ratings yet
Computer Architecture Unit 1 - Phase 2 PDF
26 pages
CSE 332 L4 - 14 Nov 2020
No ratings yet
CSE 332 L4 - 14 Nov 2020
41 pages
Week 10 Part 02 - Processor Performance (Answers)
No ratings yet
Week 10 Part 02 - Processor Performance (Answers)
35 pages
Computer Performance
No ratings yet
Computer Performance
18 pages
2 CPU Performance
No ratings yet
2 CPU Performance
35 pages
Lecture 3
No ratings yet
Lecture 3
19 pages
Lecture: Metrics To Evaluate Performance
No ratings yet
Lecture: Metrics To Evaluate Performance
15 pages
Week 13 14 - Performance Evaluation
No ratings yet
Week 13 14 - Performance Evaluation
19 pages
Lecture 3
No ratings yet
Lecture 3
21 pages
COD Ch. 2 The Role of Performance
No ratings yet
COD Ch. 2 The Role of Performance
13 pages
DA_CI
No ratings yet
DA_CI
13 pages
Performance Chap4
No ratings yet
Performance Chap4
20 pages
Chapter 8 - CPU Performance
No ratings yet
Chapter 8 - CPU Performance
40 pages
Quatitative Principle
No ratings yet
Quatitative Principle
56 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
47 pages
Lecture 2: Performance/Power, MIPS Instructions
No ratings yet
Lecture 2: Performance/Power, MIPS Instructions
28 pages
A Constant Clock Rate:: - Most Computers Run Synchronously Utilizing A CPU Clock Running at
No ratings yet
A Constant Clock Rate:: - Most Computers Run Synchronously Utilizing A CPU Clock Running at
45 pages
Lecture 02 CH01 Performance Power
No ratings yet
Lecture 02 CH01 Performance Power
76 pages
10
No ratings yet
10
76 pages
Assessing and Understanding Performance
No ratings yet
Assessing and Understanding Performance
31 pages
Lesson 3 - Computing For Performance
No ratings yet
Lesson 3 - Computing For Performance
38 pages
Computer Performance
No ratings yet
Computer Performance
17 pages
02 Performance
No ratings yet
02 Performance
13 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
18 pages
Lecture 3: Performance/Power, MIPS Instructions
No ratings yet
Lecture 3: Performance/Power, MIPS Instructions
18 pages
Puter Performance
No ratings yet
Puter Performance
15 pages
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
No ratings yet
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
56 pages
Chapter 1 Performance
No ratings yet
Chapter 1 Performance
32 pages
Computer Performance Measurement. Amdahl's Law
No ratings yet
Computer Performance Measurement. Amdahl's Law
24 pages
550 12 6 2011 PDF
No ratings yet
550 12 6 2011 PDF
45 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Memory 2
No ratings yet
Memory 2
31 pages
Memory Hierarchy 1
No ratings yet
Memory Hierarchy 1
44 pages
Gmail - Associate Engineer-Trainee hiring– Entry-Level Opportunity at Innofied Solutions
No ratings yet
Gmail - Associate Engineer-Trainee hiring– Entry-Level Opportunity at Innofied Solutions
2 pages
Tlp
No ratings yet
Tlp
19 pages
Master 2 Exam Keys
No ratings yet
Master 2 Exam Keys
4 pages
Stardom Fcn-Rtu: Low Power Autonomous Controller
No ratings yet
Stardom Fcn-Rtu: Low Power Autonomous Controller
1 page
Allen Bradley 1756-CNB Control Net Bridge Module - Burn Only (103 Pages)
No ratings yet
Allen Bradley 1756-CNB Control Net Bridge Module - Burn Only (103 Pages)
203 pages
Multimeter TS-297/U TM 11-5500
100% (1)
Multimeter TS-297/U TM 11-5500
85 pages
User Manual Version 2.4: A Software Tool For The Graphing and Analysis of Large Complex Pedigree
No ratings yet
User Manual Version 2.4: A Software Tool For The Graphing and Analysis of Large Complex Pedigree
24 pages
C: Client-Independent: Abap Certification Questions
No ratings yet
C: Client-Independent: Abap Certification Questions
23 pages
devops ppt
No ratings yet
devops ppt
7 pages
IP Project
No ratings yet
IP Project
13 pages
NTC K.S - Laptops, Price-List, Models & Specs.
No ratings yet
NTC K.S - Laptops, Price-List, Models & Specs.
11 pages
E-Health and Nursing: Using Smartphones To Enhance Nursing Practice
No ratings yet
E-Health and Nursing: Using Smartphones To Enhance Nursing Practice
8 pages
Powerscan Pm9500 Models and Kits
No ratings yet
Powerscan Pm9500 Models and Kits
6 pages
Information Security Policy Final Goerge Washington Unversity
No ratings yet
Information Security Policy Final Goerge Washington Unversity
7 pages
HPE ProLiant DL380 Gen10 Plus Server
No ratings yet
HPE ProLiant DL380 Gen10 Plus Server
80 pages
CBS Computer Application
No ratings yet
CBS Computer Application
5 pages
Dbms MCQ Questions With Answers
No ratings yet
Dbms MCQ Questions With Answers
5 pages
One Shot Learning
No ratings yet
One Shot Learning
1 page
Zapanta MachineProblem#6
No ratings yet
Zapanta MachineProblem#6
12 pages
Brochure AlgoSec ASMS Foundations Training
No ratings yet
Brochure AlgoSec ASMS Foundations Training
2 pages
Data Analysis Assignment Help
No ratings yet
Data Analysis Assignment Help
3 pages
Lastexception 63724413576
No ratings yet
Lastexception 63724413576
158 pages
Data Communication and Networking
No ratings yet
Data Communication and Networking
15 pages
Sub Netting Tip Sheet
No ratings yet
Sub Netting Tip Sheet
1 page
Database Design and Development
No ratings yet
Database Design and Development
2 pages
Ijrcm 2 IJRCM 2 - Vol 3 - 2013 - Issue 4 - April
No ratings yet
Ijrcm 2 IJRCM 2 - Vol 3 - 2013 - Issue 4 - April
157 pages
Uses of Hexadecimal
No ratings yet
Uses of Hexadecimal
10 pages
4700L Service Manual
No ratings yet
4700L Service Manual
1,024 pages
DSSD Computer Education
No ratings yet
DSSD Computer Education
12 pages
Azure + Dynamics 365 + Online Services - IsO 22301 Recertification Assessment Report (4.24.2023)
No ratings yet
Azure + Dynamics 365 + Online Services - IsO 22301 Recertification Assessment Report (4.24.2023)
33 pages
devwin32
No ratings yet
devwin32
2,467 pages
ITG Company Profile 2022
No ratings yet
ITG Company Profile 2022
24 pages