0% found this document useful (0 votes)

53 views52 pages

L-2 (Computer Performance)

This document summarizes a lecture on computer performance. It discusses that performance is key to understanding hardware organization and factors like clock speed, instructions per cycle, and number of instructions affect overall execution time. Common metrics for measuring performance are response time and throughput. The document also covers concepts like clock cycles, instructions per cycle, factors that influence performance like pipeline stalls, and benchmarks for comparing systems.

Uploaded by

Imran Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views52 pages

L-2 (Computer Performance)

Uploaded by

Imran Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 52

CSE 332

Computer Organization and Architecture

Lecture 2: Computer Performance

Tanzilur Rahman (Tnr)

North South University
Performance

• Performance is the key to understanding underlying motivation

for the hardware and its organization

• Why is some hardware better than others for different

programs?

• What factors of system performance are hardware related?

(e.g., do we need a new machine, or a new operating system?)
Performance

3
What do we measure?
Define performance….

• How much faster is the Concorde compared to the 747?

• How much bigger is the Boeing 747 than the Douglas DC-8?
Defining Performance

5
Computer Performance: TIME, TIME,
TIME!!!
• Response Time (elapsed time, latency):
• How long does it take to complete (start to finish) a task? Individual user
• Eg: how long must I wait for the database query? concerns…

Individual is more interested in response time. As a user of a smart phone/laptop,

the one that responds faster is better!

Response time (computer ): the total time required by computer to complete a

task including :
Disk access Memory access I/O activities OS overheads CPU exec. time etc
Computer Performance: TIME, TIME,
TIME!!!
• Throughput:
• Total work done per unit time……(per hr,day etc)
• how many jobs can the machine run at once? Systems manager
concerns…
• what is the average execution rate?
• how much work is getting done?
Response Time and Throughput

• If we upgrade a machine with a new processor what do we increase?

8
Relative Performance

9
Relative Performance

10
Execution Time
• Elapsed Time
• counts everything (disk and memory accesses, waiting for I/O, running other
programs, etc.) from start to finish
• a useful number, but often not good for comparison purposes
Elapsed time = CPU time + wait time (I/O, other programs, etc.)

• CPU time
• doesn't count waiting for I/O or time spent running other programs
• can be divided into user CPU time and system CPU time (OS calls)
CPU time = user CPU time + system CPU time

• Our focus:
• user CPU time (CPU execution time or, simply, execution time)
• time spent executing the lines of code that are in our program
• For easier writing, user CPU time has been termed simply as CPU time in rest of
the studies.
Execution Time
Summary of Execution Time
CPU Clocking

14
CPU Clocking

Clock

Processor
Transistor
s

15
CPU Time

Performance Equation - I

16
Example

• Our favorite program runs in 10 seconds on computer A, which has a

2Ghz. clock.
• We are trying to help a computer designer build a new machine B, that
will run this program in 6 seconds. The designer can use new (or
perhaps more expensive) technology to substantially increase the clock
rate, but has informed us that this increase will affect the rest of the
CPU design, causing machine B to require 1.2 times as many clock cycles
as machine A for the same program.

• What clock rate should we tell the designer to target?

CPU Time Example

18
CPU Time Example

19
CPU Time Example

20
No. of Clock Cycles

21
Instruction Count and CPI

Performance Equation - II 22
23
24
25
26
Factors Influencing Performance

27
Factors Influencing Performance

28
Factors Influencing Performance

29
Factors Influencing Performance

More to follow in ALU and Pipeline chapter

30
CPU Time Example

31
CPU Time Example

32
CPU Time Example

33
Self Help

• Suppose we have two implementations of the same instruction set

architecture (ISA). For some program:
• machine A has a clock cycle time of 10 ns. and a CPI of 2.0
• machine B has a clock cycle time of 20 ns. and a CPI of 1.2

• Which machine is faster for this program, and by how much?

• If two machines have the same ISA, which of our quantities (e.g., clock
rate, CPI, execution time, # of instructions, MIPS) will always be
identical?
CPI Example
• A compiler designer is trying to decide between two code sequences for
a particular machine.
• Based on the hardware implementation, there are three different
classes of instructions: Class A, Class B, and Class C,

• Which code sequence has the most instructions? Which sequence will be
faster? How much? What is the CPI for each sequence?
For different class of Instructions
CPI Example
Which code sequence has the most instructions?

Which sequence will be faster?

What is the CPI for each sequence

Self Help

• Two different compilers are being tested for a 500 MHz. machine with
three different classes of instructions: Class A, Class B, and Class C,
which require 1, 2 and 3 cycles (respectively). Both compilers are used
to produce code for a large piece of software.
• Compiler 1 generates code with 5 billion Class A instructions, 1 billion
Class B instructions, and 1 billion Class C instructions.
• Compiler 2 generates code with 10 billion Class A instructions, 1 billion
Class B instructions, and 1 billion Class C instructions.

• Which sequence will be faster according to MIPS?

• Which sequence will be faster according to execution time?
Example
Benchmarks

• Performance best determined by running a real application

• use programs typical of expected workload
• or, typical of expected class of applications
e.g., compilers/editors, scientific applications, graphics, etc.

• Benchmark suites
• Each vendor announces a SPEC rating for their system
• a measure of execution time for a fixed collection of programs
• is a function of a specific CPU, memory system, IO system, operating
system, compiler enables easy comparison of different systems

• The key is coming up with a collection of relevant programs

SPEC (System Performance Evaluation
Corporation)
• Sponsored by industry but independent and self-managed – trusted by
code developers and machine vendors
• Clear guides for testing, see www.spec.org
• Regular updates (benchmarks are dropped and new ones added
periodically according to relevance)
• Specialized benchmarks for particular classes of applications
SPEC CPU

• The 2006 version includes 12 integer and 17 floating-point applications

• The SPEC rating specifies how much faster a system is, compared to
a baseline machine – a system with SPEC rating 600 is 1.5 times
faster than a system with SPEC rating 400

• Note that this rating incorporates the behavior of all 29 programs – this
may not necessarily predict performance for your favorite program!

42
SPEC CPU

43
SPEC CPU

44
Summary

• Performance is specific to a particular program

• total execution time is a consistent summary of performance

• For a given architecture performance increases come from:

• increases in clock rate (without adverse CPI affects)
• improvements in processor organization that lower CPI
• compiler enhancements that lower CPI and/or instruction count
Important Trends

• Running out of ideas to improve single thread performance

• Power wall makes it harder to add complex features

• Power wall makes it harder to increase frequency

46
47
Power Wall
Power Wall

The energy of a pulse during the logic transition of 0 → 1 → 0 or 1 → 0 → 1

Energy ∝ Capacitive load X Voltage2
The energy of a single transition
Energy ∝ ½ X Capacitive load X Voltage2
The power required per transistor
Power ∝ ½ X Capacitive load XVoltage2 X Frequency switched 49
Power Wall
Power Wall

The energy of a pulse during the logic transition of 0 → 1 → 0 or 1 → 0 → 1

51
Power Wall

Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
80% (5)
Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
118 pages
CSE 332 L4 - 14 Nov 2020
No ratings yet
CSE 332 L4 - 14 Nov 2020
41 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
47 pages
Computer Organization and Architecture (AT70.01)
No ratings yet
Computer Organization and Architecture (AT70.01)
29 pages
Computer Performance
No ratings yet
Computer Performance
22 pages
Module 2 (26-10-2024)
No ratings yet
Module 2 (26-10-2024)
50 pages
Lec10 Performance
No ratings yet
Lec10 Performance
22 pages
Cs23402 - Computer Architecture - Unit - 1
No ratings yet
Cs23402 - Computer Architecture - Unit - 1
161 pages
Performance
No ratings yet
Performance
51 pages
The Role of Performance: Chapter - 2
No ratings yet
The Role of Performance: Chapter - 2
40 pages
Week 10 Part 02 - Processor Performance (Q Only) - Tagged 2
No ratings yet
Week 10 Part 02 - Processor Performance (Q Only) - Tagged 2
23 pages
Da Ci
No ratings yet
Da Ci
13 pages
Lecture 02 CH01 Performance Power
No ratings yet
Lecture 02 CH01 Performance Power
76 pages
COD Ch. 2 The Role of Performance
No ratings yet
COD Ch. 2 The Role of Performance
28 pages
Performance Measures For Computers
No ratings yet
Performance Measures For Computers
53 pages
Computer Organization The Role of Performance
No ratings yet
Computer Organization The Role of Performance
45 pages
Chapter 1 Performance
No ratings yet
Chapter 1 Performance
32 pages
Chapter 01 RISC V
No ratings yet
Chapter 01 RISC V
30 pages
C A Lecture-3
No ratings yet
C A Lecture-3
41 pages
Lecture # 2
No ratings yet
Lecture # 2
33 pages
Assessing and Understanding Performance
No ratings yet
Assessing and Understanding Performance
31 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
17 pages
02 Performance
No ratings yet
02 Performance
23 pages
Performances of Computer Systems: CSE 675.02: Introduction To Computer Architecture
No ratings yet
Performances of Computer Systems: CSE 675.02: Introduction To Computer Architecture
52 pages
Inroduction and Performance Analysis
No ratings yet
Inroduction and Performance Analysis
29 pages
Computer Performance
No ratings yet
Computer Performance
18 pages
Cse - 321 - 2
No ratings yet
Cse - 321 - 2
37 pages
Lecture4 Performance Evaluation
No ratings yet
Lecture4 Performance Evaluation
34 pages
Week 2 - Lecture 2 - Performance Measurement
No ratings yet
Week 2 - Lecture 2 - Performance Measurement
25 pages
CS5204/EE5364 - Advanced Computer Architecture - Performance
No ratings yet
CS5204/EE5364 - Advanced Computer Architecture - Performance
56 pages
Lesson 3 - Computing For Performance
No ratings yet
Lesson 3 - Computing For Performance
38 pages
M116C 1 M116C 1 Lect02-Performance
No ratings yet
M116C 1 M116C 1 Lect02-Performance
23 pages
ACA Lec2 New
No ratings yet
ACA Lec2 New
44 pages
4 Performance
No ratings yet
4 Performance
67 pages
Computer Architecture Measurement
No ratings yet
Computer Architecture Measurement
26 pages
Chapter4 Performance
No ratings yet
Chapter4 Performance
36 pages
Puter Performance
No ratings yet
Puter Performance
15 pages
2 RISC V Performance ISA
No ratings yet
2 RISC V Performance ISA
72 pages
CS3350B Computer Architecture CPU Performance and Profiling: Marc Moreno Maza
No ratings yet
CS3350B Computer Architecture CPU Performance and Profiling: Marc Moreno Maza
28 pages
4 Perfrmance
No ratings yet
4 Perfrmance
30 pages
Lecture - 4 - Performance
No ratings yet
Lecture - 4 - Performance
31 pages
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
No ratings yet
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
52 pages
Lecture Ch4 Performance
No ratings yet
Lecture Ch4 Performance
25 pages
DHXD - Chuong 8. Performance
No ratings yet
DHXD - Chuong 8. Performance
27 pages
Week2 Performance
No ratings yet
Week2 Performance
15 pages
CMP2008 L1
No ratings yet
CMP2008 L1
47 pages
CH 02a-Computer Performance
No ratings yet
CH 02a-Computer Performance
22 pages
Computer Architecture 2
No ratings yet
Computer Architecture 2
17 pages
Unit 1
No ratings yet
Unit 1
68 pages
Lecture4 Performance Evaluation 2011
No ratings yet
Lecture4 Performance Evaluation 2011
34 pages
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
28 pages
Lecture 2: Performance/Power, MIPS Instructions
No ratings yet
Lecture 2: Performance/Power, MIPS Instructions
28 pages
CAO Fall 2024 Lecture 06 Design Metrics Performance Evaluation
No ratings yet
CAO Fall 2024 Lecture 06 Design Metrics Performance Evaluation
41 pages
CCE 131 Lecture1
No ratings yet
CCE 131 Lecture1
26 pages
Performance Measures
No ratings yet
Performance Measures
25 pages
Performance Numericals
No ratings yet
Performance Numericals
24 pages
09 Perf
No ratings yet
09 Perf
22 pages
Co Unit1 Part3
No ratings yet
Co Unit1 Part3
11 pages
05 Performance
No ratings yet
05 Performance
16 pages
Foundation Course for Advanced Computer Studies
From Everand
Foundation Course for Advanced Computer Studies
Franck Ismael Djédjé
No ratings yet
PHY107 Lecture Notes - Lecture #1, #2, #3, and #4
No ratings yet
PHY107 Lecture Notes - Lecture #1, #2, #3, and #4
28 pages
PHY107 Lecture Notes - Lecture #5 and #6
No ratings yet
PHY107 Lecture Notes - Lecture #5 and #6
19 pages
L98-Virtual Memory
No ratings yet
L98-Virtual Memory
41 pages
ER Diagram Tutorial Complete Guide To Entity Relationship Diagrams
No ratings yet
ER Diagram Tutorial Complete Guide To Entity Relationship Diagrams
11 pages
L - (Instruction Set Architecture)
No ratings yet
L - (Instruction Set Architecture)
37 pages
L5 More SQL
No ratings yet
L5 More SQL
55 pages
Intro Computer
No ratings yet
Intro Computer
21 pages

L-2 (Computer Performance)

Uploaded by

L-2 (Computer Performance)

Uploaded by

CSE 332

Computer Organization and Architecture

Lecture 2: Computer Performance

Tanzilur Rahman (Tnr)

• Performance is the key to understanding underlying motivation

• Why is some hardware better than others for different

• What factors of system performance are hardware related?

• How much faster is the Concorde compared to the 747?

Individual is more interested in response time. As a user of a smart phone/laptop,

Response time (computer ): the total time required by computer to complete a

• If we upgrade a machine with a new processor what do we increase?

• Our favorite program runs in 10 seconds on computer A, which has a

• What clock rate should we tell the designer to target?

More to follow in ALU and Pipeline chapter

• Suppose we have two implementations of the same instruction set

• Which machine is faster for this program, and by how much?

Which sequence will be faster?

What is the CPI for each sequence

• Which sequence will be faster according to MIPS?

• Performance best determined by running a real application

• The key is coming up with a collection of relevant programs

• The 2006 version includes 12 integer and 17 floating-point applications

• Performance is specific to a particular program

• For a given architecture performance increases come from:

• Running out of ideas to improve single thread performance

• Power wall makes it harder to add complex features

• Power wall makes it harder to increase frequency

The energy of a pulse during the logic transition of 0 → 1 → 0 or 1 → 0 → 1

The energy of a pulse during the logic transition of 0 → 1 → 0 or 1 → 0 → 1

You might also like