0% found this document useful (0 votes)

37 views41 pages

C A Lecture-3

Compuer Architecture Course Lecture -1

Uploaded by

Md. Nasimul Islam Nihal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views41 pages

C A Lecture-3

Compuer Architecture Course Lecture -1

Uploaded by

Md. Nasimul Islam Nihal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

Computer Architecture

Lecture-03

Shahanaz Islam Shaown

Lecturer, CSE, UU

1
Chapter - 2

• Discusses how to measure, report,

and summarize performance

• Describe the major factors that

determine the performance of a
computer.

2
Why examining performance is
important?
• Hardware performance is often key to the
effectiveness of an entire system.

3
Why assessing the performance is
challenging?
• The scale and intricacy of modern software
systems, together with the wide range of
performance improvement techniques employed
by hardware designers have made performance
assessment much more difficult.

• For different types of applications, different

performance metrics may be appropriate and
different aspects of a computer system may be
the most significant in determining overall
performance.
4
Measures

• Response Time / Execution Time :

the time between the start and
completion of a task.

• Throughput : the total amount of

work done in a given time.

5
Throughput and Response Time

• Do the following changes to a computer

system to increase throughput, decrease
response time, or both

– Replacing the processor in a computer with a

faster version – both.

– Adding additional processors to a system that

uses processors for separate tasks –
throughput (also response time).

6
Continuation
1
PerformanceX =
Execution timeX
Performance of X is greater than the performance
of Y

PerformanceX > PerformanceY

1 1
>
Execution timeX Execution timeY
Execution timey > Execution timex

X is faster than Y
7
Continuation

• X is n times faster than Y, it means,

PerformanceX
= n
PerformanceY

PerformanceX Execution time

= =n
y
PerformanceY Execution time x

8
Example
If machine A runs a program in 10 seconds
and machine B runs the same program in 15
seconds, how faster is A than B?

9
Solution

– A is n times faster than B if

PerformanceA
=n
PerformanceB
Execution timeB
=n
Execution timeA
15
= 1.5

10 than
– A is 1.5 times faster
B
10
Continuation

• We could also say that – Machine B is 1.5 times

slower than machine A. since

PerformanceA
=n
PerformanceB

PerformanceA
PerformanceB =
n

11
Measuring Performance

• Time is the measure of computer

performance.
• Program execution time is measured in
seconds per program.
• Wall-clock time / response time /
elapsed time / execution time – total
time to complete a task, including - disk
accesses, memory access, I/O activity,
OS overhead. 12
CPU execution time / CPU time

• is the time the CPU spends computing

for a task and does not include time
spent waiting for I/O or running other
programs.

CPU execution time / CPU time < Response time

13
Continuation
User CPU time
CPU time
System CPU time
• User CPU time – the CPU time spent in
the program

• System CPU time – the CPU time spent

in the OS performing tasks on behalf of
the program
14
Continuation

Execution Time
CPU time

For I/O User CPU System

and Others time CPU time

15
Continuation

• Example:
• Unix time command –
• 90.7u 12.9s 2:39 65%

User CPU time System CPU time Elapsed time

(90.7 seconds) (12.9 seconds) 2*60 + 39 =
(159 seconds)

90.7 + 12.9
= 0.65
16
159
Continuation

• System Performance – considering

elapsed time on an unloaded system

• CPU Performance – considering user

√
CPU time.

17
Continuation
• Clock cycle – Almost all computers are
constructed using a clock that
determines when events take place. These
discrete time intervals are called clock cycles
(ticks / clock ticks / clock periods / clocks /
cycles).

• Clock rate – Inverse of clock period.

18
Relating the Metrics

CPU execution time CPU clock cycle Clock cycle

= ×
for a program for a program time

CPU execution time CPU clock cycle for a program

=
for a program Clock rate

Hardware designer can improve performance

by reducing either the length of the clock
cycle or the number of clock cycles required for
a program.
19
Improving Performance
Our favorite program runs in 10 seconds on
computer A, which has a 400 MHz clock. We are
trying to help a computer designer build a machine
B, that will run this program in 6 seconds. The
designer has determined that a substantial increase
in the clock rate is possible, but this increase will
affect the rest of the CPU design, causing machine
B to require 1.2 times as many clock cycles as
machine A for this program. What clock rate
should we tell the designer to target?

20
Improving Performance (Cont.)
CPU clock cycleA
CPU timeA =
Clock rateA
CPU clock cycleA
10 Seconds =
400 × 106 cycles/sec
CPU clock cycleA = 10 seconds × 400 × 106 cycles/sec
= 4000 × 106 cycles
CPU clock cycleB
CPU timeB =
Clock rateB
1.2 × CPU clock cycleA
CPU timeB =
Clock rateB
21
Improving Performance (Cont.)
1.2 × 4000 × 106 cycles
6 seconds =
Clock rateB
1.2 × 4000 × 106 cycles
Clock rateB =
6 seconds
= 800 MHz

Machine B must therefore have twice the

clock rate of A to run the program in 6
seconds.

22
Example
Our favourite program runs in 10 seconds on computer A,
which has a 2 GHz clock. We are trying to help a computer
designer build a computer, B, which will run this program in 6
seconds. The designer has determined that a substantial
increase in the clock rate is possible, but this increase will affect
the rest of the CPU design, causing computer B to require 1.2
times as many clock cycles as computer A for this program.
What clock rate should we tell the designer to target?

23
24
Hardware Software Interface

• Since Machine had to execute the

instructions to run the program, the
execution time mustdepend on the
number of instructions in a program.
Average
CPU clock cycles Instructions
= × clock cycles
(for a program) for a
program per
instruction
CPI
25
Using the Performance Equation

• Suppose, we have two implementations of the

same instruction set architecture. Machine A has
a clock cycle time of 1 ns and a CPI of 2.0 for
some program, and machine B has a clock
cycle time of 2 ns and a CPI of 1.2 for the
same program. Which machine is faster for this
program, and by how much?

26
Continuation
Let the number of instructions of the program be I
CPU clock cyclesA = I × 2.0
CPU clock cyclesB = I × 1.2
CPU timeA = CPU clock cyclesA × Clock cycle timeA
= I × 2.0 × 1 ns = 2I ns
CPU timeB = I × 1.2 × 2 ns = 2.4I ns

CPU performanceA Execution timeB 2.4I ns

= × = 1.2
CPU performanceB Execution timeA 2I ns
A is 1.2 times faster than
27
B
Continuation
• Basic performance equation

CPU time = Instruction count × CPI × clock cycle time

Instruction count × CPI

CPU time =
Clock rate

28
Continuation

• It is possible to compute the CPU clock

cycles by looking at the different types of
instructions and using their individual
clock cycle counts.
• In such cases,
CPU clock cycles= summation of (CPIi*Ci)

29
Comparing Code Segments
• Example
– The hardware designer supplied:
Instruction Class CPI for this class
A 1
B 2
C 3

– Two code sequences requires the following:

Code Sequence Instruction Counts for instruction class
A B C
1 2 1 2
2 4 1 1

– Which code sequence executes the most instructions?

– Which will be faster?
– What is the CPI for each sequence? 30
Solution

• Sequence 1 executes 2 + 1 + 2 = 5
instructions.
• Sequence 2 executes 4 + 1 + 1 = 6
instructions.
• So sequence 2 executes most instructions.

31
Solution
• CPU clock cycles1 = (2×1) + (1×2)
+ (2×3) = 2 + 2 + 6 = 10 cycles

• CPU clock cycles2 = (4×1) + (1×2)

+ (1×3) = 4 + 2 + 3 = 9 cycles

• So code sequence 2 is faster.

32
Solution
CPU clock cycles1 10
CPI1 = = = 2
Instruction count1 5

CPU clock cycles2 9

CPI2 = = = 1.5
Instruction count2 6

When comparing two machines, we must look at all

three components, which combine to form execution
time.

33
Example

34
Processor Clock Rate CPI
P1 4GHz 1.25
P2 3GHz 0.75

Instruction count= 10^6

Prove the fallacy, “ Largest clock rate has largest performance”

Here,
CPU execution time , p1= (CPI * Instructions) / clock rate
= (1.25* 10^6)/ (4*10^9)

CPU execution time, p2 = (0.7510^6)/ (310^9)

35
Performance p1 : performance p2 =
((0.75*10^6)/ (3*10^9) ) / ((1.25* 10^6)/ (4*10^9) )
= 0.8

So, performance p1 = 0.8 * performance p2

Here,
P1 has highest clock rate but performance is higher.
So, the fallacy is true.

36
MIPS (Millions instructions per second)

A measurement of program
execution speed based on the
number of millions of
instructions.

Limitations of MIPS:

Firstly, MIPS specifies the instruction execution rate but does

not specify the capabilities of the instructions.

Secondly, MIPS varies between program on the same

computer. Thus, a machine should not have a same MIPS
ratings.
37
MIPS as a Performance Measure

38
39
40
Amdahl’s Law

Earlier version of Amdahl’s

law:

Latest version (second law) of Amdahl’s law:

Speed up = (Performance after improvement) / (Performance

before improvement)
= (Execution time before improvement) / Execution
time after improvement) 41

About Financial Accounting Volume 2 8th Doussy
100% (10)
About Financial Accounting Volume 2 8th Doussy
503 pages
List of Cities and Towns in Bangladesh - Wikipedia
No ratings yet
List of Cities and Towns in Bangladesh - Wikipedia
14 pages
CONSTI LAW I - Council of Teachers V Secretary of Education PDF
100% (6)
CONSTI LAW I - Council of Teachers V Secretary of Education PDF
4 pages
Chapter 4
No ratings yet
Chapter 4
53 pages
Building PYRTE - An Introduction PDF
No ratings yet
Building PYRTE - An Introduction PDF
14 pages
The Role of Performance: Chapter - 2
No ratings yet
The Role of Performance: Chapter - 2
40 pages
Cse - 321 - 2
No ratings yet
Cse - 321 - 2
37 pages
Unit 2 Performance
No ratings yet
Unit 2 Performance
6 pages
Performance Measures For Computers
No ratings yet
Performance Measures For Computers
53 pages
Computer Performance
No ratings yet
Computer Performance
22 pages
Computer Organization The Role of Performance
No ratings yet
Computer Organization The Role of Performance
45 pages
Chapter 1 Performance
No ratings yet
Chapter 1 Performance
32 pages
Module 2 (26-10-2024)
No ratings yet
Module 2 (26-10-2024)
50 pages
Lecture 4
No ratings yet
Lecture 4
37 pages
Lecture 02 CH01 Performance Power
No ratings yet
Lecture 02 CH01 Performance Power
76 pages
Lecture # 2
No ratings yet
Lecture # 2
33 pages
Performance
No ratings yet
Performance
51 pages
CH 02a-Computer Performance
No ratings yet
CH 02a-Computer Performance
22 pages
2 - Computer Organization and Architecture
No ratings yet
2 - Computer Organization and Architecture
21 pages
Lesson 3 - Computing For Performance
No ratings yet
Lesson 3 - Computing For Performance
38 pages
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
No ratings yet
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
52 pages
Computer Architecture Measuring Performance
No ratings yet
Computer Architecture Measuring Performance
33 pages
Performance
No ratings yet
Performance
23 pages
Defining Performance
No ratings yet
Defining Performance
6 pages
CSE 332 L4 - 14 Nov 2020
No ratings yet
CSE 332 L4 - 14 Nov 2020
41 pages
Da Ci
No ratings yet
Da Ci
13 pages
Lecture Ch4 Performance
No ratings yet
Lecture Ch4 Performance
25 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
52 pages
02 Performance
No ratings yet
02 Performance
23 pages
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Computer Evolution and Performance
28 pages
Lec10 Performance
No ratings yet
Lec10 Performance
22 pages
Week 10 Part 02 - Processor Performance (Answers)
No ratings yet
Week 10 Part 02 - Processor Performance (Answers)
35 pages
Performance Measures
No ratings yet
Performance Measures
25 pages
Computer Organization and Architecture (AT70.01)
No ratings yet
Computer Organization and Architecture (AT70.01)
29 pages
Puter Performance
No ratings yet
Puter Performance
15 pages
Week 13 14 - Performance Evaluation
No ratings yet
Week 13 14 - Performance Evaluation
19 pages
Computer Architecture 2
No ratings yet
Computer Architecture 2
17 pages
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
No ratings yet
CS322 - Computer Architecture (CA) : Spring 2019 Section V3
56 pages
Performances of Computer Systems: CSE 675.02: Introduction To Computer Architecture
No ratings yet
Performances of Computer Systems: CSE 675.02: Introduction To Computer Architecture
52 pages
IT401 Computer Organization and Architecture: Prasun Ghosal
No ratings yet
IT401 Computer Organization and Architecture: Prasun Ghosal
30 pages
4 Perfrmance
No ratings yet
4 Perfrmance
30 pages
Comp Org Notes On Measuring Cpu Performance
No ratings yet
Comp Org Notes On Measuring Cpu Performance
4 pages
Co Unit1 Part3
No ratings yet
Co Unit1 Part3
11 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
47 pages
Coa Unit 1 Problems
No ratings yet
Coa Unit 1 Problems
6 pages
Week 2 - Lecture 2 - Performance Measurement
No ratings yet
Week 2 - Lecture 2 - Performance Measurement
25 pages
Module 3.3 - Problems On Performance
No ratings yet
Module 3.3 - Problems On Performance
54 pages
Computer Performance
No ratings yet
Computer Performance
18 pages
Performance
No ratings yet
Performance
4 pages
COD Ch. 2 The Role of Performance
No ratings yet
COD Ch. 2 The Role of Performance
13 pages
Chapter 2 A: Performance
No ratings yet
Chapter 2 A: Performance
33 pages
COD Ch. 2 The Role of Performance
No ratings yet
COD Ch. 2 The Role of Performance
28 pages
M116C 1 M116C 1 Lect02-Performance
No ratings yet
M116C 1 M116C 1 Lect02-Performance
23 pages
Measuring Performance: Chris Clack B261 Systems Architecture
No ratings yet
Measuring Performance: Chris Clack B261 Systems Architecture
19 pages
Week 10 Part 02 - Processor Performance (Q Only) - Tagged 2
No ratings yet
Week 10 Part 02 - Processor Performance (Q Only) - Tagged 2
23 pages
Outline of Lecture: 1. The Role of Computer Performance 2. Measuring Performance
No ratings yet
Outline of Lecture: 1. The Role of Computer Performance 2. Measuring Performance
14 pages
Assessing and Understanding Performance
No ratings yet
Assessing and Understanding Performance
31 pages
ACA Lec2 New
No ratings yet
ACA Lec2 New
44 pages
L14 Introduction To Performance Evaluation
No ratings yet
L14 Introduction To Performance Evaluation
48 pages
Computer Performance
No ratings yet
Computer Performance
17 pages
Computer Architecture Measurement
No ratings yet
Computer Architecture Measurement
26 pages
Measuring Computer Performance
No ratings yet
Measuring Computer Performance
26 pages
Quatitative Principle
No ratings yet
Quatitative Principle
56 pages
Cpu Performance
No ratings yet
Cpu Performance
13 pages
Pass Rate in Medical Entrance Exam Drops To 35pc - The Financial Express
No ratings yet
Pass Rate in Medical Entrance Exam Drops To 35pc - The Financial Express
8 pages
CSE, IT Graduates Getting More Jobs - The Daily Star
No ratings yet
CSE, IT Graduates Getting More Jobs - The Daily Star
9 pages
Assembly Language Project Report
No ratings yet
Assembly Language Project Report
3 pages
Group No 3 (7 Person)
No ratings yet
Group No 3 (7 Person)
11 pages
MP Project
No ratings yet
MP Project
10 pages
BSC DAY CLASS ROUTINE SPRING 25 V-1 ACTIVE FROM 13-01-25 - Updated Timing Slot
No ratings yet
BSC DAY CLASS ROUTINE SPRING 25 V-1 ACTIVE FROM 13-01-25 - Updated Timing Slot
4 pages
Exploring Early Learning Challenges in Children Ut
No ratings yet
Exploring Early Learning Challenges in Children Ut
27 pages
Business Jute Products in Bangladesh
No ratings yet
Business Jute Products in Bangladesh
12 pages
BSC in CSE Evening MSC Batches Final Examination Routine Fall 2024
No ratings yet
BSC in CSE Evening MSC Batches Final Examination Routine Fall 2024
6 pages
Mats 2024
No ratings yet
Mats 2024
2 pages
Class Lecture 11&12MOSFET
No ratings yet
Class Lecture 11&12MOSFET
19 pages
Inglés Examen
No ratings yet
Inglés Examen
12 pages
ESG DisclosuresRev1
No ratings yet
ESG DisclosuresRev1
5 pages
Air Filter Grades PDF
No ratings yet
Air Filter Grades PDF
2 pages
AUT International Scholarships - South Asia - Regulations S1 2025 Final Version
No ratings yet
AUT International Scholarships - South Asia - Regulations S1 2025 Final Version
5 pages
Sten Plans The Sten Mkii
100% (1)
Sten Plans The Sten Mkii
28 pages
The Use of Copper Shells by Twin Roll Strip Casters: TMS Light Metals March 2010
No ratings yet
The Use of Copper Shells by Twin Roll Strip Casters: TMS Light Metals March 2010
6 pages
New Misc Mod
No ratings yet
New Misc Mod
36 pages
Rooftop-Mounted Wind Turbine: Final Design Report: Client: Professor Upmanu Lall, EEE
No ratings yet
Rooftop-Mounted Wind Turbine: Final Design Report: Client: Professor Upmanu Lall, EEE
20 pages
2015 Dodge Challenger V6-3.6L Exterior Lights
No ratings yet
2015 Dodge Challenger V6-3.6L Exterior Lights
2 pages
Math5 - q2 - Mod4 - Multiply Decimals Up To 2 Decimal Places
No ratings yet
Math5 - q2 - Mod4 - Multiply Decimals Up To 2 Decimal Places
30 pages
The Tuckahoe Talker: Congratulations Marissa & Brandon
No ratings yet
The Tuckahoe Talker: Congratulations Marissa & Brandon
2 pages
Study Plan
No ratings yet
Study Plan
1 page
Aisi 5140 PDF
No ratings yet
Aisi 5140 PDF
2 pages
Standard Costing and Variance Analysis 1: Solutions To Chapter 18 Questions
No ratings yet
Standard Costing and Variance Analysis 1: Solutions To Chapter 18 Questions
8 pages
Visually Pleasing Composition Amount of Information With Respect To Principles of User Interface Design
No ratings yet
Visually Pleasing Composition Amount of Information With Respect To Principles of User Interface Design
9 pages
Midterm Exam. (ONLINE) Autumn 2021
No ratings yet
Midterm Exam. (ONLINE) Autumn 2021
9 pages
Reyes VS NLRC
No ratings yet
Reyes VS NLRC
2 pages
Shaping, Planning, and Slotting Machines - Principles, Specifications, and Comparisons
No ratings yet
Shaping, Planning, and Slotting Machines - Principles, Specifications, and Comparisons
12 pages
BAC GIANG - Đề thi chọn ĐT 2023 (chính thức)
No ratings yet
BAC GIANG - Đề thi chọn ĐT 2023 (chính thức)
19 pages
2016 CCNY Great Grads
No ratings yet
2016 CCNY Great Grads
16 pages
Aminu Final Draft-1
No ratings yet
Aminu Final Draft-1
86 pages
(Communication Electronic Circuits) Preface
No ratings yet
(Communication Electronic Circuits) Preface
2 pages
ELE 2 Module 1
No ratings yet
ELE 2 Module 1
4 pages
Centrifugal and Axial Compressor Appendix B
No ratings yet
Centrifugal and Axial Compressor Appendix B
21 pages
Datasheet For Steel Grades High Alloy Aquamet 22
No ratings yet
Datasheet For Steel Grades High Alloy Aquamet 22
3 pages
Simovert 106035
No ratings yet
Simovert 106035
4 pages

C A Lecture-3

Uploaded by

C A Lecture-3

Uploaded by

Computer Architecture

Shahanaz Islam Shaown

• Discusses how to measure, report,

• Describe the major factors that

• For different types of applications, different

• Response Time / Execution Time :

• Throughput : the total amount of

• Do the following changes to a computer

– Replacing the processor in a computer with a

– Adding additional processors to a system that

PerformanceX > PerformanceY

• X is n times faster than Y, it means,

PerformanceX Execution time

– A is n times faster than B if

• We could also say that – Machine B is 1.5 times

• Time is the measure of computer

• is the time the CPU spends computing

CPU execution time / CPU time < Response time

• System CPU time – the CPU time spent

For I/O User CPU System

User CPU time System CPU time Elapsed time

• System Performance – considering

• CPU Performance – considering user

• Clock rate – Inverse of clock period.

CPU execution time CPU clock cycle Clock cycle

CPU execution time CPU clock cycle for a program

Hardware designer can improve performance

Machine B must therefore have twice the

• Since Machine had to execute the

• Suppose, we have two implementations of the

CPU performanceA Execution timeB 2.4I ns

CPU time = Instruction count × CPI × clock cycle time

Instruction count × CPI

• It is possible to compute the CPU clock

– Two code sequences requires the following:

– Which code sequence executes the most instructions?

• CPU clock cycles2 = (4×1) + (1×2)

• So code sequence 2 is faster.

CPU clock cycles2 9

When comparing two machines, we must look at all

Instruction count= 10^6

CPU execution time, p2 = (0.75*10^6)/ (3*10^9)

So, performance p1 = 0.8 * performance p2

Firstly, MIPS specifies the instruction execution rate but does

Secondly, MIPS varies between program on the same

Earlier version of Amdahl’s

Latest version (second law) of Amdahl’s law:

Speed up = (Performance after improvement) / (Performance

You might also like

CPU execution time, p2 = (0.7510^6)/ (310^9)