Chap 2 Exercises With Solutions

The document contains a series of exercises and solutions related to computer architecture, focusing on performance metrics such as clock rates, CPI, execution times, and instruction counts for various processors and implementations. It explores concepts like speedup, the impact of compilers on performance, and the effects of parallelization on execution time. Additionally, it includes practical calculations for frame buffer size and network transmission times, as well as comparisons between different processors and instruction sets.

Uploaded by

abdelridaahmed92

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views7 pages

Chap 2 Exercises With Solutions

Uploaded by

abdelridaahmed92

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Ex. chap.

2 “Computer abstractions and technology” with solutions

Ex. 1.4: Assume a color display using 8 bits for each of the primary colors (red, green, blue) per
pixel and a frame size of 1280 × 1024.
a. What is the minimum size in bytes of the frame buffer to store a frame?
b. How long would it take, at a minimum, for the frame to be sent over a 100 Mbit/s network?

Solution:
a. 1280 * 1024 pixels = 1,310,720 pixels => 1,310,720 * 3 = 3,932,160 bytes/frame.
b. 3,932,160 bytes * (8 bits/byte) /100E6 bits/second = 0.31 seconds

Ex.1.5
Consider three different processors P1, P2, and P3 executing the same instruction set. P1 has a
3 GHz clock rate and a CPI of 1.5. P2 has a 2.5 GHz clock rate and a CPI of 1.0. P3 has a 4.0 GHz
clock rate and has a CPI of 2.2.
a. Which processor has the highest performance expressed in instructions per second?
b. If the processors each execute a program in 10 seconds, find the number of cycles and the
number of instructions.
c. We are trying to reduce the execution time by 30% but this leads to an increase of 20% in the
CPI. What clock rate should we have to get this time reduction?
a)P1
3ghz clock rate=3*10^9cycles/s
cpi=1.5
nb of instructions per second=3.10^9/1.5=2*10^9 instructions per second
p2 and p3 same calculation
for p2=2.5*10^9 instructions per second
for p3=1.8*10^9 instructions per second
p2 has the highest performance
b)
for p1:
from a) the nb of instr per second is 2*10^9
if the execution time is 10 s so the total nb of instr is
2*10^9*10=2*10^10
cpi for p1 is 1.5 so the total nb of cycles is 1.5*2*10^10=3*10^10
second method
nb of cycles for p1:
clock rate is 3Ghz=3*10^9 hz=3*10^9 cycles per second
if the duration of the program is 10s so the total nb of cycles is 30*10^9 cycles
the total nb of instr is total nb of cycles / nb of cycles for an instruction
which is nb of cycles/cpi=30*10^9/1.5=2*10^10 instr
same calculation for p2 and p3
nbof cylces for p2=25*10^9
for p3=40*10^9
for the number of instructions or Instruction count(IC)
we have cpu time=(IC*CPI)/CR=> IC=(CPU time*CR)/CPI=….same formula for P1, P2 and P3
c) Clock rate=Instr count*CPI/cpu time
instr count the same so 2*10^10
cpi increased by 20% => new cpi=1.5*1.2=1.8
cputime decreased by 30%=7s
clock rate=(2*10^10*1.8)/7=5.14Ghz
same calculation for p2 and p3
cr for p2=4.28Ghz
cr for p3=6.75Ghz

Ex 1.6
Consider two different implementations of the same instruction set architecture. The
instructions can be divided into four classes according to their CPI (class A, B, C, and D). P1 with
a clock rate of 2.5 GHz and CPIs of 1, 2, 3, and 3, and P2 with a clock rate of 3 GHz and CPIs of 2,
2, 2, and 2.
Given a program with a dynamic instruction count of 1.0E6 instructions divided into classes as
follows: 10% class A, 20% class B, 50% class C, and 20% class D.
a. Which implementation is faster?
Sol: for p1 : cpu time=instr_count*cpi/clock_rate
Cpu_time for p1=(ICfor class A * cpi class A+ICfor class B*cpi class B……)/2.5*10^9=
(10^6*(10%)*1+10^6*(20%)*2……….)/2.5*10^9=
Cputime for p1=10.4*10^-4 s
Same calculation for p2 cputime for p2=6.66*10^-4 s
So p2 is faster
b. What is the global CPI for each implementation?
Global cpi for p1:
Cputimeforp1=ICforp1*globalcpiforp1/clockrateforp1
Globalcpi=cputime*clockrate/ICforp1=2.6
Globalcpi for p2=2.0
c. Find the clock cycles required in both cases.
For p1:
Genral formula for totalnbofcycles=IC*cpi
10^6*0.1*1+10^6*0.2*2+10^6*0.5*3+10^6*0.2*3=
We have 10^6 instructions, cpi for p1 is 2.6 so each instruction needs 2.6 cycles => we
need 10^6*2.6 cycles
0.1*10^6*1
For p2= clock cycles needed =20*10^5 cycles.

Ex. 1.7
Compilers can have a profound impact on the performance of an application. Assume that for a
program, compiler A results in a dynamic instruction count of 1.0E9 and has an execution time
of 1.1 s, while compiler B results in a dynamic instruction count of 1.2E9 and an execution time
of 1.5 s.
a. Find the average CPI for each program given that the processor has a clock cycle time of 1 ns.
For compiler A:
Cputime=instr_count*cpi*cycle_time=>cpi =cputime/instr_count*cycle_time=1.1/10^9*10^-
9=1.1
Same calculation for compiler B, cpi for B is 1.25
b. Assume the compiled programs run on two different processors. If the execution times on
the two processors are the same, how much faster is the clock of the processor running
compiler A’s code versus the clock of the processor running compiler B’s code?
Cputime for A=cputime for B
ICforA*cpiforA/CR for A=ICforB*cpiforB/CRforB
ClockRateofB/ClockRateofA=ICforB*CPIforB/ICforA*CPIforA=1.2*10^9*1.25/10^9*1.1=1.37
c. A new compiler is developed that uses only 6.0E8 instructions and has an average CPI of 1.1.
What is the speedup of using this new compiler versus using compiler A or B on the original
processor?
TA/Tnew= ICforA*cpiforA*cycletime /ICofnew*cpinew*cycletime =1.67
TB/Tnew=2.27
Ex. 1.12
Consider the following two processors: P1 has a clock rate of 4 GHz, average CPI of 0.9, and
requires the execution of 5.0E9 instructions; P2 has a clock rate of 3 GHz, an average CPI of
0.75, and requires the execution of 1.0E9 instructions.
a- One usual fallacy is to consider the computer with the largest clock rate as having the largest
performance. Check if this is true for P1 and P2.
Sol.: cpu for p1=(Ic*cpi)/CR=5*10^9*0.9/4*10^9=1.125s
For p2 cpu time=0.025s. so it is not true as p2 has a lower CR and is faster than p1.
b- Another fallacy is to consider that the processor executing the largest number of instructions
will need a larger CPU time. Considering that processor P1 is executing a sequence of 1.0E9
instructions and that the CPI of processors P1 and P2 do not change, determine the number of
instructions that P2 can execute in the same time that P1 needs to execute 1.0E9 instructions.
Sol.: to excute 1*10^9 instructions, p1 needs 1*10^9*0.9/4*10^9=0.225s
Same cpu time for p2=>0.225=IC*0.75/3*10^9=>IC=0.9*10^9 instructions
c- A common fallacy is to use MIPS (millions of instructions per second) to compare the
performance of two different processors, and consider that the processor with the largest MIPS
has the largest performance.
Check if this is true for P1 and P2.
d- Another common performance figure is MFLOPS (millions of floating-point operations per
second), defined as MFLOPS = No. FP operations / (execution time × 1E6) but this figure has the
same problems as MIPS. Assume that 40% of the instructions executed on both P1 and P2 are
floating-point instructions. Find the MFLOPS figures for the programs.

Ex. 1.14
Assume a program requires the execution of 50 × 10^6 FP instructions, 110 × 10^6 INT
instructions, 80 × 10^6 L/S instructions, and 16 × 106 branch instructions. The CPI for each type
of instruction is 1, 1, 4, and 2, respectively.
Assume that the processor has a 2 GHz clock rate.
1.14.1 By how much must we improve the CPI of FP instructions if we want the program to run
two times faster?
1.14.2 By how much must we improve the CPI of L/S instructions if we want the program to run
two times faster?
1.14.3 By how much is the execution time of the program improved if the CPI of INT and FP
instructions is reduced by 40% and the CPI of L/S and Branch is reduced by 30%?

Solution:
Ex. 1.9
Assume for arithmetic, load/store, and branch instructions, a processor has CPIs of 1, 12, and 5,
respectively. Also assume that on a single processor a program requires the execution of 2.56E9
arithmetic instructions, 1.28E9 load/store instructions, and 256 million branch instructions.
Assume that each processor has a 2 GHz clock frequency.
Assume that, as the program is parallelized to run over multiple cores, the number of
arithmetic and load/store instructions per processor is divided by 0.7 x p (where p is the
number of processors) but the number of branch instructions per processor remains the same.
1.9.1 Find the total execution time for this program on 1, 2, 4, and 8 processors, and show the
relative speedup of the 2, 4, and 8 processor result relative to the single processor result.
1.9.2 If the CPI of the arithmetic instructions was doubled, what would the impact be on the
execution time of the program on 1, 2, 4, or 8 processors?
1.9.3 To what should the CPI of load/store instructions be reduced in order for a single
processor to match the performance of four processors using the original CPI values?

Solution:
1.9.1
P Nb. of arith. inst. Nb. of L/S inst. Nb. of branch inst. cycles Exec. time speedup
1 2.56E9 1.28E9 2.56E8 19.2E9 9.6 s 1
2 1.83E9 9.14E8 2.56E8 14.078 7.039s 1.36
4 9.12E8 4.57E8 2.56E8 7.676 3.838 s 2.5
8 4.57E8 2.29E8 2.56E8 4.485 2.2425 s 4.2

1.9.2
(2.56E9*2+1.28E9*12+0.256E9*5)/2E9
P Exec. time
1 (2.56E9*2+1.28E9*12+0.256E9*5)/2E9
2 (1.83E9*2+1.28E9*12+0.256E9*5)/2E9
4 (0.912E9*2+1.28E9*12+0.256E9*5)/2E9
8 (0.457E9*2+1.28E9*12+0.256E9*5)/2E9

1.9.3
Exec time for 1 proc. With new cpi for l/s =exc time for 4 processros =3.838s
So 2.56E9*1+1.28E9*newcpi+0.256E9*5=3.838=>newcpi=(3.838-0.256E9*5-2.56E9*1)/1.28E9

Extra exercise (Amdahl’s law):

A machine executes a program consisting of 60% of addition operations and 40% of divide
operations. It is considered that both operations have the same CPI. The original execution time
is of 100s.
a) What is the execution time after improvement if the divide operations can run 5 times
faster?
Sol.: the operations affected by the improvement are divide operations which are 40%
of the total operations so the execution time after improvement (divide operations 5
times faster) is (40/5) + 60=68s
b) What is the speedup of the improved machine relative to the original machine?
Sol.: speedup is 100 / 68 =1.47

Hint: We remind you about the Amdahl’s law formula:

Execution time after improvement = (Execution time affected by improvement)/(Amount of
Improvement) + Execution time unaffected

Solution Chapter 1
91% (22)
Solution Chapter 1
2 pages
Problem 1 A) Considering The Number of Instructions Here To Be A Constant A
No ratings yet
Problem 1 A) Considering The Number of Instructions Here To Be A Constant A
13 pages
Computer Component Performance-Nguyễn Hoàng Long - BI11-157
100% (1)
Computer Component Performance-Nguyễn Hoàng Long - BI11-157
9 pages
Homework 1
No ratings yet
Homework 1
18 pages
TD Micro Chap1 With Sol-2022
No ratings yet
TD Micro Chap1 With Sol-2022
4 pages
hw1 Sol
No ratings yet
hw1 Sol
4 pages
Chapter 1 Notes
No ratings yet
Chapter 1 Notes
28 pages
Sheet 1
No ratings yet
Sheet 1
2 pages
Assignment - 1
0% (1)
Assignment - 1
4 pages
Exercises Chap 2
No ratings yet
Exercises Chap 2
4 pages
Sample Questions
No ratings yet
Sample Questions
5 pages
COA ASsignment
No ratings yet
COA ASsignment
7 pages
1 Computer - Component Performance
No ratings yet
1 Computer - Component Performance
4 pages
Sheet 1
No ratings yet
Sheet 1
6 pages
Chapter 1
No ratings yet
Chapter 1
42 pages
Problem1 - Pablo Lird
No ratings yet
Problem1 - Pablo Lird
5 pages
Aa07190 HW1
No ratings yet
Aa07190 HW1
10 pages
Ejercicios 2
No ratings yet
Ejercicios 2
13 pages
MIS 6110 Assignment #1 (Spring 2015)
No ratings yet
MIS 6110 Assignment #1 (Spring 2015)
14 pages
Computerorganization Subject
No ratings yet
Computerorganization Subject
6 pages
HW 1
No ratings yet
HW 1
4 pages
Homework 1
No ratings yet
Homework 1
11 pages
CompEng 361 - Homework 1 - Solutions
No ratings yet
CompEng 361 - Homework 1 - Solutions
4 pages
Solution
No ratings yet
Solution
14 pages
Assg1 Sol PDF
No ratings yet
Assg1 Sol PDF
3 pages
Discussion Session 4-11
No ratings yet
Discussion Session 4-11
12 pages
Lecture Ch4 Performance
No ratings yet
Lecture Ch4 Performance
25 pages
A5 Solution
No ratings yet
A5 Solution
4 pages
Module 3.3 - Problems On Performance
No ratings yet
Module 3.3 - Problems On Performance
54 pages
Computer Performance
No ratings yet
Computer Performance
27 pages
Assignment 1 2020coa
No ratings yet
Assignment 1 2020coa
5 pages
CH01 Solution PDF
No ratings yet
CH01 Solution PDF
8 pages
PS1 Exercises
No ratings yet
PS1 Exercises
32 pages
Instruction Count and Cpi
No ratings yet
Instruction Count and Cpi
8 pages
Week 3: Assignment Solutions
No ratings yet
Week 3: Assignment Solutions
4 pages
Nmam Institute of Technology: Department of Computer Science and Engineering
No ratings yet
Nmam Institute of Technology: Department of Computer Science and Engineering
8 pages
Exercise 3 & 12
No ratings yet
Exercise 3 & 12
17 pages
CSE 530 Homework #1 Due September 26 Anthony Dotterer: C C C T C T C C T T
No ratings yet
CSE 530 Homework #1 Due September 26 Anthony Dotterer: C C C T C T C C T T
9 pages
Homework 1
No ratings yet
Homework 1
10 pages
CA Chap1 Ex
No ratings yet
CA Chap1 Ex
2 pages
4 Performance
No ratings yet
4 Performance
27 pages
Computer Organization Exercise Answerb
No ratings yet
Computer Organization Exercise Answerb
5 pages
A1 Sol 2020 PDF
No ratings yet
A1 Sol 2020 PDF
13 pages
Sheet1 Computer
No ratings yet
Sheet1 Computer
2 pages
2 CPU Performance
No ratings yet
2 CPU Performance
35 pages
Performance of Processor1
No ratings yet
Performance of Processor1
9 pages
Performance
No ratings yet
Performance
23 pages
SEN307 Lecture 8
No ratings yet
SEN307 Lecture 8
16 pages
CENG400 Assignment 1
No ratings yet
CENG400 Assignment 1
4 pages
Chapter 1 Lecture 2 & 3 - Computer Performance
No ratings yet
Chapter 1 Lecture 2 & 3 - Computer Performance
37 pages
09 Perf
No ratings yet
09 Perf
22 pages
Chapter-7 Practice Questions For Performance
No ratings yet
Chapter-7 Practice Questions For Performance
9 pages
COA ASSIGNMENT 1 - Answers
No ratings yet
COA ASSIGNMENT 1 - Answers
6 pages
CompOrg 5thed HW1
No ratings yet
CompOrg 5thed HW1
2 pages
Tut 1
No ratings yet
Tut 1
2 pages
Engine Tuning Guide
From Everand
Engine Tuning Guide
Rodulf nouh Fidal
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Multicore DSP: From Algorithms to Real-time Implementation on the TMS320C66x SoC
From Everand
Multicore DSP: From Algorithms to Real-time Implementation on the TMS320C66x SoC
Naim Dahnoun
No ratings yet
Profound Python Libraries
From Everand
Profound Python Libraries
Onder Teker
No ratings yet
IGNOU Operating System Previous Years Solved Papers
From Everand
IGNOU Operating System Previous Years Solved Papers
Manish Soni
No ratings yet
Arcs and Inscribed Angle
No ratings yet
Arcs and Inscribed Angle
29 pages
Leeb Hardness Tester
No ratings yet
Leeb Hardness Tester
4 pages
Ielts Reading Question Sheet
No ratings yet
Ielts Reading Question Sheet
2 pages
What Is Capacity Planning
No ratings yet
What Is Capacity Planning
6 pages
VA Pilot Competencies
No ratings yet
VA Pilot Competencies
1 page
Aqautec Ocean Parts Manual
No ratings yet
Aqautec Ocean Parts Manual
4 pages
EUPoP-Solo and Bot Rules-1.2-Single Pages
No ratings yet
EUPoP-Solo and Bot Rules-1.2-Single Pages
16 pages
190 MP IgM-IFU-en-EU-IVDD-V2.1
No ratings yet
190 MP IgM-IFU-en-EU-IVDD-V2.1
2 pages
MGD Lime Projects - Activation Schedule (01 April 2025) Calls
No ratings yet
MGD Lime Projects - Activation Schedule (01 April 2025) Calls
81 pages
List of Units Competency: Daftar Unit Kompetensi
No ratings yet
List of Units Competency: Daftar Unit Kompetensi
1 page
First Summative Test in English 5
No ratings yet
First Summative Test in English 5
2 pages
Possessive Pronouns
No ratings yet
Possessive Pronouns
17 pages
Contributions of Muslim Scientists
No ratings yet
Contributions of Muslim Scientists
5 pages
AD 6 Chawl Case Study
No ratings yet
AD 6 Chawl Case Study
4 pages
3rd Quarter ACR
No ratings yet
3rd Quarter ACR
4 pages
Acetyline Cylinder PDF
No ratings yet
Acetyline Cylinder PDF
2 pages
Using The Universal PE Unpacker
No ratings yet
Using The Universal PE Unpacker
11 pages
Nitoprime Zincrich TDS
No ratings yet
Nitoprime Zincrich TDS
2 pages
Carcassonne V3 Supplement
No ratings yet
Carcassonne V3 Supplement
2 pages
Ielts
No ratings yet
Ielts
1 page
Top 10 DAX Interview Questions and Answers
No ratings yet
Top 10 DAX Interview Questions and Answers
3 pages
Final Demo
100% (1)
Final Demo
47 pages
YAMAHA OUTBOARD LZ200NETO, LZ200TR Service Repair Manual X 100101 PDF
No ratings yet
YAMAHA OUTBOARD LZ200NETO, LZ200TR Service Repair Manual X 100101 PDF
60 pages
JLG-860SJ - en
No ratings yet
JLG-860SJ - en
142 pages
Unit 8
No ratings yet
Unit 8
62 pages
B 2 B Sales Manager Checklist
No ratings yet
B 2 B Sales Manager Checklist
1 page
Sabino 2017
No ratings yet
Sabino 2017
15 pages
68 133 1 SM PDF
No ratings yet
68 133 1 SM PDF
9 pages
RTE 1503 Unit 3 Self Test
No ratings yet
RTE 1503 Unit 3 Self Test
15 pages
What Is The Definition of - Medium - in Art
No ratings yet
What Is The Definition of - Medium - in Art
9 pages

Chap 2 Exercises With Solutions

Uploaded by

Chap 2 Exercises With Solutions

Uploaded by

Ex. chap.

2 “Computer abstractions and technology” with solutions

Extra exercise (Amdahl’s law):

Hint: We remind you about the Amdahl’s law formula:

You might also like