0% found this document useful (0 votes)

109 views

Exercise 1 ComputerArchitecture

The document summarizes performance comparisons between 3 processors (P1, P2, P3) based on clock rate, CPI, and instructions per second. It determines that P2 has the highest performance as it can process the greatest number of instructions per second. It also provides formulas for calculating CPU time, number of instructions, and clock cycles for each processor. The document then discusses additional problems involving global CPI calculations, comparing CPU times between implementations, determining clock rate ratios, and calculating capacitive loads and static power percentages for different processors.

Uploaded by

khang nguyen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

109 views

Exercise 1 ComputerArchitecture

Uploaded by

khang nguyen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Computer Architecture – Thursday afternoon

Exercise 1
Hồ Hữu Hiệp - ITITIU20202
Nguyễn Duy Khang - ITITIU18057
Nguyễn Thanh Hiền ITITIU20142

Problem 1
P1 P2 P3
Clock rate 3.0 GHz 2.5 GHz 4.0 GHz
CPI 1.5 1.0 2.2

a) Considering the number of instructions here to be a constant a.

To compare the performance among those processors, we have to calculate
each’s CPU time.
Instruction count × CPI
CPU time =
Clock rate
Instruction count Clock rate
¿> =
CPU time CPI
Clock rate
¿> Instruction per second =
CPI

Processor P1:
9
3 ×10
Instruction per second ( P 1)= =2×10 9 (instructions /s)
1.5
Processor P2:
9
2.5 ×10 9
Instruction per second ( P 2)= =2.5× 10 (instructions /s)
1.0
Processor P3:
9
4.0 ×10 9
Instruction per second ( P 3)= =1.81 ×10 (instructions / s)
2.2
In the same amount of time (1 second), the P2 process the greatest number of
instructions among those three processors. Hence, P2 has the highest
performance.

b) Based on the formula calculating the CPU time above, the formula
calculating number of instructions is
CPU time × Clock rate
Instruction count=
CPI

Number of instructions that P1 executed:

10 ×3.0 ×10 9 10
Instruction count 1= =2.0× 10 (instructions)
1.5
Number of instructions that P2 executed:
10 ×2.5 ×10 9
Instruction count 2= =2.5× 1010 (instructions)
1.0
Number of instructions that P3 executed:
10 × 4.0 ×109
Instruction count 3= =1.8 ×10 10(instructions )
2.2

Formula calculating clock cycles:

Clock cycles=CPI × Instruction count

Processor P1:
10 10
Clock cycles 1=1.5 × 2.0× 10 =3 ×10 (cycles)

Processor P2:
Clock cycles 2=1.0× 2.5× 1010 =2.5 ×1010 (cycles)

Processor P3:
10 10
Clock cycles 3=2.2× 1.8× 10 =4.0× 10 (cycles)

c) CPI ' =1.2CPI

CPU time '=0.7 CPU time
'
Instruction count ×CP I
'
CPU tim e =
Clock rat e'
Instruction count ×1.2 CPI
¿> 0.7CPU time=
Clock rate'

Taking the ratio of CPU time over CPU time’:

' '
CPU time CPI ×Clock rat e 1 1× Clock rat e
= =¿ =
' '
CPU tim e CP I × Clock rate 0.7 1.2× Clock rate
12
¿>Clock rat e ' = Clock rate
7

Processor P1:
' 12
Clock rat e = ×3.0=5.1(GHz )
7

Processor P2:
' 12
Clock rat e = ×2.5=4.3 (GHz)
7

Processor P3:
' 12
Clock rat e = × 4.0=6.6(GHz)
7

Problem 2:
Class A: 106 ×10 %=105 (instructions)
Class B: 106 ×20 %=2 ×105 (instructions )
Class C: 106 ×50 %=5 ×105 (instructions )
Class D: 106 ×20 %=2 ×105 (instructions )
n
Instruction count i
b) global CPI=∑ (CPI i × ¿)¿
i=1 Instruction count
5
global CPI (1)=(10¿ ¿5 ×1)+(2× 2×10 )+¿ ¿ ¿ ¿
5
global CPI (2)=(10¿ ¿5 × 2)+(2× 2× 10 ) +¿ ¿ ¿ ¿
Instruction count × global CPI
a) CPU time=
Clock rate

106 × 2.6
CPU time ( 1 )= 9
=1.04 × 10−3 (s)
2.5 ×10

106 ×2.0
CPU time ( 2 )= 9
=0.66 × 10−3 (s)
3.0 ×10
Hence, the second implementation is faster.
c) Clock cycles=CPI × Instruction count
6
Clock cycles ( 1 )=2.6 × 10 (cycles)
6
Clock cycles ( 2 )=2.0× 10 (cycles)

Problem 3:
CPU time
a) CPI=
Clock cycle time × Instruction count
1.1
CPI ( A )= 9 −9
=1.1
10 ×10
1.5
CPI ( B )= =1.25
1.2×10 9 × 10−9
Instruction count × CPI
b) CPU time =
Clock rate
CPU time ( A ) Instruction count ( A ) CPI ( A ) Clock rate ( B )
¿> = × ×
CPU time ( B ) Instruction count ( B ) CPI ( B ) Clock rate ( A )
Clock rate ( A ) Instruction count ( A ) CPI ( A ) CPU time ( B )
= × ×
Clock rate ( B ) Instruction count ( B ) CPI ( B ) CPU time ( A )
Clock rate ( A ) 10
9
1.1
= × ×1=¿ Clock rate ( A ) =0.73 Clock rate ( B )
Clock rate ( B ) 1.2 ×10 1.25
9

1
Hence, the clock of the processor running compiler B’s code is =1.36 faster
0.73
than the clock of the processor running compiler A’s code.
c) CPU time ( new compiler )=CPI × Instruction count × Clock cycle time
¿ 1.1× 6 ×10 8 ×10−9
(Clock cyle time=10−9 ( s ) because of the same processor)
¿ 0.66( s)
CPU time ( A ) 1.1
= =1.67
CPU time ( new compiler ) 0.66
CPU time (B) 1.5
= =2.27
CPU time(new compiler ) 0.66

Therefore, the new compiler applied for that processor is faster than the
compiler A 1.67 times and also faster than B 2.27 times.

Problem 4:
2
Dynamic power=Capacitive load ×Voltage × Frequency
Dynamic power
Capacitive load =
Voltage 2 × Frequency

90 −8
a) Capacitive load( Pentinum 4 Prescott)= 2 9
=1.6 × 10 ( F)
1.25 ×3.6 ×10
40 −8
Capacitive load (Core i5 Ivy Bridge)= 2 9
=1.45× 10 (F )
0.9 ×3.4 × 10

static power
b) %static power=
dynamic power+ static power
10
Pentinum 4 Prescott :%static power= =10 %
90+ 10
30
Core i 5 Ivy Bridge :%static power= =42.86 %
40+ 30

Ratio of static power to dynamic power:

10 1
Pentinum 4 Prescott : = =0.11
90 9
30 3
Core i 5 Ivy Bridge : = =0.75
40 4

c) total power=dynamic power+ static power

2
¿ Capacitive load ×Voltage × Frequency +Voltage ×leakage current
2
total power−Capacitive load ×Voltage × Frequency
¿>leakage current =
Voltage

After the total power is reduced by 10 % :

2
total power '−Capacitive load ×(Voltage ') × Frequency
leakage current ' =
Voltage '

And the leakage current is unchanged:

2
total power−Capacitive load ×Voltage 2 × Frequency total power '−Capacitive load ×(Voltage ') × Frequen
=
Voltage Voltage '

Pentinum 4 Prescott:
2
(90+ 10)−1.6 ×10−8 ×1.252 ×3.6 ×10 9 (90+10)× 0.9−1.6× 10−8 × ( Voltag e ' ) × 3.6× 109
=
1.25 Voltage '
' 2
90−57.6 ( Voltag e )
¿> 8= '
Voltag e
2
¿>57.6 ( Voltag e ' ) + 8 Voltag e' −90=0

¿>Voltag e ' =1.182(V )

Percentage of voltage reduced:

Voltage−Voltage ' 1.25−1.182
= =5.44 %
Voltage 1.25

Core i5 Ivy Bridge:

2
( 40+30)−1.45 ×10−8 × 0.92 × 3.4 ×109 ( 40+30)× 0.9−1.45 ×10−8 × ( Voltag e ' ) ×3.4 × 109
=
0.9 Voltage '
' 2
63−49.3 × ( Voltag e )
¿>33.4= '
Voltag e
' 2
¿> 49.3 × ( Voltag e ) +33.4 Voltag e −63=0
'
'
¿>Voltag e =0.841(V )

Percentage of voltage reduced:

Voltage−Voltage ' 0.9−0.841
= =6.5 %
Voltage 0.9

Problem 5:
a)
We have the equation:
clock cycles = num of instruction × CPI
Because we have three types of instructions, so:
3

clock cycles = ∑ numof instruction of typei +CPI i

i=1

Hence, for only one processor, we have:

clock cycles ¿ ( 2.56 ×10 9 ) × 1+ ( 1.28 ×109 ) ×12+ ( 256 × 106 ) ×5
¿ 1.92× 10
10

Then,
clock cycles 1.92×10 10
execution time = clock rate = 9 = 9.6 (s)
2 ×10
Call p is the number of processor (p > 1). We have:
9 9
clock cycles p ¿ 2.56× 10 × 1+ 1.28 ×10 ×12+256 ×106 ×5
0.7 × p 0.7 × p
9
2.56× 10 9
¿ + 1.28× 10
p
hence,
9
2.56 ×10 9
+ 12.8× 10
clock cycles p
clock cycles p = =
clock rate 2 ×109
Finally, we’ll sketch the table:
p 1 2 4 8

execution time in seconds 9.6 7.04 3.84 2.24

speed-up (relative to 1 processor ) 1 1.36 2.5 4.29

b)
For one processor we have:
clock cycles = ( 2.56 ×10 9 ) × 2+ ( 1.28 ×109 ) ×12+ ( 256 × 106 ) ×5
¿ ( 2.18 ×10 10)
clock cycles ( 2.18 ×1010 )
execution time = clock rate = = 10.9 (s)
( 2 ×10 9)
Call p is the number of processor (p > 1). We have:
9 9
2.56 × 10 1.28 ×10 6
clock cycles p = ×2+ × 12+ 256 ×10 ×5
0.7 × p 0.7 × p
9
2.93× 10 9
¿ +1.28 × 10
p
hence,
2.93× 109 9
+12.8 × 10 9
clock cycles p 2.56 × 10 9 14.65
clock cycles p = = 9
= +1.28 ×10 = +0.64
clock rate 2× 10 p p
Finally, we’ll sketch the table:
p 1 2 4 8

execution time in seconds 10.9 7.965 4.303 2.47

speed-up (relative to 1 processor ) 1.13 1.13 1.12 1.1

c)
This mean that the execution time of one processor (with reduced CPI ) and of 2

four processors will be the same. So we have:

execution time = 3.84 (s) new

Because clock rate remains unchanged, and

clock cycles
execution time = clock rate
We have:
clock cycles2 GHz = 3.84 (s)
⇒ clock cycles = 2 ×10 ×3.84=7.68 ×109
new
9

Then,
clock cycles = ( 2.56 ×10 9 ) × 1+ ( 1.28 ×109 ) ×CPI 2 , new+ ( 256 ×106 ) ×5
new,

9 9 9
¿ 3.84 ×10 + 1.28× 10 × CPI 2 ,new =7.68× 10
Hence,
9 9
7.68 ×10 −3.84 ×10
CPI 2 , new = 9
=3
1.28 ×10
Problem 6:
a)
First, we obtian the die areas:
Wafer area1 π ( 7.5 )2 2
Die area1 ≈ = =2.104 (cm )
Die count 1 84
Wafer area2 π ( 10 )2 2
Die area2 ≈ = =π (cm )
Die count 2 100
Plug in to the yield euqation:
1 1
Yield 1= 2
= 2
=0.96
Die area1 2.104
(1+ Defect rate1 × ) (1+0.020 × )
2 2
1 1
Yield 2= 2
= =0.91
Die area2 π 2
(1+ Defect rate2 × ) (1+0.031 × )
2 2
b)
Cost per die:
Cost per wafer 1 12
Cost per die 1= = =0.149
Dies per wafer 1 ×Yield 1 84 ×0.96
Cost per wafer 2 15
Cost per die 2= = =0.165
Dies per wafer 2 × Yield2 100 × 0.91

c)
 number of dies per wafer is increased by 10%
Wafer area1 π ( 7.5 )2
=1.91 ( cm )
2
Die area1 ≈ =
Die count 1 84 ×1.1
2
Wafer area2 π ( 10 ) 2
Die area2 ≈ = =2.86( cm )
Die count 2 100 ×1.1

 the defects per area unit increases by 15%

1 1
Yield 1= 2
= =0.95
Die area1 2.104 2
(1+ Defect rate1 ×1.15 × ) (1+0.020 ×1.15 × )
2 2
1 1
Yield 2= 2
= 2
=0.91
Die area2 π
(1+ Defect rate2 ×1.15 × ) (1+0.031 ×1.15 × )
2 2
d) a die area is 200 mm2 = 2 cm 2
We find the yield is given by:
1 1 1
Yield= = =
(1+ Defect rate× Die2area ) ( 1+ Defect rate × 22 )
2 2
(1+ Defect rate )2

Solving for defect rate we have:

1
Defect rate= −1
√Yield
Previous: Defect rate =
1 1 defects
Defect rate= −1= −1=0.043( )
√Yield √ 0.92 cm
2

New :
1 1 defects
Defect rate= −1= −1=0.026 ( )
√Yield √ 0.95 cm2

Problem 7.
Instruction Execution Reference
count time time
2.389E12 750 s 9650 s

a.
- Clock cycle is 0.333ns find CPI.
- CPI = (execution time)/((instruction count) × (Clock cycle))
750
- 12 −9
=0.94
(2.389 ×10 )×(0.333 ×10 )
b.
9650
- Spec ratio = reference time /excecution time= 750 =12.87s
c.
Number of instruction count ×CPI
- CPU time = Clock rate
 Because CPU time is proportional to Instruction count . So increase 10%
of number of instruction count without affect clock rate and CPI will
increase the CPU time 10%.
d.
- CPU time after increase Intruction count 10% , CPI 5%:
( 1.1number of instruction ) ×(1.05 CPI )
- CPU time = =1.115CPU time (old )
Clock rate

So CPU time increase 15.5%

e.
- SPEC ratio = reference time/CPU time
Specratio(after) CPU time (before) 1
- = =
Specratio (before) CPU time(after ) 1.1555
=0.86 s

So the SPEC ratio is decreased by 14%.

f.
( CPU time ) × Clock rate 700 × 4 × 109
- CPI = = =1.37
Instruction count 0.85× 2389× 109
g.
AMD version Clock rate (GHz) CPI
Before 3 0.94
After 4 1.37
Clock rate (after ) 4
- The clock rate ratio between 2 version : Clock rate(before) = 3 =1.33
CPI (after ) 1.37
- The CPI ratio between 2 version : CPI (before) = 0.94 =1.46
 The increase in CPI is different from the increase in Clock rate because
the number of instructions has been reduced by 15%, the CPU time has
been reduced by a lower percentage.
h.
CPU time(after ) 700
- The percentage reduce on CPU time: CPU time(before) = 750 =0.933=6.7 %.
i.
Clock rate (GHz) CPI Instruction count CPU time (ns)
4 1.61 960
- CPU time after reduce 10% : 0.9 × 960=¿864 ns
CPU time× Clock rate 864 × 4 ×109
- Instruction count = = =2147 × 109.
CPI 1.61
j.
Instruction count × CPI
- Clock rate= CPU time
.
- To reduce CPU time 10% (0.9 time), Clock rate must increase
1 1
clock rate ( old ) = ×3 GHz=3.33 GHz .
0.9 0.9
k.
- CPI is reduced by 15 % = 0.85 CPI(old)
- CPU time is reduced by 20% =0.8 CPU time (old)
- New Clock rate =
Instruction count ×(0.85CPI ) 0.85 0.85
= Clock rate ( old )= ×3=3.1875 GHz
0.8CCPU time 0.8 0.8

Problem 8.
Clock Rate (GHz) Instruction CPI
Counts (E9)
P1 4 5 0.9
P2 3 1 0.75

a.
5× 109 ×0.9
- Execution Time P1: =1.125 second
4 ×10 9
1× 109 × 075
- Execution Time P2 9
=0.25 second
3× 10
 This fallacy is false although Processor 1 has larger clock rate than
Processor 2 but the execution time is smaller than processor 2.
b.
- The execution time of Processor 1 to process 1.0E9 instruction:
Instruction count ×CPI 1.0 ×109 ×0.9
CPU time = Clock rate
= = 0.225 s
4 ×10 9

- The number of instructions that Processor 2 can process in 0.25s :

9
CPU time × Clock rate 0.225× 3× 10
Instruction count= = = 9 ×10 8 instructions
CPI 0.75

Calculate the millions of instructions per second (MIPS) of 2 processor:

9
( 4 ×10 )
- MIPS of Processor 1 = =4444.44
0.9 ×106
9
(3× 10 )
- MIPS of Processor 2 = 6
=4000
0.75× 10

In the section (a) of this problem we have the performance ratio of 2

processor:
Performance ( p 1) Execution time( p 2) 0.25
= = =0.22
Performance ( p 2) Execution time( p 1) 1.125

So Processor 1’s performance is less than processor 2’s performance

=> Although Processor 1 has larger MIPS but we has determined that
Processor 2 has better performance in the section a.

d.
Number of FP operation
MFLOPS = 6
Execution time × 10

40 % × 5× 109
- MFLOPS of Processor 1 : 6
=1.7 ×103
1.125× 10
9
40 % × 1× 10 3
- MFLOPS of Processor 2 : 6
=1.6 ×10
0.25× 10

Problem 9:
a)
New time spend to run FP operation:
(1-0.2) x 70 = 56 (s)
Total time reduced by:
70 - 56 = 14 (s)
or (14 : 250) x 100 = 5.6%

b)
The total time is reduced by 20% ⇒ 250 x (1- 0.2) = 200 (s)
Then, the time for execute INT operations is : 200 -70 -85 - 40 = 5 (s)
When the actually time needed is : 250 - 70 - 85 - 40 = 55 (s)
5
Hence, the time for INT operations reduced by : 55 x100 = 91%

c)
Assume that we avoid using branch operations.
The time of execution is : 55 + 70 +85 = 210
210
So it’s reduction is : 1 - 250 = 0.16 = 16%
Hence, the total time cannot be reduced 20% only by decreasing time of
branch operations.

Problem 10:
a)
The execution of 50 ×106 FP instructions
110 ×10 INT instructions
6

80 ×10 L/S instructions

16 ×10 branch instructions

∑ num of instructions × CPI

executions time= i=1
clock rate
50× 106 × 1+110× 106 ×1+80 ×106 × 4+16 × 106 × 2
¿
2× 108
¿ 0.256(s)

b)
we want the program to run two times faster
0.256
⇒ the executions time = 2
= 0.128 (s)
4

∑ num of instructions × CPI

i=1
executions time=
clock rate
Solve for new CPI:
256× 106−462× 106
executions time= 6
=−4.12(cannot)
50 ×10
Therefore, it is impossible for the program to run two times faster.

c)
The CPI of INT and FP instructions reduced by 40%
CPI = (1 - 0.4) x 1 = 0.6
INT

CPI = (1 - 0.4) x 1 = 0.6

CPI of L/S and branch reduced by 30%

CPI = (1 - 0.3) x 4= 2.8
L/S

CPI = (1 - 0.3) x 2 = 1.4

BRANCH

Then,
4

∑ num of instructions × CPI

i=1
executions time=
clock rate
6 6 6 6
50× 10 × 0.6+110 ×10 × 0.6+80 ×10 ×2.8+ 16 ×10 ×1.4
¿ 8
2 ×10
¿ 0.1712( s)

Manual Solution For RISC-V Edition
100% (5)
Manual Solution For RISC-V Edition
100 pages
CEN468 Lab 3 V2
No ratings yet
CEN468 Lab 3 V2
14 pages
Assignment3 2021HT80531
100% (1)
Assignment3 2021HT80531
14 pages
2.IPTV User Manual
No ratings yet
2.IPTV User Manual
21 pages
Solution Manual COD
No ratings yet
Solution Manual COD
115 pages
ESD Assignment1
No ratings yet
ESD Assignment1
7 pages
Problem 1 A) Considering The Number of Instructions Here To Be A Constant A
No ratings yet
Problem 1 A) Considering The Number of Instructions Here To Be A Constant A
13 pages
Solution Chapter 1
91% (22)
Solution Chapter 1
2 pages
8086 Microprocessor
No ratings yet
8086 Microprocessor
19 pages
Assignment - 1
0% (1)
Assignment - 1
4 pages
Midterm Solution
No ratings yet
Midterm Solution
18 pages
Homework 1
No ratings yet
Homework 1
18 pages
Dsa Assignment: 1. Define Binary Tree
No ratings yet
Dsa Assignment: 1. Define Binary Tree
6 pages
Advanced Computer Architecture Test-1 Answer
No ratings yet
Advanced Computer Architecture Test-1 Answer
2 pages
Test 6 PracticeQuestion Cachememory 1
No ratings yet
Test 6 PracticeQuestion Cachememory 1
21 pages
Solutions To Set 7
100% (1)
Solutions To Set 7
20 pages
Computer Architecture Questions
No ratings yet
Computer Architecture Questions
1 page
Parry C Q
100% (1)
Parry C Q
636 pages
Instruction Set Architecture and Design
No ratings yet
Instruction Set Architecture and Design
27 pages
Computer Architecture and Organization Ch#2 Examples
No ratings yet
Computer Architecture and Organization Ch#2 Examples
6 pages
Chapter 4 (Processors and Memory Hierarchy)
100% (1)
Chapter 4 (Processors and Memory Hierarchy)
17 pages
Capgemini Technical Topicwise Sorted
No ratings yet
Capgemini Technical Topicwise Sorted
21 pages
CH 5 Answers
No ratings yet
CH 5 Answers
6 pages
Booths Algorithm
100% (1)
Booths Algorithm
24 pages
8-Bit Microprocessor: VLSI Architecture Project Report On
No ratings yet
8-Bit Microprocessor: VLSI Architecture Project Report On
35 pages
Pipeline Data Hazards: Example #1 - Write-Back Data Hazard
No ratings yet
Pipeline Data Hazards: Example #1 - Write-Back Data Hazard
6 pages
Midterm Exam Architecture
No ratings yet
Midterm Exam Architecture
2 pages
DSA Final Fall 2022
No ratings yet
DSA Final Fall 2022
2 pages
ECE 341 2013 in Class Midterm1
No ratings yet
ECE 341 2013 in Class Midterm1
9 pages
Data Structures
No ratings yet
Data Structures
7 pages
Multiprocessors Interconnection Networks
No ratings yet
Multiprocessors Interconnection Networks
32 pages
Chapter 05
No ratings yet
Chapter 05
19 pages
CCS CMCS 611-101 Advanced Computer Architecture Advanced Computer Architecture
100% (2)
CCS CMCS 611-101 Advanced Computer Architecture Advanced Computer Architecture
24 pages
ECE 341 Final Exam Solution: Problem No. 1 (10 Points)
No ratings yet
ECE 341 Final Exam Solution: Problem No. 1 (10 Points)
9 pages
Chapter 5 - CPU Scheduling
100% (1)
Chapter 5 - CPU Scheduling
41 pages
High Performance Computer Architecture (CS60003)
No ratings yet
High Performance Computer Architecture (CS60003)
2 pages
Introduction To Parallel Processing
No ratings yet
Introduction To Parallel Processing
23 pages
Polish Expression
No ratings yet
Polish Expression
20 pages
Ael Zg626 Ec-3r First Sem 2023-2024
No ratings yet
Ael Zg626 Ec-3r First Sem 2023-2024
5 pages
Introduction To TMS320C6713 DSP Starter Kit DSK)
No ratings yet
Introduction To TMS320C6713 DSP Starter Kit DSK)
18 pages
The University of The South Pacific: EE326 Embedded Systems
No ratings yet
The University of The South Pacific: EE326 Embedded Systems
2 pages
8085 Simulator: A User Manual On
No ratings yet
8085 Simulator: A User Manual On
41 pages
Assignment 2
No ratings yet
Assignment 2
12 pages
Lecture 24
No ratings yet
Lecture 24
41 pages
Expt-9 Interfacing of 8253 With 8086
No ratings yet
Expt-9 Interfacing of 8253 With 8086
2 pages
Data Mining Exercises - Solutions
No ratings yet
Data Mining Exercises - Solutions
5 pages
Ovsf Codes
100% (1)
Ovsf Codes
2 pages
8086 Assign2 Ans
100% (1)
8086 Assign2 Ans
6 pages
Digi QS Full 2
No ratings yet
Digi QS Full 2
30 pages
Homework3 Solution v2
No ratings yet
Homework3 Solution v2
41 pages
Model Checking
No ratings yet
Model Checking
6 pages
Embedded Systems Lab Manual
100% (1)
Embedded Systems Lab Manual
60 pages
Microprocessor and Interfacing Techniques: (Course Code: CET208A) Credits-3
No ratings yet
Microprocessor and Interfacing Techniques: (Course Code: CET208A) Credits-3
147 pages
12 .B. Closely and Loosely Coupled
No ratings yet
12 .B. Closely and Loosely Coupled
4 pages
Week 6: Assignment Solutions
No ratings yet
Week 6: Assignment Solutions
4 pages
DC Lab Exp6 17l238 Rep
No ratings yet
DC Lab Exp6 17l238 Rep
12 pages
Computer Architecture & Organization Assignment Based On Pipelining
No ratings yet
Computer Architecture & Organization Assignment Based On Pipelining
1 page
CH01 Solution PDF
No ratings yet
CH01 Solution PDF
8 pages
Solns HWSW MIPS Textbook
No ratings yet
Solns HWSW MIPS Textbook
99 pages
CompEng 361 - Homework 1 - Solutions(1)
No ratings yet
CompEng 361 - Homework 1 - Solutions(1)
4 pages
Open Book 331
No ratings yet
Open Book 331
33 pages
計組題目解答
No ratings yet
計組題目解答
34 pages
Final Examination: SUBJECT: Physics 3 (ID: PH015IU)
No ratings yet
Final Examination: SUBJECT: Physics 3 (ID: PH015IU)
3 pages
Dirt Bikes Financial and Sales Data
No ratings yet
Dirt Bikes Financial and Sales Data
7 pages
Final Entre
No ratings yet
Final Entre
1 page
Int F (Int& N) (N++ Return N ) Int X 1 X + F (X) + X //1
No ratings yet
Int F (Int& N) (N++ Return N ) Int X 1 X + F (X) + X //1
2 pages
ABI MATLAB For Electrical and Computer Engineering Students and Professionals With Simulink PDF
No ratings yet
ABI MATLAB For Electrical and Computer Engineering Students and Professionals With Simulink PDF
1 page
Solutions Manual For Besterfield Quality Improvement: - PDF - 92 Pages - 479.32 KB - 10 May, 2016
No ratings yet
Solutions Manual For Besterfield Quality Improvement: - PDF - 92 Pages - 479.32 KB - 10 May, 2016
3 pages
Lab1 NP PDF
No ratings yet
Lab1 NP PDF
2 pages
Cisco Wireless Solution Overview
No ratings yet
Cisco Wireless Solution Overview
4 pages
Establishing JDBC Connection in Java
No ratings yet
Establishing JDBC Connection in Java
4 pages
Quiz 1
No ratings yet
Quiz 1
6 pages
Non Conventional or Modern Form
No ratings yet
Non Conventional or Modern Form
16 pages
Computer Graphics Report
No ratings yet
Computer Graphics Report
26 pages
Travel and Tourism Management System2017dotnetproject
No ratings yet
Travel and Tourism Management System2017dotnetproject
5 pages
Creating A Ceph Storage Cluster Using Old Desktop Computers
100% (1)
Creating A Ceph Storage Cluster Using Old Desktop Computers
7 pages
Uccx 80 PBT Lab Guide
No ratings yet
Uccx 80 PBT Lab Guide
50 pages
Network Monitoring System
No ratings yet
Network Monitoring System
23 pages
TW100-BRF104: User's Guide
No ratings yet
TW100-BRF104: User's Guide
80 pages
Axxess IPdevices Install Manual
No ratings yet
Axxess IPdevices Install Manual
230 pages
ExtremeXOS Upgrading The BootROM
No ratings yet
ExtremeXOS Upgrading The BootROM
2 pages
CCS Ut2 QB
No ratings yet
CCS Ut2 QB
19 pages
Activity 2
No ratings yet
Activity 2
6 pages
CUDA C Programming Guide
No ratings yet
CUDA C Programming Guide
346 pages
SSRS Interview Questions PDF Download Basic Part 2
No ratings yet
SSRS Interview Questions PDF Download Basic Part 2
3 pages
Iface SDK Manual
100% (1)
Iface SDK Manual
86 pages
Wk-6 APC-MultiSIM Online LAB Part-1
No ratings yet
Wk-6 APC-MultiSIM Online LAB Part-1
4 pages
CM Ra2tr
No ratings yet
CM Ra2tr
2 pages
Configure Cisco Router
No ratings yet
Configure Cisco Router
9 pages
Audio Steganography Complete MATLAB Report-1
100% (2)
Audio Steganography Complete MATLAB Report-1
65 pages
Waltrmrtantplorzlid - Globe FDD L18 - L26 BFT 071621
No ratings yet
Waltrmrtantplorzlid - Globe FDD L18 - L26 BFT 071621
67 pages
PC 8 PDF
No ratings yet
PC 8 PDF
28 pages
Activity Sheets Computer1
No ratings yet
Activity Sheets Computer1
10 pages
DLP 32C3FB
No ratings yet
DLP 32C3FB
74 pages

Exercise 1 ComputerArchitecture

Uploaded by

Exercise 1 ComputerArchitecture

Uploaded by

Computer Architecture – Thursday afternoon

a) Considering the number of instructions here to be a constant a.

Number of instructions that P1 executed:

Formula calculating clock cycles:

c) CPI ' =1.2CPI

Taking the ratio of CPU time over CPU time’:

Ratio of static power to dynamic power:

c) total power=dynamic power+ static power

After the total power is reduced by 10 % :

And the leakage current is unchanged:

¿>Voltag e ' =1.182(V )

Percentage of voltage reduced:

Core i5 Ivy Bridge:

Percentage of voltage reduced:

clock cycles = ∑ numof instruction of typei +CPI i

Hence, for only one processor, we have:

execution time in seconds 9.6 7.04 3.84 2.24

speed-up (relative to 1 processor ) 1 1.36 2.5 4.29

execution time in seconds 10.9 7.965 4.303 2.47

speed-up (relative to 1 processor ) 1.13 1.13 1.12 1.1

four processors will be the same. So we have:

Because clock rate remains unchanged, and

 the defects per area unit increases by 15%

Solving for defect rate we have:

So CPU time increase 15.5%

So the SPEC ratio is decreased by 14%.

- The number of instructions that Processor 2 can process in 0.25s :

Calculate the millions of instructions per second (MIPS) of 2 processor:

In the section (a) of this problem we have the performance ratio of 2

So Processor 1’s performance is less than processor 2’s performance

80 ×10 L/S instructions

16 ×10 branch instructions

∑ num of instructions × CPI

∑ num of instructions × CPI

CPI = (1 - 0.4) x 1 = 0.6

CPI of L/S and branch reduced by 30%

CPI = (1 - 0.3) x 2 = 1.4

∑ num of instructions × CPI

You might also like