Lecture-14 CH-04 2
Lecture-14 CH-04 2
Edition
The Hardware/Software Interface
Chapter 4
The Processor
Performance Issues
◼ Longest delay determines clock period
◼ Critical path: load instruction
◼ Instruction memory → register file → ALU →
data memory → register file
◼ Not feasible to vary period for different
instructions
◼ Violates design principle
◼ Making the common case fast
◼ We will improve performance by pipelining
◼ Four loads:
◼ Speedup
= 8/3.5 = 2.3
◼ Non-stop:
◼ Speedup
= 2n/(0.5n + 1.5) ≈ 4
n is the number of
stages
Prediction
correct
Prediction
incorrect