Advanced Computer Architecture
Advanced Computer Architecture
Falguni Sinhababu
Government College of Engineering and Leather Technology
1
MULTICYCLE OPERATIONS
MUL IF ID EX MEM WB
Without Pipeline
MUL IF ID EX EX EX EX MEM WB
MUL IF ID EX EX EX EX MEM WB
With Pipeline
9 clock
cycles per
iteration
(with 4
Our program has 1000 iterations and there are 9
clock cycles per iteration so there are total of
stalls)
9000 iterations.
7 clock
cycles per
iteration
(with 2
stalls)
Scheduling
the unroll
loop
LOADs are done using the two Load/Store unit. Then two ADD.D instructions. The S.D there
will be delay of 2 cycles. For 4 instructions we need 8 cycles.
V1 V2 V3 V4 V5
LV v1
MULV v3,v1,v2
ADDV v5, v3, v4
Chain Chain
Load Unit
Mult. Add
Memory
Space Space
Advanced Computer Architecture 45
SIMD ARRAY PROCESSING VS. VLIW
VLIW Array processor