Unit-4-Pipeline and Vector Processing
Unit-4-Pipeline and Vector Processing
Vector
Reference Processing
: Chapter 9 from Computer System Architecture by Morris Mano
• How to achieve?
• Concurrent Data Processing.
• Multiprocessor System.
• Parallel processing can be viewed from various
level of complexity
• Lowest level: Parallel and serial operation by the type
of register used.
• Higher level: multiple functional units
1 A1 B1 - - -
2 A2 B2 A1 * B1 C1 -
3 A3 B3 A2 * B2 C2 A1 * B1 + C1
4 A4 B4 A3 * B3 C3 A2 * B2 + C2
5 A5 B5 A4 * B4 C4 A3 * B3 + C3
6 A6 B6 A5 * B5 C5 A4 * B4 + C4
7 A7 B7 A6 * B6 C6 A5 * B5 + C5
8 - - A7 * B7 C7 A6 * B6 + C6
9 - - - - A7 * B7 + C7
• Pipeline Unit:
• k segment pipeline with clock cycle time tp to complete n tasks.
• Clock cycles required to complete n task= k + (n - 1)
• Time to complete n task= k*tp + (n - 1)*tp = (k+n-1) * tp
• Non Pipeline Unit:
• tn to complete each task
• Time to complete n task= n * tn
TR= 10ns
Speedup=Tn / Tp = 320 /110
© Ronak Patel, Computer Engineering Department , CSPIT, CHARUSAT
Instruction Pipeline
• An instruction pipeline reads consecutive instructions from memory while
previous instructions are being executed in other segment.
• The instruction fetch segment can be implemented by FIFO buffer.
• Whenever execution unit is not using memory, the control increments the
program counter and read the next instruction.
• Reduce average access time to memory for reading instructions.
• Instruction phases:
1. Fetch the instruction from memory
2. Decode the instruction.
3. Calculate the effective address.
4. Fetch the operands from memory.
5. Execute the instruction.
6. Store the result in the proper place.
© Ronak Patel, Computer Engineering Department , CSPIT, CHARUSAT
Reduce the Phases
• A register mode instruction does not need an
effective address calculation.
Advantage: Data dependency is taken care of by the compiler rather than the hardware.
?
© Ronak Patel, Computer Engineering Department , CSPIT, CHARUSAT