0% found this document useful (0 votes)

130 views40 pages

Pipelining - Computer Architecture and Organization

Instruction pipelining improves processor performance by dividing instruction processing into multiple stages that can operate in parallel. A multi-stage pipeline allows new instructions to begin processing before previous instructions have finished. However, pipeline hazards like resource conflicts, data dependencies, and control hazards can reduce the benefits of pipelining by stalling the pipeline. Common pipeline hazards include instructions competing for functional units like ALUs, instructions depending on results not yet available, and conditional branches altering the instruction flow. More advanced techniques attempt to mitigate these hazards to better utilize the parallelism of pipelined processors.

Uploaded by

DIPTANU SAHA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

130 views40 pages

Pipelining - Computer Architecture and Organization

Uploaded by

DIPTANU SAHA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

1

School of Engineering 2
3
4
5
6
7
8
9
10
Instruction pipelining is similar to the use of an assembly line in a manufacturing
plant. An assembly line takes advantage of the fact that a product goes through
various stages of production. By laying the production process out in an assembly
line, products at various stages can be worked on simultaneously. This process is
also referred to as pipelining, because, as in a pipeline, new inputs are accepted at
one end before previously accepted inputs appear as outputs at the other end.

To apply this concept to instruction execution, we must recognize that, in fact, an

instruction has a number of stages. Figures 14.5, for example, breaks the
instruction cycle up into 10 tasks, which occur in sequence. Clearly, there should
be some opportunity for pipelining.
As a simple approach, consider subdividing instruction processing into two stages: fetch
instruction and execute instruction. There are times during the execution of an instruction when
main memory is not being accessed. This time could be used to fetch the next instruction in
parallel with the execution of the current one. Figure depicts this approach. The pipeline has two
independent stages. The first stage fetches an instruction and buffers it. When the second stage is
free, the first stage passes it the buffered instruction. While the second stage is executing the
instruction, the first stage takes advantage of any unused memory cycles to fetch and buffer the
next instruction. This is called instruction prefetch or fetch overlap. Note that this approach,
which involves instruction buffering, requires more registers. In general, pipelining requires
registers to store data between stages.

It should be clear that this process will speed up instruction execution. If the fetch and execute
stages were of equal duration, the instruction cycle time would be halved. However, if we look
more closely at this pipeline (Figure 14.9b), we will see that this doubling of execution rate is
unlikely for two reasons:

The execution time will generally be longer than the fetch time. Execution will involve reading
and storing operands and the performance of some operation. Thus, the fetch stage may have to
wait for some time before it can empty its buffer.

A conditional branch instruction makes the address of the next instruction to be fetched unknown.
Thus, the fetch stage must wait until it receives the next instruction address from the execute
stage. The execute stage may then have to wait while the next instruction is fetched.

Guessing can reduce the time loss from the second reason. A simple rule is the following: When a
conditional branch instruction is passed on from the fetch to the execute stage, the fetch stage
fetches the next instruction in memory after the branch instruction. Then, if the branch is not
taken, no time is lost. If the branch is taken, the fetched instruction must be discarded and a new
instruction fetched.
While these factors reduce the potential effectiveness of the two-stage pipeline, some
speedup occurs. To gain further speedup, the pipeline must have more stages. Let us
consider the following decomposition of the instruction processing.

Fetch instruction (FI): Read the next expected instruction into a buffer.

Decode instruction (DI): Determine the opcode and the operand specifiers.

Calculate operands (CO): Calculate the effective address of each source operand. This
may involve displacement, register indirect, indirect, or other forms of address
calculation.

Fetch operands (FO): Fetch each operand from memory. Operands in registers need not
be fetched.

Execute instruction (EI): Perform the indicated operation and store the result, if any, in
the specified destination operand location.

Write operand (WO): Store the result in memory.

With this decomposition, the various stages will be of more nearly equal duration.
For the sake of illustration, let us assume equal duration. Using this assumption,
Figure 14.10 shows that a six-stage pipeline can reduce the execution time for 9
instructions from 54 time units to 14 time units.

Several comments are in order: The diagram assumes that each instruction goes
through all six stages of the pipeline. This will not always be the case. For
example, a load instruction does not need the WO stage. However, to simplify the
pipeline hardware, the timing is set up assuming that each instruction requires all
six stages. Also, the diagram assumes that all of the stages can be performed in
parallel. In particular, it is assumed that there are no memory conflicts. For
example, the FI, FO, and WO stages involve a memory access. The diagram
implies that all these accesses can occur simultaneously. Most memory systems
will not permit that. However, the desired value may be in cache, or the FO or WO
stage may be null. Thus, much of the time, memory conflicts will not slow down
the pipeline.
Several other factors serve to limit the performance enhancement. If the six stages
are not of equal duration, there will be some waiting involved at various pipeline
stages, as discussed before for the two-stage pipeline. Another difficulty is the
conditional branch instruction, which can invalidate several instruction fetches. A
similar unpredictable event is an interrupt. Figure 14.11 illustrates the effects of
the conditional branch, using the same program as Figure 14.10. Assume that
instruction 3 is a conditional branch to instruction 15. Until the instruction is
executed, there is no way of knowing which instruction will come next. The
pipeline, in this example, simply loads the next instruction in sequence (instruction
4) and proceeds. In Figure 14.10, the branch is not taken, and we get the full
performance benefit of the enhancement. In Figure 14.11, the branch is taken. This
is not determined until the end of time unit 7. At this point, the pipeline must be
cleared of instructions that are not useful. During time unit 8, instruction 15 enters
the pipeline. No instructions complete during time units 9 through 12; this is the
performance penalty incurred because we could not anticipate the branch.
Figure 14.12 indicates the logic needed for pipelining to account for branches and
interrupts.

Other problems arise that did not appear in our simple two-stage organization. The
CO stage may depend on the contents of a register that could be altered by a
previous instruction that is still in the pipeline. Other such register and memory
conflicts could occur. The system must contain logic to account for this type of
conflict.
To clarify pipeline operation, it might be useful to look at an alternative depiction.
Figures 14.10 and 14.11 show the progression of time horizontally across the
figures, with each row showing the progress of an individual instruction. Figure
14.13 shows same sequence of events, with time progressing vertically down the
figure, and each row showing the state of the pipeline at a given point in time. In
Figure 14.13a (which corresponds to Figure 14.10), the pipeline is full at time 6,
with 6 different instructions in various stages of execution, and remains full
through time 9; we assume that instruction I9 is the last instruction to be executed.
In Figure 14.13b, (which corresponds to Figure 14.11), the pipeline is full at times
6 and 7. At time 7, instruction 3 is in the execute stage and executes a branch to
instruction 15. At this point, instructions I4 through I7 are flushed from the
pipeline, so that at time 8, only two instructions are in the pipeline, I3 and I15.
18
In the previous subsection, we mentioned some of the situations that can result in
less than optimal pipeline performance. In this subsection, we examine this issue
in a more systematic way. Chapter 16 revisits this issue, in more detail, after we
have introduced the complexities found in superscalar pipeline organizations.

A pipeline hazard occurs when the pipeline, or some portion of the pipeline, must
stall because conditions do not permit continued execution. Such a pipeline stall
is also referred to as a pipeline bubble. There are three types of hazards: resource,
data, and control.
Morgan Kaufmann Publishers 30 November, 2021

Chapter 4 — The Processor 20

A resource hazard occurs when two (or more) instructions that are already in the
pipeline need the same resource. The result is that the instructions must be
executed in serial rather than parallel for a portion of the pipeline. A resource
hazard is sometime referred to as a structural hazard.

Let us consider a simple example of a resource hazard. Assume a simplified five-

stage pipeline, in which each stage takes one clock cycle. Figure 14.15a shows the
ideal case, in which a new instruction enters the pipeline each clock cycle. Now
assume that main memory has a single port and that all instruction fetches and data
reads and writes must be performed one at a time. Further, ignore the cache. In this
case, an operand read to or write from memory cannot be performed in parallel
with an instruction fetch. This is illustrated in Figure 14.15b, which assumes that
the source operand for instruction I1 is in memory, rather than a register.
Therefore, the fetch instruction stage of the pipeline must idle for one cycle before
beginning the instruction fetch for instruction I3. The figure assumes that all other
operands are in registers.

Another example of a resource conflict is a situation in which multiple instructions

are ready to enter the execute instruction phase and there is a single ALU. One
solutions to such resource hazards is to increase available resources, such as
having multiple ports into main memory and multiple ALU units.
A data hazard occurs when there is a conflict in the access of an operand location.
In general terms, we can state the hazard in this form: Two instructions in a
program are to be executed in sequence and both access a particular memory or
register operand. If the two instructions are executed in strict sequence, no
problem occurs. However, if the instructions are executed in a pipeline, then it is
possible for the operand value to be updated in such a way as to produce a
different result than would occur with strict sequential execution. In other words,
the program produces an incorrect result because of the use of pipelining.
23
Morgan Kaufmann Publishers 30 November, 2021

Chapter 4 — The Processor 24

25
There are three types of data hazards;

• Read after write (RAW), or true dependency: An instruction modifies a

register or memory location and a succeeding instruction reads the data in that
memory or register location. A hazard occurs if the read takes place before the
write operation is complete.

• Write after read (WAR), or antidependency: An instruction reads a register or

memory location and a succeeding instruction writes to the location. A hazard
occurs if the write operation completes before the read operation takes place.

• Write after write (WAW), or output dependency: Two instructions both write
to the same location. A hazard occurs if the write operations take place in the
reverse order of the intended sequence.

The example of Figure 14.16 is a RAW hazard. The other two hazards are best
discussed in the context of superscalar organization, discussed in Chapter 16.
27
28
29
A control hazard, also known as a branch hazard, occurs when the pipeline makes
the wrong decision on a branch prediction and therefore brings instructions into
the pipeline that must subsequently be discarded. We discuss approaches to
dealing with control hazards next.

One of the major problems in designing an instruction pipeline is assuring a steady

flow of instructions to the initial stages of the pipeline. The primary impediment,
as we have seen, is the conditional branch instruction. Until the instruction is
actually executed, it is impossible to determine whether the branch will be taken or
not.

A variety of approaches have been taken for dealing with conditional branches:

Multiple streams
Prefetch branch target
Loop buffer
Branch prediction
Delayed branch
Morgan Kaufmann Publishers 30 November, 2021

Chapter 4 — The Processor 31

32
33
34
35
36
37
38
39
40

Module 4-Pipelining
No ratings yet
Module 4-Pipelining
39 pages
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
No ratings yet
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
64 pages
Csa Module Iv Notes
No ratings yet
Csa Module Iv Notes
59 pages
C.Arch Large
No ratings yet
C.Arch Large
57 pages
Ch#16 (CPU Structure and Function)
No ratings yet
Ch#16 (CPU Structure and Function)
48 pages
CA Unit-2 Chapter-2
No ratings yet
CA Unit-2 Chapter-2
36 pages
Lec 8 Performance Enhancement-Computer Architecture
No ratings yet
Lec 8 Performance Enhancement-Computer Architecture
23 pages
COA Lecture 10
No ratings yet
COA Lecture 10
22 pages
Beginning Software Engineering
From Everand
Beginning Software Engineering
Rod Stephens
4.5/5 (2)
Pipeline Hazards
No ratings yet
Pipeline Hazards
53 pages
UNIT 3 Second Half Notes
No ratings yet
UNIT 3 Second Half Notes
28 pages
Module 5 Notes Bcs302
No ratings yet
Module 5 Notes Bcs302
22 pages
Co - Unit Ii - Ii
No ratings yet
Co - Unit Ii - Ii
34 pages
10 Pipelining
No ratings yet
10 Pipelining
44 pages
Pipelining Basic Concept
No ratings yet
Pipelining Basic Concept
23 pages
Pipe Lining
No ratings yet
Pipe Lining
23 pages
COA Unit 3 Pipelining 31.5.23
No ratings yet
COA Unit 3 Pipelining 31.5.23
12 pages
CH 12.ppt Type I
No ratings yet
CH 12.ppt Type I
54 pages
Unit 5 - Pipeling and Multipoessors
No ratings yet
Unit 5 - Pipeling and Multipoessors
74 pages
SIMD Machines:: Pipeline System
No ratings yet
SIMD Machines:: Pipeline System
35 pages
Unit 6
No ratings yet
Unit 6
20 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
COA Unit - V Notes
No ratings yet
COA Unit - V Notes
21 pages
Pipelining (All Slides)
No ratings yet
Pipelining (All Slides)
45 pages
Pipelining - Modified1
No ratings yet
Pipelining - Modified1
51 pages
UNIT - 5 Pipeling Concept
No ratings yet
UNIT - 5 Pipeling Concept
15 pages
11 Processor Structure and Function 20 3 18
No ratings yet
11 Processor Structure and Function 20 3 18
27 pages
Ca 5
No ratings yet
Ca 5
12 pages
Lecture 7 - PIPELINING
No ratings yet
Lecture 7 - PIPELINING
16 pages
Neues verkehrswissenschaftliches Journal - Ausgabe 26: User-based Adaptable High Performance Simulation Modelling and Design for Railway Planning and Operations
From Everand
Neues verkehrswissenschaftliches Journal - Ausgabe 26: User-based Adaptable High Performance Simulation Modelling and Design for Railway Planning and Operations
Yong Cui
No ratings yet
Chap-06a Pipelining
No ratings yet
Chap-06a Pipelining
12 pages
Module 3 Pipelining
No ratings yet
Module 3 Pipelining
7 pages
Eee1001 - Electrical Circuits and Systems: Abhishek Joshi
No ratings yet
Eee1001 - Electrical Circuits and Systems: Abhishek Joshi
41 pages
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
No ratings yet
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
19 pages
Unit3 Pipelining
No ratings yet
Unit3 Pipelining
54 pages
Unit 6
No ratings yet
Unit 6
11 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
Module 3
No ratings yet
Module 3
20 pages
Pipeline Hazards. Presentation
100% (2)
Pipeline Hazards. Presentation
20 pages
Parallel Processing Chapter - 3: Instruction Level Parallelism
No ratings yet
Parallel Processing Chapter - 3: Instruction Level Parallelism
33 pages
3 Pipelining Pipeline:: "Folder" Takes 20 Minutes
No ratings yet
3 Pipelining Pipeline:: "Folder" Takes 20 Minutes
8 pages
Branch Hazard.: Control Hazards
No ratings yet
Branch Hazard.: Control Hazards
4 pages
4-Concept of Pipelining
No ratings yet
4-Concept of Pipelining
20 pages
L10-L11-Instruction Pipelining
No ratings yet
L10-L11-Instruction Pipelining
38 pages
5.pipeline and Multiprocessors
100% (1)
5.pipeline and Multiprocessors
16 pages
Uni1-2 Pipelining
No ratings yet
Uni1-2 Pipelining
12 pages
PIpeline Processing and Multi Processing
No ratings yet
PIpeline Processing and Multi Processing
16 pages
CO Pipelining PDF Notes
No ratings yet
CO Pipelining PDF Notes
10 pages
Module 5
No ratings yet
Module 5
44 pages
ACA Unit 2,7th Sem CSE
No ratings yet
ACA Unit 2,7th Sem CSE
13 pages
Instruction Pipeline
No ratings yet
Instruction Pipeline
16 pages
Chap-10: Speed and Efficiency
No ratings yet
Chap-10: Speed and Efficiency
29 pages
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
CS17303 Computer Architecture Notes On Lesson Unit IV - Sumathi
No ratings yet
CS17303 Computer Architecture Notes On Lesson Unit IV - Sumathi
24 pages
52. ĐỀ THI THỬ TN THPT 2023 - MÔN TIẾNG ANH - Sở Giáo Dục Và Đào Tạo Bình Phước (Bản Word Có Lời Giải Chi Tiết) .Image.marked (1) -1-30
No ratings yet
52. ĐỀ THI THỬ TN THPT 2023 - MÔN TIẾNG ANH - Sở Giáo Dục Và Đào Tạo Bình Phước (Bản Word Có Lời Giải Chi Tiết) .Image.marked (1) -1-30
30 pages
Computer Organization: An Introduction To RISC Hardware: 6.1 An Overview of Pipelining
No ratings yet
Computer Organization: An Introduction To RISC Hardware: 6.1 An Overview of Pipelining
12 pages
Dairy Management System
53% (15)
Dairy Management System
43 pages
Instruction Pipelining
No ratings yet
Instruction Pipelining
32 pages
Techopedia Explains: Amdahl's Law
No ratings yet
Techopedia Explains: Amdahl's Law
19 pages
FDA-Circular-2021-017 List of Class A
No ratings yet
FDA-Circular-2021-017 List of Class A
32 pages
Pipe Line1
No ratings yet
Pipe Line1
7 pages
Operating Room Nursing: Vergel G. Leonardo Rn. Man
No ratings yet
Operating Room Nursing: Vergel G. Leonardo Rn. Man
72 pages
Lecture 1
100% (1)
Lecture 1
10 pages
Pneumatics and Pneumatic Circuits
From Everand
Pneumatics and Pneumatic Circuits
Dr.Ilango Sivaraman
4/5 (1)
2012 Jin
100% (1)
2012 Jin
105 pages
The Big Picture: Beginners Guide: Alex Hogrefe Fundamentals Uncategorized 30 Comments
No ratings yet
The Big Picture: Beginners Guide: Alex Hogrefe Fundamentals Uncategorized 30 Comments
10 pages
Pipeline: A Simple Implementation of A RISC Instruction Set
No ratings yet
Pipeline: A Simple Implementation of A RISC Instruction Set
16 pages
Lab Report Format Physiology of Exercise
No ratings yet
Lab Report Format Physiology of Exercise
11 pages
Module 1 Part 2
No ratings yet
Module 1 Part 2
57 pages
DKA Final
No ratings yet
DKA Final
66 pages
Eee1001 - Electrical Circuits and Systems: Abhishek Joshi
No ratings yet
Eee1001 - Electrical Circuits and Systems: Abhishek Joshi
35 pages
Organic Electronic Materials: Sunny Tantia R
No ratings yet
Organic Electronic Materials: Sunny Tantia R
19 pages
VSA Short Note BY Ali Chowdhury
100% (4)
VSA Short Note BY Ali Chowdhury
6 pages
Booklet Start Greek
No ratings yet
Booklet Start Greek
13 pages
2 Performance Issue
No ratings yet
2 Performance Issue
4 pages
Catjan 14
No ratings yet
Catjan 14
18 pages
EAR - BC - ELT Indentifying Problems - Taking Action - Assessing Results
No ratings yet
EAR - BC - ELT Indentifying Problems - Taking Action - Assessing Results
84 pages
Conjunctions.
100% (2)
Conjunctions.
8 pages
Gigabyte Ga-Z370 Hd3 Rev1.0 Schematic Diagram
No ratings yet
Gigabyte Ga-Z370 Hd3 Rev1.0 Schematic Diagram
53 pages
Analysis of The Influence of Geometric, Modeling& Environmental Parameters On Bridges Subjected To Fire
No ratings yet
Analysis of The Influence of Geometric, Modeling& Environmental Parameters On Bridges Subjected To Fire
13 pages
8602 2 PDF
No ratings yet
8602 2 PDF
18 pages
Microstructure Simulation of Copper Tube and Its Application in Three Roll Planetary Rolling
No ratings yet
Microstructure Simulation of Copper Tube and Its Application in Three Roll Planetary Rolling
9 pages
GSM/GPRS/GPS Vehicle Tracker GPS106A/B User Manual
No ratings yet
GSM/GPRS/GPS Vehicle Tracker GPS106A/B User Manual
17 pages
20 Papers - Summary
No ratings yet
20 Papers - Summary
8 pages
Group No. 383 (E-SWACHHA) - Review 2
No ratings yet
Group No. 383 (E-SWACHHA) - Review 2
18 pages
Group 4 - Activity
No ratings yet
Group 4 - Activity
17 pages
Structure of Clauses
No ratings yet
Structure of Clauses
7 pages
OB Assignment Brief
No ratings yet
OB Assignment Brief
10 pages
CH 09 Hull OFOD9 TH Edition
No ratings yet
CH 09 Hull OFOD9 TH Edition
18 pages
Ajptr 35003 3 - 7103
No ratings yet
Ajptr 35003 3 - 7103
18 pages
Pilon - Electro Acupuncture
No ratings yet
Pilon - Electro Acupuncture
4 pages
© The Institute of Chartered Accountants of India
No ratings yet
© The Institute of Chartered Accountants of India
8 pages
Team 7 Finding The KTH Smallest Element D11
No ratings yet
Team 7 Finding The KTH Smallest Element D11
7 pages
6 Browser Rendering
No ratings yet
6 Browser Rendering
2 pages
Partitioning Algorithms Operating Systems
No ratings yet
Partitioning Algorithms Operating Systems
4 pages
Document From Minhaz
No ratings yet
Document From Minhaz
3 pages
Business Plan Template For Pharmaceutical Company
100% (1)
Business Plan Template For Pharmaceutical Company
11 pages

Pipelining - Computer Architecture and Organization

Uploaded by

Pipelining - Computer Architecture and Organization

Uploaded by

1

To apply this concept to instruction execution, we must recognize that, in fact, an

Write operand (WO): Store the result in memory.

Chapter 4 — The Processor 20

Let us consider a simple example of a resource hazard. Assume a simplified five-

Another example of a resource conflict is a situation in which multiple instructions

Chapter 4 — The Processor 24

• Read after write (RAW), or true dependency: An instruction modifies a

• Write after read (WAR), or antidependency: An instruction reads a register or

One of the major problems in designing an instruction pipeline is assuring a steady

Chapter 4 — The Processor 31

You might also like