0% found this document useful (0 votes)

333 views43 pages

CH-1 1 Pipelining

Pipelining hazards can occur in instruction pipelines that prevent the next instruction from executing as planned. There are three main types of hazards: structural hazards which occur when two instructions need to use the same functional unit; data hazards which happen when an instruction needs results from a previous instruction; and control hazards which arise from conditional branch instructions altering the instruction flow. Managing these hazards effectively is important for achieving optimal pipelining performance.

Uploaded by

Devanshi Gudsariya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

333 views43 pages

CH-1 1 Pipelining

Uploaded by

Devanshi Gudsariya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

PIPELINING AND HAZARDS

UNIT - IV
CONTENT

1. Introduction of Pipelining
2. Instruction Pipelining
3. Arithmetic Pipelining
4. Pipelining Hazards
5. Numerical on Pipelining
1. INTRODUCTION OF PIPELINING
INTRODUCTION

◼ A program consists of several number of instructions. These instructions may

be executed in the following two-way:

◼ The primary goal of computer architecture is to enhance the performance and

speed of the computer. This can be achieved by:
◼ Improving the hardware
◼ Arranging the hardware so that multiple operations can be performed simultaneously.
INTRODUCTION

◼ In non-pipelined (sequential) architecture, all the instructions of a program are

executed sequentially one after the other
◼ Pipelining is referred as
◼ A technique in which a given task is divided into a number of subtasks that need to be
performed in sequence.
◼ One of the processes of arranging the hardware so that simultaneous execution of multiple
instructions takes place, thus, improving the overall performance.

◼ The main advantage of pipelining is the simultaneous execution of various subtasks,

which improves the system's throughput.
PIPELINING
◼ A technique of decomposing a sequential process into suboperations, with each
subprocess being executed in a partially dedicated segment that operates concurrently
with all other segments.
◼ Example:
OPERATIONS IN EACH PIPELINING STAGE

Pipelined Execution

Non-Pipelined Execution: 3*7 = 21 Clock

pulses
INSTRUCTION CYCLE

◼ There are several stages of processing an instruction. A pipeline can be of three,

four, ﬁve, or six stages.

3-stage 4-stage 5-stage 6-stage

Fetch Fetching the instruction Fetching the instruction Fetching the instruction
Decode Decoding the instruction Decoding the instruction Decoding the instruction
Execute Executing the instruction Memory Access for operands Calculate the effective address
Write Back Executing the instruction of the operand

Write Back Memory Access for operands

Executing the instruction
Write Back
SEQUENTIAL VS. PIPELINED EXECUTION OF INSTRUCTIONS
◼ Consider that there are three instructions- I1, I2, and I3 and there are 4 stages of
execution – Fetch (F), Decode (D), Execute (E), and Write back (W).
◼ It takes 12 machine cycles to execute these three instructions in sequential processing
and only 6 machine cycles in pipelining.

4-stage pipelining
PERFORMANCE MEASURES FOR THE GOODNESS
OF A PIPELINE
◼ In order to formulate the performance measures for the goodness of a
pipeline in processing a series of instructions
◼ A space-time chart (called the Gantt’s chart) is used.
◼ In this chart, the vertical axis represents the segments (four in this case) and the
horizontal axis represents time (the time (T) taken by each subunit to perform its task
is the same, therefore, known as unit time)

13 time units are

required to execute
10 instructions
using 4-stage
pipelining
PERFORMANCE MEASURES FOR THE GOODNESS OF A PIPELINE
SPEED-UP S(N)

◼
SPEED-UP S(N)

◼
Mainly two
types of
pipelining

1. Instruction 2. Arithmetic
Pipelining Pipelining
2. INSTRUCTION PIPELINING
INSTRUCTION PIPELINING
◼ Instruction pipelines are used to divide the task of executing a stream of instructions into
subtasks to be executed in different pipeline segments to improve the throughput of the
computer system.
◼ For example, if we have a stream of instructions, then one segment of the pipeline can
read the instructions while another segment can decode the previous instruction. In this
way, more than one instruction will be handled simultaneously by the computer system
which will improve its throughput. The instruction pipeline will be more efﬁcient if the
instructions are divided into equal-duration segments.

◼ A typical example of an instruction pipeline used by computer systems consists of the

following segments:
◼ Segment 1: This segment will fetch the instruction from the memory
◼ Segment 2: This segment will decode the instruction and ﬁnd out the effective address
◼ Segment 3: This segment will fetch the operands from the memory
◼ Segment 4: This segment will execute the instruction
INSTRUCTION PIPELINING
3. ARITHMETIC PIPELINE
ARITHMETIC PIPELINING

◼ Arithmetic pipelines are used to divide an arithmetic task into subtasks to be executed in
different pipeline segments.
◼ The main purpose is to speed up the arithmetic operations

◼ Pipeline arithmetic units are usually found in very high-speed computers. They are used
to implement floating-point operations, multiplication of fixed-point numbers, and similar
computations encountered in scientific problems.
◼ Floating-point operations are easily decomposed into suboperations.
◼ A pipeline multiplier is essentially an array multiplier, with special adders designed to minimize the carry
propagation time through the partial products.
ARITHMETIC PIPELINING: FLOATING POINT ADDER
Exponents Mantissas
We know that two floating point numbers are represented in their a b A B

normalized form using mantissa and exponents. Mantissa represents the R R

precision of the number and the exponent represents the range.
Compare Difference
Let us consider two floating point numbers X and Y. Segment 1: exponents
by subtraction

X = A x 2a
Choose exponent Align mantissa
Y = B x 2b Segment 2:

A and B are two fractions that represent the mantissa and a R

and b are the exponents.
Segment 3: Add or subtract
mantissas
1. Compare the exponents In the arithmetic pipeline,
2. Align the mantissa these four steps are R R
performed in four different
3. Add/sub the mantissa segments to improve the Segment 4: Adjust Normalize
exponent result
4. Normalize the result speed and throughput of
the system R R
ARITHMETIC PIPELINING:
FLOATING POINT ADDER
EXAMPLE: FLOATING POINT ADDER

◼ The following numerical example may clarify the suboperations performed in each
segment. For simplicity, we use decimal numbers, although Figure refers to binary numbers.
1. Consider the two normalized ﬂoating-point numbers:

2. The two exponents are subtracted in the first segment to obtain 3 - 2 = 1. The larger
exponent 3 is chosen as the exponent of the result.
3. The next segment shifts the mantissa of Y to the right to obtain
EXAMPLE: FLOATING POINT ADDER
4. This aligns the two mantissa under the same exponent. The addition of the two mantissa
in segment 3 produces the sum Z = 1 .0324 * 103.
5. The sum is adjusted by normalizing the result so that it has a fraction with a nonzero first
digit. This is done by shifting the mantissa once to the right and incrementing the
exponent by one to obtain the normalized sum.
Z = 0.10324 * 104.
6. The comparator, shifter, adder-subtractor, incrementer, and decrementer in the
floating-point pipeline are implemented with combinational circuits. Suppose that the
time delays of the four segments are t1 = 60 ns, t2 = 70 ns, t3 = 100 ns, t4 = 80 ns, and the
interface registers have a delay of tr = 10 ns.
7. The clock cycle is chosen to be tp = t3 + tr = 110 ns.
8. An equivalent non-pipeline floating point adder-subtractor will have a delay time tn = t1 + t2
+ t3 + t4 + tr = 320 ns.
9. In this case the pipelined adder has a speedup of 320/110 = 2.9 over the nonpipelined
adder.
4. PIPELINING HAZARDS
PIPELINING HAZARDS

◼ Pipeline hazards are situations that prevent the next instruction in the instruction stream
from executing during its designated clock cycles.

◼ Any condition that causes a stall in the pipeline operations can be called a hazard.

◼ There are primarily three types of hazards:

1. Data Hazards

2. Control Hazards or instruction Hazards

3. Structural Hazards.
1. DATA HAZARDS

◼ A data hazard is any condition in which either the source or the destination operands of an
instruction are not available at the time expected in the pipeline. As a result, some
operation has to be delayed, and the pipeline stalls.

◼ When the execution of an instruction is dependent on the results of a prior instruction

that’s still being processed in a pipeline, data hazards occur. If the execution is done in a
pipelined processor, it is highly likely that the interleaving of these two instructions can lead
to incorrect results due to data dependency between the instructions. Thus the pipeline
needs to be stalled as and when necessary to avoid errors.
1. DATA HAZARDS

◼ Consider the following scenario.

DATA HAZARDS CLASSIFICATION

◼ Data hazards are divided into three types according to the order in which READ or WRITE
operations are performed on the register:

1. Flow/True Data Dependency [RAW (or Read after Write)]:

This is when one instruction makes use of data from a previous instruction.

Example,

ADD X0, X1, X2

SUB X4, X3, X0

DATA HAZARDS CLASSIFICATION

2. Anti-Data Dependency [WAR (or Write after Read)]

When the second instruction is written to a register before the ﬁrst instruction is read, this is known as a race
condition. In the case of a simple structure of a pipeline, this is uncommon. WAR, on the other hand, can occur in
some machines having complex and speciﬁc instructions.
Example,
ADD X2, X1, X0
SUB X0, X3, X4

3. Output data dependency [WAW (or Write after Write)]

This is a situation where two simultaneous instructions must write the same register in the same sequence they
were issued.
Example,
ADD X0, X1, X2
SUB X0, X4, X5
DATA HAZARDS CLASSIFICATION

◼ Important Note:

WAW and WAR hazards can only occur when instructions are executed in parallel or out of order.
These occur because the same register numbers have been allotted by the compiler although
avoidable.

This situation is ﬁxed by renaming one of the registers by the compiler or by delaying the updating of
a register until the appropriate value has been produced.

Modern CPUs not only have incorporated Parallel execution with multiple ALUs but also out of order
issues and execution of instructions along with many stages of pipelines.
1. DATA HAZARDS

◼ Solution 1: At the IF stage of the SUB instruction, add three bubbles. This will make it easier for SUB – ID
(Instruction Decoder) to work at t6. As a result, all subsequent instructions in the pipe are similarly delayed.

◼ Solution 2: Forwarding of Data – Data forwarding is the process of sending a result straight to that
functional unit that needs it: a result is transferred from one unit’s output to another’s input. The goal is to
have the solution ready for the next instruction as soon as possible.
2. STRUCTURAL HAZARDS
◼ Hardware resource conﬂicts among the
instructions in the pipeline cause structural
hazards. Memory, a GPR Register, or an ALU
might all be used as resources here.

◼ When more than one instruction in the pipe

requires access to the very same resource in the
same clock cycle, a resource conﬂict is said to
arise.

◼ In an overlapping pipelined execution, this is a

situation where the hardware cannot handle all
potential combinations.
2. STRUCTURAL HAZARDS

◼ Solution: For a portion of the

pipeline, instructions must be
performed in series rather than
parallel.
3. CONTROL HAZARDS
◼ Control hazards are called Branch hazards and are caused by Branch Instructions. Branch
instructions control the ﬂow of program/ instructions execution. Recall that we use
conditional statements in the higher-level language either for iterative loops or with
conditions checking (correlate with for, while, if, and case statements). These are transformed
into one of the variants of BRANCH instructions. It is necessary to know the value of the
condition being checked to get the program ﬂow.

◼ Thus a Conditional hazard occurs when the decision to execute an instruction is based on
the result of another instruction like a conditional branch, which checks the condition’s
resultant value.

◼ The branch and jump instructions decide the program ﬂow by loading the appropriate
location in the Program Counter(PC). The PC has the value of the next instruction to be
fetched and executed by CPU. Consider the following sequence of instructions.
3. CONTROL HAZARDS
SOLUTION FOR CONTROL HAZARDS

1. Stall:

Stall the given pipeline as soon as any branch instructions are decoded. Just don’t allow IF anymore.
Stalling reduces throughput as it always does. According to statistics, at least 30% of the instructions
in a program are BRANCH. With Stalling, the pipeline is effectively operating at 50% capacity.

2. Prediction:

Consider a for or a while loop that is repeated 100 times. We know the program would run 100 times
without the given branch condition being met. The program only exits the loop for the 101st time. As a
result, it’s better to let the pipeline run its course and then ﬂush/undo when the branch condition is
met. This has less of an impact on the pipeline’s throttle and stalling.
SOLUTION FOR CONTROL HAZARDS
3. Dynamic Branch Prediction :

A history record is maintained with the help of Branch Table Buffer (BTB). The BTB is a kind of cache, which
has a set of entries, with the PC address of the Branch Instruction and the corresponding effective branch
address. This is maintained for every branch instruction that occurs.

Branch Instruction Address Target Branch Address taken

4. Reordering Instructions:

Delayed branching entails reordering the instructions to move the branch instruction later in the
sequence, allowing safe and beneﬁcial instructions that are unaffected by the result of a branch to be
brought in earlier in the sequence, delaying the fetch of the branch instruction. If such instructions are
not available, NOP is used. The Compiler is used to implement this delayed branch.
5. NUMERICAL ON PIPELINING
QUESTION 1

In certain scientiﬁc computations it is necessary to perform the arithmetic operation (Ai + Bi)(Ci +
Di) with a stream of numbers. Specify a pipeline conﬁguration to carry out this task. Use the
contents of all registers in the pipeline for i = 1 through 6.

Solution:
QUESTION 2

Determine the number of clock cycles that it takes to process 200 tasks in a six-segment
pipeline

Solution: Pipelined execution is = n + m – 1

n = 6 segments

m = 200 tasks

(n + m – 1) = 6 + 200 – 1 = 205 cycles

QUESTION 3

Draw a space-time diagram for a six-segment pipeline showing the time it takes to process eight
tasks.

Solution:

(n + m – 1) = 6 + 8 – 1 = 13 cycles
QUESTION 4
A non-pipeline system takes 50 ns to process a task. The same task can be processed in a
six-segment pipeline with a clock cycle of 10 ns. Determine the speedup ratio of the
pipeline for 100 tasks. What is the maximum speedup that can be achieved?
Solution:
QUESTION 5
The pipeline of Fig has the following propagation times: 40 ns for the operands to be read from memory into registers R1 and R2, 45
ns for the signal to propagate through the multiplier, 5 ns for the register transfer time into R3, and 15 ns to add the two numbers
into R5.

a) What is the minimum clock cycle time that can be used?

b) A non-pipeline system can perform the same operation by removing R3 and R4. How long will it take to multiply and add the
operands without using the pipeline?
c) Calculate the speedup of the pipeline for 10 tasks and again for 100 tasks.
d) What Is the maximum speedup that can be achieved?
QUESTION 5

Solution:

PPT-Unit-4 CPU Scheduling and Algorithms
No ratings yet
PPT-Unit-4 CPU Scheduling and Algorithms
56 pages
Fundamentals of Information Systems, 6e - Ralph M. Stair, George Reynolds (New)
No ratings yet
Fundamentals of Information Systems, 6e - Ralph M. Stair, George Reynolds (New)
333 pages
Unit 5
No ratings yet
Unit 5
86 pages
V-Unit Co
No ratings yet
V-Unit Co
18 pages
Presentation 5156 Content Document 20250301102853AM
No ratings yet
Presentation 5156 Content Document 20250301102853AM
40 pages
Coa M3 Bit
No ratings yet
Coa M3 Bit
4 pages
Unit II - Asymptotic Notations
No ratings yet
Unit II - Asymptotic Notations
9 pages
COA - Module-5
No ratings yet
COA - Module-5
35 pages
11-Subroutine Call and Return
No ratings yet
11-Subroutine Call and Return
6 pages
Queue
100% (1)
Queue
26 pages
1 Sem Btech - Fundamentals of Computers Part 1
No ratings yet
1 Sem Btech - Fundamentals of Computers Part 1
140 pages
OS PPT Introduction
No ratings yet
OS PPT Introduction
43 pages
COA Unit - IV Notes
No ratings yet
COA Unit - IV Notes
25 pages
Aggregate Functions PPT DWI
No ratings yet
Aggregate Functions PPT DWI
12 pages
GE3151-Lab Manual
No ratings yet
GE3151-Lab Manual
117 pages
8086 Signals
No ratings yet
8086 Signals
11 pages
18ECE205J - FPGA-based Embedded System Design - Unit - 1
No ratings yet
18ECE205J - FPGA-based Embedded System Design - Unit - 1
151 pages
DL - & - CO - Unit 5 - Material (N)
No ratings yet
DL - & - CO - Unit 5 - Material (N)
15 pages
Computer Organization
No ratings yet
Computer Organization
1 page
Computer Graphic - Chapter 02
No ratings yet
Computer Graphic - Chapter 02
108 pages
Superscalar Vs Superpipeline Processor
No ratings yet
Superscalar Vs Superpipeline Processor
17 pages
5.4 Error Handling in File Operations
No ratings yet
5.4 Error Handling in File Operations
10 pages
EEF011 Computer Architecture 計算機結構: Exploiting Instruction-Level Parallelism with Software Approaches
0% (1)
EEF011 Computer Architecture 計算機結構: Exploiting Instruction-Level Parallelism with Software Approaches
40 pages
BTES-401-18 (05-12-2023) Solution
No ratings yet
BTES-401-18 (05-12-2023) Solution
11 pages
OS Chap 5 Slides
No ratings yet
OS Chap 5 Slides
82 pages
Deadlock in DBMS
No ratings yet
Deadlock in DBMS
3 pages
Microprocessor Notes Vtu Brief 6th Sem
100% (4)
Microprocessor Notes Vtu Brief 6th Sem
31 pages
DAA Question Bank-Unit 3
No ratings yet
DAA Question Bank-Unit 3
30 pages
Unit - Ii Control Structures
No ratings yet
Unit - Ii Control Structures
18 pages
Lecture 12 Stack and Subroutines
No ratings yet
Lecture 12 Stack and Subroutines
24 pages
Superscalar and Super Pipelined Processors
No ratings yet
Superscalar and Super Pipelined Processors
3 pages
Final Report-Industrial Training at Keltron Controls, Aroor
83% (12)
Final Report-Industrial Training at Keltron Controls, Aroor
49 pages
CAO Units PDF
No ratings yet
CAO Units PDF
357 pages
DBMS Notes
No ratings yet
DBMS Notes
367 pages
Synchronisation Hardware
100% (4)
Synchronisation Hardware
17 pages
Computer Programming PDF
No ratings yet
Computer Programming PDF
260 pages
Form One Computer Notes
No ratings yet
Form One Computer Notes
73 pages
AVL Trees - Horowitz Sahani
No ratings yet
AVL Trees - Horowitz Sahani
31 pages
Data Structures Unit 2 Notes
No ratings yet
Data Structures Unit 2 Notes
51 pages
8086 Instruction Set 1N2
No ratings yet
8086 Instruction Set 1N2
22 pages
Unit 1 - BD - Introduction To Big Data
No ratings yet
Unit 1 - BD - Introduction To Big Data
83 pages
Problem Solving Unit 1
No ratings yet
Problem Solving Unit 1
6 pages
DAA Practical File Questions
No ratings yet
DAA Practical File Questions
6 pages
Memory Organization
No ratings yet
Memory Organization
99 pages
Unit-5 Control Statements
No ratings yet
Unit-5 Control Statements
16 pages
Operating Systems Unit - 5: I/O and File Management
No ratings yet
Operating Systems Unit - 5: I/O and File Management
48 pages
Computer Science Department: Majlis Arts and Science College, Puramannur
No ratings yet
Computer Science Department: Majlis Arts and Science College, Puramannur
20 pages
Three Address Code (TAC) : Addresses and Instructions
No ratings yet
Three Address Code (TAC) : Addresses and Instructions
28 pages
Final Examination - Attempt Review
No ratings yet
Final Examination - Attempt Review
26 pages
MCSE-103 by Mohd Abdullah
No ratings yet
MCSE-103 by Mohd Abdullah
9 pages
Memory Reference Instructions Execution
100% (1)
Memory Reference Instructions Execution
13 pages
Vector Computers
No ratings yet
Vector Computers
43 pages
File Allocation Methods
No ratings yet
File Allocation Methods
9 pages
Coa Previous Q Papers
No ratings yet
Coa Previous Q Papers
8 pages
CS 6303 Computer Architecture TWO Mark With Answer
100% (1)
CS 6303 Computer Architecture TWO Mark With Answer
14 pages
Stack and SUBROUTINES Bindu Agarwalla
No ratings yet
Stack and SUBROUTINES Bindu Agarwalla
15 pages
Chapter 6 Assembly Language-PPandMS-1617
No ratings yet
Chapter 6 Assembly Language-PPandMS-1617
14 pages
Cse-IV-unix and Shell Programming (10cs44) - Notes
No ratings yet
Cse-IV-unix and Shell Programming (10cs44) - Notes
161 pages
Unit-3 C++ Functions: 2140705 Object Oriented Programming With C++
No ratings yet
Unit-3 C++ Functions: 2140705 Object Oriented Programming With C++
52 pages
Part - A: Database Management System Lab
No ratings yet
Part - A: Database Management System Lab
26 pages
IHI0014Q Etm Architecture Spec
No ratings yet
IHI0014Q Etm Architecture Spec
420 pages
DEADLOCK
No ratings yet
DEADLOCK
8 pages
COA Class Test-1
No ratings yet
COA Class Test-1
3 pages
Unit-1 Introduction To Microprocessor Architecture PDF
No ratings yet
Unit-1 Introduction To Microprocessor Architecture PDF
15 pages
CONSTRUCTOR AND DESTRUCTOR (C++)
No ratings yet
CONSTRUCTOR AND DESTRUCTOR (C++)
24 pages
COA Unit 1
No ratings yet
COA Unit 1
33 pages
2-Fold Analogue Limit Monitor 62 100 Safety-Related: Connection Not Required
No ratings yet
2-Fold Analogue Limit Monitor 62 100 Safety-Related: Connection Not Required
20 pages
Csa 2022
No ratings yet
Csa 2022
6 pages
Computer Organization and Architecture: Notes On RISC-Pipelining
No ratings yet
Computer Organization and Architecture: Notes On RISC-Pipelining
14 pages
Cursor-Based Linked Lists
No ratings yet
Cursor-Based Linked Lists
4 pages
Robots PDF
No ratings yet
Robots PDF
16 pages
4 Intel 286 To Pentium Architecture
No ratings yet
4 Intel 286 To Pentium Architecture
25 pages
Operating System
No ratings yet
Operating System
39 pages
HC2021.C1.4 Intel Arijit
No ratings yet
HC2021.C1.4 Intel Arijit
22 pages
2019 Summer Model Answer Paper (Msbte Study Resources)
100% (1)
2019 Summer Model Answer Paper (Msbte Study Resources)
38 pages
GE3151 PYTHON Syllabus
No ratings yet
GE3151 PYTHON Syllabus
2 pages
PL Anandam 30 Agustus 2015 Ok
No ratings yet
PL Anandam 30 Agustus 2015 Ok
6 pages
Kernel I/O Subsystem in Operating System
No ratings yet
Kernel I/O Subsystem in Operating System
2 pages
CH4 External and Internal Architecture Mips
No ratings yet
CH4 External and Internal Architecture Mips
9 pages
Rodmach Troubleshooting
No ratings yet
Rodmach Troubleshooting
29 pages
Hello: Let's Get Started!
No ratings yet
Hello: Let's Get Started!
28 pages
Microprocessor and Microcontroller Fundamentals
No ratings yet
Microprocessor and Microcontroller Fundamentals
24 pages
Leonardo Inventario
No ratings yet
Leonardo Inventario
12 pages
Computer Evolution
No ratings yet
Computer Evolution
14 pages
Week 5
No ratings yet
Week 5
16 pages
ICT Grade 4 - 1st Term Evaluation 2024
No ratings yet
ICT Grade 4 - 1st Term Evaluation 2024
5 pages
Unit2 - Building Blocks With Exercises Future
No ratings yet
Unit2 - Building Blocks With Exercises Future
9 pages
October 2022 IT Passport Examination
No ratings yet
October 2022 IT Passport Examination
8 pages
Muthayammal College of Engineering MKC
No ratings yet
Muthayammal College of Engineering MKC
4 pages
Textbook of Engineering Chemistry
From Everand
Textbook of Engineering Chemistry
C. Parameswara Murthy
No ratings yet

CH-1 1 Pipelining

Uploaded by

CH-1 1 Pipelining

Uploaded by

PIPELINING AND HAZARDS

◼ A program consists of several number of instructions. These instructions may

◼ The primary goal of computer architecture is to enhance the performance and

◼ In non-pipelined (sequential) architecture, all the instructions of a program are

◼ The main advantage of pipelining is the simultaneous execution of various subtasks,

Non-Pipelined Execution: 3*7 = 21 Clock

◼ There are several stages of processing an instruction. A pipeline can be of three,

3-stage 4-stage 5-stage 6-stage

Write Back Memory Access for operands

13 time units are

◼ A typical example of an instruction pipeline used by computer systems consists of the

normalized form using mantissa and exponents. Mantissa represents the R R

A and B are two fractions that represent the mantissa and a R

◼ There are primarily three types of hazards:

2. Control Hazards or instruction Hazards

◼ When the execution of an instruction is dependent on the results of a prior instruction

◼ Consider the following scenario.

1. Flow/True Data Dependency [RAW (or Read after Write)]:

ADD X0, X1, X2

SUB X4, X3, X0

2. Anti-Data Dependency [WAR (or Write after Read)]

3. Output data dependency [WAW (or Write after Write)]

◼ When more than one instruction in the pipe

◼ In an overlapping pipelined execution, this is a

◼ Solution: For a portion of the

Branch Instruction Address Target Branch Address taken

Solution: Pipelined execution is = n + m – 1

(n + m – 1) = 6 + 200 – 1 = 205 cycles

a) What is the minimum clock cycle time that can be used?

You might also like