0% found this document useful (0 votes)

174 views28 pages

Pipelining Basic Concepts: Instruction Fetch Execute Operand Fetch IF OF EX

Pipelining is a technique where the instruction execution process is divided into discrete stages so that multiple instructions can be overlapped in execution. By partitioning the instruction execution process into stages like fetch, decode, execute etc and ensuring independent hardware for each stage, multiple instructions can progress through the pipeline simultaneously, improving instruction throughput. However, pipelining can introduce hazards like structural hazards from resource conflicts, data hazards from dependencies between instructions, and control hazards from branches. Solutions involve buffering, forwarding, interlocking, and dynamic/static scheduling.

Uploaded by

Syed Ashmad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

174 views28 pages

Pipelining Basic Concepts: Instruction Fetch Execute Operand Fetch IF OF EX

Uploaded by

Syed Ashmad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Pipelining Basic Concepts

Partition the function of instruction execution into smaller functions

(called a stage), allocate separate concurrently operating hardware
components so that at any given time, there exist several instructions
in various stages of execution.

Instruction Operand Execute

fetch fetch
IF OF EX

1
Typical Non-Pipelined Execution

EX I0 I1 I2
OF I0 I1 I2
IF I0 I1 I2 I3
1 2 3 4 5 6 7 8 9 10

Time to execute n instructions: 3nt

2
Ideal Pipelined Execution

EX I0 I1 I2 I3 I4 I5 I6 I7
OF I0 I1 I2 I3 I4 I5 I6 I7 I8
IF I0 I1 I2 I3 I4 I5 I6 I7 I8 I9
1 2 3 4 5 6 7 8 9 10

Time to execute n instructions: (2 + n)t

3
Pipeline Turbulence

Consider the presence of a branch instruction in the pipeline:

EX I0 I1 I2 I3 Ibr Ik
OF I0 I1 I2 I3 Ibr Ik Ik+1
IF I0 I1 I2 I3 Ibr Ik Ik+1 Ik+2
1 2 3 4 5 6 7 8 9 10

Branch instructions introduce control hazards into the pipeline,

negatively impacting pipeline performance.

4
Multicycle Execution Units
EX

Integer unit

FP/integer
multiply

IF ID MEM WB
EX

FP adder

FP/integer
divider

FIGURE 3.42 The DLX pipeline with three additional unpipelined, floating-point,
functional units.

5
SuperScalar/Multiple Issue

Floating
Point
Unit
Instr Buffer
Issue
Unit Integer Memory

Switch
Crossbar
Unit 1 Module 0

Integer Memory
Unit 2 Module 1

6
Hazards

Structural Hazards: arising from resource conflicts when the

hardware cannot support all possible combinations of instructions in
simultaneous overlapped execution.
Data Hazards: arising when an instruction in the pipeline depends on
the results of a previous instruction still in the pipeline.
Control Hazards: arising from the pipelining of branches or other
instructions that change the PC.

Hazards are easily solved by stalling, but stalling reduces pipeline

efficiency.

Structural hazards can be solved by extra hardware resource — a

cost/performance tradeoff.
Load/Store architectures substantially reduce complexity of hazards.

7
Structural Hazards

• Split I/D caches

• Pipelined Execute Units
• Buffers on Execute Units
• Hardware replication

8
Pipelined & Buffered Execution Units

Integer unit

FP/integer multiply

M1 M2 M3 M4 M5 M6 M7

IF ID MEM WB
FP adder

A1 A2 A3 A4

FP/integer divider

DIV

FIGURE 3.44 A pipeline that supports multiple outstanding FP operations.

9
Data Hazards

RAW: read after write

WAW: write after write (only present when writes occur at different
stages in a pipeline)

WAR: write after read (only possible when writes may occur earlier
than some reads)

Note: RAR (read after read) is not a hazard.

10
Data Hazards: Potential Solutions

• Pipeline interlocking

• Forwarding

• Compiler optimizations

11
Pipeline Interlocking

• Stall pipeline until data hazard is eliminated

• Loses performance benefits of pipelining

12
Pipeline Interlocking

Assume RAW conflict between I3 and I4.

EX I0 I1 I2 I3 I4 I5 I6
OF I0 I1 I2 I3 I4 I5 I6 I7
IF I0 I1 I2 I3 I4 I5 I6 I7 I8
1 2 3 4 5 6 7 8 9 10

13
Data Forwarding

• Organize data path with routes back from later pipe stages into
earlier pipe stages.

• Efficient operation best supported when operand encoding is simple.

14
Data Forwarding

rd rs1 rs2 rd rs1 rs2

Operand Execute
fetch

15
Static Scheduling

• Compiler optimization to schedule instructions as per data hazard

considerations.

• Organize code into basic blocks (single entry/single exit)

• Schedule instructions to minimize data hazards between adjacent

instructions.

16
Dynamic Scheduling

• Scoreboarding
• Tomasulo’s Algorithm
• VLIW

17
An Architecture for Scoreboarding
Registers Data buses

FP mult
FP mult

FP divide

FP add

Integer unit

Scoreboard
Control/ Control/
status status

FIGURE 4.3 The basic structure of a DLX processor with a scoreboard.

18
An Architecture for Tomasulo
From instruction unit
Floating-
From point
memory operation
queue FP registers
Load buffers
6
5
4
3
2 Operand Store buffers
1 buses 3
2
1

Operation bus To
memory

3 2
2 Reservation 1
1 stations

FP adders FP multipliers

Common data bus (CDB)

FIGURE 4.8 The basic structure of a DLX FP unit using Tomasulo's algorithm.

19
Structural/Data Hazards and the Original Pentium

Floating
Point
Unit
Instr Buffer
Issue
Unit Integer Memory

Switch
Crossbar
Unit 1 Module 0

Integer Memory
Unit 2 Module 1

20
Control Hazards

• Evaluate branches earlier

• Stalls/pipelined interlocks

• Delayed Branch

• Statically predict taken/not-taken & flush when not

• Canceling (or nullifying) branch: compiler prediction/flush

• Dynamic (hardware) prediction

• Conditional Instructions

21
Delayed Branch

Redefine architecture so branches take effect after n instrs after branch

• Where to get instructions to fill branch delay slot?

– Noop
– Before the branch
– After the branch (from both destinations or, if needed/possible,
only one)
• Compiler effectiveness
– Fills about 60% of delay slots (when one slot)
– About 80% instrs in delay slot useful
• Complications: deep pipelines & superscalar implementations

22
Predict Taken/Not-Taken, Statically or Dynamically

Assume I3 is a conditional branch & branch resolved in EX stage.

Correct Prediction:

EX I0 I1 I2 I3 I4 I5 I6 I7
OF I0 I1 I2 I3 I4 I5 I6 I7 I8
IF I0 I1 I2 I3 I4 I5 I6 I7 I8 I9
1 2 3 4 5 6 7 8 9 10

Incorrect Prediction:

EX I0 I1 I2 I3 Ik
OF I0 I1 I2 I3 I4 Ik Ik+1
IF I0 I1 I2 I3 I4 I5 Ik Ik+1 IK k + 2
1 2 3 4 5 6 7 8 9 10

23
Branch Prediction

Taken

Not taken

Predict taken Predict taken

Taken

Taken Not taken

Not taken

Predict not taken Predict not taken

Taken

Not taken

FIGURE 4.13 The states in a two-bit prediction scheme.

24
Branch Prediction w/ History

Branch address
4

2–bit per branch predictors

XX XX prediction

2–bit global branch history

FIGURE 4.20 A (2,2) branch-prediction buffer uses a two-bit global history to

choose from among four predictors for each branch address.

25
Branch Target Buffers

PC of instruction to fetch

Look up Predicted PC

Number of
entries
in branch-
target
buffer

No: instruction is
= not predicted to be Branch
branch. Proceed normally predicted
taken or
Yes: then instruction is branch and predicted untaken
PC should be used as the next PC

FIGURE 4.22 A branch-target buffer.

26
Send PO to
memory and

Step-by-step use of Branch Target Buffers

branch-target
buffer

No Entry found in Yes

branch-target
buffer?

Send out
predicted
Is
PC
No instruction Yes
a taken
branch?

No Taken Yes
branch?
Normal
instruction
execution

Enter Mispredicted Branch

branch IO branch, kill fetched correctly
and next PC instruction; restart predicted;
EX into branch fetch at other continue
target buffer target; delete execution with
entry from no stalls
target buffer

FIGURE 4.23 The steps involved in handling an instruction with a branch-target

buffer.
27
Exceptions

1. Synchronous versus Asynchronous

2. User requested versus coerced
3. User maskable (or not)
4. Within or between instructions
5. Resumption versus Termination
6. Precise/Imprecise

Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
No ratings yet
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
64 pages
How To Optimize Human Biology: Where Genome Editing and Artificial Intelligence Collide
No ratings yet
How To Optimize Human Biology: Where Genome Editing and Artificial Intelligence Collide
27 pages
Alarm Management Presentation PDF
No ratings yet
Alarm Management Presentation PDF
43 pages
Processor Structure and Function
100% (1)
Processor Structure and Function
55 pages
Regular Expression Question Solution
100% (2)
Regular Expression Question Solution
68 pages
Mouly CV
No ratings yet
Mouly CV
3 pages
III - I R09 Regular Dec 2013
No ratings yet
III - I R09 Regular Dec 2013
162 pages
III - I R09 Regular Dec 2013
No ratings yet
III - I R09 Regular Dec 2013
162 pages
Human Resource Management As Strategic Business Contributor
100% (4)
Human Resource Management As Strategic Business Contributor
19 pages
Onur Ddca 2025 Lecture15b Branch Prediction Beforelecture
No ratings yet
Onur Ddca 2025 Lecture15b Branch Prediction Beforelecture
188 pages
Comp206 Lecture9
No ratings yet
Comp206 Lecture9
53 pages
Module 5 Part2 Pipelining
No ratings yet
Module 5 Part2 Pipelining
36 pages
X Ray
No ratings yet
X Ray
18 pages
16ECE315-Digital Design Through Verilog HDL
No ratings yet
16ECE315-Digital Design Through Verilog HDL
1 page
Torsion of Multi-Cell Cross-Section - Hw7 - B
No ratings yet
Torsion of Multi-Cell Cross-Section - Hw7 - B
12 pages
CAQA5e ch3
No ratings yet
CAQA5e ch3
45 pages
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
No ratings yet
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
19 pages
SQL Server Integration Services An Introduction
100% (6)
SQL Server Integration Services An Introduction
23 pages
Slides Chapter 6 Pipelining
No ratings yet
Slides Chapter 6 Pipelining
60 pages
Pipeline - Instr - Super Branch
No ratings yet
Pipeline - Instr - Super Branch
48 pages
Chapter 5
No ratings yet
Chapter 5
38 pages
Kuliah 14 Pipeliningg
No ratings yet
Kuliah 14 Pipeliningg
28 pages
Group 17 - 2151177
No ratings yet
Group 17 - 2151177
15 pages
ERTOS Course Outcomes
No ratings yet
ERTOS Course Outcomes
2 pages
5.1-5.3 Pipelining and Parallel Processing
No ratings yet
5.1-5.3 Pipelining and Parallel Processing
56 pages
Introduction To Pipelining Introduction To Pipelining
No ratings yet
Introduction To Pipelining Introduction To Pipelining
35 pages
Pipelining (All Slides)
No ratings yet
Pipelining (All Slides)
45 pages
Moduel 5
No ratings yet
Moduel 5
46 pages
SRM Pipelining 05
No ratings yet
SRM Pipelining 05
42 pages
Digital Design Through Verilog HDL Course Outcomes For Lab
No ratings yet
Digital Design Through Verilog HDL Course Outcomes For Lab
1 page
CA Lecture 12
No ratings yet
CA Lecture 12
48 pages
Pipelining: Basic Concepts
No ratings yet
Pipelining: Basic Concepts
20 pages
Slot24 25 CH14 ProcessorStructureAndFunction 42 Slots
No ratings yet
Slot24 25 CH14 ProcessorStructureAndFunction 42 Slots
42 pages
CEA201 - Chapter 14 - Processor Structure and Function
No ratings yet
CEA201 - Chapter 14 - Processor Structure and Function
42 pages
Puppets and Therapy
No ratings yet
Puppets and Therapy
6 pages
Module 5 - Processor Structure and Function
No ratings yet
Module 5 - Processor Structure and Function
74 pages
ch4 3
No ratings yet
ch4 3
61 pages
Chapter 4
No ratings yet
Chapter 4
78 pages
Chapter 3 PPTV 31 Sem IIv 31
No ratings yet
Chapter 3 PPTV 31 Sem IIv 31
40 pages
CA Unit-2 Chapter-2
No ratings yet
CA Unit-2 Chapter-2
36 pages
Basics and Hazards of Pipeline Controller
No ratings yet
Basics and Hazards of Pipeline Controller
23 pages
10 Pipelining
No ratings yet
10 Pipelining
44 pages
Pipeline Hazards: Structural Hazards: Resource Conflict
No ratings yet
Pipeline Hazards: Structural Hazards: Resource Conflict
49 pages
L05 PipeliningII
No ratings yet
L05 PipeliningII
36 pages
Lecutre-7 Instruction Pipelining
No ratings yet
Lecutre-7 Instruction Pipelining
29 pages
Lecutre-7 Instruction Pipelining
No ratings yet
Lecutre-7 Instruction Pipelining
29 pages
Pipelining
No ratings yet
Pipelining
22 pages
Chapter 10 Principles of Pipelining
No ratings yet
Chapter 10 Principles of Pipelining
124 pages
CH10-Processor Structure and Function
No ratings yet
CH10-Processor Structure and Function
14 pages
Instruction-Level Parallelism (ILP), Since The
100% (1)
Instruction-Level Parallelism (ILP), Since The
57 pages
Unit - 1 Microprocessor Architecture
No ratings yet
Unit - 1 Microprocessor Architecture
52 pages
Canvas Pipelining and Parallel Processors
No ratings yet
Canvas Pipelining and Parallel Processors
5 pages
Dpco Unit 4
No ratings yet
Dpco Unit 4
21 pages
Pipelining New
No ratings yet
Pipelining New
33 pages
Ch2 Lec7 Instruction Piplining
No ratings yet
Ch2 Lec7 Instruction Piplining
34 pages
CH14 COA9e Processor Structure and Function
No ratings yet
CH14 COA9e Processor Structure and Function
40 pages
Lec3 PDF
No ratings yet
Lec3 PDF
15 pages
Branch Handling 1
No ratings yet
Branch Handling 1
50 pages
New Oríkì Orúko
No ratings yet
New Oríkì Orúko
3 pages
12 - Processor Structure and Function
No ratings yet
12 - Processor Structure and Function
73 pages
DLCO Module 6 Sem 3
No ratings yet
DLCO Module 6 Sem 3
40 pages
Iii-I R09 Dec 2014 Result PDF
No ratings yet
Iii-I R09 Dec 2014 Result PDF
184 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
53 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
SAS Library Data Transformations and Data Manipulation in SAS
No ratings yet
SAS Library Data Transformations and Data Manipulation in SAS
31 pages
Monday Tuesday Wednesday Thursday Friday I. Objectives: Weekly Learning Plan
No ratings yet
Monday Tuesday Wednesday Thursday Friday I. Objectives: Weekly Learning Plan
43 pages
Operators 140917230056 Phpapp01
No ratings yet
Operators 140917230056 Phpapp01
20 pages
Ca07 2014 PDF
No ratings yet
Ca07 2014 PDF
56 pages
Chapter 8 - Pipelining
No ratings yet
Chapter 8 - Pipelining
38 pages
Expectancy Theory of Motivation
No ratings yet
Expectancy Theory of Motivation
4 pages
CH 01
No ratings yet
CH 01
37 pages
III-I R09 DEC 2014 RESULT (12 Batch)
No ratings yet
III-I R09 DEC 2014 RESULT (12 Batch)
551 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
55 pages
Week 4 - Pipelining
No ratings yet
Week 4 - Pipelining
44 pages
Application Form For 1st Year (Session 2020-21) : Sarojini Naidu College For Women
No ratings yet
Application Form For 1st Year (Session 2020-21) : Sarojini Naidu College For Women
1 page
L10-L11-Instruction Pipelining
No ratings yet
L10-L11-Instruction Pipelining
38 pages
Verilog HDL Basics
No ratings yet
Verilog HDL Basics
73 pages
SystemC and Codesign Additional Lectures
No ratings yet
SystemC and Codesign Additional Lectures
58 pages
William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function
74 pages
HRY-312 Computer Organization Introduction To Pipelining
No ratings yet
HRY-312 Computer Organization Introduction To Pipelining
30 pages
IO Ports in 8051
No ratings yet
IO Ports in 8051
8 pages
EX - CX - 044 - Master Inspection Characteristics MIC
No ratings yet
EX - CX - 044 - Master Inspection Characteristics MIC
61 pages
Digital Design and Synthesis: Behavioral Verilog
No ratings yet
Digital Design and Synthesis: Behavioral Verilog
41 pages
Watson 1999 Liberal Communitarianism As Political Theory
No ratings yet
Watson 1999 Liberal Communitarianism As Political Theory
8 pages
English Grammar For Enginners
No ratings yet
English Grammar For Enginners
177 pages
DIP and Soft Computing Syllabus
No ratings yet
DIP and Soft Computing Syllabus
5 pages
The Master of Animals in Old World Iconography
No ratings yet
The Master of Animals in Old World Iconography
21 pages
International MKT Case Study 2 IKEA
No ratings yet
International MKT Case Study 2 IKEA
3 pages
01 MI Introduction
No ratings yet
01 MI Introduction
72 pages
Verilog Code For Basic Logic Gates
No ratings yet
Verilog Code For Basic Logic Gates
4 pages
Visit 2 Fatima Bilal
No ratings yet
Visit 2 Fatima Bilal
6 pages
COA Unit 3
No ratings yet
COA Unit 3
89 pages
Development of Timing and State Diagrams Refined The Block Diagram Cleaned Up and Added Better Names, Added Detail
No ratings yet
Development of Timing and State Diagrams Refined The Block Diagram Cleaned Up and Added Better Names, Added Detail
21 pages
Texture Analysis and Fracture Identificat
No ratings yet
Texture Analysis and Fracture Identificat
5 pages
Travers Poster Presentation 4 21 17
No ratings yet
Travers Poster Presentation 4 21 17
1 page
WWW - Manaresults.co - In: Applications (Common To ECE, ETM)
No ratings yet
WWW - Manaresults.co - In: Applications (Common To ECE, ETM)
2 pages
Six Strategies For Effective Learning Bookmarks: Interleaving Interleaving Interleaving Interleaving
No ratings yet
Six Strategies For Effective Learning Bookmarks: Interleaving Interleaving Interleaving Interleaving
1 page
Microprocessor 8085 Architecture
No ratings yet
Microprocessor 8085 Architecture
19 pages
SOC 110 Week 1 Assignment The Value of Teams
No ratings yet
SOC 110 Week 1 Assignment The Value of Teams
5 pages
M.Sc. (ICT in Agriculture and Rural Development)
No ratings yet
M.Sc. (ICT in Agriculture and Rural Development)
4 pages
The Look Up Table (LUT)
No ratings yet
The Look Up Table (LUT)
5 pages
Sce554 Reflective Essay
No ratings yet
Sce554 Reflective Essay
2 pages
Exadel Studio Pro: Getting Started Guide For Creating A JSF Application
No ratings yet
Exadel Studio Pro: Getting Started Guide For Creating A JSF Application
9 pages
Arduino Ir Obstacle Sensor Tutorial and 2 PDF
No ratings yet
Arduino Ir Obstacle Sensor Tutorial and 2 PDF
3 pages
Development and Validation of The Positive Evaluation Core Beliefs Scale For Social Anxiety (Gavril Andreea)
No ratings yet
Development and Validation of The Positive Evaluation Core Beliefs Scale For Social Anxiety (Gavril Andreea)
7 pages
ShBrushSql Js
No ratings yet
ShBrushSql Js
2 pages
CCNA Certification Study Guide Volume 1: Exam 200-301 v1.1
From Everand
CCNA Certification Study Guide Volume 1: Exam 200-301 v1.1
Todd Lammle
5/5 (1)
Stack Computers: The New Wave
From Everand
Stack Computers: The New Wave
Philip Koopman
No ratings yet
LEARN MPLS FROM SCRATCH PART-B: A Beginners guide to next level of networking
From Everand
LEARN MPLS FROM SCRATCH PART-B: A Beginners guide to next level of networking
POONAM DEVI
No ratings yet

Pipelining Basic Concepts: Instruction Fetch Execute Operand Fetch IF OF EX

Uploaded by

Pipelining Basic Concepts: Instruction Fetch Execute Operand Fetch IF OF EX

Uploaded by

Pipelining Basic Concepts

Partition the function of instruction execution into smaller functions

Instruction Operand Execute

Time to execute n instructions: 3nt

Time to execute n instructions: (2 + n)t

Consider the presence of a branch instruction in the pipeline:

Branch instructions introduce control hazards into the pipeline,

Structural Hazards: arising from resource conflicts when the

Hazards are easily solved by stalling, but stalling reduces pipeline

Structural hazards can be solved by extra hardware resource — a

• Split I/D caches

FIGURE 3.44 A pipeline that supports multiple outstanding FP operations.

RAW: read after write

Note: RAR (read after read) is not a hazard.

• Stall pipeline until data hazard is eliminated

• Loses performance benefits of pipelining

Assume RAW conflict between I3 and I4.

• Efficient operation best supported when operand encoding is simple.

rd rs1 rs2 rd rs1 rs2

• Compiler optimization to schedule instructions as per data hazard

• Organize code into basic blocks (single entry/single exit)

• Schedule instructions to minimize data hazards between adjacent

FIGURE 4.3 The basic structure of a DLX processor with a scoreboard.

Common data bus (CDB)

• Evaluate branches earlier

• Statically predict taken/not-taken & flush when not

• Canceling (or nullifying) branch: compiler prediction/flush

• Dynamic (hardware) prediction

Redefine architecture so branches take effect after n instrs after branch

• Where to get instructions to fill branch delay slot?

Assume I3 is a conditional branch & branch resolved in EX stage.

Predict taken Predict taken

Taken Not taken

Predict not taken Predict not taken

FIGURE 4.13 The states in a two-bit prediction scheme.

2–bit per branch predictors

2–bit global branch history

FIGURE 4.20 A (2,2) branch-prediction buffer uses a two-bit global history to

FIGURE 4.22 A branch-target buffer.

Step-by-step use of Branch Target Buffers

No Entry found in Yes

Enter Mispredicted Branch

FIGURE 4.23 The steps involved in handling an instruction with a branch-target

1. Synchronous versus Asynchronous

You might also like