0% found this document useful (0 votes)

107 views31 pages

Pipelined Data-Path in MIPS Architecture

This document discusses pipelining in computer processors. It explains that pipelining allows overlapping execution of multiple instructions to improve throughput. Pipelining divides instruction execution into stages, like fetch, decode, execute, and writeback, so that a new instruction can begin execution each clock cycle. While pipelining improves throughput, it can introduce hazards like structural hazards from limited hardware resources, control hazards from branches, and data hazards from instructions dependent on earlier instructions. The document uses MIPS as an example pipeline and discusses how its design makes hazards easier to avoid and pipelining more feasible.

Uploaded by

DeepanshGoyal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

107 views31 pages

Pipelined Data-Path in MIPS Architecture

Uploaded by

DeepanshGoyal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Computer Organization

CS1403
Pipelined Data-Path

Mayank Pandey, MNNIT, Allahabad, India

Pipelining
• Start work ASAP!! Do not waste time!
6 PM 7 8 9 10 11 12 1 2 AM
Time
Task
order
A
Not pipelined
B

Assume 30 min. each task – wash, dry, fold, store – and that
separate tasks use separate hardware and so can be overlapped
6 PM 7 8 9 10 11 12 1 2 AM
Time

Task
order

A Pipelined
B

D
Pipelined vs. Single-Cycle
Program
execution 2 4 6 8 10 12 14 16 18
order Time
(in instructions)
Instruction Data Single-cycle
lw $1, 100($0) fetch
Reg ALU
access
Reg

Instruction Data
lw $2, 200($0) 8 ns fetch
Reg ALU
access
Reg

Instruction
lw $3, 300($0) 8 ns fetch
...
8 ns

Assume 2 ns for memory access, ALU operation; 1 ns for register access:

therefore, single cycle clock 8 ns; pipelined clock cycle 2 ns.
Program
execution 2 4 6 8 10 12 14
Time
order
(in instructions)
Instruction Data
lw $1, 100($0) Reg ALU Reg
fetch access

Instruction Data
Pipelined
lw $2, 200($0) 2 ns Reg ALU Reg
fetch access

Instruction Data
lw $3, 300($0) 2 ns Reg ALU Reg
fetch access

2 ns 2 ns 2 ns 2 ns 2 ns
Pipelining: Keep in Mind
• Pipelining does not reduce latency of a single task,
it increases throughput of entire workload
• Pipeline rate limited by longest stage
– potential speedup = number pipe stages
– unbalanced lengths of pipe stages reduces speedup
• Time to fill pipeline and time to drain it – when
there is slack in the pipeline – reduces speedup
Pipelining MIPS
• What makes it easy with MIPS?
– all instructions are same length
• so fetch and decode stages are similar for all instructions
– just a few instruction formats
• simplifies instruction decode and makes it possible in one stage
– memory operands appear only in load/stores
• so memory access can be deferred to exactly one later stage
– operands are aligned in memory
• one data transfer instruction requires one memory access stage
Pipelining MIPS
• What makes it hard?
– structural hazards: different instructions, at different stages, in the
pipeline want to use the same hardware resource
– control hazards: succeeding instruction, to put into pipeline, depends
on the outcome of a previous branch instruction, already in pipeline
– data hazards: an instruction in the pipeline requires data to be
computed by a previous instruction still in the pipeline

• Before actually building the pipelined datapath and control

we first briefly examine these potential hazards individually…
Structural Hazards
• Structural hazard: inadequate hardware to simultaneously support all
instructions in the pipeline in the same clock cycle
• E.g., suppose single – not separate – instruction and data memory in
pipeline below with one read port
– then a structural hazard between first and fourth lw instructions
Program
execution 2 4 6 8 10 12 14
Time
order
(in instructions)
Instruction Data
lw $1, 100($0) Reg ALU Reg
fetch access
Pipelined
Instruction Data
lw $2, 200($0) 2 ns Reg ALU Reg
fetch access

Instruction Data
Hazard if single memory
lw $3, 300($0) 2 ns Reg ALU Reg
fetch access
Instruction Data
lw $4, 400($0) Reg ALU Reg
2 ns fetch access

2 ns 2 ns 2 ns 2 ns 2 ns

• MIPS was designed to be pipelined: structural hazards are easy to

avoid!
Control Hazards
• Control hazard: need to make a decision based on the result of a previous
instruction still executing in pipeline
• Solution 1 Stall the pipeline

Program
execution 2 4 6 8 10 12 14 16
order Time
(in instructions)
Instruction Data
add $4, $5, $6 fetch
Reg ALU
access
Reg Note that branch outcome is
Instruction Data computed in ID stage with
beq $1, $2, 40 Reg ALU Reg
2ns fetch access
added hardware (later…)
Instruction Data
lw $3, 300($0) bubble Reg ALU Reg
fetch access

4 ns 2ns

Pipeline stall
Control Hazards
• Solution 2 Predict branch outcome
– e.g., predict branch-not-taken :
Program
execution 2 4 6 8 10 12 14
order Time
(in instructions)
Instruction Data
add $4, $5, $6 fetch
Reg ALU
access
Reg

Instruction Data
beq $1, $2, 40 Reg ALU Reg
2 ns fetch access

Instruction Data
lw $3, 300($0) Reg ALU Reg
2 ns fetch access

Prediction success
Program
execution 2 4 6 8 10 12 14
order Time
(in instructions)
Instruction Data
add $4, $5 ,$6 Reg ALU Reg
fetch access

Instruction Data
beq $1, $2, 40 Reg ALU Reg
fetch access
2 ns
bubble bubble bubble bubble bubble

Instruction Data
or $7, $8, $9 Reg ALU Reg
fetch access
4 ns
Prediction failure: undo (=flush) lw
Control Hazards
Solution 3 Delayed branch: always execute the sequentially next
statement with the branch executing after one instruction delay –
compiler’s job to find a statement that can be put in the slot that is
independent of branch outcome
MIPS does this

1/22/2019 Mayank Pandey, MNNIT, Allahabad, India 10

Data Hazards
• Data hazard: instruction needs data from the result of a previous
instruction still executing in pipeline
• Solution Forward data if possible…

2 4 6 8 10
Time

IF ID EX
Instruction pipeline diagram:
add $s0, $t0, $t1 MEM WB
shade indicates use –
left=write, right=read

Program
execution 2 4 6 8 10
order Time
(in instructions)
Without forwarding – blue line
add $s0, $t0, $t1 IF ID EX MEM WB
– data has to go back in time;
with forwarding – red line
sub $t2, $s0, $t3 IF ID EX MEM WB
– data is available in time

Mayank Pandey, MNNIT, Allahabad, India

Data Hazards
• Forwarding may not be enough
– e.g., if an R-type instruction following a load uses the result of the load –
called load-use data hazard
2 4 6 8 10 12 14
Program Time
execution
order
(in instructions)

lw $s0, 20($t1) IF ID EX MEM WB Without a stall it is impossible

to provide input to the sub
sub $t2, $s0, $t3 IF ID EX MEM WB instruction in time

2 4 6 8 10 12 14
Program Time
execution
order
(in instructions)

lw $s0, 20($t1) IF ID EX MEM WB With a one-stage stall, forwarding

can get the data to the sub
bubble bubble bubble bubble bubble instruction in time
sub $t2, $s0, $t3 IF ID EX MEM WB
Reordering Code to Avoid Pipeline Stall
• Example:
lw $t0, 0($t1)
lw $t2, 4($t1)
Data hazard
sw $t2, 0($t1)
sw $t0, 4($t1)

• Reordered code:
lw $t0, 0($t1)
lw $t2, 4($t1)
sw $t0, 4($t1)
Interchanged
sw $t2, 0($t1)
Pipelined Datapath
• We now move to actually building a pipelined datapath
• First recall the 5 steps in instruction execution
1. Instruction Fetch & PC Increment (IF)
2. Instruction Decode and Register Read (ID)
3. Execution or calculate address (EX)
4. Memory access (MEM)
5. Write result into register (WB)
• Review: single-cycle processor
– all 5 steps done in a single clock cycle
– dedicated hardware required for each step

• What happens if we break the execution into multiple cycles, but keep
the extra hardware?
Review - Single-Cycle Data-path “Steps”

ADD

4 ADD

PC <<2
Instruction I
ADDR RD
32 16 32
5 5 5
Instruction
Memory RN1 RN2 WN
RD1 Zero
Register File ALU
WD
RD2 M
U ADDR
X
Data RD M

16
E
X 32
Memory U
X
T WD
N
D

EX
Execute/ Address
IF ID Calc. MEM WB
Instruction Fetch Instruction Decode Memory Access Write Back
Pipelined Datapath – Key Idea
• What happens if we break the execution into multiple cycles,
but keep the extra hardware?
– Answer: We may be able to start executing a new instruction at each
clock cycle - pipelining
• …but we shall need extra registers to hold data between
cycles – pipeline registers
Pipelined Datapath
Pipeline registers wide enough to hold data coming in
ADD

4 ADD
64 bits 128 bits
PC <<2 97 bits 64 bits
Instruction I
ADDR RD
32 32
Instruction 16 5 5 5

Memory RN1 RN2 WN

RD1
Zero
Register File ALU
WD
RD2 M
U ADDR
X
Data
E MemoryRD M
U
16 X 32 X
T WD
N
D

IF/ID ID/EX EX/MEM MEM/WB

Pipelined Datapath
Pipeline registers wide enough to hold data coming in
ADD

4 ADD
64 bits 128 bits
PC <<2 97 bits 64 bits
Instruction I
ADDR RD
32 16 32
5 5 5
Instruction
RN1 RN2 WN
Memory
RD1
Zero
Register File ALU
WD
RD2 M
U ADDR
X
Data
E MemoryRD M
U
16 X 32 X
T WD
N
D

IF/ID ID/EX EX/MEM MEM/WB

Bug in the Datapath

IF/ID ID/EX EX/MEM MEM/WB

ADD

4 ADD

PC Instruction I <<2
ADDR RD
32 16 32
5 5 5
Instruction
RN1 RN2 WN
Memory RD1
Register File ALU
WD
RD2 M
U ADDR
X
Data RD M

16
E
X 32
Memory U
X
T WD
N
D

Write register number comes from another later instruction!

Corrected Datapath
IF/ID ID/EX EX/MEM MEM/WB
ADD
ADD
4 64 bits 133 bits
<<2 102 bits 69 bits
PC
ADDR RD 5
RN1 RD1
32
Zero
Instruction 5
RN2 ALU
Memory Register
WN
5 File RD2 M
WD U ADDR
X
Data
E RD M

16 X 32
Memory U
X
T WD
N
5 D

Destination register number is also passed through ID/EX, EX/MEM

and MEM/WB registers, which are now wider by 5 bits
Pipelined Example
• Consider the following instruction sequence:
lw $t0, 10($t1)
sw $t3, 20($t4)
add $t5, $t6, $t7
sub $t8, $t9, $t10
Single-Clock-Cycle Diagram: Clock Cycle 1
LW
Single-Clock-Cycle Diagram: Clock Cycle 2
SW LW
Single-Clock-Cycle Diagram: Clock Cycle 3
ADD SW LW
Single-Clock-Cycle Diagram: Clock Cycle 4
SUB ADD SW LW
Single-Clock-Cycle Diagram: Clock Cycle 5
SUB ADD SW LW
Single-Clock-Cycle Diagram: Clock Cycle 6
SUB ADD SW
Single-Clock-Cycle Diagram: Clock Cycle 7
SUB ADD
Single-Clock-Cycle Diagram: Clock Cycle 8
SUB
Alternative View – Multiple-Clock-Cycle Diagram

CC 1 CC 2 CC 3 CC 4 CC 5 CC 6 CC 7 CC 8
Time axis
IM REG ALU DM REG
lw $t0, 10($t1)

IM REG ALU DM REG

sw $t3, 20($t4)

add $t5, $t6, $t7 IM REG ALU DM REG

sub $t8, $t9, $t10 IM REG ALU DM REG

Notes
• One significant difference in the execution of an R-type instruction
between multi-cycle and pipelined implementations:
– register write-back for the R-type instruction is the 5th (the last write-
back) pipeline stage vs. the 4th stage for the multi-cycle
implementation. Why?
– think of structural hazards when writing to the register file…
• Worth repeating: the essential difference between the pipeline and
multi-cycle implementations is the insertion of pipeline registers to
decouple the 5 stages
• The CPI of an ideal pipeline (no stalls) is 1. Why?

Lect8 Pipelined DP Control
No ratings yet
Lect8 Pipelined DP Control
59 pages
Pipelining in Computer Architecture
No ratings yet
Pipelining in Computer Architecture
77 pages
Ca07 2014 PDF
No ratings yet
Ca07 2014 PDF
56 pages
CODch 6 Slides
No ratings yet
CODch 6 Slides
77 pages
Enhancing Performance With Pipelining
No ratings yet
Enhancing Performance With Pipelining
85 pages
Chapter 6
No ratings yet
Chapter 6
43 pages
Module 4-Pipelining
No ratings yet
Module 4-Pipelining
39 pages
L15 MipsPipeline
No ratings yet
L15 MipsPipeline
26 pages
Pipelining for Enhanced Performance
No ratings yet
Pipelining for Enhanced Performance
71 pages
Lecture 13 Pipelining
No ratings yet
Lecture 13 Pipelining
12 pages
L117-19 MIPS Pipeline Implementation
No ratings yet
L117-19 MIPS Pipeline Implementation
37 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
L14 MipsPipeline Ovw
No ratings yet
L14 MipsPipeline Ovw
17 pages
Chapter 10 Principles of Pipelining
No ratings yet
Chapter 10 Principles of Pipelining
124 pages
8 Pipeline DDP Control
No ratings yet
8 Pipeline DDP Control
54 pages
16.482 / 16.561 Computer Architecture and Design: Instructor: Dr. Michael Geiger Fall 2013
No ratings yet
16.482 / 16.561 Computer Architecture and Design: Instructor: Dr. Michael Geiger Fall 2013
42 pages
MIPS Pipeline: Data and Control Path Data and Control Path
No ratings yet
MIPS Pipeline: Data and Control Path Data and Control Path
46 pages
Pipelined Processor Execution Diagram
100% (1)
Pipelined Processor Execution Diagram
31 pages
Understanding Pipelining and Hazards
No ratings yet
Understanding Pipelining and Hazards
19 pages
CS 162 Computer Architecture Lecture 3: Pipelining Contd.: Instructor: L.N. Bhuyan
No ratings yet
CS 162 Computer Architecture Lecture 3: Pipelining Contd.: Instructor: L.N. Bhuyan
21 pages
Pipelining in Computer Architecture
No ratings yet
Pipelining in Computer Architecture
64 pages
CH 6
No ratings yet
CH 6
29 pages
Ca06 2014 PDF
No ratings yet
Ca06 2014 PDF
53 pages
Pipelining & Vector Processing Guide
No ratings yet
Pipelining & Vector Processing Guide
29 pages
Pipelining Lecture
No ratings yet
Pipelining Lecture
39 pages
Computer Architecture: Appendix A Pipelining Prof. Jerry Breecher CSCI 240 Fall 2003
No ratings yet
Computer Architecture: Appendix A Pipelining Prof. Jerry Breecher CSCI 240 Fall 2003
58 pages
MIPS Pipelining and Hazards Explained
No ratings yet
MIPS Pipelining and Hazards Explained
48 pages
3.2 Pipeline Processing
No ratings yet
3.2 Pipeline Processing
18 pages
MIPS Processor Architecture Overview
No ratings yet
MIPS Processor Architecture Overview
70 pages
CS530 Fall2015 Lecture9
No ratings yet
CS530 Fall2015 Lecture9
5 pages
Lecture 9
No ratings yet
Lecture 9
21 pages
Pipelining - Modified1
No ratings yet
Pipelining - Modified1
51 pages
Lecture Notes Pipelining Stages 7B
No ratings yet
Lecture Notes Pipelining Stages 7B
7 pages
CA Unit-2 Chapter-2
No ratings yet
CA Unit-2 Chapter-2
36 pages
Advanced Pipelining Techniques
No ratings yet
Advanced Pipelining Techniques
44 pages
Embedded Computer Architecture 5SAI0
No ratings yet
Embedded Computer Architecture 5SAI0
59 pages
MIPS Pipelining Performance and Hazards
No ratings yet
MIPS Pipelining Performance and Hazards
20 pages
Pipe Lining
No ratings yet
Pipe Lining
66 pages
Pipelined MIPS Processor: Dmitri Strukov ECE 154A
No ratings yet
Pipelined MIPS Processor: Dmitri Strukov ECE 154A
81 pages
Pipelining and Parallelism
No ratings yet
Pipelining and Parallelism
41 pages
Pipelining Basic Concept
No ratings yet
Pipelining Basic Concept
23 pages
Unit 5 Pipeline Hazard
No ratings yet
Unit 5 Pipeline Hazard
31 pages
Chapter 4 The Processor
No ratings yet
Chapter 4 The Processor
72 pages
Pipeline Processor Design
No ratings yet
Pipeline Processor Design
89 pages
Week 4 - Pipelining
No ratings yet
Week 4 - Pipelining
44 pages
Computer Systems Pipelining Guide
No ratings yet
Computer Systems Pipelining Guide
7 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
49 pages
Pipelining Lecture
No ratings yet
Pipelining Lecture
60 pages
CAO Pipelining Lecture
No ratings yet
CAO Pipelining Lecture
50 pages
31 Pipeline Hazards 25-04-2024
No ratings yet
31 Pipeline Hazards 25-04-2024
35 pages
07 MIPS Pipelining CH4
No ratings yet
07 MIPS Pipelining CH4
73 pages
Computer Systems Architecture: Thorsten Altenkirch and Liyang Hu
No ratings yet
Computer Systems Architecture: Thorsten Altenkirch and Liyang Hu
20 pages
05 Pipelining
No ratings yet
05 Pipelining
37 pages
MIPS Pipeline Stages & Hazards
No ratings yet
MIPS Pipeline Stages & Hazards
84 pages
Lecture # Pipelining and Datahazards
No ratings yet
Lecture # Pipelining and Datahazards
70 pages
CA Unit 3 Answers
No ratings yet
CA Unit 3 Answers
10 pages
Session 1 Data 2
No ratings yet
Session 1 Data 2
6 pages
Random Variables: Complete Business Statistics, 8/e Instructor's Solutions Manual, Chapter 3
No ratings yet
Random Variables: Complete Business Statistics, 8/e Instructor's Solutions Manual, Chapter 3
33 pages
Professional Ethics: Unit 2 - Part 3 of 3: Trade-Related Aspects of Intellectual Property Rights
No ratings yet
Professional Ethics: Unit 2 - Part 3 of 3: Trade-Related Aspects of Intellectual Property Rights
36 pages
Professional Ethics: Unit 2 - Part 2 of 3: Intellectual Property
No ratings yet
Professional Ethics: Unit 2 - Part 2 of 3: Intellectual Property
46 pages
Experiment No: 08 Aim: Tool Used: Theory
No ratings yet
Experiment No: 08 Aim: Tool Used: Theory
8 pages
CSE and IT Student Registration List
No ratings yet
CSE and IT Student Registration List
4 pages
Introduction to TEX and LATEX Typesetting
No ratings yet
Introduction to TEX and LATEX Typesetting
21 pages
Sentiment Analysis: Srishti Chaubey
No ratings yet
Sentiment Analysis: Srishti Chaubey
40 pages
Conditions for n+2 to be Prime
No ratings yet
Conditions for n+2 to be Prime
1 page
NCERT Class 12 Physics Part 2 PDF
67% (3)
NCERT Class 12 Physics Part 2 PDF
254 pages
Set de Instrucciones Arquitectura ARM
No ratings yet
Set de Instrucciones Arquitectura ARM
4 pages
1.5 Addressing Modes
No ratings yet
1.5 Addressing Modes
20 pages
Assembly to Binary Code Conversion
No ratings yet
Assembly to Binary Code Conversion
3 pages
Introduction To MIPS Architecture
No ratings yet
Introduction To MIPS Architecture
10 pages
AVR Microcontrollers Overview
No ratings yet
AVR Microcontrollers Overview
30 pages
Understanding Superscalar Architecture
No ratings yet
Understanding Superscalar Architecture
9 pages
170 Infrared Keybox
No ratings yet
170 Infrared Keybox
1 page
STMicroelectronics STM32F3DISCOVERY Datasheet
No ratings yet
STMicroelectronics STM32F3DISCOVERY Datasheet
8 pages
Processors and Memory Hierarchy Overview
No ratings yet
Processors and Memory Hierarchy Overview
49 pages
Understanding Microcontrollers 8051
No ratings yet
Understanding Microcontrollers 8051
34 pages
Take A Way: Exploring The Security Implications of AMD's Cache Way Predictors
No ratings yet
Take A Way: Exploring The Security Implications of AMD's Cache Way Predictors
13 pages
Microprocessor Systems Laboratory 3
No ratings yet
Microprocessor Systems Laboratory 3
2 pages
Unit 11 - Week 9: Assignment 9
No ratings yet
Unit 11 - Week 9: Assignment 9
4 pages
5.IA 64 and Itanium Processors
No ratings yet
5.IA 64 and Itanium Processors
9 pages
Instruction Format 8051
No ratings yet
Instruction Format 8051
26 pages
Computer Architecture: Instruction Pipeline
No ratings yet
Computer Architecture: Instruction Pipeline
28 pages
Pipeline Optimization: Dinesh Sharma
No ratings yet
Pipeline Optimization: Dinesh Sharma
50 pages
Desktop P4gok0u
No ratings yet
Desktop P4gok0u
26 pages
MAA Assigment 3
No ratings yet
MAA Assigment 3
9 pages
Modern Computer Architecture
No ratings yet
Modern Computer Architecture
2 pages
Microprocessor Intel x86 Evolution and Main Features
No ratings yet
Microprocessor Intel x86 Evolution and Main Features
3 pages
Electrical Part List
100% (1)
Electrical Part List
13 pages
Microprocessor & Microcontroller From Scratch To Expert
No ratings yet
Microprocessor & Microcontroller From Scratch To Expert
15 pages
Risc Cisc
100% (5)
Risc Cisc
17 pages
8085 Instruction Set Guide
No ratings yet
8085 Instruction Set Guide
91 pages
8085 All
No ratings yet
8085 All
165 pages
Computer Architecture Quiz
No ratings yet
Computer Architecture Quiz
6 pages
Control Unit Operations in Computing
No ratings yet
Control Unit Operations in Computing
34 pages
Computer Architecture Lesson 2 (Instruction Set Architecture)
No ratings yet
Computer Architecture Lesson 2 (Instruction Set Architecture)
9 pages

Pipelined Data-Path in MIPS Architecture

Uploaded by

Pipelined Data-Path in MIPS Architecture

Uploaded by

Computer Organization

Mayank Pandey, MNNIT, Allahabad, India

Assume 2 ns for memory access, ALU operation; 1 ns for register access:

• Before actually building the pipelined datapath and control

• MIPS was designed to be pipelined: structural hazards are easy to

1/22/2019 Mayank Pandey, MNNIT, Allahabad, India 10

Mayank Pandey, MNNIT, Allahabad, India

lw $s0, 20($t1) IF ID EX MEM WB Without a stall it is impossible

lw $s0, 20($t1) IF ID EX MEM WB With a one-stage stall, forwarding

Memory RN1 RN2 WN

IF/ID ID/EX EX/MEM MEM/WB

IF/ID ID/EX EX/MEM MEM/WB

IF/ID ID/EX EX/MEM MEM/WB

Write register number comes from another later instruction!

Destination register number is also passed through ID/EX, EX/MEM

IM REG ALU DM REG

add $t5, $t6, $t7 IM REG ALU DM REG

sub $t8, $t9, $t10 IM REG ALU DM REG

You might also like