0% found this document useful (0 votes)

174 views25 pages

Computer Organization and Assembly Language: Pipeline: Introduction

This document introduces pipelining in computer processors. It defines pipelining as a way to speed up the execution of instructions by overlapping the execution of multiple instructions. It provides an analogy using a laundry process to illustrate how pipelining works by performing tasks like washing, drying, and folding clothes simultaneously across multiple loads of laundry. The document then discusses how pipelining can be applied to a digital system by breaking computations into stages separated by pipeline registers. Finally, it shows how pipelining can be applied to a MIPS processor by splitting the instruction execution process into five stages - fetch, decode, execute, memory, and writeback.

Uploaded by

Aroosa Sheikh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

174 views25 pages

Computer Organization and Assembly Language: Pipeline: Introduction

Uploaded by

Aroosa Sheikh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 25

Computer Organization and Assembly

Language

Pipeline: Introduction

CSCE430/830 Pipeline
Pipelining Outline

• Introduction 
– Defining Pipelining
– Pipelining Instructions
• Hazards
– Structural hazards
– Data Hazards
– Control Hazards
• Performance
• Controller implementation

CSCE430/830 Pipeline
What is Pipelining?

• A way of speeding up execution of instructions

• Key idea:
overlap execution of multiple instructions

CSCE430/830 Pipeline
The Laundry Analogy

• Ann, Brian, Cathy, Dave

each have one load of clothes A B C D
to wash, dry, and fold
• Washer takes 30 minutes

• Dryer takes 30 minutes

• “Folder” takes 30 minutes

• “Stasher” takes 30 minutes

to put clothes into drawers

CSCE430/830 Pipeline
If we do laundry sequentially...

6 PM 7 8 9 10 11 12 1 2 AM

30 30 30 30 30 30 30 30 30 30 30 30 30 30 30 30
T Time
a A
s
k
B
O
r
d C
e
r D
• Time Required: 8 hours for 4 loads

CSCE430/830 Pipeline
To Pipeline, We Overlap Tasks

6 PM 7 8 9 10 11 12 1 2 AM

30 30 30 30 30 30 30 Time
T
a
s A
k
O
B
r
d C
e
r D
• Time Required: 3.5 Hours for 4 Loads

CSCE430/830 Pipeline
To Pipeline, We Overlap Tasks

6 PM 7 8 9 10 11 12 1 2 AM

30 30 30 30 30 30 30 Time
T
a • Pipelining doesn’t help latency of
s A single task, it helps throughput of
k entire workload
O
B • Pipeline rate limited by slowest
r pipeline stage
d C
e • Multiple tasks operating
r D simultaneously
• Potential speedup = Number
pipe stages
• Unbalanced lengths of pipe
stages reduces speedup
• Time to “fill” pipeline and time to
CSCE430/830 “drain” it reduces speedup Pipeline
Pipelining a Digital System
1 nanosecond = 10^-9 second
1 picosecond = 10^-12 second

• Key idea: break big computation up into pieces

1ns
• Separate each piece with a pipeline register

200ps 200ps 200ps 200ps 200ps

Pipeline
Register
CSCE430/830 Pipeline
Pipelining a Digital System

• Why do this? Because it's faster for repeated

computations

Non-pipelined:
1 operation finishes
every 1ns

1ns

Pipelined:
1 operation finishes
every 200ps

200ps 200ps 200ps 200ps 200ps

CSCE430/830 Pipeline
Comments about pipelining

• Pipelining increases throughput, but not

latency
– Answer available every 200ps, BUT
– A single computation still takes 1ns
• Limitations:
– Computations must be divisible into stage size
– Pipeline registers add overhead

CSCE430/830 Pipeline
Pipelining a Processor

• Recall the 5 steps in instruction execution:

1. Instruction Fetch (IF)
2. Instruction Decode and Register Read (ID)
3. Execution operation or calculate address (EX)
4. Memory access (MEM)
5. Write result into register (WB)

• Review: Single-Cycle Processor

– All 5 steps done in a single clock cycle
– Dedicated hardware required for each step

CSCE430/830 Pipeline
Review - Single-Cycle Processor

CSCE430/830 •What do we need to add to actually split the datapath into stages? Pipeline
The Basic Pipeline For MIPS

Cycle 1 Cycle 2 Cycle 3 Cycle 4 Cycle 5 Cycle 6 Cycle 7

ALU
Ifetch Reg DMem Reg
I
n
s

ALU
t Ifetch Reg DMem Reg

ALU
O Ifetch Reg DMem Reg

r
d

ALU
e Ifetch Reg DMem Reg

What do we need to add to actually split the datapath into stages?

CSCE430/830 Pipeline
Basic Pipelined Processor

CSCE430/830 Pipeline
Pipeline example: lw
IF

CSCE430/830 Pipeline
Pipeline example: lw
ID

CSCE430/830 Pipeline
Pipeline example: lw
EX

CSCE430/830 Pipeline
Pipeline example: lw
MEM

CSCE430/830 Pipeline
Pipeline example: lw
WB

CSCE430/830 Pipeline
Single-Cycle vs. Pipelined Execution

Non-Pipelined
Instruction 0 200 400 600 800 1000 1200 1400 1600 1800
Order Time
Instruction REG REG
lw $1, 100($0) ALU MEM
Fetch RD WR
Instruction REG REG
lw $2, 200($0) Fetch
ALU MEM
RD WR
800ps
Instruction
lw $3, 300($0)
Fetch
800ps
800ps
Pipelined
Instruction 0 200 400 600 800 1000 1200 1400 1600
Order Time
Instruction REG REG
lw $1, 100($0) ALU MEM
Fetch RD WR
Instruction REG REG
lw $2, 200($0) Fetch
ALU MEM
RD WR
200ps
Instruction REG REG
lw $3, 300($0) ALU MEM
Fetch RD WR
200ps
200ps 200ps 200ps 200ps 200ps

CSCE430/830 Pipeline
Speedup
• Consider the unpipelined processor introduced previously. Assume that
it has a 1 ns clock cycle and it uses 4 cycles for ALU operations and
branches, and 5 cycles for memory operations, assume that the relative
frequencies of these operations are 40%, 20%, and 40%, respectively.
Suppose that due to clock skew and setup, pipelining the processor
adds 0.2ns of overhead to the clock. Ignoring any latency impact, how
much speedup in the instruction execution rate will we gain from a
pipeline?

Average instruction execution time

= 1 ns * ((40% + 20%)*4 + 40%*5)
= 4.4ns

Speedup from pipeline

= Average instruction time unpiplined/Average instruction time pipelined
= 4.4ns/1.2ns = 3.7

CSCE430/830 Pipeline
Comments about Pipelining

• The good news

– Multiple instructions are being processed at same time
– This works because stages are isolated by registers
– Best case speedup of N
• The bad news
– Instructions interfere with each other - hazards
» Example: different instructions may need the same
piece of hardware (e.g., memory) in same clock cycle
» Example: instruction may require a result produced
by an earlier instruction that is not yet complete

CSCE430/830 Pipeline
Pipeline Hazards

• Limits to pipelining: Hazards prevent next instruction

from executing during its designated clock cycle
– Structural hazards: two different instructions use same h/w
in same cycle
– Data hazards: Instruction depends on result of prior
instruction still in the pipeline

CSCE430/830 Pipeline
Summary - Pipelining Overview

• Pipelining increase throughput (but not

latency)
• Hazards limit performance
– Structural hazards
– Data hazards

CSCE430/830 Pipeline
Pipelining Outline

• Introduction
– Defining Pipelining
– Pipelining Instructions
• Hazards
– Structural hazards 
– Data Hazards
• Performance

CSCE430/830 Pipeline

Pipeline
No ratings yet
Pipeline
22 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
94 pages
07 Pipeline Notes
No ratings yet
07 Pipeline Notes
145 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
94 pages
Pipelining
No ratings yet
Pipelining
43 pages
Computer Architecture Pipe Line
No ratings yet
Computer Architecture Pipe Line
28 pages
Kien-Truc-May-Tinh-Nang-Cao - Tran-Ngoc-Thinh - Lec03-Pipelining - (Cuuduongthancong - Com)
No ratings yet
Kien-Truc-May-Tinh-Nang-Cao - Tran-Ngoc-Thinh - Lec03-Pipelining - (Cuuduongthancong - Com)
35 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
94 pages
Slides14 Pipeline1 4up
No ratings yet
Slides14 Pipeline1 4up
6 pages
Pipelining Lecture
No ratings yet
Pipelining Lecture
74 pages
Pipeline Processing
No ratings yet
Pipeline Processing
16 pages
A Pipelining
No ratings yet
A Pipelining
16 pages
33 Hazards in Pipeline 06-04-2023
No ratings yet
33 Hazards in Pipeline 06-04-2023
27 pages
Lec03-Pipelining 2021
No ratings yet
Lec03-Pipelining 2021
20 pages
Pipelining Basic and Intermediate Concepts
No ratings yet
Pipelining Basic and Intermediate Concepts
75 pages
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
No ratings yet
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
64 pages
Pipelining
No ratings yet
Pipelining
10 pages
Lecture 13 Pipelining
No ratings yet
Lecture 13 Pipelining
12 pages
Unit 2 - Session-6 To 10
No ratings yet
Unit 2 - Session-6 To 10
40 pages
Chapter # 03 Pipelining
No ratings yet
Chapter # 03 Pipelining
85 pages
Lecture 22 Pipelining
No ratings yet
Lecture 22 Pipelining
13 pages
06 Pipeline PDF
No ratings yet
06 Pipeline PDF
17 pages
Pipeline
No ratings yet
Pipeline
39 pages
Cse410 10 Pipelining A
No ratings yet
Cse410 10 Pipelining A
27 pages
Pipelining Unit 3
No ratings yet
Pipelining Unit 3
19 pages
Chapter 4.5 - 4.8 Piplined Processor and Hazards
No ratings yet
Chapter 4.5 - 4.8 Piplined Processor and Hazards
68 pages
Pipe Lining
No ratings yet
Pipe Lining
66 pages
Pipeline Processing
No ratings yet
Pipeline Processing
28 pages
3-Pipelining 241110 203716
No ratings yet
3-Pipelining 241110 203716
59 pages
Co Unit 4
No ratings yet
Co Unit 4
17 pages
Enhancing Performance With Pipelining: CS.305 Computer Architecture
No ratings yet
Enhancing Performance With Pipelining: CS.305 Computer Architecture
25 pages
Lec18 Pipeline
No ratings yet
Lec18 Pipeline
59 pages
Pipelining Concepts and Problems
No ratings yet
Pipelining Concepts and Problems
33 pages
Module 4
No ratings yet
Module 4
12 pages
Pipe Lining
No ratings yet
Pipe Lining
32 pages
Week 11 Reduced
No ratings yet
Week 11 Reduced
29 pages
CSE332 / EEE336 Computer Organization & Architecture Pipelining I
No ratings yet
CSE332 / EEE336 Computer Organization & Architecture Pipelining I
21 pages
Pipeline
No ratings yet
Pipeline
33 pages
L14 MipsPipeline Ovw
No ratings yet
L14 MipsPipeline Ovw
17 pages
Pipeline Hazards Selected
No ratings yet
Pipeline Hazards Selected
44 pages
Pipelining and Parallel Processing
No ratings yet
Pipelining and Parallel Processing
26 pages
Module 4 - Parallel & Pipeline Processing - Final
No ratings yet
Module 4 - Parallel & Pipeline Processing - Final
31 pages
CS530 Fall2015 Lecture9
No ratings yet
CS530 Fall2015 Lecture9
5 pages
PipeLining in Microprocessors
No ratings yet
PipeLining in Microprocessors
19 pages
Piplining
No ratings yet
Piplining
23 pages
Computer Organization and Architecture Pipelining Set Execution, Stages and Throughput
No ratings yet
Computer Organization and Architecture Pipelining Set Execution, Stages and Throughput
7 pages
Lecture # Pipelining
No ratings yet
Lecture # Pipelining
36 pages
Lect5 Pipelining1
No ratings yet
Lect5 Pipelining1
42 pages
6.1.CSE 4293 Pipelining
No ratings yet
6.1.CSE 4293 Pipelining
36 pages
3.4 Pipelining Performance2
No ratings yet
3.4 Pipelining Performance2
14 pages
Shri G.S. Institute of Technology and Science: Computer Architecture and Organisation (CO-24009) Session: 2019-2020
No ratings yet
Shri G.S. Institute of Technology and Science: Computer Architecture and Organisation (CO-24009) Session: 2019-2020
27 pages
Pipeline Processor Design
No ratings yet
Pipeline Processor Design
89 pages
Comp Architecture Chapter 4 - Pipelining
No ratings yet
Comp Architecture Chapter 4 - Pipelining
53 pages
Pipelining and ALU
No ratings yet
Pipelining and ALU
23 pages
Pipelined Processor Design: Computer Architecture and Assembly Language
No ratings yet
Pipelined Processor Design: Computer Architecture and Assembly Language
22 pages
Basic Concepts1
No ratings yet
Basic Concepts1
18 pages
Module 4-Pipelining
No ratings yet
Module 4-Pipelining
39 pages
General Principles of Pipelining: Andrew Warfield CS313
No ratings yet
General Principles of Pipelining: Andrew Warfield CS313
25 pages
Technology in Telecommunications Networks
From Everand
Technology in Telecommunications Networks
Tanushri Kaniyar
No ratings yet
100 Circuits - Audio 1
From Everand
100 Circuits - Audio 1
Newton C. Braga
5/5 (1)
CRISC Research
No ratings yet
CRISC Research
13 pages
Ddco Simp 2024
No ratings yet
Ddco Simp 2024
3 pages
Computer Architecture LAB 2
No ratings yet
Computer Architecture LAB 2
4 pages
Module 1 - Parallel Computing
No ratings yet
Module 1 - Parallel Computing
29 pages
ACA20012021 - Vector & Multiple Issue Processor - 2
No ratings yet
ACA20012021 - Vector & Multiple Issue Processor - 2
21 pages
Model Answers Summer 2014 - 17431
No ratings yet
Model Answers Summer 2014 - 17431
23 pages
Computer Organization and Assembly Language: Lecture 2 - x86 Processor Architecture
No ratings yet
Computer Organization and Assembly Language: Lecture 2 - x86 Processor Architecture
23 pages
CA Assignment 2025
No ratings yet
CA Assignment 2025
8 pages
C2000 Microcontroller Workshop
No ratings yet
C2000 Microcontroller Workshop
342 pages
Write-After-Read (WAR) Artificial (Name) Dependence
No ratings yet
Write-After-Read (WAR) Artificial (Name) Dependence
17 pages
The Processor Unit (Cpu) : By: Solomon S
No ratings yet
The Processor Unit (Cpu) : By: Solomon S
50 pages
Unit 1 - ARM7, ARM9, ARM11 Processors
67% (3)
Unit 1 - ARM7, ARM9, ARM11 Processors
88 pages
How Data Hazards Can Be Removed Effectively
No ratings yet
How Data Hazards Can Be Removed Effectively
6 pages
Module 5
No ratings yet
Module 5
19 pages
Computer Architecture
No ratings yet
Computer Architecture
100 pages
Lect3 - Design Metrics
No ratings yet
Lect3 - Design Metrics
34 pages
Lecture Notes For Class
No ratings yet
Lecture Notes For Class
41 pages
Architecture Suggestion SEM
No ratings yet
Architecture Suggestion SEM
3 pages
Chap. 9 Pipeline and Vector Processing
0% (1)
Chap. 9 Pipeline and Vector Processing
12 pages
Intel Architecture: 2.1. Brief History of The Ia-32 Architecture
No ratings yet
Intel Architecture: 2.1. Brief History of The Ia-32 Architecture
19 pages
(English) Advanced CPU Designs - Crash Course Computer Science #9 (DownSub - Com)
No ratings yet
(English) Advanced CPU Designs - Crash Course Computer Science #9 (DownSub - Com)
10 pages
Chapter-10 Parallel Programming Models, Languages and Compilers
No ratings yet
Chapter-10 Parallel Programming Models, Languages and Compilers
29 pages
Unit 5
No ratings yet
Unit 5
29 pages
Minimal Instruction Set AES Processor Using Harvard Architecture
No ratings yet
Minimal Instruction Set AES Processor Using Harvard Architecture
5 pages
Chapter1 Basic Structure of Computers
100% (2)
Chapter1 Basic Structure of Computers
7 pages
Syllabus For BIT 4 Smester: Computer Organization
No ratings yet
Syllabus For BIT 4 Smester: Computer Organization
2 pages
Introduction To Parallel Programming
No ratings yet
Introduction To Parallel Programming
268 pages
The Sharc: Super Harvard Architecture Computer
0% (1)
The Sharc: Super Harvard Architecture Computer
25 pages
AMD GCN3 Instruction Set Architecture PDF
No ratings yet
AMD GCN3 Instruction Set Architecture PDF
354 pages
1.1.3 Evolution of Computer Arhitecture
No ratings yet
1.1.3 Evolution of Computer Arhitecture
12 pages

Computer Organization and Assembly Language: Pipeline: Introduction

Uploaded by

Computer Organization and Assembly Language: Pipeline: Introduction

Uploaded by

Computer Organization and Assembly

• A way of speeding up execution of instructions

• Ann, Brian, Cathy, Dave

• Dryer takes 30 minutes

• “Folder” takes 30 minutes

• “Stasher” takes 30 minutes

• Key idea: break big computation up into pieces

200ps 200ps 200ps 200ps 200ps

• Why do this? Because it's faster for repeated

200ps 200ps 200ps 200ps 200ps

• Pipelining increases throughput, but not

• Recall the 5 steps in instruction execution:

• Review: Single-Cycle Processor

Cycle 1 Cycle 2 Cycle 3 Cycle 4 Cycle 5 Cycle 6 Cycle 7

What do we need to add to actually split the datapath into stages?

Average instruction execution time

Speedup from pipeline

• The good news

• Limits to pipelining: Hazards prevent next instruction

• Pipelining increase throughput (but not

You might also like