0% found this document useful (0 votes)

15 views

Lecture 6

The document discusses pipelining extensions such as bypassing and deeper pipelines to improve performance. It covers topics like control hazards, RISC/CISC load/store instructions, and how instructions flow through a pipeline. Examples are provided to illustrate structural hazards and how bypassing can eliminate stalls between dependent instructions. Bypassing allows the results of one instruction to be used as an operand for a subsequent instruction in the same cycle, reducing pipeline stalls.

Uploaded by

ghy3721ghy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

Lecture 6

Uploaded by

ghy3721ghy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Lecture: Pipelining Extensions

• Topics: bypassing, deeper pipelines, control hazards

1
RISC/CISC Loads/Stores

Registers and memory

Complex and reduced instrs 2
Format of a load/store
Pipeline Summary

RR ALU DM RW

ADD R3  R1, R2 Rd R1,R2 R1+R2 -- Wr R3

BEZ R1, [R5] Rd R1, R5 -- -- --

Compare, Set PC

LD R6  8[R3] Rd R3 R3+8 Get data Wr R6

ST R6  8[R3] Rd R3,R6 R3+8 Wr data --

3
Problem 4
• For the following code sequence, show how the instrs
flow through the pipeline:
ADD R3  R1, R2
LD R7  8[R6]
ST R9  4[R8]
BEZ R4, [R5]

4
Problem 4
• For the following code sequence, show how the instrs
flow through the pipeline:
ADD R3  R1, R2
LD R7  8[R6]
ST R9  4[R8]
BEZ R4, [R5]

ADD ADD ADD ADD ADD

LD LD LD LD LD

ST ST ST ST
BEZ BEZ
5
Hazards

• Structural hazards: different instructions in different stages

(or the same stage) conflicting for the same resource

• Data hazards: an instruction cannot continue because it

needs a value that has not yet been generated by an
earlier instruction

• Control hazard: fetch cannot continue because it does

not know the outcome of an earlier branch – special case
of a data hazard – separate category because they are
treated in different ways

6
Structural Hazards

• Example: a unified instruction and data cache 

stage 4 (MEM) and stage 1 (IF) can never coincide

• The later instruction and all its successors are delayed

until a cycle is found when the resource is free  these
are pipeline bubbles

• Structural hazards are easy to eliminate – increase the

number of resources (for example, implement a separate
instruction and data cache)

7
Problem 5
• Show the instruction occupying each stage in each cycle (no bypassing)
if I1 is R1+R2R3 and I2 is R3+R4R5 and I3 is R7+R8R9
CYC-1 CYC-2 CYC-3 CYC-4 CYC-5 CYC-6 CYC-7 CYC-8

IF IF IF IF IF IF IF IF

D/R D/R D/R D/R D/R D/R D/R D/R

ALU ALU ALU ALU ALU ALU ALU ALU

DM DM DM DM DM DM DM DM

RW RW RW RW RW RW RW RW 8
Problem 5
• Show the instruction occupying each stage in each cycle (no bypassing)
if I1 is R1+R2R3 and I2 is R3+R4R5 and I3 is R7+R8R9
CYC-1 CYC-2 CYC-3 CYC-4 CYC-5 CYC-6 CYC-7 CYC-8

IF IF IF IF IF IF IF IF
I1 I2 I3 I3 I3 I4 I5
D/R D/R D/R D/R D/R D/R D/R D/R
I1 I2 I2 I2 I3 I4
ALU ALU ALU ALU ALU ALU ALU ALU
I1 I2 I3
DM DM DM DM DM DM DM DM
I1 I2 I3
RW RW RW RW RW RW RW RW 9
I1 I2
Bypassing: 5-Stage Pipeline

PC/L1 L2 L3 L4 L5

10
Source: H&P textbook
Problem 6
• Show the instruction occupying each stage in each cycle (with bypassing)
if I1 is R1+R2R3 and I2 is R3+R4R5 and I3 is R3+R8R9.
Identify the input latch for each input operand.
CYC-1 CYC-2 CYC-3 CYC-4 CYC-5 CYC-6 CYC-7 CYC-8

IF IF IF IF IF IF IF IF

D/R D/R D/R D/R D/R D/R D/R D/R

ALU ALU ALU ALU ALU ALU ALU ALU

DM DM DM DM DM DM DM DM

RW RW RW RW RW RW RW RW 11
Problem 6
• Show the instruction occupying each stage in each cycle (with bypassing)
if I1 is R1+R2R3 and I2 is R3+R4R5 and I3 is R3+R8R9.
Identify the input latch for each input operand.
CYC-1 CYC-2 CYC-3 CYC-4 CYC-5 CYC-6 CYC-7 CYC-8

IF IF IF IF IF IF IF IF
I1 I2 I3 I4 I5
D/R D/R D/R D/R D/R D/R D/R D/R
I1 I2 I3 I4
L3 L3 L4 L3 L5 L3
ALU ALU ALU ALU ALU ALU ALU ALU
I1 I2 I3
DM DM DM DM DM DM DM DM
I1 I2 I3
RW RW RW RW RW RW RW RW
I1 I2 I3
Pipeline Implementation
• Signals for the muxes have to be generated – some of this can happen during ID
• Need look-up tables in decode stage to identify situations that merit bypassing/stalling
– the number of inputs to the muxes goes up

13
Problem 7

• For the 5-stage pipeline (RR and RW take half a cycle)

D/
IF AL DM RW
RR
• For the following pairs of instructions, how many stalls will the 2nd
instruction experience (with and without bypassing)?

 ADD R3  R1+R2
ADD R5  R3+R4
 LD R2  [R1]
ADD R4  R2+R3
 LD R2  [R1]
SD R3  [R2]
 LD R2  [R1]
SD R2  [R3]
14
Problem 7

• For the 5-stage pipeline (RR and RW take half a cycle)

D/
IF AL DM RW
RR
• For the following pairs of instructions, how many stalls will the 2nd
instruction experience (with and without bypassing)?

 ADD R3  R1+R2
ADD R5  R3+R4 without: 2 with: 0
 LD R2  [R1]
ADD R4  R2+R3 without: 2 with: 1
 LD R2  [R1]
SD R3  [R2] without: 2 with: 1
 LD R2  [R1]
SD R2  [R3] without: 2 with: 0
15
Summary

• For the 5-stage pipeline, bypassing can eliminate delays

between the following example pairs of instructions:
add/sub R1, R2, R3
add/sub/lw/sw R4, R1, R5

lw R1, 8(R2)
sw R1, 4(R3)

• The following pairs of instructions will have intermediate

stalls:
lw R1, 8(R2)
add/sub/lw R3, R1, R4 or sw R3, 8(R1)

fmul F1, F2, F3

fadd F5, F1, F4
16
17

Course 3 Module 5
No ratings yet
Course 3 Module 5
23 pages
IP Routing Protocols All-in-one: OSPF EIGRP IS-IS BGP Hands-on Labs
From Everand
IP Routing Protocols All-in-one: OSPF EIGRP IS-IS BGP Hands-on Labs
Redouane MEDDANE
No ratings yet
chapter4_2
No ratings yet
chapter4_2
34 pages
Chapter_04_processor_3.5
No ratings yet
Chapter_04_processor_3.5
52 pages
Computer Architecture: Introduction To The Concept of Pipelined Processor
No ratings yet
Computer Architecture: Introduction To The Concept of Pipelined Processor
20 pages
3.2 Pipeline Processing
No ratings yet
3.2 Pipeline Processing
18 pages
15IF11 Multicore A PDF
No ratings yet
15IF11 Multicore A PDF
64 pages
Pipeline Very Useful
No ratings yet
Pipeline Very Useful
8 pages
8 Pipeline Ddp Control
No ratings yet
8 Pipeline Ddp Control
54 pages
Pipeline Review: Here Is The Example Instruction Sequence Used To Illustrate Pipelining On The Previous Page
No ratings yet
Pipeline Review: Here Is The Example Instruction Sequence Used To Illustrate Pipelining On The Previous Page
11 pages
L15 MipsPipeline
No ratings yet
L15 MipsPipeline
26 pages
CS 162 Computer Architecture Lecture 3: Pipelining Contd.: Instructor: L.N. Bhuyan
No ratings yet
CS 162 Computer Architecture Lecture 3: Pipelining Contd.: Instructor: L.N. Bhuyan
21 pages
Ca07 2014 PDF
No ratings yet
Ca07 2014 PDF
56 pages
Lect3 Pipeline
No ratings yet
Lect3 Pipeline
4 pages
Pipelining Lecture
No ratings yet
Pipelining Lecture
39 pages
Lec 11
No ratings yet
Lec 11
30 pages
Week 11
No ratings yet
Week 11
33 pages
forwarding assignment
No ratings yet
forwarding assignment
35 pages
Chapter 4 The Processor
No ratings yet
Chapter 4 The Processor
72 pages
sample_midterm2
No ratings yet
sample_midterm2
4 pages
DigitalLogic ComputerOrganization L19 PipelinedProcessorP3 Handout
No ratings yet
DigitalLogic ComputerOrganization L19 PipelinedProcessorP3 Handout
24 pages
DLX-Phases of Instruction Cycle
No ratings yet
DLX-Phases of Instruction Cycle
12 pages
Chapter_04_processor_2
No ratings yet
Chapter_04_processor_2
28 pages
03 Pipeline
0% (1)
03 Pipeline
38 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
49 pages
Enhancing Performance With Pipelining
No ratings yet
Enhancing Performance With Pipelining
71 pages
Homework 4 - Alina Pineda
No ratings yet
Homework 4 - Alina Pineda
5 pages
Cyclic Redundancy Check - CRC: CRC Solution Sequential Divider
No ratings yet
Cyclic Redundancy Check - CRC: CRC Solution Sequential Divider
12 pages
Reduced Instruction Set Computer (Risc) Complex Instruction Set Computer (Cisc)
No ratings yet
Reduced Instruction Set Computer (Risc) Complex Instruction Set Computer (Cisc)
7 pages
Parallel Processing
No ratings yet
Parallel Processing
32 pages
Embedded Systems Design: Pipelining and Instruction Scheduling
No ratings yet
Embedded Systems Design: Pipelining and Instruction Scheduling
48 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
28 pages
Lecture04 Pipelining Hazards
No ratings yet
Lecture04 Pipelining Hazards
18 pages
L11 Pipelined Datapath and
100% (1)
L11 Pipelined Datapath and
31 pages
Presentation 1
No ratings yet
Presentation 1
22 pages
Pipelining-3
No ratings yet
Pipelining-3
37 pages
SRM Pipelining 05.Pptx
No ratings yet
SRM Pipelining 05.Pptx
42 pages
Lec14 Pipeline Riscv - Key
No ratings yet
Lec14 Pipeline Riscv - Key
58 pages
COA DR MVN 5 UNIT - Latest PDF
No ratings yet
COA DR MVN 5 UNIT - Latest PDF
24 pages
Pipelining ControlUnitAndHazards
No ratings yet
Pipelining ControlUnitAndHazards
109 pages
DigitalLogic ComputerOrganization L18 PipelinedProcessorP2 Handout
No ratings yet
DigitalLogic ComputerOrganization L18 PipelinedProcessorP2 Handout
36 pages
Computer Architecture
100% (2)
Computer Architecture
46 pages
Super Scalar 2
No ratings yet
Super Scalar 2
46 pages
Pipeline Processor Design
No ratings yet
Pipeline Processor Design
89 pages
COA Unit 3
No ratings yet
COA Unit 3
89 pages
3 Pipeline
No ratings yet
3 Pipeline
38 pages
3 Pipeline
No ratings yet
3 Pipeline
21 pages
Pipeline Hazards Detailed Notes
No ratings yet
Pipeline Hazards Detailed Notes
49 pages
Pipeline and Vector
No ratings yet
Pipeline and Vector
29 pages
M116C 1 EE116C-Midterm2-w15 Solution
100% (1)
M116C 1 EE116C-Midterm2-w15 Solution
8 pages
Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
No ratings yet
Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
51 pages
Pipe 4
No ratings yet
Pipe 4
50 pages
Pipeline and Vector Processing
No ratings yet
Pipeline and Vector Processing
18 pages
Tuesday, October 31, 2023 10:53 PM: Discuss, The Schemes For Dealing With The Pipeline Stalls Caused by Branch Hazards
No ratings yet
Tuesday, October 31, 2023 10:53 PM: Discuss, The Schemes For Dealing With The Pipeline Stalls Caused by Branch Hazards
7 pages
Exe On Pipelining
No ratings yet
Exe On Pipelining
12 pages
CA unit-2 Chapter-2
No ratings yet
CA unit-2 Chapter-2
36 pages
Arch4 Pipelined Processor Design Afterlecture
No ratings yet
Arch4 Pipelined Processor Design Afterlecture
130 pages
2014fa CS61C L31 DG PipelineII 6up
No ratings yet
2014fa CS61C L31 DG PipelineII 6up
4 pages
Calculated Encryption
From Everand
Calculated Encryption
John C Livingstone
No ratings yet
ROUTING INFORMATION PROTOCOL: RIP DYNAMIC ROUTING LAB CONFIGURATION
From Everand
ROUTING INFORMATION PROTOCOL: RIP DYNAMIC ROUTING LAB CONFIGURATION
Mulayam Singh
No ratings yet
SP791 Data Sheets
No ratings yet
SP791 Data Sheets
19 pages
Disable UEFI SEcure Boot
No ratings yet
Disable UEFI SEcure Boot
2 pages
Question Bank
No ratings yet
Question Bank
3 pages
Newsletter February 2022
No ratings yet
Newsletter February 2022
11 pages
A Dic Scripts S 2011 Complete
No ratings yet
A Dic Scripts S 2011 Complete
411 pages
Unit II Introduction To 8086 Microprocessor 8086 Architecture
No ratings yet
Unit II Introduction To 8086 Microprocessor 8086 Architecture
34 pages
CS203 COMPUTER ORGANIZATION & ARCHITECTURE_OE (END_SP23)
No ratings yet
CS203 COMPUTER ORGANIZATION & ARCHITECTURE_OE (END_SP23)
1 page
(ECE271) (11ES) (Group 1) Report Lab 1
No ratings yet
(ECE271) (11ES) (Group 1) Report Lab 1
9 pages
DPSD Question Bank
No ratings yet
DPSD Question Bank
5 pages
Pattabi_Ramaiah
No ratings yet
Pattabi_Ramaiah
2 pages
Scalable Aggregation On Multicore Processors
No ratings yet
Scalable Aggregation On Multicore Processors
9 pages
Chap4 Lect11 Logical Effort
No ratings yet
Chap4 Lect11 Logical Effort
19 pages
Digital Logic and Microprocessor Design With Interfacing 2nd Edition Hwang Solutions Manual 1
100% (53)
Digital Logic and Microprocessor Design With Interfacing 2nd Edition Hwang Solutions Manual 1
36 pages
IBM 8183 HDWR Maint Manual
No ratings yet
IBM 8183 HDWR Maint Manual
196 pages
DVP-PLC Application Manual 【Programming】 Table of Contents
No ratings yet
DVP-PLC Application Manual 【Programming】 Table of Contents
586 pages
Atxmega16 128a4u
No ratings yet
Atxmega16 128a4u
121 pages
A Register-Set Flip-Flop (RSFF) Is A Type of Flip-Flop Circu443-1
No ratings yet
A Register-Set Flip-Flop (RSFF) Is A Type of Flip-Flop Circu443-1
11 pages
List of Intel Pentium 4 Microproce PDF
No ratings yet
List of Intel Pentium 4 Microproce PDF
10 pages
XUPV5-LX110T PCIe x1 Endpoint Plus Design Creation
No ratings yet
XUPV5-LX110T PCIe x1 Endpoint Plus Design Creation
33 pages
Shogun Method
0% (1)
Shogun Method
5 pages
74 HC 138
No ratings yet
74 HC 138
5 pages
Datapath For The MIPS Architecture (A Single-Cycle Implementation)
No ratings yet
Datapath For The MIPS Architecture (A Single-Cycle Implementation)
22 pages
43 Ecep331l Digital Electronics 1 Logic Circuits and Switching Theory Laboratory
No ratings yet
43 Ecep331l Digital Electronics 1 Logic Circuits and Switching Theory Laboratory
3 pages
High Performance Computer Architecture (CS60003)
No ratings yet
High Performance Computer Architecture (CS60003)
2 pages
8 Bits Register
No ratings yet
8 Bits Register
4 pages
Tutorial (Solution)
No ratings yet
Tutorial (Solution)
15 pages
DSD UNIT 3,1
No ratings yet
DSD UNIT 3,1
28 pages
Intelligent Water Control System Using 8051 (AT89C51) : Project Report
No ratings yet
Intelligent Water Control System Using 8051 (AT89C51) : Project Report
12 pages
2.2.2.a UniversalGatesNANDLogicDesign
No ratings yet
2.2.2.a UniversalGatesNANDLogicDesign
7 pages
DS-00543-GD25LQ32E-Rev1.6
No ratings yet
DS-00543-GD25LQ32E-Rev1.6
70 pages

Lecture 6

Uploaded by

Lecture 6

Uploaded by

Lecture: Pipelining Extensions

• Topics: bypassing, deeper pipelines, control hazards

Registers and memory

ADD R3  R1, R2 Rd R1,R2 R1+R2 -- Wr R3

BEZ R1, [R5] Rd R1, R5 -- -- --

LD R6  8[R3] Rd R3 R3+8 Get data Wr R6

ST R6  8[R3] Rd R3,R6 R3+8 Wr data --

ADD ADD ADD ADD ADD

• Structural hazards: different instructions in different stages

• Data hazards: an instruction cannot continue because it

• Control hazard: fetch cannot continue because it does

• Example: a unified instruction and data cache 

• The later instruction and all its successors are delayed

• Structural hazards are easy to eliminate – increase the

D/R D/R D/R D/R D/R D/R D/R D/R

ALU ALU ALU ALU ALU ALU ALU ALU

D/R D/R D/R D/R D/R D/R D/R D/R

ALU ALU ALU ALU ALU ALU ALU ALU

• For the 5-stage pipeline (RR and RW take half a cycle)

• For the 5-stage pipeline (RR and RW take half a cycle)

• For the 5-stage pipeline, bypassing can eliminate delays

• The following pairs of instructions will have intermediate

fmul F1, F2, F3

You might also like