0% found this document useful (0 votes)

22 views30 pages

Lec 1

Data

Uploaded by

alhdhyryfwlyd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views30 pages

Lec 1

Data

Uploaded by

alhdhyryfwlyd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

Pipeline: Hazards

Fall, 2017

These slides are adapted from notes by Dr. David Patterson (UCB)

1
Single-Cycle vs. Pipelined Execution

Non-Pipelined
Instruction 0 200 400 600 800 1000 1200 1400 1600 1800
Order Time
l w $ 1 , 1 0 0 ( $ 0 ) Instruction REG ALU MEM REG
Fetch RD WR
Instruction REG REG
lw $2, 200($0) ALU MEM
Fetch RD WR
800ps
Instruction
lw $3, 300($0) Fetch
800ps
800ps
Pipelined
Instruction 0 200 400 600 800 1000 1200 1400 1600
Order Time
l w $ 1 , 1 0 0 ( $ 0 ) Instruction REG ALU MEM REG
Fetch RD WR
Instruction REG REG
lw $2, 200($0) ALU MEM
Fetch RD WR
200ps
Instruction REG REG
lw $3, 300($0) ALU MEM
Fetch RD WR
200ps
200ps 200ps 200ps 200ps 200ps

2
Speedup
• Consider the unpipelined processor introduced previously. Assume that it has
a 1 ns clock cycle and it uses 4 cycles for ALU operations and branches, and
5 cycles for memory operations, assume that the relative frequencies of these
operations are 40%, 20%, and 40%, respectively. Suppose that due to clock
skew and setup, pipelining the processor adds 0.2ns of overhead to the clock.
Ignoring any latency impact, how much speedup in the instruction execution
rate will we gain from a pipeline?

Average instruction execution time

= 1 ns * ((40% + 20%)*4 + 40%*5)
= 4.4ns

Speedup from pipeline

= Average instruction time unpiplined/Average instruction time pipelined
= 4.4ns/1.2ns = 3.7

3
Comments about Pipelining

• The good news

– Multiple instructions are being processed at same time
– This works because stages are isolated by registers
– Best case speedup of N
• The bad news
– Instructions interfere with each other - hazards
• Example: different instructions may need the same piece of
hardware (e.g., memory) in same clock cycle
• Example: instruction may require a result produced by an
earlier instruction that is not yet complete

4
Pipeline Hazards
• Limits to pipelining: Hazards prevent next
instruction from executing during its
designated clock cycle
– Structural hazards: two different instructions use
same h/w in same cycle
– Data hazards: Instruction depends on result of
prior instruction still in the pipeline
– Control hazards: Pipelining of branches & other
instructions that change the PC

5
Structural Hazards
• Attempt to use same resource twice at same time
• Example: Single Memory for instructions, data
– Accessed by IF stage
– Accessed at same time by MEM stage
• Solutions ?
– Delay second access by one clock cycle
– Provide separate memories for instructions, data
•This is what the book does
•This is called a “Harvard Architecture”
•Real pipelined processors have separate caches
6
Pipelined Example -
Executing Multiple Instructions
• Consider the following instruction
sequence:
lw $r0, 10($r1)
sw $sr3, 20($r4)
add $r5, $r6, $r7
sub $r8, $r9, $r10

7
Executing Multiple Instructions
Clock Cycle 1
LW

8
Executing Multiple Instructions
Clock Cycle 2
SW LW

9
Executing Multiple Instructions
Clock Cycle 3
ADD SW LW

10
Executing Multiple Instructions
Clock Cycle 4
SUB ADD SW LW

11
Executing Multiple Instructions
Clock Cycle 5
SUB ADD SW LW

12
Executing Multiple Instructions
Clock Cycle 6
SUB ADD SW

13
Executing Multiple Instructions
Clock Cycle 7
SUB ADD

14
Executing Multiple Instructions
Clock Cycle 8
SUB

15
Alternative View - Multicycle Diagram
CC 1 CC 2 CC 3 CC 4 CC 5 CC 6 CC 7 CC 8

lw $r0, 10($r1) IM REG ALU DM REG

sw $r3, 20($r4) IM REG ALU DM REG

add $r5, $r6, $r7 IM REG ALU DM REG

sub $r8, $r9, $r10 IM REG ALU DM REG

16
Alternative View - Multicycle Diagram
CC 1 CC 2 CC 3 CC 4 CC 5 CC 6 CC 7 CC 8

lw $r0, 10($r1) IM REG ALU DM REG

Memory Conflict

sw $r3, 20($r4) IM REG ALU DM REG

add $r5, $r6, $r7 IM REG ALU DM REG

sub $r8, $r9, $r10 IM REG ALU DM REG

17
One Memory Port Structural Hazards
Time (clock cycles)
Cycle 1 Cycle 2 Cycle 3 Cycle 4 Cycle 5 Cycle 6 Cycle 7

ALU
Ifetch Reg DMem Reg
Load
n
s

ALU
Ifetch Reg DMem Reg
t
Instr 1
r.

ALU
Ifetch Reg DMem Reg
Instr 2
O
r
Stall Bubble Bubble Bubble Bubble Bubble
d
e
r

ALU
Ifetch Reg DMem Reg
Instr 3
18
Structural Hazards
Some Common Structural Hazards:
• Memory:
– we’ve already mentioned this one.
• Floating point:
– Since many floating point instructions require many
cycles, it’s easy for them to interfere with each other.
• Starting up more of one type of instruction than
there are resources.
– For instance, the PA-8600 can support two ALU + two
load/store instructions per cycle - that’s how much
hardware it has available.
19
Dealing with Structural Hazards
Stall
– low cost, simple
– Increases CPI
– use for rare case since stalling has performance effect
Pipeline hardware resource
– useful for multi-cycle resources
– good performance
– sometimes complex e.g., RAM
Replicate resource
– good performance
– increases cost (+ maybe interconnect delay)
– useful for cheap or divisible resources
20
Structural Hazards
• Structural hazards are reduced with these rules:
– Each instruction uses a resource at most once
– Always use the resource in the same pipeline stage
– Use the resource for one cycle only
• Many RISC ISA’s designed with this in mind
• Sometimes very complex to do this.
– For example, memory of necessity is used in the IF and
MEM stages.

21
Structural Hazards
We want to compare the performance of two machines.
Which machine is faster?
– Machine A: Dual ported memory - so there are no memory
stalls
– Machine B: Single ported memory, but its pipelined
implementation has a 1.05 times faster clock rate
Assume:
– Ideal CPI = 1 for both
– Loads are 40% of instructions executed

22
Speedup from Pipelining
Speedup from pipelining =
Average instruction time unpipelined
Average instruction time pipelined
CPI unpipelined ×Clock cycle unpipelined
CPI pipelined ×Clock cycle pipelined

CPI pipelined = Ideal CPI + Pipeline stall clock

cycles per instruction
23
CPI unpipelined = Ideal CPI ×Pipeline depth
Speed Up Equations for Pipelining

CPIpipelined = Ideal CPI + Average Stall cycles per Inst

Ideal CPI × Pipeline depth Cycle Timeunpipelined

Speedup = ×
Ideal CPI + Pipeline stall CPI Cycle Timepipelined

For simple RISC pipeline, the Ideal CPI on a pipelined

processor = 1:

Pipeline depth Cycle Timeunpipelined

Speedup = ×
1 + Pipeline stall CPI Cycle Timepipelined

24
Structural Hazards
We want to compare the performance of two machines. Which machine is faster?
• Machine A: Dual ported memory - so there are no memory stalls
• Machine B: Single ported memory, but its pipelined implementation has a 1.05 times
faster clock rate
Assume:
• Ideal CPI = 1 for both
• Loads are 40% of instructions executed

25
Summary - Structural Hazards
• Speed Up <= Pipeline Depth; if ideal CPI is 1, then:
Pipeline Depth Clock Cycle Unpipelined
Speedup = X
1 + Pipeline stall CPI Clock Cycle Pipelined

• Hazards limit performance on computers:

– Structural: need more HW resources
– Data (RAW,WAR,WAW):
– Control

26
Data Hazards
• Data hazards occur when data is used
before it is stored
Time (in clock cycles)

Value of CC 1 CC 2 CC 3 CC 4 CC 5 CC 6 CC 7 CC 8 CC 9
register $2: 10 10 10 10 10/– 20 – 20 – 20 – 20 – 20
Program
execution
order
(in instructions)
sub $2, $1, $3 IM Reg DM Reg

and $12, $2, $5 IM Reg DM Reg

or $13, $6, $2 IM Reg DM Reg

add $14, $2, $2 IM Reg DM Reg

sw $15, 100($2) IM Reg DM Reg

The use of the result of the SUB instruction in the next three instructions causes a
data hazard, since the register is not written until after those instructions read it.
27
Data Hazards
Execution Order is: Read After Write (RAW)
InstrI
InstrJ InstrJ tries to read operand before InstrI writes it
I: add r1,r2,r3
J: sub r4,r1,r3

• Caused by a “Dependence” (in compiler nomenclature). This

hazard results from an actual need for communication.

28
Data Hazards
Execution Order is: Write After Read (WAR)
InstrI
InstrJ
InstrJ tries to write operand before InstrI reads i
– Gets wrong operand
I: sub r4,r1,r3
J: add r1,r2,r3
K: mul r6,r1,r7

– Called an “anti-dependence” by compiler writers.

This results from reuse of the name “r1”.

• Can’t happen in MIPS 5 stage pipeline because:

–All instructions take 5 stages, and
– Reads are always in stage 2, and
– Writes are always in stage 5
29
Data Hazards
Execution Order is: Write After Write (WAW)
InstrI
InstrJ InstrJ tries to write operand before InstrI writes it
– Leaves wrong result ( InstrI not InstrJ )

I: sub r1,r4,r3
J: add r1,r2,r3
K: mul r6,r1,r7
• Called an “output dependence” by compiler writers
This also results from the reuse of name “r1”.
• Can’t happen in MIPS 5 stage pipeline because:
–All instructions take 5 stages, and
– Writes are always in stage 5

•Will see WAR and WAW in later more

complicated pipes
30

Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
No ratings yet
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
64 pages
Chapter 17 - Pipelining Hazards
No ratings yet
Chapter 17 - Pipelining Hazards
33 pages
Lecture # Pipelining
No ratings yet
Lecture # Pipelining
36 pages
06 - CS F342 Pipelining (ForMIDSEM - Upto35slides)
No ratings yet
06 - CS F342 Pipelining (ForMIDSEM - Upto35slides)
69 pages
1.pipelining & ILP
No ratings yet
1.pipelining & ILP
38 pages
1.pipelining & ILP
No ratings yet
1.pipelining & ILP
37 pages
L04 Pipelining
No ratings yet
L04 Pipelining
38 pages
Question Ans CA
No ratings yet
Question Ans CA
28 pages
Unit 5.2 Processor
No ratings yet
Unit 5.2 Processor
40 pages
Pipelining Lecture
No ratings yet
Pipelining Lecture
60 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
61 pages
Lecture-5-09 01 2025
No ratings yet
Lecture-5-09 01 2025
25 pages
CH 6
No ratings yet
CH 6
29 pages
Chapter 10 Principles of Pipelining
No ratings yet
Chapter 10 Principles of Pipelining
124 pages
Pipelining 2019
No ratings yet
Pipelining 2019
82 pages
Chapter 04 Processor 2
No ratings yet
Chapter 04 Processor 2
28 pages
Pipelining Lecture
No ratings yet
Pipelining Lecture
39 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
37 pages
Pipelining
No ratings yet
Pipelining
43 pages
CS530 Fall2015 Lecture9
No ratings yet
CS530 Fall2015 Lecture9
5 pages
Pipelined Processor Design: Computer Architecture and Assembly Language
No ratings yet
Pipelined Processor Design: Computer Architecture and Assembly Language
22 pages
Computer Architecture: Nguyễn Trí Thành
No ratings yet
Computer Architecture: Nguyễn Trí Thành
77 pages
Presentation 1
No ratings yet
Presentation 1
22 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
49 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
94 pages
Pipelining - Modified1
No ratings yet
Pipelining - Modified1
51 pages
CAO Pipelining Lecture
No ratings yet
CAO Pipelining Lecture
50 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
214 pages
Computer Architecture: Appendix A Pipelining Prof. Jerry Breecher CSCI 240 Fall 2003
No ratings yet
Computer Architecture: Appendix A Pipelining Prof. Jerry Breecher CSCI 240 Fall 2003
58 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
37 pages
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
No ratings yet
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
19 pages
CA Slides#5 Pipeline Hazards
No ratings yet
CA Slides#5 Pipeline Hazards
33 pages
Pipeline Hazards. Presentation
100% (2)
Pipeline Hazards. Presentation
20 pages
Piplining
No ratings yet
Piplining
23 pages
The Big Picture: Requirements Algorithms Prog. Lang./Os Isa Uarch Circuit Device
No ratings yet
The Big Picture: Requirements Algorithms Prog. Lang./Os Isa Uarch Circuit Device
60 pages
Week 11 Reduced
No ratings yet
Week 11 Reduced
29 pages
Pipeline Hazards Selected
No ratings yet
Pipeline Hazards Selected
44 pages
Computer Architecture: Pipelining: Dr. Ashok Kumar Turuk
No ratings yet
Computer Architecture: Pipelining: Dr. Ashok Kumar Turuk
136 pages
Pipeline
No ratings yet
Pipeline
39 pages
ILP - Appendix C PDF
No ratings yet
ILP - Appendix C PDF
52 pages
Embedded Firmware Design Approaches and Development Languages Class Notes
No ratings yet
Embedded Firmware Design Approaches and Development Languages Class Notes
10 pages
Pipelining Preview: Basics & Challenges
No ratings yet
Pipelining Preview: Basics & Challenges
75 pages
L14 MipsPipeline Ovw
No ratings yet
L14 MipsPipeline Ovw
17 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
94 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
Week 4 - Pipelining
No ratings yet
Week 4 - Pipelining
44 pages
Mikrotik
67% (3)
Mikrotik
1,005 pages
CODch 6 Slides
No ratings yet
CODch 6 Slides
77 pages
Pipelined MIPS Processor: Dmitri Strukov ECE 154A
No ratings yet
Pipelined MIPS Processor: Dmitri Strukov ECE 154A
81 pages
ARM Notes For Students
100% (3)
ARM Notes For Students
24 pages
Enhancing Performance With Pipelining
No ratings yet
Enhancing Performance With Pipelining
85 pages
Pipe Lining
No ratings yet
Pipe Lining
66 pages
CA-unit 4-Material
No ratings yet
CA-unit 4-Material
31 pages
Helping Slides Pipelining Hazards Solutions
No ratings yet
Helping Slides Pipelining Hazards Solutions
55 pages
Embedded Systems Design: Pipelining and Instruction Scheduling
No ratings yet
Embedded Systems Design: Pipelining and Instruction Scheduling
48 pages
Instruction Pipelining: 1 Zelalem Birhanu, Aait
No ratings yet
Instruction Pipelining: 1 Zelalem Birhanu, Aait
20 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
ImagePRESS Server G100 Installation and Service Guide
No ratings yet
ImagePRESS Server G100 Installation and Service Guide
78 pages
ITN Module 9
No ratings yet
ITN Module 9
24 pages
STM 32 H 523 Ce
No ratings yet
STM 32 H 523 Ce
226 pages
Rswa Admin en A4
No ratings yet
Rswa Admin en A4
72 pages
Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
No ratings yet
Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
51 pages
Alibaba Cloud Cloud Monitor User Guide 20190903
No ratings yet
Alibaba Cloud Cloud Monitor User Guide 20190903
313 pages
DLS-3 v1-3 IM EN NA 29005694 R001
No ratings yet
DLS-3 v1-3 IM EN NA 29005694 R001
120 pages
HRY-312 Computer Organization Introduction To Pipelining
No ratings yet
HRY-312 Computer Organization Introduction To Pipelining
30 pages
527307-001C - Basler Camera Firmware Upgrade
No ratings yet
527307-001C - Basler Camera Firmware Upgrade
22 pages
HCIP-Routing & Switching-IENP V2.5 Lab Guide
No ratings yet
HCIP-Routing & Switching-IENP V2.5 Lab Guide
240 pages
Processor Organization & Instruction Cycle
No ratings yet
Processor Organization & Instruction Cycle
31 pages
Atollic TrueSTUDIO Installation Guide
No ratings yet
Atollic TrueSTUDIO Installation Guide
48 pages
JHS-770 Software Upgrade Procedure
100% (2)
JHS-770 Software Upgrade Procedure
19 pages
Get Printer Driver Directory
No ratings yet
Get Printer Driver Directory
2 pages
Readme First
No ratings yet
Readme First
3 pages
CM SP Erouter I08 120329
No ratings yet
CM SP Erouter I08 120329
70 pages
2014fa CS61C L31 DG PipelineII 6up
No ratings yet
2014fa CS61C L31 DG PipelineII 6up
4 pages
UFT Install Guide
No ratings yet
UFT Install Guide
56 pages
V73 Supported Platforms and Hardware
No ratings yet
V73 Supported Platforms and Hardware
13 pages
IEC 60870-5-103 Master Protocol Profile 2
No ratings yet
IEC 60870-5-103 Master Protocol Profile 2
22 pages
MikroTik hAP AC2 DataSheet
No ratings yet
MikroTik hAP AC2 DataSheet
2 pages
11-Neo Connect Plus User Manual
No ratings yet
11-Neo Connect Plus User Manual
26 pages
DCC Microproject
No ratings yet
DCC Microproject
12 pages
Important Question Bank BD
No ratings yet
Important Question Bank BD
3 pages
WhatsNew in Codesmart
No ratings yet
WhatsNew in Codesmart
4 pages
Differences Between Enterprise, Standard Edition 2 On Oracle 12.2
No ratings yet
Differences Between Enterprise, Standard Edition 2 On Oracle 12.2
9 pages
WP Hostbridge Soap and Rest 090303
No ratings yet
WP Hostbridge Soap and Rest 090303
12 pages
DTPS FAQ - Import A Robot Backup
No ratings yet
DTPS FAQ - Import A Robot Backup
4 pages
Step 2: Create An API Token: To Install The Latest Version of Using Snap On Ubuntu Or, Run
No ratings yet
Step 2: Create An API Token: To Install The Latest Version of Using Snap On Ubuntu Or, Run
2 pages
Windows Powershell Quick Reference Windows Powershell Quick Reference
No ratings yet
Windows Powershell Quick Reference Windows Powershell Quick Reference
2 pages
Epson LX-350 9-Pin Dot Matrix Printer Datasheet
No ratings yet
Epson LX-350 9-Pin Dot Matrix Printer Datasheet
2 pages
Reference Guide To Useful Electronic Circuits And Circuit Design Techniques - Part 2
From Everand
Reference Guide To Useful Electronic Circuits And Circuit Design Techniques - Part 2
Kerwin Mathew
No ratings yet
Electronics II Essentials
From Everand
Electronics II Essentials
The Editors of REA
No ratings yet

Lec 1

Uploaded by

Lec 1

Uploaded by

Pipeline: Hazards

Average instruction execution time

Speedup from pipeline

• The good news

lw $r0, 10($r1) IM REG ALU DM REG

sw $r3, 20($r4) IM REG ALU DM REG

add $r5, $r6, $r7 IM REG ALU DM REG

sub $r8, $r9, $r10 IM REG ALU DM REG

lw $r0, 10($r1) IM REG ALU DM REG

sw $r3, 20($r4) IM REG ALU DM REG

add $r5, $r6, $r7 IM REG ALU DM REG

sub $r8, $r9, $r10 IM REG ALU DM REG

CPI pipelined = Ideal CPI + Pipeline stall clock

CPIpipelined = Ideal CPI + Average Stall cycles per Inst

Ideal CPI × Pipeline depth Cycle Timeunpipelined

For simple RISC pipeline, the Ideal CPI on a pipelined

Pipeline depth Cycle Timeunpipelined

• Hazards limit performance on computers:

and $12, $2, $5 IM Reg DM Reg

or $13, $6, $2 IM Reg DM Reg

add $14, $2, $2 IM Reg DM Reg

sw $15, 100($2) IM Reg DM Reg

• Caused by a “Dependence” (in compiler nomenclature). This

– Called an “anti-dependence” by compiler writers.

• Can’t happen in MIPS 5 stage pipeline because:

•Will see WAR and WAW in later more

You might also like