0% found this document useful (0 votes)

50 views21 pages

CS 162 Computer Architecture Lecture 3: Pipelining Contd.: Instructor: L.N. Bhuyan

lecture 3 of coa2

Uploaded by

علي سعدهاشم

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views21 pages

CS 162 Computer Architecture Lecture 3: Pipelining Contd.: Instructor: L.N. Bhuyan

lecture 3 of coa2

Uploaded by

علي سعدهاشم

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 21

CS 162 Computer Architecture

Lecture 3: Pipelining Contd.

Instructor: L.N. Bhuyan

www.cs.ucr.edu/~bhuyan/cs162

1 1999 ©UCB
Single Cycle Datapath (From Ch 5)
M
a a u
d d x
4 d << d
2 PCSrc
Read 25:21 Read MemWrite
P Addr Reg1
Read Read
C
31:0 Read data1 Zero data
20:16
Instruc- Reg2
A
tion L
Read Address
M Write U MemTo-
data2 M
u Reg Reg
u
Imem x Regs x
Dmem
Write ALU-
15:11 con Write
Data
Data
RegDst ALU- M
RegWrite src MemRead u
15:0 Sign
Extend x

2 ALUOp 1999 ©UCB

Required Changes to
Datapath
° Introduce registers to separate 5 stages
by putting IF/ID, ID/EX, EX/MEM, and
MEM/WB registers in the datapath.
° Next PC value is computed in the 3rd
step, but we need to bring in next instn
in the next cycle – Move PCSrc Mux to
1st stage. The PC is incremented unless
there is a new branch address.
° Branch address is computed in 3rd
stage. With pipeline, the PC value has
changed! Must carry the PC value along
with instn. Width of IF/ID register = (IR)+
(PC) = 64 bits.
3 1999 ©UCB
Changes to Datapath
Contd.
° For lw instn, we need write register
address at stage 5. But the IR is now
occupied by another instn! So, we
must carry the IR destination field as
we move along the stages. See
connection in fig.
Length of ID/EX register = (Reg1:32)+
(Reg2:32)+(offset:32)+ (PC:32)+
(destination register:5) = 133 bits
Assignment: What are the lengths of
EX/MEM, and MEM/WB registers

4 1999 ©UCB
Pipelined Datapath (with Pipeline Regs)
(6.2)Fetch Decode Execute Memory Write
Back
0
M
u
x
1

IF/ID ID/EX EX/MEM MEM/WB

Add

Add
4 Add
result

Shift
left 2

Read
Ins tructio n

PC Address register 1
Read
data 1
Read
register 2 Zero
Read ALU ALU
Write 0 Address Read
data 2 result 1
register M data
u M
Imem Write
data Regs x
1
u
x
0
Write

16 32
data
Dmem
Sign
extend

5
64 bits 133 bits 102 bits 69 bits
1999 ©UCB
Pipelined Control
(6.3)
• Start with single-cycle controller
• Group control lines by pipeline stage needed
• Extend pipeline registers with control bits

Instruction Mem
Control WB

EX Mem WB

RegDst
Branch MemToReg
ALUop
MemRead RegWrite
ALUSrc
MemWrite

IF/ID ID/EX EX/MEM MEM/WB

6 1999 ©UCB
Pipelined Processor: Datapath +
Control • More work to correctly handle pipeline hazards
PCSrc

ID/EX
0
M
u WB
x EX/MEM
1
Control M WB
MEM/WB

EX M WB
IF/ID

Add

Add
4 Add resul t
RegWrite
Sh if t Branch

MemWrite
left 2

MemToReg
ALUSrc
Instructi on

Read
PC Address regis ter 1 Read
Read data 1
regis ter 2 Zero
Read ALU ALU
Writ e 0 Read
data 2 result Address 1
Imem regis ter M
u
data
M

Regs
Writ e x u
data x
1
Dmem
0
Write
data

Instruction 16 32
[15– 0] 6
Si gn ALU MemRead
ex tend control

Instruction
[20– 16]
0 ALUOp
M
Instruction u
[15– 11] x
1
RegDst
7 1999 ©UCB
Reca
p
° if can keep all pipeline stages busy,
can retire (complete) up to one
instruction per clock cycle (thereby
achieving single-cycle throughput)
° The pipeline paradox (for MIPS): any
instruction still takes 5 cycles to
execute (even though can retire one
instruction per cycle)

8 1999 ©UCB
Problems for Pipelining
° Hazards prevent next instruction from
executing during its designated clock
cycle, limiting speedup
• Structural hazards: HW cannot support
this combination of instructions (single
memory for instruction and data)
• Data hazards: Instruction depends on
result of prior instruction still in the
pipeline
• Control hazards: conditional branches &
other instructions may stall the pipeline
delaying later instructions

9 1999 ©UCB
Single Memory is a Structural
Hazard
Time (clock cycles)
I
n

ALU
M Reg M Reg

s Load

ALU
t Instr 1 M Reg M Reg

ALU
M Reg M Reg
Instr 2
O

ALU
M Reg M Reg
Instr 3
r

ALU
d Instr 4 M Reg M Reg

e
r
10
• Can’t read same memory twice in same clock cycle
1999 ©UCB
EX: MIPS multicycle datapath:
Structural Hazard in Memory

P Address Instruction Read

C Register Reg1
Memory Read
Read
Instruction Reg2
data 1 A A ALU-
or Data L Out
Registers U
Write Read
Reg data 2 B
Data Memory
Data
Register Data

11 1999 ©UCB
Structural Hazards limit
performance
° Example: if 1.3 memory accesses per
instruction (30% of instructions
execute loads and stores)
and only one memory access per cycle
then
• Average CPI  1.3
• Otherwise datapath resource is more than
100% utilized

Structural Hazard Solution: Add more

Hardware
12 1999 ©UCB
Speed Up Equation for Pipelining

CPIpipelined = Ideal CPI + Pipeline stall clock cycles per instn

Speedup = Ideal CPI x Pipeline depth Clock Cycleunpipelined

---------------------------------- X -------------------------
Ideal CPI + Pipeline stall
x CPI Clock Cyclepipelined

Speedup = Pipeline depth Clock Cycleunpipelined

------------------------ X ---------------------------
1 + Pipeline stall CPI Clock Cyclepipelined

13 1999 ©UCB
Example: Dual-port vs. Single-port
° Machine A: Dual ported memory
° Machine B: Single ported memory, but its pipelined implementation
has a 1.05 times faster clock rate
° Ideal CPI = 1 for both
° Loads are 40% of instructions executed
SpeedUpA = Pipeline Depth/(1 + 0) x (clockunpipe/clockpipe)
= Pipeline Depth
SpeedUpB = Pipeline Depth/(1 + 0.4 x 1)
x (clockunpipe/(clockunpipe / 1.05)
= (Pipeline Depth/1.4) x 1.05
= 0.75 x Pipeline Depth
SpeedUpA / SpeedUpB = Pipeline Depth/(0.75 x Pipeline Depth) = 1.33

° Machine A is 1.33 times faster

add $1 ,$2, $3

sub $4, $1 ,$3

and $6, $1 ,$7

or $8, $1 ,$9

xor $10, $1 ,$11

15 1999 ©UCB
Data Hazard
Solution:
• “Forward” result from one stage to another
I Time (clock cycles)
IF ID/RF EX MEM WB
n

ALU
s add $1,$2,$3 IM Reg DM Reg

ALU
IM Reg DM Reg
sub $4,$1,$3
r.

ALU
IM Reg DM Reg
and $6,$1,$7
O

ALU
IM Reg DM Reg
r or $8,$1,$9
d

ALU
IM Reg DM Reg
xor $10,$1,$11
e
r
• “or” OK if implement register file properly
16 1999 ©UCB
Hazard Detection for Forwarding
° A hazard must be detected just before execution so that
in case of hazard, the data can be forwarded to the
input of the ALU.
° It can be detected when a source register (Rs or Rt or
both) of the instruction at the EX stage is equal to the
destination register (Rd) of an instruction in the
pipeline (either in MEM or WB stage)
° Compare the values of Rs and Rt registers in the ID/EX
stage with Rd at EX/MEM and MEM/WB stages =>
Need to carry Rs, Rt, Rd values to the ID/EX register
from the IF/ID register (only Rd was carried before)
° If they match, forward the data to the input of the ALU
through the multiplexor.

See Fig. 6.43 pp. 488 of the text

IF ID/RF EX MEM WB

ALU
lw $1,0($2) IM Reg DM Reg

ALU
IM Reg DM Reg
sub $4,$1,$3

• Can’t solve with forwarding alone

• Must stall instruction dependent on load
•“Load-Use” hazard
18 1999 ©UCB
Data Hazard Even with
Forwarding
• Must stall pipeline 1 cycle (insert 1 bubble)
Time (clock cycles)

IF ID/RF EX MEM WB
lw $1, 0($2)

ALU
IM Reg DM Reg

bub

ALU
sub $4,$1,$6 IM Reg
ble
DM Reg

bub

ALU
IM Reg DM Reg
and $6,$1,$7 ble

bub

ALU
or $8,$1,$9 ble
IM Reg DM

19 1999 ©UCB
Compiler Schemes to Improve Load Delay
° Compiler will detect data dependency and inserts
nop instructions until data is available
sub $2, $1, $3
nop
and $12, $2, $5
or $13, $6, $2
add $14, $2, $2
sw $15, 100($2)
° Compiler will find independent instructions to
fill in the delay slots
20 1999 ©UCB
Software Scheduling to Avoid Load Hazards
Try producing fast code for
a = b + c;
d = e – f;
assuming a, b, c, d ,e, and f in memory.
Slow code: Fast code:
LW Rb,b LW Rb,b
LW Rc,c LW Rc,c
ADD Ra,Rb,Rc LW Re,e
SW a,Ra ADD Ra,Rb,Rc
LW Re,e
LW Rf,f
LW Rf,f
SW a,Ra
SUB Rd,Re,Rf
SUB Rd,Re,Rf
SW d,Rd
SW d,Rd

21 1999 ©UCB

The Everyday Healthy Vegetarian by Nandita Iyer
No ratings yet
The Everyday Healthy Vegetarian by Nandita Iyer
458 pages
Lect8 Pipelined DP Control
No ratings yet
Lect8 Pipelined DP Control
59 pages
CODch 6 Slides
No ratings yet
CODch 6 Slides
77 pages
L11 Pipelined Datapath and
100% (1)
L11 Pipelined Datapath and
31 pages
Pipelined MIPS Processor: Dmitri Strukov ECE 154A
No ratings yet
Pipelined MIPS Processor: Dmitri Strukov ECE 154A
81 pages
Basic Pipelining: CS2100 - Computer Organization
No ratings yet
Basic Pipelining: CS2100 - Computer Organization
83 pages
03 Pipeline
0% (1)
03 Pipeline
38 pages
Computer Architecture: Nguyễn Trí Thành
No ratings yet
Computer Architecture: Nguyễn Trí Thành
77 pages
Chapter 4.5 - 4.8 Piplined Processor and Hazards
No ratings yet
Chapter 4.5 - 4.8 Piplined Processor and Hazards
68 pages
EECS 252 Graduate Computer Architecture Lec 3 - Performance + Pipeline Review
No ratings yet
EECS 252 Graduate Computer Architecture Lec 3 - Performance + Pipeline Review
48 pages
Pipelining ControlUnitAndHazards
No ratings yet
Pipelining ControlUnitAndHazards
109 pages
Lecture 11 COMP2611 Processor Part3
No ratings yet
Lecture 11 COMP2611 Processor Part3
41 pages
Pipeline Processor Design
No ratings yet
Pipeline Processor Design
89 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
49 pages
Pipelining Lecture
No ratings yet
Pipelining Lecture
39 pages
Pipe 4
No ratings yet
Pipe 4
50 pages
Enhancing Performance With Pipelining
No ratings yet
Enhancing Performance With Pipelining
71 pages
Chapter 04 Processor 3.5
No ratings yet
Chapter 04 Processor 3.5
52 pages
Lec12 Pipeline 2 Notes
No ratings yet
Lec12 Pipeline 2 Notes
58 pages
ILP - Appendix C PDF
No ratings yet
ILP - Appendix C PDF
52 pages
Pipelining and Parallelism
No ratings yet
Pipelining and Parallelism
41 pages
Enhancing Performance With Pipelining
No ratings yet
Enhancing Performance With Pipelining
85 pages
Pipe 2 New
No ratings yet
Pipe 2 New
41 pages
L15 MipsPipeline
No ratings yet
L15 MipsPipeline
26 pages
8 Pipeline DDP Control
No ratings yet
8 Pipeline DDP Control
54 pages
Chapter 4 The Processor
No ratings yet
Chapter 4 The Processor
72 pages
Pipelining 3
No ratings yet
Pipelining 3
37 pages
Unit 5 Pipeline Hazard
No ratings yet
Unit 5 Pipeline Hazard
31 pages
05 Pipelining
No ratings yet
05 Pipelining
37 pages
Forwarding Assignment
No ratings yet
Forwarding Assignment
35 pages
Pipelined Datapath and Control
No ratings yet
Pipelined Datapath and Control
26 pages
Ca06 2014 PDF
No ratings yet
Ca06 2014 PDF
53 pages
Pipelining - Modified1
No ratings yet
Pipelining - Modified1
51 pages
Pipelining 2
No ratings yet
Pipelining 2
33 pages
Chapter Six: 2004 Morgan Kaufmann Publishers
No ratings yet
Chapter Six: 2004 Morgan Kaufmann Publishers
25 pages
15IF11 Multicore A PDF
No ratings yet
15IF11 Multicore A PDF
64 pages
Lec 06
No ratings yet
Lec 06
18 pages
02a ILP Pipeline
No ratings yet
02a ILP Pipeline
40 pages
Advanced Linux Programming
No ratings yet
Advanced Linux Programming
31 pages
Lec 11
No ratings yet
Lec 11
30 pages
Lec13 Pipe Control
No ratings yet
Lec13 Pipe Control
19 pages
Two Forms of Pipelining: - E.g., Floating Point Operations
No ratings yet
Two Forms of Pipelining: - E.g., Floating Point Operations
36 pages
Embedded Computer Architecture 5SAI0
No ratings yet
Embedded Computer Architecture 5SAI0
59 pages
CS M151B / EE M116C: Computer Systems Architecture
No ratings yet
CS M151B / EE M116C: Computer Systems Architecture
38 pages
Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
No ratings yet
Lecture 13-14: Pipelines Hazards": Suggested Reading:" (HP Chapter 4.5-4.7) "
51 pages
CSCE 5610 Computer System Architecture: Instruction Level Parallelism
No ratings yet
CSCE 5610 Computer System Architecture: Instruction Level Parallelism
16 pages
TTSH Nursing Survival Guide
100% (2)
TTSH Nursing Survival Guide
96 pages
Pipeline Review: Here Is The Example Instruction Sequence Used To Illustrate Pipelining On The Previous Page
No ratings yet
Pipeline Review: Here Is The Example Instruction Sequence Used To Illustrate Pipelining On The Previous Page
11 pages
Ca07 2014 PDF
No ratings yet
Ca07 2014 PDF
56 pages
Lecture 13 Pipelining
No ratings yet
Lecture 13 Pipelining
12 pages
Pipelined Processor Design: Computer Architecture and Assembly Language
No ratings yet
Pipelined Processor Design: Computer Architecture and Assembly Language
22 pages
L24 Pipeline
No ratings yet
L24 Pipeline
40 pages
Colony Earth PDF
No ratings yet
Colony Earth PDF
144 pages
HRY-312 Computer Organization Introduction To Pipelining
No ratings yet
HRY-312 Computer Organization Introduction To Pipelining
30 pages
Chapter 13 Solutions
67% (3)
Chapter 13 Solutions
8 pages
Sizing and Selection of Grounding TransformersDecision Criteria
100% (2)
Sizing and Selection of Grounding TransformersDecision Criteria
5 pages
CA Unit 3 Answers
No ratings yet
CA Unit 3 Answers
10 pages
Lec7 Pipelining
No ratings yet
Lec7 Pipelining
22 pages
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
No ratings yet
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
50 pages
CS530 Fall2015 Lecture9
No ratings yet
CS530 Fall2015 Lecture9
5 pages
Mobile SDK Developer Guide
No ratings yet
Mobile SDK Developer Guide
387 pages
Application of Linear Programming Techniques To Practical
100% (1)
Application of Linear Programming Techniques To Practical
13 pages
Pipeline Datapaths: Pipelined Datapath and Control
No ratings yet
Pipeline Datapaths: Pipelined Datapath and Control
16 pages
Search vs. Hashing
No ratings yet
Search vs. Hashing
55 pages
Lec12 Pipeline
No ratings yet
Lec12 Pipeline
23 pages
" by Nils Gottfries (2013), Palgrave Macmillan. This Is An Advanced
No ratings yet
" by Nils Gottfries (2013), Palgrave Macmillan. This Is An Advanced
6 pages
Dissertation Business Plan
100% (1)
Dissertation Business Plan
5 pages
Why Do We Need One?: According To Richard Girling's Book Rubbish!
No ratings yet
Why Do We Need One?: According To Richard Girling's Book Rubbish!
3 pages
2014fa CS61C L31 DG PipelineII 6up
No ratings yet
2014fa CS61C L31 DG PipelineII 6up
4 pages
Capc
No ratings yet
Capc
21 pages
INGLES II Cuadernillo
No ratings yet
INGLES II Cuadernillo
38 pages
Ecs268: Structural & Material Laboratory: I. Objective
No ratings yet
Ecs268: Structural & Material Laboratory: I. Objective
7 pages
CSC 504 - Computer Architecture II: Course Particulars
No ratings yet
CSC 504 - Computer Architecture II: Course Particulars
6 pages
Prostate Cancer Thesis Statement
100% (3)
Prostate Cancer Thesis Statement
8 pages
Building Your Money Making Machine
100% (1)
Building Your Money Making Machine
2 pages
Od123134082577368000 2
No ratings yet
Od123134082577368000 2
2 pages
RGUHS - B.SC Nursing - 2012 - 1 - Mar - 1754 Anatomy and Physiology (Rs 3)
No ratings yet
RGUHS - B.SC Nursing - 2012 - 1 - Mar - 1754 Anatomy and Physiology (Rs 3)
1 page
StraMa Comprehensive Guidelines (C1 To C8) PDF
No ratings yet
StraMa Comprehensive Guidelines (C1 To C8) PDF
103 pages
Stykliste BY Manual
No ratings yet
Stykliste BY Manual
35 pages
SH3532 95石油化工换热设备施工及验收规范
No ratings yet
SH3532 95石油化工换热设备施工及验收规范
30 pages
Sinopsis Muhammad Haris Yulianto-1
No ratings yet
Sinopsis Muhammad Haris Yulianto-1
6 pages
Cpe 442 Introduction To Computer Architecture
No ratings yet
Cpe 442 Introduction To Computer Architecture
25 pages
ANT 4468 - Syllabus PDF
No ratings yet
ANT 4468 - Syllabus PDF
5 pages
CSE/EE 470: Computer Architecture II
No ratings yet
CSE/EE 470: Computer Architecture II
13 pages
CCN202 Kinetix 5700 Troubelshooting and Project Interpretation
No ratings yet
CCN202 Kinetix 5700 Troubelshooting and Project Interpretation
2 pages
470 L1 PDF
No ratings yet
470 L1 PDF
13 pages
Rpsxwhu$Ufklwhfwxuhdqg 2Shudwlqj6/Vwhpv: 'Dwdwudqvihuehwzhhqwkhfhqwudo Frpsxwhudqg, 2ghylfh
No ratings yet
Rpsxwhu$Ufklwhfwxuhdqg 2Shudwlqj6/Vwhpv: 'Dwdwudqvihuehwzhhqwkhfhqwudo Frpsxwhudqg, 2ghylfh
12 pages
4TB 3520203
No ratings yet
4TB 3520203
1 page
Accenture Presentation Script
No ratings yet
Accenture Presentation Script
3 pages
Computer Architecture PDF
No ratings yet
Computer Architecture PDF
10 pages
Hunshu
No ratings yet
Hunshu
6 pages
Contact Us - WBM International Online Shopping in Pakistan
No ratings yet
Contact Us - WBM International Online Shopping in Pakistan
1 page
1st Sem Result
No ratings yet
1st Sem Result
1 page
Assignment 3
No ratings yet
Assignment 3
2 pages
CISCO PACKET TRACER LABS: Best practice of configuring or troubleshooting Network
From Everand
CISCO PACKET TRACER LABS: Best practice of configuring or troubleshooting Network
Mulayam Singh
No ratings yet
LEARN MPLS FROM SCRATCH PART-B: A Beginners guide to next level of networking
From Everand
LEARN MPLS FROM SCRATCH PART-B: A Beginners guide to next level of networking
POONAM DEVI
No ratings yet

CS 162 Computer Architecture Lecture 3: Pipelining Contd.: Instructor: L.N. Bhuyan

Uploaded by

CS 162 Computer Architecture Lecture 3: Pipelining Contd.: Instructor: L.N. Bhuyan

Uploaded by

CS 162 Computer Architecture

Lecture 3: Pipelining Contd.

Instructor: L.N. Bhuyan

2 ALUOp 1999 ©UCB

IF/ID ID/EX EX/MEM MEM/WB

IF/ID ID/EX EX/MEM MEM/WB

P Address Instruction Read

Structural Hazard Solution: Add more

CPIpipelined = Ideal CPI + Pipeline stall clock cycles per instn

Speedup = Ideal CPI x Pipeline depth Clock Cycleunpipelined

Speedup = Pipeline depth Clock Cycleunpipelined

° Machine A is 1.33 times faster

sub $4, $1 ,$3

and $6, $1 ,$7

xor $10, $1 ,$11

See Fig. 6.43 pp. 488 of the text

• Can’t solve with forwarding alone

You might also like