0% found this document useful (0 votes)

7 views33 pages

Week 11

Uploaded by

आस्तिक शर्मा

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPSX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views33 pages

Week 11

Uploaded by

आस्तिक शर्मा

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPSX, PDF, TXT or read online on Scribd

You are on page 1/ 33

Computer Architecture

EE-371/CS-330
Spring 2019

Hasan Baig
[email protected]

Habib University

The contents in these lecture slides are prepared with a help of the official lecture slides of the book “Computer Organization and Design –
RISC V edition” by Patterson and Hennessy .
2
Recap
3
Performance Issues

• Longest delay determines clock period

Buffet/Delivery Restaurant Delivery Address /

Goodies collection
1. Dine-in, Delivery, worker 10

Process execute one at a

time
Tasks: Cash Counter 2

1. Customer – Grab food and dine in

2. Delivery Guy – Deliver food
3. Worker – Purchase Groceries Kitchen
(Grab food) 2
(Dining Hall)
(Check groceries)

Take/Give Order 1

Token counter 1
5
Pipelining Laundry Analogy
An implementation technique in which multiple instructions are
overlapped in execution

 Four loads:
 Speedup
= 8/3.5 = 2.3
 Non-stop:
 Speedup
= 2n/(0.5n + 1.5) ≈ 4
= number of stages
6
Pipelining
Five stages, one step per stage
1. IF: Instruction fetch from memory
2. ID: Instruction decode & register read
3. EX: Execute operation or calculate address
4. MEM: Access memory operand
5. WB: Write result back to register
7
Pipelining Example

• Assume time for stages is

– 100ps for register read or write
– 200ps for other stages
• Compare pipelined datapath with single-cycle
datapath

Instr Instr fetch Register ALU op Memory Register Total time

read access write
ld 200ps 100 ps 200ps 200ps 100 ps 800ps
sd 200ps 100 ps 200ps 200ps 700ps
R-format 200ps 100 ps 200ps 100 ps 600ps
beq 200ps 100 ps 200ps 500ps
8
Pipelining Example
Instr Instr fetch Register ALU op Memory Register Total time
read access write
ld 200ps 100 ps 200ps 200ps 100 ps 800ps
sd 200ps 100 ps 200ps 200ps 700ps
R-format 200ps 100 ps 200ps 100 ps 600ps
beq 200ps 100 ps 200ps 500ps

Single-cycle (Tc= 800ps)

9
Pipelining Example
Instr Instr fetch Register ALU op Memory Register Total time
read access write
ld 200ps 100 ps 200ps 200ps 100 ps 800ps
sd 200ps 100 ps 200ps 200ps 700ps
R-format 200ps 100 ps 200ps 100 ps 600ps
beq 200ps 100 ps 200ps 500ps

Pipelined (Tc= 200ps)

10
Pipelining Speedup

• If all stages are balanced

– i.e., all take the same time
– Time between instructionspipelined
= Time between instructionsnonpipelined
Number of stages
• If not balanced, speedup is less
• Speedup due to increased throughput
– Latency (time for each instruction) does not
decrease
11
Pipelining Speedup

T = 2400

T = 1400
12
Pipelining Speedup
Instructions = 1,000,000

Pipelined Non-Pipelined
Each instruction will add 200 ps Each instruction will add 800 ps

Total time = 1,000,000 x 200 = 200000000 + 1400 Total time = 1,000,000 x 800 = 800000000 + 2400
= 200001400 = 800002400
13
Latency Exercise
14
Latency Solultion

1. R-type: 30 + 250 + 150 + 25 + 200 + 25 + 20 = 700ps

2. ld: 30 + 250 + 150 + 25 + 200 + 250 + 25 + 20 = 950 ps

3. sd: 30 + 250 + 150 + 200 + 25 + 250 = 905

4. beq: 30 + 250 + 150 + 25 + 200 + 5 + 25 + 20 = 705

5. I-type: 30 + 250 + 150 + 25 + 200 + 25 + 20 = 700ps

15
Recap Problems in single-cycle processor

• Longest delay determines clock period

– Critical path: load instruction
– Instruction memory  register file  ALU  data
memory  register file
• Not feasible to vary period for different
instructions
• Violates design principle
– Making the common case fast
• We will improve performance by pipelining
16
Recap
17
Quick Review Stages in Processor
Five stages, one step per stage
1. IF: Instruction fetch from memory
2. ID: Instruction decode & register read
3. EX: Execute operation or calculate address
4. MEM: Access memory operand
5. WB: Write result back to register
18
Recap

• Data Read in the second half of clock cycle

• Data Write in the first half of clock cycle
19
Pipelining and ISA Design

• RISC-V ISA designed for pipelining

– All instructions are 32-bits
• Easier to fetch and decode in one cycle
• c.f. x86: 1- to 17-byte instructions

– Few and regular instruction formats

• Can decode and read registers in one step
20
Pipelining and ISA Design

• Situations that prevent starting the next

instruction in the next cycle  Hazards
• Structure hazards
– A required resource is busy
• Data hazard
– Need to wait for previous instruction to complete
its data read/write
• Control hazard
– Deciding on control action depends on previous
instruction
21
Pipelining Structural Hazards

When a planned instruction cannot execute in the proper clock cycle

because the hardware does not support the combination of
instructions that are set to execute.
• Conflict for use of a resource
• In RISC-V pipeline with a single memory
– Load/store requires data access
– Instruction fetch would have to stall for that cycle
• Would cause a pipeline “bubble”
• Hence, pipelined datapaths require separate
instruction/data memories
22
Pipelining Data Hazards

Data hazards occur when the pipeline must be stalled because one
step must wait for another to complete.
add x19, x0, x1
sub x2, x19, x3
23
Pipelining Data Hazards

• Use result when it is computed

– Don’t wait for it to be stored in a register
– Requires extra connections in the datapath

Forwarding - Also called bypassing. A method of resolving a data hazard by retrieving the
missing data element from internal buffers rather than waiting for it to arrive from
programmer- visible registers or memory.
25
Pipelining Data Hazards

• Forwarding paths are valid only if the destination stage is

later in time than the source stage
• Source – Output of MEM in first instruction
• Destination - Input to EX stage
26
Pipelining Data Hazards

• Load-use Data Hazards

• Can’t always avoid stalls by forwarding
– If value not computed when needed
– Can’t use forwarding backward in time!
27
Pipelining Data Hazards

Code Scheduling to avoid stalls

• Reorder code to avoid use of load result in the
next instruction
• C code for a = b + e; c = b + f;
Assume all variables are in memory and accessible from the offset x31

ld x1, 0(x31) ld x1, 0(x31)

ld x2, 8(x31) ld x2, 8(x31)
stall
add x3, x1, x2 ld x4, 16(x31)
sd x3, 24(x31) add x3, x1, x2
ld x4, 16(x31) sd x3, 24(x31)
add x5, x1, x4 add x5, x1, x4
stall
sd x5, 32(x31) sd x5, 32(x31)
13 cycles 11 cycles
28
Pipelining Control Hazards

• Also called Branch Hazard

• Branch determines flow of control
– Fetching next instruction depends on branch
outcome
– Pipeline can’t always fetch correct instruction
• Still working on ID stage of branch

• In RISC-V pipeline
– Need to compare registers and compute target
early in the pipeline
– Add hardware to do it in ID stage
29
Pipelining Control Hazards

Stall on branch
• Wait until branch outcome determined before
fetching next instruction
30
Pipelining Control Hazards

Branch Predictions
• Longer pipelines can’t readily determine
branch outcome early
– Stall penalty becomes unacceptable
• Predict outcome of branch
– Only stall if prediction is wrong
• In RISC-V pipeline
– Can predict branches not taken
– Fetch instruction after branch, with no delay
31
Pipelining Control Hazards

Branch Predictions
32
Pipelining Control Hazards

More-Realistic Branch Predictions

• Static branch prediction
– Based on typical branch behavior
– Example: loop and if-statement branches
• Predict backward branches taken
• Predict forward branches not taken
• Dynamic branch prediction
– Hardware measures actual branch behavior
• e.g., record recent history of each branch
– Assume future behavior will continue the trend
• When wrong, stall while re-fetching, and update history
33
Pipelining Summary of Overview

• Pipelining improves performance by increasing

instruction throughput
– Executes multiple instructions in parallel
– Each instruction has the same latency
• Subject to hazards
– Structure, data, control
• Instruction set design affects complexity of
pipeline implementation
34
Pipelining Activity

For each code sequence below, state whether it must stall, can avoid stalls using
only forwarding, or can execute without stalling or forwarding.

Chapter 04 RISC V Removed
No ratings yet
Chapter 04 RISC V Removed
99 pages
Cse410 10 Pipelining A
No ratings yet
Cse410 10 Pipelining A
7 pages
CAO Fall 2024 Lecture 07 RISC V Pipelined Implementation
No ratings yet
CAO Fall 2024 Lecture 07 RISC V Pipelined Implementation
114 pages
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
No ratings yet
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
64 pages
SRM Pipelining 05
No ratings yet
SRM Pipelining 05
42 pages
IT3030E CA Chap5 CPU - Removed
No ratings yet
IT3030E CA Chap5 CPU - Removed
26 pages
Module 5 Part2 Pipelining
No ratings yet
Module 5 Part2 Pipelining
36 pages
Lecture # 8B
No ratings yet
Lecture # 8B
20 pages
2.pipeline RISC-V v2
No ratings yet
2.pipeline RISC-V v2
47 pages
CH14-WS - 10thed - Pipeline
No ratings yet
CH14-WS - 10thed - Pipeline
16 pages
Design of 3 Stage Pipelining Processor Using VHDL
No ratings yet
Design of 3 Stage Pipelining Processor Using VHDL
22 pages
Lec14 Pipeline Riscv - Key
No ratings yet
Lec14 Pipeline Riscv - Key
58 pages
Pipeline Processor Design
No ratings yet
Pipeline Processor Design
89 pages
Week 10
No ratings yet
Week 10
12 pages
L04 Pipelining
No ratings yet
L04 Pipelining
48 pages
CO4 PPT Modified
No ratings yet
CO4 PPT Modified
35 pages
Lect8 Pipelined DP Control
No ratings yet
Lect8 Pipelined DP Control
59 pages
Lec11 Pipeline 1 Notes
No ratings yet
Lec11 Pipeline 1 Notes
26 pages
Chapter 10 Principles of Pipelining
No ratings yet
Chapter 10 Principles of Pipelining
124 pages
Chapter 4.5 - 4.8 Piplined Processor and Hazards
No ratings yet
Chapter 4.5 - 4.8 Piplined Processor and Hazards
68 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
49 pages
CA07 2022S3 New
No ratings yet
CA07 2022S3 New
29 pages
TTL & CMOS Series (Complete) PDF
100% (1)
TTL & CMOS Series (Complete) PDF
22 pages
Pipelining 2019
No ratings yet
Pipelining 2019
82 pages
Pipelining Lecture
No ratings yet
Pipelining Lecture
39 pages
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
No ratings yet
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
19 pages
Chapter - 04 RISC V
No ratings yet
Chapter - 04 RISC V
132 pages
Chapter 4 Part 2
No ratings yet
Chapter 4 Part 2
50 pages
Ca 5
No ratings yet
Ca 5
12 pages
Programmable DSP Lecture1
75% (4)
Programmable DSP Lecture1
19 pages
Pipelining and Parallel Processing
No ratings yet
Pipelining and Parallel Processing
26 pages
Pipelining and Pipelining Hazards
No ratings yet
Pipelining and Pipelining Hazards
43 pages
Chapter 04 Processor 2
No ratings yet
Chapter 04 Processor 2
28 pages
CS530 Fall2015 Lecture9
No ratings yet
CS530 Fall2015 Lecture9
5 pages
L04 Pipelining
No ratings yet
L04 Pipelining
38 pages
3 Pipeline
No ratings yet
3 Pipeline
38 pages
Robie Lego
100% (1)
Robie Lego
196 pages
Computer Architecture: Nguyễn Trí Thành
No ratings yet
Computer Architecture: Nguyễn Trí Thành
77 pages
L15 MipsPipeline
No ratings yet
L15 MipsPipeline
26 pages
The Permaculture City - Chapter One: The Surprisingly Green City
100% (1)
The Permaculture City - Chapter One: The Surprisingly Green City
18 pages
Computer Architecture: Pipelining: Dr. Ashok Kumar Turuk
No ratings yet
Computer Architecture: Pipelining: Dr. Ashok Kumar Turuk
136 pages
CH7-Parallel and Pipelined Processing
No ratings yet
CH7-Parallel and Pipelined Processing
23 pages
Cat Dcs Sis Controller PDF
100% (1)
Cat Dcs Sis Controller PDF
48 pages
Computer System Organization
No ratings yet
Computer System Organization
26 pages
CODch 6 Slides
No ratings yet
CODch 6 Slides
77 pages
Ijj J JJ JJ
No ratings yet
Ijj J JJ JJ
203 pages
Pipelining
No ratings yet
Pipelining
44 pages
Unit-V: Performance Enhancement Techinques
No ratings yet
Unit-V: Performance Enhancement Techinques
61 pages
Ca07 2014 PDF
No ratings yet
Ca07 2014 PDF
56 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
Pipe Lining
No ratings yet
Pipe Lining
16 pages
L14 MipsPipeline Ovw
No ratings yet
L14 MipsPipeline Ovw
17 pages
Veritas Infoscale 7 Installation Guide
No ratings yet
Veritas Infoscale 7 Installation Guide
116 pages
Ca06 2014 PDF
No ratings yet
Ca06 2014 PDF
53 pages
E204 S07 Exr DFT
100% (1)
E204 S07 Exr DFT
15 pages
Risc in Pipe Ine
No ratings yet
Risc in Pipe Ine
39 pages
Pipe Lining
No ratings yet
Pipe Lining
66 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
COA Unit 3
No ratings yet
COA Unit 3
89 pages
Helping Slides Pipelining Hazards Solutions
No ratings yet
Helping Slides Pipelining Hazards Solutions
55 pages
Assoc Churches Christ Church Extension Property Trust Boardv AC
No ratings yet
Assoc Churches Christ Church Extension Property Trust Boardv AC
26 pages
Getting Started With The Msp430 Launchpad: Student Guide and Lab Manual
No ratings yet
Getting Started With The Msp430 Launchpad: Student Guide and Lab Manual
260 pages
Enhancing Performance With Pipelining
No ratings yet
Enhancing Performance With Pipelining
85 pages
Blu-Ray Disc Player: User Manual
No ratings yet
Blu-Ray Disc Player: User Manual
24 pages
MEITRACK GPS Tracking System MS03 User Guide V1.0 - 20150825
No ratings yet
MEITRACK GPS Tracking System MS03 User Guide V1.0 - 20150825
46 pages
Gigabyte MB Naming Rule Apr-2009
No ratings yet
Gigabyte MB Naming Rule Apr-2009
7 pages
An Introduction To Hadoop
No ratings yet
An Introduction To Hadoop
12 pages
CO Pipelining PDF Notes
No ratings yet
CO Pipelining PDF Notes
10 pages
AYJR 2022 July - Shift 2
No ratings yet
AYJR 2022 July - Shift 2
70 pages
(Presentation) Contemporary Architects
No ratings yet
(Presentation) Contemporary Architects
12 pages
HRY-312 Computer Organization Introduction To Pipelining
No ratings yet
HRY-312 Computer Organization Introduction To Pipelining
30 pages
Reduced Instruction Set Computer (Risc) Complex Instruction Set Computer (Cisc)
No ratings yet
Reduced Instruction Set Computer (Risc) Complex Instruction Set Computer (Cisc)
7 pages
Kerala's Vernacular Architecture
No ratings yet
Kerala's Vernacular Architecture
28 pages
Argus Classified 030616
No ratings yet
Argus Classified 030616
2 pages
Week 1
No ratings yet
Week 1
34 pages
Processor Organization & Instruction Cycle
No ratings yet
Processor Organization & Instruction Cycle
31 pages
MGD 2621 P Medical Greyscale Display: Installation & User Manual
No ratings yet
MGD 2621 P Medical Greyscale Display: Installation & User Manual
27 pages
OMA's Taipei Performing Arts Center in Taiwan
No ratings yet
OMA's Taipei Performing Arts Center in Taiwan
8 pages
Foilroom Full - Rep
No ratings yet
Foilroom Full - Rep
14 pages
Fatehpur Sikri
No ratings yet
Fatehpur Sikri
5 pages
Semiconductors - JEE Main 2022 Chapter Wise Questions by MathonGo
No ratings yet
Semiconductors - JEE Main 2022 Chapter Wise Questions by MathonGo
21 pages
1561546552enable Cost Centre in Purchase Voucher
No ratings yet
1561546552enable Cost Centre in Purchase Voucher
9 pages
HD-600 Wind Turbine Ventilator Fan
No ratings yet
HD-600 Wind Turbine Ventilator Fan
8 pages
ACI318-08 RC Beam - XLS: Material Properties
100% (1)
ACI318-08 RC Beam - XLS: Material Properties
3 pages
Work Power Energy - JEE Main 2022 Chapter Wise Questions by MathonGo
No ratings yet
Work Power Energy - JEE Main 2022 Chapter Wise Questions by MathonGo
17 pages
Sadatan Factory Tour
No ratings yet
Sadatan Factory Tour
16 pages
EE7
No ratings yet
EE7
4 pages
A Survey of FPGA Based Accelerators For
No ratings yet
A Survey of FPGA Based Accelerators For
32 pages
Theft by Katherine Anne Porter
No ratings yet
Theft by Katherine Anne Porter
6 pages
Debug 1214
No ratings yet
Debug 1214
3 pages
Elevation 1: Scale 1:50
No ratings yet
Elevation 1: Scale 1:50
1 page
Networkingreport 201501553 201400937
No ratings yet
Networkingreport 201501553 201400937
27 pages
Watergems Product Data Sheet
No ratings yet
Watergems Product Data Sheet
2 pages
GNU Octave Beginner's Guide
From Everand
GNU Octave Beginner's Guide
Jesper Schmidt Hansen
3/5 (2)
Bash Command Line Pro Tips
From Everand
Bash Command Line Pro Tips
Jason Cannon
4.5/5 (8)
DevOps. How to build pipelines with Jenkins, Docker container, AWS ECS, JDK 11, git and maven 3?
From Everand
DevOps. How to build pipelines with Jenkins, Docker container, AWS ECS, JDK 11, git and maven 3?
John Edward Cooper Berg
No ratings yet

Week 11

Uploaded by

Week 11

Uploaded by

Computer Architecture

• Longest delay determines clock period

Buffet/Delivery Restaurant Delivery Address /

Process execute one at a

1. Customer – Grab food and dine in

• Assume time for stages is

Instr Instr fetch Register ALU op Memory Register Total time

Single-cycle (Tc= 800ps)

Pipelined (Tc= 200ps)

• If all stages are balanced

1. R-type: 30 + 250 + 150 + 25 + 200 + 25 + 20 = 700ps

2. ld: 30 + 250 + 150 + 25 + 200 + 250 + 25 + 20 = 950 ps

3. sd: 30 + 250 + 150 + 200 + 25 + 250 = 905

4. beq: 30 + 250 + 150 + 25 + 200 + 5 + 25 + 20 = 705

5. I-type: 30 + 250 + 150 + 25 + 200 + 25 + 20 = 700ps

• Longest delay determines clock period

• Data Read in the second half of clock cycle

• RISC-V ISA designed for pipelining

– Few and regular instruction formats

• Situations that prevent starting the next

When a planned instruction cannot execute in the proper clock cycle

• Use result when it is computed

• Forwarding paths are valid only if the destination stage is

• Load-use Data Hazards

Code Scheduling to avoid stalls

ld x1, 0(x31) ld x1, 0(x31)

• Also called Branch Hazard

More-Realistic Branch Predictions

• Pipelining improves performance by increasing

You might also like