ECE/CS 752: Advanced Computer Architecture I 1

This document discusses superscalar pipelining and techniques to improve processor performance beyond single instruction pipelining. It introduces the concept of superscalar machines that can dispatch and execute multiple instructions per cycle by exploiting instruction level parallelism. Several challenges of implementing superscalar designs are discussed, including limiting factors on instruction level parallelism and managing dependencies between instructions. Different classifications of instruction level parallelism machines such as superscalar, superpipelined, VLIW and hybrid approaches are also covered.

Uploaded by

Nusrat Mary Chowdhury

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views4 pages

ECE/CS 752: Advanced Computer Architecture I 1

Uploaded by

Nusrat Mary Chowdhury

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

ECE/CS 752: Advanced Computer Architecture I 1

Pipelining to Superscalar Pipelining to Superscalar

Prof.Mikko H.Lipasti
UniversityofWisconsinMadison
LecturenotesbasedonnotesbyJohnP.Shen
UpdatedbyMikko Lipasti
Pipelining to Superscalar Pipelining to Superscalar
Forecast
Limitsofpipelining
Thecaseforsuperscalar p
Instructionlevelparallelmachines
Superscalarpipelineorganization
Superscalarpipelinedesign
Limits of Pipelining Limits of Pipelining
IBMRISCExperience
Controlanddatadependencesadd15%
BestcaseCPIof1.15,IPCof0.87
D i li (hi h f ) if Deeperpipelines(higherfrequency)magnify
dependencepenalties
Thisanalysisassumes100%cachehitrates
Hitratesapproach100%forsomeprograms
Manyimportantprogramshavemuchworsehit
rates
Later!
Processor Performance Processor Performance
Processor Performance = ---------------
Time
Program
Instructions Cycles
I i
Time
=
X X
Inthe1980s(decadeofpipelining):
CPI:5.0=>1.15
Inthe1990s(decadeofsuperscalar):
CPI:1.15=>0.5(bestcase)
Inthe2000s(decadeofmulticore):
MarginalCPIimprovement
Program Instruction Cycle
(code size)
X X
(CPI) (cycle time)
Amdahls Law Amdahls Law
No. of
Processors
N
1
h 1- h
1- f
f
h=fractionoftimeinserialcode
f=fractionthatisvectorizable
v=speedupforf
Overallspeedup:
Time
1 1 f
v
f
f
Speedup

1
1
Revisit Amdahls Law Revisit Amdahls Law
Sequentialbottleneck
Evenifvisinfinite
Performancelimitedbynonvectorizable
f
v
f
f
v

1
1
1
1
lim
y
portion(1f)
No. of
Processors
N
Time
1
h 1- h
1- f
f
ECE/CS 752: Advanced Computer Architecture I 2
Pipelined Performance Model Pipelined Performance Model
Pipeline
Depth
N
1
g=fractionoftimepipelineisfilled
1g=fractionoftimepipelineisnotfilled
(stalled)
1-g g
1
Pipeline
Depth
N
1
Pipelined Performance Model Pipelined Performance Model
g=fractionoftimepipelineisfilled
1g=fractionoftimepipelineisnotfilled
(stalled)
1-g g
1
Pipelined Performance Model Pipelined Performance Model
Pipeline
Depth
N
1
TyrannyofAmdahlsLaw[BobColwell]
Whengisevenslightlybelow100%,abig
performancehitwillresult
Stalledcyclesarethekeyadversaryandmustbe
minimizedasmuchaspossible
1-g g
1
Motivation for Superscalar Motivation for Superscalar
[Agerwala and Cocke] [Agerwala and Cocke]
5
6
7
8

p
n=12
n=100
Speedupjumpsfrom3to4.3
forN=6,f=0.8,buts=2instead
ofs=1(scalar)
0 0.2 0.4 0.6 0.8 1
0
1
2
3
4
5
Vectorizability f
S
p
e
e
d
u
p

p
n=4
n=6
n=6,s=2
Typical Range
Superscalar Proposal Superscalar Proposal
ModeratetyrannyofAmdahlsLaw
Easesequentialbottleneck
Moregenerallyapplicable g y pp
Robust(lesssensitivetof)
RevisedAmdahlsLaw:

v
f
s
f
Speedup

1
1
Limits on Instruction Level Limits on Instruction Level
Parallelism (ILP) Parallelism (ILP)
WeissandSmith[1984] 1.58
Sohi andVajapeyam[1987] 1.81
TjadenandFlynn[1970] 1.86(Flynns bottleneck)
TjadenandFlynn[1973] 1.96
Uht[1986] 2.00
Smithet al. [1989] 2.00 Smithet al. [1989] 2.00
J ouppi andWall [1988] 2.40
J ohnson[1991] 2.50
Acostaet al. [1986] 2.79
Wedig[1982] 3.00
Butler et al. [1991] 5.8
MelvinandPatt [1991] 6
Wall [1991] 7(J ouppi disagreed)
Kuck et al. [1972] 8
RisemanandFoster [1972] 51(nocontrol dependences)
NicolauandFisher [1984] 90(Fishers optimism)
ECE/CS 752: Advanced Computer Architecture I 3
Superscalar Proposal Superscalar Proposal
Gobeyondsingleinstructionpipeline,
achieveIPC>1
Dispatchmultipleinstructionspercycle
id ll li bl f f Providemoregenerallyapplicableformof
concurrency(notjustvectors)
Gearedforsequentialcodethatishardto
parallelizeotherwise
Exploitfinegrainedorinstructionlevel
parallelism(ILP)
Classifying ILP Machines Classifying ILP Machines
[Jouppi,DECWRL1991]
BaselinescalarRISC
Issueparallelism=IP=1
Operationlatency=OP=1
PeakIPC=1
1
2
3
4
5
6
IF DE EX WB
1 2 3 4 5 6 7 8 9 0
TIME IN CYCLES (OF BASELINE MACHINE)
S
U
C
C
E
S
S
I
V
E
I
N
S
T
R
U
C
T
I
O
N
S
Classifying ILP Machines Classifying ILP Machines
[Jouppi,DECWRL1991]
Superpipelined:cycletime=1/mofbaseline
Issueparallelism=IP=1inst/minorcycle
Operationlatency=OP=mminorcycles
P k IPC i t / j l ( d ?) PeakIPC=minstr/majorcycle(mxspeedup?)
1
2
3
4
5
IF DE EX WB
6
1 2
3 4 5 6
Classifying ILP Machines Classifying ILP Machines
[Jouppi,DECWRL1991]
Superscalar:
Issueparallelism=IP=ninst/cycle
Operationlatency=OP=1cycle
PeakIPC=ninstr/cycle(nxspeedup?) / y ( p p )
IF DE EX WB
1
2
3
4
5
6
9
7
8
Classifying ILP Machines Classifying ILP Machines
[Jouppi,DECWRL1991]
VLIW:VeryLongInstructionWord
Issueparallelism=IP=ninst/cycle
Operationlatency=OP=1cycle
PeakIPC=ninstr/cycle=1VLIW/cycle / y / y
IF DE
EX
WB
Classifying ILP Machines Classifying ILP Machines
[Jouppi,DECWRL1991]
SuperpipelinedSuperscalar
Issueparallelism=IP=ninst/minorcycle
Operationlatency=OP=mminorcycles
PeakIPC=nxminstr/majorcycle / j y
IF DE EX WB
1
2
3
4
5
6
9
7
8
ECE/CS 752: Advanced Computer Architecture I 4
Superscalar vs. Superpipelined Superscalar vs. Superpipelined
Roughlyequivalentperformance
Ifn=mthenbothhaveaboutthesameIPC
Parallelismexposedinspacevs.time
Timein Cycles (of BaseMachine)
0 1 2 3 4 5 6 7 8 9
SUPERPIPELINED
10 11 12 13
SUPERSCALAR
Key:
IFetch
Dcode
Execute
Writeback
Superpipelining Superpipelining: Result Latency : Result Latency
Superpipelining - J ouppi, 1989
essentially describes apipelined execution stage
J ouppi s basemachine J ouppi s basemachine
Underpipelined machine
Superpipelined machine
Underpipelined machines cannot
issue instructions as fast as they are
executed
Note - key charact eristic of Superpipe lined
machines is that results are not available
to M-1 suc cess ive instructions
Superscalar Challenges Superscalar Challenges
I-cache
FETCH
DECODE
Branch
Predictor
Instruction
Buffer
Instruction
Flow
DECODE
COMMIT
D-cache Store
Queue
Reorder
Buffer
Integer Floating-point Media Memory
Register
Data
Memory
Data
EXECUTE
(ROB)
Flow
Flow

T o o Index Volume-1 PDF
50% (4)
T o o Index Volume-1 PDF
281 pages
Module 2 ACA Notes
100% (1)
Module 2 ACA Notes
31 pages
Unit 2 - Advanced Computer Architecture - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Advanced Computer Architecture - WWW - Rgpvnotes.in
59 pages
Chapter 4 Processors and Memory Hierarchy: Module-2
No ratings yet
Chapter 4 Processors and Memory Hierarchy: Module-2
31 pages
@vtucode - in 21CS643 Module 2 2021 Scheme
No ratings yet
@vtucode - in 21CS643 Module 2 2021 Scheme
49 pages
2 3 4 5 Merged Merged
No ratings yet
2 3 4 5 Merged Merged
164 pages
2 3 4 Merged
No ratings yet
2 3 4 Merged
134 pages
Pipelining - Lec 2-3-4
No ratings yet
Pipelining - Lec 2-3-4
72 pages
02b ILP Superscalar VLIW
No ratings yet
02b ILP Superscalar VLIW
20 pages
Lecture13 Pipeline2
No ratings yet
Lecture13 Pipeline2
51 pages
ACA Module2 2018.PDF Extra
No ratings yet
ACA Module2 2018.PDF Extra
48 pages
7TH - Unit 2-21ec74h6 - Ca
No ratings yet
7TH - Unit 2-21ec74h6 - Ca
95 pages
L03 Pipelining
No ratings yet
L03 Pipelining
45 pages
Processors
100% (4)
Processors
44 pages
Chapter 04 Processors and Memory Hierarchy
75% (8)
Chapter 04 Processors and Memory Hierarchy
50 pages
Superscalar Vs VLIW
No ratings yet
Superscalar Vs VLIW
30 pages
740 Fall10 Lecture4 Afterlecture Pipelining
No ratings yet
740 Fall10 Lecture4 Afterlecture Pipelining
24 pages
03a ILP Superscalar VLIW
No ratings yet
03a ILP Superscalar VLIW
21 pages
ITEC582-Chapter 16m
No ratings yet
ITEC582-Chapter 16m
55 pages
Lec9 Multiple Issue Processors
No ratings yet
Lec9 Multiple Issue Processors
33 pages
15CS72 ACA Module2Final
No ratings yet
15CS72 ACA Module2Final
29 pages
Arch3 Pipelining Afterlecture
No ratings yet
Arch3 Pipelining Afterlecture
180 pages
Lec 14
No ratings yet
Lec 14
36 pages
Comparch PDF
No ratings yet
Comparch PDF
84 pages
Chapter 04 Processors and Memory Hierarchy PDF
No ratings yet
Chapter 04 Processors and Memory Hierarchy PDF
50 pages
Stud CSA Processors Mod2 Part1
No ratings yet
Stud CSA Processors Mod2 Part1
64 pages
Aca Notes
No ratings yet
Aca Notes
23 pages
Chapter 4 (Processors and Memory Hierarchy)
100% (1)
Chapter 4 (Processors and Memory Hierarchy)
17 pages
Subb Arao
No ratings yet
Subb Arao
191 pages
Design Proposal of An Automatic Smart MultiInsect Mosquito Killing System IEEE
No ratings yet
Design Proposal of An Automatic Smart MultiInsect Mosquito Killing System IEEE
6 pages
Coa Unit 5
No ratings yet
Coa Unit 5
20 pages
Instruction Pipelining and SuperScalar Development - 2019
No ratings yet
Instruction Pipelining and SuperScalar Development - 2019
53 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
60 pages
Aca Notes
No ratings yet
Aca Notes
19 pages
Advanced Processor Superscalarclass
50% (2)
Advanced Processor Superscalarclass
73 pages
Subba Thesis
No ratings yet
Subba Thesis
182 pages
Csa Mod 2
100% (1)
Csa Mod 2
28 pages
Advanced Unix Programming
From Everand
Advanced Unix Programming
Prof. N. B Venkateswarlu
No ratings yet
Mod5 1
No ratings yet
Mod5 1
18 pages
Byou Dissertation
No ratings yet
Byou Dissertation
177 pages
WRL Research Report 89/7: Available Instruction-Level Parallelism For Superscalar and Superpipelined Machines
No ratings yet
WRL Research Report 89/7: Available Instruction-Level Parallelism For Superscalar and Superpipelined Machines
35 pages
ACA Mod2
No ratings yet
ACA Mod2
45 pages
08 Architecture
No ratings yet
08 Architecture
51 pages
20 Advanced Processor Designs
No ratings yet
20 Advanced Processor Designs
28 pages
DSP q1
No ratings yet
DSP q1
7 pages
CSO Computer Programming
No ratings yet
CSO Computer Programming
73 pages
Batch 2 ICS 2101 AND BIT 2102 (1) - 1
No ratings yet
Batch 2 ICS 2101 AND BIT 2102 (1) - 1
17 pages
A Superpipeline Approach To The MIPS Arch
No ratings yet
A Superpipeline Approach To The MIPS Arch
5 pages
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
Franco Mario
No ratings yet
Success Starts With IELTS: British Council 2012
No ratings yet
Success Starts With IELTS: British Council 2012
33 pages
Design and Analysis of A 32-Bit Pipelined Mips Risc Processor
No ratings yet
Design and Analysis of A 32-Bit Pipelined Mips Risc Processor
18 pages
Superscalar and VLIW Architectures
No ratings yet
Superscalar and VLIW Architectures
35 pages
Law485 Director
No ratings yet
Law485 Director
67 pages
MGP 2025 Test Code 813215 Sol Eng
No ratings yet
MGP 2025 Test Code 813215 Sol Eng
12 pages
Ieltsreadingpreparationtips 121229130101 Phpapp02
No ratings yet
Ieltsreadingpreparationtips 121229130101 Phpapp02
29 pages
Lecture 2
No ratings yet
Lecture 2
17 pages
Soumya Pandey - Freelance Agreement PDF
No ratings yet
Soumya Pandey - Freelance Agreement PDF
4 pages
Types of Processor
No ratings yet
Types of Processor
6 pages
Advanced Computer Architectures: 17CS72 (As Per CBCS Scheme)
No ratings yet
Advanced Computer Architectures: 17CS72 (As Per CBCS Scheme)
32 pages
Computer Organization and Architecture: Instruction-Level Parallelism and Superscalar Processors
No ratings yet
Computer Organization and Architecture: Instruction-Level Parallelism and Superscalar Processors
43 pages
Advanced Computer Architecture Prof Thriveni T K
No ratings yet
Advanced Computer Architecture Prof Thriveni T K
59 pages
Rock Cycle - Metamorphic Rocks
No ratings yet
Rock Cycle - Metamorphic Rocks
33 pages
IELTS in The US and Beyond: A Truly Global Experience: NAFSA Region XII 2012
No ratings yet
IELTS in The US and Beyond: A Truly Global Experience: NAFSA Region XII 2012
30 pages
IELTS in The US and Beyond: A Truly Global Experience: NAFSA Region XII 2012
No ratings yet
IELTS in The US and Beyond: A Truly Global Experience: NAFSA Region XII 2012
30 pages
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
No ratings yet
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
36 pages
Superscalar - Superpipeline - Processor
No ratings yet
Superscalar - Superpipeline - Processor
10 pages
2.1: Advanced Processor Technology: Qn:Explain Design Space of Processor?
No ratings yet
2.1: Advanced Processor Technology: Qn:Explain Design Space of Processor?
29 pages
The Microarchitecture of Superscalar Processors: Paper
No ratings yet
The Microarchitecture of Superscalar Processors: Paper
16 pages
Stack Computers: The New Wave
From Everand
Stack Computers: The New Wave
Philip Koopman
No ratings yet
Gateway
No ratings yet
Gateway
80 pages
Gateway
No ratings yet
Gateway
80 pages
Study Guide 300-615 Dcit Troubleshooting Cisco Data Centre Infrastructure
From Everand
Study Guide 300-615 Dcit Troubleshooting Cisco Data Centre Infrastructure
Anand Vemula
No ratings yet
Parallelism in Uniprocessor System and Granularity
100% (5)
Parallelism in Uniprocessor System and Granularity
5 pages
霍尼韦尔 Lks310 燃烧器控制器说明书
No ratings yet
霍尼韦尔 Lks310 燃烧器控制器说明书
8 pages
Unit I Instruction Level Parallelism Two Mark Questions: Dept of Cse G.SURESH. M.Tech, Asst Prof / CSE
No ratings yet
Unit I Instruction Level Parallelism Two Mark Questions: Dept of Cse G.SURESH. M.Tech, Asst Prof / CSE
12 pages
Superscaling in Computer Architecture
No ratings yet
Superscaling in Computer Architecture
9 pages
Name: Rafi Dar: Very Large Instruction Word
No ratings yet
Name: Rafi Dar: Very Large Instruction Word
18 pages
Computer Organization and Architecture What Does Superscalar Mean?
No ratings yet
Computer Organization and Architecture What Does Superscalar Mean?
14 pages
Riphah International University: Student Information System
No ratings yet
Riphah International University: Student Information System
3 pages
K.1.1 Sisters and Brothers (Social Studies)
No ratings yet
K.1.1 Sisters and Brothers (Social Studies)
10 pages
Studies Abroad Counselors
No ratings yet
Studies Abroad Counselors
38 pages
Monal
100% (1)
Monal
4 pages
Cs 201 Long Quiz 2
No ratings yet
Cs 201 Long Quiz 2
3 pages
Analysis of The Task Superscalar Architecture Hardware Design
No ratings yet
Analysis of The Task Superscalar Architecture Hardware Design
10 pages
Open The Dor
No ratings yet
Open The Dor
9 pages
Natwar Lal Joshi - Resume 2023
No ratings yet
Natwar Lal Joshi - Resume 2023
1 page
Jigs and Fixtures
No ratings yet
Jigs and Fixtures
5 pages
AEDT Icepak Intro 2019R1 L3 Flow and Thermal Boundary Conditions
No ratings yet
AEDT Icepak Intro 2019R1 L3 Flow and Thermal Boundary Conditions
20 pages
Publications Requirements 1.4
No ratings yet
Publications Requirements 1.4
11 pages
Company Law Sujith
No ratings yet
Company Law Sujith
8 pages
A New Current-Source Converter Using A Symmetric Gate-Commutated Thyristor (SGCT)
No ratings yet
A New Current-Source Converter Using A Symmetric Gate-Commutated Thyristor (SGCT)
8 pages
8.1.2.7 Lab - Using The Windows Calculator With Network Addresses
No ratings yet
8.1.2.7 Lab - Using The Windows Calculator With Network Addresses
7 pages
Multiple Issue
No ratings yet
Multiple Issue
10 pages
Superscalar Processor
No ratings yet
Superscalar Processor
4 pages
REPSE Requirements
No ratings yet
REPSE Requirements
6 pages
G9 DLL Q1 Week4
No ratings yet
G9 DLL Q1 Week4
3 pages
WWW Study-India Co in
No ratings yet
WWW Study-India Co in
16 pages
Module 1.session 3.ISCM.2021
No ratings yet
Module 1.session 3.ISCM.2021
18 pages
Chapter 17 - Answer PDF
No ratings yet
Chapter 17 - Answer PDF
5 pages
Macronix MX25L12855FXCI 10G Datasheet
No ratings yet
Macronix MX25L12855FXCI 10G Datasheet
15 pages
Rea P6 Extra Practice 1
No ratings yet
Rea P6 Extra Practice 1
16 pages
7 - SCR
No ratings yet
7 - SCR
11 pages
The Ielts Exam - : Reading
No ratings yet
The Ielts Exam - : Reading
11 pages
TOPIC 7 Unemployment
No ratings yet
TOPIC 7 Unemployment
13 pages
The Ielts Exam - Listening
No ratings yet
The Ielts Exam - Listening
9 pages
Application of High Power Thyristors in HVDC and FACTS Systems
No ratings yet
Application of High Power Thyristors in HVDC and FACTS Systems
8 pages
EE Lab 10
No ratings yet
EE Lab 10
7 pages
700-V Asymmetrical 4H-Sic Gate Turn-Off Thyristors (Gto'S)
No ratings yet
700-V Asymmetrical 4H-Sic Gate Turn-Off Thyristors (Gto'S)
3 pages
Brake Drum
No ratings yet
Brake Drum
4 pages
Air Conditioning
No ratings yet
Air Conditioning
4 pages
7.2.5.3 Packet Tracer - Configuring IPv6 Addressing Instructions PDF
No ratings yet
7.2.5.3 Packet Tracer - Configuring IPv6 Addressing Instructions PDF
3 pages
Lab Assignment 2
No ratings yet
Lab Assignment 2
7 pages
DLL - FP Wk8 Day 1
No ratings yet
DLL - FP Wk8 Day 1
5 pages
223 Dak 17 DRG Cul Misc GW Typ 01
No ratings yet
223 Dak 17 DRG Cul Misc GW Typ 01
2 pages
Translation Certification: Form H-1
No ratings yet
Translation Certification: Form H-1
2 pages

ECE/CS 752: Advanced Computer Architecture I 1

Uploaded by

ECE/CS 752: Advanced Computer Architecture I 1

Uploaded by

ECE/CS 752: Advanced Computer Architecture I 1

Pipelining to Superscalar Pipelining to Superscalar

You might also like