Computer Architecture_Lecture 13

Uploaded by

rifadulhasan69

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views18 pages

Computer Architecture_Lecture 13

Uploaded by

rifadulhasan69

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Computer Architecture

Lecture 13
Instruction-level Parallelism
And Superscalar Processors
Instructor: Sultana Jahan Soheli
Assistant Professor , ICE, NSTU
Reference Books
• Computer Organization and Architecture:
Designing for Performance- William Stallings
(8th Edition)
– Any later edition is fine
Superscalar Architecture
• A processor architecture
• In which common instructions—integer and
floating-point arithmetic, loads, stores, and
conditional branches—can be initiated
simultaneously and executed independently
• Refers to a machine that is designed to improve
the performance of the execution of scalar
instructions
– In most applications, the bulk of the operations are on
scalar quantities
Superscalar Architecture
• Deals with the Ability to execute instructions
independently and concurrently in different
pipelines
Superpipelined Architecture
• Superpipelining
exploits the fact that
many pipeline stages
perform tasks that
require less than half a
clock cycle
• Thus, a doubled
internal clock speed
allows the
performance of two
tasks in one external
clock cycle
Superscalar vs. Superpipelined
Limitations
• The superscalar approach depends on the ability to execute multiple
instructions in parallel
• The term instruction-level parallelism refers to the degree to which, on
average, the instructions of a program can be executed in parallel
• A combination of compiler-based optimization and hardware
techniques can be used to maximize instruction-level parallelism
• Obstacles for achieving parallelism:
– True data dependency
– Procedural dependency
– Resource conflicts
– Output dependency
– Anti-dependency
Effect of Dependencies
Design Issues
• Instruction-Level Parallelism and Machine Parallelism
– Instruction-level parallelism exists when instructions in a
sequence are independent and thus can be executed in
parallel by overlapping
– The degree of instruction-level parallelism is determined
by the frequency of true data dependencies and
procedural dependencies in the code
– These factors, in turn, are dependent on the instruction set
architecture and on the application
– Also determined by operation latency: the time until the
result of an instruction is available for use as an operand in
a subsequent instruction
Design Issues
• Machine parallelism is a measure of the ability
of the processor to take advantage of
instruction-level parallelism
– Determined by the number of instructions that
can be fetched and executed at the same time
(the number of parallel pipelines) and
– by the speed and sophistication of the
mechanisms that the processor uses to find
independent instructions
Design Issues
• Instruction Issue Policy: The processor must also
be able to identify instruction level parallelism
and co-ordinate the fetching, decoding, and
execution of instructions in parallel
• Instruction issue refers to the process of initiating
instruction execution in the processor’s
functional units
• Instruction issue policy refers to the protocol
used to issue instructions
Design Issues
• Superscalar instruction issue policies fall into
the following categories:
– In-order issue with in-order completion
– In-order issue with out-of-order completion
– Out-of-order issue with out-of-order completion
Design Issues
Design Issues
• Register Renaming: When out-of-order
techniques are used, the values in registers
cannot be fully known at each point in time
just from a consideration of the sequence of
instructions dictated by the program
• Solution: Duplication of resources
– Also refers to as register renaming
• When a new register value is created, a new
register is allocated for that value
Design Issues
• Subsequent instructions that access that value
as a source operand in that register must go
through a renaming process:
– The register references in those instructions must
be revised to refer to the register containing the
needed value
– The same original register reference in several
different instructions may refer to different actual
registers, if different values are intended
Design Issues
• Superscalar Execution:
Design Issues
• Superscalar Implementation Issues:
a) Instruction fetch strategies that simultaneously fetch multiple
instructions, often by predicting the outcomes of, and fetching
beyond, conditional branch instructions
– These functions require the use of multiple pipeline fetch and
decode stages, and branch prediction logic
b) Logic for determining true dependencies involving register values,
and mechanisms for communicating these values to where they
are needed during execution
c) Mechanisms for initiating, or issuing, multiple instructions in
parallel
d) Resources for parallel execution of multiple instructions, including
multiple pipelined functional units and memory hierarchies
capable of simultaneously servicing multiple memory references
e) Mechanisms for committing the process state in correct order
Thank you!

PAL
No ratings yet
PAL
2 pages
ITEC582-Chapter 16m
No ratings yet
ITEC582-Chapter 16m
55 pages
L27,28 Superscaler
No ratings yet
L27,28 Superscaler
28 pages
William Stallings Computer Organization and Architecture: Instruction Level Parallelism and Superscalar Processors
No ratings yet
William Stallings Computer Organization and Architecture: Instruction Level Parallelism and Superscalar Processors
28 pages
P14-15 Superscalar
No ratings yet
P14-15 Superscalar
28 pages
Decode and Issue More and One Instruction at A Time Executing More Than One Instruction at A Time More Than One Execution Unit
No ratings yet
Decode and Issue More and One Instruction at A Time Executing More Than One Instruction at A Time More Than One Execution Unit
28 pages
CH16 COA9e Instruction Level Parallelism and Superscalar Processors
No ratings yet
CH16 COA9e Instruction Level Parallelism and Superscalar Processors
20 pages
CH - 14 - Instruction Level Parallelism and Superscalar Processors
No ratings yet
CH - 14 - Instruction Level Parallelism and Superscalar Processors
42 pages
CH16 ParallelismSuperScalar 22 Slides
No ratings yet
CH16 ParallelismSuperScalar 22 Slides
22 pages
7TH_UNIT 2-21EC74H6_CA
No ratings yet
7TH_UNIT 2-21EC74H6_CA
95 pages
Chapter 13_Instruction Level Parallelism (1)
No ratings yet
Chapter 13_Instruction Level Parallelism (1)
16 pages
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
No ratings yet
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
36 pages
CH18-COA11e
No ratings yet
CH18-COA11e
37 pages
Instruction-Level Parallelism and Superscalar Processors
No ratings yet
Instruction-Level Parallelism and Superscalar Processors
22 pages
William Stallings Computer Organization and Architecture 8 Edition Instruction Level Parallelism and Superscalar Processors
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Instruction Level Parallelism and Superscalar Processors
50 pages
Computer Organization and Architecture: Instruction-Level Parallelism and Superscalar Processors
No ratings yet
Computer Organization and Architecture: Instruction-Level Parallelism and Superscalar Processors
43 pages
Computer Organization and Architecture What Does Superscalar Mean?
No ratings yet
Computer Organization and Architecture What Does Superscalar Mean?
14 pages
Computer Architecture Unit 3
No ratings yet
Computer Architecture Unit 3
8 pages
Lecture 2
No ratings yet
Lecture 2
17 pages
CH16-WS ILP and Superscalar-v2
No ratings yet
CH16-WS ILP and Superscalar-v2
42 pages
Xx-Iip & Ilp
No ratings yet
Xx-Iip & Ilp
16 pages
CH16-WS ILP and Superscalar-V2
No ratings yet
CH16-WS ILP and Superscalar-V2
42 pages
Instruction Level Parallelism
No ratings yet
Instruction Level Parallelism
19 pages
Lec5 PDF
No ratings yet
Lec5 PDF
39 pages
Instruction Level Parallelism and Superscalar Processors
No ratings yet
Instruction Level Parallelism and Superscalar Processors
34 pages
Chapter 5 PPTV 41 STDV 1
No ratings yet
Chapter 5 PPTV 41 STDV 1
47 pages
Superscalar and Superpipelined Processors
No ratings yet
Superscalar and Superpipelined Processors
4 pages
Hafta 14
No ratings yet
Hafta 14
23 pages
10.Week
No ratings yet
10.Week
35 pages
Lect5 PDF
No ratings yet
Lect5 PDF
21 pages
Batch 2 ICS 2101 AND BIT 2102 (1) - 1
No ratings yet
Batch 2 ICS 2101 AND BIT 2102 (1) - 1
17 pages
Chapter 2 ILP
No ratings yet
Chapter 2 ILP
89 pages
Superscaling in Computer Architecture
No ratings yet
Superscaling in Computer Architecture
9 pages
William Stallings Computer Organization and Architecture 10 Edition
No ratings yet
William Stallings Computer Organization and Architecture 10 Edition
40 pages
Instruction Level Parallelism: Module 5: Chapter 12
No ratings yet
Instruction Level Parallelism: Module 5: Chapter 12
13 pages
Cs2354 Advanced Computer Architecture 2 Marks
No ratings yet
Cs2354 Advanced Computer Architecture 2 Marks
10 pages
CH18 COA11e
No ratings yet
CH18 COA11e
40 pages
4 MultiIssue 2024
No ratings yet
4 MultiIssue 2024
174 pages
Instruction Pipelining and SuperScalar Development - 2019
No ratings yet
Instruction Pipelining and SuperScalar Development - 2019
53 pages
Presentation - Cea - Chapter16 2
No ratings yet
Presentation - Cea - Chapter16 2
33 pages
Instruction Level Pipelining
100% (1)
Instruction Level Pipelining
113 pages
3-INSTRUCTION LEVEL PARALLELISM-12-Dec-2019Material - I - 12-Dec-2019 - ILP PDF
No ratings yet
3-INSTRUCTION LEVEL PARALLELISM-12-Dec-2019Material - I - 12-Dec-2019 - ILP PDF
15 pages
Unit 1
No ratings yet
Unit 1
5 pages
Input Unit: Memory: in Processing Element (PE) or CPU: Output
No ratings yet
Input Unit: Memory: in Processing Element (PE) or CPU: Output
24 pages
CSCI 4717/5717 Computer Architecture: Topic: Instruction Level Parallelism Reading: Stallings, Chapter 14
No ratings yet
CSCI 4717/5717 Computer Architecture: Topic: Instruction Level Parallelism Reading: Stallings, Chapter 14
43 pages
8. Module3
No ratings yet
8. Module3
49 pages
Presentation Cea Chapter16 2 Demo
No ratings yet
Presentation Cea Chapter16 2 Demo
30 pages
Superscalar Processors Questions
No ratings yet
Superscalar Processors Questions
12 pages
CSE 820 Graduate Computer Architecture Week 5 - Instruction Level Parallelism
No ratings yet
CSE 820 Graduate Computer Architecture Week 5 - Instruction Level Parallelism
38 pages
ELECH473_Th04
No ratings yet
ELECH473_Th04
59 pages
(123doc) Dien Tu Vien Thong c16 Instructionlevel Parallelism and Superscalar Processors 39 g3 Khotailieu
No ratings yet
(123doc) Dien Tu Vien Thong c16 Instructionlevel Parallelism and Superscalar Processors 39 g3 Khotailieu
71 pages
Architecture PDF
No ratings yet
Architecture PDF
19 pages
08 Parallel algorithms approches
No ratings yet
08 Parallel algorithms approches
12 pages
2-TypesofParallelism (1)
No ratings yet
2-TypesofParallelism (1)
69 pages
HPC-Unit-2
No ratings yet
HPC-Unit-2
72 pages
Instruction-Level Parallel Processors: Asim Munir
No ratings yet
Instruction-Level Parallel Processors: Asim Munir
28 pages
Parallel Processing
No ratings yet
Parallel Processing
127 pages
Chap2 Slides
No ratings yet
Chap2 Slides
127 pages
Superscalar
No ratings yet
Superscalar
38 pages
Model-Driven Online Capacity Management for Component-Based Software Systems
From Everand
Model-Driven Online Capacity Management for Component-Based Software Systems
André van Hoorn
No ratings yet
Programming with Patterns in Parallel and Distributed Systems
From Everand
Programming with Patterns in Parallel and Distributed Systems
Pasquale De Marco
No ratings yet
Lecture 2 digital logic gate
No ratings yet
Lecture 2 digital logic gate
17 pages
Lecture 2 DLD
No ratings yet
Lecture 2 DLD
17 pages
Lecture 1 DLD
No ratings yet
Lecture 1 DLD
24 pages
Lecture 1 DLD
No ratings yet
Lecture 1 DLD
24 pages
82 Conde V CA
No ratings yet
82 Conde V CA
1 page
Amazon Trust Safety Mock Test
No ratings yet
Amazon Trust Safety Mock Test
3 pages
Datasheet - HK ds1286 1090859
No ratings yet
Datasheet - HK ds1286 1090859
12 pages
WSM Reviewer
No ratings yet
WSM Reviewer
9 pages
Harnoor Singh: Microsoft, Atlanta - Software Engineer
No ratings yet
Harnoor Singh: Microsoft, Atlanta - Software Engineer
2 pages
Electrical Interview Questions Bank
No ratings yet
Electrical Interview Questions Bank
53 pages
Additive Manufacturing Process Modeling Technology Preview
No ratings yet
Additive Manufacturing Process Modeling Technology Preview
26 pages
Odata
No ratings yet
Odata
8 pages
Listade Precios
No ratings yet
Listade Precios
52 pages
REN R01us0037ej0100 V850e2 LBR 20140529
No ratings yet
REN R01us0037ej0100 V850e2 LBR 20140529
286 pages
Genesys Logic, Inc.: Revision 1.04 Nov. 5, 2009
No ratings yet
Genesys Logic, Inc.: Revision 1.04 Nov. 5, 2009
21 pages
Customer Relationship Management (CRM) and Study of Its Effect on Competitive Advantage
No ratings yet
Customer Relationship Management (CRM) and Study of Its Effect on Competitive Advantage
7 pages
Information Privacy
No ratings yet
Information Privacy
4 pages
DISC 321-Decision Analysis-Kamran Ali Chatha PDF
No ratings yet
DISC 321-Decision Analysis-Kamran Ali Chatha PDF
9 pages
05+ICRSE-2023+5 8+Ratna+Farwati
No ratings yet
05+ICRSE-2023+5 8+Ratna+Farwati
7 pages
PC-6 IPC I_24
100% (1)
PC-6 IPC I_24
2,640 pages
Second Order Filter Functions: V S Ns Ns N Ts Vs S S Q
No ratings yet
Second Order Filter Functions: V S Ns Ns N Ts Vs S S Q
17 pages
Kanhaiya - CV - 07022024 (1) .Docx - 20240919 - 011759 - 0000
No ratings yet
Kanhaiya - CV - 07022024 (1) .Docx - 20240919 - 011759 - 0000
7 pages
Sinopec Antiwear Hydraulic Oil L-HM
100% (1)
Sinopec Antiwear Hydraulic Oil L-HM
3 pages
HBI DRI Fines Guides Available
No ratings yet
HBI DRI Fines Guides Available
1 page
FAG Bearing
No ratings yet
FAG Bearing
6 pages
Parking Lifts
No ratings yet
Parking Lifts
10 pages
Power Point 2010
No ratings yet
Power Point 2010
29 pages
Hybrid Self Tuned Fuzzy PID Controller For Speed Control of Brushless DC Motor
No ratings yet
Hybrid Self Tuned Fuzzy PID Controller For Speed Control of Brushless DC Motor
9 pages
Final (Resume) 2023
No ratings yet
Final (Resume) 2023
2 pages
LATIHAN ASESMEN AKHIR SEMESTER I kls6
No ratings yet
LATIHAN ASESMEN AKHIR SEMESTER I kls6
4 pages
Measuring The Performance of E-Government
No ratings yet
Measuring The Performance of E-Government
56 pages
Mdlz Cagny 2024 Slides - Post
No ratings yet
Mdlz Cagny 2024 Slides - Post
65 pages
Case Study Background
No ratings yet
Case Study Background
6 pages

Computer Architecture_Lecture 13

Uploaded by

Computer Architecture_Lecture 13

Uploaded by

Computer Architecture

You might also like