0% found this document useful (0 votes)

30 views39 pages

Pipelining Updated

Uploaded by

ajjualmighty4955

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views39 pages

Pipelining Updated

Uploaded by

ajjualmighty4955

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Pipelining

Amudhan AN

Course Instructor : Amudhan AN 1

What is Pipelining

Pipelining is a technique used in computer architecture to increase instructio

n throughput by overlapping the execution of multiple instructions.
It involves dividing the instruction processing into separate stages, with each
stage handling a different part of the instruction's execution.
This allows multiple instructions to be processed simultaneously, improving
overall efficiency and performance.

Course Instructor : Amudhan AN 2

Advantages:
1. Increased Instruction Throughput: Pipelining allows multiple instructions to be processed simult
aneously, significantly increasing the number of instructions completed per unit of time. This leads to
faster overall execution of programs.
2. Efficient CPU Resource Utilization: Each stage of the pipeline can work on different parts of mult
iple instructions concurrently, ensuring that the CPU is not idle and its resources are used more efficie
ntly.
3. Improved Performance: By overlapping the execution of instructions, pipelining reduces the instr
uction cycle time, leading to improved performance and faster execution of programs. This makes syst
ems more responsive and capable of handling complex tasks more effectively.
4. Reduced Instruction Latency: Although the individual instruction latency might not change signif
icantly, the overall latency for a sequence of instructions decreases due to the parallel processing of m
ultiple instructions.
5. Simplified Instruction Execution: Breaking down instruction execution into smaller, manageable
stages simplifies the design and implementation of the CPU. This modular approach makes it easier to
design, debug, and optimize each stage individually.
Course Instructor : Amudhan AN 3
Course Instructor : Amudhan AN 4
Stages of the Pipeline
Fetch:
• The Instruction Fetch (IF) stage is where the CPU retrieves the next instruction
to be executed from memory.
• Program Counter (PC): Holds the address of the next instruction.
• Instruction Memory: The instruction is fetched from this memory location.
• Increment PC: After fetching, the PC is incremented to point to the next instru
ction.
• Operations:
1.Fetch Instruction: Use the address in the PC to get the instruction from memory.
2.Update PC: Increment the PC to point to the next instruction.

Course Instructor : Amudhan AN 5

Example:
• PC = 0x0000
• Instruction = Memory[PC]
• PC = PC + 4

Course Instructor : Amudhan AN 6

Decode
• Definition:
• The Instruction Decode (ID) stage interprets the fetched instruction and
prepares the necessary operands for execution.
• Key Points:
• Control Unit: Decodes the instruction to determine what action is neede
d.
• Register File: Reads the necessary operands from the registers.
• Immediate Values: Extracts any immediate values if present.
• Operations:
1.Decode Instruction: Control unit interprets the opcode.
2.Read Operands: Fetch the necessary operands from the register file.
3.Sign Extend: For immediate values, extend the sign if needed.
Course Instructor : Amudhan AN 7
Opcode = Instruction[31:26]
Rs = Instruction[25:21]
Rt = Instruction[20:16]
Immediate = SignExtend(Instruction[15:0])

Opcode (short for operation code) is a unique code that specifies the operation to be performed by
a computer's processor. It's essentially the instruction part of a machine language instruction.
For example, in the instruction "ADD A, B", "ADD" is the opcode, specifying the addition operation.
The registers "A" and "B" are the operands, the data on which the operation is performed.
Key points about opcodes:
•Unique Identifier: Each opcode corresponds to a specific operation.
•Binary Representation: Opcodes are typically represented in binary format.
•Part of Machine Language: They are the fundamental building blocks of machine language.
•Instruction Set Architecture: The set of opcodes supported by a processor is defined by its
instruction set architecture (ISA).

Course Instructor : Amudhan AN 8

This slides briefs only on the use of registers in various stage.
A register file is a collection of registers, each capable of storing a fixed number of bits of data. It's a high-speed storage element within a
processor, used to store temporary data during program execution. Think of it as a small, high-speed memory that's directly accessible by
the processor.
Why is it Important in a 5-Stage Pipeline?
In a 5-stage pipeline, the register file plays a crucial role in efficiently passing data between different stages. Here's how:
Instruction Decode (ID) Stage:
The instruction is decoded to determine the source and destination registers.
The register file reads the values from the source registers.
Execute (EX) Stage:
The ALU uses the values from the register file to perform arithmetic or logical operations.
The result of the operation is stored in a temporary register.
Memory Access (MEM) Stage:
For load instructions, the memory address is calculated using values from the register file.
For store instructions, the data to be stored is obtained from the register file.
Write Back (WB) Stage:
The final result of the operation is written back to the destination register in the register file.
Key Points:
High Speed: Register files are designed to provide very fast access to data, significantly impacting overall processor performance.
Organization: They are typically organized as an array of registers, each with its own address.
Read/Write Ports: Multiple read and write ports allow for concurrent access to different registers, enhancing pipeline efficiency. 9
Execute:
• Definition:
• The Execution (EX) stage performs the operation specified by the decoded instr
uction.
• Key Points:
• Arithmetic Logic Unit (ALU): Performs arithmetic and logical operations.
• Branch Calculations: Determines the branch target address if the instruction is
a branch.
• ALU Control: Selects the appropriate ALU operation based on the instruction t
ype.
• Operations:
1.ALU Operation: Perform the required arithmetic or logical operation.
2.Branch Evaluation: Calculate the branch target if it’s a branch instruction.

Course Instructor : Amudhan AN 10

ALU Operation: Result = ALU(RsValue, RtValue)
•Context: This takes place in the Execution (EX) stage of the pipeline.
•ALU: Arithmetic Logic Unit, responsible for performing arithmetic and logical operation
s.
•RsValue: Value from the source register Rs.
•RtValue: Value from the source register Rt.
•Operation: The ALU performs a specified operation (like addition, subtraction, etc.) on
RsValue and RtValue, then stores the result in Result.

Example:
•Instruction: ADD R1, R2, R3
•Rs: Register R2
•Rt: Register R3
•Operation: Result = R2 + R3 (Value from R2 plus value from R3).

Course Instructor : Amudhan AN 11

Branch Address Calculation: BranchAddr = PC + (SignExtend(Immediate) << 2)
•Context: This calculation is used to determine the target address for branch instructions.
•PC (Program Counter): Holds the address of the current instruction.
•Immediate: A value embedded in the instruction, often representing an offset.
•SignExtend (Immediate): Extends the immediate value to match the bit-
width of the target address, preserving its sign.
•Shift Left (<< 2): Multiplies the immediate value by 4, which aligns it with the instruction word size
(assuming 32-bit instructions).
•Operation: Adds the shifted, sign-
extended immediate value to the PC to compute the target address of the branch.

Example:
•Instruction: BEQ R1, R2, offset
•PC: 0x1000
•Immediate (offset): 0x0004
•Sign-Extended Immediate: 0x0004
•Shifted Immediate: 0x0004 << 2 = 0x0010
•Branch Address: PC + 0x0010 = 0x1010
Course Instructor : Amudhan AN 12
Memory
• Definition:
• The Memory Access (MEM) stage is used for load and store instructions
to access memory.
• Key Points:
• Load/Store Unit: Handles memory read or write operations.
• Data Memory: Reads or writes data from/to memory.
• Memory Address: The effective address is calculated and used.
• Operations:
1.Memory Read: Load the data from the calculated address.
2.Memory Write: Store the data to the calculated address
Course Instructor : Amudhan AN 13
Example:
• If Load instruction: Data = Memory[Address]
• If Store instruction: Memory[Address] = RtValue

Course Instructor : Amudhan AN 14

Write Back :
• Definition:
• The Write Back (WB) stage writes the result of the instruction back to the
register file.
• Key Points:
• Register File: The destination register is updated with the result.
• Result Selection: Choose between ALU result and memory data for write-
back.
• Final Stage: Completes the instruction execution cycle.
• Operations:
1.Write Result: Write the result to the specified destination register.
Example:
Register[Destination] = Result
Course Instructor : Amudhan AN 15
Pipelining stages in various ARM archs

Course Instructor : Amudhan AN 16

Src: https://www.geeksforgeeks.org/pipelining-in-arm/

Course Instructor : Amudhan AN 17

Instruction Cycle:
The complete sequence of stages needed to execute an instruction—
from fetching to write-back.
Clock Cycle:
The time taken to complete one cycle of the clock, which synchronizes the operation
s of the CPU.
Role in Pipelining: Each stage of the pipeline typically completes its part of the instru
ction cycle in one clock cycle.
Parallel Processing: Multiple instructions are processed simultaneously, each at diffe
rent stages, thereby utilizing each clock cycle efficiently
Course Instructor : Amudhan AN 18
• Example: Pipelining with Clock Cycles
• Instruction 1 (I1):
• Cycle 1: Fetch
• Cycle 2: Decode
• Cycle 3: Execute

• Instruction 2 (I2):
• Cycle 2: Fetch
• Cycle 3: Decode
• Cycle 4: Execute

Course Instructor : Amudhan AN 19

Disadvantages of pipelining

• Pipeline Hazards: These are the conditions that result in

interruption of the pipelines leading to delay in the execution of
instructions.
• Increased Complexity: Mention that with addition of the
pipelines, the total design of this processor escalates as well.
• Stalling: Data dependences may require instructions to wait
and hence result in pipeline stalls.

Course Instructor : Amudhan AN 20

Types of Data Hazards:
• Read After Write (RAW) - True Dependence:
• Occurs when an instruction needs to read a value that has not yet been writ
ten by a previous instruction.

Instruction 1: ADD R2, R3, R4 ; R2 = R3 + R4

Instruction 2: SUB R5, R2, R6 ; R5 = R2 - R6

• If I2 is executed before I1 writes to R2, I2 will read the old value of R2

Course Instructor : Amudhan AN 21

• Write After Read (WAR) - Anti-Dependence:
• Occurs when an instruction writes to a register that a previous instruc
tion needs to read.

I1: MOV R4, R1 ; R4 = R1

I2: ADD R1, R2, R3 ; R1 = R2 + R3

• If I2 is executed before I1 reads R1, I1 will read the wrong value of R1

Course Instructor : Amudhan AN 22

• Write After Write (WAW) - Output Dependence:
Occurs when two instructions write to the same register, potentially causing the w
rong value to be written last.

I1: ADD R1, R2, R3 ; R1 = R2 + R3

I2: SUB R1, R4, R5 ; R1 = R4 - R5

Suppose I2 executes before I1 writes its result to R1.

I2 will write its result to R1.

I1 then writes its result to R1, overwriting the value written by I2.

In a program where the value calculated by I2 is needed before the execution of I1,
this order can lead to incorrect results.
Course Instructor : Amudhan AN 23
Handling Data Hazards:
• Forwarding (Bypassing):
• Uses hardware to pass the result of an instruction directly to a
subsequent instruction needing it, bypassing the normal write-back
stage.

• Example: The result of I1 is forwarded directly to the next instruction

I2 needing the result without waiting for it to be written back to the
register file.

• This happens through interface register or Buffer registers.

Course Instructor : Amudhan AN 24

Course Instructor : Amudhan AN 25
• Pipeline Stalls:

• Introduces delays in the pipeline to wait for the necessary data to be

available.

• Insert NOP (no-operation) instructions to wait for ADD to complete

before executing

Course Instructor : Amudhan AN 26

• Register Renaming:

• Dynamically assigns different physical registers to eliminate WAR and

WAW hazards.

• Example: Instead of using the same logical register, use different

physical registers to store intermediate results.

Course Instructor : Amudhan AN 27

Control Hazards in Pipelining

• Control hazards, also known as branch hazards, occur when the pipeline
makes incorrect predictions about the flow of control instructions such a
s branches, jumps, and calls. These hazards can disrupt the flow of instru
ctions through the pipeline, leading to delays and inefficiencies.

Course Instructor : Amudhan AN 28

Instruction
Address Instruction

2000 I1
2004 I2 BEQ Label
2008 I3
Jump case

2050 BI1(Label part)

1 2 3 4 5 6 7
I1 IF ID EX Mem Wb Ok
ID
I2 IF (PC:250) EX Mem Wb Ok
Prob based
Me on Previous
I3 IF ID Ex m WB decode stage
I4
BI1 IF ID
Course Instructor : Amudhan AN Ex Mem WB 29
Solution: Stall
1 2 3 4 5 6 7
I1 IF ID EX Mem Wb Ok
ID
I2 IF (PC:250) EX Mem Wb Ok
Till Decode
Introduc of previous
e delay - - - - - stage
I4
BI1 IF ID Ex Mem WB

I1: BEQ R0, R1, label ; Branch if R0 equals R1

I2: NOP ; Delay slot (No Operation)

Course Instructor : Amudhan AN 30

Solution 2
Pipeline Flushing:
1. Description: Discard instructions in the pipeline that were fetched based on inc
orrect predictions.
2. Example: If a branch is taken, flush all subsequent instructions in the pipeline t
hat were fetched assuming the branch was not taken.

1 2 3 4 5 6 7
I1 IF ID EX Mem Wb Ok
EX
I2 IF ID (PC:250) Mem Wb Ok
Prob based
Branch on on Previous
condition I3 IF ID Ex Mem WB decode stage
I4 IF ID Ex Mem WB

Course Instructor : Amudhan AN 31

BI1 IF ID Ex Mem WB
1. Initialize Prediction Table:
Create a table to store branch history and prediction outcomes (e.g., a Branch History Table with 2-
bit counters for each branch).
2. Fetch Instruction:
Fetch the next instruction to be executed.
3. Check for Branch:
If the instruction is a branch, proceed to predict its outcome.
If not, execute it as a normal instruction.
4. Predict Branch Outcome:
Use the branch history table to predict whether the branch will be taken or not taken.
Based on the prediction, fetch the next set of instructions.
5. Execute Branch Instruction:
Execute the branch instruction to determine the actual outcome.
6. Update Prediction Table:
Sol3 : Branch
If the prediction was correct, strengthen the prediction in the table. Prediction Algorithm
If the prediction was incorrect, update the table to reflect the actual outcome.
7. Handle Mispredictions:
If the prediction was incorrect:
Flush the incorrect instructions from the pipeline.
Fetch the correct set of instructions based on the actual outcome of the branch.
8. Continue Pipeline Execution:
Course Instructor : Amudhan AN 32
Repeat the process for subsequent instructions and branches.
Structural Hazards in Pipelining
• Definition:
• Structural hazards occur when hardware resources are insufficient to
support all concurrent operations in the pipeline. This can cause confli
cts and delays in instruction execution.
• Example:
• If two instructions require the same resource (e.g., memory, ALU) at t
he same time, a structural hazard occurs.

Course Instructor : Amudhan AN 33

1 2 3 4 5 6
I1 IF(Mem) ID EX Mem Wb
I1 and I4 trying to access the
same memory location
I2 IF(Mem) ID EX Mem Wb
I3 IF(Mem) ID Ex Mem
I4 IF(Mem) ID Ex

1 2 3 4 5 6

I1 IF(Mem) ID EX Mem Wb

I2 IF(Mem) ID EX Mem Wb
Wb
I3 IF(Mem) ID Ex Mem

I4 Course Instructor : Amudhan AN - - IF(Mem) ID Ex 34

Problem 1

• Consider a five stage pipelining with cycle time of 6 ns .

• Calculate the execution time of 100 instructions.
• Calculate the speed up due to pipelining
• Also find the utilization.

Course Instructor : Amudhan AN 35

1. Execution Time
Non-Pipelined Execution:
Execution Time per Instruction: 5 stages * 6 ns = 30 ns per instruction
Total Time for 100 Instructions: 100 * 30 ns = 3000 ns
Pipelined Execution:
Time to fill the pipeline: 5 * 6 ns = 30 ns
Time to execute remaining 95 instructions: 95 * 6 ns = 570 ns
Total Pipelined Execution Time: 30 ns + 570 ns = 600 ns

2. Speedup due to Pipelining

Speedup = Non-Pipelined Time / Pipelined Time
Speedup = 3000 ns / 600 ns = 5
Course Instructor : Amudhan AN 36
3. Utilization of the Pipeline
Pipeline Utilization = (Number of stages actively used / Total number of stages) * 100%

During the full 100 instruction execution, each stage of the pipeline is actively used after
the pipeline is filled.

Utilization Calculation: After the pipeline is filled, all 5 stages are utilized for 96 cycles (5
cycles to fill + 95 more cycles).

Percentage Utilization = (96 cycles * 5 stages) / (100 instructions * 5 stages) * 100%

Utilization = (480 / 500) * 100% = 96%

Course Instructor : Amudhan AN 37

Problem 2
Consider a 5 stage pipeline. Delay of each stage is 10,16,12,11 and 14 ns.
Calculate the execution time of 100 instructions and speed up due to
pipelining.

Course Instructor : Amudhan AN 38

1. Find the Cycle Time
In a pipelined architecture, the cycle time is determined by the stage with the maximum delay, becaus
e every stage must operate within the same cycle time to maintain synchronization.
Stage Delays: 10 ns, 16 ns, 12 ns, 11 ns, 14 ns
Cycle Time: 16 ns (the maximum stage delay)

2. Execution Time for 100 Instructions

Non-Pipelined Execution:
Execution Time per Instruction: Sum of all stage delays = 10 + 16 + 12 + 11 + 14 = 63 ns
Total Time for 100 Instructions: 100 * 63 ns = 6300 ns
Pipelined Execution:
Time to fill the pipeline: 5 * 16 ns = 80 ns
Time to execute remaining 95 instructions: 95 * 16 ns = 1520 ns
Total Pipelined Execution Time: 80 ns + 1520 ns = 1600 ns

3. Speedup due to Pipelining

Speedup = Non-Pipelined Time / Pipelined Time
Speedup = 6300 ns / 1600 ns = 3.94 Course Instructor : Amudhan AN 39

Processor Structure and Function
100% (1)
Processor Structure and Function
55 pages
PIPELINING
No ratings yet
PIPELINING
30 pages
Chapter 2 Lecture 4 and 5
No ratings yet
Chapter 2 Lecture 4 and 5
56 pages
Chapter 4
No ratings yet
Chapter 4
78 pages
CH10-Processor Structure and Function
No ratings yet
CH10-Processor Structure and Function
14 pages
Lec03 - Processor Structure and Function
No ratings yet
Lec03 - Processor Structure and Function
55 pages
Pipeline 1
No ratings yet
Pipeline 1
6 pages
Risc in Pipe Ine
No ratings yet
Risc in Pipe Ine
39 pages
Module 5 - Processor Structure and Function
No ratings yet
Module 5 - Processor Structure and Function
74 pages
CPU Structure & Functions
No ratings yet
CPU Structure & Functions
44 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
55 pages
Moduel 5
No ratings yet
Moduel 5
46 pages
Pipelining ControlUnitAndHazards
No ratings yet
Pipelining ControlUnitAndHazards
109 pages
Presentation 35191 Content Document 20250423021246PM
No ratings yet
Presentation 35191 Content Document 20250423021246PM
46 pages
CH14 COA9e Processor Structure and Function
No ratings yet
CH14 COA9e Processor Structure and Function
40 pages
Processor Organization & Instruction Cycle
No ratings yet
Processor Organization & Instruction Cycle
31 pages
Unit - 1 Microprocessor Architecture
No ratings yet
Unit - 1 Microprocessor Architecture
52 pages
William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function
74 pages
Pipe Lining
No ratings yet
Pipe Lining
16 pages
Pipelining and Parallel Processing
No ratings yet
Pipelining and Parallel Processing
26 pages
Module-5 DDCO
No ratings yet
Module-5 DDCO
35 pages
CH 2
No ratings yet
CH 2
86 pages
Lec07 Pipelining Review
No ratings yet
Lec07 Pipelining Review
121 pages
CH 2
No ratings yet
CH 2
50 pages
Lec12 Pipeline
No ratings yet
Lec12 Pipeline
23 pages
12 - Processor Structure and Function
No ratings yet
12 - Processor Structure and Function
73 pages
DDCO Notes-162-171
No ratings yet
DDCO Notes-162-171
10 pages
HRY-312 Computer Organization Introduction To Pipelining
No ratings yet
HRY-312 Computer Organization Introduction To Pipelining
30 pages
Ca06 2014 PDF
No ratings yet
Ca06 2014 PDF
53 pages
01 - Mod 2 - Livro Autorresponsabilidade
No ratings yet
01 - Mod 2 - Livro Autorresponsabilidade
9 pages
Unit2 Aca
No ratings yet
Unit2 Aca
118 pages
Design of 3 Stage Pipelining Processor Using VHDL
No ratings yet
Design of 3 Stage Pipelining Processor Using VHDL
22 pages
Illustrate The Pipeline in ARM Procerros of ARM9
No ratings yet
Illustrate The Pipeline in ARM Procerros of ARM9
2 pages
CEA201 - Chapter 14 - Processor Structure and Function
No ratings yet
CEA201 - Chapter 14 - Processor Structure and Function
42 pages
MIPS
No ratings yet
MIPS
70 pages
CBA Processor
No ratings yet
CBA Processor
21 pages
Unit1 1.7 Instr Cycle
No ratings yet
Unit1 1.7 Instr Cycle
35 pages
Computer Architecture: Pipelining: Dr. Ashok Kumar Turuk
No ratings yet
Computer Architecture: Pipelining: Dr. Ashok Kumar Turuk
136 pages
8 Pipeline DDP Control
No ratings yet
8 Pipeline DDP Control
54 pages
Slot24 25 CH14 ProcessorStructureAndFunction 42 Slots
No ratings yet
Slot24 25 CH14 ProcessorStructureAndFunction 42 Slots
42 pages
Chapter 3 PPTV 31 Sem IIv 31
No ratings yet
Chapter 3 PPTV 31 Sem IIv 31
40 pages
EC Chapter2 2014
No ratings yet
EC Chapter2 2014
88 pages
CH14 COA10e
No ratings yet
CH14 COA10e
54 pages
Chapter 2 ILP
No ratings yet
Chapter 2 ILP
89 pages
8 Week
No ratings yet
8 Week
43 pages
CH 7
No ratings yet
CH 7
68 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
Instruction Pipelining
No ratings yet
Instruction Pipelining
32 pages
CO Pipelining PDF Notes
No ratings yet
CO Pipelining PDF Notes
10 pages
Instruction Pipeline
No ratings yet
Instruction Pipeline
28 pages
Enhancing Performance With Pipelining
No ratings yet
Enhancing Performance With Pipelining
71 pages
CH 12.ppt Type I
No ratings yet
CH 12.ppt Type I
54 pages
CH12 CPU Structure and Function
No ratings yet
CH12 CPU Structure and Function
44 pages
Slot15 CH14 ProcessorStructureAndFunction 42 Slots
No ratings yet
Slot15 CH14 ProcessorStructureAndFunction 42 Slots
42 pages
Chapter 8 - Pipelining
No ratings yet
Chapter 8 - Pipelining
38 pages
CPU Structure and Functions
No ratings yet
CPU Structure and Functions
39 pages
CA Lecture 12
No ratings yet
CA Lecture 12
48 pages
Pipe Lining
No ratings yet
Pipe Lining
66 pages
Computer Science II Essentials
From Everand
Computer Science II Essentials
Randall Raus
No ratings yet
TMS374 Family In-Circuit Programming: Users Manual Rev. 1.3 2005.05.11
100% (1)
TMS374 Family In-Circuit Programming: Users Manual Rev. 1.3 2005.05.11
10 pages
Agenda: Body Effect Model Reverse and Forward Body Bias
No ratings yet
Agenda: Body Effect Model Reverse and Forward Body Bias
6 pages
10 8255 Ppi & 8254a Pit
No ratings yet
10 8255 Ppi & 8254a Pit
24 pages
Unit VI Parallel Programming Concepts
No ratings yet
Unit VI Parallel Programming Concepts
90 pages
Dram VS Sram
No ratings yet
Dram VS Sram
2 pages
MP Q Paper 3
No ratings yet
MP Q Paper 3
2 pages
CMOS (Complementary Metal Oxide Semiconductor)
No ratings yet
CMOS (Complementary Metal Oxide Semiconductor)
5 pages
MPC5674F
No ratings yet
MPC5674F
132 pages
F1A55-M LX PLUS CPU Compatibles Placas Madre ASUS
No ratings yet
F1A55-M LX PLUS CPU Compatibles Placas Madre ASUS
2 pages
MOSFET Slide
No ratings yet
MOSFET Slide
137 pages
Fma Imp-1 PDF
No ratings yet
Fma Imp-1 PDF
154 pages
Memory
No ratings yet
Memory
38 pages
Dec50143 - PW3 (M.adam F1126)
No ratings yet
Dec50143 - PW3 (M.adam F1126)
8 pages
CSO Previous Year Question Paper (2019-15)
No ratings yet
CSO Previous Year Question Paper (2019-15)
10 pages
Computer Hardware
No ratings yet
Computer Hardware
64 pages
8051 Microcontroller Introduction, Basics and Features
No ratings yet
8051 Microcontroller Introduction, Basics and Features
13 pages
Mri Airis II MRPSC Cz67ac-S13
No ratings yet
Mri Airis II MRPSC Cz67ac-S13
370 pages
Alliance Semiconductor: High Performance 32K 8 Cmos Sram AS7C256 AS7C256L
No ratings yet
Alliance Semiconductor: High Performance 32K 8 Cmos Sram AS7C256 AS7C256L
9 pages
EC305 Microprocessor & Microcontroller
No ratings yet
EC305 Microprocessor & Microcontroller
2 pages
CO & A All Modules Notes 21CS34 PDF
100% (2)
CO & A All Modules Notes 21CS34 PDF
190 pages
Mic 314321 Notes
No ratings yet
Mic 314321 Notes
60 pages
Eprom Equivalent
No ratings yet
Eprom Equivalent
6 pages
Lesson 8 - Parts and Function (CPU)
No ratings yet
Lesson 8 - Parts and Function (CPU)
28 pages
Cpu Cores PDF
No ratings yet
Cpu Cores PDF
6 pages
Adv Comp Arch Q3'11
No ratings yet
Adv Comp Arch Q3'11
54 pages
Ceramic Package Reviewer
No ratings yet
Ceramic Package Reviewer
4 pages
8086 Microprocessor Full Slide
No ratings yet
8086 Microprocessor Full Slide
124 pages
1.addressing Mode
No ratings yet
1.addressing Mode
2 pages
Z80 Instruction Set Summary
No ratings yet
Z80 Instruction Set Summary
4 pages
Mos1 Const. Opera.
No ratings yet
Mos1 Const. Opera.
8 pages

Pipelining Updated

Uploaded by

Pipelining Updated

Uploaded by

Pipelining

Course Instructor : Amudhan AN 1

Pipelining is a technique used in computer architecture to increase instructio

Course Instructor : Amudhan AN 2

Course Instructor : Amudhan AN 5

Course Instructor : Amudhan AN 6

Course Instructor : Amudhan AN 8

Course Instructor : Amudhan AN 10

Course Instructor : Amudhan AN 11

Course Instructor : Amudhan AN 14

Course Instructor : Amudhan AN 16

Course Instructor : Amudhan AN 17

Course Instructor : Amudhan AN 19

• Pipeline Hazards: These are the conditions that result in

Course Instructor : Amudhan AN 20

Instruction 1: ADD R2, R3, R4 ; R2 = R3 + R4

• If I2 is executed before I1 writes to R2, I2 will read the old value of R2

Course Instructor : Amudhan AN 21

I1: MOV R4, R1 ; R4 = R1

• If I2 is executed before I1 reads R1, I1 will read the wrong value of R1

Course Instructor : Amudhan AN 22

I1: ADD R1, R2, R3 ; R1 = R2 + R3

Suppose I2 executes before I1 writes its result to R1.

I2 will write its result to R1.

• Example: The result of I1 is forwarded directly to the next instruction

• This happens through interface register or Buffer registers.

Course Instructor : Amudhan AN 24

• Introduces delays in the pipeline to wait for the necessary data to be

• Insert NOP (no-operation) instructions to wait for ADD to complete

Course Instructor : Amudhan AN 26

• Dynamically assigns different physical registers to eliminate WAR and

• Example: Instead of using the same logical register, use different

Course Instructor : Amudhan AN 27

Course Instructor : Amudhan AN 28

2050 BI1(Label part)

I1: BEQ R0, R1, label ; Branch if R0 equals R1

Course Instructor : Amudhan AN 30

Course Instructor : Amudhan AN 31

Course Instructor : Amudhan AN 33

I4 Course Instructor : Amudhan AN - - IF(Mem) ID Ex 34

• Consider a five stage pipelining with cycle time of 6 ns .

Course Instructor : Amudhan AN 35

2. Speedup due to Pipelining

Percentage Utilization = (96 cycles * 5 stages) / (100 instructions * 5 stages) * 100%

Course Instructor : Amudhan AN 37

Course Instructor : Amudhan AN 38

2. Execution Time for 100 Instructions

3. Speedup due to Pipelining

You might also like