0% found this document useful (0 votes)

14 views70 pages

MIPS

The document provides an overview of the MIPS processor architecture, detailing its components, instruction execution, and data path operations. It discusses advanced features like pipelining, parallel processing, and various hazards that can occur during instruction execution, along with strategies for optimization. Additionally, it covers branch prediction techniques to enhance performance in pipelined architectures.

Uploaded by

pilotscrown

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views70 pages

MIPS

Uploaded by

pilotscrown

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 70

Unit III MIPS Processor

• A microprocessor is an integrated circuit that

performs computing tasks.
• It acts as the brain of a computer, executing
instructions from memory.
• Evolution: Started from 4-bit processors to
modern multi-core processors.
• Applications: Used in PCs, embedded systems,
industrial automation, and more.
Microprocessor

• Basic Components:
- ALU (Arithmetic Logic Unit): Performs mathematical and
logical operations.
- CU (Control Unit): Directs operations within the processor.
- Registers: Small storage units for temporary data.
• Data Flow: Instructions fetched from memory, decoded, and
executed.
• Instruction Set Architecture (ISA): Defines supported operations
and formats.
Instruction Set and Execution

• Types of Instructions:
- Data transfer (MOV, LOAD, STORE)
- Arithmetic & logical (ADD, SUB, AND, OR)
- Control (JUMP, CALL, RETURN)
• Addressing Modes:
- Immediate, Register, Direct, Indirect
• Execution Cycle:
- Fetch → Decode → Execute → Store
Memory and I/O Interface

• Memory Hierarchy:
- Registers → Cache → RAM → Hard Disk
• I/O Interfacing:
- Parallel and Serial communication
- Memory-mapped and I/O-mapped addressing
• Bus Architecture:
- Address Bus (selects memory locations)
- Data Bus (transfers data)
- Control Bus (manages operations)
Advanced Microprocessor Features

• Pipelining: Overlapping instruction execution to

enhance speed.
• Parallel Processing: Multi-core processors for
improved performance.
• Cache Memory: Stores frequently used data to
reduce latency.
• Power Optimization: Efficient power
management techniques in modern processors.
Load–store architecture (or a register–register architecture) is an
instruction set architecture that divides instructions into two
categories: memory access (load and store between memory and
registers) and ALU operations (which only occur between registers.

Reduced instruction set computer (RISC) is a

computer architecture designed to simplify the individual instructions
given to the computer to accomplish tasks.

Compared to the instructions given to a

complex instruction set computer (CISC), a RISC computer might
require more instructions (more code) in order to accomplish a task
because the individual instructions are written in simpler code.
MIPS Data Path Operation

1. Instruction Fetch (IF): PC fetches instruction from instruction

memory.
2. Instruction Decode (ID): Instruction is decoded, and register
values are read.
3. Execution (EX): ALU processes the instruction.
4. Memory Access (MEM): Data memory is accessed if
required.
5. Write Back (WB): The computed value is written back to the
register file.
In the MIPS processor architecture, the data path elements are the key
components responsible for instruction execution. These elements work together
to fetch, decode, execute, and store results efficiently. Below are the major
elements of the MIPS processor data path:
1. Program Counter (PC)
•Stores the address of the next instruction to be fetched.
•Updates sequentially (PC + 4) unless a branch/jump occurs.
2. Instruction Memory
•Stores the program instructions.
•Fetches the instruction based on the value in the PC.
3. Instruction Decoder / Control Unit
•Decodes the fetched instruction.
•Generates control signals for various components.
4. Register File
•Contains 32 general-purpose registers
•Reads two registers (Rs and Rt) and writes the result to a destination register
(Rd).
5. ALU (Arithmetic Logic Unit)
•Performs arithmetic and logic operations (e.g., addition, subtraction, AND,
OR).
•Supports comparison operations (e.g., branch decisions).
6. Sign Extender
•Converts 16-bit immediate values into 32-bit format for processing.
7.Data Memory
•Used for lw (load word) and sw (store word) instructions.
•Loads/stores data between registers and memory.
8. Multiplexers (MUX)
•Select between different inputs based on control signals.
•Used to choose between ALU inputs, write destinations, and
branch/jump addresses.
9. Branch Logic
•Determines the next PC value for branch/jump instructions.
•Compares register values to decide if branching is required.
10. Pipeline Registers (For Pipelined MIPS)
•Stores intermediate values between instruction stages.
•Helps achieve instruction parallelism.
Control Implementation Schemes
Load word & Store word – ALU addition (to compute memory address)
Arithmetic and logical instructions – Add, sub, and, or, set on less than (for R type instruction)
Branch – beq & bne – ALU subtraction
ALU Control lines & its functions:
0000 – AND
0001 – OR
0010 – Add
0110 – Sub
0111 – slt
1100 – NOR
Pipelining
Pipelining is a technique used in MIPS (Microprocessor without Interlocked
Pipeline Stages) to improve instruction throughput by overlapping the
execution of multiple instructions.
Instead of executing instructions sequentially, pipelining allows multiple
instructions to be in different stages of execution simultaneously.
Each segment consists of input along with combinational circuits.
Each execution has 5 subtasks in MIPS -IF ID OF IE OS

PIPELINE SPEEDUP:
If the stages are perfectly balanced, then the time interval between the
instructions in the pipelined processor assuming ideal condition:
Time between instruction pipelined = Time between instruction non
pipelined / No of Pipelined stages
Pipeline Performance Improvement

• Without pipelining: A single instruction

completes in 5 clock cycles.

• With pipelining: New instructions start every

cycle, achieving one instruction per cycle
throughput (ideal case).
Pipeline Hazards
Pipelining introduces potential issues called hazards, which can cause stalls or
incorrect execution.

Hazards occur when the next instruction cannot execute the following clock
cycles.

Types of hazards:

Structural hazard
Data hazard
Control Hazard

a) Structural Hazards
•Occurs when two instructions try to use the same hardware resource
simultaneously.
•Solution: Use separate instruction and data memory or multi-port registers.
b) Data Hazards
•Occurs when an instruction depends on the result of a previous instruction
that has not yet completed.

• Unavailability of data in an instruction then the pipeline must be stalled, one

instruction must wait for another to be completed.

•When a planned instruction cannot be executed in proper clock cycle because

data needed to execute an instruction is not available.

•In a computer pipeline data hazard arise from the dependency of one
instruction on the previous instruction that is still in the pipeline.
•Example:

ADD $t1, $t2, $t3

SUB $t4, $t1, $t5

# Data hazard: $t1 is not yet updated

The add instruction writes the result in the fifth stage of clock cycle. Sub
instruction must wait – Stall (bubble) for three clock cycles.
Data forwarding:

• Forwarding (Bypassing): Use the result from EX/MEM stage instead

of waiting for WB. (resolving a data hazard)

• As soon as ALU creates sum for the ADD instruction it can be given as
input for SUB instruction.

• If the first instruction is (lw) loaded instead of ADD, then the desired
data is available only after the fourth stage of the first instruction.

• Hence even with the forwarding we must stall for one clock cycle.

• Pipeline stall is also known as bubble.

• Pipeline Stall (NOP insertion): Delay execution until data is available.

Reordering of code:

Conside the code segement c

a=b+e
c=b+f

Lw $t1, 0($t0)
Lw $t2, 4($t0)
ADD $t3, $t1, $t2
sw $t3, 12($t0)
Lw $t4, 8($t0)
ADD $t5, $t1, $t4
sw $t5, 16($t0)
Reordering of code:

Conside the code segement c

a=b+e
c=b+f

Lw $t1, 0($t0)
Lw $t2, 4($t0)
Lw $t4, 8($t0)
ADD $t3, $t1, $t2
sw $t3, 12($t0)
ADD $t5, $t1, $t4
sw $t5, 16($t0)
Control Hazards
•Occurs due to branch (jump) instructions when the next instruction is
unknown

•It arises from the need to make decision based on the result of the
instruction while the others are executing.

•Example:

BEQ $t1, $t2, LABEL

•Solution:
• Branch Prediction (guess the branch outcome).
• Branch Delay Slot (reorder instructions to minimize stalls).

• If TRUE instead of moving to OR instruction it moves to ADD –

control hazard.
Example of MIPS Pipeline Execution

Consider the following instruction sequence:

assembly
LOAD $t1, 0($t2)
ADD $t3, $t1, $t4
SUB $t5, $t3, $t6
Cycle IF ID EX MEM WB

1 LOAD

2 ADD LOAD

3 SUB ADD LOAD

4 SUB ADD LOAD

5 SUB ADD LOAD

6 SUB ADD

Pipeline execution:
7 SUB

•Without stalls, one instruction completes per cycle after the pipeline fills.
•Data hazards might require forwarding or stalling to resolve dependencies.
MIPS Pipeline Optimizations

•Forwarding: Reduces stalls by using ALU results before WB.

•Branch Prediction: Reduces control hazard delays.
•Delayed Branching: Executes an instruction in the branch
Cycle IF ID EX MEM WB

1 LOAD

2 ADD LOAD

3 SUB ADD LOAD

4 SUB ADD LOAD

5 SUB ADD LOAD

6 SUB ADD

7 SUB

delay slot.
•Multiple Issue (Superscalar Execution): Executes multiple
instructions per cycle.
Branch Prediction:

It is the method of predicting the branch outcome:

Static branch prediction

Dynamic branch prediction

Static branch prediction: Cycle

7
IF

LOAD

ADD

SUB
ID

LOAD

ADD

SUB
EX

LOAD

ADD

SUB
MEM

LOAD

ADD

SUB
WB

LOAD

ADD

SUB

It is used to predict always that the branches are not taken NT

When the prediction is right the pipeling proceeds at full speed.

When the branches are taken the pipeline stalls- when there is misprediction
Dynamic branch prediction:

The dynamic prediction hardware guesses depending on the behaviour of each branch
and may change the prediction for a branch over the life of a program using runtime
information.

Prediction of branches at runtime using runtime information.

It is based on the recent past behaviour to predict the future.

Types:
Cycle IF ID EX MEM WB

1 LOAD

2 ADD LOAD

3 SUB ADD LOAD

One bit branch prediction:

4 SUB ADD LOAD

5 SUB ADD LOAD

6 SUB ADD

7 SUB

After one-bit wrong prediction, predicted bit is inverted. Implementation of this approach
is that contains branch prediction buffer ( small memory indexed by the lower portion of
the address of the branch instruction).

The memory contains a bit 1 or 0 ( depending on this whether the branch is recently taken
or not)
Ex:

NT T T T T NT

Perform bit inversion NT as T

Even if the branch is almost T we can predict incorrectly twice rather than
ones when it is NT. Prediction accuracy is 80%

Two bit branch prediction: Cycle IF ID EX MEM WB

1 LOAD

2 ADD LOAD

3 SUB ADD LOAD

4 SUB ADD LOAD

5 SUB ADD LOAD

6 SUB ADD

7 SUB

The prediction changes according to the history of the individual branch

instruction.

00 – Strongly NT
11 – Strongly T
10 – Weakly T
01 – Weakly NT
Cycle IF ID EX MEM WB

1 LOAD

2 ADD LOAD

3 SUB ADD LOAD

4 SUB ADD LOAD

5 SUB ADD LOAD

6 SUB ADD

7 SUB

Ripes A Visual Computer Architecture Simulator
100% (1)
Ripes A Visual Computer Architecture Simulator
8 pages
Ca06 2014 PDF
No ratings yet
Ca06 2014 PDF
53 pages
CA Unit 3 Answers
No ratings yet
CA Unit 3 Answers
10 pages
L14 MipsPipeline Ovw
No ratings yet
L14 MipsPipeline Ovw
17 pages
Cse410 10 Pipelining A
No ratings yet
Cse410 10 Pipelining A
7 pages
L15 MipsPipeline
No ratings yet
L15 MipsPipeline
26 pages
Lec07 Pipelining Review
No ratings yet
Lec07 Pipelining Review
121 pages
Unit 5.2 Processor
No ratings yet
Unit 5.2 Processor
40 pages
Lect8 Pipelined DP Control
No ratings yet
Lect8 Pipelined DP Control
59 pages
02a ILP Pipeline
No ratings yet
02a ILP Pipeline
40 pages
UNIT-3: MIPS Instructions
No ratings yet
UNIT-3: MIPS Instructions
15 pages
Enhancing Performance With Pipelining
No ratings yet
Enhancing Performance With Pipelining
71 pages
CS530 Fall2015 Lecture9
No ratings yet
CS530 Fall2015 Lecture9
5 pages
8 Pipeline DDP Control
No ratings yet
8 Pipeline DDP Control
54 pages
Ca07 2014 PDF
No ratings yet
Ca07 2014 PDF
56 pages
ASIC Design of MIPS Based RISC Processor For High Performance
No ratings yet
ASIC Design of MIPS Based RISC Processor For High Performance
7 pages
A4 版本1 （未使用）
No ratings yet
A4 版本1 （未使用）
2 pages
CODch 6 Slides
No ratings yet
CODch 6 Slides
77 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
30 pages
Embedded Systems Design: Pipelining and Instruction Scheduling
No ratings yet
Embedded Systems Design: Pipelining and Instruction Scheduling
48 pages
Computer Architecture: Nguyễn Trí Thành
No ratings yet
Computer Architecture: Nguyễn Trí Thành
77 pages
Pipeline: Example
No ratings yet
Pipeline: Example
6 pages
Pipeline 1
No ratings yet
Pipeline 1
6 pages
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
No ratings yet
Advanced Computer Architecture: BY Dr. Radwa M. Tawfeek
50 pages
4 20 10 PDF
No ratings yet
4 20 10 PDF
12 pages
Basic Pipelining: CS2100 - Computer Organization
No ratings yet
Basic Pipelining: CS2100 - Computer Organization
83 pages
Lecture-4-08 01 2025
No ratings yet
Lecture-4-08 01 2025
35 pages
Embedded Computer Architecture 5SAI0
No ratings yet
Embedded Computer Architecture 5SAI0
59 pages
Module 5 - Processor Structure and Function
No ratings yet
Module 5 - Processor Structure and Function
74 pages
Chapter 4 The Processor
No ratings yet
Chapter 4 The Processor
72 pages
Enhancing Performance With Pipelining
No ratings yet
Enhancing Performance With Pipelining
85 pages
Unit - 1 Microprocessor Architecture
No ratings yet
Unit - 1 Microprocessor Architecture
52 pages
Module-5 DDCO
No ratings yet
Module-5 DDCO
35 pages
Parallelism Via Instructions: Instruction-Level Parallelism (ILP)
No ratings yet
Parallelism Via Instructions: Instruction-Level Parallelism (ILP)
21 pages
COA Unit 3
No ratings yet
COA Unit 3
89 pages
01 - Mod 2 - Livro Autorresponsabilidade
No ratings yet
01 - Mod 2 - Livro Autorresponsabilidade
9 pages
DDCO Notes-162-171
No ratings yet
DDCO Notes-162-171
10 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
Advanced Linux Programming
No ratings yet
Advanced Linux Programming
31 pages
21CS403Notes 5
No ratings yet
21CS403Notes 5
16 pages
Processor Organization & Instruction Cycle
No ratings yet
Processor Organization & Instruction Cycle
31 pages
Pipe Lining
No ratings yet
Pipe Lining
16 pages
Pipelining ControlUnitAndHazards
No ratings yet
Pipelining ControlUnitAndHazards
109 pages
Unit 3 Computer Architecture
No ratings yet
Unit 3 Computer Architecture
3 pages
ILP - Appendix C PDF
No ratings yet
ILP - Appendix C PDF
52 pages
Phy 108
No ratings yet
Phy 108
24 pages
Pipe Lining
No ratings yet
Pipe Lining
66 pages
Unit 7 - Basic Processing
No ratings yet
Unit 7 - Basic Processing
85 pages
CO Pipelining PDF Notes
No ratings yet
CO Pipelining PDF Notes
10 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
49 pages
Introduction To MIPS Architecture
No ratings yet
Introduction To MIPS Architecture
10 pages
MIPS Pipeline: Data and Control Path Data and Control Path
No ratings yet
MIPS Pipeline: Data and Control Path Data and Control Path
46 pages
PIPELINING
No ratings yet
PIPELINING
30 pages
Pipelined MIPS Processor: Dmitri Strukov ECE 154A
No ratings yet
Pipelined MIPS Processor: Dmitri Strukov ECE 154A
81 pages
Chapter 04 Processor 2
No ratings yet
Chapter 04 Processor 2
28 pages
CSO Lecture Notes Unit - 5
No ratings yet
CSO Lecture Notes Unit - 5
11 pages
Digital Fundamentals & Computer Architecture
No ratings yet
Digital Fundamentals & Computer Architecture
110 pages
The Big Picture: Requirements Algorithms Prog. Lang./Os Isa Uarch Circuit Device
No ratings yet
The Big Picture: Requirements Algorithms Prog. Lang./Os Isa Uarch Circuit Device
60 pages
Pipelining - Modified1
No ratings yet
Pipelining - Modified1
51 pages
Python Beyond Limits: Python, #3
From Everand
Python Beyond Limits: Python, #3
AnwaarX
No ratings yet
Gate Cse Cao
100% (1)
Gate Cse Cao
108 pages
CH-1 1 Pipelining
No ratings yet
CH-1 1 Pipelining
43 pages
Reduced Instruction Set Computing (RISC) : Li-Chuan Fang
No ratings yet
Reduced Instruction Set Computing (RISC) : Li-Chuan Fang
42 pages
How Data Hazards Can Be Removed Effectively
No ratings yet
How Data Hazards Can Be Removed Effectively
6 pages
Unit III - Basic Processing Unit
No ratings yet
Unit III - Basic Processing Unit
123 pages
Chapter - 04 Mips Assembly Data Path
No ratings yet
Chapter - 04 Mips Assembly Data Path
137 pages
COAL Assignment (Y86 Processor Architecture)
100% (1)
COAL Assignment (Y86 Processor Architecture)
32 pages
2.1: Advanced Processor Technology: Qn:Explain Design Space of Processor?
No ratings yet
2.1: Advanced Processor Technology: Qn:Explain Design Space of Processor?
29 pages
Parallel Processing Chapter - 2: Basics of Architectural Design
No ratings yet
Parallel Processing Chapter - 2: Basics of Architectural Design
29 pages
CS3351 Dpco Qbank
No ratings yet
CS3351 Dpco Qbank
43 pages
Unit-6: Pipeline & Vector Processing
No ratings yet
Unit-6: Pipeline & Vector Processing
41 pages
Computer Architecture Assignment 1
No ratings yet
Computer Architecture Assignment 1
12 pages
Computer Organization and Architecture
No ratings yet
Computer Organization and Architecture
21 pages
Unit Iv Coa - PPT
No ratings yet
Unit Iv Coa - PPT
99 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
37 pages
Co Unit3
No ratings yet
Co Unit3
41 pages
1822 B.E Ece Batchno 102
No ratings yet
1822 B.E Ece Batchno 102
86 pages
CCS CMCS 611-101 Advanced Computer Architecture Advanced Computer Architecture
100% (2)
CCS CMCS 611-101 Advanced Computer Architecture Advanced Computer Architecture
24 pages
Patterson6e MIPS Ch04 PPT
No ratings yet
Patterson6e MIPS Ch04 PPT
137 pages
Detailed Notes On Data Hazards, Structural Hazards, and Control Hazards
No ratings yet
Detailed Notes On Data Hazards, Structural Hazards, and Control Hazards
1 page
Lesson Plan LP - CS6303 LP Rev. No: 00 Date: 20/06/2014 Page: 01 of 06 Sub Code: CS6303 Sub Name: Unit: I Branch: Be (Cse) Semester: Iii
No ratings yet
Lesson Plan LP - CS6303 LP Rev. No: 00 Date: 20/06/2014 Page: 01 of 06 Sub Code: CS6303 Sub Name: Unit: I Branch: Be (Cse) Semester: Iii
6 pages
Computer-Architecture Q&A
100% (3)
Computer-Architecture Q&A
37 pages
Riscv Design
No ratings yet
Riscv Design
82 pages
Pipeline Hazards
No ratings yet
Pipeline Hazards
53 pages
QUESTION BANK UNIT 5 - Computer Organization and Architecture
No ratings yet
QUESTION BANK UNIT 5 - Computer Organization and Architecture
9 pages
8 - RISCV - Pipelined - Arch2
No ratings yet
8 - RISCV - Pipelined - Arch2
57 pages
Pipelining Vector Processing
No ratings yet
Pipelining Vector Processing
27 pages
Pipeline and Vector Processing
No ratings yet
Pipeline and Vector Processing
18 pages
Coa Unit - 5 Notes
No ratings yet
Coa Unit - 5 Notes
6 pages

MIPS

Uploaded by

MIPS

Uploaded by

Unit III MIPS Processor

• A microprocessor is an integrated circuit that

• Pipelining: Overlapping instruction execution to

Reduced instruction set computer (RISC) is a

Compared to the instructions given to a

1. Instruction Fetch (IF): PC fetches instruction from instruction

• Without pipelining: A single instruction

• With pipelining: New instructions start every

• Unavailability of data in an instruction then the pipeline must be stalled, one

•When a planned instruction cannot be executed in proper clock cycle because

ADD $t1, $t2, $t3

SUB $t4, $t1, $t5

# Data hazard: $t1 is not yet updated

• Forwarding (Bypassing): Use the result from EX/MEM stage instead

• Pipeline stall is also known as bubble.

• Pipeline Stall (NOP insertion): Delay execution until data is available.

Conside the code segement c

Conside the code segement c

BEQ $t1, $t2, LABEL

• If TRUE instead of moving to OR instruction it moves to ADD –

Consider the following instruction sequence:

3 SUB ADD LOAD

4 SUB ADD LOAD

5 SUB ADD LOAD

•Forwarding: Reduces stalls by using ALU results before WB.

3 SUB ADD LOAD

4 SUB ADD LOAD

5 SUB ADD LOAD

It is the method of predicting the branch outcome:

Static branch prediction

Static branch prediction: Cycle

It is used to predict always that the branches are not taken NT

When the prediction is right the pipeling proceeds at full speed.

Prediction of branches at runtime using runtime information.

It is based on the recent past behaviour to predict the future.

3 SUB ADD LOAD

One bit branch prediction:

5 SUB ADD LOAD

Perform bit inversion NT as T

Two bit branch prediction: Cycle IF ID EX MEM WB

3 SUB ADD LOAD

4 SUB ADD LOAD

5 SUB ADD LOAD

The prediction changes according to the history of the individual branch

3 SUB ADD LOAD

4 SUB ADD LOAD

5 SUB ADD LOAD

You might also like