0% found this document useful (0 votes)

40 views30 pages

Chap 4

This document provides an overview of the processor chapter. It discusses CPU performance factors like instruction count and cycle time. It then examines two implementations of a MIPS processor - a simplified version and a more realistic pipelined version. It describes the instruction execution model and how the processor fetches instructions from memory and reads register operands. It also discusses control vs data signals, logic design basics like combinational and sequential elements, and clocking methodology.

Uploaded by

Nguyen Danh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views30 pages

Chap 4

Uploaded by

Nguyen Danh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

Chapter 4 — The Processor October 9, 2022

Computer Architecture
Chapter 4: The Processor

Adapted from Computer Organization the Hardware/Software Interface – 5th

Computer Engineering – CSE – HCMUT

Introduction
•  CPU performance factors
–  Instruction count
•  Determined by ISA and compiler
–  CPI and Cycle time
•  Determined by CPU hardware
•  We will examine two MIPS implementations
–  A simplified version
–  A more realistic pipelined version
•  Simple instruction subset, shows most aspects
–  Memory reference: lw, sw
–  Arithmetic/logical: add, sub, and, or, slt
–  Control transfer: beq, j
Chapter 4 - Processor 2

University of Technology – VNUHCM 1

Chapter 4 — The Processor October 9, 2022

Instruction Execution
•  PC → instruction memory, fetch instruction
•  Register numbers → register file, read registers
•  Depending on instruction class
–  Use ALU to calculate
•  Arithmetic result
•  Memory address for load/store
•  Branch condition (comparison)
–  Access data memory for load/store
–  PC ← target address or PC + 4

Chapter 4 - Processor 3

CPU Overview

(số hiệu
thanh ghi)

bộ nhớ lệnh

ALU có thể cộng trừ, nhân chia

Chapter 4 - Processor 4

input and output relationship

University of Technology – VNUHCM 2

Chapter 4 — The Processor October 9, 2022

Execution Model
•  Instruction fetch: PC → instruction address
•  Instruction decode: register operands → register
file
•  Instruction execute:
–  Load/store: compute a memory address
–  Arithmetic: compute an arithmetic result
•  Write back:
–  Load/store: store a value to a register or a memory
location
–  Arithmetic: store a result of register file
Chapter 4 - Processor 5

Multiplexers
n  Can’t just join
wires together
n  Use multiplexers
hai đường tín hiệu
đấu với nhau một
cách rất quái
=> việc xác định
trạng thái kết hợp là
bất khả thi

Chapter 4 - Processor 6

University of Technology – VNUHCM 3

Chapter 4 — The Processor October 9, 2022

Multiplexer

𝐶=𝐴𝑆 +𝐵𝑆

Chapter 4 - Processor 7

Control vs. Data signals

•  Control signal: used for •  Data signal: contains
multiplexer selection or information that is
for directing the operated on by a
operation of a functional unit
functional unit

Chapter 4 - Processor 8

University of Technology – VNUHCM 4

Chapter 4 — The Processor October 9, 2022

Control

Chapter 4 - Processor 9

Logic Design Basics

•  Information encoded in binary
–  Low voltage = 0, High voltage = 1
–  One wire per bit
–  Multi-bit data encoded on multi-wire buses
•  Combinational element
–  Operate on data
–  Output is a function of input
•  State (sequential) elements
–  Store information
Chapter 4 - Processor 10

University of Technology – VNUHCM 5

Chapter 4 — The Processor October 9, 2022

Combinational Elements
•  AND-gate •  Adder
–  Y = A & B –  Y = A + B
A A
Y + Y
B
B

•  Multiplexer •  Arithmetic/Logic Unit

–  Y = S ? I1 : I0 –  Y = F(A,B)
A
I0 M
u Y ALU Y
I1 x
B
S F
Chapter 4 - Processor 11

Sequential Elements
•  Register: stores data in a circuit
–  Uses a clock signal to determine when to update
the stored value
–  Edge-triggered: update when Clk changes from 0
to 1
Clk
D Q
D

Clk
Q

Chapter 4 - Processor 12

University of Technology – VNUHCM 6

Chapter 4 — The Processor October 9, 2022

Sequential Elements
•  Register with write control
–  Only updates on clock edge when write control
input is 1
–  Used when stored value is required later
Clk

D Q Write

Write D
Clk
Q

Chapter 4 - Processor 13

Clocking Methodology
•  Combinational logic transforms data during
clock cycles
–  Between clock edges
–  Input from state elements (a memory or a
register), output to state element
–  Longest delay determines clock period

Chapter 4 - Processor 14

University of Technology – VNUHCM 7

Chapter 4 — The Processor October 9, 2022

Building a Datapath
•  Datapath
–  Elements that process data and addresses
in the CPU
•  Registers, ALUs, mux’s, memories, …
•  We will build a MIPS datapath incrementally
–  Refining the overview design

Chapter 4 - Processor 15

Instruction Fetch

Increment by
4 for next
32-bit instruction
register

Chapter 4 - Processor 16

University of Technology – VNUHCM 8

Chapter 4 — The Processor October 9, 2022

R-Format Instructions
•  Ex. add, sub, and, or, slt
•  Read two register operands
•  Perform arithmetic/logical operation
•  Write register result

Chapter 4 - Processor 17

Load/Store Instructions
•  Read register operands
•  Calculate address using 16-bit offset
–  Use ALU, but sign-extend offset
•  Load: Read memory and update register
•  Store: Write register value to memory

Chapter 4 - Processor 18

University of Technology – VNUHCM 9

Chapter 4 — The Processor October 9, 2022

Branch Instructions
•  Branch taken: condition is satisfied and the
Program Counter (PC) register becomes the
branch target
•  Branch not taken: PC becomes the address of
the next instruction
•  Datapath:
–  Compute the branch target
–  Compare the registers

Chapter 4 - Processor 19

Branch Instructions
•  Read register operands
•  Compare operands
–  Use ALU, subtract and check Zero output
•  Calculate target address
–  Sign-extend displacement
–  Shift left 2 places (word displacement)
–  Add to PC + 4
•  Already calculated by instruction fetch

Chapter 4 - Processor 20

University of Technology – VNUHCM 10

Chapter 4 — The Processor October 9, 2022

Branch Instructions
Just
re-routes
wires

Sign-bit wire
replicated
Chapter 4 - Processor 21

Composing the Elements

một lệnh một chu kỳ
•  First-cut datapath does an instruction in one
clock cycle
–  Each datapath element can only do one function at a
time
–  Hence, we need separate instruction and data
memories
•  Use multiplexers where alternate data sources
are used for different instructions
–  ALU input: register value/immediate
–  Register file write data: results from the ALU/Memory

Chapter 4 - Processor 22

University of Technology – VNUHCM 11

Chapter 4 — The Processor October 9, 2022

R-Type/Load/Store Datapath
thiếu lệnh BRANCH

Chapter 4 - Processor 23

Full Datapath

Chapter 4 - Processor 24

University of Technology – VNUHCM 12

Chapter 4 — The Processor October 9, 2022

ALU Control
•  ALU used for
–  Load/Store: F = add
–  Branch: F = subtract
–  R-type: F depends on funct field
ALU control Function
0000 AND
0001 OR
0010 add
0110 subtract
0111 set-on-less-than
1100 NOR
Chapter 4 - Processor 25

ALU Control
•  Assume 2-bit ALUOp derived from opcode
–  Combinational logic derives ALU control
opcode ALUOp Operation funct ALU function ALU control
lw 00 load word XXXXXX add 0010
sw 00 store word XXXXXX add 0010
beq 01 branch equal XXXXXX subtract 0110
R-type 10 add 100000 add 0010
subtract 100010 subtract 0110
AND 100100 AND 0000
OR 100101 OR 0001
set-on-less-than 101010 set-on-less-than 0111

Chapter 4 - Processor 26

University of Technology – VNUHCM 13

Chapter 4 — The Processor October 9, 2022

Multiple Levels of Decoding

•  Instruction opcode + function => ALUOp =>
ALU control lines
–  Smaller main controller
–  Faster main controller (potentially)

Chapter 4 - Processor 27

The Main Control Unit

•  Control signals derived from instruction
R-type 0 rs rt rd shamt funct
31:26 25:21 20:16 15:11 10:6 5:0

Load/
35 or 43 rs rt address
Store
31:26 25:21 20:16 15:0

Branch 4 rs rt address
31:26 25:21 20:16 15:0

opcode always read, write for sign-extend

read except R-type and add
for load and load
Chapter 4 - Processor 28

University of Technology – VNUHCM 14

Chapter 4 — The Processor October 9, 2022

Datapath with Instruction Fields

Chapter 4 - Processor 29

Datapath With Control

University of Technology – VNUHCM 15

Chapter 4 — The Processor October 9, 2022

R-Type Instruction

add $t1, $t2, $t3

Load Instruction

lw $t1, offset($t2)

University of Technology – VNUHCM 16

Chapter 4 — The Processor October 9, 2022

Branch-on-Equal Instruction

beq $t1, $t2, offset

Implementing Jumps
•  Jump uses word address
•  Update PC with concatenation of
–  Top 4 bits of old PC
–  26-bit jump address
–  00
•  Need an extra control signal decoded from
opcode
Jump 2 address
31:26 25:0
Chapter 4 - Processor 34

University of Technology – VNUHCM 17

Chapter 4 — The Processor October 9, 2022

Datapath With Jumps Added

Chapter 4 - Processor 35

Performance Issues
•  Longest delay determines clock period
–  Critical path: load instruction
–  Instruction memory → register file → ALU → data
memory → register file
•  Not feasible to vary period for different
instructions
•  Violates design principle
–  Making the common case fast
•  We will improve performance by pipelining
Chapter 4 - Processor 36

University of Technology – VNUHCM 18

Chapter 4 — The Processor October 9, 2022

Pipelining Analogy
•  Pipelined laundry: overlapping execution
–  Parallelism improves performance
n  Four loads:
n  Speedup
= 8/3.5 = 2.3
n  Non-stop:
n  Speedup
= 2n/0.5n + 1.5 ≈ 4
= number of stages

Chapter 4 - Processor 37

MIPS Pipeline
•  Five stages, one step per stage
1.  IF: Instruction fetch from memory
2.  ID: Instruction decode & register read
3.  EX: Execute operation or calculate address
4.  MEM: Access memory operand
5.  WB: Write result back to register

Chapter 4 - Processor 38

University of Technology – VNUHCM 19

Chapter 4 — The Processor October 9, 2022

Pipeline Performance
•  Assume time for stages is
–  100ps for register read or write
–  200ps for other stages
•  Compare pipelined datapath with single-cycle
datapath
Instr Instr fetch Register ALU op Memory Register Total time
read access write
lw 200ps 100 ps 200ps 200ps 100 ps 800ps
sw 200ps 100 ps 200ps 200ps 700ps
R-format 200ps 100 ps 200ps 100 ps 600ps
beq 200ps 100 ps 200ps 500ps

Chapter 4 - Processor 39

Pipeline Performance
Single-cycle (Tc= 800ps)

Pipelined (Tc= 200ps)

Chapter 4 - Processor 40

University of Technology – VNUHCM 20

Chapter 4 — The Processor October 9, 2022

Pipeline Speedup
•  If all stages are balanced
–  i.e., all take the same time

•  If not balanced, speedup is less

•  Speedup due to increased throughput
–  Latency (time for each instruction) does not
decrease
Chapter 4 - Processor 41

Pipelining and ISA Design

•  MIPS ISA designed for pipelining
–  All instructions are 32-bits
•  Easier to fetch and decode in one cycle
•  c.f. x86: 1- to 17-byte instructions
–  Few and regular instruction formats
•  Can decode and read registers in one step
–  Load/store addressing
•  Can calculate address in 3rd stage, access memory in 4th
stage
–  Alignment of memory operands
•  Memory access takes only one cycle

Chapter 4 - Processor 42

University of Technology – VNUHCM 21

Chapter 4 — The Processor October 9, 2022

Multi-Cycle Pipeline Diagram

•  Form showing resource usage

Chapter 4 - Processor 43

Multi-Cycle Pipeline Diagram

•  Traditional form

Chapter 4 - Processor 44

University of Technology – VNUHCM 22

Chapter 4 — The Processor October 9, 2022

Hazards
•  Situations that prevent starting the next
instruction in the next cycle
•  Structure hazards
–  A required resource is busy
•  Data hazard
–  Need to wait for previous instruction to complete
its data read/write
•  Control hazard
–  Deciding on control action depends on previous
instruction
Chapter 4 - Processor 45

Structure Hazards
•  Conflict for use of a resource
•  In MIPS pipeline with a single memory
–  Load/store requires data access
–  Instruction fetch would have to stall for that cycle
•  Would cause a pipeline “bubble”
•  Hence, pipelined datapaths require separate
instruction/data memories
–  Or separate instruction/data caches

Chapter 4 - Processor 46

University of Technology – VNUHCM 23

Chapter 4 — The Processor October 9, 2022

Data Hazards
•  An instruction depends on completion of data
access by a previous instruction
–  add $s0, $t0, $t1
sub $t2, $s0, $t3

Chapter 4 - Processor 47

Forwarding (aka Bypassing)

•  Use result when it is computed
–  Don’t wait for it to be stored in a register
–  Requires extra connections in the datapath

Chapter 4 - Processor 48

University of Technology – VNUHCM 24

Chapter 4 — The Processor October 9, 2022

Load-Use Data Hazard

•  Can’t always avoid stalls by forwarding
–  If value not computed when needed
–  Can’t forward backward in time!

Chapter 4 - Processor 49

Code Scheduling to Avoid Stalls

•  Reorder code to avoid use of load result in the
next instruction
•  C code for A = B + E; C = B + F;
lw $t1, 0($t0) lw $t1, 0($t0)
lw $t2, 4($t0) lw $t2, 4($t0)
stall add $t3, $t1, $t2 lw $t4, 8($t0)
sw $t3, 12($t0) add $t3, $t1, $t2
lw $t4, 8($t0) sw $t3, 12($t0)
stall add $t5, $t1, $t4 add $t5, $t1, $t4
sw $t5, 16($t0) sw $t5, 16($t0)
13 cycles 11 cycles
Chapter 4 - Processor 50

University of Technology – VNUHCM 25

Chapter 4 — The Processor October 9, 2022

Control Hazards
•  Branch determines flow of control
–  Fetching next instruction depends on branch
outcome
–  Pipeline can’t always fetch correct instruction
•  Still working on ID stage of branch
•  In MIPS pipeline
–  Need to compare registers and compute target
early in the pipeline
–  Add hardware to do it in ID stage

Chapter 4 - Processor 51

Stall on Branch
•  Wait until branch outcome determined before
fetching next instruction

Chapter 4 - Processor 52

University of Technology – VNUHCM 26

Chapter 4 — The Processor October 9, 2022

Branch Prediction
•  Longer pipelines can’t readily determine
branch outcome early
–  Stall penalty becomes unacceptable
•  Predict outcome of branch
–  Only stall if prediction is wrong
•  In MIPS pipeline
–  Can predict branches not taken
–  Fetch instruction after branch, with no delay

Chapter 4 - Processor 53

MIPS with Predict Not Taken

Prediction
correct

Prediction
incorrect

Chapter 4 - Processor 54

University of Technology – VNUHCM 27

Chapter 4 — The Processor October 9, 2022

More-Realistic Branch Prediction

•  Static branch prediction
–  Based on typical branch behavior
–  Example: loop and if-statement branches
•  Predict backward branches taken
•  Predict forward branches not taken
•  Dynamic branch prediction
–  Hardware measures actual branch behavior
•  e.g., record recent history of each branch
–  Assume future behavior will continue the trend
•  When wrong, stall while re-fetching, and update history

Chapter 4 - Processor 55

Dynamic Branch Prediction

•  In deeper and superscalar pipelines, branch penalty
is more significant
•  Use dynamic prediction
–  Branch prediction buffer (aka branch history table)
–  Indexed by recent branch instruction addresses
–  Stores outcome (taken/not taken)
–  To execute a branch
•  Check table, expect the same outcome
•  Start fetching from fall-through or target
•  If wrong, flush pipeline and flip prediction

Chapter 4 - Processor 100

University of Technology – VNUHCM 28

Chapter 4 — The Processor October 9, 2022

1-Bit Predictor: Shortcoming

•  Inner loop branches mispredicted twice!
outer: …
…
inner: …
…
beq …, …, inner
…
beq …, …, outer

n  Mispredict as taken on last iteration of

inner loop
n  Then mispredict as not taken on first

iteration of outer loop next time around

Chapter 4 - Processor 101

2-Bit Predictor
•  Only change prediction on two successive
mispredictions

Chapter 4 - Processor 102

University of Technology – VNUHCM 29

Chapter 4 — The Processor October 9, 2022

Calculating the Branch Target

•  Even with predictor, still need to calculate the
target address
–  1-cycle penalty for a taken branch
•  Branch target buffer
–  Cache of target addresses
–  Indexed by PC when instruction fetched
•  If hit and instruction is branch predicted taken, can
fetch target immediately

Chapter 4 - Processor 103

Pipeline Summary
The BIG Picture
•  Pipelining improves performance by
increasing instruction throughput
–  Executes multiple instructions in parallel
–  Each instruction has the same latency
•  Subject to hazards
–  Structure, data, control
•  Instruction set design affects complexity of
pipeline implementation
Chapter 4 - Processor 104

University of Technology – VNUHCM 30

SAP HR-Set Up Personnel Action - Configuration Steps
100% (2)
SAP HR-Set Up Personnel Action - Configuration Steps
16 pages
Chapter 04 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
71% (7)
Chapter 04 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
137 pages
Bulbapedia Walkthrough Leaf Green
No ratings yet
Bulbapedia Walkthrough Leaf Green
9 pages
COA 2013 Chapter 4 The Processor
No ratings yet
COA 2013 Chapter 4 The Processor
153 pages
Computer Architecture: Chapter 4: The Processor Part 1
No ratings yet
Computer Architecture: Chapter 4: The Processor Part 1
51 pages
Chapter4 Part1
No ratings yet
Chapter4 Part1
51 pages
Chapter 04MHE Kabir
No ratings yet
Chapter 04MHE Kabir
171 pages
Lecture 12
No ratings yet
Lecture 12
29 pages
Chapter 04.Ppt - Chapter 04
No ratings yet
Chapter 04.Ppt - Chapter 04
182 pages
The Processor: The Hardware/Software Interface 5
No ratings yet
The Processor: The Hardware/Software Interface 5
149 pages
Processor PDF
No ratings yet
Processor PDF
98 pages
Patterson6e MIPS Ch04 PPT
No ratings yet
Patterson6e MIPS Ch04 PPT
137 pages
Patterson6e MIPS Ch04
No ratings yet
Patterson6e MIPS Ch04
137 pages
Chapter - 04 Mips Assembly Data Path
No ratings yet
Chapter - 04 Mips Assembly Data Path
137 pages
Chap 4 1
No ratings yet
Chap 4 1
57 pages
Chapter 4 The Processor
100% (1)
Chapter 4 The Processor
131 pages
Chapter 4 The Processor
No ratings yet
Chapter 4 The Processor
131 pages
The Processor: Dr. Randa Mohamed
No ratings yet
The Processor: Dr. Randa Mohamed
34 pages
Chapter 04 Computer Architecture and D
No ratings yet
Chapter 04 Computer Architecture and D
95 pages
Chap 4 1
No ratings yet
Chap 4 1
57 pages
Chapter 04
No ratings yet
Chapter 04
169 pages
Chapter 04 RISC V
No ratings yet
Chapter 04 RISC V
130 pages
Chapter 04
No ratings yet
Chapter 04
131 pages
The Processor: Omputer Rganization and Esign
No ratings yet
The Processor: Omputer Rganization and Esign
135 pages
The Processor: CPU Performance Factors
No ratings yet
The Processor: CPU Performance Factors
66 pages
Lec7 ch4 Part2
No ratings yet
Lec7 ch4 Part2
21 pages
Comp206 Lecture7
No ratings yet
Comp206 Lecture7
44 pages
The Processor: Computer Organization and Design
No ratings yet
The Processor: Computer Organization and Design
162 pages
Chapter4 Processor PDF
No ratings yet
Chapter4 Processor PDF
149 pages
Comp206 Inclass8
No ratings yet
Comp206 Inclass8
20 pages
Lec 1
No ratings yet
Lec 1
21 pages
CA 06 Datapath
No ratings yet
CA 06 Datapath
26 pages
Ca Lecture 9
No ratings yet
Ca Lecture 9
26 pages
Processor: Datapath and Control
No ratings yet
Processor: Datapath and Control
47 pages
Chapter 4 Part 1
No ratings yet
Chapter 4 Part 1
31 pages
Chapter ..
No ratings yet
Chapter ..
130 pages
Microlecture 8
No ratings yet
Microlecture 8
19 pages
Lec 2
No ratings yet
Lec 2
28 pages
CO chpt-4
No ratings yet
CO chpt-4
161 pages
4 The Processors
No ratings yet
4 The Processors
112 pages
Unit 3
No ratings yet
Unit 3
20 pages
Micro 20
No ratings yet
Micro 20
11 pages
Computer Architecture Chapter 4: The Processor Part 3: Dr. Phạm Quốc Cường
No ratings yet
Computer Architecture Chapter 4: The Processor Part 3: Dr. Phạm Quốc Cường
23 pages
Lecture 4.1 - The Processor
No ratings yet
Lecture 4.1 - The Processor
29 pages
DPCO Chapter 4
No ratings yet
DPCO Chapter 4
33 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
15 pages
CS3351 DPCO UNIT 4 Notes
No ratings yet
CS3351 DPCO UNIT 4 Notes
47 pages
4.4 Pipelining
No ratings yet
4.4 Pipelining
39 pages
Ca Unit 3 Prabu
100% (1)
Ca Unit 3 Prabu
24 pages
CH4 External and Internal Architecture Mips
No ratings yet
CH4 External and Internal Architecture Mips
9 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
15 pages
Ldco Unit 4 Notes
No ratings yet
Ldco Unit 4 Notes
38 pages
Kiến Trúc Máy Tính CS2009: Khoa Khoa học và Kỹ thuật Máy tính BM Kỹ thuật Máy tính Võ Tấn Phương
No ratings yet
Kiến Trúc Máy Tính CS2009: Khoa Khoa học và Kỹ thuật Máy tính BM Kỹ thuật Máy tính Võ Tấn Phương
134 pages
Chapter 5: Processing Unit
No ratings yet
Chapter 5: Processing Unit
18 pages
Lecture08 RISCV Impl 2
No ratings yet
Lecture08 RISCV Impl 2
55 pages
Ch#4 Part 1, 2,34
No ratings yet
Ch#4 Part 1, 2,34
70 pages
CA Chap4 Cpu Nlt2021 Part1
No ratings yet
CA Chap4 Cpu Nlt2021 Part1
81 pages
CA Chap4 CPU
No ratings yet
CA Chap4 CPU
83 pages
CA Chap4 CPU NLT2020
No ratings yet
CA Chap4 CPU NLT2020
82 pages
Chapter4 Pipelining END FA11
No ratings yet
Chapter4 Pipelining END FA11
84 pages
Unit III
No ratings yet
Unit III
43 pages
Accelerated Computing With HIP: Second Edition
From Everand
Accelerated Computing With HIP: Second Edition
Yifan Sun
No ratings yet
gsp5 Tools Menus and How-To List
No ratings yet
gsp5 Tools Menus and How-To List
3 pages
01-Software Project Management
No ratings yet
01-Software Project Management
12 pages
ASTM Laboratory Information Management Systems Rte1nzg
No ratings yet
ASTM Laboratory Information Management Systems Rte1nzg
27 pages
General Register Organization Stack Organization Addressing Modes Data Transfer and Manipulation Program Control Risc and Cisc
No ratings yet
General Register Organization Stack Organization Addressing Modes Data Transfer and Manipulation Program Control Risc and Cisc
46 pages
Kodiaq Accessories 2018 en
No ratings yet
Kodiaq Accessories 2018 en
33 pages
Lovish's Resume Btech Chem
No ratings yet
Lovish's Resume Btech Chem
3 pages
Microsoft Windows Shortcut Keys List: Advertisement
No ratings yet
Microsoft Windows Shortcut Keys List: Advertisement
5 pages
Evaluating The Computational Efficiency and Precision of Pathfinding Algorithms
No ratings yet
Evaluating The Computational Efficiency and Precision of Pathfinding Algorithms
8 pages
The 67 Steps by Tai Lopez
No ratings yet
The 67 Steps by Tai Lopez
7 pages
Revit 2026 Shortcuts
No ratings yet
Revit 2026 Shortcuts
3 pages
Project Planning and Scheduling
No ratings yet
Project Planning and Scheduling
4 pages
DBMS Mini Project Review 1
No ratings yet
DBMS Mini Project Review 1
8 pages
Plate Theory Lecture Notes
No ratings yet
Plate Theory Lecture Notes
82 pages
AutoCAD 2012 DGN Hotfix Readme0
No ratings yet
AutoCAD 2012 DGN Hotfix Readme0
3 pages
Week 02 Theory Slides-S
No ratings yet
Week 02 Theory Slides-S
34 pages
PrivacyIssuesofPublicWi FiNetworks
No ratings yet
PrivacyIssuesofPublicWi FiNetworks
11 pages
PDF Template Generator A Complete Guide
No ratings yet
PDF Template Generator A Complete Guide
37 pages
FitSM Sample Service Portfolio Catalogue v2.0
No ratings yet
FitSM Sample Service Portfolio Catalogue v2.0
9 pages
Transferring Phones
No ratings yet
Transferring Phones
1 page
Mbistarchitect Process Guide: Software Version 2017.3 September 2017
No ratings yet
Mbistarchitect Process Guide: Software Version 2017.3 September 2017
350 pages
CSBS R23 II Year Course Structure and Syllabus
No ratings yet
CSBS R23 II Year Course Structure and Syllabus
52 pages
Tutorial Crack Wep Encryption
No ratings yet
Tutorial Crack Wep Encryption
6 pages
Delta4000 Ds en
0% (1)
Delta4000 Ds en
4 pages
Narrative Report
No ratings yet
Narrative Report
12 pages
Electra Rc-3 Remote Control Dip Switch
No ratings yet
Electra Rc-3 Remote Control Dip Switch
3 pages
The Role of Change Management in Successful Information Management Solutions
No ratings yet
The Role of Change Management in Successful Information Management Solutions
10 pages
Basic Python Book PDF
No ratings yet
Basic Python Book PDF
41 pages
It App Notes
No ratings yet
It App Notes
20 pages

Chap 4

Uploaded by

Chap 4

Uploaded by

Chapter 4 — The Processor October 9, 2022

Adapted from Computer Organization the Hardware/Software Interface – 5th

Computer Engineering – CSE – HCMUT

University of Technology – VNUHCM 1

ALU có thể cộng trừ, nhân chia

input and output relationship

University of Technology – VNUHCM 2

University of Technology – VNUHCM 3

Control vs. Data signals

University of Technology – VNUHCM 4

Logic Design Basics

University of Technology – VNUHCM 5

• Multiplexer • Arithmetic/Logic Unit

University of Technology – VNUHCM 6

University of Technology – VNUHCM 7

University of Technology – VNUHCM 8

University of Technology – VNUHCM 9

University of Technology – VNUHCM 10

Composing the Elements

University of Technology – VNUHCM 11

University of Technology – VNUHCM 12

University of Technology – VNUHCM 13

Multiple Levels of Decoding

The Main Control Unit

opcode always read, write for sign-extend

University of Technology – VNUHCM 14

Datapath with Instruction Fields

Datapath With Control

University of Technology – VNUHCM 15

add $t1, $t2, $t3

University of Technology – VNUHCM 16

beq $t1, $t2, offset

University of Technology – VNUHCM 17

Datapath With Jumps Added

University of Technology – VNUHCM 18

University of Technology – VNUHCM 19

Pipelined (Tc= 200ps)

University of Technology – VNUHCM 20

• If not balanced, speedup is less

Pipelining and ISA Design

University of Technology – VNUHCM 21

Multi-Cycle Pipeline Diagram

Multi-Cycle Pipeline Diagram

University of Technology – VNUHCM 22

University of Technology – VNUHCM 23

Forwarding (aka Bypassing)

University of Technology – VNUHCM 24

Load-Use Data Hazard

Code Scheduling to Avoid Stalls

University of Technology – VNUHCM 25

University of Technology – VNUHCM 26

MIPS with Predict Not Taken

University of Technology – VNUHCM 27

More-Realistic Branch Prediction

Dynamic Branch Prediction

Chapter 4 - Processor 100

University of Technology – VNUHCM 28

1-Bit Predictor: Shortcoming

n Mispredict as taken on last iteration of

iteration of outer loop next time around

Chapter 4 - Processor 102

University of Technology – VNUHCM 29

Calculating the Branch Target

Chapter 4 - Processor 103

University of Technology – VNUHCM 30

You might also like

•  Multiplexer •  Arithmetic/Logic Unit

•  If not balanced, speedup is less

n  Mispredict as taken on last iteration of