0% found this document useful (0 votes)

3 views29 pages

LPLP4

The document provides an overview of microprocessor fundamentals, including processor architecture, CPU operation, and pipelining. It explains key components such as the CPU, memory, and buses, and details how instructions are executed through fetch, decode, and execute phases. Additionally, it discusses the benefits of pipelining in improving CPU performance and the factors affecting processor performance.

Uploaded by

Shad Srwd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views29 pages

LPLP4

Uploaded by

Shad Srwd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Microprocessor Fundamentals

Belan Bapir Bakr

Table of Contents
1

2 • Processor Architecture

• How the CPU Works

4
• Pipelining

2
Architecture
In the context of computers, architecture has many meanings
Instruction Set Architecture (ISA)
The parts of a processor design that one needs to understand or write assembly/machine code
e.g. instruction set specifications, register
Example ISAs:
Intel: x86, IA32, x86-64, Itanium
ARM
Microarchitecture (or Computer Organization)
Implementation of the architecture
e.g. cache size, core frequency
Platform Architecture (or System Design)
Memory and I/O buses
Memory controllers
Direct memory access

3
Components of the Computer
Since 1946 all computers have had 4
components:

Processor controls the system

Input
Memory contains program and
Device
data Central
Processing Memory
Input devices that get data from Unit
the outside world into the Output
computer Device

Output devices that present com-

putation results to the
outside world

4
What is the CPU?
CPU is the brain of the computer system
The computer works by executing a program containing many instructions
Program
Sequence of instructions that perform a task
Think of executing a program like playing music
Instruction
A binary code representing the simplest operation performed by the processor
Think of an individual instruction as a note coming from a musical instrument
Code forms:
Machine code: byte-level program that a processor executes
Assembly code: text representation of machine code

5
Memory
k × m array of stored bits (k is usually 2n)
Address unique (n-bit) identifier of location
Contents m-bit value stored in location
Basic operations:
LOAD read a value from a memory location
STORE write a value to a memory location
Basic memory types:
RAM read-and-write
ROM read-only

6
Buses
Most processors use the three-bus architecture.
Address bus
Carries the address of memory or I/O device for a data transfer. Determines the addressing range.
Unidirectional: always acts output of the CPU
Data bus
Carries data to be transferred between processor and memory or I/O.
Bidirectional: set as input when reading data, and output when writing data
Control bus
Carries status and control signals required for various operations
An assortment of signals, anything not address or data
e.g. R/W*, IO/M*, Interrupt, DMA

7
Address Bus
The address bus contains the address of memory location or I/O device selected for a data
transfer.
Address bus width determines the addressing range.

CPU Address bus width Total addressable memory

8051 16 216 = 65,536 = 64 KB
8086 20 220 = 1,048,576 = 1MB
68000 24 224 = 16,777,216 = 16 MB
ARM 32 232 = 4,294,967,296 = 4 GB
Xeon 64 264 = 18,446,744,073,709,551,616 = 16 EB

8
Data Bus
Width of data bus determines the amount of data transferable in one step
Most microcontrollers have 8-bit data buses
Can transfer 1byte at any one time
A 32-bit word requires 4 transfers
ARM has a 32 bit data bus
Can transfer 4 bytes at once
Some chips has a external bus with selectable bus width of 8, 16 or 32 bits
Selecting smaller data bus results in lower performance but enables interfacing to lower cost memory
devices

9
Address and Data Buses $000000

224 addresses

24-bit address bus $FFFFFF

15 0
CPU Memory
16-bit data bus

Motorola 68000 bus sizes.

10
Inside the CPU Memory-Facing Registers
Registers store binary data. The following
registers interface with memory.

Program Counter (PC) points to the address of the

instruction currently being
executed by the CPU

Instruction Register (IR) stores the instruction read

from the address of the
instruction indicated by the
PC

Address Register store the address of the

current data

Data Register stores the value of from the

address indicated by Address
Register

11
Inside the CPU Internal Components
These components do not have direct access
to memory.

Decoder interpret the instruction

brought to the IR and pass
it to the CU.

Control Unit (CU) generates control signals

based on the instruction
detected by Decoder

Arithmetic/Logic Unit (ALU) performs arithmetic &

logic operations

Accumulator (ACC) a register that stores the

values used before and
after an ALU operation

12
ALU size
The ALU operates on several bits simultaneously
“The size of the processor”
Usually (but not always) determines data bus size
Typical sizes:
4 bits (remote controllers etc)
8 bits (microcontrollers: 68HC05, 8051, PIC)
16 bits (low-end microprocessors: Intel 8086)
32 bits (most popular size today: ARM, MIPS)
64 bits (servers: IBM POWER, Intel Xeon)

13
Table of Contents

• Processor Architecture
3
• How the CPU Works

• Pipelining

14
How does the CPU Work?

Fetch Decode Execute

Fetch instructions Interpret binary perform requested

from memory instruction action

An instruction has 3 phases of execution

The Control Unit (CU) orchestrates the complete execution of each instruction
At CU’s heart is a Finite State Machine (FSM) that sets up the state of the logic circuits according
to each instruction.
This process is governed by the system clock
the FSM goes through one transition (“machine cycle”) for each tick of the clock.

15
Inside the CPU An Example Program

C version Explanation
Assembly version
void main(void)
LOAD 0x2000 Load value of a to Data Register
{ Address Assembly
int a = 1; ------- --------- ADD 0x2002 After adding the previously loaded
int b = 2; 0x1000 LOAD 0x2000 value of a and the newly loaded
int c; 0x1002 ADD 0x2002 value of b, save in ACC.
c = a + b; 0x1004 STORE 0x2004 STORE 0x2004 Save the added result to the
} address of c

16
Executing LOAD instruction
1. The address the CPU wants to execute is 0x10000 in
the PC.
2. Put 0x1000 in Address Register,
3. When 0x1000 enters the Address Register, it
automatically accesses 0x1000 of the memory
4. Instruction there is read from memory.
5. Instruction is stored in IR (LOAD 0x2000)
6. Instruction goes into the decoder. At the same time the
PC is increased
7. Decoder interprets what the content is. CU
understands that the content is to get the value of
address 0x2000
8. CU generates control signals to read the value of
0x2000 from the memory
9. A value of 1is entered into the data register by the
control signal generated by the CU.
10. Value of data register is available to any circuits
needing it
11. Since this value may be operated through ALU, it is
temporarily stored in ACC.

17
Executing ADD instruction
1. Like LOAD, the address the current CPU will execute is
0x1002, which has already been increased
2. Put 0x1002 in Address Register,
3. Address 0x1002 is accessed
4. Value at 0x1002 is available
5. This value is stored in IR
6. Value in IR is sent to decoder. At the same time the PC
is increased
7. Decoder interprets value in IR. CU understands that the
content is to get the add the value of address 0x2002
8. CU generates control signals to read the value at
0x2002 based on decoder interpretation The ALU is
given a control signal to add.
9. Data from 0x2002 is loaded and saved in data register
10. ALU add data in Data Register with current value in
ACC
11. The sum replaces the old value of ACC

18
Executing STORE instruction
1. The current Instruction to be executed is 0x1004,
which is the PC value
2. The value in PC is transferred to the Address Register
3. Location 0x1004 is accessed
4. The value from 0x1004 is made available to the CPU
5. This value is saved in IR
6. The value in IR is made available to the decoder, and
the PC is incremented
7. Decoder interprets the value in IR
8. The CU generates control signals to store the value in
ACC at 0x2004 based on decoder interpretation
9. The output of the ALU is stored in the ACC
10. Finally, the value in ACC is stored in location 0x2004

19
Table of Contents
• Processor Architecture

3 • How the CPU Works

• Pipelining

20
Idea of the Pipeline 1/3

The sequence of operation of the CPU is the

regardless of instruction
Separate circuits are active during different
phases of execution
Each phase can be executed in parallel

21
Idea of the Pipeline 2/3
With pipelining each stage (fetch,
decode, execution) of the
instructions can be processed at
once.
Pipelining is used even in the
smalles $2 microcontroller
For our short program, while LOAD
0x2000 is actually being executed,
ADD 0x2002 is being decoded and
STORE 0x2004 is being fetched
from memory

22
Idea of the Pipeline 3/3
The 3-stage pipeline of the famous
ARM7
1clock cycle per cell. from the first
cycle to the third cycle
The first opcode is executed, the
second opcode is decoded, and the
third opcode is fetched all at once.
Execute Fetch-Decode-Execute
without pipelining will take 3×3
cycles to execute 3 opcodes
If you use a pipeline, it takes only 5
cycles to execute 3 opcodes

23
A 5-stage Pipeline
The instruction execution steps can be refined to increase the number of pipeline stages

Non-pipelined

Pipelined

24
Pipeline Performance
Latency
Defined as the time (or #cycles) from entering the pipeline until an instruction completes
Pipelining doesn’t help latency of single task
Throughput
Defined as the number of instructions executed per time period
Potential speedup = Number of pipeline stages

Trivia
The longest pipeline on a commercial machine is 31 stages on the Intel Pentium 4.

25
Speedup
k-stage pipeline processes n tasks in k + (n − 1) clock cycles:
k cycles for the first task and
n − 1 cycles for the remaining n − 1tasks
Total time to process n tasks, k stages:
For the pipelined processor:
[k + (n − 1)]τ
For the non-pipelined processor:
nkτ
Speedup (Sk = k as n → ∞):
Tnon−pipelined
Sk =
Tpipelined
T1
=
Tk
nkτ
=
[k + (n − 1)]τ
nk
=
k+n− 1
26
Clocking Si+1
Si

t tm d

Latch delay: d
Clock cycle of the pipeline: τ
τ = max(τm) + d
Pipeline frequency: f
1
f=
τ
∴ Pipeline rate limited by slowest pipeline stage.
Also, increasing #stages adds delay d
27
Limits to Pipelining
Hazards prevent next instruction from executing during its designated clock cycle
Structural hazards
Two instructions attempting to use the same resources at the same time
Data hazards
Instruction attempting to use data before it’s available in the register file
Control hazards
Caused by branch instructions, which invalidates data already in pipeline, requiring flushing and refilling.
Simplest solution is to stall the pipeline until the hazard is resolved, inserting one or more
“bubbles” in the pipeline
More stall cycles = lower performance
Complex solutions include branch prediction and data forwarding, out of the scope of this course

28
CPU Performance
Processor Performance is function of
IC: Instruction count
CPI: Cycle per instruction
Clock cycle

Seconds
CPU time =
Program
Instructions Cycles Seconds
= × ×
Program Instruction cycle
= IC × CPI × Clk

Reducing any of the 3 factors will lead to improved performance

Pipelining reduces CPI.
Best case: CPI = 1.

Dekka Cadet Operation Manual and Tech Ref, Oct 2009, 69-07-1
100% (2)
Dekka Cadet Operation Manual and Tech Ref, Oct 2009, 69-07-1
35 pages
02-General Purpose Processors
No ratings yet
02-General Purpose Processors
37 pages
Mgd1602b FL Ybs
No ratings yet
Mgd1602b FL Ybs
10 pages
Build Your Own Drill Press For FREE!: Instructables
100% (1)
Build Your Own Drill Press For FREE!: Instructables
27 pages
ITBP205 Digital Design and Computer Organization
No ratings yet
ITBP205 Digital Design and Computer Organization
29 pages
Computer Architecture I: Digital Design
No ratings yet
Computer Architecture I: Digital Design
56 pages
SAP1
No ratings yet
SAP1
40 pages
Computing Fundamental
No ratings yet
Computing Fundamental
32 pages
Module 2
No ratings yet
Module 2
34 pages
Lecture 16
No ratings yet
Lecture 16
29 pages
P1 - Unit 3-1 and 3-4 Comp Arch and Networks
No ratings yet
P1 - Unit 3-1 and 3-4 Comp Arch and Networks
37 pages
Cpu 1
No ratings yet
Cpu 1
6 pages
Week2 Comparch
No ratings yet
Week2 Comparch
53 pages
Computer Organization and Assembly Language: Lecture 1 & 2 Introduction and Basics
No ratings yet
Computer Organization and Assembly Language: Lecture 1 & 2 Introduction and Basics
33 pages
COA Unit 1
No ratings yet
COA Unit 1
55 pages
Computer Architecture Notes AYAN
No ratings yet
Computer Architecture Notes AYAN
12 pages
Computer Organization and Assembly Language: Lecture 2 - x86 Processor Architecture
No ratings yet
Computer Organization and Assembly Language: Lecture 2 - x86 Processor Architecture
23 pages
The Central Processing Unit (CPU) : Next
No ratings yet
The Central Processing Unit (CPU) : Next
26 pages
Week 1
No ratings yet
Week 1
45 pages
CH 03
No ratings yet
CH 03
48 pages
Chapter0
No ratings yet
Chapter0
43 pages
Internal Architecture of 8086: Central Processing Unit or and Output Circuitry or I/O and
No ratings yet
Internal Architecture of 8086: Central Processing Unit or and Output Circuitry or I/O and
8 pages
What Goes On Inside The Computer: The Central Processing Unit
No ratings yet
What Goes On Inside The Computer: The Central Processing Unit
185 pages
Processor and Control Unit
No ratings yet
Processor and Control Unit
71 pages
Instruction Set Architecture
No ratings yet
Instruction Set Architecture
7 pages
Micro Unit 4
No ratings yet
Micro Unit 4
151 pages
Lecture 1 - Architecture of Microprocessor
No ratings yet
Lecture 1 - Architecture of Microprocessor
20 pages
1.1.1. Structure and Function of The Processor
No ratings yet
1.1.1. Structure and Function of The Processor
7 pages
3 - CPU Organization
No ratings yet
3 - CPU Organization
8 pages
Lec 1-2
No ratings yet
Lec 1-2
11 pages
The CPU, Instruction Fetch & Execute: 2.1 A Bog Standard Architecture
No ratings yet
The CPU, Instruction Fetch & Execute: 2.1 A Bog Standard Architecture
15 pages
Unit-I: 8-Bit Microprocessors
No ratings yet
Unit-I: 8-Bit Microprocessors
55 pages
Basic Computer Organization: Dr. Bernard Chen PH.D
No ratings yet
Basic Computer Organization: Dr. Bernard Chen PH.D
72 pages
Unit 2
No ratings yet
Unit 2
7 pages
Chapter 1 (Assembly)
No ratings yet
Chapter 1 (Assembly)
32 pages
Uni3 - Revision 1
No ratings yet
Uni3 - Revision 1
36 pages
A-Level - 4 - Processor Fundamentals
No ratings yet
A-Level - 4 - Processor Fundamentals
125 pages
Microprocessor Lecture 1
No ratings yet
Microprocessor Lecture 1
18 pages
Introduction To Microprocessor
No ratings yet
Introduction To Microprocessor
38 pages
Microprocessor 01
No ratings yet
Microprocessor 01
62 pages
The Cpu: Central Processing Unit
No ratings yet
The Cpu: Central Processing Unit
26 pages
Instructions and Addressing
No ratings yet
Instructions and Addressing
61 pages
SEBB3033-L02-UP System HW Fundamentals
No ratings yet
SEBB3033-L02-UP System HW Fundamentals
38 pages
Lec-02 (Architecture of 8085)
No ratings yet
Lec-02 (Architecture of 8085)
28 pages
Pic
100% (1)
Pic
94 pages
Chapter 9MOD
No ratings yet
Chapter 9MOD
45 pages
Computer Architecture
No ratings yet
Computer Architecture
13 pages
L3 Processor Organization
No ratings yet
L3 Processor Organization
38 pages
8051
No ratings yet
8051
90 pages
Microcomputers & Microprocessors
No ratings yet
Microcomputers & Microprocessors
53 pages
Chapter 33
No ratings yet
Chapter 33
63 pages
Lec 2
No ratings yet
Lec 2
21 pages
Lecture04 VonNuemann
No ratings yet
Lecture04 VonNuemann
53 pages
AD Up Dig Design Be A
No ratings yet
AD Up Dig Design Be A
130 pages
Chapter 2 - x86 Processor Architecture
No ratings yet
Chapter 2 - x86 Processor Architecture
46 pages
Module 1
No ratings yet
Module 1
50 pages
MCU Basic Structure/Operation: MCU: The Brain That Controls The Hardware
No ratings yet
MCU Basic Structure/Operation: MCU: The Brain That Controls The Hardware
18 pages
SLR 1 Structure and Function of The Processor
No ratings yet
SLR 1 Structure and Function of The Processor
9 pages
COAR - Midterm
No ratings yet
COAR - Midterm
10 pages
Unit 3 Students
No ratings yet
Unit 3 Students
37 pages
Topic #3 - CPU - With Notations
No ratings yet
Topic #3 - CPU - With Notations
39 pages
PlayStation 2 Architecture: Architecture of Consoles: A Practical Analysis, #12
From Everand
PlayStation 2 Architecture: Architecture of Consoles: A Practical Analysis, #12
Rodrigo Copetti
No ratings yet
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet
Travelmate 4730 Series: Quick Guide
No ratings yet
Travelmate 4730 Series: Quick Guide
13 pages
S-Micro Processor Based Fire Pump Controller
67% (3)
S-Micro Processor Based Fire Pump Controller
14 pages
Unit 5
No ratings yet
Unit 5
33 pages
Introduction To Computer Objectives
No ratings yet
Introduction To Computer Objectives
24 pages
Folding Workbench Plans
No ratings yet
Folding Workbench Plans
11 pages
gp20 150
No ratings yet
gp20 150
170 pages
Remote Control Based Home Appliances Final Report
50% (2)
Remote Control Based Home Appliances Final Report
22 pages
Piyush 234 Report
No ratings yet
Piyush 234 Report
45 pages
Gold - Object-Oriented Game Development
100% (2)
Gold - Object-Oriented Game Development
441 pages
Chapter 16
No ratings yet
Chapter 16
45 pages
Lifting and FIxing Precast
100% (2)
Lifting and FIxing Precast
25 pages
Professional Practices
100% (1)
Professional Practices
179 pages
KX ft908
No ratings yet
KX ft908
182 pages
Fourier Transform and Its Medical Application
100% (1)
Fourier Transform and Its Medical Application
55 pages
Good SRS Sample
No ratings yet
Good SRS Sample
76 pages
Module 4A HDL Intro 04-12-23
No ratings yet
Module 4A HDL Intro 04-12-23
15 pages
FIO IoXtreme User Guide Linux
No ratings yet
FIO IoXtreme User Guide Linux
77 pages
Bringing Multimedia Content To Multiple Screens: Telecom Italia Develops Next-Generation Cubovision Web TV Services
No ratings yet
Bringing Multimedia Content To Multiple Screens: Telecom Italia Develops Next-Generation Cubovision Web TV Services
4 pages
Design and Fabrication of A Low-Cost and Programmable Dip Coating Machine
No ratings yet
Design and Fabrication of A Low-Cost and Programmable Dip Coating Machine
23 pages
93 LC 46 B
No ratings yet
93 LC 46 B
12 pages
Subgraph OS Overview
No ratings yet
Subgraph OS Overview
25 pages
S380 - HomeBase - (HomeBase - 3) - Manual
No ratings yet
S380 - HomeBase - (HomeBase - 3) - Manual
38 pages
Manual Dilutor
No ratings yet
Manual Dilutor
4 pages
NAS4Free Setup and User Guide: 1.1 Hardware Requirements
No ratings yet
NAS4Free Setup and User Guide: 1.1 Hardware Requirements
24 pages
CH Four
100% (4)
CH Four
10 pages
DFT
100% (1)
DFT
4 pages
Omron PLC Cpm2C
No ratings yet
Omron PLC Cpm2C
325 pages

LPLP4

Uploaded by

LPLP4

Uploaded by

Microprocessor Fundamentals

Belan Bapir Bakr

• How the CPU Works

Processor controls the system

Output devices that present com-

CPU Address bus width Total addressable memory

24-bit address bus $FFFFFF

Motorola 68000 bus sizes.

Program Counter (PC) points to the address of the

Instruction Register (IR) stores the instruction read

Address Register store the address of the

Data Register stores the value of from the

Decoder interpret the instruction

Control Unit (CU) generates control signals

Arithmetic/Logic Unit (ALU) performs arithmetic &

Accumulator (ACC) a register that stores the

Fetch Decode Execute

Fetch instructions Interpret binary perform requested

An instruction has 3 phases of execution

3 • How the CPU Works

The sequence of operation of the CPU is the

Reducing any of the 3 factors will lead to improved performance

You might also like