Basic Processing Unit
Overview
Instruction Set Processor (ISP)
Central Processing Unit (CPU)
A typical computing task consists of a series
of steps specified by a sequence of machine
instructions that constitute a program.
An instruction is executed by carrying out a
sequence of more rudimentary operations.
Some Fundamental
Concepts
Fundamental Concepts
Processor fetches one instruction at a time and
perform the operation specified.
Instructions are fetched from successive memory
locations until a branch or a jump instruction is
encountered.
Processor keeps track of the address of the memory
location containing the next instruction to be fetched
using Program Counter (PC).
Instruction Register (IR)
Executing an Instruction
Fetch the contents of the memory location pointed
to by the PC. The contents of this location are
loaded into the IR (fetch phase).
IR [[PC]]
Assuming that the memory is byte addressable,
increment the contents of the PC by 4 (fetch phase).
PC [PC] + 4
Carry out the actions specified by the instruction in
the IR (execution phase).
Processor Organization
Internal processor
bus
MDR HAS
TWO INPUTS
AND TWO
OUTPUTS
Control signals
PC
Instruction
Address
lines
decoder and
MAR
control logic
Memory
bus
MDR
Data
lines
IR
Y
R0
Constant 4
Select
Datapath
MUX
Add
ALU
control
lines
Sub
R ( n -
ALU
Carry-in
XOR
TEMP
Z
Fig 1 Single-bus organization of the datapath inside a processor
1 )
Executing an Instruction
Transfer a word of data from one processor
register to another or to the ALU.
Perform an arithmetic or a logic operation
and store the result in a processor register.
Fetch the contents of a given memory
location and load them into a processor
register.
Store a word of data from a processor
register into a given memory location.
Register Transfers
Internal processor
b us
R i in
Ri
R i out
Y
in
Y
Constant 4
Select
MUX
B
ALU
in
out
Fig 2 Input and output gating for the registers shown in Fig1.
Register Transfers
All operations and data transfers are controlled by the processor clock.
Bus
0
D
1
Q
Ri in
Riout
Clock
Figure
Inputand
and
output
ating
g forfor
re
gister
Fig7.3.
3. Input
output
gating
oneone
register
bit. bit.
Performing an Arithmetic or
Logic Operation
The ALU is a combinational circuit that has no
internal storage.
ALU gets the two operands from MUX and bus.
The result is temporarily stored in register Z.
What is the sequence of operations to add the
contents of register R1 to those of R2 and store the
result in R3?
1.
2.
3.
R1out, Yin
R2out, SelectY, Add, Zin
Zout, R3in
Fetching a Word from
Memory
Address into MAR; issue Read operation; data into MDR.
Memory-bus
data lines
MDRoutE
MDRout
Internal processor
bus
MDR
MDR inE
MDRin
Fig7.4.
4 Connection
and control
signalssignals
for register
MDR.MDR.
Figure
Connection
and control
forgister
re
Fetching a Word from
Memory
The response time of each memory access varies
(cache miss, memory-mapped I/O,).
To accommodate this, the processor waits until it
receives an indication that the requested operation
has been completed (Memory-Function-Completed,
MFC).
Move (R1), R2
MAR [R1]
Start a Read operation on the memory bus
Wait for the MFC response from the memory
Load MDR from the memory bus
R2 [MDR]
Step
Timing
Clock
MARin
Assume MAR
is always available
on the address lines
of the memory bus.
MAR [R1]
Address
Read
Start a Read operation on the memory bus
MR
MDRinE
Data
Wait for the MFC response from the memory
MFC
MDR out
Load MDR from the memory bus
R2 [MDR]
Figure 7.5.ofTiming
of a memory
Read operation
operation.
Fig 5 Timing
a Memory
Read
Execution of a Complete
Instruction
Add (R3), R1
Fetch the instruction
Fetch the first operand (the contents of the
memory location pointed to by R3)
Perform the addition
Load the result into R1
Internal processor
b us
R i in
Architecture
Ri
R i out
Y
in
Y
Constant 4
Select
MUX
B
ALU
in
out
Fig 2 Input and output gating for the registers in Figure1.
Execution of a Complete
Instruction
Internal processor
bus
Control signals
PC
Instruction
Address
lines
Add (R3), R1
decoder and
MAR
Step Action
control logic
Memory
bus
MDR
PCout , MARin , Read,Select4,Add, Zin
Zout , PCin , Yin , WMFC
MDRout , IRin
R3out , MARin , Read
R1out , Yin , WMFC
MDRout , SelectY,Add, Zin
Zout , R1in , End
Data
lines
IR
Y
R0
Constant 4
Select
MUX
Add
ALU
control
lines
Sub
R( n - 1)
ALU
Carry-in
Fig 6 Control Sequencer execution of the instruction Add (R3),R1.
XOR
TEMP
Z
Fig 1 Single-bus organization of the datapath inside a processor
Execution of Branch
Instructions
A branch instruction replaces the contents of
PC with the branch target address, which is
usually obtained by adding an offset X given
in the branch instruction.
The offset X is usually the difference between
the branch target address and the address
immediately following the branch instruction.
Conditional branch
Execution of Branch
Instructions
Step Action
1
PCout , MAR in , Read, Select4,Add, Z in
Zout , PCin , Yin , WMF C
MDR out , IR in
Offset-field-of-IRout, Add, Z in
Z out , PCin , End
Fig 7. Control sequence for an unconditional branch instruction.
Multiple-Bus Organization
Bus A
Bus B
Bus C
Incrementer
PC
Re
gister
Registerfile
file
MUX
Constant 4
A
ALU
Instruction
decoder
IR
MDR
MAR
Memory b
data lines
Memory
bus
datalines
Figure 7.8.
us
Address
lines
Fig 8Three-b
Three-busus ororganization
ofdatapath.
the datapath
g anization of the
Multiple-Bus Organization
Add R4, R5, R6
Step Action
1
PCout, R=B, MAR in , Read, IncPC
WMFC
MDR outB , R=B, IR in
R4outA , R5outB , SelectA, Add, R6in , End
Fig 9. Control sequence for the instruction Add R4,R5,R6,
for the three-bus organization in Fig 8.
Internal processor
bus
Control signals
Quiz
PC
Instruction
What is the control
sequence for
execution of the
instruction
Add R1, R2
including the
instruction fetch
phase? (Assume
single bus
architecture)
Address
lines
decoder and
MAR
control logic
Memory
bus
MDR
Data
lines
IR
Y
R0
Constant 4
Select
ALU
control
lines
MUX
Add
Sub
R( n - 1 )
ALU
Carry-in
XOR
TEMP
Z
Fig 1 Single-bus organization of the datapath inside a processor
Hardwired Control
Overview
To execute instructions, the processor must
have some means of generating the control
signals needed in the proper sequence.
Two categories: hardwired control and
microprogrammed control
Hardwired system can operate at high speed;
but with little flexibility.
Control Unit Organization
Clock
CLK
Control step
counter
External
inputs
IR
Decoder/
encoder
Condition
codes
Control signals
Figure 7.10. Control unit organization.
Detailed Block Description
Clock
CLK
Control step
counter
Reset
Step decoder
T 1 T2
Tn
INS1
External
inputs
INS2
IR
Instruction
decoder
Encoder
Condition
codes
INSm
Run
End
Control signals
Figure 7.11. Separation of the decoding and encoding functions.
Generating Zin
Zin = T1 + T6 ADD + T4 BR +
Branch
T4
Add
T6
T1
Figure 7.12. Generation of the Zin control signal for the processor in Figure 7.1.
Generating End
End = T7 ADD + T5 BR + (T5 N + T4 N) BRN +
Branch<0
Add
Branch
N
T7
T5
T4
End
Figure 7.13. Generation of the End control signal.
T5
A Complete Processor
Instruction
unit
Integer
unit
Instruction
cache
Floating-point
unit
Data
cache
Bus interface
Processor
System bus
Main
memory
Input/
Output
Figure 7.14. Block diagram of a complete processor
.
Microprogrammed
Control
Overview
Micro instruction
PCout
MAR in
Read
MDRout
IRin
Yin
Select
Add
Zin
Z out
R1out
R1in
R3out
WMFC
End
Control signals are generated by a program similar to machine
language programs.
Control Word (CW); microroutine; microinstruction
PCin
Figure 7.15 An example of microinstructions for Figure 7.6.
Overview
Step
Action
PCout , MAR in , Read, Select4,Add, Zin
Zout , PCin , Y in , WMF C
MDR out , IR in
R3out , MAR in , Read
R1out , Yin , WMF C
MDR out , SelectY, Add, Zin
Zout , R1in , End
Figure 7.6. Control sequencefor execution of the instruction Add (R3),R1.
Overview
Control store
IR
Starting
address
generator
Clock
PC
Control
store
One function
cannot be carried
out by this simple
organization.
CW
Figure 7.16. Basic organization of a microprogrammed control unit.
Overview
The previous organization cannot handle the situation when the control
unit is required to check the status of the condition codes or external
inputs to choose between alternative courses of action.
Use conditional branch microinstruction.
AddressMicroinstruction
0
PCout , MAR in , Read, Select4,Add, Z in
Zout , PCin , Yin , WMF C
MDRout , IR in
3
Branch to starting addressof appropriatemicroroutine
. ... .. ... ... .. ... .. ... ... .. ... ... .. ... .. ... ... .. ... .. ... ... .. ... ..
25
If N=0, then branch to microinstruction0
26
Offset-field-of-IRout , SelectY, Add, Z in
27
Zout , PCin , End
Figure 7.17. Microroutine for the instruction Branch<0.
Overview
External
inputs
IR
Clock
Starting and
branch address
generator
PC
Control
store
Figure 7.18.
Condition
codes
CW
Organization of the control unit to allow
conditional branching in the microprogram.
Microinstructions
A straightforward way to structure
microinstructions is to assign one bit position
to each control signal.
However, this is very inefficient.
The length can be reduced: most signals are
not needed simultaneously, and many signals
are mutually exclusive.
All mutually exclusive signals are placed in
the same group in binary coding.
Partial Format for the
Microinstructions
Microinstruction
F1
F2
F3
F4
F1 (4 bits)
F2 (3 bits)
F3 (3 bits)
0000: No transfer
0001: PCout
0010: MDRout
0011: Zout
0100: R0out
0101: R1out
0110: R2out
0111: R3out
1010: TEMPout
1011: Offsetout
000: No transfer
001: PCin
010: IRin
011: Zin
100: R0in
101: R1in
110: R2in
111: R3in
000: No transfer 0000: Add
001: MARin
0001: Sub
010: MDRin
011: TEMPin
1111: XOR
100: Yin
F6
F7
F4 (4 bits)
F5
F5 (2 bits)
00: No action
01: Read
10: Write
16 ALU
functions
F8
F6 (1 bit)
F7 (1 bit)
F8 (1 bit)
0: SelectY
1: Select4
0: No action
1: WMFC
0: Continue
1: End
What is the price paid for
this scheme?
Figure 7.19. An example of a partial format for field-encoded microinstructions.
Further Improvement
Enumerate the patterns of required signals in
all possible microinstructions. Each
meaningful combination of active control
signals can then be assigned a distinct code.
Vertical organization(No encoding of control
signals in control memory)
Horizontal organization(Reducing control
fields in memory storage by use of MUXs in
the output word form control memory)
Microprogram Sequencing
If all microprograms require only straightforward
sequential execution of microinstructions except for
branches, letting a PC governs the sequencing
would be efficient.
However, two disadvantages:
Having a separate microroutine for each machine instruction results
in a large total number of microinstructions and a large control store.
Longer execution time because it takes more time to carry out the
required branches.
Example: Add src, Rdst
Four addressing modes: register, autoincrement,
autodecrement, and indexed (with indirect forms).
- Bit-ORing
- Wide-Branch Addressing
- WMFC
Mode
Contents of IR
OP code
0 1
11 10
Rsrc
87
Rdst
4 3
Address
(octal)
Microinstruction
000
4, Add, Zin
PCout, MARin, Read, Select
001
Zout, PCin, Yin, WMFC
002
MDRout, IRin
003
Branch { PC 101 (from Instruction decoder);
PC5,4 [IR10,9]; PC3 [IR 10] [IR9] [IR8]}
121
Rsrcout , MARin , Read, Select4, Add,inZ
122
Zout, Rsrcin
123
Branch {PC 170;PC0 [IR8]}, WMFC
170
MDRout, MARin, Read, WMFC
171
MDRout, Yin
172
Rdstout , SelectY
, Add, Zin
173
Zout, Rdstin, End
Figure 7.21. Microinstruction for Add (Rsrc)+,Rdst.
Note:Microinstruction at location 170 is not executed for this addressing mode.
Microinstructions with NextAddress Field
The microprogram we discussed requires several
branch microinstructions, which perform no useful
operation in the datapath.
A powerful alternative approach is to include an
address field as a part of every microinstruction to
indicate the location of the next microinstruction to
be fetched.
Pros: separate branch microinstructions are virtually
eliminated; few limitations in assigning addresses to
microinstructions.
Cons: additional bits for the address field (around
1/6)
Microinstructions with NextAddress Field
IR
External
Inputs
Condition
codes
Decoding circuits
A R
Control store
I R
Next address
Microinstruction decoder
Control signals
Figure 7.22. Microinstruction-sequencing organization.
Microinstruction
F0
F0 (8 bits)
F1
F1 (3 bits)
Address of next 000: No transfer
microinstruction 001: PCout
010: MDRout
011: Zout
100: Rsrcout
101: Rdstout
110: TEMPout
F4
F5
F2
F3
F2 (3 bits)
F3 (3 bits)
000: No transfer
001: PCin
010: IRin
011: Zin
100: Rsrcin
101: Rdstin
000: No transfer
001: MARin
010: MDRin
011: TEMPin
100: Yin
F6
F7
F4 (4 bits)
F5 (2 bits)
F6 (1 bit)
F7 (1 bit)
0000: Add
0001: Sub
00: No action
01: Read
10: Write
0: SelectY
1: Select4
0: No action
1: WMFC
F9
F10
1111: XOR
F8
F8 (1 bit)
F9 (1 bit)
F10 (1 bit)
0: NextAdrs
1: InstDec
0: No action
1: ORmode
0: No action
1: ORindsrc
Figure 7.23. Format for microinstructions in the example of Section 7.5.3.
Implementation of the
Microroutine
Octal
address
F0
F1
F3
F2
0
1
0
0
F4
F5 F6 F7 F8 F9 F10
01
00
00
00
1
0
0
0
0
1
0
0
0
0
0
1
0
0
0
1
0
0
0
0
121
122
0 1 0 1 0 0 1 0 1 0 0 01 1 0 0 1 0 0 0 0 01
0 1 1 1 1 0 0 0 0 1 1 10 0 0 0 0 0 0 0 0 00
1
0
0
1
0
0
0
0
0
1
1
1
1
1
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
7
7
7
7
0
1
2
3
0
1
2
3
0
0
0
0
0
0
0
0
1
1
1
0
0
0
0
0
1
1
1
0
0
0
0
0
1
1
1
0
0
0
0
0
1
1
1
0
0
0
0
0
0
0
0
0
0
1
1
0
0
1
1
0
1
0
1
0
1
0
1
0
0
0
0
0
0
0
1
0
0
1
1
0
1
1
0
1
1
1
0
0
0
0
1
1
01
00
01
00
00
00
01
10
1
1
0
0
0
0
1
1
0
1
0
0
0
0
0
0
0
0
0
0
1
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
01
00
00
00
Figure 7.24. Implementation of the microroutine of Figure 7.21 using a
next-microinstruction address field.
(See Figure 7.23 for encoded signals.)
R15in
R15out
R0in
R0out
Decoder
Decoder
IR
Rsrc
Rdst
InstDecout
External
inputs
Decoding
circuits
Condition
codes
ORmode
ORindsrc
A R
Control store
Next address
F1
F2
F8 F9 F10
Rdstout
Rdstin
Rsrcout
Microinstruction
decoder
Rsrcin
Other control signals
Figure 7.25. Some details of the control-signal-generating circuitry.
bit-ORing
Micro-Programmed Control
Emulation
A micro-program determines the machine
instructions of a computer
Suppose we have two computers M1 and M2 with
different instruction sets
By adapting the micro-program of M1, we can
emulate M2
47
Micro-Programmed Control
Organization
Micro-program is often placed in ROM on CPU chip
Some machines had writable control store, i.e. user
could change instruction set
48