0% found this document useful (0 votes)

35 views553 pages

ECE2015 - CA - All Slides

The document provides information about a computer architecture course taught by Dr. Ellison at VIT-AP. It includes the contact details of the professor, pre-requisites for the course covering digital logic and data representation, objectives and expected outcomes of the course, evaluation criteria, textbook and reference books. It also gives an overview of topics that will be covered like computer organization, structure and function, operations, history of computers from first to second generation.

Uploaded by

Nitya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views553 pages

ECE2015 - CA - All Slides

Uploaded by

Nitya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 553

Computer Architecture

ECE 2015

DR. M. S. ELLISON
ASSOCIATE PROFESSOR
SENSE

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 1

Contact Details
Email: [email protected]
Phone: +91-9491902516
Cabin: CB-206

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 2

Course Pre-requisites
Digital logic design
Chap. 1: Digital Logic Circuits
• Logic Gates, • Boolean Algebra
• K-Map Simplification, • Combinational Circuits
• Flip-Flops, • Sequential Circuits
Please refer M. Morris Mano, Computer System
Chap. 2: Digital Components Architecture, Pearson Education, Third Edition
• Integrated Circuits, • Decoders, • Multiplexers
• Registers, • Shift Registers, • Binary Counters
• Memory Unit
Chap. 3: Data Representation
• Data Types • Complements • Fixed Point Representation
• Floating Point Representation
• Other Binary Codes

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 3

Objectives: 1. To familiarize students about hardware design including logic design, basic structure and
behavior of the various functional modules of the computer and how they interact to provide
the processing needs of the user.
2. To obtain a working knowledge of assembly language
Expected Outcome: On completion of the course, students will have the ability to
1. Understand the overview of basic computer architecture.
2. Learn basic computer logic: adders, multipliers, ALU, and memory.
3. Learn basic assembly language programming, basic Instruction Set Architecture (ISA), and the design of
single cycle CPU
4. Understand Parallel processors, RISC and CISC architecture.
Mode of Evaluation Practice/Quiz Tests-20%, Continuous Assessment Tests-60%, Practical Assesment-20%

Quiz Test 20%

Continuous Assessment Test-1 20%
Continuous Assessment Test-2 20%
Continuous Assessment Test-3 20%
Practical Assessment (Mini Project) 20%
Open Hours Will be displayed

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 4

Text Book:

William Stallings, Computer Organization and

Architecture: Designing for Performance,
Pearson Education, Tenth Edition,2013

Reference Books:
1. M. Morris Mano, Rajib Mall, Computer System Architecture, Pearson Education Third Edition,2017.
2. Carl Hamacher, Zvonkovranesic, Safwat Zaky , Computer Organization, McGraw Hill, Fifth Edition,2011.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 5

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 6
Brain

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 7

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 8
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 9
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 10
Architecture & Organization
Architecture is those attributes visible to the programmer
◦ Instruction set, number of bits used for data representation, I/O mechanisms, addressing techniques.
◦ e.g. Is there a multiply instruction?

Computer Architecture is also referred as Instruction set architecture (ISA) which has an algorithm to control various
components.
ISA

Instruction Instruction Instruction and

Organization is how features are implemented by interconnecting the operational units to realize the specific architectural
specifications.
◦ Control signals, interfaces, memory technology.
◦ e.g. Is there a hardware multiply unit or is it done by repeated addition?

All Intel x86 family share the same basic architecture

The IBM System/370 family share the same basic architecture
This gives code compatibility
Organization differs between different versions
Ex: Pipelining

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 11

Structure & Function
Structure is the way in which components relate to each other
Function is the operation of individual components as part of the structure

Function Structure

Data Data Data System

Control CPU Main memory I/O
processing storage movement interconnection

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 12

Function
All computer functions are:
◦ Data processing
◦ Data storage
◦ Data movement
◦ Control

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 13

Operations (1) Data movement

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 14

Operations (2) Storage

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 15

Operation (3) Processing from/to
storage

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 16

Operation (4) Processing from storage to I/O

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 17

Structure
The Computer
◦ CPU
◦ Controls the operation of the computer and performs its data
processing functions.
◦ Main memory
◦ Stores data
◦ Fast Page Mode RAM
◦ Synchronous DRAM
◦ Extended data output) RAM
◦ I/O
◦ Moves data between the computer and its external environment
◦ System interconnection
◦ Provides for communication among CPU, main memory, and I/O
Structure - Top Level

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 19

Multicore computer

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 20

Review Questions
1. What, in general terms, is the distinction between computer organization and computer architecture?
2. What, in general terms, is the distinction between computer structure and computer function?
3. What are the four main functions of a computer?
4. List and briefly define the main structural components of a computer.
5. List and briefly define the main structural components of a processor.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 21

History of computers
First Generation: Vacuum Tubes
Second Generation: Transistors
Third Generation: Integrated circuits
Later generations: LSI and VLSI

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 22

First Gen: Vacuum tubes
ENIAC (Electronic Numerical Integrator and
Computer)
Built in 1943 for WW-II
Weighed 30 tons, occupying 1500 square
feet of floor space, and containing more than
18,000 vacuum tubes
Consumed 140 kilowatts of power
Faster than any electromechanical
computer, capable of 5000 additions per
second

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 23

ENIAC
It was a decimal computer, rather than a binary one
Its memory consisted of 20 “accumulators,” each capable of holding a 10-digit
decimal number
A ring of 10 vacuum tubes represented each digit
The major drawback is that it had to be programmed manually by setting switches and plugging
and unplugging cables

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 24

Von Neumann machine
Program could be represented in a form suitable for storing in memory alongside
the data
A computer could get its instructions by reading them from memory
This program could be set or altered by setting the values of a portion of memory
This idea, known as the stored-program concept, usually attributed to
the mathematician John von Neumann
Shortly, the design of a new stored program computer, referred to as the IAS computer, at the
Princeton Institute for Advanced Studies
Took 6 Years to build and is the prototype for all the subsequent general-purpose computers

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 25

Second GEN: Transistors
Invented at Bell Labs in 1947; by 1950s  electronic revolution.
Late 1950s  fully transistorized computers were commercially available
Smaller, cheaper, and dissipates less heat; can be used in the same way as a vacuum tube to
construct computers
The second generation introduced more complex ALUs and CUs
Use of high-level programming languages, and system software with the computer.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 26

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 27
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 28
Third gen: Integrated circuits
A single, self-contained transistor is called a discrete component
1950s to early 1960s, electronic equipment was composed largely of discrete components—
transistors, resistors, capacitors, and so on.
Discrete components were manufactured separately  packaged in their own containers 
soldered or wired together onto masonite-like circuit boards  then installed in computers,
oscilloscopes, and other electronic equipment.
Whenever an electronic device needed a transistor replacement, a small piece of silicon had to
be soldered to a circuit board; made the manufacturing process expensive and cumbersome.
The Early second-generation computers contained about 10,000 transistors. This figure grew to
the hundreds of thousands; manufacturing of newer, more powerful machines became
increasingly difficult.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 29

Third gen: Integrated circuits
1958  invention of the Integrated circuit
The integrated circuit exploits the fact that  components as
transistors, resistors, and conductors can be fabricated from a
semiconductor such as silicon.
Fabrication of an entire circuit in a tiny piece of silicon; rather than
assemble discrete components using separate pieces of silicon
Many transistors can be produced at the same time on a single
wafer of silicon; and can be connected with a process of
metallization to form circuits

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 30

Integrated circuits
Initially, only a few gates or memory
cells could be reliably manufactured
and packaged together; known as SSI
As time went on, it became possible
to pack more and more components
on the same chip.
This figure reflects the famous
Moore’s law, which was propounded
by Gordon Moore, cofounder of
Intel, in 1965.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 31

Integrated circuits
Moore observed that the #Transistors was doubling every year; correctly
predicted that this pace would continue
The pace continued year after year and decade after decade; it slowed to a
doubling every 18 months in the 1970s but has sustained that rate ever since
Consequences of Moore’s law:
The cost of a chip has remained virtually unchanged during this period of rapid growth in
density. This means that the cost of computer logic and memory circuitry has fallen at a
dramatic rate.
Because logic and memory elements are placed closer together on more densely packed
chips, the electrical path length is shortened, increasing operating speed.
The computer becomes smaller, making it more convenient to place in a variety of
environments.
There is a reduction in power and cooling requirements.
The interconnections on the integrated circuit are much more reliable than solder
connections. With more circuitry on each chip, there are fewer interchip connections
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 32
Later generations

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 33

Later generations
With the introduction of largescale integration (LSI), more than 1000 components can be
placed on a single integrated circuit chip.
Very-large-scale integration (VLSI) achieved more than 10,000 components per chip, while
current ultra-large-scale integration (ULSI) chips can contain more than one million components.
The first application of integrated circuit technology to computers was construction of the
processor
It was also found that this same technology could be used to construct memories

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 34

Later generations
In the 1950s and 1960s, most computer memory was constructed from tiny rings of
ferromagnetic materials
Magnetized one way, a ring (called a core) represented a one; magnetized the other way, it
stood for a zero.
Magnetic-core memory was rather fast; it took as little as a millionth of a second to read a bit
stored in memory.
But it was expensive, bulky, and used destructive readout: The simple act of reading a core
erased the data stored in it.
It was therefore necessary to install circuits to restore the data as soon as it had been
extracted.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 35

Later generations
Then, in 1970, Fairchild produced the first semiconductor memory.
This chip, about the size of a single core, could hold 256 bits of memory.
It was nondestructive and much faster than core. It took only 70 billionths of a second to
read a bit.
However, the cost per bit was higher than for that of core
In 1974, the price per bit of semiconductor memory dropped below the price per bit of core
memory.
Following this, there has been a continuing and rapid decline in memory cost accompanied
by a corresponding increase in physical memory density.
This has led the way to smaller, faster machines with memory sizes of larger and more
expensive machines from just a few years earlier.
Developments in memory technology, together with developments in processor technology
changed the nature of computers in less than a decade.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 36

Microprocessors
A breakthrough was achieved in 1971, when Intel developed its 4004.
The 4004 was the first chip to contain all of the components of a CPU on a single chip: The
microprocessor was born.
The 4004 can add two 4-bit numbers and can multiply only by repeated addition.
By today’s standards, the 4004 is hopelessly primitive, but it marked the beginning of a
continuing evolution of microprocessor capability and power.
The next major step in the evolution of the microprocessor was the introduction in 1972 of the
Intel 8008.
This was the first 8-bit microprocessor and was almost twice as complex as the 4004.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 37

Microprocessors
The introduction, in 1974, of the Intel 8080.
This was the first general-purpose microprocessor.
The 4004 and the 8008 had been designed for specific applications, the 8080 was designed to
be the CPU of a general-purpose microcomputer.
Like the 8008, the 8080 is an 8-bit microprocessor.
It was faster, has a richer instruction set, and has a large addressing capability.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 38

Microprocessors
About the same time, 16-bit microprocessors began to be developed.
However, it was not until the end of the 1970s that powerful, general-purpose 16-bit
microprocessors appeared. One of these was the 8086.
The next step in this trend occurred in 1981, when both Bell Labs and Hewlett-Packard
developed 32-bit, single-chip microprocessors.
Intel introduced its own 32-bit microprocessor, the 80386, in 1985

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 39

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 40
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 41
Speeding it up
• Pipelining
• On board cache
• On board L1 & L2 cache
• Branch prediction
• Data flow analysis

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 42

Performance Balance
• Processor speed increased
• Memory capacity increased
• Memory speed lags behind processor speed

Logic and Memory Performance Gap

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 43

Solutions
• Increase number of bits retrieved at one time
• Make DRAM “wider” rather than “deeper”
• Change DRAM interface
• Cache
• Reduce frequency of memory access
• More complex cache and cache on chip
• Increase interconnection bandwidth
• High speed buses
• Hierarchy of buses

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 44

I/O Devices
• Peripherals with intensive I/O demands
• Large data throughput demands
• Processors can handle this
• Solutions:
• Caching
• Higher-speed interconnection buses
• More elaborate bus structures
• Multiple-processor configurations

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 45

Typical I/O Device Data Rates

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 46

Key is Balance
• Processor components
• Main memory
• I/O devices
• Interconnection structures

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 47

Improvements in Chip Organization and
Architecture
• Increase hardware speed of processor
• Fundamentally due to shrinking logic gate size
• More gates, packed more tightly, increasing clock rate
• Propagation time for signals reduced

• Increase size and speed of caches

• Dedicating part of processor chip
• Cache access times drop significantly

• Change processor organization and architecture

• Increase effective speed of execution
• Parallelism

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 48

Problems with Clock Speed and Logic Density
• Power
• Power density increases with density of logic and clock speed
• Dissipating heat
• RC delay
• Speed at which electrons flow limited by resistance and capacitance of metal wires
connecting them
• Delay increases as RC product increases
• Wire interconnects thinner, increasing resistance
• Wires closer together, increasing capacitance
• Memory latency
• Memory speeds lag processor speeds
• Solution:
• More emphasis on organizational and architectural approaches

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 49

Intel Microprocessor Performance

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 50

Increased Cache Capacity
• Typically two or three levels of cache between processor and main memory
• Chip density increased
• More cache memory on chip
• Faster cache access

• Pentium chip devoted about 10% of chip area to cache

• Pentium 4 devotes about 50%

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 51

More Complex Execution Logic
• Enable parallel execution of instructions
• Pipeline works like assembly line
• Different stages of execution of different instructions at same time along pipeline
• Superscalar allows multiple pipelines within single processor
• Instructions that do not depend on one another can be executed in parallel

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 52

Diminishing Returns
• Internal organization of processors complex
• Can get a great deal of parallelism
• Further significant increases likely to be relatively modest
• Benefits from cache are reaching limit
• Increasing clock rate runs into power dissipation problem
• Some fundamental physical limits are being reached

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 53

New Approach – Multiple Cores
• Multiple processors on single chip
• Large shared cache
• Within a processor, increase in performance proportional to square root of
increase in complexity
• If software can use multiple processors, doubling number of processors almost
doubles performance
• So, use two simpler processors on the chip rather than one more complex
processor
• With two processors, larger caches are justified
• Power consumption of memory logic less than processing logic

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 54

IAS Computer

A main memory, which stores both data and instructions

An arithmetic and logic unit (ALU) capable of operating on binary data
A control unit, which interprets the instructions in memory and causes them to
be executed
Input and output (I/O) equipment operated by the control unit

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 55

Working of IAS
The memory of the IAS consists of 1000 storage locations, called words, of 40 binary digits
(bits) each.
Both data and instructions are stored there.
Numbers are represented in binary form, and each instruction is a binary code.
Each number is represented by a sign bit and a 39-bit value.
A word may also contain two 20-bit instructions, with each instruction consisting of an 8-bit
operation code (opcode) specifying the operation to be performed and a 12-bit address
designating one of the words in memory (numbered from 0 to 999).

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 56

EXAMPLE

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 57

Registers in IAS
Memory buffer register (MBR): Contains a word to be stored in memory or sent to the I/O unit,
or is used to receive a word from memory or from the I/O unit.
Memory address register (MAR): Specifies the address in memory of the word to be written
from or read into the MBR.
Instruction register (IR): Contains the 8-bit opcode instruction being executed.
Instruction buffer register (IBR): Employed to hold temporarily the righthand instruction from
a word in memory.
Program counter (PC): Contains the address of the next instruction-pair to be fetched from
memory.
Accumulator (AC) and multiplier quotient (MQ): Employed to hold temporarily operands and
results of ALU operations.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 58

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 59
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 60
Computer function
Basic function performed by the computer is execution of the program  set of instructions
stored in memory
Instruction processing consists of two steps
Processor reads / Fetches
Executing the fetched instruction

Processing required for a single instruction is known as instruction cycle

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 61

Instruction fetch and execute
At the beginning of each instruction cycle, the processor fetches an instruction from memory
A register called the program counter (PC) holds the address of the instruction to be fetched
next
Unless told otherwise, the processor always increments the PC after each instruction fetch so
that it will fetch the next instruction in sequence i.e. the instruction located in the next higher
memory address
The fetched instruction is loaded into a register in the processor known as the
instruction register (IR)
The instruction contains bits that specify the action the processor needs to take
The processor interprets the instruction and performs the required action

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 62

Example
Suppose a processor contains a
single data register, called an
accumulator (AC)
Both instructions and data are 16
bits long
The instruction format provides 4
bits for the opcode  16 opcodes
and 4096 (4K) memory

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 63

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 64
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 65
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 66
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 67
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 68
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 69
Example
In this example, three instruction cycles, each consisting of a fetch cycle and an execute cycle,
are needed to add the contents of location 940 to the contents of 941.
With a more complex set of instructions, fewer cycles would be needed.
Some older processors, for example, included instructions that contain more than one memory
address.
For example, the PDP-11 processor expressed symbolically as ADD B,A
Fetch the ADD instruction.
Read the contents of memory location A into the processor.
Read the contents of memory location B into the processor. In order that the contents of A are not lost,
the processor must have at least two registers for storing memory values, rather than a single
accumulator.
Add the two values.
Write the result from the processor to memory location A.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 70

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 71
States
Instruction address calculation (iac): Determine the address of the next instruction to be
executed.
Instruction fetch (if): Read instruction from its memory location into the processor.
Instruction operation decoding (iod): Analyze instruction to determine type of operation to be
performed and operand(s) to be used.
Operand address calculation (oac): If the operation involves reference to an operand in
memory or available via I/O, then determine the address of the operand.
Operand fetch (of): Fetch the operand from memory or read it in from I/O.
Data operation (do): Perform the operation indicated in the instruction.
Operand store (os): Write the result into memory or out to I/O.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 72

States
The oac state appears twice, because an instruction may involve a read, a write, or both.
Example  ADD A,B results in the following sequence of states: iac, if, iod, oac, of, oac, of, do,
oac, os.
A single instruction can specify an operation to be performed on an array of numbers or a
string of characters  involves repetitive operand fetch and store operations

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 73

IAS structure

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 74

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 75
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 76
Example of addition
1. LOAD M(X) 500, ADD M(X) 501 (PC=1) 2. STOR M(X) 500, OTHER INSTRUCTION (PC=2)

MAR<PC MAR<PC
MBR<M[MAR] MBR<M[MAR]
IBR<MBR[20:39] IBR<MBR[20:39]
IR<MBR[0:7] IR<MBR[0:7]
MAR<MBR[8:19] MAR<MBR[8:19]
MBR<M[MAR] MBR<AC
AC<MBR M[MAR]<MBR
IR<IBR[0:7]
MAR<IBR[8:19]
PC<PC+1
MBR<M[MAR]
AC<AC+MBR

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 77

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 78

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 79

Answer
P1
This program will store the absolute value of content at memory location 0FA into memory location 0FB.

P2 OPCODE OPERAND
00000001 000000000010
First, the CPU must make access memory to fetch the instruction. The instruction contains the address of the
data we want to load. During the execute phase accesses memory to load the data value located at that
address for a total of two trips to memory.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 80

Answer

P3
The vectors A, B, and C are each stored in
1,000 continuous locations in memory,
beginning at locations 1001, 2001, and 3001,
respectively.
The program begins with the left half of
location 3.
A counting variable N is set to 999 and
decremented after each step until it reaches -
1.
Thus, the vectors are processed from high
location to low location.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 81

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 82
Example 3
main ()
{
int a=15, b=5, c;
if (a >= b)
c = a – b;
else
c = a + b;
}

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 83

Example 3 (continued)
• Optimized

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 84

0 15 a
15 b
Example3 ( with a > b) 2c
31
4 begin
5.a>b
main () { 5 load M(0)
int a=15, b=5, c; 6 sub M(1)
7 sub M(3)
if (a > b) 8 jump+ M(10)
c = a – b; 9 jump M(14)
10 . True, c = a- b
else 10 load M(0)
c = a + b; 11 sub M(1)
12 stor M(2)
} 13 jump M(17)
14 . False, c = a + b
14 load M(0)
15 add M(1)
16 stor M(2)
17 halt
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 85
Example 6
main () { Give it a try.
int a=2, b=2, I;
I = 1;
while (I < 10) {
a = a +b;
I = I +1;
}
}

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 86

Example 6 (continued)
main () {
int a=2, b=2, I;
I = 1;
while (I < 10) {
a = a +b;
I = I +1;
}
}

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 87

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 88

Problem
Write a program to add 5 numbers located at consecutive memory locations starting at address 500. Add
another 5 numbers located at consecutive memory locations starting at address 600. Subtract the lower
value from the higher value and store it at location having address 700.
Sol: 800: LOAD M(500) [Left Instr] 803: ADD M(601) [Right Instr] 807: LOAD -M(702) [Left Instr]
800: ADD M(501) [Right Instr] 804: ADD M(602) [Left Instr] 807: STOR M(700) [Right Instr]
801: ADD M(502) [Left Instr] 804: ADD M(603) [Right Instr]
801: ADD M(503) [Right Instr] 805: ADD M(604) [Left Instr]
802: ADD M(504) [Left Instr] 805: SUB M(701) [Right Instr]
802: STOR M(701) [Right Instr] 806: JUMP+ M(807,20:39) [Left Instr]
803: LOAD M(600) [Left Instr] 806: STOR M(702) [Right Instr]

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 89

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 90
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 91
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 92
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 93
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 94
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 95
Performance Assessment
 Performance is one of the key parameters to consider, along with cost, size, security, reliability, and, in some cases
power consumption.
 Application performance depends not just on the raw speed of the processor, but on the instruction set, choice of
implementation language, efficiency of the compiler, and skill of the programming.
 Clock Speed
 The System Clock: The most fundamental level, the speed of a processor is dictated by the pulse frequency
produced by the clock, measured in cycles per second, or Hertz (Hz).
 Clock signals are generated by a quartz crystal
 The rate of pulses is known as the clock rate, or clock speed
 One increment, or pulse, of the clock is referred to as a clock cycle, or a clock tick.
 The time between pulses is the cycle time.
 For example, a 1-GHz processor receives 1 billion pulses per second.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 96

Performance Assessment
 Millions of instructions per second (MIPS) or MIPS rate
 The System Clock: The most fundamental level, the speed of a processor is dictated by the pulse frequency
produced by the clock, measured in cycles per second, or Hertz (Hz).

CPI: Cycle per instruction

f: Clock frequency (number of cycle per second)
Example: Consider a program having 500 million instructions, running on a 400 MHz processor. The program
consists of three major types of instructions - ALU related, load/store, and branching. These instructions require
1, 2, and 4 CPI with a instruction mix of 60, 30, and 10% respectively in the program. Estimate the MIPS of the
processor.
Solution: CPI=0.6x1+0.3x2+0.1x4=1.6
MIPS=(400x106)/(1.6x106)=250 MIPS
 Similarly there are other performance parameters.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 97

For computer storage and processing  minus signs and periods cannot be used
Only binary digits (0 and 1) may be used to represent numbers
If limited to non-negative integers, the representation is straightforward

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 102

Sign-Magnitude representation
There are several conventions used to represent negative as well as positive integers  all of
which involve treating the most significant (leftmost) bit in the word as a sign bit.
If the sign bit is 0  the number is positive; if the sign bit is 1  the number is negative
The simplest form of representation that employs a sign bit is the sign-magnitude
representation
In an n-bit word, the rightmost bits hold the magnitude of the integer

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 103

Sign-Magnitude representation
General case

DRAWBACKS
Addition and subtraction requires consideration of both the signs of the numbers and their
relative magnitudes to carry out the required operation
Another drawback is that there are two representations of 0

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 104

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 105
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 106
2’s complement representation
2’s complement representation also uses the MSB as a sign bit

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 107

2’s complement representation
Consider an n-bit integer, A, in 2’s complement representation
If A is positive, then the sign bit, is zero. The remaining bits represent the magnitude of
the number in the same fashion as for sign magnitude representation

The number zero is identified as positive and therefore has a 0 sign bit and a magnitude of all
0s.
The range of positive integers that may be represented is from 0 (all of the magnitude bits are
0) through (all of the magnitude bits are 1).
Any larger number would require more bits

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 108

2’s complement representation
Now, for a negative number the sign bit, is one
The remaining bits can take on any one of values
Therefore, the range of negative integers that can be represented is from
The weight of the most significant bit is
This is the convention used in 2’s complement representation, yielding the following expression
for negative numbers

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 109

2’s complement representation
For , the term and the equation
defines a non-negative integer
 When , the term is subtracted from the
summation term, yielding a negative integer

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 110

Working
Consider an n-bit sequence of binary digits interpreted as a 2’s complement
integer A, so that its value is

If A is a positive number, the rule clearly works. Now, if A is negative and we want
to construct an m-bit representation, with m > n. Then

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 111

Working

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 112

2’s complement using value box
representation
The nature of 2’s complement representation is a value box, in which the value on the far right
in the box is
Each succeeding position to the left is double in value, until the leftmost position, which is
negated
The most negative 2’s complement number that can be represented is i.e. if any of the
bits other than the sign bit is one, it adds a positive amount to the number

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 113

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 114
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 115
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 116
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 117
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 118
Converting between different bit lengths
It is sometimes desirable to take an n-bit integer and store it in m bits, where m > n
In sign-magnitude notation, this is easily accomplished: simply move the sign bit to
the new leftmost position and fill in with zeros.

 For 2’s complement negative numbers

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 119

Converting between different bit lengths
The rule for 2’s complement integers is to move the sign bit to the new leftmost position and
fill in with copies of the sign bit.
For positive numbers, fill in with zeros, and for negative numbers, fill in with ones. This is called
sign extension.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 120

Integer arithmetic: negation
In sign-magnitude representation, the rule for forming the negation of an integer is simple:
invert the sign bit
In 2’s complement notation,
Take the Boolean complement of each bit of the integer
Treating the result as an unsigned binary integer, add 1

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 121

Integer arithmetic: Addition

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 122

Integer arithmetic: Addition

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 123

Integer arithmetic: Addition

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 124

Integer arithmetic: subtraction

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 125

Integer arithmetic: subtraction

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 126

Integer arithmetic: subtraction

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 127

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 128
Integer arithmetic: multiplication
Compared with addition and subtraction, multiplication is a complex operation, whether
performed in hardware or software
A wide variety of algorithms have been used in various computers
Multiplication of unsigned integers
Multiplication involves the generation of partial products, one for each digit in the multiplier. These
partial products are then summed to produce the final product
The partial products are easily defined. When the multiplier bit is 0, the partial product is 0. When the
multiplier bit is 1, the partial product is the multiplicand
The total product is produced by summing the partial products. For this operation, each successive
partial product is shifted one position to the left relative to the preceding partial product
The multiplication of two n-bit binary integers results in a product of up to 2n bits in length (e.g., 11 x 11
= 1001)

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 129

Integer arithmetic: multiplication
Compared with the pencil-and-paper approach, there are several things we can do to make
computerized multiplication more efficient
First, we can perform a running addition on the partial products rather than waiting until the
end.
For each 1 on the multiplier, an add and a shift operation are required; but for each 0, only a
shift is required.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 130

Array multiplication
Let us consider the multiplicand to be M = (A3, A2, A1, A0) and the multiplier to be Q = (B3, B2,
B1, B0)

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 131

Hardware configuration of 4*4 Array multiplier

For m-bit*n-bit multiplier

we need m*n AND gates, n
Half adders, and (m-2)*n
Full adders

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 132

Hardware configuration of 4*4 Array multiplier

For m-bit*n-bit
multiplier we need m*n
AND gates, n Half
adders, and (m-2)*n
Full adders

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 133

Signed multiplication using array
multiplier

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 134

Multiplication

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 135

Multiplication

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 136

Multiplying Negative Numbers
This does not work! Eg. If we multiply 11 (1011) by 13 (1101) we should get 143 (10001111).
If we interpret these as two’s complement numbers, we have -5 (1011) times
-3(1101) which equals -113(10001111) which is incorrect.
It will also not work if either the multiplicand or the multiplier is negative.
If 9 and 3 are treated as unsigned integers, the multiplication proceeds simply fig a.
But if 1001 is interpreted as the
twos complement value -7, then
each partial product must be a
negative twos complement number of 2n(8) bits, as shown in Figure b.
Note: this is accomplished by padding out each partial product to the left with binary 1s

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 137

If the multiplier is negative
If the multiplier is negative, straightforward multiplication also will not work.
The reason is that the bits of the multiplier no longer correspond to the shifts or multiplications that must take
place.
For example, the 4-bit decimal number -3 is written 1101 in twos complement. If we simply took partial products
based on each bit position, we have

instead of

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 138

Solutions to the above issues
Solution 1
◦ Convert to positive if required
◦ Multiply as above
◦ If signs were different, negate answer

Solution 2
◦ Booth’s algorithm

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 139

Booth Multiplication

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 140

Booth Multiplication: Example

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 141

Division

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 142

Division Restoring Algorithm

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 143

Non-restoring Algorithm

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 144

Signed Division process
To deal with negative numbers, the remainder is defined by

Consider the following examples of integer division with all possible combinations of signs of D
and V

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 145

Floating Point Numbers
Now we have seen unsigned and signed integers. In real life we also need to be able represent
numbers with fractional parts (like: -12.5 & 45.39).

 Called Floating Point numbers.

 We will learn the IEEE 32-bit floating point representation.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 146

Floating Point Numbers
In the decimal system, a decimal point (radix point) separates the whole numbers from the
fractional part
Examples:
37.25 ( whole = 37, fraction = 25/100)
123.567
10.12345678

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 147

Floating Point Numbers
For example, 37.25 can be analyzed as:

101 100 10-1 10-2

Tens Units Tenths Hundredths
3 7 2 5

37.25 = (3 x 10) + (7 x 1) + (2 x 1/10) + (5 x 1/100)

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 148

Binary Equivalence
The binary equivalent of a floating point number can be determined by computing the
binary representation for each part separately.

1) For the whole part:

Use subtraction or division method previously learned.
2) For the fractional part:
Use the subtraction or multiplication method (to be shown
next)
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 149
Fractional Part – Multiplication Method
In the binary representation of a floating point number the column values will be as
follows:

… 25 24 23 22 21 20 . 2-1 2-2 2-3 2-4 …

… 32 16 8 4 2 1 . 1/2 1/4 1/8 1/16…
… 32 16 8 4 2 1 . .5 .25 .125 .0625…

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 150

Fractional Part – Multiplication Method
Ex 1. Find the binary equivalent of 0.25
Step 1: Multiply the fraction by 2 until the fractional part becomes
0
.25
x2
0.5
x2
1.0

Step 2: Collect the whole parts in forward order. Put

them after the radix point
. .5 .25 .125 .0625
. 0 1 DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 151
Fractional Part – Multiplication Method
Ex 2. Find the binary equivalent of 0.625
Step 1: Multiply the fraction by 2 until the fractional part becomes 0
.625
x 2
1.25
x 2
0.50
x 2
1.0
Step 2: Collect the whole parts in forward order. Put them
after the radix point
. .5 .25 .125 .0625
. 1 0 1
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 152
Fractional Part – Subtraction Method

Start with the column values again, as follows:

… 20 . 2-1 2-2 2-3 2-4 2-5 2-6…

… 1 . 1/2 1/4 1/8 1/16 1/32 1/64…
… 1 . .5 .25 .125 .0625 .03125 .015625…

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 153

Fractional Part – Subtraction Method
Starting with 0.5, subtract the column values from left to right.
Insert a 0 in the column if the value cannot be subtracted or 1
if it can be. Continue until the fraction becomes .0

Ex 1.

.25 .5 .25 .125 .0625

- .25 .0 1
.0

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 154

Binary Equivalent of FP number
Ex 2. Convert 37.25, using subtraction method.
64 32 16 8 4 2 1 . .5 .25 .125 .0625
26 25 24 23 22 21 20 . 2-1 2-2 2-3 2-4
1 0 0 1 . 00 1 1
37 .25
- 32 - .25 5
.0
-4
1
-1 37.2510 = 100101.012
0
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 155
Binary Equivalent of FP number
Ex 3. Convert 18.625, using subtraction method.
64 32 16 8 4 2 1 . .5 .25 .125 .0625
26 25 24 23 22 21 20 . 2-1 2-2 2-3 2-4
1 0 0 1 0 1 0 1

18 .625
- 16 - .5
2 .125
- 2 - .125
0 0
18.62510 = 10010.1012
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 156
Problem storing binary form

We have no way to store the radix point!

Large numbers will take so much space

Standards committee came up with a way to store floating point numbers (that have a decimal
point)

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 157

IEEE Floating Point Representation

Floating point numbers can be stored into 32-bits, by dividing the

bits into three parts:
the sign, the exponent, and the mantissa.

1 2 9 10 32

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 158

Sign
Base = 2
Exponent = Value of the 8bit exponent – Bias
(where Bias = 2k-1-1, k = no. of bits in the exponent)

Significand

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 159

Floating point representation
Any floating-point number can be expressed in many ways
To simplify operations on floating-point numbers, it is typically required that they
be normalized
A normalized number is one in which the most significant digit of the significand is nonzero
For base 2 representation, a normalized number is therefore one in which the most
significant bit of the significand is one
The typical convention is that there is one bit to the left of the radix point

where b is a binary digit (either 0 or 1)

Thus, the 23-bit field is used to store a 24-bit significand with a value in the half open
interval [1, 2)

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 160

Sample problems
Convert the following decimal numbers into binary using IEEE 754 floating point representation.
(i) -3347.7991 x 221
(ii) 157.4773 x 2-11
(iii) -1234.5997 x 217
(iv) -488.6791 x 2-31

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 161

Floating point representation
In the IBM base-16 format, the exponent is stored to represent 5 rather than 20

The advantage of using a larger exponent is that a greater range can be achieved for the same
number of exponent bits.
However, a larger exponent base gives a greater range at the expense of less precision.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 162

Floating point Arithmetic

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 163

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 164
ADDITION ALGORITHM

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 165

Floating point arithmetic
For addition and subtraction, it is necessary to ensure that both operands have the same
exponent value.
This may require shifting the radix point on one of the operands to achieve alignment.
Multiplication and division are more straightforward.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 166

Floating point arithmetic
Exponent overflow: A positive exponent exceeds the maximum possible exponent value. In
some systems, this may be designated as or +∞ to - ∞
Exponent underflow: A negative exponent is less than the minimum possible exponent
value (e.g., is less than ). This means that the number is too small to be represented, and it
may be reported as 0.
Significand underflow: In the process of aligning significands, digits may flow off the right
end of the significand. As we shall discuss, some form of rounding is required.
Significand overflow: The addition of two significands of the same sign may result in a carry
out of the most significant bit. This can be fixed by realignment, as we shall explain.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 167

Floating point arithmetic: Addition and
subtraction

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 168

Floating point arithmetic: Addition and
subtraction
Phase 1: Zero check. Because addition and subtraction are identical except for a
sign change, the process begins by changing the sign of the subtrahend if it is a
subtract operation. Next, if either operand is 0, the other is reported as the result.
Phase 2: Significand alignment. The next phase is to manipulate the numbers so that the
two exponents are equal.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 169

Floating point arithmetic: Addition and
subtraction
Phase 3: Addition. Next, the two significands are added together, taking into account their
signs. Because the signs may differ, the result may be 0. There is also the possibility of
significand overflow by 1 digit. If so, the significand of the result is shifted right and the
exponent is incremented. An exponent overflow could occur as a result; this would be
reported and the operation halted.
Phase 4: Normalization. The final phase normalizes the result. Normalization consists of
shifting significand digits left until the most significant digit (bit, or 4 bits for base-16
exponent) is nonzero. Each shift causes a decrement of the exponent and thus could cause
an exponent underflow. Finally, the result must be rounded off and then reported. We defer
a discussion of rounding until after a discussion of multiplication and division.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 170

Floating point arithmetic: Addition and
subtraction

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 171

ADDITION & SUBTRACTION

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 172

MULTIPLICATION

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 173

DIVISION

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 174

Precision considerations
The exponent and significand are stored in ALU registers.
The length of the register is almost always greater than the length of the significand plus an
implied bit.
The register contains additional bits, called guard bits, which are used to pad out the right end
of the significand with 0s.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 175

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 176
Rounding
The result of any operation on the significands is generally stored in a longer register.
When the result is put back into the floating-point format, the extra bits must be disposed of.
Round to nearest: The result is rounded to the nearest representable number.
Round toward +∞: The result is rounded up toward plus infinity.
Round toward -∞: The result is rounded down toward negative infinity.
Round toward 0: The result is rounded toward zero.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 177

Rounding
Round to nearest is the default rounding mode listed in the standard and is defined as follows:
The representable value nearest to the infinitely precise result shall be delivered.
If the extra bits, beyond the 23 bits that can be stored, are 10010, then the extra bits amount to
more than one-half of the last representable bit position.
In this case, the correct answer is to add binary 1 to the last representable bit, rounding up to
the next representable number.
If the extra bits are 01111, they are less than one-half of the last representable bit position.
The correct way is simply to drop the extra bits (truncate).
If the result of a computation is exactly midway between two representable numbers, the value
is rounded up if the last representable bit is currently 1 and not rounded up if it is currently 0.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 178

Rounding
Rounding to plus and minus infinity provides an efficient method for monitoring and
controlling errors in floating-point computations by producing two values for each result.
The two values correspond to the lower and upper endpoints of an interval that contains the
true result.
If the range between the upper and lower bounds is sufficiently narrow, then a sufficiently
accurate result has been obtained.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 179

Rounding
Rounding toward zero is, in fact, simple truncation: The extra bits are ignored.
However, the result is that the magnitude of the truncated value is always less than or equal to
the more precise original value, and it affects every operation for which there are non-zero extra
bits.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 180

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 181
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 182
Special BIT patterns
An exponent of zero together with a fraction of zero represents positive or negative zero,
depending on the sign bit. As was mentioned, it is useful to have an exact value of 0
represented.
An exponent of all ones together with a fraction of zero represents positive or negative infinity,
depending on the sign bit. It is also useful to have a representation of infinity. This leaves it up to
the user to decide whether to treat overflow as an error condition or to carry the value and
proceed with whatever program is being executed.
An exponent of zero together with a nonzero fraction represents a denormalized number. In
this case, the bit to the left of the binary point is zero and the true exponent is -126 or -1022.
The number is positive or negative depending on the sign bit.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 183

Special BIT patterns
An exponent of all ones together with a nonzero fraction is given the value NaN,
which means Not a Number, and is used to signal various exception conditions.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 184

8086 microprocessor
The 8086 Microprocessor has two units
1. Bus Interface Unit
2. Execution Unit

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 185

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 186
Bus interface Unit
The Bus Interface Unit consists of different units such as the Instruction Queue, Segment
Registers, Instruction Pointer, Address adder.
It interfaces the processor to the outside  performing all the all external bus operations
like fetch, read, write, input and output of data.
The BIU uses instruction queue for pipelined instructions  6-byte First-in-First-out register.
It provides a full 16-bit bidirectional data bus and 20-bit address bus.
Specifically, it performs Instruction fetch, Instruction queuing, Operand fetch and storage,
Address relocation and Bus control.
The BIU uses a mechanism known as an instruction stream queue to implement a pipeline
architecture.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 187

Bus interface Unit
This queue permits prefetch of up to six bytes of instruction code.
Whenever the queue of the BIU is not full, it has room for at least two more bytes and at
the same time the EU is not requesting it to read or write operands from memory, the BIU is
free to look ahead in the program by prefetching the next sequential instruction.
These prefetching instructions are held in its FIFO queue.
With its 16-bit data bus, the BIU fetches two instruction bytes in a single memory cycle.
After a byte is loaded at the input end of the queue, it automatically shifts up through the
FIFO to the empty location nearest the output.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 188

Execution Unit
The Execution Unit consists of the following units such as Control circuitry, Instruction
decoder, ALU, Pointer and Index register, Flag register.
The EU decodes and executes the instructions fetched by the BIU.
It extracts instructions from the top of the queue in the BIU, decodes them, generates
operands if necessary, passes them to the BIU and requests it to perform the read or write
bus cycles to memory or I/O and perform the operation specified by the instruction on the
operands.
If the BIU is already in the process of fetching an instruction when the EU requests it to
read or write operands from memory or I/O, the BIU first completes the instruction fetch
bus cycle before initiating the operand read / write cycle.
It also tests the status and control flags and updates these flags based on the results of the
instruction.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 189

Software Model of the 8086 Microprocessors

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 190

Register organization
8086 has a powerful set of registers of 16-bit each
The registers are categorized into 4 groups
1. General data registers
2. Segment registers
3. Pointer and index registers
4. Flag register

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 191

General Registers
8086 contains 4 general data registers AX, BX, CX, and DX.
They are used to hold data, variables, results etc., temporarily for faster operation.

AX - the Accumulator
BX - the Base Register
CX - the Count Register
DX - the Data Register

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 192

General Registers
AX is used as a 16-bit accumulator, with the lower 8-bits designated as AL and higher 8-bits as
AH for 8-bit operations.
It performs all the arithmetic and logic operations and it is also used to store the result of any
operation.
BX register is used as a general purpose register as well as to store the offset for forming
physical address in certain addressing modes.
CX register is used as a default counter in case of string and loop instructions.
It is also used for the count of the number of bits by which the contents of an operand must be
shifted or rotated during the execution of the multibit shift or rotate instructions.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 193

General Registers
DX register is used in I/O operations to hold the address of the I/O port.
DX register also holds the remainder after a word division and holds the high-order bits (MSB)
of the result after a word multiplication (32-bit).

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 194

Segment Registers
There are 4 segment registers
1. Code segment
2. Data segment
3. Stack segment
4. Extra segment

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 195

Segment Registers
Code segment (CS) register is used to address a memory location in the code segment of
memory where the executable program or instructions are stored
Stack segment (SS) register is used for addressing stack segment of memory which is used to
store stack data.
The data segment (DS) register points to the data segment of the memory where the data is
stored
The extra segment (ES) register points to the extra segment of the memory. This is used as
another data segment for extra data storage.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 196

Pointers and index Registers

•All 16 bits wide, L/H bytes separately are not accessible

•Used as memory pointers

◦ Example: MOV AH, [SI]
◦ Move the byte stored in memory location whose address is contained in register SI to register AH

•IP is not under direct control of the programmer

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 197
Pointers and index Registers
The function of IP is similar to a program counter, but it contains the offset address instead of
the actual address of the next instruction.
It contains the offset address within the code segment and the IP is combined with the CS
register to generate the actual address of the next instruction to be executed.
Stack pointer also contains the offset value which is added to the SS register to obtain the
actual address of the stack segment.
Base pointer is similar to the SP since it also contains the offset value pointing to the stack
segment.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 198

Pointers and index Registers
The index registers are used as general purpose registers as well as for offset storage purpose.
The source index register is used to store the offset of the source data in the data segment.
And the destination index register is used to store the offset of the destination data in the data
or extra segment.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 199

FLAG Register

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 200

FLAG Registers
Overflow flag is based on the (n-1)th bit carry of the ALU result.
Overflow occurs when signed numbers are added or subtracted.
For 8-bit operation, if there is carry from the D6 bit to the D7 bit, then the overflow flag is set
Similarly, for 16-bit operation, if there is carry from the D14 bit to the D15 bit, then the
overflow flag is set
Trap flag is when the processor enters into single step mode or else it is reset.
In single step mode, the processor executes one instruction at a time and it is useful for
debugging programs.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 201

FLAG Registers
Interrupt flag is set when an maskable interrupt or INTR is received by the processor.
Direction flag is used for string manipulation instructions i.e. the direction flag selects the
increment or decrement mode for DI and SI registers in string instructions
If DF = 1; then the registers are automatically decremented, or else DF = 0 then the registers
are incremented.
Carry Flag is set when there is a carry generated from the MSB bit addition. Otherwise, CF=0.
Auxillary flag is set when there is a carry from D3 bit to D4 bit in 8-bit operations / D7 bit to D8
bit in 16-bit operations. If there is no carry generated for these bits, then AF=0.
Zero flag is set when the result of ALU operation is 0. Otherwise, for any non-zero value, ZF = 0.
Sign flag is set when the result of ALU operation has 1 as its MSB. Otherwise, SF = 0.
Parity flag is set when the result of ALU operation has even number of 1’s. Otherwise, PF = 0.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 202

Addressing modes
Addressing mode is the way of locating the data or operands, the types of operands used and
the way they are accessed for executing an instruction.
Based on the flow of instructions, the instructions in 8086 can be categorized as
1. Sequential control flow instructions
2. Control transfer instructions
Sequential control flow are the instructions in which after execution, the control is transferred
to the next instruction appearing immediately after it in the program. Eg. Arithmetic
instructions, logical, data transfer, and processor control instructions.
Control transfer instructions transfer their control to some predefined address after their
execution. Eg. INT, CALL, RET, and JUMP instructions.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 203

Addressing modes
1. Register addressing mode 1. Intrasegment direct addressing mode
2. Immediate addressing mode 2. Intrasegment indirect addressing mode
3. Direct addressing mode 3. Intersegment direct addressing mode
4. Register indirect addressing mode 4. Intersegment indirect addressing mode
5. Register relative addressing mode
6. Indexed addressing mode
7. Based indexed addressing mode
8. Relative based indexed addressing mode

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 204

Sequential Control flow modes
1. Register addressing mode
2. Immediate addressing mode
3. Direct addressing mode
4. Register indirect addressing mode
5. Register relative addressing mode
6. Indexed addressing mode
7. Based indexed addressing mode
8. Relative based indexed addressing mode

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 205

Register addressing mode
In this mode, both the operands are specified by registers.
Eg. MOV AX, BX
All the registers can be used in this mode.
Both the source and destination registers should be of the same size
A segment to segment movement of data is not allowed

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 206

Immediate addressing mode
In this mode, the source operand is specified as immediate data byte or word and the
destination is either a register or a memory location.
Eg. MOV AL, 22H; MOV BX, 3456H
All the registers can be used in this mode.
Both the source and destination should be of the same size

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 207

Direct addressing mode
In this mode, the source is a memory location and the destination is a register.
Eg. MOV AL, [1234H]; MOV BX, [5000H]
Here, a 16-bit memory address i.e. the offset address is directly specified in the instruction as a
part of it.
The content of the physical address which is formed from the offset address is the source data.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 208

Register indirect addressing mode
Register indirect addressing mode allows data to be addressed at any memory location using
the offset registers: BP, BX, DI or SI
DS is the default segment when the registers BX, DI or SI are used.
SS is the default segment when the register BP is used.
Eg. MOV AX, [BX]

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 209

Register relative addressing mode
In this mode, the data in a segment of memory are addressed by adding an 8-bit or 16-bit
displacement to the contents of a base register (BX or BP) or an index register (SI or DI).
ES and DS are the default segments when the registers BX, DI or SI are used.
SS is the default segment when the register BP is used.
Eg. MOV AX, 1000H [BX]

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 210

Indexed addressing mode
In this mode, the offset address of the operand is stored in one of the index registers like SI or
DI.
ES and DS are the default segments for DI or SI
Eg. MOV AX, [SI]

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 211

Based-Indexed addressing mode
In this mode, one base register (BX or BP) and one index register (SI or DI) are used to indirectly
address memory.
The effective address is formed by adding contents of a base register to the contents of the
index register.
Eg. MOV AX, [BX] [DI]

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 212

Relative Based-Indexed addressing mode
It is similar to the based indexed mode, but it adds a displacement along with the base register
and index register to form the memory address
The effective address is formed by adding the 8-bit or 16-bit displacement with the addition
result of the base register and the index register.
Eg. MOV AX, 2000H [BX] [DI]

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 213

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 214
Control transfer instruction modes
1. Intrasegment direct addressing mode
2. Intrasegment indirect addressing mode
3. Intersegment direct addressing mode
4. Intersegment indirect addressing mode

If the address location to which the control is to be transferred lies in a different segment
other than the current one, the mode is called intersegment mode.
If the destination lies in the same segment, the mode is called intrasegment mode

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 215

Intrasegment direct mode
In this mode, the effective branch address to which the control is to be transferred lies in the
same segment in which the control transfer instruction lies and it appears directly in the
instruction as an immediate displacement value.
The effective branch address is the sum of an 8-bit or 16-bit displacement in the current
contents of IP.
Eg. JMP [02]( Eff. offset Addr = [IP] + [02] )

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 216

Intrasegment indirect mode
In this mode, the effective branch address is the contents of the register or memory location
that is accessed using any one of the data addressing modes.
The contents of the IP will be replaced by the effective branch address
Eg. JMP BX ( Eff. offset Addr = [IP] + [BX] )
In this instruction, the control is jumped to an address specified by the 16-bit register.
The value of IP+BX is copied into IP with CS value unchanged.
Then the physical address of the next instruction is obtained using the current content of CS
and new value of IP

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 217

Intersegment direct mode
This addressing mode is used to provide means of branching from one segment to another
segment.
Eg. JMP 2000H: 3000H
i.e. the JMP instruction loads CS with 2000H and loads IP with 3000H

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 218

Intersegment indirect mode
This addressing mode replaces the contents of the IP and CS with the contents of two
consecutive words in memory that are addressed using indirect addressing.
Eg. JMP [5000H] or JMP [BX or SI or DI]
i.e. the contents of [5000H] & [5000H+1] in DS is loaded into IP and loads the contents of
[5000H +2] & [5000H +3] in DS into CS.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 219

What is an Instruction Set?
The complete collection of instructions that are understood by a CPU
Machine Code
Binary
Usually represented by programmer as assembly codes

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 220

Elements of an Instruction
Operation code (Op code)
◦ Do this

Source Operand reference

◦ To this

Result Operand reference

◦ Put the answer here

Next Instruction Reference

◦ When you have done that, do this...

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 221

Instruction Representation
In machine code each instruction has a unique bit pattern
For human consumption (well, programmers anyway) a symbolic representation is used
◦ e.g. ADD, SUB, LOAD

Operands can also be represented in this way

◦ ADD A,B

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 222

Simple Instruction Format

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 223

Instruction Types
Data processing
Data storage (main memory)
Data movement (I/O)
Program flow control

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 224

Number of Addresses (a)
3 addresses
◦ Operand 1, Operand 2, Result
◦ a = b + c;
◦ May be a forth - next instruction (usually implicit)
◦ Not common
◦ Needs very long words to hold everything

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 225

Number of Addresses (b)
2 addresses
◦ One address doubles as operand and result
◦ a=a+b
◦ Reduces length of instruction
◦ Requires some extra work
◦ Temporary storage to hold some results

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 226

Number of Addresses (c)
1 address
◦ Implicit second address
◦ Usually a register (accumulator)
◦ Common on early machines

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 227

Number of Addresses (d)
0 (zero) addresses
◦ All addresses implicit
◦ Uses a stack
◦ e.g. push a
◦ push b
◦ add
◦ pop c

◦ c=a+b

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 228

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 229
Instruction Set
8086 supports 6 types of instructions.

1. Data Transfer Instructions

2. Arithmetic Instructions

3. Logical Instructions

4. String manipulation Instructions

5. Process Control Instructions

6. Control Transfer Instructions

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 230

Instruction Set
1. Data Transfer Instructions

Instructions that are used to transfer data/ address in to

registers, memory locations and I/O ports.

Generally involve two operands: Source operand and

Destination operand of the same size.

Source: Register or a memory location or an immediate data

Destination : Register or a memory location.

The size should be a either a byte or a word.

A 8-bit data can only be moved to 8-bit register/ memory

and a 16-bit data can be moved to 16-bit register/ memory.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 231

8086 Microprocessor

Instruction Set
1. Data Transfer Instructions

Mnemonics: MOV, XCHG, PUSH, POP, IN, OUT …

MOV reg2/ mem, reg1/ mem

MOV reg2, reg1 (reg2)  (reg1)

MOV mem, reg1 (mem)  (reg1)
MOV reg2, mem (reg2)  (mem)

MOV reg/ mem, data

MOV reg, data (reg)  data

MOV mem, data (mem)  data

XCHG reg2/ mem, reg1

XCHG reg2, reg1 (reg2)  (reg1)

XCHG mem, reg1 (mem)  (reg1)

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 232

Stack operations
Data is placed on the stack using the PUSH instruction and removed from the stack using the
POP instruction.
When an item is pushed onto the stack, the processor decrements the SP register, then writes
the item at the new top of stack.
When an item is popped off the stack, the processor reads the item from the top of stack, then
increments the SP register.
Therefore, the stack grows down in memory (towards lesser addresses) when items are pushed
on the stack and shrinks up (towards greater addresses) when the items are popped from the
stack.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 233

The Stack

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 234

PUSH/POP
These instructions are used to copy a word on top of the stack or remove the word from top of
the stack in the register specified.
The operand in both (PUSH and POP) instructions can be a general purpose register, segment
register(except CS) or a memory location.

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 235

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 236
Instruction Set
1. Data Transfer Instructions

Mnemonics: MOV, XCHG, PUSH, POP, IN, OUT …

PUSH reg16/ mem

PUSH reg16 (SP)  (SP) – 2

MA S = (SS) x 1610 + SP
(MA S ; MA S + 1)  (reg16)

PUSH mem (SP)  (SP) – 2

MA S = (SS) x 1610 + SP
(MA S ; MA S + 1)  (mem)

POP reg16/ mem

POP reg16 MA S = (SS) x 1610 + SP

(reg16)  (MA S ; MA S + 1)
(SP)  (SP) + 2

POP mem MA S = (SS) x 1610 + SP

(mem)  (MA S ; MA S + 1)
(SP)  (SP) + 2

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 237

Instruction Set
1. Data Transfer Instructions

Mnemonics: MOV, XCHG, PUSH, POP, IN, OUT …

IN A, [DX] OUT [DX], A

IN AL, [DX] PORTaddr = (DX) OUT [DX], AL PORTaddr = (DX)

(AL)  (PORT) (PORT)  (AL)

IN AX, [DX] PORTaddr = (DX) OUT [DX], AX PORTaddr = (DX)

(AX)  (PORT) (PORT)  (AX)

IN A, addr8 OUT addr8, A

IN AL, addr8 (AL)  (addr8) OUT addr8, AL (addr8)  (AL)

IN AX, addr8 (AX)  (addr8) OUT addr8, AX (addr8)  (AX)

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 238

Instruction Set
2. Arithmetic Instructions
Mnemonics: ADD, ADC, SUB, SBB, INC, DEC, MUL, DIV, CMP…
ADD reg2/ mem, reg1/mem

ADD reg2, reg1 (reg2)  (reg1) + (reg2)

ADD reg2, mem (reg2)  (reg2) + (mem)
ADD mem, reg1 (mem)  (mem)+(reg1)

ADD reg/mem, data

ADD reg, data (reg)  (reg)+ data

ADD mem, data (mem)  (mem)+data

ADD A, data

ADD AL, data8 (AL)  (AL) + data8

ADD AX, data16 (AX)  (AX) +data16

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 239

Instruction Set
2. Arithmetic Instructions
Mnemonics: ADD, ADC, SUB, SBB, INC, DEC, MUL, DIV, CMP…
ADC reg2/ mem, reg1/mem

ADC reg2, reg1 (reg2)  (reg1) + (reg2)+CF

ADC reg2, mem (reg2)  (reg2) + (mem)+CF
ADC mem, reg1 (mem)  (mem)+(reg1)+CF

ADC reg/mem, data

ADC reg, data (reg)  (reg)+ data+CF

ADC mem, data (mem)  (mem)+data+CF

ADC A, data

ADC AL, data8 (AL)  (AL) + data8+CF

ADC AX, data16 (AX)  (AX) +data16+CF

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 240

Instruction Set
2. Arithmetic Instructions
Mnemonics: ADD, ADC, SUB, SBB, INC, DEC, MUL, DIV, CMP…
SUB reg2/ mem, reg1/mem

SUB reg2, reg1 (reg2)  (reg1) - (reg2)

SUB reg2, mem (reg2)  (reg2) - (mem)
SUB mem, reg1 (mem)  (mem) - (reg1)

SUB reg/mem, data

SUB reg, data (reg)  (reg) - data

SUB mem, data (mem)  (mem) - data

SUB A, data

SUB AL, data8 (AL)  (AL) - data8

SUB AX, data16 (AX)  (AX) - data16

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 241

Instruction Set
2. Arithmetic Instructions
Mnemonics: ADD, ADC, SUB, SBB, INC, DEC, MUL, DIV, CMP…
SBB reg2/ mem, reg1/mem

SBB reg2, reg1 (reg2)  (reg1) - (reg2) - CF

SBB reg2, mem (reg2)  (reg2) - (mem)- CF
SBB mem, reg1 (mem)  (mem) - (reg1) –CF

SBB reg/mem, data

SBB reg, data (reg)  (reg) – data - CF

SBB mem, data (mem)  (mem) - data - CF

SBB A, data

SBB AL, data8 (AL)  (AL) - data8 - CF

SBB AX, data16 (AX)  (AX) - data16 - CF

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 242

Instruction Set
2. Arithmetic Instructions
Mnemonics: ADD, ADC, SUB, SBB, INC, DEC, MUL, DIV, CMP…
INC reg/ mem

INC reg8 (reg8)  (reg8) + 1

INC reg16 (reg16)  (reg16) + 1

INC mem (mem)  (mem) + 1

DEC reg/ mem

DEC reg8 (reg8)  (reg8) - 1

DEC reg16 (reg16)  (reg16) - 1

DEC mem (mem)  (mem) - 1

After DEC instruction we can use any JMP (Cond. Or Non-conditional) incase of loop
DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 243
Instruction Set
2. Arithmetic Instructions
Mnemonics: ADD, ADC, SUB, SBB, INC, DEC, MUL, DIV, CMP…
MUL reg/ mem

MUL reg For byte : (AX)  (AL) x (reg8)

For word : (DX)(AX)  (AX) x (reg16)

MUL mem For byte : (AX)  (AL) x (mem8)

IMUL reg/ mem

IMUL reg For byte : (AX)  (AL) x (reg8)

For word : (DX)(AX)  (AX) x (reg16)

IMUL mem For byte : (AX)  (AX) x (mem8)

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 244

Instruction Set
2. Arithmetic Instructions
Mnemonics: ADD, ADC, SUB, SBB, INC, DEC, MUL, DIV, CMP…

DIV reg/ mem

DIV reg For 16-bit :- 8-bit :

(AL)  (AX) :- (reg8) Quotient
(AH)  Remainder

For 16-bit :- 8-bit :

DIV mem (AL)  (AX) :- (mem8) Quotient
(AH)  Remainder

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 245

Instruction Set
2. Arithmetic Instructions
Mnemonics: ADD, ADC, SUB, SBB, INC, DEC, MUL, DIV, CMP…

IDIV reg/ mem

IDIV reg For 16-bit :- 8-bit :

(AL)  (AX) :- (reg8) Quotient
(AH)  Remainder

For 16-bit :- 8-bit :

IDIV mem (AL)  (AX) :- (mem8) Quotient
(AH)  Remainder

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 246

Instruction Set
2. Arithmetic Instructions
Mnemonics: ADD, ADC, SUB, SBB, INC, DEC, MUL, DIV, CMP…
CMP reg2/mem, reg1/ mem

CMP reg2, reg1 Modify flags  (reg2) – (reg1)

If (reg2) > (reg1) then CF=0, ZF=0, SF=0

If (reg2) < (reg1) then CF=1, ZF=0, SF=1
If (reg2) = (reg1) then CF=0, ZF=1, SF=0

CMP reg2, mem Modify flags  (reg2) – (mem)

If (reg2) > (mem) then CF=0, ZF=0, SF=0

If (reg2) < (mem) then CF=1, ZF=0, SF=1
If (reg2) = (mem) then CF=0, ZF=1, SF=0

CMP mem, reg1 Modify flags  (mem) – (reg1)

If (mem) > (reg1) then CF=0, ZF=0, SF=0

If (mem) < (reg1) then CF=1, ZF=0, SF=1
If (mem) = (reg1) then CF=0, ZF=1, SF=0

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 247

8086 Microprocessor

Instruction Set
2. Arithmetic Instructions
Mnemonics: ADD, ADC, SUB, SBB, INC, DEC, MUL, DIV, CMP…

CMP reg/mem, data

CMP reg, data Modify flags  (reg) – (data)

If (reg) > data then CF=0, ZF=0, SF=0

If (reg) < data then CF=1, ZF=0, SF=1
If (reg) = data then CF=0, ZF=1, SF=0

CMP mem, data Modify flags  (mem) – (data)

If (mem) > data then CF=0, ZF=0, SF=0

If (mem) < data then CF=1, ZF=0, SF=1
If (mem) = data then CF=0, ZF=1, SF=0

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 248

8086 Microprocessor

Instruction Set
2. Arithmetic Instructions
Mnemonics: ADD, ADC, SUB, SBB, INC, DEC, MUL, DIV, CMP…

CMP A, data

CMP AL, data8 Modify flags  (AL) – data8

If (AL) > data8 then CF=0, ZF=0, SF=0

If (AL) < data8 then CF=1, ZF=0, SF=1
If (AL) = data8 then CF=0, ZF=1, SF=0

CMP AX, data16 Modify flags  (AX) – data16

If (AX) > data16 then CF=0, ZF=0, SF=0

If (mem) < data16 then CF=1, ZF=0, SF=1
If (mem) = data16 then CF=0, ZF=1, SF=0

DR. ELLISON | COMPUTER ARCHITECTURE (ECE2015) | VIT-AP 249

Eg: SUM OF ‘N’ CONSECUTIVE NUMBERS
MOV SI, 2000
MOV CL, [SI]
MOV AL, 00
MOV BL, 01
LOOP ADD AL, BL
INC BL
DEC CL
JNZ LOOP
MOV DI, 2002
MOV [DI], AX
HLT