0% found this document useful (0 votes)

319 views86 pages

Computer Fundamentals Course Overview

This document provides an introduction to a course on computer fundamentals. It outlines the aims of the course, which are to give an understanding of how computers work, introduce assembly-level programming, and prepare students for future courses. The document then provides an overview of the course content, which will cover the history of computing, how a simple computer operates, input/output, and MIPS assembly language. It also references recommended reading materials and provides a brief timeline of important developments in the history of computing.

Uploaded by

pratibha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

319 views86 pages

Computer Fundamentals Course Overview

Uploaded by

pratibha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Computer Fundamentals

6L for CST/NST 1A
Michaelmas 2010
MWF @ , Arts “ hool A
Aims & Objectives
• This course aims to:
– give you a general understanding of how a
computer works
– introduce you to assembly-level programming
– prepare you for future courses. . .
• At the e d of the ourse ou ll e a le to:
– describe the fetch-execute cycle of a computer
– understand the different types of information
which may be stored within a computer memory
– write a simple assembly language program
2
Recommended Reading
• This ourse does t follo a parti ular ook
exactly, but any of the following are useful:
– Computer Organization & Design (4th Ed),
Patterson and Hennessy, Morgan Kaufmann 2008
• also used i C“T Part B Co puter Desig
– Digital Design and Computer Architecture, Harris
and Harris, Morgan Kaufmann 2007
• also used i C“T Part A Digital Ele tro i s
– Structured Computer Organization (5th Ed),
Tannenbaum, Prentice-Hall 2005
• good general overview book; somewhat broader in
scope, and somewhat simpler to digest than above
3
Course Outline
• We ll o er the follo i g topi s:
– A Brief History of Computing
– Operation of a Simple Computer
– Input / Output
– MIPS Assembly Language
• This course is new this year, but derives from
Part I of pre- C“T A Operati g “ ste s
– This will help in finding e.g. past exam questions
• Feel free to ask questions during the lecture
– or after it, or via email – see course web page 4
A Chronology of Early Computing
• (several BC): abacus used for counting
• 1614: logarithms discovered (John Napier)
• 1622: invention of the slide rule (Robert Bissaker)
• 1642: First mechanical digital calculator (Pascal)
• Charles Babbage (U. Cambridge) invents:
– 1812: Differe e E gi e
– 1833: A al ti al E gi e
• 1890: First electro-mechanical punched card data-
processing machine (Hollerith)
• 1905: Vacuum tube/triode invented (De Forest)

5
The War Years…
• 1935: the relay-based IBM 601 reaches 1 MPS.
• 1939: ABC - first electronic digital computer (Atanasoff
& Berry)
• 1941: Z3 - first programmable computer (Zuse)
• Jan 1943: the Harvard Mark I (Aiken)
• Dec 1943: Colossus uilt at “tatio X – Bletchley Park
• 1945: ENIAC (Eckert & Mauchley, U. Penn):
– 30 tons, 1000 square feet, 140 kW,
– 18K vacuum tubes, 20×10-digit accumulators,
– 100KHz, circa 300 MPS.
– Used to calculate artillery firing tables.
– (1946) blinking lights for the media. . .
• But progra i g is ia plug-board: tedious and slow
6
The Von Neumann Architecture

• 1945: o Neu a drafts EDVAC report

– design for a stored-program machine
– Eckert & Mauchley mistakenly unattributed

7
Further Progress…
• 1947: poi t o ta t tra sistor i e ted
(Shockley, Bardeen & Brattain)
• 1949: ED“AC, the orld s first stored-program
computer (Wilkes & Wheeler)
– 3K vacuum tubes, 300 square feet, 12 kW,
– 500KHz, circa 650 IPS, 225 MPS.
– 1024 17-bit words of memory in mercury
ultrasonic delay lines – early DRAM ;-)
– ord operati g s ste !
• 1954: TRADIC, first electronic computer
without vacuum tubes (Bell Labs)
8
The Silicon Age
• 1954: first silicon (junction) transistor (TI)
• 1959: first integrated circuit (Kilby & Noyce, TI)
• 1964: IBM System/360, based on ICs.
• 1971: Intel 4004, first micro-processor (Ted
Hoff):
– 2300 transistors, 60 KIPS.
• 1978: Intel 8086/8088 (used in IBM PC).
• 1980: first VLSI chip (> 100,000 transistors)
• Today: ~800M transistors, 45nm, ~3 GHz.
9
Languages and Levels

• Computers programmable with variety of different languages.

– e.g. ML, java, C/C++, python, perl, FORTRAN, Pascal, . . .
• Can describe the operation of a computer at a number of
different levels; however all levels are functionally equivalent
• Levels relate via either (a) translation, or (b) interpretation.
10
Layered Virtual Machines
High-Level Language, e.g. ML Virtual Machine M5 (Language L5)
Compiled Language (e.g. C++) Virtual Machine M4 (Language L5)
Assembly Language Programs Virtual Machine M3 (Language L3)
Operating System Level Virtual Machine M2 (Language L2) Software
Computer Organization Level Virtual Machine M1 (Language L1) Hardware
Digital Logic Level A tual Ma hi e M La guage L

• Consider a set of machines M0, M1, . . . Mn:

– Machine Mi understands only machine language Li
– Levels 0, -1 covered in Digital Electronics, Physics,
– Level 2 will be covered in CST 1A Operating Systems
• This course focuses on levels 1 and 3
• NB: all le els useful; o e the truth .
11
Digital Electronics in a Slide
• Take a ele tri ir uit ut treat high oltages as 1,
a d lo oltages as 0
• Using transistors, can build logic gates
– Deterministic functions of inputs (1s and 0s)
• Circuit diagrams use symbols as short hand, e.g.

Output is o l if Output is if Output is o l Output is o l if

both i puts are either i put is if i put is inputs are different

• Using feedback (outputs become inputs) we can build

other stuff (latches, flip-flops, ...)
• Low-level circuit diagrams are not examinable
12
A (Simple) Modern Computer

13
A (Simple) Modern Computer
Processor (CPU): Memory: stores
executes programs programs & data

| Bus: connects
everything together
Devices: for input
and output

14
Registers and the Register File
R00 0x5A R08 0xEA02D1F
R01 0x102034 R09 0x1001D
R02 0x2030ADCB R10 0xFFFFFFFF
R03 0x0 R11 0x1020FC8
R04 0x0 R12 0xFF0000
R05 0x2405 R13 0x37B1CD
R06 0x102038 R14 0x1
R07 0x20 R15 0x20000000

• Computers all about operating on information:

– information arrives into memory from input devices
– e or is a large te arra hi h a hold a thi g e a t
• Computer conceptually takes values from memory, performs
whatever operations, and then stores results back
• In practice, CPU operates on registers:
– a register is an extremely fast piece of on-chip memory
– modern CPUs have between 8 and 128 registers, each 32/64 bits
– data values are loaded from memory into registers before operation
– result goes into register; eventually stored back to memory again.
15
Memory Hierarchy

• Use cache et ee ai e or & registers to hide slo D‘AM

• Cache made from faster SRAM: more expensive, and hence smaller.
– holds copy of subset of main memory.
• Split of instruction and data at cache level:
– Har ard ar hite ture.
• Cache <-> CPU interface uses a custom bus.
• Today have ~8MB cache, ~4GB RAM.
16
Static RAM (SRAM)

• ‘elati el fast urre tl − ns).

• This is the Digital Logic view:
• Some wires, some gates, a d so e D-lat hes
17
/wr if we want to
write (store) data Static RAM (SRAM)
Data Inputs (when
we want to store)

Address Inputs: say

which bits we want Each D-latch box
can store 1 bit

Data Outputs (when

/oe if we want to we want to read)
output (read) data

• ‘elati el fast urre tl − ns).

• This is the Digital Logic view:
• Some wires, some gates, a d so e D-lat hes
18
SRAM Reality
• Data held in cross-coupled
inverters.
• One word line, two bit lines.
• To read:
– precharge both bit and bit, and
then strobe word
– bit discharged if there was a 1 in
the cell;
– bit discharged if there was a 0.
• To write:
– pre harge either it for or
it for ,
– strobe word.

19
Dynamic RAM (DRAM)

• Use a single transistor to store a bit.

• Write: put value on bit lines, strobe word line.
• Read: pre-charge, strobe word line, amplify, latch.
• D a i : refresh periodi all to restore harge.
• “lo er tha “‘AM: t pi all s− s.
20
DRAM Decoding

• Two stage: row, then column.

• Usually share address pins: RAS & CAS select decoder or mux.
• FPM, EDO, SDRAM faster for same row reads.
21
The Fetch-Execute Cycle

• A special register called PC holds a memory address

– on reset, initialized to 0.
• Then:
1. Instruction fetched from memory address held in PC into instruction buffer (IB)
2. Control Unit determines what to do: decodes instruction
3. Execution Unit executes instruction
4. PC updated, and back to Step 1
• Continues pretty much forever...
22
The Execution Unit

• The al ulator part of the pro essor.

• Broken into parts (functional units), e.g.
– Arithmetic Logic Unit (ALU).
– Shifter/Rotator.
– Multiplier.
– Divider.
– Memory Access Unit (MAU).
– Branch Unit.
• Choice of functional unit determined by signals from control unit.
23
Arithmetic Logic Unit (ALU)

• Part of the execution unit.

• Inputs from register file; output to register file.
• Performs simple two-operand functions:
– a + b; a – b; a AND b; a OR b; etc
• Typically perform all possible functions; use
function code to select (mux) output.
24
Number Representation

• n-bit register b − b − . . . b1b0 can represent 2n different values.

• Call b − the most significant bit (msb), b0 the least significant bit (lsb).
• Unsigned numbers: val = b − 2 − + b − 2 − + · · · + b121 + b020
– e.g. 11012 = 23 + 22 + 20 = 8 + 4 + 1 = 13.
• Represents values from 0 to 2 − inclusive.
• For large numbers, binary is unwieldy: use hexadecimal (base 16).
• To convert, group bits into groups of 4, e.g.
– 11111010102 = 0011|1110|10102 = 3EA16.
• Ofte use prefi to de ote he , e.g. .
• Can use dot to separate large numbers into 16-bit chunks, e.g.
– [Link]
25
Signed Numbers
• What about signed numbers? Two main options:
• Sign & magnitude:
– top (leftmost) bit flags if negative; remaining bits make value.
– e.g. byte 100110112 → − 2=− .
– represe ts ra ge − − − to + − − ...
– ... a d the o us alue − !
• ’s co ple e t:
– to get − fro , i ert e er it a d add .
– e.g. +27 = 000110112 ⇒ − = 2 + 1) = 111001012.
– treat 1000 . . . 0002 as − −
– represe ts ra ge − − to +(2 − −
• Note:
– in both cases, top- it ea s egati e .
– both representations depend on n;
• I pra ti e, all oder o puters use s o ple e t...
26
Unsigned Arithmetic
• Unsigned addition (using 5-bit registers)
C0 1 1 1 0 0
Wrong!
0 1 1 1 0 (by 32=25)
00101 5 11110 30
Cn 7
+ 00111 7 + 00111
0 01100 12 1 00101 5

• Carry bits C0 (=Cin), C1, C2, … Cn (=Cout)

– usually refer to Cn as C, the carry flag
– In addition, if C is 1, we got the wrong answer
• Unsigned subtraction: if C is , e orro ed
0 1 1 0
Wrong!
1 1 0 0 (again by 25)
+27 is 11011 00111 7
11110 30
+ 00101 -27 + 10110 -10
1 00011 3 0 11101 29
27
Signed Arithmetic
• In signed arithmetic, C o its o is useless…
– Instead use overflow flag, V = Cn⊕Cn-1
Cn and Cn-1 are
0 1 1 1 0 different => V=1 1 1 1 0 0
Wrong
00101 5 01010 10
by 32=25
+ 00111 7 + 00111 7
0 01100 12 0 10001 -15
… ut a s er
is correct V=1 => wrong
1 0 0 0 0 0 1 1 0 0
C is set… 01010 10 10110 -10
+ 11001 -7 + 10110 -10
1 00011 3 1 01100 12

– Negative flag N = Cn-1 (i.e. msb) flips on overflow

28
Arithmetic and Logical Instructions
Mnemonic C/Java Equivalent Mnemonic C/Java Equivalent
and d ← a, b d = a & b; add d ← a, b d = a + b;
xor d ← a, b d = a ^ b; sub d ← a, b d = a - b;
orr d ← a, b d = a | b; rsb d ← a, b d = b - a;
bis d ← a, b d = a | b; shl d ← a, b d = a << b;
bic d ← a, b d = a & (~b); shr d ← a, b d = a >> b;

• Both d and a must be registers; b can be a register or, in most

machines, can also be a (small) constant
• Typically also have addc and subc, which handle carry or borrow
(for multi-precision arithmetic), e.g.
add d0, a0, b0 // compute "low" part
addc d1, a1, b1 // compute "high" part
• May also get:
– Arithmetic shifts: asr and asl(?)
– Rotates: ror and rol
29
1-bit ALU Implementation

• 8 possible functions:
1. a AND b, a AND b
2. a OR b, a OR b
3. a + b, a + b with carry
4. a − b, a − b with borrow
• To make n-bit ALU bit, connect together (use carry-lookahead on adders)
30
Conditional Execution
• Seen C,N,V flags; now add Z (zero), logical NOR of all bits in output.
• Can predicate execution based on (some combination) of flags, e.g.
subs d, a, b // compute d = a - b
beq proc1 // if equal, goto proc1
br proc2 // otherwise goto proc2

– Java equivalent approximately:

if (a==b) proc1() else proc2();
• On most computers, mainly limited to branches; but on ARM (and
IA64), everything conditional, e.g.
sub d, a, b // compute d = a - b
moveq d, #5 // if equal, d = 5;
movne d, #7 // otherwise d = 7;

– Java equivalent: d = (a==b) ? 5 : 7;

• “ile t ersio s useful he do t reall a t result, e.g. teq, cmp

31
Condition Codes
Suffix Meaning Flags
EQ, Z Equal, zero Z == 1
NE, NZ Not equal, non-zero Z == 0
Used to compare
unsigned numbers MI Negative N == 1
(recall C==0 means
PL Positive (incl. zero) N == 0
we borrowed)
CS, HS Carry, higher or same C == 1
CC, LO No carry, lower C == 0
HI Higher C == 1 && Z == 0
LS Lower or same C == 0 || Z == 1
Used to compare VS Overflow V == 1
signed numbers
(note must check VC No overflow V == 0
both N and V) GE Greater than or equal N == V
GT Greater than N == V && Z == 0
LT Less than N != V
LE Less than or equal N != V || Z == 1
32
Loads and Stores
• Have variable sized values, e.g. bytes (8-bits), words (16-bits),
longwords (32-bits) and quadwords (64-bits).
• Load or store instructions usually have a suffix to determine the
size, e.g. for te, for ord, l for lo g ord.
• When storing > 1 byte, have two main options: big endian and little
endian; e.g. storing 0xDEADBEEF into memory at address 0x4

• If read back a byte from address 0x4, get 0xDE if big-endian, or 0xEF
if little-endian.
– If you always load and store things of the same size, things are fine.
• Today have x86 little-endian; Sparc big-endian; Mips & ARM either.
• Annoying. . . and burns a considerable number of CPU cycles on a
daily basis. . .
33
Accessing Memory
• To load/store values need the address in memory.
• Most modern machines are byte addressed: consider memory a big
array of 2A bytes, where A is the number of address lines in the bus.
• Lots of thi gs o sidered e or ia address de oder, e.g.

• Typically devices decode only a subset of low address lines, e.g.

Device Size Data Decodes
ROM 1024 bytes 32-bit A[2:9]
RAM 16384 bytes 32-bit A[2:13]
UART 256 bytes 8-bit A[0:7]
34
Addressing Modes
• An addressing mode tells the computer where the data for an
instruction is to come from.
• Get a wide variety, e.g.
– Register: add r1, r2, r3
– Immediate: add r1, r2, #25
– PC Relative: beq 0x20
– Register Indirect: ldr r1, [r2]
– ” + Displace e t: str r1, [r2, #8]
– Indexed: movl r1, (r2, r3)
– Absolute/Direct: movl r1, $0xF1EA0130
– Memory Indirect: addl r1, ($0xF1EA0130)
• Most modern machines are load/store ⇒ only support first five:
– allow at most one memory ref per instruction
– (there are very good reasons for this)

• Note that CPU ge erall does t are what is being held within the
memory – up to programmer to interpret whether data is an
integer, a pixel or a few characters in a novel...
35
Representing Text
• Two main standards:
1. ASCII: 7-bit code holding (English) letters, numbers,
punctuation and a few other characters.
2. Unicode: 16-bit code supporting practically all international
alphabets and symbols.
• ASCII default on many operating systems, and on the early
Internet (e.g. e-mail).
• Unicode becoming more popular (especially UTF-8!)
• In both cases, represent in memory as either strings or
arrays: e.g. Pu Ti e! i AC“II:
N (here 2) bytes
hold length,
followed by
characters
Byte per character,
terminated with 0

36
Floating Point
• In many cases need very large or very small numbers
• Use idea of s ie tifi otatio , e.g. = m × 10e
– m is called the mantissa
– e is called the exponent.
e.g. C = 3.01 × 108 m/s.

• For computers, use binary i.e. n = m × 2e, where m includes

a i ar poi t .
• Both m and e can be positive or negative; typically
– sign of mantissa given by an additional sign bit, s
– exponent is stored in a biased (excess) format
⇒ use = − sm × 2e− , where 0 <= m < 2, and b is the bias
• e.g. with a 4-bit mantissa and a 3-bit bias-3 exponent, you
can represent positive range [0.0012 × 2− , 1.1112 × 24]
= [ (1/8)(1/8), (15/8)(16) ] = [ 1/64 , 30 ]
37
IEEE Floating Point
• To avoid redundancy, in practice modern computers use IEEE
floating point with normalised mantissa m = [Link] . . . x2
⇒ = −1 s((1 + m) × 2e− )
• Both single precision (32 bits) and double precision (64 bits)

• IEEE fp reserves e = 0 and e = max:

– ±0 (!): both e and m zero.
– ±∞: e = max, m zero.
– NaNs: e = max, m non-zero.
– denorms: e = 0, m non-zero
• Normal positive range [2− , ~2128 for si gle, or −1022, ~21024] for
double precision.
• NB: still only 232/264 values — just spread out.
38
Data Structures
• Records / structures: each field stored as an offset from a
base address
• Variable size structures: explicitly store addresses (pointers)
inside structure, e.g.
datatype rec = node of int * int * rec
| leaf of int;
val example = node(4, 5, node(6, 7, leaf(8)));
• Imagine example is stored at address 0x1000:

leaf tag sa s
e re do e…

agi ode
tag => 4 words

poi ts to
next node
39
Instruction Encoding
• An instruction comprises:
a. an opcode: specifies what to do.
b. zero or more operands: where to get values
• Old machines (and x86) use variable length encoding for
low code density; most other modern machines use fixed
length encoding for simplicity, e.g. ARM ALU instructions:
31 28 27 26 25 24 21 20 19 16 15 12 11 0

Cond 00 I Opcode S Ra Rd Operand 2

and r13, r13, #255

1110 00 1 0000 0 1101 1101 000011111111

bic r03, r03, r02

1110 00 0 1110 0 0011 0011 000000000010

cmp r01, r02

1110 00 0 1010 1 0001 0000 000000000010
40
Fetch-Execute Cycle Revisited

1. CU fetches & decodes instruction and generates (a) control signals and (b) operand
information.
2. I EU, o trol sig als sele t fu tio al u it instruction class a d operatio .
3. If ALU, then read 1–2 registers, perform op, and (probably) write back result.
4. If BU, test condition and (maybe) add value to PC.
5. If MAU, ge erate address addressing mode a d use us to read/ rite alue.
6. Repeat ad infinitum
41
A (Simple) Modern Computer

Devices: for input

and output

42
Input/Output Devices
• Devices connected to processor via a bus (e.g. PCI)
• Includes a wide range:
– Mouse,
– Keyboard,
– Graphics Card,
– Sound card,
– Floppy drive,
– Hard-Disk,
– CD-Rom,
– Network card,
– Printer,
– Modem
– etc.
• Often two or more stages involved (e.g. USB, IDE, SCSI,
RS-232, Centronics, etc.) 43
UARTs

• UART = Universal Asynchronous Receiver/Transmitter:

– stores 1 or more bytes internally
– converts parallel to serial
– outputs according to RS-232
• Various baud rates (e.g. 1,200 – 115,200)
• Slow and simple. . . and very useful.
• Make up serial ports o PC
• Max throughput 14.4KBytes; variants up to 56K (for
modems). 44
Hard Disks
• Whirling bits of
(magnetized) metal. . .
• Bit like a double-sided
record player: but
rotates 3,600–12,000
times a minute ;-)
• To read/write data:
– move arms to cylinder
– wait for sector
– activate head
• Today capacities are
around ~500 GBytes
(=500 × 230 bytes)

45
Graphics Cards

• Essentially some RAM (framebuffer) and some digital-to-analogue

circuitry (RAMDAC) – latter only required for CRTs
• (Today usually also have powerful GPU for 3D)
• Framebuffer holds 2-D array of pixels: picture elements.
• Various resolutions (640x480, 1280x1024, etc) and color depths:
8-bit (LUT), 16-bit (RGB=555), 24-bit (RGB=888), 32-bit (RGBA=888)
• Memory requirement = x × y × depth
• e.g. 1280x1024 @ 32bpp needs 5,120KB for screen
• => full-screen 50Hz video requires 250 MBytes/s (or 2Gbit/s!)
46
Buses

• Bus = a collection of shared communication wires:

 low cost
 versatile / extensible
 potential bottle-neck
• Typically comprises address lines, data lines and control lines
– and of course power/ground
• Operates in a master-slave manner, e.g.
1. master decides to e.g. read some data
2. aster puts address o to us a d asserts read
3. slave reads address from bus and retrieves data
4. slave puts data onto bus
5. master reads data from bus
47
Bus Hierarchy

• In practice, have lots of different buses with different

characteristics e.g. data width, max #devices, max length.
• Most buses are synchronous (share clock signal).
48
Synchronous Buses

Figure shows a read transaction which requires three bus cycles

1. CPU puts addr onto address lines and, after settle, asserts control lines.
2. Device (e.g. memory) fetches data from address.
3. Device puts data on data lines, CPU latches value and then finally
deasserts control lines.
• If device not fast enough, can insert wait states
• Faster clock/longer bus can give bus skew
49
Asynchronous Buses

• Asynchronous buses have no shared clock; instead use handshaking, e.g.

– CPU puts address onto address lines and, after settle, asserts control lines
– next, CPU asserts /SYN to say everything ready
– once memory notices /SYN, it fetches data from address and puts it onto bus
– memory then asserts /ACK to say data is ready
– CPU latches data, then deasserts /SYN
– finally, Memory deasserts /ACK
• More handshaking if multiplex address & data lines
50
Interrupts
• Bus reads and writes are transaction based: CPU requests
something and waits until it happens.
• But e.g. reading a block of data from a hard-disk takes ~2ms, which
might be over 10,000,000 clock cycles!
• Interrupts provide a way to decouple CPU requests from device
responses.
1. CPU uses bus to make a request (e.g. writes some special values to
addresses decoded by some device).
2. Device goes off to get info.
3. Meanwhile CPU continues doing other stuff.
4. When device finally has information, raises an interrupt.
5. CPU uses bus to read info from device.
• When interrupt occurs, CPU vectors to handler, then resumes using
special instruction, e.g.

51
Interrupts (2)
• I terrupt li es ~ − are part of the us.
• Often only 1 or 2 pins on chip ⇒ need to encode.
• e.g. ISA & x86:

1. Device asserts IRx

2. PIC asserts INT
3. When CPU can interrupt, strobes INTA
4. PIC sends interrupt number on D[0:7]
5. CPU uses number to index into a table in memory which
holds the addresses of handlers for each interrupt.
6. CPU saves registers and jumps to handler
52
Direct Memory Access (DMA)
• Interrupts are good, but even better is a device which
can read and write processor memory directly.
• A ge eri DMA o a d ight i lude
– source address
– source increment / decrement / do nothing
– sink address
– sink increment / decrement / do nothing
– transfer size
• Get one interrupt at end of data transfer
• DMA channels may be provided by devices themselves:
– e.g. a disk controller
– pass disk address, memory address and size
– give instruction to read or write
• Also get sta d-alo e progra a le DMA o trollers.
53
Computer Organization: Summary
• Computers made up of four main parts:
1. Processor (including register file, control unit and execution
unit – with ALU, memory access unit, branch unit, etc),
2. Memory (caches, RAM, ROM),
3. Devices (disks, graphics cards, etc.), and
4. Buses (interrupts, DMA).
• Information represented in all sorts of formats:
– signed & unsigned integers,
– strings,
– floating point,
– data structures,
– instructions.
• Can (hopefully) understand all of these at some level, but
gets pretty complex...
• Ne t up: are o es progra i g ith MIP“ asse l …
54
What is MIPS?
• A Reduced Instruction Set Computer (RISC)
microprocessor:
– Developed at Stanford in the 1980s [Hennessy]
– Designed to be fast and simple
– Originally 32-bit; today also get 64-bit versions
– Primarily used in embedded systems (e.g. routers,
TiVo s, P“Ps…
– First as ‘ ; later ‘ ,‘ ,…
• Also used by big-iron SGI machines (R1x000)

55
MIPS Instructions
• MIPS has 3 instruction formats:
– R-type - register operands
– I-type - immediate operands
– J-type - jump operands
• All instructions are 1 word long (32 bits)
• Examples of R-type instructions:
add $8, $1, $2 # $8 <= $1 + $2
sub $12, $6, $3 # $12 <= $6 - $3
and $1, $2, $3 # $1 <= $2 & $3
or $1, $2, $3 # $1 <= $2 | $3
• Register 0 ($0) always contains zero
add $8, $0, $0 # $8 <= 0
add $8, $1, $0 # $8 <= $1
56
R-Type Instructions
• These take three register operands ($0 .. $31)
• R-type instructions have six fixed-width fields:
31 26 25 21 20 16 15 11 10 6 5 0

opcode Rs Rt Rd shamt funct

6 bits 5 bits 5 bits 5 bits 5 bits 6 bits

opcode basic operation of the instruction

Rs the first register source operand
Rt the second register source operand
Rd: the register destination operand; gets result of the operation
shamt shift amount (0 if not shift instruction)
funct This field selects the specific variant of the operation and is
sometimes called the function code; e.g. for opcode 0,
if (funct == 32) => add ; if (funct == 34) => sub

57
I-Type Instructions
31 26 25 21 20 16 15 0

opcode Rs Rt immediate value

6 bits 5 bits 5 bits 16 bits

• I = Immediate
– Value is encoded in instruction & available directly
– MIPS allows 16-bit values (only 12-bits on ARM)
• Useful for loading constants, e.g:
– li $7, 12 # load constant 12 into reg7
• This is a big win in practice since >50% of
arithmetic instructions involve constants!
• MIPS supports several immediate mode
instructions: opcode deter i es hi h o e…
58
Immediate Addressing on MIPS
• or, and, xor and add instructions have immediate
for s hi h take a i” suffix, e.g:
ori $8, $0, 0x123 # puts 0x00000123 into r8
ori $9, $0, -6 # puts 0x0000fffa into r9
addi $10, $0, 0x123 # puts 0x00000123 into r10
addi $11, $0, -6 # puts 0xfffffffa into r11
# (note sign extension...)
• lui instruction loads upper 16 bits with a constant
and sets the least-significant 16 bits to zero
lui $8, 0xabcd # puts 0xabcd0000 into r8
ori $8, $0, 0x123 # sets just low 16 bits
# result: r8 = 0xabcd0123
• li pseudo-instruction (see later) generates lui/ori
or ori code sequence as needed...
59
J-Type Instruction
• Last instruction format: Jump-type (J-Type)
31 26 25 0

opcode target address (in #instructions)

6 bits 26 bits

• Only used by unconditional jumps, e.g.

j dest_addr # jump to (target<<2)
• Cannot directly jump more than 226
i stru tio s a a see later…
• Branches use I-type, not J-type, since must
specify 2 registers to compare, e.g.
beq $1, $2, dest # goto dest iff $1==$2
60
Big Picture
x = a - b + c - d; High level Language

sub $10, $4, $5

Assembly
sub $11, $6, $7 Language
add $12, $10, $11
0 4 5 10 0 34
0 6 7 11 0 34
0 10 11 12 0 32
000000 00100 00101 01010 00000 100010 Machine Code
000000 00110 00111 01011 00000 100010
000000 01010 01011 01100 00000 100000

Assumes that a, b, c, d are in $4, $5, $6, $7 somehow

61
MIPS Register Names
• Registers are used for specific purposes, by convention
• For example, registers 4, 5, 6 and 7 are used as parameters or
arguments for subroutines (see later)
• Can be specified as $4, $5, $6, $7 or as $a0, $a1, $a2 and $a3
• Other examples:
$zero $0 zero
$at $1 assembler temporary
$v0, $v1 $2, $3 expression eval & result
$t0...$t7 $8...$15 temporary registers
$s0...$s7 $16...$23 saved temporaries
$t8, $t9 $24, $25 temporary
$k0, $k1 $26, $27 kernel temporaries
$gp $28 global pointer
$sp $29 stack pointer
$fp $30 frame pointer
$ra $31 return address
62
Our first program: Hello World!
.text # begin code section
.globl main
main: li $v0, 4 # system call for print string
la $a0, str # load address of string to print
syscall # print the string
li $v0, 10 # system call for exit
syscall # exit

.data # begin data section

str: .asciiz “Hello world!\n”
# NUL terminated string, as in C

• Co e ts after # to aid reada ilit

• Assembly language 5-20x line count of high level languages
• (And empirical wisdom is that development time strongly related to
number of lines of code...)
63
Assembler Directives
• O pre ious slide sa arious thi gs that ere t asse l ode
instructions: labels and directives
• These are here to assist assembler to do its job ...
• ... but do not necessarily produce results in memory
• Examples:
main: tell assembler where program starts
str: user-friendly[er] way to refer to a memory address

.text tells assembler that following is part of code area

.data following is part of data area
.ascii str insert ASCII string into next few bytes of memory
.asciiz str ...as above, but add null byte at end
.word n1,n2 reserve space for words and store values n1, n2 etc. in them
.half n1,n2 reserve space for halfwords and store values n1, n2 in them
.byte n1,n2 reserve space for bytes and store values n1, n2 in them
.space n reserve space for n bytes
m
.align m align the next datum on 2 byte boundary, e.g. .align 2
aligns on word boundary
64
Pseudo Instructions
• Assemblers can also support other things that look like
asse l i stru tio s… ut are t!
– These are called pseudo-instructions and are there to
make life easier for the programmer
– Can be built from other actual instructions
• Some examples are:
Pseudo Instruction Translated to
move $1,$2 add $1, $0, $2
li $1, 678 ori $1, $0, 678
la $8, 6($1) addi $8, $1, 6
la $8, label lui $1, label[31:16]
ori $8, $1, label[15:0]
b label bgez $0, $0, label
beq $8, 66, label ori $1, $0, 66
beq $1, $8, label 65
Accessing Memory (Loads & Stores)
• Can load bytes, half-words, or words
lb $a0,c($s1) # load byte; $a0 = Mem[$s1+c]
lh $a0,c($s1) # load half-word [16 bits]
lw $a0,c($s1) # load word [32 bits]
– gets data from memory and puts into a register
– c is a [small] constant; can omit if zero
• Same for stores using sb, sh, and sw
• lw, sw etc are I-type instructions:
– destination register ($a0), source register ($s1), and
16-bit immediate value (constant c)
• However assembler also allows lw/sw (and la)
to be pseudo-instructions e.g.
lw $a0, addr ---> lui $1, addr[31:16]
lw $a0, addr[15:0]($1)
66
Control Flow Instructions
Asse l la guage has er fe o trol stru tures…
• Branch instructions: if <cond> then goto <label>
beqz $s0, label # if $s0==0 goto label
bnez $s0, label # if $s0!=0 goto label
bge $s0, $s1, label # if $s0>=$s1 goto label
ble $s0, $s1, label # if $s0<=$s1 goto label
blt $s0, $s1, label # if $s0<$s1 goto label
beq $s0, $s1, label # if $s0==$s1 goto label
bgez $s0, $s1, label # if $s0>=0 goto label

• Jump instructions: (unconditional goto):

j label # goto instruction at “label:”
jr $a0 # goto instruction at Memory[$a0]

• We can build while-loops, for-loops, repeat-until loops, and

if-then-else o stru ts fro these…
67
if-then-else
if ($t0==$t1) then /*blockA */ else /* blockB */

beq $t0, $t1, blockA # if equal goto A

j blockB # ... else goto B

blockA:
… i structio s of blockA …
j exit

blockB:
… i structio s of blockB …

exit:
… ext part of progra …
68
repeat-until
repeat … u til $t > $t

… i itialize $t0, e.g. to 0 …

loop:
… i structio s of loop …
add $t0, $t0, 1 # increment $t0
ble $t0, $t1, loop # if <= $t1, loop

• Other loop structures (for-loops, while-loops,

etc) can be constructed similarly
69
Jump Instructions
• Recall J-Type instructions have 6-bit opcode
and 26-bit target address
– in #instructions (words), so effectively 228 bits
• Assembler converts very distant conditional
branches into inverse-branch and jump, e.g.
beq $2, $3, very_far_label
/* next instruction */
• … is o erted to:
bne $2, $3, L1; # continue
j very_far_label; # branch far
L1:
/*next instruction */ 70
Indirect Jumps
• Sometimes we need to jump (or branch) more than 228
bytes – can use indirect jump via register
jr $t1 # transfer control to
# memory address in $t1
• Can also use to build a jump table
• e.g. suppose we want to branch to different locations
depending on the value held in $a0
.data
jtab: .word l1, l2, l3, l4, l5, l6
.text
main: ... instructions setting $a0, etc ...
lw $t7, jtab($a0) # load adddress
jr $t7 # jump
l1: ... instructions ...
l2: ... instructions ...
l3: ... instructions ... (and so on...)
71
The Spim Simulator

• 1/ th
25 the perfor a e at o e of the ost
• Simulates a MIPS-based machine with some
basic virtual hardware (console)
• Installation
1. From the Patterson & Hennesey textbook CD
2. From the internet
[Link]
• Versions for Windows, Mac and Linux

72
PC Spim
reset a hi e , load asm
programs, run them, etc

.text section:
(program)

.data section
and the stack

diagnostic
messages

73
Using SPIM
• Combines an assembler, a simulator and BIOS
• Assembly language program prepared in your
favourite way as a text file
• Label your first instruction as main, e.g.
main: add $5, $3, $4 # comment
• Read program into SPIM which will assemble it and
may indicate assembly errors (1 at a time!)
• Execute your program (e.g. hit F5)
• Results output to window which simulates console
(or by inspection of registers)
• Let s look at a e a ple...
74
SPIM System Calls
• As ou ll ha e oti ed, “PIM allo s us to use
special code sequences, e.g.
li $a0, 10 # load argument $a0=10
li $v0, 1 # call code to print integer
syscall # print $a0
– ill pri t out o the o sole
• The syscall instruction does various things
depending on the value of $v0
– this is very similar to how things work in a modern
PC or Mac BIOS, albeit somewhat simpler
• We ll see h these are alled s ste alls
later o i the ourse…
75
SPIM System Call Codes
Procedure code $v0 argument
print int 1 $a0 contains number
print float 2 $f12 contains number
print double 3 $f12 contains number
print string 4 $a0 address of string
read int 5 res returned in $v0
read float 6 res returned in $f0
read double 7 res returned in $f0
read string 8 $a0 buffer, $a1 length
exit program 10 /* none */
76
Example: Print numbers 1 to 10
.data
newln: .asciiz “\n”
.text
.globl main
main:
li $s0, 1 # $s0 = loop counter
li $s1, 10 # $s1 = upper bound of loop
loop:
move $a0, $s0 # print loop counter $s0
li $v0, 1
syscall
li $v0, 4 # syscall for print string
la $a0, newln # load address of string
syscall
addi $s0, $s0, 1 # increase counter by 1
ble $s0, $s1, loop # if ($s0<=$s1) goto loop
li $v0, 10 # exit
syscall
77
Example: Increase array elems by 5
.text
.globl main
main:
la $t0, Aaddr # $t0 = pointer to array A
lw $t1, len # $t1 = length (of array A)
sll $t1, $t1, 2 # $t1 = 4*length
add $t1, $t1, $t0 # $t1 = address(A)+4*length
loop:
lw $t2, 0($t0) # $t2 = A[i]
addi $t2, $t2, 5 # $t2 = $t2 + 5
sw $t2, 0($t0) # A[i] = $t2
addi $t0, $t0, 4 # i = i+1
bne $t0, $t1, loop # if $t0<$t1 goto loop
# ... exit here ...
.data
Aaddr: .word 0,2,1,4,5 # array with 5 elements
len: .word 5

78
Procedures
• Long assembly programs get very unwieldy!
• Procedures or subroutines (similar to methods
or functions) allow us to structure programs
• Makes use of a new J-type instruction, jal:
• jal addr # jump-and-link
– stores (current address + 4) into register $ra
– jumps to address addr
• jr $ra
– e e see this efore – an indirect jump
– after a jal, this will return back to the main code
79
Example Using Procedures
.data
newline:.asciiz “\n”
.text
print_eol: # procedure to print "\n"
li $v0, 4 # load system call code
la $a0, newline # load string to print
syscall # perform system call
jr $ra # return
print_int: # prints integer in $a0
li $v0, 1 # load system call code
syscall # perform system call
jr $ra # return
main:
li $s0, 1 # $s0 = loop counter
li $s1, 10 # $s1 = upper bound
loop: move $a0, $s0 # print loop counter
jal print_int #
jal print_eol # print "\n“
addi $s0, $s0, 1 # increment loop counter
ble $s0, $s1, loop # continue unless $s0>$s1
80
Non-leaf Procedures
• Procedures are great, but what if have
procedures invoking procedures?
procA: … i structio s to do stuff procA does …
li $a0, 25 # prep to call procB
jal procB # $ra = next address
jr $ra # return to caller
procB: … i structio s to do stuff procB does …
jr $ra # return to caller
$ra INFINITE LOOP!
main:
li $a0, 10 # prep to call procA
jal procA # $ra = next address
… rest of progra …
81
The Stack
• Pro le as that there s o l o e $ra!
– generally need to worry about other regs too
• We can solve this by saving the contents of
registers in memory before doing procedure
– Restore values from memory before return
• The stack is a way of organizing data in memory
which is ideally suited for this purpose
– Has so-called last-in-first-out (LIFO) semantics
– push items onto the stack, pop items back off
• Think of a pile of paper on a desk
– pushing a ite is addi g a pie e of paper
– popping is re o i g it
– size of pile grows and shrinks over time
82
The Stack in Practice
• Register $sp holds address of top of stack
– In SPIM this is initialized to [Link]
• A push stores data, a d de re e ts $sp
• A pop reads a k data, a d i re e ts $sp
Higher
# $a0 holds 0xFEE $sp 8($sp) 0xEACD0000 Addresses
# „push‟ $a0
sub $sp, $sp, 4 4($sp) 0x00000001
sw $a0, 0($sp) 0($sp) 0x20003CFC
# „pop‟ $a0 -4($sp) 0x00000FEE
lw $a0, 0($sp) -8($sp)
add $sp, $sp, 4
-12($sp) Lower
Addresses
• We use the stack for parameter passing, storing return
addresses, and saving and restoring other registers
83
Fi o a i… i asse l !

fib(0) = 0
fib(1) = 1
fib(n) = fib(n-1) + fib(n-2)

, , , , , , , , ,…

li $a0, 10 # call fib(10)

jal fib #
move $s0, $v0 # $s0 = fib(10)

fib is a recursive procedure with one argument $a0

need to store argument $a0, temporary register $s0 for
intermediate results, and return address $ra
84
Fibonacci: core procedure
fib: sub $sp,$sp,12 # save registers on stack
sw $a0, 0($sp) # save $a0 = n
sw $s0, 4($sp) # save $s0
sw $ra, 8($sp) # save return address $ra
bgt $a0,1, gen # if n>1 then goto generic case
move $v0,$a0 # output = input if n=0 or n=1
j rreg # goto restore registers
gen: sub $a0,$a0,1 # param = n-1
jal fib # compute fib(n-1)
move $s0,$v0 # save fib(n-1)
sub $a0,$a0,1 # set param to n-2
jal fib # and make recursive call
add $v0, $v0, $s0 # $v0 = fib(n-2)+fib(n-1)
rreg: lw $a0, 0($sp) # restore registers from stack
lw $s0, 4($sp) #
lw $ra, 8($sp) #
add $sp, $sp, 12 # decrease the stack size
jr $ra

85
Optional Assembly Ticks
• Tick 0: download SPIM (some version) and
assemble + run the hello world program
• Tick 1: write an assembly program which takes an
array of 10 values and swaps the values (so e.g.
A := A , A := A , … A := A
• Tick 2: write an assembly program which reads in
any 10 values from the keyboard, and prints them
out lowest to highest
• Tick 3 (*hard*): write an optimized version of the
Fibonacci code presented here. You may wish do
custom stack frame management for the base
cases, and investigate tail-recursion.
– see what Fibonacci number you can compute in 5
minutes with the original and optimized versions
86

Computer Fundamentals
No ratings yet
Computer Fundamentals
31 pages
Computer Organization and Architecture (18EC35) - Machine Instructions and Programs - Part 1 (Module 1)
100% (1)
Computer Organization and Architecture (18EC35) - Machine Instructions and Programs - Part 1 (Module 1)
60 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
8 pages
Course Code: Course Title TPC Version No. Course Pre-Requisites/ Co-Requisites Anti-Requisites (If Any)
No ratings yet
Course Code: Course Title TPC Version No. Course Pre-Requisites/ Co-Requisites Anti-Requisites (If Any)
4 pages
Discrete Maths Notes
No ratings yet
Discrete Maths Notes
63 pages
Sequential Circuit Analysis: State Tables State Diagrams
100% (1)
Sequential Circuit Analysis: State Tables State Diagrams
17 pages
Control Systems Overview and Modeling
No ratings yet
Control Systems Overview and Modeling
24 pages
Computer Science An Overview Chapter 1 PDF Notes
No ratings yet
Computer Science An Overview Chapter 1 PDF Notes
82 pages
Mathematics For Engineers Preliminary 2018 Autumn
No ratings yet
Mathematics For Engineers Preliminary 2018 Autumn
17 pages
Galois Field Operations in MATLAB
No ratings yet
Galois Field Operations in MATLAB
31 pages
DSP For Matlab and Labview Iv - Lms Adaptive Filtering Forester W Isen (Morgan and Claypool 2009 127S) PDF
100% (1)
DSP For Matlab and Labview Iv - Lms Adaptive Filtering Forester W Isen (Morgan and Claypool 2009 127S) PDF
127 pages
Ssms Vs Ssis: SQL Server Management Studio (SSMS)
No ratings yet
Ssms Vs Ssis: SQL Server Management Studio (SSMS)
17 pages
Verilog Intro
No ratings yet
Verilog Intro
112 pages
Computers and Processors: Reference Manual
No ratings yet
Computers and Processors: Reference Manual
40 pages
CO Unit 1 Chap 1 Notes
No ratings yet
CO Unit 1 Chap 1 Notes
11 pages
Information Theory PDF
No ratings yet
Information Theory PDF
26 pages
Signals and Systems Lecture Notes: Dr. Mahmoud M. Al-Husari
No ratings yet
Signals and Systems Lecture Notes: Dr. Mahmoud M. Al-Husari
79 pages
8085 Assembly Programs for Data Operations
0% (1)
8085 Assembly Programs for Data Operations
13 pages
Introduction to C++ Basics
100% (1)
Introduction to C++ Basics
13 pages
Handbook of Mathematics For Engineers - PDF Room
100% (1)
Handbook of Mathematics For Engineers - PDF Room
229 pages
Parallel Algorithms Course Guide
No ratings yet
Parallel Algorithms Course Guide
13 pages
Lecture - Notes MAtrix Algebra 2
100% (2)
Lecture - Notes MAtrix Algebra 2
99 pages
Math Books List
No ratings yet
Math Books List
10 pages
Computer Organization and Architecture (18EC35) - Machine Instructions and Programs - Part 2 (Module 2)
100% (1)
Computer Organization and Architecture (18EC35) - Machine Instructions and Programs - Part 2 (Module 2)
105 pages
V.A. Ilyin, E.G. Poznyak - Analytic Geometry - Mir - 1984
No ratings yet
V.A. Ilyin, E.G. Poznyak - Analytic Geometry - Mir - 1984
236 pages
1 Transient Response
No ratings yet
1 Transient Response
17 pages
Solution Manual Automata Computability PDF
No ratings yet
Solution Manual Automata Computability PDF
20 pages
Vector Space
No ratings yet
Vector Space
42 pages
Answers For Elements of The Theory of Computation 2E 2nd Edition Harry Lewis Christos H Papadimitriou
No ratings yet
Answers For Elements of The Theory of Computation 2E 2nd Edition Harry Lewis Christos H Papadimitriou
327 pages
History Complex Analysis
No ratings yet
History Complex Analysis
5 pages
Pde
No ratings yet
Pde
146 pages
Rutx50 Datasheet v15 1
100% (1)
Rutx50 Datasheet v15 1
11 pages
M110 TMA Spring 2022/2023 Overview
No ratings yet
M110 TMA Spring 2022/2023 Overview
3 pages
Computer Memory Essentials
100% (1)
Computer Memory Essentials
74 pages
Basic Structure of C Programming
No ratings yet
Basic Structure of C Programming
9 pages
Binary Adder-Subtractor Guide
No ratings yet
Binary Adder-Subtractor Guide
38 pages
Data Structures and Algorithms Overview
No ratings yet
Data Structures and Algorithms Overview
49 pages
Mesh Analysis Techniques Guide
No ratings yet
Mesh Analysis Techniques Guide
17 pages
Calculus 1 Text Book
100% (1)
Calculus 1 Text Book
350 pages
Edexcel - M5 PDF
No ratings yet
Edexcel - M5 PDF
133 pages
Stastics 6th Sem
No ratings yet
Stastics 6th Sem
266 pages
Vector Space Axioms Explained
100% (1)
Vector Space Axioms Explained
2 pages
Just The Maths - A.J.hobson (Complex Numbers)
No ratings yet
Just The Maths - A.J.hobson (Complex Numbers)
47 pages
Two's Complement: Three-Bit Signed Integers
No ratings yet
Two's Complement: Three-Bit Signed Integers
12 pages
Linear Guest
100% (1)
Linear Guest
436 pages
Linear Algebra
No ratings yet
Linear Algebra
319 pages
Numerical Linear Algebra and Matrix Factorizations: Tom Lyche
No ratings yet
Numerical Linear Algebra and Matrix Factorizations: Tom Lyche
376 pages
Overview of Computer Structure and Functions
No ratings yet
Overview of Computer Structure and Functions
18 pages
Introduction to VHDL and Design Basics
No ratings yet
Introduction to VHDL and Design Basics
90 pages
Sequential Logic Circuit Design Guide
100% (1)
Sequential Logic Circuit Design Guide
51 pages
C Slides
No ratings yet
C Slides
411 pages
Crash Course Computer Science
No ratings yet
Crash Course Computer Science
4 pages
CH 01
No ratings yet
CH 01
78 pages
Chapter 01 (3) Compu
No ratings yet
Chapter 01 (3) Compu
78 pages
Lecture 3
No ratings yet
Lecture 3
32 pages
Week 1
No ratings yet
Week 1
5 pages
CH 1 Introduction
No ratings yet
CH 1 Introduction
110 pages
CH 01
No ratings yet
CH 01
78 pages
Microprocessor Course Overview
No ratings yet
Microprocessor Course Overview
27 pages
Chs 4
No ratings yet
Chs 4
19 pages
Glossary of Terminology: (Mark IV, Mark V Gas Turbine Control System)
No ratings yet
Glossary of Terminology: (Mark IV, Mark V Gas Turbine Control System)
12 pages
2VAA001584 en S Control SPICI03 Cnet-To-Computer Communication Interface User Manual
No ratings yet
2VAA001584 en S Control SPICI03 Cnet-To-Computer Communication Interface User Manual
73 pages
Manual SPH PROFIBUS-DP Master - Slave Module - FEH237
No ratings yet
Manual SPH PROFIBUS-DP Master - Slave Module - FEH237
60 pages
Chapter 4 Microprocessor System
No ratings yet
Chapter 4 Microprocessor System
106 pages
E4 TCN
No ratings yet
E4 TCN
19 pages
DesignBase Quick Start
No ratings yet
DesignBase Quick Start
56 pages
Parts of The Motherboard & Their Functions
100% (2)
Parts of The Motherboard & Their Functions
17 pages
GSM Clock Synchronization Guide
No ratings yet
GSM Clock Synchronization Guide
9 pages
AMD Geode Sc2200 - Ds
0% (1)
AMD Geode Sc2200 - Ds
433 pages
Information and Communication Technology by Khan
No ratings yet
Information and Communication Technology by Khan
48 pages
Bm3551 Esiomt Complete Notes
No ratings yet
Bm3551 Esiomt Complete Notes
195 pages
NA
No ratings yet
NA
5 pages
Unit 3 - Hardware
No ratings yet
Unit 3 - Hardware
151 pages
Hi-Speed Design Tutorial For Altium Designer
100% (2)
Hi-Speed Design Tutorial For Altium Designer
17 pages
32-Bit Microprocessor (Stand-Alone) : Labvolt Series
No ratings yet
32-Bit Microprocessor (Stand-Alone) : Labvolt Series
5 pages
周志遠教授作業系統 - chap13Operating System Chap13 IO Systems＿
No ratings yet
周志遠教授作業系統 - chap13Operating System Chap13 IO Systems＿
39 pages
Microprocessor Lab Viva Questions & Answers
100% (2)
Microprocessor Lab Viva Questions & Answers
11 pages
Atlas Copco SYS 6000 Manual
No ratings yet
Atlas Copco SYS 6000 Manual
110 pages
Understanding Direct Memory Access (DMA)
No ratings yet
Understanding Direct Memory Access (DMA)
22 pages
1-117 Ac Comp Quiz
100% (1)
1-117 Ac Comp Quiz
394 pages
Stereo Equalizer Service Guide
No ratings yet
Stereo Equalizer Service Guide
32 pages
Pin Diagram and Description of 8085 Microprocessor
No ratings yet
Pin Diagram and Description of 8085 Microprocessor
5 pages
LG GT360 Service Manual
No ratings yet
LG GT360 Service Manual
162 pages
Vaddis Zr36966elcg D
No ratings yet
Vaddis Zr36966elcg D
0 pages
Mechatronics and Control Systems Overview
No ratings yet
Mechatronics and Control Systems Overview
11 pages
8086 Min Max Modes
No ratings yet
8086 Min Max Modes
31 pages
Computer Organization Book Sem 5
No ratings yet
Computer Organization Book Sem 5
64 pages
Hardware In the Loop Simulator Overview
No ratings yet
Hardware In the Loop Simulator Overview
40 pages
B.Tech CSE/IT Computer Architecture Course
No ratings yet
B.Tech CSE/IT Computer Architecture Course
5 pages

Computer Fundamentals Course Overview

Uploaded by

Computer Fundamentals Course Overview

Uploaded by

Computer Fundamentals

• 1945: o Neu a drafts EDVAC report

• Computers programmable with variety of different languages.

• Consider a set of machines M0, M1, . . . Mn:

Output is o l if Output is if Output is o l Output is o l if

• Using feedback (outputs become inputs) we can build

• Computers all about operating on information:

• Use cache et ee ai e or & registers to hide slo D‘AM

• ‘elati el fast urre tl − ns).

Address Inputs: say

Data Outputs (when

• ‘elati el fast urre tl − ns).

• Use a single transistor to store a bit.

• Two stage: row, then column.

• A special register called PC holds a memory address

• The al ulator part of the pro essor.

• Part of the execution unit.

• n-bit register b − b − . . . b1b0 can represent 2n different values.

• Carry bits C0 (=Cin), C1, C2, … Cn (=Cout)

– Negative flag N = Cn-1 (i.e. msb) flips on overflow

• Both d and a must be registers; b can be a register or, in most

– Java equivalent approximately:

– Java equivalent: d = (a==b) ? 5 : 7;

• “ile t ersio s useful he do t reall a t result, e.g. teq, cmp

• Typically devices decode only a subset of low address lines, e.g.

• For computers, use binary i.e. n = m × 2e, where m includes

• IEEE fp reserves e = 0 and e = max:

Cond 00 I Opcode S Ra Rd Operand 2

and r13, r13, #255

bic r03, r03, r02

cmp r01, r02

Devices: for input

• UART = Universal Asynchronous Receiver/Transmitter:

• Essentially some RAM (framebuffer) and some digital-to-analogue

• Bus = a collection of shared communication wires:

• In practice, have lots of different buses with different

Figure shows a read transaction which requires three bus cycles

• Asynchronous buses have no shared clock; instead use handshaking, e.g.

1. Device asserts IRx

opcode Rs Rt Rd shamt funct

6 bits 5 bits 5 bits 5 bits 5 bits 6 bits

opcode basic operation of the instruction

opcode Rs Rt immediate value

6 bits 5 bits 5 bits 16 bits

opcode target address (in #instructions)

• Only used by unconditional jumps, e.g.

sub $10, $4, $5

Assumes that a, b, c, d are in $4, $5, $6, $7 somehow

.data # begin data section

• Co e ts after # to aid reada ilit

.text tells assembler that following is part of code area

• Jump instructions: (unconditional goto):

• We can build while-loops, for-loops, repeat-until loops, and

beq $t0, $t1, blockA # if equal goto A

… i itialize $t0, e.g. to 0 …

• Other loop structures (for-loops, while-loops,

li $a0, 10 # call fib(10)

fib is a recursive procedure with one argument $a0

You might also like