0% found this document useful (0 votes)

6 views47 pages

2MIPS Assembly

The document discusses benchmarks for evaluating system performance, emphasizing the importance of real application testing and various benchmark suites like SPEC. It details the evolution of SPEC benchmarks from CPU89 to CPU2000, highlighting specific programs and their functions. Additionally, it covers MIPS architecture, including its instruction set, memory organization, and design principles, along with examples of MIPS assembly code.

Uploaded by

jaydipshiroya12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views47 pages

2MIPS Assembly

Uploaded by

jaydipshiroya12

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

Benchmarks

 Performance best determined by running a real application

 use programs typical of expected workload
 or, typical of expected class of applications
e.g., compilers/editors, scientific applications, graphics, etc.

 Small benchmarks
 nice for architects and designers
 easy to standardize
 can be abused!

 Benchmark suites
 Perfect Club: set of application codes
 Livermore Loops: 24 loop kernels
 Linpack: linear algebra package
 SPEC: mix of code from industry organization
SPEC (System Performance
Evaluation Corporation)
 Sponsored by industry but independent and self-managed –
trusted by code developers and machine vendors
 Clear guides for testing, see www.spec.org
 Regular updates (benchmarks are dropped and new ones added
periodically according to relevance)
 Specialized benchmarks for particular classes of applications
 Can still be abused…, by selective optimization!
SPEC History
 First Round: SPEC CPU89
 10 programs yielding a single number
 Second Round: SPEC CPU92
 SPEC CINT92 (6 integer programs) and SPEC CFP92 (14 floating
point programs)
 compiler flags can be set differently for different programs
 Third Round: SPEC CPU95
 new set of programs: SPEC CINT95 (8 integer programs) and SPEC
CFP95 (10 floating point)
 single flag setting for all programs
 Fourth Round: SPEC CPU2000
 new set of programs: SPEC CINT2000 (12 integer programs) and
SPEC CFP2000 (14 floating point)
 single flag setting for all programs
 programs in C, C++, Fortran 77, and Fortran 90
CINT2000 (Integer component
of SPEC CPU2000)
Program Language What It Is
164.gzip C Compression
175.vpr C FPGA Circuit Placement and Routing
176.gcc C C Programming Language Compiler
181.mcf C Combinatorial Optimization
186.crafty C Game Playing: Chess
197.parser C Word Processing
252.eon C++ Computer Visualization
253.perlbmk C PERL Programming Language
254.gap C Group Theory, Interpreter
255.vortex C Object-oriented Database
256.bzip2 C Compression
300.twolf C Place and Route Simulator
CFP2000 (Floating point
component of SPEC CPU2000)
Program Language What It Is
168.wupwise Fortran 77 Physics / Quantum Chromodynamics
171.swim Fortran 77 Shallow Water Modeling
172.mgrid Fortran 77 Multi-grid Solver: 3D Potential Field
173.applu Fortran 77 Parabolic / Elliptic Differential Equations
177.mesa C 3-D Graphics Library
178.galgel Fortran 90 Computational Fluid Dynamics
179.art C Image Recognition / Neural Networks
183.equake C Seismic Wave Propagation Simulation
187.facerec Fortran 90 Image Processing: Face Recognition
188.ammp C Computational Chemistry
189.lucas Fortran 90 Number Theory / Primality Testing
191.fma3d Fortran 90 Finite-element Crash Simulation
200.sixtrack Fortran 77 High Energy Physics Accelerator Design
301.apsi Fortran 77 Meteorology: Pollutant Distribution
SPEC CPU2000 reporting
 Refer SPEC website www.spec.org for documentation
 Single number result – geometric mean of normalized ratios for
each code in the suite
 Report precise description of machine
 Report compiler flag setting
CINT2006 for Opteron X4 2356
Name Description IC×109 CPI Tc (ns) Exec time Ref time SPECratio
perl Interpreted string processing 2,118 0.75 0.40 637 9,777 15.3
bzip2 Block-sorting compression 2,389 0.85 0.40 817 9,650 11.8
gcc GNU C Compiler 1,050 1.72 0.40 724 8,050 11.1
mcf Combinatorial optimization 336 10.00 0.40 1,345 9,120 6.8
go Go game (AI) 1,658 1.09 0.40 721 10,490 14.6
hmmer Search gene sequence 2,783 0.80 0.40 890 9,330 10.5
sjeng Chess game (AI) 2,176 0.96 0.40 837 12,100 14.5
libquantum Quantum computer simulation 1,623 1.61 0.40 1,047 20,720 19.8
h264avc Video compression 3,102 0.80 0.40 993 22,130 22.3
omnetpp Discrete event simulation 587 2.94 0.40 690 6,250 9.1
astar Games/path finding 1,082 1.79 0.40 773 7,020 9.1
xalancbmk XML parsing 1,058 2.70 0.40 1,143 6,900 6.0
Geometric mean 11.7

High cache miss rates

SPEC CPU2000 Benchmark Sample Result

Source: Sun Microsystems

W1100z uses AMD Opteron
100 series CPU

8
SPEC ‘95
Does doubling the clock rate double the performance?
10
Can a machine with a slower clock rate10 have better performance?

9 9

8 8

7 7

6 6

SPECfp
SPECint

5 5

4 4

3 3

2 2

1 1

0 0
50 100 150 200 250 50 100 150 200 250
Clock rate (MHz)
Clock rate (MHz) Pentium Pentium
Pentium Pro Pentium Pro

9
Specialized SPEC Benchmarks

 I/O
 Network
 Graphics
 Java
 Web server
 Transaction processing (databases)
Instructions: Language of the Machine
Instructions: Overview
 Language of the machine
 More primitive than higher level languages, e.g., no
sophisticated control flow such as while or for loops
 Very restrictive
 e.g., MIPS arithmetic instructions
 We’ll be working with the MIPS instruction set architecture
 inspired most architectures developed since the 80's
 used by NEC, Nintendo, Silicon Graphics, Sony
 the name is not related to millions of instructions per second !
 it stands for microcomputer without interlocked pipeline stages !
 Design goals: maximize performance and minimize cost and
reduce design time
MIPS Arithmetic

 All MIPS arithmetic instructions have 3 operands

 Operand order is fixed (e.g., destination first)

 Example:

C code: A = B + C compiler’s job to associate

variables with registers
MIPS code: add $s0, $s1, $s2
MIPS Arithmetic
 Design Principle 1: simplicity favors regularity.
Translation: Regular instructions make for simple hardware!

 Simpler hardware reduces design time and manufacturing cost.

Allowing variable number
 Of course this complicates some things... of operands would
simplify the assembly
C code: A = B + C + D; code but complicate the
E = F - A; hardware.

MIPS code add $t0, $s1, $s2

(arithmetic): add $s0, $t0, $s3
sub $s4, $s5, $s0

 Performance penalty: high-level code translates to denser

machine code.
MIPS Arithmetic
 Operands must be in registers – only 32 registers provided
(which require 5 bits to select one register). Reason for small
number of registers:

 Design Principle 2: smaller is faster. Why?

 Electronic signals have to travel further on a physically larger chip
increasing clock cycle time.
 Smaller is also cheaper!
Registers vs. Memory
 Arithmetic instructions operands must be in registers
 MIPS has 32 registers
 Compiler associates variables with registers
 What about programs with lots of variables (arrays, etc.)? Use
memory, load/store operations to transfer data from memory to
register – if not enough registers spill registers to memory
 MIPS is a load/store architecture

Control Input
Memory
Datapath Output

Processor I/O
Memory Organization
 Viewed as a large single-dimension array with access by address
 A memory address is an index into the memory array
 Byte addressing means that the index points to a byte of
memory, and that the unit of memory accessed by a load/store
is a byte
0 8 bits of data
1 8 bits of data

2 8 bits of data

3 8 bits of data

4 8 bits of data

5 8 bits of data
6 8 bits of data

...
Memory Organization
 Bytes are load/store units, but most data items use larger words
 For MIPS, a word is 32 bits or 4 bytes.

0 32 bits of data
4 32 bits of data Registers correspondingly hold 32 bits of data
8 32 bits of data
12 32 bits of data

...

 232 bytes with byte addresses from 0 to 232-1

 230 words with byte addresses 0, 4, 8, ... 232-4
 i.e., words are aligned
 what are the least 2 significant bits of a word address?
Load/Store Instructions
 Load and store instructions
 Example:
C code: A[8] = h + A[8];
value offset address
MIPS code (load): lw $t0, 32($s3)
(arithmetic): add $t0, $s2, $t0
(store): sw $t0, 32($s3)

 Load word has destination first, store has destination last

 Remember MIPS arithmetic operands are registers, not memory
locations
 therefore, words must first be moved from memory to registers
using loads before they can be operated on; then result can be
stored back to memory
A MIPS Example
 Can we figure out the assembly code?

swap(int v[], int k);

{ int temp;
temp = v[k];
v[k] = v[k+1];
v[k+1] = temp;
} $5=k
$4=Base address of v[]
swap:
muli $2, $5, 4
add $2, $4, $2
lw $15, 0($2)
lw $16, 4($2)
sw $16, 0($2)
sw $15, 4($2)
jr $31
So far we’ve learned:
 MIPS
 loading words but addressing bytes
 arithmetic on registers only

 Instruction Meaning

add $s1, $s2, $s3 $s1 = $s2 + $s3

sub $s1, $s2, $s3 $s1 = $s2 – $s3
lw $s1, 100($s2) $s1 = Memory[$s2+100]
sw $s1, 100($s2) Memory[$s2+100]= $s1
Machine Language
 Instructions, like registers and words of data, are also 32 bits long
 Example: add $t0, $s1, $s2
 registers are numbered, e.g., $t0 is 8, $s1 is 17, $s2 is 18

 Instruction Format R-type (“R” for aRithmetic):

000000 10001 10010 01000 00000 100000

op rs rt rd shamt funct
opcode – first second register shift function field -
operation register register destin- amount selects variant
source source ation of operation
operand operand operand

6 bits 5 bits 5 bits 5 bits 5 bits 6 bits

MIPS Encoding: R-Type

31 26 25 21 20 16 15 11 10 6 5 0

opcode rs rt rd shamt funct

rt
add $4, $3, $2

31 26 25 21 20 16 15 11 10 6 5 0
0 0 0 0 0 0 0 0 0 1 1 0 0 0 1 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0
opcode rs rt rd shamt funct

0 0 0 0 0 0 0 0 0 1 1 0 0 0 1 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0

Encoding = 0x00622020
23
Machine Language
 Consider the load-word and store-word instructions,
 what would the regularity principle have us do?
 we would have only 5 or 6 bits to determine the offset from a base
register - too little…

 Design Principle 3: Good design demands a compromise

 Introduce a new type of instruction format
 I-type (“I” for Immediate) for data transfer instructions
 Example: lw $t0, 1002($s2)

100011 10010 01000 0000001111101010

6 bits 5 bits 5 bits 16 bits
op rs rt 16 bit offset
MIPS Encoding: I-Type
31 26 25 21 20 16 15 0

opcode rs rt Immediate Value

rt
Immediate
lw $5, 3000($2)

31 26 25 21 20 16 15 0
1 0 0 0 1 1 0 0 0 1 0 0 0 1 0 1 0 0 0 0 1 0 1 1 1 0 1 1 1 0 0 0
opcode rs rt Immediate Value

1 0 0 0 1 1 0 0 0 1 0 0 0 1 0 1 0 0 0 0 1 0 1 1 1 0 1 1 1 0 0 0

Encoding = 0x8C450BB8
25
MIPS Encoding: I-Type
31 26 25 21 20 16 15 0

opcode rs rt Immediate Value

rt
Immediate
sw $5, 3000($2)

31 26 25 21 20 16 15 0
1 0 1 0 1 1 0 0 0 1 0 0 0 1 0 1 0 0 0 0 1 0 1 1 1 0 1 1 1 0 0 0
opcode rs rt Immediate Value

1 0 1 0 1 1 0 0 0 1 0 0 0 1 0 1 0 0 0 0 1 0 1 1 1 0 1 1 1 0 0 0

Encoding = 0xAC450BB8
The immediate value is signed 26
Control: Conditional Branch
 Decision making instructions
 alter the control flow,
 i.e., change the next instruction to be executed

 MIPS conditional branch instructions:

bne $t0, $t1, Label I-type instructions

beq $t0, $t1, Label
beq $t0, $t1, Label
000100 01000 01001 0000000000011001 (= addr.100)

 Example: if (i==j) h = i + j; word-relative addressing:

25 words = 100 bytes;
bne $s0, $s1, Label also PC-relative (more…)
add $s3, $s0, $s1
Label: ....
Addresses in Branch
 Instructions:
bne $t4,$t5,Label Next instruction is at Label if $t4 != $t5
beq $t4,$t5,Label Next instruction is at Label if $t4 = $t5
 Format:
I op rs rt 16 bit offset

 16 bits is too small a reach in a 232 address space

 Solution: specify a register (as for lw and sw) and add it to

offset
 use PC (= program counter), called PC-relative addressing, based
on
 principle of locality: most branches are to instructions near current
instruction (e.g., loops and if statements)
Addresses in Branch
 Further extend reach of branch by observing all MIPS
instructions are a word (= 4 bytes), therefore word-relative
addressing:
 MIPS branch destination address = (PC + 4) + (4 * offset)
Because hardware typically increments PC early
in execute cycle to point to next instruction

 so offset = (branch destination address – PC – 4)/4

BEQ/BNE uses I-Type
31 26 25 21 20 16 15 0

opcode rs rt Signed Offset Value

(encoded in words, e.g. 4-bytes)

rs
Offset
beq $0, $9, 40 Encoded by
40/4 = 10
rt

31 26 25 21 20 16 15 0
0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 0
opcode rs rt Immediate Value

0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 0

Encoding = 0x1009000A
30
Control: Unconditional Branch (Jump)

 MIPS unconditional branch instructions:

j Label
 Example:
if (i!=j) beq $s4, $s5, Lab1
h=i+j; add $s3, $s4, $s5
else j Lab2
h=i-j; Lab1: sub $s3, $s4, $s5
Lab2: ...
 J-type (“J” for Jump) instruction format
 Example: j Label # addr. Label = 100 word-relative
addressing:
25 words = 100 bytes
000010 00000000000000000000011001
6 bits 26 bits
op 26 bit number
Addresses in Jump
 Word-relative addressing also for jump instructions

J op 26 bit address

 MIPS jump j instruction replaces lower 28 bits of the PC with

A00 where A is the 26 bit address; it never changes upper 4 bits
 Example: if PC = 1011X (where X = 28 bits), it is replaced with
1011A00
 there are 16(=24) partitions of the 232 size address space, each
partition of size 256 MB (=228), such that, in each partition the upper
4 bits of the address is same.
 if a program crosses an address partition, then a j that reaches a
different partition has to be replaced by jr with a full 32-bit address
first loaded into the jump register
 therefore, OS should always try to load a program inside a single
partition
Constants
 Small constants are used quite frequently (50% of operands)
e.g., A = A + 5;
B = B + 1;
C = C - 18;

 Solutions? Will these work?

 create hard-wired registers (like $zero) for constants like 1
 put program constants in memory and load them as required

 MIPS Instructions:
addi $29, $29, 4
slti $8, $18, 10
andi $29, $29, 6
ori $29, $29, 4

 How to make this work?

Immediate Operands
 Make operand part of instruction itself!

 Design Principle 4: Make the common case fast

 Example: addi $sp, $sp, 4 # $sp = $sp + 4

001000 11101 11101 0000000000000100

6 bits 5 bits 5 bits 16 bits

op rs rt 16 bit number
How about larger constants?
 First we need to load a 32 bit constant into a register
 Must use two instructions for this: first new load upper immediate
instruction for upper 16 bits
lui $t0, 1010101010101010 filled with zeros

1010101010101010 0000000000000000

 Then get lower 16 bits in place:

ori $t0, $t0, 1010101010101010
1010101010101010 0000000000000000

0000000000000000 1010101010101010
ori
1010101010101010 1010101010101010

 Now the constant is in place, use register-register arithmetic

So far
 Instruction Format Meaning
add $s1,$s2,$s3 R $s1 = $s2 + $s3
sub $s1,$s2,$s3 R $s1 = $s2 – $s3
lw $s1,100($s2) I $s1 = Memory[$s2+100]
sw $s1,100($s2) I Memory[$s2+100] = $s1
bne $s4,$s5,Lab1 I Next instr. is at Lab1 if $s4 != $s5
beq $s4,$s5,Lab2 I Next instr. is at Lab2 if $s4 = $s5
j Lab3 J Next instr. is at Lab3

 Formats:
R op rs rt rd shamt funct
I op rs rt 16 bit address
J op 26 bit address
Logical Operations
Shift Logical Left (SLL $S1,$S2,10)
Shift Logical Right (SRL $S1,$S2,10)
AND (AND $S1,$S2,$S3)
OR (OR $S1,$S2,$S3)
NOR (NOR $S1,$S2,$S3)
ANDI (ANDI $S1,$S2,100)
ORI (ORI $S1,$S2,100)
Control Flow
 We have: beq, bne. What about branch-if-less-than?

 New instruction:
if $s1 < $s2 then
$t0 = 1
slt $t0, $s1, $s2 else
$t0 = 0

 Can use this instruction to build blt $s1, $s2, Label

 how? We generate more than one instruction – pseudo-instruction
 Slti is also available
 The assembler needs a register to manufacture instructions
from pseudo-instructions

 There is a convention (not mandatory) for use of registers

Policy-of-Use Convention for
Registers
Name Register number Usage
$zero 0 the constant value 0
$v0-$v1 2-3 values for results and expression evaluation
$a0-$a3 4-7 arguments
$t0-$t7 8-15 temporaries
$s0-$s7 16-23 saved
$t8-$t9 24-25 more temporaries
$gp 28 global pointer
$sp 29 stack pointer
$fp 30 frame pointer
$ra 31 return address

Register 1, called $at, is reserved for the assembler; registers 26-27,

called $k0 and $k1 are reserved for the operating system.
Assembly Language vs.
Machine Language
 Assembly provides convenient symbolic representation
 much easier than writing down numbers
 regular rules: e.g., destination first

 Machine language is the underlying reality

 e.g., destination is no longer first

 Assembly can provide pseudo-instructions

 e.g., move $t0, $t1 exists only in assembly
 would be implemented using add $t0, $t1, $zero

 When considering performance you should count actual number

of machine instructions that will execute
Procedure Calling
 Steps required
1. Place parameters in registers
2. Transfer control to procedure
3. Acquire storage for procedure
4. Perform procedure’s operations
5. Place result in register for caller
6. Return to place of call
Procedure Call Instructions
 Procedure call: jump and link
jal ProcedureLabel
 Address of following instruction put in $ra

 Jumps to target address

 Procedure return: jump register

jr $ra
 Copies $ra to program counter

 Can also be used for computed jumps

 e.g., for case/switch statements

MIPS Addressing Modes
1. Immediate addressing
op rs rt Immediate

2. Register addressing
op rs rt rd ... funct Registers
Register

3. Base addressing
op rs rt Address Memor y

Register + Byte Halfword Word

4. PC-relative addressing
op rs rt Address Memor y

PC + Word

5. Pseudodirect addressing
op Address Memor y

PC Word
Overview of MIPS
 Simple instructions – all 32 bits wide
 Very structured – no unnecessary baggage
 Only three instruction formats

R op rs rt rd shamt funct

I op rs rt 16 bit address

J op 26 bit address
Summarize MIPS:

MIPS operands
Name Example Comments
$s0-$s7, $t0-$t9, $zero, Fast locations for data. In MIPS, data must be in registers to perform
32 registers $a0-$a3, $v0-$v1, $gp, arithmetic. MIPS register $zero always equals 0. Register $at is
$fp, $sp, $ra, $at reserved for the assembler to handle large constants.
Memory[0], Accessed only by data transfer instructions. MIPS uses byte addresses, so
30
2 memory Memory[4], ..., sequential words differ by 4. Memory holds data structures, such as arrays,
words Memory[4294967292] and spilled registers, such as those saved on procedure calls.
MIPS assembly language
Category Instruction Example Meaning Comments
add add $s1, $s2, $s3 $s1 = $s2 + $s3 Three operands; data in registers

Arithmetic subtract sub $s1, $s2, $s3 $s1 = $s2 - $s3 Three operands; data in registers

add immediate addi $s1, $s2, 100 $s1 = $s2 + 100 Used to add constants
load word lw $s1, 100($s2) $s1 = Memory[$s2 + 100] Word from memory to register
store word sw $s1, 100($s2) Memory[$s2 + 100] = $s1 Word from register to memory
Data transfer load byte lb $s1, 100($s2) $s1 = Memory[$s2 + 100] Byte from memory to register
store byte sb $s1, 100($s2) Memory[$s2 + 100] = $s1 Byte from register to memory
load upper immediate lui $s1, 100 $s1 = 100 * 2
16 Loads constant in upper 16 bits

branch on equal beq $s1, $s2, 25 if ($s1 == $s2) go to Equal test; PC-relative branch
PC + 4 + 100

branch on not equal bne $s1, $s2, 25 if ($s1 != $s2) go to Not equal test; PC-relative
PC + 4 + 100
Conditional
branch set on less than slt $s1, $s2, $s3 if ($s2 < $s3) $s1 = 1; Compare less than; for beq, bne
else $s1 = 0

set less than slti $s1, $s2, 100 if ($s2 < 100) $s1 = 1; Compare less than constant
immediate else $s1 = 0

jump j 2500 go to 10000 Jump to target address

Uncondi- jump register jr $ra go to $ra For switch, procedure return
tional jump jump and link jal 2500 $ra = PC + 4; go to 10000 For procedure call
Summary
 Instruction complexity is only one variable
 lower instruction count vs. higher CPI / lower clock rate

 Design Principles:
 simplicity favors regularity
 smaller is faster
 good design demands compromise
 make the common case fast

07-08 - CO - B2Ch2 - MIPS Instruction Set
100% (1)
07-08 - CO - B2Ch2 - MIPS Instruction Set
62 pages
TM-2263 AVEVA Marine (12 Series) Programmable Macro Language (Basic) Rev 1.0 PDF
No ratings yet
TM-2263 AVEVA Marine (12 Series) Programmable Macro Language (Basic) Rev 1.0 PDF
178 pages
Equation Group Questions and Answers
100% (2)
Equation Group Questions and Answers
44 pages
Instructions: Language of The Machine: Computer Architecture - CS401
No ratings yet
Instructions: Language of The Machine: Computer Architecture - CS401
65 pages
Posiflex OPOS Driver Installation V13xx
No ratings yet
Posiflex OPOS Driver Installation V13xx
11 pages
Chapter 2:instructions: Language of The Computer
No ratings yet
Chapter 2:instructions: Language of The Computer
81 pages
2 Mips Architecture
No ratings yet
2 Mips Architecture
70 pages
Introduction To System On Chip
No ratings yet
Introduction To System On Chip
110 pages
Mips Instruction Format
No ratings yet
Mips Instruction Format
41 pages
Network System Design
No ratings yet
Network System Design
828 pages
MIPS 32 in Brief
No ratings yet
MIPS 32 in Brief
29 pages
BigData - Oozie
No ratings yet
BigData - Oozie
5 pages
MIPS
No ratings yet
MIPS
27 pages
Setup
No ratings yet
Setup
544 pages
Computer Architecture: Nguyễn Trí Thành
No ratings yet
Computer Architecture: Nguyễn Trí Thành
65 pages
Instruction Set Architecture MIPS PDF
No ratings yet
Instruction Set Architecture MIPS PDF
46 pages
Ashish Assignment
No ratings yet
Ashish Assignment
29 pages
Computer Architecture: Nguyễn Trí Thành
No ratings yet
Computer Architecture: Nguyễn Trí Thành
70 pages
פרק ד - אסמבלי
No ratings yet
פרק ד - אסמבלי
176 pages
Instructions: Language of The Computer: Computer Organization and Design
No ratings yet
Instructions: Language of The Computer: Computer Organization and Design
99 pages
CH7 - Instructions - Language of The Computer
No ratings yet
CH7 - Instructions - Language of The Computer
92 pages
4-8-01 - ISA-Part I-0408
No ratings yet
4-8-01 - ISA-Part I-0408
33 pages
CorePythonforEveryone MadhusudanMothe
No ratings yet
CorePythonforEveryone MadhusudanMothe
137 pages
4 3 01 ISA Part I Annotated
No ratings yet
4 3 01 ISA Part I Annotated
27 pages
Chapter2 Instructions Architecture Set
No ratings yet
Chapter2 Instructions Architecture Set
105 pages
05 - Instruction Set Architecture I
No ratings yet
05 - Instruction Set Architecture I
63 pages
Lecture 1c
No ratings yet
Lecture 1c
107 pages
Computer Architecture Lecture Notes Spring 2005 Dr. Michael P. Frank Competency Area 3: Programming and Coding Methods
No ratings yet
Computer Architecture Lecture Notes Spring 2005 Dr. Michael P. Frank Competency Area 3: Programming and Coding Methods
68 pages
Computer Architecture Chapter 2: MIPS: Dr. Phạm Quốc Cường
No ratings yet
Computer Architecture Chapter 2: MIPS: Dr. Phạm Quốc Cường
27 pages
Ca03 2014 PDF
No ratings yet
Ca03 2014 PDF
84 pages
WP Lesson Plan
No ratings yet
WP Lesson Plan
2 pages
Instructions: Language of The Computer: Omputer Rganization and Esign
No ratings yet
Instructions: Language of The Computer: Omputer Rganization and Esign
73 pages
1062 PDF
No ratings yet
1062 PDF
84 pages
2015Sp CS61C L05 Kavs M1
No ratings yet
2015Sp CS61C L05 Kavs M1
36 pages
Instructions: Language of The Machine
No ratings yet
Instructions: Language of The Machine
47 pages
LECTURE2
No ratings yet
LECTURE2
26 pages
6 Mips Datapath
No ratings yet
6 Mips Datapath
55 pages
Lecture 6 - 10 - On - MIPS - ISA
No ratings yet
Lecture 6 - 10 - On - MIPS - ISA
42 pages
Topic3 CH-02
No ratings yet
Topic3 CH-02
72 pages
Mips Isa
No ratings yet
Mips Isa
32 pages
Lecture 4
No ratings yet
Lecture 4
32 pages
Lect2 PDF
No ratings yet
Lect2 PDF
51 pages
Lecture 03 Assembly I
No ratings yet
Lecture 03 Assembly I
54 pages
Seminar Lab
No ratings yet
Seminar Lab
26 pages
Amdahl's Law: Execution Time After Improvement
No ratings yet
Amdahl's Law: Execution Time After Improvement
23 pages
Computer Architecture Lec 10
No ratings yet
Computer Architecture Lec 10
41 pages
CSCE 5610 Computer System Architecture: Content
No ratings yet
CSCE 5610 Computer System Architecture: Content
20 pages
Constraints
No ratings yet
Constraints
31 pages
CS3350B Computer Architecture MIPS Introduction: Marc Moreno Maza
No ratings yet
CS3350B Computer Architecture MIPS Introduction: Marc Moreno Maza
24 pages
Lecture 5 Digital Computing Systems
No ratings yet
Lecture 5 Digital Computing Systems
30 pages
11 Risc Cisc and Assemblers I
No ratings yet
11 Risc Cisc and Assemblers I
39 pages
EECE 417 Computer Systems Architecture: Department of Electrical and Computer Engineering Howard University
No ratings yet
EECE 417 Computer Systems Architecture: Department of Electrical and Computer Engineering Howard University
38 pages
CS220 Lecture 14
No ratings yet
CS220 Lecture 14
28 pages
IBM FlashCopy Backup Solution For SAP HANA TDI 1.6
No ratings yet
IBM FlashCopy Backup Solution For SAP HANA TDI 1.6
23 pages
Lecture 3
No ratings yet
Lecture 3
43 pages
L4 Mips 1
No ratings yet
L4 Mips 1
25 pages
Slides D CODchap2Slides
No ratings yet
Slides D CODchap2Slides
37 pages
CS220 Lecture 14
No ratings yet
CS220 Lecture 14
15 pages
Coal W2
No ratings yet
Coal W2
37 pages
03 AT Mega 16 Pin Diagram and Ports Microcontroller
No ratings yet
03 AT Mega 16 Pin Diagram and Ports Microcontroller
20 pages
Assemby and MIPS ISA
No ratings yet
Assemby and MIPS ISA
91 pages
Lecture 7 8 ISA
No ratings yet
Lecture 7 8 ISA
45 pages
Windows Virtual Desktop: User Login Instruction Guide
No ratings yet
Windows Virtual Desktop: User Login Instruction Guide
20 pages
Smt. Sharchchandrika Suresh Patil Institute of Technology Chopda
No ratings yet
Smt. Sharchchandrika Suresh Patil Institute of Technology Chopda
15 pages
COE480 Lecture2
No ratings yet
COE480 Lecture2
23 pages
Module 3 (02-02-2025)
No ratings yet
Module 3 (02-02-2025)
68 pages
CS122: Computer Architecture & Organization: Semester I, 2011
No ratings yet
CS122: Computer Architecture & Organization: Semester I, 2011
27 pages
How To Add A Linux
No ratings yet
How To Add A Linux
8 pages
Edge Adaptive Image Steganography Based On LSB Matching Revisited Code
No ratings yet
Edge Adaptive Image Steganography Based On LSB Matching Revisited Code
24 pages
Registers vs. Memory: Control
No ratings yet
Registers vs. Memory: Control
12 pages
Stored Program Concept Instructions
No ratings yet
Stored Program Concept Instructions
7 pages
Lec 4
No ratings yet
Lec 4
32 pages
Java Sample Program
No ratings yet
Java Sample Program
6 pages
A Review Paper On Improving Security of ATM System
No ratings yet
A Review Paper On Improving Security of ATM System
4 pages
Computer Organizatin - MIPS Assembly Part 2
No ratings yet
Computer Organizatin - MIPS Assembly Part 2
6 pages
How To Migrate From MPLS To SD-WAN: Network Transformation Strategy
No ratings yet
How To Migrate From MPLS To SD-WAN: Network Transformation Strategy
7 pages
2 Ce
No ratings yet
2 Ce
49 pages
DELL Precision 5680 Spec Sheet
No ratings yet
DELL Precision 5680 Spec Sheet
8 pages
CS232: Computer Architecture II: Spring 2012
No ratings yet
CS232: Computer Architecture II: Spring 2012
14 pages
Ma&re Lesson Plan
No ratings yet
Ma&re Lesson Plan
3 pages
Instruction Set Architecture: Mips Section 2.1-2.5
No ratings yet
Instruction Set Architecture: Mips Section 2.1-2.5
27 pages
Writing C++ Scripts
No ratings yet
Writing C++ Scripts
4 pages
ACTIVITY 2 Compare Data With A Hash
No ratings yet
ACTIVITY 2 Compare Data With A Hash
4 pages
CLK 0026
No ratings yet
CLK 0026
2 pages
Rtdi 310 Uk
No ratings yet
Rtdi 310 Uk
2 pages
Shafiq Ahmed Resume
No ratings yet
Shafiq Ahmed Resume
1 page
How To Tether The HTC Pro2 To A Laptop
No ratings yet
How To Tether The HTC Pro2 To A Laptop
2 pages
MAC 7 - Windows 7 Ultimate - x64 - Prince NRVL
No ratings yet
MAC 7 - Windows 7 Ultimate - x64 - Prince NRVL
3 pages
C Programming for the Pc the Mac and the Arduino Microcontroller System
From Everand
C Programming for the Pc the Mac and the Arduino Microcontroller System
Peter D Minns
No ratings yet
Neo Geo Architecture: Architecture of Consoles: A Practical Analysis, #23
From Everand
Neo Geo Architecture: Architecture of Consoles: A Practical Analysis, #23
Rodrigo Copetti
No ratings yet

2MIPS Assembly

Uploaded by

2MIPS Assembly

Uploaded by

Benchmarks

 Performance best determined by running a real application

High cache miss rates

Source: Sun Microsystems

 All MIPS arithmetic instructions have 3 operands

C code: A = B + C compiler’s job to associate

 Simpler hardware reduces design time and manufacturing cost.

MIPS code add $t0, $s1, $s2

 Performance penalty: high-level code translates to denser

 Design Principle 2: smaller is faster. Why?

 232 bytes with byte addresses from 0 to 232-1

 Load word has destination first, store has destination last

swap(int v[], int k);

add $s1, $s2, $s3 $s1 = $s2 + $s3

 Instruction Format R-type (“R” for aRithmetic):

000000 10001 10010 01000 00000 100000

6 bits 5 bits 5 bits 5 bits 5 bits 6 bits

opcode rs rt rd shamt funct

 Design Principle 3: Good design demands a compromise

100011 10010 01000 0000001111101010

opcode rs rt Immediate Value

opcode rs rt Immediate Value

 MIPS conditional branch instructions:

bne $t0, $t1, Label I-type instructions

 Example: if (i==j) h = i + j; word-relative addressing:

 16 bits is too small a reach in a 232 address space

 Solution: specify a register (as for lw and sw) and add it to

 so offset = (branch destination address – PC – 4)/4

opcode rs rt Signed Offset Value

 MIPS unconditional branch instructions:

 MIPS jump j instruction replaces lower 28 bits of the PC with

 Solutions? Will these work?

 How to make this work?

 Design Principle 4: Make the common case fast

 Example: addi $sp, $sp, 4 # $sp = $sp + 4

001000 11101 11101 0000000000000100

 Then get lower 16 bits in place:

 Now the constant is in place, use register-register arithmetic

 Can use this instruction to build blt $s1, $s2, Label

 There is a convention (not mandatory) for use of registers

Register 1, called $at, is reserved for the assembler; registers 26-27,

 Machine language is the underlying reality

 Assembly can provide pseudo-instructions

 When considering performance you should count actual number

 Jumps to target address

 Procedure return: jump register

 Can also be used for computed jumps

 e.g., for case/switch statements

Register + Byte Halfword Word

jump j 2500 go to 10000 Jump to target address

You might also like