0% found this document useful (0 votes)

52 views19 pages

COA - Unit2 Floating Point Arithmetic 3

The document discusses floating point representations and arithmetic. It begins with an example of decimal division and then covers IEEE 754 floating point number representations using sign-magnitude notation. It explains how numbers are represented with a sign bit, exponent field, and fraction field. It also discusses details like exponent biasing and special values like infinity and NaN. Finally, it provides examples of floating point addition and multiplication algorithms and shows MIPS instructions for floating point operations.

Uploaded by

Devika csbs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views19 pages

COA - Unit2 Floating Point Arithmetic 3

Uploaded by

Devika csbs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 19

Floating Point

• Today’s topics:

 Division
 IEEE 754 representations
 FP arithmetic

1
Divide Example
• Divide 7ten (0000 0111two) by 2ten (0010two)
Iter Step Quot Divisor Remainder
0 Initial values 0000 0010 0000 0000 0111
1 Rem = Rem – Div 0000 0010 0000 1110 0111
Rem < 0  +Div, shift 0 into Q 0000 0010 0000 0000 0111
Shift Div right 0000 0001 0000 0000 0111
2 Same steps as 1 0000 0001 0000 1111 0111
0000 0001 0000 0000 0111
0000 0000 1000 0000 0111
3 Same steps as 1 0000 0000 0100 0000 0111
4 Rem = Rem – Div 0000 0000 0100 0000 0011
Rem >= 0  shift 1 into Q 0001 0000 0100 0000 0011
Shift Div right 0001 0000 0010 0000 0011
5 Same steps as 4 0011 0000 0001 0000 0001
2
Hardware for Division

Source: H&P textbook

A comparison requires a subtract; the sign of the result is

examined; if the result is negative, the divisor must be added back

Similar to multiply, results are placed in Hi (remainder) and Lo (quotient)

3
Efficient Division

4
Divisions involving Negatives

• Simplest solution: convert to positive and adjust sign later

• Note that multiple solutions exist for the equation:

Dividend = Quotient x Divisor + Remainder

+7 div +2 Quo = Rem =

-7 div +2 Quo = Rem =
+7 div -2 Quo = Rem =
-7 div -2 Quo = Rem =

5
Divisions involving Negatives

• Simplest solution: convert to positive and adjust sign later

• Note that multiple solutions exist for the equation:

Dividend = Quotient x Divisor + Remainder

+7 div +2 Quo = +3 Rem = +1

-7 div +2 Quo = -3 Rem = -1
+7 div -2 Quo = -3 Rem = +1
-7 div -2 Quo = +3 Rem = -1

Convention: Dividend and remainder have the same sign

Quotient is negative if signs disagree
These rules fulfil the equation above

6
Floating Point

• Normalized scientific notation: single non-zero digit to the

left of the decimal (binary) point – example: 3.5 x 109

• 1.010001 x 2-5two = (1 + 0 x 2-1 + 1 x 2-2 + … + 1 x 2-6) x 2-5ten

• A standard notation enables easy exchange of data between

machines and simplifies hardware algorithms – the
IEEE 754 standard defines how floating point numbers
are represented

7
Sign and Magnitude Representation

Sign Exponent Fraction

1 bit 8 bits 23 bits
S E F

• More exponent bits  wider range of numbers (not necessarily more

numbers – recall there are infinite real numbers)

• More fraction bits  higher precision

• Register value = (-1)S x F x 2E

• Since we are only representing normalized numbers, we are

guaranteed that the number is of the form 1.xxxx..
Hence, in IEEE 754 standard, the 1 is implicit
Register value = (-1)S x (1 + F) x 2E
8
Sign and Magnitude Representation

Sign Exponent Fraction

1 bit 8 bits 23 bits
S E F

• Largest number that can be represented:

• Smallest number that can be represented:

9
Sign and Magnitude Representation
Sign Exponent Fraction
1 bit 8 bits 23 bits
S E F

• Largest number that can be represented: 2.0 x 2128 = 2.0 x 1038

• Smallest number that can be represented: 1.0 x 2-127 = 2.0 x 10-38

• Overflow: when representing a number larger than the one above;

Underflow: when representing a number smaller than the one above

• Double precision format: occupies two 32-bit registers:

Largest: Smallest:
Sign Exponent Fraction
1 bit 11 bits 52 bits
S E F 10
Details

• The number “0” has a special code so that the implicit 1 does not
get added: the code is all 0s
(it may seem that this takes up the representation for 1.0, but
given how the exponent is represented, we’ll soon see that
that’s not the case)
(see discussion of denorms (pg. 222) in the textbook)

• The largest exponent value (with zero fraction) represents +/- infinity

• The largest exponent value (with non-zero fraction) represents

NaN (not a number) – for the result of 0/0 or (infinity minus infinity)

• Note that these choices impact the smallest and largest numbers
that can be represented

11
Exponent Representation

• To simplify sort, sign was placed as the first bit

• For a similar reason, the representation of the exponent is also

modified: in order to use integer compares, it would be preferable to
have the smallest exponent as 00…0 and the largest exponent as 11…1

• This is the biased notation, where a bias is subtracted from the

exponent field to yield the true exponent

• IEEE 754 single-precision uses a bias of 127 (since the exponent

must have values between -127 and 128)…double precision uses
a bias of 1023

Final representation: (-1)S x (1 + Fraction) x 2(Exponent – Bias)

12
Examples

Final representation: (-1)S x (1 + Fraction) x 2(Exponent – Bias)

• Represent -0.75ten in single and double-precision formats

Single: (1 + 8 + 23)

Double: (1 + 11 + 52)

• What decimal number is represented by the following

single-precision number?
1 1000 0001 01000…0000
13
Examples

Final representation: (-1)S x (1 + Fraction) x 2(Exponent – Bias)

• Represent -0.75ten in single and double-precision formats

Single: (1 + 8 + 23)
1 0111 1110 1000…000

Double: (1 + 11 + 52)
1 0111 1111 110 1000…000

• What decimal number is represented by the following

single-precision number?
1 1000 0001 01000…0000 14
-5.0
FP Addition

• Consider the following decimal example (can maintain

only 4 decimal digits and 2 exponent digits)

9.999 x 101 + 1.610 x 10-1

Convert to the larger exponent:
9.999 x 101 + 0.016 x 101
Add
10.015 x 101
Normalize
1.0015 x 102
Check for overflow/underflow
Round
1.002 x 102
Re-normalize 15
FP Addition

• Consider the following decimal example (can maintain

only 4 decimal digits and 2 exponent digits)

9.999 x 101 + 1.610 x 10-1

Convert to the larger exponent:
9.999 x 101 + 0.016 x 101
Add
10.015 x 101
Normalize If we had more fraction bits,
these errors would be minimized
1.0015 x 102
Check for overflow/underflow
Round
1.002 x 102
Re-normalize 16
FP Multiplication

• Similar steps:
 Compute exponent (careful!)
 Multiply significands (set the binary point correctly)
 Normalize
 Round (potentially re-normalize)
 Assign sign

17
MIPS Instructions

• The usual add.s, add.d, sub, mul, div

• Comparison instructions: c.eq.s, c.neq.s, c.lt.s….

These comparisons set an internal bit in hardware that
is then inspected by branch instructions: bc1t, bc1f

• Separate register file $f0 - $f31 : a double-precision

value is stored in (say) $f4-$f5 and is referred to by $f4

• Load/store instructions (lwc1, swc1) must still use

integer registers for address computation

18
Code Example

float f2c (float fahr)

{
return ((5.0/9.0) * (fahr – 32.0));
}

(argument fahr is stored in $f12)

lwc1 $f16, const5($gp)
lwc1 $f18, const9($gp)
div.s $f16, $f16, $f18
lwc1 $f18, const32($gp)
sub.s $f18, $f12, $f18
mul.s $f0, $f16, $f18
jr $ra

CBSE Class 9 Mathematics Worksheet - Number System
80% (10)
CBSE Class 9 Mathematics Worksheet - Number System
1 page
COA - Unit2 Floating Point Arithmetic 2
No ratings yet
COA - Unit2 Floating Point Arithmetic 2
67 pages
Math Riddle Book
100% (1)
Math Riddle Book
82 pages
Real Estate Math
67% (3)
Real Estate Math
59 pages
5.4 NOTES Condensed - Translating Expressions - Gammache PDF
No ratings yet
5.4 NOTES Condensed - Translating Expressions - Gammache PDF
4 pages
Chapter3 3
No ratings yet
Chapter3 3
13 pages
MIPS Architecture - BITS Pilani
No ratings yet
MIPS Architecture - BITS Pilani
58 pages
Finite Word Length Effects
No ratings yet
Finite Word Length Effects
31 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
Floating Point 6up
No ratings yet
Floating Point 6up
7 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
Floating Point Arithmetic
100% (1)
Floating Point Arithmetic
30 pages
Week8 Slides
No ratings yet
Week8 Slides
43 pages
Floating Point Representation of Numbers: Wide Range
No ratings yet
Floating Point Representation of Numbers: Wide Range
11 pages
Computer Architecture: Nguyễn Trí Thành
No ratings yet
Computer Architecture: Nguyễn Trí Thành
55 pages
Computer Architecture CS F342 Ca-Lect7
No ratings yet
Computer Architecture CS F342 Ca-Lect7
11 pages
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
No ratings yet
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
51 pages
Floating Point Representation Examples
No ratings yet
Floating Point Representation Examples
2 pages
Floating Point Numbers
No ratings yet
Floating Point Numbers
8 pages
3 Fixed and Floating Point DSP
No ratings yet
3 Fixed and Floating Point DSP
23 pages
Lecture 14 - Arithmetic Subsystems - Numbering Systems and Floating Point Unit (FPU)
No ratings yet
Lecture 14 - Arithmetic Subsystems - Numbering Systems and Floating Point Unit (FPU)
32 pages
CH08.2-Computer Arithmetic
No ratings yet
CH08.2-Computer Arithmetic
14 pages
Integer Representation
No ratings yet
Integer Representation
34 pages
80X87 Arch and Register Set
100% (1)
80X87 Arch and Register Set
56 pages
Machine Level Representation of Data Part 3
100% (1)
Machine Level Representation of Data Part 3
32 pages
Floating Point
No ratings yet
Floating Point
13 pages
"The Course That Gives CMU Its Zip!": Topics
No ratings yet
"The Course That Gives CMU Its Zip!": Topics
30 pages
CH03 Data II
No ratings yet
CH03 Data II
31 pages
Unit 2
No ratings yet
Unit 2
85 pages
Fixed & Floating Point
No ratings yet
Fixed & Floating Point
31 pages
Computer Arithmetic (5 Hours)
No ratings yet
Computer Arithmetic (5 Hours)
27 pages
Lec07 - Computer Arithmetic - Floating-Point Representation and Arithmetic
No ratings yet
Lec07 - Computer Arithmetic - Floating-Point Representation and Arithmetic
42 pages
Lect4 Floats
No ratings yet
Lect4 Floats
64 pages
ARCh Presentation1
No ratings yet
ARCh Presentation1
12 pages
Floating Point
No ratings yet
Floating Point
33 pages
"The Course That Gives CMU Its Zip!": Topics
No ratings yet
"The Course That Gives CMU Its Zip!": Topics
30 pages
Floating Point & Fixed Point Representation - BCA II
No ratings yet
Floating Point & Fixed Point Representation - BCA II
24 pages
L2-Variables and Floating Point Number System
No ratings yet
L2-Variables and Floating Point Number System
38 pages
"The Course That Gives CMU Its Zip!": Topics
No ratings yet
"The Course That Gives CMU Its Zip!": Topics
31 pages
Data Representation
No ratings yet
Data Representation
19 pages
This Unit: Arithmetic and ALU Design Floating Point Arithmetic
No ratings yet
This Unit: Arithmetic and ALU Design Floating Point Arithmetic
8 pages
08 FloatingPoint
No ratings yet
08 FloatingPoint
52 pages
Design & Simulation of 32-Bit Floating Point Alu
No ratings yet
Design & Simulation of 32-Bit Floating Point Alu
3 pages
Floating Point Arithmetic Class
No ratings yet
Floating Point Arithmetic Class
24 pages
Cacc
No ratings yet
Cacc
106 pages
Lecture 4
No ratings yet
Lecture 4
21 pages
5 Data - Floating - Point v1
No ratings yet
5 Data - Floating - Point v1
25 pages
Floating - Point - Number
No ratings yet
Floating - Point - Number
36 pages
Number Systems - Data Representation (Numbers)
No ratings yet
Number Systems - Data Representation (Numbers)
27 pages
Floating Point
No ratings yet
Floating Point
2 pages
Design & Simulation of 32-Bit Floating Point Alu
No ratings yet
Design & Simulation of 32-Bit Floating Point Alu
3 pages
COMP0068 Lecture10 High Level Data Types
No ratings yet
COMP0068 Lecture10 High Level Data Types
25 pages
ENSC254 - Floating Point Computation
No ratings yet
ENSC254 - Floating Point Computation
29 pages
EE 109 Unit 20: IEEE 754 Floating Point Representation Floating Point Arithmetic
No ratings yet
EE 109 Unit 20: IEEE 754 Floating Point Representation Floating Point Arithmetic
31 pages
Coa Unit 2
No ratings yet
Coa Unit 2
35 pages
Lecture 4
No ratings yet
Lecture 4
21 pages
HW 4 Sol
No ratings yet
HW 4 Sol
10 pages
COA UNIT-III PPTs Dr.G.Bhaskar ECE
No ratings yet
COA UNIT-III PPTs Dr.G.Bhaskar ECE
64 pages
Ece552 10 Floating Point
No ratings yet
Ece552 10 Floating Point
15 pages
Module 04
No ratings yet
Module 04
19 pages
FIXED and FLOAT
No ratings yet
FIXED and FLOAT
8 pages
Chap 02
No ratings yet
Chap 02
16 pages
Module 2 - PART D Floating
No ratings yet
Module 2 - PART D Floating
30 pages
LEC03 Data II
No ratings yet
LEC03 Data II
45 pages
Principles of Digital Electronics
From Everand
Principles of Digital Electronics
Sapana Rane
No ratings yet
Coa Unit Iv
No ratings yet
Coa Unit Iv
147 pages
COA UNIT-III Parallel Processors
No ratings yet
COA UNIT-III Parallel Processors
51 pages
COA UNIT - III Processor and Control Unit
No ratings yet
COA UNIT - III Processor and Control Unit
127 pages
COA - UNIT 2 - Floating Point Arithmetic 1
No ratings yet
COA - UNIT 2 - Floating Point Arithmetic 1
19 pages
COA - Unit 2 Data Representation 1
No ratings yet
COA - Unit 2 Data Representation 1
59 pages
Addressing Modes Notes With Examples
No ratings yet
Addressing Modes Notes With Examples
10 pages
Multiplying and Dividing Fractions
No ratings yet
Multiplying and Dividing Fractions
21 pages
Mainframe Module1.2
No ratings yet
Mainframe Module1.2
7 pages
Fractions of Shapes
No ratings yet
Fractions of Shapes
10 pages
Reed Solomon Explained V1-0 PDF
No ratings yet
Reed Solomon Explained V1-0 PDF
27 pages
Quick Test - Di For Sbi Po: A 21.0% F 21.8% E 7.5% B 10.6% D 15.4% C 23.7%
No ratings yet
Quick Test - Di For Sbi Po: A 21.0% F 21.8% E 7.5% B 10.6% D 15.4% C 23.7%
4 pages
Algebra 1.formulas
No ratings yet
Algebra 1.formulas
27 pages
Ncert Solutions For Class 9 Maths April05 Chapter 2 Polynomials Exercise 2 4
No ratings yet
Ncert Solutions For Class 9 Maths April05 Chapter 2 Polynomials Exercise 2 4
8 pages
Sheet - Rakesh Yadav Pratham Batch 2 2025
No ratings yet
Sheet - Rakesh Yadav Pratham Batch 2 2025
42 pages
Week 8 Math 1st Quarter
No ratings yet
Week 8 Math 1st Quarter
20 pages
Perfect Square Perfect Square Perfect Square Perfect Square
No ratings yet
Perfect Square Perfect Square Perfect Square Perfect Square
32 pages
Excel Templates Operations Guide
No ratings yet
Excel Templates Operations Guide
30 pages
7th Grade Mathematics Curriculum Map
No ratings yet
7th Grade Mathematics Curriculum Map
6 pages
ACTIVITY 3.docxmath
No ratings yet
ACTIVITY 3.docxmath
2 pages
Activity - Math 4march 8 12 - 2021
No ratings yet
Activity - Math 4march 8 12 - 2021
6 pages
Number System
No ratings yet
Number System
8 pages
Mathematics Vi 3rd Rating
50% (2)
Mathematics Vi 3rd Rating
54 pages
U11.2 Maths - WB5-109-112
No ratings yet
U11.2 Maths - WB5-109-112
4 pages
Taller 3 - Sistemas Numericos
No ratings yet
Taller 3 - Sistemas Numericos
15 pages
Binary Operations: (Abstract Algebra)
No ratings yet
Binary Operations: (Abstract Algebra)
61 pages
CIE Scheme of Work YR 8
No ratings yet
CIE Scheme of Work YR 8
109 pages
7.01 Ordering Decimals
No ratings yet
7.01 Ordering Decimals
3 pages
Math Quiz Bee
No ratings yet
Math Quiz Bee
3 pages
New Countdown Book 4
100% (1)
New Countdown Book 4
46 pages
SSC CHSL Number System Questions Solved Problems With Detailed Solutions (Free PDF)
No ratings yet
SSC CHSL Number System Questions Solved Problems With Detailed Solutions (Free PDF)
21 pages
IOQM Worksheet-16
No ratings yet
IOQM Worksheet-16
18 pages
P2 Chp1 AlgebraicMethods
No ratings yet
P2 Chp1 AlgebraicMethods
52 pages

COA - Unit2 Floating Point Arithmetic 3

Uploaded by

COA - Unit2 Floating Point Arithmetic 3

Uploaded by

Floating Point

Source: H&P textbook

A comparison requires a subtract; the sign of the result is

Similar to multiply, results are placed in Hi (remainder) and Lo (quotient)

• Simplest solution: convert to positive and adjust sign later

• Note that multiple solutions exist for the equation:

+7 div +2 Quo = Rem =

• Simplest solution: convert to positive and adjust sign later

• Note that multiple solutions exist for the equation:

+7 div +2 Quo = +3 Rem = +1

Convention: Dividend and remainder have the same sign

• Normalized scientific notation: single non-zero digit to the

• 1.010001 x 2-5two = (1 + 0 x 2-1 + 1 x 2-2 + … + 1 x 2-6) x 2-5ten

• A standard notation enables easy exchange of data between

Sign Exponent Fraction

• More exponent bits  wider range of numbers (not necessarily more

• More fraction bits  higher precision

• Register value = (-1)S x F x 2E

• Since we are only representing normalized numbers, we are

Sign Exponent Fraction

• Largest number that can be represented:

• Smallest number that can be represented:

• Largest number that can be represented: 2.0 x 2128 = 2.0 x 1038

• Smallest number that can be represented: 1.0 x 2-127 = 2.0 x 10-38

• Overflow: when representing a number larger than the one above;

• Double precision format: occupies two 32-bit registers:

• The largest exponent value (with non-zero fraction) represents

• To simplify sort, sign was placed as the first bit

• For a similar reason, the representation of the exponent is also

• This is the biased notation, where a bias is subtracted from the

• IEEE 754 single-precision uses a bias of 127 (since the exponent

Final representation: (-1)S x (1 + Fraction) x 2(Exponent – Bias)

Final representation: (-1)S x (1 + Fraction) x 2(Exponent – Bias)

• Represent -0.75ten in single and double-precision formats

• What decimal number is represented by the following

Final representation: (-1)S x (1 + Fraction) x 2(Exponent – Bias)

• Represent -0.75ten in single and double-precision formats

• What decimal number is represented by the following

• Consider the following decimal example (can maintain

9.999 x 101 + 1.610 x 10-1

• Consider the following decimal example (can maintain

9.999 x 101 + 1.610 x 10-1

• The usual add.s, add.d, sub, mul, div

• Comparison instructions: c.eq.s, c.neq.s, c.lt.s….

• Separate register file $f0 - $f31 : a double-precision

• Load/store instructions (lwc1, swc1) must still use

float f2c (float fahr)

(argument fahr is stored in $f12)

You might also like