Floating Point Representation
Floating Point Representation
06/07/21 1
Floating Decimal Point : Scientific Form
3
0.003678 is written as 3.678 10
256.78 is written as 2.5678 10 2
2
Example
The form is
sign mantissa 10exponent
or
m 10e
Example: For
2.5678 10 2
1
m 2.5678
e2
3
Floating Point Format for Binary
Numbers
y m2 e
4
Example
9 bit-hypothetical word
the first bit is used for the sign of the number,
the second bit for the sign of the exponent,
the next four bits for the mantissa, and
the next three bits for the exponent
0 0 1 0 1 1 1 0 1
mantissa exponent
Sign of the Sign of the
number exponent
5
Machine Epsilon
Defined as the measure of accuracy and found
by difference between 1 and the next number
that can be represented
6
Example
Ten bit word
Sign of number
Sign of exponent
Next four bits for exponent
Next four bits for mantissa
0 0 0 0 0 0 0 0 0 0 1 10
Next
number 0 0 0 0 0 0 0 0 0 1 1.0001 2 1.0625 10
mach 1.0625 1 2 4
7
Relative Error and Machine Epsilon
The absolute relative true error in representing
a number will be less then the machine epsilon
Example
0.0283210 1.1100 2 25
1.1100 2 2 0110 2
0 1 0 1 1 0 1 1 0 0
Sign of the exponent mantissa
Sign of the
number
exponent
1.1100 2 2 0110 2
0.0274375
0.02832 0.0274375
a
0.02832
0.034472 2 4 0.0625
8
IEEE 754 Standards for Single
Precision Representation
http://numericalmethods.eng.usf.edu
IEEE-754 Floating Point
Standard
• Standardizes representation of
floating point numbers on
different computers in single and
double precision.
• Standardizes representation of
floating point operations on
different computers.
One Great Reference
What every computer scientist (and even if
you are not) should know about floating point
arithmetic!
http://www.validlab.com/goldberg/paper.pdf
IEEE-754 Format Single Precision
s
.
Value ( 1) 1 m 2 2 e ' 127
12
Example#1
1 1 0 1 0 0 0 1 0 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
1 1.625 2162127
1 1.625 235 5.5834 1010
13
Example#2
Represent -5.5834x1010 as a single
precision floating point number.
? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
5.5834 10 1 1. ? 2
10 1 ?
14
Exponent for 32 Bit IEEE-754
8 bits would represent
0 e 255
Bias is 127; so subtract 127 from
representation
127 e 128
15
Exponent for Special Cases e
Actual range of
1 e 254
e 0 and e 255 are reserved for special numbers
Actual range of e
126 e 127
Special Exponents and Numbers
e 0 all zeros
e 255 all ones
s e m Represents
0 all zeros all zeros 0
1 all zeros
all zeros -0
0 all ones all zeros
1 all ones all zeros
0 or 1 all ones non-zero NaN
IEEE-754 Format
1.1........1 2 2
127
3.40 10 38
Machine epsilon
mach 2 23
1.19 10 7
18
THE END