0% found this document useful (0 votes)

39 views19 pages

Floating Point To Fixed Point Conversion

This document discusses converting algorithms from floating-point to fixed-point domains for hardware implementation. It describes fixed-point data types and how they are represented in binary. The document then presents a simple method for floating-point to fixed-point conversion in fewer steps than MATLAB's fixed-point toolbox. Several examples demonstrate performing arithmetic operations like addition and multiplication using both methods, and show they produce the same results, with the proposed method being faster.

Uploaded by

nayeem4444

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views19 pages

Floating Point To Fixed Point Conversion

Uploaded by

nayeem4444

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

1

FloatingpointtoFixedpoint
conversion
FixedPointDesign
2

FixedPointDataTypes
In a digital hardware, numbers are stored in binary words. A binary word is a fixedlength
sequence of bits (1's and 0's). How hardware components or sofware functons interpret this
sequenceof1'sand0's isdenedbythedata type.Binarynumbersarerepresentedaseither
fixedpoint or floatingpoint data types. In order to implement an algorithm such as
communication algorithms, the algorithm should be converted to the fixedpoint domain and
thenitshouldbedescribedwithHardwareDescriptionLanguage(HDL).InHDLcodingprocess,
it is necessary to indicate the size of the variables and registers. The registers should be large
enoughtorepresentthevalueofparameterswiththedesiredprecision.
Fixedpointdatatypehelpsustoknowwhathappensinthehardware.Intheotherwords
when an algorithm is represented in floatingpoint domain, all of the variables have 64 bits(in
MATLABprogramming).Soalloftheoperationsaredonewithlargenumberofbits.Weknow
that it is impossible to implement an algorithm with large number of flip flops. Because large
number of flip flops need a larger area, and more power consumption. In order to solve this
problem the algorithm should be converted to the fixedpoint domain. In the fixedpoint
domainapair(W,F)isconsideredforeachoftheparametersinthealgorithm,whereWisthe
word length of the parameters and F is the fractional length of the parameters. It is obvious
that larger W and F results in a better performance and lower bit error rate (BER) but the
designneedsalargesiliconarea.OntheotherhandsmallerWandFresultinalargerBERbut
lessarea.Soweshouldchoosesuitablevaluesof(W,F)foreachparameterinthealgorithm.For
this reason a simulation should be ran for the algorithm to get the dynamic range of the
parameters. Simulation results indicate the dynamic rangeof the variables and the number of
bitsforWandF,whichareusedtorepresentthevariableswiththedesiredprecision.
According to the previous section, a fixedpoint data type is characterized by the word
length in bits, the position of the binary point, and whether it is signed or unsigned. The
positionofthebinarypointisthemeansbywhichfixedpointvaluesarescaledandinterpreted.
Forexample,abinaryrepresentationofageneralizedfixedpointnumber(eithersignedor
unsigned)isshownbelow:
0
b
1
b
2
b
3
b
1 wl
b
2 wl
b

FixedPointDesign
3

Where:
b

istheithbinarydigit
wlisthewordlengthinbits
b
wI-1
isthelocationofthemostsignificant,orhighest,bit(MSB)
b
0
isthelocationoftheleastsignificant,orlowest,bit(LSB).
The binary point is shown three places to the left of the LSB. In this example, therefore, the
numberissaidtohavethreefractionalbits,orafractionlengthofthree.
Fixedpointdatatypescanbeeithersignedorunsigned.Signedbinaryfixedpointnumbers
aretypicallyrepresentedinoneoftheseways:
Sign/magnitude
One'scomplement
Two'scomplement
Two's complement is the most common representation of signed fixedpoint numbers and is
theonlyrepresentationusedbyFixedPointToolboxinMATLAB.
Fixedpointnumberscanbeencodedaccordingtothefollowingscheme:
Rea| -ua|ue = 2
-ract|una|-|ength
xtured |nteger(1)
wherestorcJ intcgcris the raw binary number, in which the binary point assumed to be at
thefarrightoftheword.
Conversion of an algorithm from floatingpoint domain to fixedpoint domain can be done
throughtheMATLABfixedpointtoolbox.
FixedPoint Toolbox provides fixedpoint data types in MATLAB and enables algorithm
developmentbyprovidingfixedpointarithmetic.FixedPointToolboxenablesyoutocreatethe
followingtypesofobjects:
fi Defines a fixedpoint numeric object in the MATLAB workspace. Each fi object is
composedofvaluedata,afimathobject,andanumerictypeobject.
fimathGovernshowoverloadedarithmeticoperatorsworkwithfiobjects
fiprefDefinesthedisplay,logging,anddatatypeoverridepreferencesoffiobjects
numerictypeDefinesthedatatypeandscalingattributesoffiobjects
quantizerQuantizesdatasets
FixedPointDesign
4

Normallycomplicatedalgorithmshavemanyvariablessothenumberoffixedpointobjects
growssignificantly.Moreover,insomecasesalongtimesimulationisneededtoobtaintheBER
curves of the algorithm. In the above cases fixedpoint simulation with MATLAB fixedpoint
toolboxneedsalargeamountofmemory,time,andCPUusageandinmostofthecasesitwill
crash.
In order to solve the above problem a simple method for floatingpoint to fixedpoint
conversion is proposed in this tutorial. Simulation results with this method and simulation
results with the MATLAB fixedpoint toolbox are the same, but the simulation with the
proposed method is significantly faster than the other. For example one iteration of KBest
algorithm simulation with MATLAB fixedpoint toolbox, takes 237 seconds but simulation with
the proposed method, needs only 36 seconds. So in a longtime simulation for example 5000
iterationMATLABfixedpointtoolboxdoesntworkwell.

FloatingpointtoFixedpointconversion:
Inthispartasimplemethodforfloatingpointtofixedpointconversionwilldescribe.Then
we consider the various arithmetic operations and mention a lot of examples for them and
finallycomparetheirresultswiththeresultsofMATLABfixedpointtoolbox.
In order to convert a floatingpoint value to the corresponding fixedpoint vlaue use the
followingsteps.
Considerafloatingpointvariable,o :
Step 1: Calculate b = o 2
P
, where F is the fractional length of the variable. Note thatbis
representedindecimal.
Step 2:Roundthevalueofbtothenearestintegervalue.Forexample:
rounJ(S.S6) = 4
rounJ(-1.9) = -2
rounJ(-1.S) = -2
Step 3:Convertbfromdecimaltobinaryrepresentationandnamethenewvariablec.
Step 4: Now, we assume that c, needsnbits to represent the value ofbin binary. On the
otherhandweobtainthevaluesofWandF,fromthesimulation.SothevalueofWshouldbe
FixedPointDesign
5

equalorlargerthann.IfSmallvalueischosenforW,weshouldtruncatec.IfWislargerthan
n,(W n)zerobitsaddtotheleftmostofc.
Now consider the simulation is ran carefully and suitable values of (W,F) are obtained. It
means that W is equal or larger than n. So (W n) zero are added to leftmost of c. Then we
select F bits ofcfrom positon 0 to F1 as the fractional part of the fixedpoint variable.
Thereforetheconversionfromfloatingpointtofixedpointisfinishedbyfindingthepositionof
binary point inc.In order to verify the result, we can do the same conversion with MATLAB
fixedpoint toolbox. The results of both methods are the same, but the proposed method is
faster.BecauseinMATLABmethodweshouldcallalargenumberoffixedpointfunctionsand
fixedpointobjects,whicharetimeconsumingandtheyneedalargememory.
Inthefollowingsectionvariousexamplesarementionedfordifferentarithmeticoperation
such as addition, subtraction, multiplication, and norm. In each case the operation is done
throughthebothmethodsandshownthattheresultsarethesame.
Note:
In the following examples Method 1 shows the MATLAB fixedpoint toolbox and
Method 2showstheabovemethod.
The dot in the binary representation is used to separate the fractional part and the
integerpartofthevariable.Butitisntapartofthevariable.

Example 1)
This example shows that the value of (W,F) should choose carefully from the simulation
(accordingtothedynamicrangeofvariables).

Method 1:i (S.61S,1,7,4) = S.62Sconverttobinarywithbin()u11.1u1u

i (S.61S,1,1u,7) = S.6u94converttobinarywithbin()u11.1uu111u
i (S.61S,1,1S,12) = S.61Sconverttobinarywithbin()u11.1uu111uu1111

FixedPointDesign
6

Example 2)
This example shows the conversion of a floatingpoint value to fixedpoint value and then
find the corresponding binary value and finally shows the conversion of a binary value to
correspondingrealvalueby(1).

Method 1:
i (S.61S,1,1S,12) = S.61Sconverttobinarywithbin() u11.1uu111uu1111(w, F) = (1S,12)
(u111uu111uu1111)
b
= (14799)
d
converttodecimalby(1)14799 2
-12
= S.61S

Example 3)
Thisexampleshowsconversionofafloatingpointvaluetocorrespondingfixedpointvalue
intwomethods.Bothpositiveandnegativevaluesarecoveredinthisexample.
o = S.u1S,(w, F) = (8,S)

Method 1:
i (S.u1S,1,8,S) = S.uuconverttobinarywithbin()uuu11.uuu

Method 2:
Step1:b = o 2
P
= S.u1S 2
+3
= 24.1u4u
Step2: rounJ(24.1u4u) = 24
Step3:c = Jcc2bin(b) = 11uuu
Step4:c = uuu11.uuu
Inbothmethods:rcol :oluc = intcgcr :oluc 2
-P

FixedPointDesign
7

Example 4)
o = 9.S14S2,(w, F) = (12,7)
Method 1:
i (9.S14S2,1,12,7) = 9.S1S6converttobinarywithbin()u1uu1.1uuuu1u

Method 2:
Step1:b = o 2
P
= 9.S14S2 2
+7
= 1217.8S29
Step2: rounJ(1217.8S29) = 1218
Step3:c = Jcc2bin(b) = u1uu11uuuu1u
Step4:c = u1uu1.1uuuu1u

Example 5)
o = -9.uS14,(w, F) = (14,9)
Method 1:
i (-9.uS14 ,1,14,9) = -9.uSu8converttobinarywithbin()1u11u.1111uu11u

Method 2:
Step1:b = o 2
P
= -9.uS14 2
+9
= -46S4.S
Step2: rounJ(-46S4.S) = -46S4
Step3:c = Jcc2bin(b) = 1u11u1111uu11u
Step4:c = 1u11u.1111uu11u
FixedPointDesign
8

Example 6) Multiplication 1
This example shows the conversion of a floatingpoint multiplication to fixedpoint
multiplication.Inordertoperformthisconversion:
1
st
:Eachofoperandsareconvertedtofixedpointonlybystep1andstep2.
2
nd
:Performthemultiplicationwithnewvalues.
3
rd
:Applythestep3andstep4onthemultplicatonresult.

o = S.61S , (w, F) = (8,4) , b = 2 , (w, F) = (S,2)

Note:
(W,F)fortheresultofmultplicatonis(13,6).
Method 1:
J = i(S.61S,1,8,4) = S.62S,c = i(2,1,S,2) = 2
mult = J c = 7.2S converttobinarywithbin()c = uuuu111.u1uuuu
Note:
Notethatifthemultiplicationisperformedbeforefixedpointconversion,theresultwillbe
differentwiththeaboveresult.Itis bettertoperformfixedpointconversionforeachoperand,
thenperformtheoperation.
Method 2:
Step1:J = o 2
P
= S.61S 2
+4
= S7.8u8
Step2: rounJ(S7.8u8) = S8

Step1:c = b 2
P
= 2 2
+2
= 8
Step2: rounJ(8) = 8
c = o b
FixedPointDesign
9

mult = rounJ(J) rounJ(c) = S8 8 = 464

Step3:c = Jcc2bin(mult) = u111u1uuuu
Step4:c = uuuu111.u1uuuu

Example 7) Multplicaton 2.
o = 2.1S , (w, F) = (8,S ) , b = S.24S6 , (w, F) = (12,9)
Note:
(W,F)fortheresultofmultplicatonis(20,14).
Method 1:
J = i(2.1S,1,8,S) = 2.12S,c = i(S.24S6,1,12,9) = S.2461
mult = J c = 6.8979 converttobinarywithbin()c = uuu11u.111uu1u1111uuu
Method 2:
Step1:J = o 2
P
= 2.1S 2
+5
= 68.16
Step2: rounJ(68.16) = 68

Step1:c = b 2
P
= S.24S6 2
+9
= 1661.7472
Step2: rounJ(1662) = 1662
c = o b
mult = rounJ(J) rounJ(c) = 68 1662 = 11Su16
Step3:c = Jcc2bin(mult) = u11u111uu1u1111uuu
Step4:c = uuu11u.111uu1u1111uuu

FixedPointDesign
10

Example 8) Additon.1
This example shows the conversion of a floatingpoint addition to fixedpoint addition. In
ordertoperformthisconversion:
1
st
: Align the binary point of operands by adding zero in the right side of the operand, which
hassmallerfractionallength.
2
nd
:Eachofoperandsareconvertedtofixedpointonlybystep1andstep2.
3
rd
:Performtheadditionwithnewvalues.
4
th
:Applythestep3andstep4ontheadditionresult.

o = S.61S , (w, F) = (7,S ) , b = 2.S , (w, F) = (7,2)

Note:
Itisnecessarytoconsideronebitforcarry.Sothewordlengthoftheadditionresultisthe
larger wordlength of operands plus one. The fractionallength of the addition is the larger
fractionallength of operands. So the step 1 is done with final fractionallength (fractional
lengthofaddition).Thereforeinthisexample(W,F)ofadditonisequalto(8,3).
Method 1:
J = i(S.61S,1,7,S) = S.62S,c = i(2.S,1,7,2) = 2.2S
oJJ = J +c = S.87Su converttobinarywithbin()c = uu1u1.111
Method 2:
Step1:J = o 2
P
= S.61S 2
+3
= 28.9u4
Step2: rounJ(28.9u4) = 29

Step1:c = b 2
P
= 2.S 2
+3
= 18.4
Step2: rounJ(18.4) = 18
c = o +b
FixedPointDesign
11

oJJ = rounJ(J) +rounJ(c) = 29 +18 = 47

Step3:c = Jcc2bin(oJJ) = 1u1111
Step4:c = uu1u1.111

Example 9) Additon.2
Thisexampleshowsthedifferentbetweenthefollowingtwowaysinfixedpointsimulation:
a Perform the operation in floatingpoint domain and then convert the result to the
fixedpoint domain.
b Convert the operands to the fixedpoint domain and then perform the operation in
fixedpoint domain.
InordertoshowthisnotetheExample8,whichisdonewiththesecondwayisperformedagain
inthefirstway.
In order to have an efficient fixedpoint simulation, it is necessary to perform the second way.

1
st
way:
oJJ = (S.61S +2.S) = S.91S,
oJJ_i = i(oJJ, 1,7,2) = 6 converttobinarywithbin()c = uu11u.uu
2
nd
way:
J = i(S.61S,1,7,2) = S.S,c = i(2.S,1,7,2) = 2.2S
oJJ = J +c = S.7S converttobinarywithbin()c = uu1u1.11

FixedPointDesign
12

Example 10) Addition.3

o = -9.61S , (w, F) = (1u,S ) , b = -S.421 , (w, F) = (8,S)

Method 1:
J = i(-9.61S,1,1u,S) = -9.62S,c = i(-S.421,1,8,S) = -S.4u6S
oJJ = J +c = -1S.uS1S converttobinarywithbin()c = 11uu1u.11111
(W,F)=(11,5)

Method 2:
Step1:J = o 2
P
= -9.61S 2
+5
= -Su7.616
Step2: rounJ(-Su7.616) = -Su8

Step1:c = b 2
P
= -S.421 2
+5
= -1u9.472
Step2: rounJ(-1u9.472) = -1u9
c = o +b
oJJ = rounJ(J) +rounJ(c) = (-Su8) +(-1u9) = -417
Step3:c = Jcc2bin(oJJ) = 11uu1u11111
Step4:c = 11uu1u.11111

FixedPointDesign
13

Example 11) Addition.4

o = -9.61S , (w, F) = (1u,S ) , b = +S.421 , (w, F) = (8,S)

Method 1:
J = i(-9.61S,1,1u,S) = -9.62S,c = i(+S.421,1,8,S) = S.4u6S
oJJ = J +c = -6.2188 convert to binary with bin() c = 111uu1.11uu1
(W,F)=(11,5)

Method 2:
Step1:J = o 2
P
= -9.61S 2
+5
= -Su7.616
Step2: rounJ(-Su7.616) = -Su8

Step1:c = b 2
P
= S.421 2
+5
= 1u9.472
Step2: rounJ(1u9.472) = 1u9
c = o +b
oJJ = rounJ(J) +rounJ(c) = (-Su8) +(1u9) = -199
Step3:c = Jcc2bin(oJJ) = 111uu111uu1
Step4:c = 111uu1.11uu1

FixedPointDesign
14

Example 12) Addition.5

o = +9.61S , (w, F) = (1u,S ) , b = -S.421 , (w, F) = (8,S)

Method 1:
J = i(+9.61S,1,1u,S) = +9.62S,c = i(-S.421,1,8,S) = -S.4u6S
oJJ = J +c = +6.2188 converttobinarywithbin()c = uuu11u.uu111
(W,F)=(11,5)

Method 2:
Step1:J = o 2
P
= +9.61S 2
+5
= +Su7.616
Step2: rounJ(+Su7.616) = Su8

Step1:c = b 2
P
= -S.421 2
+5
= -1u9.472
Step2: rounJ(-1u9.472) = -1u9
c = o +b
oJJ = rounJ(J) +rounJ(c) = (+Su8) +(-1u9) = +199
Step3:c = Jcc2bin(oJJ) = uuu11uuu111
Step4:c = uuu11u.uu111

FixedPointDesign
15

Example 13) Norm calculation

This example shows the conversion of a floatingpoint norm calculation to a fixedpoint
normcalculation.
o = S.2S +4.26i , (w, F) = (8,4)

Method 1:
b = i(S.2S +4.26i , 1, 8, 4) = S.2Suu + 4.2Suui
c = obs(b) = S.S7Suconverttobinarywithbin()bin(c) = u1u1.u11u

Method 2:
Step1:J = Rc{b] 2
P
= S.2S 2
+4
= S2
c = Im{b] 2
P
= 4.26 2
+4
= 68.16
Step2: rounJ(S2) = S2
rounJ(68.16) = 68

Step3: = obs(S2 +68i) = 8S.6uS7

Step4: rounJ(8S.6uS7) = 86
Step5:Jcc2bin(86) = u1u1u11u
Step6:g = u1u1.u11u
Note:
In the hardware implementation the norm operation is done by CORDIC. So in an efficient
fixedpoint conversion it is better to replace the corresponding command (i.e. abs() ) with
CORDIC.Butintheabovecodethedifferencebetweenthemisnegligible.
FixedPointDesign
16

Floatingpointtofixedpointconversionofanalgorithm
In this section conversion of an algorithm from the floatingpoint to the fixedpoint is
shown.Soasimplecodeisconvertedfromthefloatingpointdomaintothefixedpointdomain.
Thecorrespondingequation,whichisdescribedinthefollowingMATLABcodesis:

Portiol FucliJcon Distoncc(PFD) = |Z -RCS|

2
]=N
T
]=1

Method1:

f unct i on PED = Fi xedPED2( R, S, C, Z) ;

R_f i = f i ( R, 1, 12, 10) ; %f i - obj ect def i ni t i ons
C_f i = f i ( C, 1, 14, 13) ;
S_f i = f i ( S, 1, 4, 0) ;
Z_f i = f i ( Z, 1, 16, 12) ;

RCS = R*S*C; %The cor r espondi ng f l oat i ng- poi nt oper at i on
RCS_f i = R_f i *C_f i *S_f i ; %Per f or mt he mul t i pl i cat i on i n f i xed- poi nt
domai n
RCS_f i _ = f i ( RCS_f i , 1, 16, 12) ; %Li mi t t he ( W, F) of t he r esul t

PED_i nt er 1 = Z- RCS; %The cor r espondi ng f l oat i ng- poi nt oper at i on
PED_i nt er 1_f i = Z_f i - RCS_f i _;
PED_i nt er 1_f i _ = f i ( PED_i nt er 1_f i , 1, 16, 12) ; %Li mi t t he ( W, F) of t he
r esul t
PED_i nt er 2_f i = abs( PED_i nt er 1_f i _) ; %Per f or mt he nor mcal cul at i on

f or j =1: l engt h( R( : , 1) ) %Cal cul at e t he power oper at i on
PED_i nt er 3_f i ( j , 1) =PED_i nt er 2_f i ( j , 1) *PED_i nt er 2_f i ( j , 1) ;
end
FF=f i mat h;
PED_i nt er 4_f i = f i ( PED_i nt er 3_f i , 1, 16, 12) ; %Li mi t t he ( W, F) of t he
r esul t
PED = f i ( sum( PED_i nt er 4_f i ) , 1, 16, 12) ; %Per f or mt he Sumoper at i on

NOTE:
In order to perform the summation operation in the above equation you can call the
abovefuncton(i.e.FixedPED2)inaloopwithapropervaluefortheloopcounter,which
isN
1
inthisequation.Thisprocessdoesntaffectonyourfixedpointconversion.

FixedPointDesign
17

Method2:

f unct i on PED = Fi xedPED3( R, S, C, Z) ;

R_Fr ac=8; %The Fr act i onal Lengt h and
R_Wor dLengt h=12; %The Wor d Lengt h of t he par amet er s ( W, F)
S_Fr ac=0;
S_Wor dLengt h=4;
C_Fr ac=14;
C_Wor dLengt h=15;
Z_Fr ac=12;
Z_Wor dLengt h=16;

RCS_Fr ac=R_Fr ac+S_Fr ac+C_Fr ac;
%PED_i nt er 1_Fr ac=max( Z_Fr ac, RCS_Fr ac) ;

R_f i 0=R*2^R_Fr ac; %St ep1 i n t he Met hod2
S_f i 0=S*2^S_Fr ac;
C_f i 0=C*2^C_Fr ac;
Z_f i 0=Z*2^Z_Fr ac;

R_f i =r ound( R_f i 0) ; %St ep2 i n t he Met hod2
S_f i =r ound( S_f i 0) ;
C_f i =r ound( C_f i 0) ;
Z_f i =r ound( Z_f i 0) ;

RCS_f i = R_f i *S_f i *C_f i ; %Per f or mi ng t he mul t i pl i cat i on
RCS_f i 1=RCS_f i *2^( - RCS_Fr ac) ; %Cal cul at i on of t he r eal - val ue of t he
RCS_f i 1 by ( 1)
RCS = R*S*C; %The cor r espondi ng f l oat i ng- poi nt
oper at i on

RCS_Fr ac = Z_Fr ac; %Equal i ze t he Fr act i onal Lengt h of t he
t wo oper ands
RCS_f i 2 = RCS_f i 1 *2^( RCS_Fr ac) ; %St ep1 i n t he Met hod2
RCS_f i 3 = r ound( RCS_f i 2) ; %St ep2 i n t he Met hod2

i f ( RCS_Fr ac<Z_Fr ac) %The t wo oper ands of t he addi t i on, shoul d
have t he same Fr act i onal l engt h.
RCS_f i 4=RCS_f i 3*2^( Z_Fr ac- RCS_Fr ac) ;
Z_f i 1=Z_f i ; %I n gener al Thi s condi t i on i s
%used t o equal i ze t he f r act i onal
el se %( RCS_Fr ac>=Z_Fr ac) %l engt h of t he t wo oper ands.
Z_f i 1=Z_f i *2^( RCS_Fr ac- Z_Fr ac) ; %But i n t hi s code, i n t he
%pr evi ous l i nes t hi s act i on i s
RCS_f i 4=RCS_f i 3; %done wi t h " RCS_Fr ac = Z_Fr ac; "
end
FixedPointDesign
18

PED_i nt er 1_f i = Z_f i 1- RCS_f i 4;

PED_i nt er 1_Fr ac = Z_Fr ac; %Updat e t he f r act i onal l engt h of
%t he r esul t of subt r act i on
PED_i nt er 1 = Z- RCS; %The cor r espondi ng f l oat i ng- poi nt
%oper at i on

f or j =1: l engt h( R( : , 1) )
PED_i nt er 2_f i ( j , 1) =abs( PED_i nt er 1_f i ( j , 1) ) ; %Per f or mi ng t he nor m
%cal cul at i on
PED_i nt er 2( j , 1) =abs( PED_i nt er 1( j , 1) ) ; %The cor r espondi ng
f l oat i ng- poi nt oper at i on
end

PED_i nt er 2_Fr ac=PED_i nt er 1_Fr ac; %Updat e t he f r act i onal l engt h of
%t he r esul t of nor mcal cul at i on

PED_i nt er 3_f i =PED_i nt er 2_f i . ^2; %Per f or mi ng t he power oper at i on
PED_i nt er 3=PED_i nt er 2. ^2; %The cor r espondi ng f l oat i ng- poi nt
%oper at i on

PED_i nt er 3_Fr ac=PED_i nt er 2_Fr ac*2; %Updat e t he f r act i onal l engt h of
%t he r esul t of power cal cul at i on

PED_i nt er 4_f i =PED_i nt er 3_f i *2^( - PED_i nt er 3_Fr ac) ; %Cal cul at i on of t he
r eal - val ue of t he PED_i nt er 4_f i by ( 1)

PED_i nt er 4_Fr ac=PED_i nt er 3_Fr ac- 8; %Updat e t he f r act i onal l engt h
f or t he next st ep
%NOTE:
%I f t he f r act i onal l engt h of t he
%t he i nt er medi at e var i abl es
%i ncr ease si gni f i cant l y, we can
%l i mi t i t wi t h t he f ol l owi ng met hod.
%Important NOTE:
%Cal cul at i on of t he r eal val ue
%i s done wi t h t he ol d F, but t he
%st ep1 of Met hod2 i s
%done wi t h t he new F.

PED_i nt er 5_f i =PED_i nt er 4_f i *2^( PED_i nt er 4_Fr ac) ; %St ep1 i n t he Met hod2
PED_i nt er 6_f i =r ound( PED_i nt er 5_f i ) ; %St ep2 i n t he Met hod2
PED_i nt er 6_Fr ac=PED_i nt er 4_Fr ac; %Updat e t he f r act i onal l engt h
%f or t he next st ep

PED1 =sum( PED_i nt er 6_f i ) ; %Per f or mt he sumoper at i on

FixedPointDesign
19

PED1_Fr ac=PED_i nt er 6_Fr ac; %Updat e t he f r act i onal l engt h f or

%t he next st ep
PED=PED1*2^( - PED1_Fr ac) ; %Cal cul at i on of t he r eal - val ue of
%t he PED by ( 1)
PED0 = sum( PED_i nt er 3) ; %The cor r espondi ng f l oat i ng- poi nt oper at i on

Introduction To Fixed Point Math
No ratings yet
Introduction To Fixed Point Math
8 pages
1 Lakhs Number
83% (6)
1 Lakhs Number
2,128 pages
IBPS PO Model Paper Canara Bank Probationary Officer Exam 2009 Solved Paper 1
No ratings yet
IBPS PO Model Paper Canara Bank Probationary Officer Exam 2009 Solved Paper 1
59 pages
A Level ZIMSEC Computer Science Notes
No ratings yet
A Level ZIMSEC Computer Science Notes
10 pages
4.3 2-D Discrete Cosine Transforms: N N K N N K N N X K K X
No ratings yet
4.3 2-D Discrete Cosine Transforms: N N K N N K N N X K K X
19 pages
L7 - Floating Point Representation
No ratings yet
L7 - Floating Point Representation
39 pages
COA - Unit 2 Data Representation 1
No ratings yet
COA - Unit 2 Data Representation 1
59 pages
Fixed Point and Floating Point Number Representations
No ratings yet
Fixed Point and Floating Point Number Representations
7 pages
Kancharla Srinivasa Rao
No ratings yet
Kancharla Srinivasa Rao
2 pages
wp491 Floating To Fixed Point
No ratings yet
wp491 Floating To Fixed Point
14 pages
Data Types T4 Floating Point Arithmetic
No ratings yet
Data Types T4 Floating Point Arithmetic
34 pages
m7 Fixed Point
No ratings yet
m7 Fixed Point
19 pages
Computer Oriented Numerical Methods!
No ratings yet
Computer Oriented Numerical Methods!
160 pages
5vp ITP Num Formats
No ratings yet
5vp ITP Num Formats
15 pages
Ch4-Machine Level Representation of Data-2019
No ratings yet
Ch4-Machine Level Representation of Data-2019
44 pages
Fixed Point Notation
No ratings yet
Fixed Point Notation
2 pages
11 09 1213-01-00ad 60ghz Impairments Modeling
No ratings yet
11 09 1213-01-00ad 60ghz Impairments Modeling
29 pages
Fixed Point Conversion
No ratings yet
Fixed Point Conversion
50 pages
Asembly Language
No ratings yet
Asembly Language
42 pages
Cockrum Fall 2008 Final Paper
No ratings yet
Cockrum Fall 2008 Final Paper
15 pages
Fixed Point Representation
No ratings yet
Fixed Point Representation
8 pages
13.3 Floating Point Numbers Notes 2024
No ratings yet
13.3 Floating Point Numbers Notes 2024
8 pages
Add04 Numbers
No ratings yet
Add04 Numbers
28 pages
Fixed Point Representation
No ratings yet
Fixed Point Representation
3 pages
Uu (Projectname) TestCases
No ratings yet
Uu (Projectname) TestCases
9 pages
Shi Wal 95 A
No ratings yet
Shi Wal 95 A
8 pages
4 Floating Point Inclass
No ratings yet
4 Floating Point Inclass
33 pages
Fixed Point Numbers
No ratings yet
Fixed Point Numbers
12 pages
CAO 2 Unit 1
No ratings yet
CAO 2 Unit 1
59 pages
Computer Arithmetic
No ratings yet
Computer Arithmetic
9 pages
2 3-FloatingPtNumbers
No ratings yet
2 3-FloatingPtNumbers
44 pages
Simplified Soft-Output Demapper For Binary Interleaved COFDM With Application To HIPERLAN/2
No ratings yet
Simplified Soft-Output Demapper For Binary Interleaved COFDM With Application To HIPERLAN/2
6 pages
Unit 5 - Share
No ratings yet
Unit 5 - Share
38 pages
Lect07 Floating Point
No ratings yet
Lect07 Floating Point
13 pages
Subject Name-Subject Code-PCECS302 Topic-The Fixed Point Representation Method
No ratings yet
Subject Name-Subject Code-PCECS302 Topic-The Fixed Point Representation Method
5 pages
Fixed Point and Floating Point Number Representations
No ratings yet
Fixed Point and Floating Point Number Representations
5 pages
Floating Point Representation: Reading: B&O 2.4
No ratings yet
Floating Point Representation: Reading: B&O 2.4
44 pages
Floating Point Numbers
No ratings yet
Floating Point Numbers
5 pages
Lecture 5 Fixed Point Vs Floating Point Q-Format Number Representation
No ratings yet
Lecture 5 Fixed Point Vs Floating Point Q-Format Number Representation
5 pages
20 Fixed Point Binary Numbers
No ratings yet
20 Fixed Point Binary Numbers
9 pages
Data Storage in Computer System: BITS Pilani
No ratings yet
Data Storage in Computer System: BITS Pilani
30 pages
SW Lab 3 Fixed Point Simulation EE 462
No ratings yet
SW Lab 3 Fixed Point Simulation EE 462
7 pages
Spra 948
No ratings yet
Spra 948
13 pages
Indic On Paper I Eee
No ratings yet
Indic On Paper I Eee
5 pages
Floating Point & Fixed Point Representation - BCA II
No ratings yet
Floating Point & Fixed Point Representation - BCA II
24 pages
Lecture 14 - Arithmetic Subsystems - Numbering Systems and Floating Point Unit (FPU)
No ratings yet
Lecture 14 - Arithmetic Subsystems - Numbering Systems and Floating Point Unit (FPU)
32 pages
NAChapter 1
No ratings yet
NAChapter 1
24 pages
4.4 - 1 New Floating Point
No ratings yet
4.4 - 1 New Floating Point
22 pages
Binary Tutorial
No ratings yet
Binary Tutorial
10 pages
Fixed Point Math F-Lemieu
No ratings yet
Fixed Point Math F-Lemieu
5 pages
Computer Architecture and Organization: Lecture 6: Floating Points
No ratings yet
Computer Architecture and Organization: Lecture 6: Floating Points
20 pages
Floating-Point To Fixed-Point Code Conversion With Variable Trade-Off Between Computational Complexity and Accuracy Loss
No ratings yet
Floating-Point To Fixed-Point Code Conversion With Variable Trade-Off Between Computational Complexity and Accuracy Loss
6 pages
Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic 33333
No ratings yet
Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic 33333
18 pages
Lecture9 - Fixed Point
No ratings yet
Lecture9 - Fixed Point
36 pages
Floating-Point To Fixed-Point Conversion For Audio
No ratings yet
Floating-Point To Fixed-Point Conversion For Audio
10 pages
Fixed Point
No ratings yet
Fixed Point
3 pages
Fixed Point Numbers
No ratings yet
Fixed Point Numbers
20 pages
A Fixed-Point Type For Octave
No ratings yet
A Fixed-Point Type For Octave
5 pages
Fixed-Point Algorithm Development
No ratings yet
Fixed-Point Algorithm Development
6 pages
Computations in Mechanical Engineering: Numbers and Vectors
No ratings yet
Computations in Mechanical Engineering: Numbers and Vectors
18 pages
Advanced Computational Methods: ENGR 680
No ratings yet
Advanced Computational Methods: ENGR 680
19 pages
Fixed Versus Floating Point
No ratings yet
Fixed Versus Floating Point
5 pages
Lab # 06 PDF
No ratings yet
Lab # 06 PDF
12 pages
Floating Point To Fixed
No ratings yet
Floating Point To Fixed
15 pages
C Programming Language
From Everand
C Programming Language
Younish Pathan
No ratings yet
Learn Programming Using C#
From Everand
Learn Programming Using C#
Taurius Litvinavicius
No ratings yet
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
From Everand
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
Fouad Sabry
No ratings yet

Floating Point To Fixed Point Conversion

Uploaded by

Floating Point To Fixed Point Conversion

Uploaded by

1

Method 1:i (S.61S,1,7,4) = S.62Sconverttobinarywithbin()u11.1u1u

o = S.61S , (w, F) = (8,4) , b = 2 , (w, F) = (S,2)

mult = rounJ(J) rounJ(c) = S8 8 = 464

o = S.61S , (w, F) = (7,S ) , b = 2.S , (w, F) = (7,2)

oJJ = rounJ(J) +rounJ(c) = 29 +18 = 47

Example 10) Addition.3

Example 11) Addition.4

Example 12) Addition.5

Example 13) Norm calculation

Step3: = obs(S2 +68i) = 8S.6uS7

Portiol FucliJcon Distoncc(PFD) = |Z -RCS|

PED_i nt er 1_f i = Z_f i 1- RCS_f i 4;

PED1_Fr ac=PED_i nt er 6_Fr ac; %Updat e t he f r act i onal l engt h f or

You might also like