Ad 0680613
Ad 0680613
RM-5766-SA
DECEMDER IPOR
PREFACE
This Memorandum was written for the Office of the Assistant Sec-
Har.,'are C ot Analysis.
empirical equations.
and of the properties of the equations he uses. For the most part, the
of the author's knowledge does not exist in any single source. The
selection of the material itself are the unique features of this pre-
mathematical training.
not been included. Only those forms and methods that have already
are for the most part suitable for the desk calculator, high-speed
digital computers are widely available today, and should be used when
possible.
_V--
SUMMARY
while at the same time demonstrating selected methods for Iv;!i mechan-
the properties of the straight line, the exponential, the power func-
viding insights into the impact of the parameter values on the form of
detail.
cases with more than two variables are considered. By using the method
influence of any others, and thus to clarify the meaning of the coeffi-
purposes, the user of this material will want to combine his readings
facilitate Lhis.
-vii-
CONTENTS
PREFACE . . . . . . . . . . . .
SUMMIARY .* v
Sect ion
I. CURVE FITTINC AND EMPIRICAL EQUATIONS......... .. .. .. ... 1
Introduction . . . . . . . . . . . . . . . . . . . . . .
Some Basic Analytic Ceomet.................2
Straight Line .. ...................... 3
Parabola. ......................... 9
ExpUlnfntliai................................22
Power Function ....................... 38
APPENDIX
A. Derivation of 'he Normal Equations for a Least-squares
Fit of a Straight Line, A Parabola, and A Three-
variable Linear Equation. ................. 1-15
BIl3LItX;RAPHY.............................H4
-lx--
TABLES
10. Using the Method of Least Squares to Fit thle Power Function:
Worksheet ........................... 84
11. Using the Method of Least Squares to Fit the Power Function
with thle ('ons talr L Work)iet t...............89
F IGURES
2. Straight line drawn through two p,,ints, .r". and x,,,,) ....
b. A parabola . . . . . . . . . . . . . . . . . . . . . . . . .
10. Points used to ill us trate fit t ing the parabola ....... .
I &urves
C illustrating the' relat 1onship bet'n . e ,
I, =", and .r - iE' and their 1ovarithit t r,inst or!.at ions .)s
INTRODUCTION
the one between the weight and cost of an aircraft and the manhours
on the subject would lead one to believe that there are as many
only a limited number of unique problems, and for that reason are not
plain curve fitting it.a general sense and to present only those
analyst.
tional forms, the general shape of the curves portrayed by each, and
the relationship between the shape of the curve, its location, and
below:
These suffice for most cost analysis problems with two variables. Although
there are other forms that are frequently used, they generally can be re-
Straight Line
the line. The main feature of any straight line is the slope or
tilt of the line. If the line rises reading from left to right as
if the line falls reading from left to right, the slope is said to be
negative. The actual value of the slope (b) is the ratio of a change
the vertical dashed line to the length of the horizontal dashed line
in Fig. 1.
the slope (b), we would be able to deduce the equation of the line.
(y -Y)
b - T -
(x I)
-4-
P(,Y
xly1
or
line. If we are given one point and the slope, we would immediately
be able to substitute in the above and have the equation of the line.
slope is not known directly, but two points on the liti are given.
)
(Y 2 -
(x -x)'
2 1
and this value may in turn be substituted into the Point-Slope formula.
-5-1
2p
I x
The modified form of the Point-s lope formula for the equation of a
straight line is
( ,. - ,t
t'his particular form is probably used more than any other in titt ng
straight lines.
'here are other instances when, as in Fig. , tCi' [I,pe and the
cept, is the point where the lint crosses the 1. axis. I11is point is
them to writt the equation ot the line. ihe Poiiitl opc tormtil.l
1 r
-6-
(y - a) = b(x - 0)
(x 2 , y€2 )
slope = b
P(a a)
~X
The remaining case is where both the .r and the : intercpts are
the li:e slopes downward from left to right so that we would expect
y a - X.
(0,a)
called the Intercept form. Another form of this equation which re-
recognizable.
The next figure, Fig. 5, shows two special cases of the straight
line, the line parallel to the -- axis and the line parallel to the
I
infinite. The equations for these two lines are quitt simple. I;"
the first case where the line is parallel to the .- a':is, the equal ion
is
where , is the constant distance from the x axis. In the second case
lar kind ,I i :tormation availa!,,> -t tie t i.-e, for t!'ere are ti.nes
-9-
when each is useful, The most generally useful, however, is the modi--
Parabola
a fixed point and a fixed line. Figure 6 shows such a curve. The fixed
point F is .nown a- the focuS of the oa- hola and the fixed line
however, we will posit ion the 'urve as sho ,r- in F g. 7. Ihe vertex
i,---- trx
//l is I-
-10-
when the value of y is 0 (which is the case at the origin), the di-
rectrix is the same distance to the left of the origin as the focus
x = -p/2.
LI
f/
/
/,
= 2
/
/
/
0O)
2
FP = (7 0 (x - /2)
PL = N + p/2)
j - 2
2 2
N(Y - 0) + (x - p/2) 2 )2
When both sides of this expression are squared and the result expanded,
we have
22
y + X px + - = X + X +2
4 4'
i' y22 2,
d = 1(YI
- Y2 ) + (Xl - x2)
(x 2Y 2 )
-12-
which simplifies to
2
y2 2px,
2 2
p/2) + (x)
FP -
2 ,
PL = y + p/2)
p/ 2 + ()=V( + r2
2. 2
4 x2
+ 21+
2 y-py + = y
2 + Py + P42__
2
x = 20y'
which is the equation for a parabola with its vertex at the origin,
symmetrical to the 4 axis, and opening upward. Notice that the ninety-
degree rotation, as was made between Fig. 7 and Fig. 8, caused the x
identical.
When the vertex of the parabola is shifted away from the origin,
x x' +h or x' =x - h,
where x and y refer to the original axes; x' and y' refer to the axes
whose center coincides with the vertex of the parabola; and h and k
are the coordinates of the origin of the x', y' axes measured from
2
(y - k) = 2p(x -h),
- 2k 9 + k = 2px- 2ph,
2 2=0.A+
Y 2px - 2ky + 2ph + k 2 0.
be written as follows:
2
y + Dx + Ey + F = 0,
where D = -2p,
E = -2k,
F = 2plz + k
This is the standard form of the equation for all parabolas symmetrical
2 h2
x 2xh + - 2py + 2pk O.
2 ' F
x + D + Ex + F'= 0,
where D = -2p,
E' = -2h,
F' = 2pk + h2
This is the standard form of the equation for all parabolas symmetrical
equations:
1 2 E F
1 2 E' F'
DX D Y
Ay 2 + By + C =x,
or
Ax2 + 8X + C =y
-16-
wl ivrc v /),
C = F/D or F'/D.
'hese are the forms of the parabola that are most commonly used
at least three points must be known to define the curve. Given three
axis would be the correct form to fit. The standard form of the
= 2.,'+ 5,"+
12 * (4,12)
I0
O4
!ic'. t S--l'used
juts t i lust rate I itt in g the par1LboIa
5
-17-
2 = OA + OB + C,
4 = 9A + 3B4 + C,
12 = 16A + 4B + C.
this knowledge to adjust the two remaining equations will reduce the
are
2 = 9A + 3B,
10 = 16A + 4F.
Probably the simplest for only two equations is the determinant meth-
od. As the number of variables and thc number of equations get larger,
however, other methods are preferred. In fact, when four or more equa-
to the desk calculator, but not particularly well suited for illus-
ing tile first equation from The second, A is eliminated. These steps
follow:
-18-
and
A + B -L
29
6
We next substitute the value of B into the first of the two variable
+ 87
- A94+54'+ -4
108 + 783
486
Ii
A 11
6'
29
6'
S2.
I
-19-
+, ( 2() + 2.
Fihe requ ired ar i thmet ic show'S Us that the va lues of the coefficients
calculated are in fact correct. Fihe equation that we have been looking
for is therefore
.= .++ 2.
them, allows us to draw the curve sho,n in Fig. 11. Contrary to our
\j ) 4,,
1
not examined the charac teristics of this curve over thle rele'vant v:Iiules,
of x, the fact that 1,is negat ive between 1'K and r 2 would not
have been noticed and coul 2! have led to absurd cost tstimates.
concerned with extrapolation beyond the range of thle exist ing data.
of data, wcmgeneral ly use o)nly, a limited segment of thle ent ire curve.
Figure 12 illustrates how this fact can lead to !rouble Thie boxed-in
segments of thle curve show thle part of thle CuICvC used to describe thle
boxes shown in FigE. 12a, 12b, and 12c indicates thle kind of trouble
one can geT. into by using this type of curve for making excrapolat ions
+ +,ii-
for 'using thte r.U.drat ic fornu li. Frirst the equit ion mutlb re-
-+ +
x
02b
I PI'r ibo'i 'pv'ilinr to~ th right
--B 4 B A (A)(C
x)
Use of this formula will probably result in two solutions because the
E',po~jnnia I
of two exp~nential equations that differ from each ot~ier onlyv with
respect to the value of 1,ar, 7hown in Fig. 13. In each case a has
been set equal to 1. As will be shown, only the levt-l of the exp'o-
? y
8
6 - 6
4-
2 2
3 2 if 1 2 3 3 2 1 13
0 0
For our purposes, the fact that the exponential curve rises from
left to right When K is greater than i and from right t. left when b
-24-
=
both curves pass through the point 1, x 0.
to raise the curve while lower values cause a downward shift. When x
.0
is equal to 0, b0 1, and the exponential becnmes
! =a.
8 2 8
2(2x) Y 1
6 6
(2
-4 4
2 2
3 2 1 1 2 3 2 1 1 2 3
0 0
L
-25- 1
topics. The system of exponents is based entirely on five basic laws
and four definitions. The first definition states that the expression
3
Sa x 7 x,
etc.
illustrate:
2 3 2+3 5
2 3
a7 = aa, 1 = 2A JXa (I
5
(a x a)(a x a x a) = (a x a x a x a x a) = a
worthwhile exercise for the reader to do so. All five laws are sum-
-,rized below:
mX
maa+
IL aa = a
i /
() =a /1
v. (a/b) n = a Zlb .
-26-
when a is greater than 0 and x is any rational number, but also provides
expressed as
X
yZ =a,
=
x logay
used. When the base is 10, the logarithms are called common (logs)
the abbreviation log where 10 is the base and in where e is the base.
in y = x,
or in exponential form,
x
=e
Figure 15(a) shows us that these two equations have exactly the same
In x=,
and
X =
of coordinate axes.
y
8
y e or x Iny
tx
y
8
or y e
-- L I -- I I
4 2 2_ 4 6 8
2 !
(15b) y = - I or x = In and
y Ii
16.5 =e
and
In 16.5
In 16.5 = 2.83,
or
16.5 = e2.83
2.83
=(e ),
and
S=e 2.83x
is proportional to, but not equal to, the value of the expression at
S - k/<'X
11
8 ,Y
x 1y
2 1 1
Two Points, P' aind P~ ,,haIZiVk beVen1 seect ed, one a t e it Ivr vend of
the curve. It is obV iou, that the slopes at these two points are k!it-
e rent . Let us ass ume that the curve extended an -quaI d i,t anci fror
each P in either direct ion (shown on the gray h as the h \potenUSe of thc
the straight line, we see that, having made this assumption, the coor-
. - - 9 )()= 8.19{
2 .00,
= . I ,( ,l = 6.063.
-
The formula for calculating the slope of a line given two points is
2
2 "' 1
and on substitution
8.000 -6.063
2.00 - 1 .6)0
me nt
The ? coordi nates were obtained by subs t itut in:g the .7'c'ord inat VS
in the expression ', = 2(2Y); values to three decimal1 pla,es were ob-
tained by solving, a procedure that improves the agreement between the
estimated slopes.
-32-
indicating that tho slope of the curve can be evaluated at any point
check this we use point P 2 in exactly the way we did P1 and derive
M
X 2 . -0.4, Y2 - 1.516,
M
,rI = -0.8, Y, = 1.149;
1.516 - 1.149
-0.4 + 0.8
S 0.918.
').918 = 1.320k,
4- = 0.695;
eyist. The value of the expression at each of the two points was calcu-
were not read from the curve. It is also interesting to note that for
close to the value of Z" estimated above. The fact that our results
assumption that the curve was linear over the relevant range.
-33-
of the form y
X , b is equal to
e
- e and the natural logarithm of e is
into logarithmic form, we take the logs of each side. Although the
a+bx
logarithm of y presents no problem, the logarithmic form of e may
appear to. When we remember the Laws of exponents and the fact that
of y to the base e, and as such becomes the natural form of the right-
In . • a + bx.
In y - a + bx.
we find that this is another dorm of the exponential which can be con-
I . ..
write
In ( /.,) = ,, +
-35-
Given the two points, 1,7 and 4,1 on x," coordinates, we chocse
rithms as follows:
In = in a + x In 1.
in Y. = in a + x, In b,
In Y2 = in a + x 2 in h.
Taking the logarithms of y, and 7)2 and substituting the logs '
the us and The xs in the above equations results in two equations with
1.9459 = in a + I In b,
0.0000 = in a + 4 in b.
in b -- 0.6486.
-36-
This valup can 4.n turn be substituted into the first equation above
1.9459 = in a - 0.6486,
In a = 2.5945,
The expression has been evaluated, and the results plotted in Fig. 17
pass exactly through the two pc*nts as required. We can simplify tie
in a = 2.59 = in i3.4,
in b -0.649 = In 0.523,
= 13.4(0.523)x
0.523 as e; hus.
13.4 = eI,
0.523 = e
-37-
14
12
10
6 y = 13.4(0.523)x
or
Y 2.59 - 0,649,,
(4,)
0 X
0 2 4 6 8 10 12
1I ""-,- r - In 13.4
and
s In (0.523.
-38-
We really did not need to use the table again because the Ins of these
s = in 0.523 = -0.649,
= in 13.4 = 2.59.
have
2.5 -O0.649 x
y=(e )(e ) .
Using the first and third laws of exponents we convert this expres-
sion to
y = e (2.59 - 0.649x)
In y = 2.59 - 0.649x,
Power Function
y - ax
x as below:
y =b x .
of the curve and has little influence on its shape. Having done this
greater than 0; the exponent b can assume any value, positive, nega-
y = 1,
used, the result would have been a straight line parallel to the x
=x) results, because any number raised to the first power is the
00
CI
-4r
.....
.....
..
-41-
not the same, as shown by Fig. 19. At the upper end all of the curves
curves become higher, the reverse of what happened when r was greater
than 1. When P is greater than I the curves are concave upwards; when
the cur-es are concave upwards. As before, however, each of the curves
than I the curves wih the lower values of b lie above those with higher
in a downward direction.
*[
This is true because a decimal raised to a power greater than
I gives a sniler number. Also, a decimal raised to a positive power
less than I gives a larger number than itself.
-42-
1.0
00.2
0.8
0.7
0.6)
0.6 -
0.9.
0.54.
0.4 1.4
0.3 1 .7
0.1
01
0 0.1 0.1 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
x*
-43-
CL
S a)
cc
u
-4-.- -4
HH/
C,
* I I
/ . 4
In most cost analvsis applications only that part of the c urve
tity, we are not concerned with the cost of le-s than one unit. But
both because the curve might be useful for other applications 31%!
log =log. + l;
og
curvo-f itt ing techn iques ire simpler when handlIing li near relaition-
power funct ion. F igure 22 snow!- the complete I aitv ot p wer funuc-
nnalvt ical lv, aqqune that ,e have two c'urvirs, one .,! Percent i!.hvr
4.0/
2.0 0.4
i 0.0)
1.0 •
0.9
0.8 -0.2
0.7
0.6 - .
0.5
0.4
0.30.
0.21
t:= -I .6
0.1 I I I
0.1 0.2 0.3 0.4 0.6 0.8 1.0 2.0 3.0 4.0 6.0 8.0 10.0
yU = ax b
2 = I 5ax
I
-47-
In the second equation let the constant terms log 1.5 + log a be set
+
log -I = 1 a b log x,
=
log Y 2 log a' + h log x.
The only difference between the two occurs in the constant terms log
a and log a'. When these two equations are plotted on logarithmic grids
there will be two parallel lines with the spacing between them equal to
linear case. "o review the method, see the discussion of the straight
line.
p. 3ff.
F4
the method of selected points, the method of'averages, and the method
of least squares. Of course one can always draw the curves freehand,
but even so the equation of the line must be determined by using one
found using the method of selected poinLs. With the use of a straight
edge, a line is drawn through the points such that the points are uni-
forinly dibtributed around the line. Two points are then read from the
= a + bx.
P1 (4, 8.3) and P 2 (26, 24). The two equations were therefore
8.3 = a + 4b,
24.0 = a + 261.
h = 0.714.
30
0
P2(26, 24)
200
'11-Freehand curve
Y = 5.44 + 0.714x
100
10 -
Pl (4, 8.3)
0 I I
0 10 20 30
x
= 5.44.
y = 5.44 + 0.714x.
the data Is to compare the calculated values with the original values
100
(8.00 - 6.87) .0 = 14.1.
8.00
Table I
8 2 6.87 14.1
6 4 8.30 -38.3
12 6 9.72 19.0
10 10 12.58 -25.8
14 12 14.01 -0.1
18 if 16.86 6.3
20 22 21.15 -5.8
28 24 22.58 19.4
26 2b 24.00 7.7
22 30 26.86 -22.1
15.9 av
After the percent deviations are calculated for each data point,
tion, disregarding the sign, and dividing the total by the number of
data points.
Notice that the placement of the line in the example was quite
different manner. Had the line been extended to the left until it
Still another method using the two points given and the modified point
"- = (X Xl
(24 -8.3\
4
- 8.3 = (24 8.3)• (x - ),
\26 4'
= 5.44 + 0.7!4x.
are divided so that two approximately equal groups are formed. If the
if odd, the extra point will have to be placed in one groul, or the
L' + i X.
,z and ,
In the example shown in Fig. 24 and Table 2, the data are ordered
for both x and y are calculated for each group, they define the two
points (6.8, 10) and 23.6, 22.8) which are then plotted and a straight
a and b.
10 = a + 6.8,
22.8 = a + 23.61.
20 - (23.6, 22.8)
= 4.82 + 0.762.r
10 -
Xo p (6.8, 10)
F 10i 20 30
12.8 = 16.8b,
b = 0.762;
10 ::a + (6.8)(0.762),
,' = 4.82.
Therefore,
y = 4.82 + 0.762x.
in Table 2.
Table 2
2 6.3, 20.7
6 4 7.87 -31.2
12 6 9.39 21.7
10 to 12.44 -24.4
14 12 13.96 0.3
10 av 6.8 av -- --
18 16 17.01 5.5
20 22 21.58 -7.9
28 24 23.11 17.5
26 26 24.63 5.3
22 30 27.b8 -25.8
22.8 av ___ _ av
23.6 8I
23.av_........ -! 6.0 av -
:- 4-
the data are used to establish ten separate linear equations as follows:
8 = a + 2!) 18 = a + 16b
6 = a + 4b 20 = a + 22K
12 = a + 6b 28 = a + 24i
10 a + lOL" 26 = a+ 261K
group are added to nbtain the required two equations which are solved
50 = + 14,
b4 - 84,;
- 0.762;
50 - . + (34'(0.762,
- 4.82.
he chosen.
re lects the Cr it er ion used o deter-, ,ine the des ired equat ion. !'he
I ine is chosen suich that the suim of the squared deviat i,ns of the
points from the 'ine is iinimized. Fhe way thi s criterion is used
4-...
such that tih' :;uM of t' -. square. d istanses of tihe daita fror: tha line
will )e minimal.
as
.ere the equat iot is of the form , .i j?, the distances are
measured parallel to tiue axis. Coenversely, where the equation s ol
the f,,rm x - a + " , the distances are measured parallel to he x axis.
Ed2 2
.22 2 z. + 2a x + 2i X 0;
ana with the obvious simplifications made, the resultant two equations
ia= Xla
L
74 U
(7X + X
line. Each of the values other than a and b can readily be deterimined
from the data, leaving two equations with two unknowns that can be
solved simultaneously.
the first two columns of Table 3. (The ordering of the values is not
The next step is to square each entry in the column headed x and enter
Table 3
the product in the column headed X. The entries in the column headed
-7 are the products of the x and values for each point. After these
the number of data points; the remaining values equired by the normal
equations are the totals of the appropriate columns. When these are
164 = 10 + 152,
= 5.66
= 0.707.
. = 5.66 + 0.707x.
Figure 25 shows the data and the straight line described by this
adding the results, dividing the total oy N (the number of data points),
Y N-2
30
20
10 -
10
0 I I
0 10 20 30
x
In our example
- ± 77.869
t 8
S = ±3.120.
V (S/y)100,
or
V= 312 100;
16.4
19o02%,
Sumar
selected points is the easiest to use, but when the required line is
not obvious from the data, the choice is strictly a matter of judg-
ment and the results are not easily reproducible. The method of aver-
of the odd data point. The method of least squares requires more in
selecting and fitting the line; for that and other reasons it is the
An average percent deviation can be used to show how well the line
the same data, as displayed in Fig. 26, are strikingly similar, the
T11E PARABOLA
The same three methods used to fit the straight line may be used
to fit the parabola. To lit the parabola, both the method of selected
44-
4.
C C
4.j:
Q 0 OL
U) -4
mJ zc C'~
-.1 0 .4
c~ 'C
0.
c-o
points and the method of averages rely even more on the judgment of
the cost analyst than was the case with respect to the straight line.
For thts and other reasons, the method of least squares is preferred. j
Measuring goodness of fit is the same as it was for the straight line
and will not be discussed again. There are two forms of the parabola
£
y = a + 7~x + C-x
2
x = a + by + cp
or, in term s of y,
-b t b2 - 4c(a - x)
Y =2c
2
Cx
a + hx +
b 12- 4c(a x)
2c
to the u axis and in the second it is parallel to the x axis. Each form
2
x = a + by + cy
2
cy + by + a =O,
-b ± Vb 2 - 4c(a - x)
y 2c
Parabola Form I
freehand curve roughly the shape of a parabola through the data, and
-63-
read three points from the curve, two at the extremities and one some-
mum point (vertex), it would be best to read the third point from the
= a + bx + cx
2
In -he example shown in Fig. 29, numbers for which are given in Table
4, we draw the freenand curve indicated by the solid line and read from
Table 4
3 1 2.00 -33.3
2 3 2.99 50.0
4 4 3.56 -11.0
5 8 6.37 27.4
7 10 8.08 15.4
10 11 9,01 -9.9
12 14 12.13 1.1
15 17 15.70 4.7
18 18 17.00 -5.6
17 19 j 1 .35 7.9
-- _ 16.o av
Lhe curve the points (18, 17), (12, 10) and (1, 2). The required
324
17 = a + 18b + c',
10 = g + 12b + 144 c,
2 =a+ Z~+
When we subtract the second from the first and the third from the
Rcond, we get
7 = 6b + 180e,
8 = 1Wh + 143c.
20
/
/
o/
(18, 17)
15 -
Freehand Curve
1 1.582 + 0.391
+ 0.0259.
/P (12, 10)
,
3Ile oj/
, 0
*x
0 5 10 15 20
0 -
29 = 1122 ,
= 0.0259.
7 = 6, + (180)(0.0259),
I = 0. 391.
2 = a + ; + .',
a - 2 - 0.391 - 0.0259,
. 1. 582.
The graph c this equat ion is shown by the dashed line in i ii . .Q.
As with the straight line, the chances are small that two analysts
the data in ascending order of one of the variables, form three groups
and for each group. We substitute the average points in the equation
,I + !)X + -. r"
Three groups are forme,- by assigning 3 points to the first and last
Table 5
(calc) { Percent
Deviation
3 1 2.34 22.0
2 3 3.15 -57.5
4 4 3.64 9.0
3 av 2.67 av
5 8 6.20 -'4.0
7 10 7.83 -11.9
10 it 8.73 -12.7
12 14 11.78 1.8
8.5 av 10.75 av
15 17 15.36 -2.4
18 18 16.67 7.4
17 19 18.04 -6.1
16.67 av 18 av -- 15.5 av
three points, (18, 16.67), (10.75, 8.5), and (2.67, 3), and plot them
20
P (18, 16.67)
15
2
2.018 + 0.290.r + 0.0291X
10 0
P (10.75, 8.5)
2.67, 3)
(
0
50 15 0
- + + ",
a = 2.018,
= 0.290,
= 0.029!,
=
9 2.018 + 0.290r + 0.0291x 2 .
but some a...biguity does exist because of having to assign any odd data
2
= a + hx -4-ox
x + c 2
Y = Na +
2 3
xy = a x + 1 x + C 3'x
2 a x2 x3 4
~xi~~x +~'x
+lx
-69-
those for the straight line. As in previous 2iscussions, all of the in-
formation required to solve the equations can be obtained from the data.
data as in the first two columns, making entries in the other columns
'Fable 6
2 3 J 4
xx
2 Percent
Deviation
3 1 1 1 1 33 2.2F 24.0
2 3 9 27 81 6 18 3.04 -52.0
4 4 16 64 256 16 64 3.51 12.0
5 8 64 512 4,096 40 320 6.03 -20.6
7 10 100 1,000 10,000 70 700 7.66 -9.4
10 11 121 1,331 14,641 110 1,210 8.58 14.2
12 14 196 2,744 38,416 168 2,352 11.69 2.6
15 17 289 4,913 83,521 255 4,335 15.37 -2.5
18 18 324 5,832 104,976 324 5,832 16.72 7.1
17 19 361 6,859 130,321 323 6,137 18.13 -6.6
number of data points and the column totals provide the other neces-
Theref arc
1.993,
3=0.254,
0.0314.
20
Is
2
0 1993 + 0,.'54x + 0.0314x
y 10
5 10 15 20
0
of least squares
Fig. 31--Parabola form 1 fitted using the method
-71-
2
1.993 -t 0.254X + 0.0314x
it is usually preferred.
20
Selected Points
16
Averages
12
Least Squares..
40
* Data Points
0 I I I I
0 4 8 12 16 20
x
Parabola Form 2
2
x - a + by + cy
The only difference between this and the equation for the parabola
before until a, b, and c have been calculated. At that point the equa-
The desired result will typically be one of the two possible solutions;
all three curve fitting methods can be used here also, we shall only
fit this kind of parabola are the same as for form 1 but with x and y
interchanged as
x ia + b
h Yy + y
-ya y + b I y + C Y
2 a d + b I + c y4
! 0
-73-
Table 7
x ,X
....-- 4 x" _ (calc)
(cal Percent
. ...... . . Deviation
2 4 8 16 2 4 0.76 62.0
4 2 16 64 256 8 32 3.99 0.3
6 3 36 216 1,296 18 108 5.80 3.3
8 5 64 512 4,096 40 320 8.40 -5.0
10 9 100 1,000 10,000 90 900 12.13 -21.3
12 6 144 1,728 20,736 72 864 9.46 21.2
15 12 225 3,375 50,625 180 2,700 14.33 4.5
16 16 256 4,096 65,536 256 4,096 16.84 -5.3
1' 18 324 5,832 104,976 324 5,832 17.97 0.2
19 20 361 6,859 130,321 380 7,220 19.04 -0.2
The solution is
(z = 0.913,
b = 0.0783,
= 0.0485;
in :.:
-74-
2
0.0485y + 0.0783y + (0.913 - x) 0,
which simplifies to
This equation has been graphed in Fig. 33. The reason for selecting the
form 2 parabola is that as larger and larger values of x are used, the
would not have been true had a form I parabola been used. This problem
,
is discussed in the section of this Memorandum on analytic geometry.
THE EXPONENTItL
or
Y lox ,
20 .
'A
151
-0.0783 + vrO.11TT4 T
0.0971
10/
5/
0 5/0152
IX
0U 10 15 20
For outtr
pqurpions alwmore usieflrep o te exonentale ishe
than1,
epedin on he alu ofa, n oaceeae tart
greater or less than x, depending on the value of b. For illustrative
although those who prefer to use base e ray do so, sin,:e the procedures
ting a curve of this type. There are iterative methods that can be
log = , + ;X.
straight line. It should be noted, however, that this method does not
yield the same least-squares solution for a and i,as the exponenli
also higher than would be the case had the least-squares criterion been
Table 8, Notice that the first step is to obtain the logarithms of the
y valupq. From that point on, the calculations required are as iadl-
cated in the column headings. The normal equations are the qame as
log = "a + x 9w
.r log =a + x'2
Table i
Percent
Y ,
log ,ilog x xlog . log :, (calc) 1- (calc) Deviatin
- 1.494,
= -0.1072.
or
Si01.494-0.1072x
The graph of this solution is shown in Fig. 34. Because only the left-
value of y may help. The equation that results when the constant is
used is
I loa+bx
or
a+bx +
Y 10
The value of the constant can be found by trial and error, but is more
/* .
0 0
0 'C
-4I
-80-,
A .0
AA
nrU- 0 r- A % I4
0 A0
A+a
A1
0 A '.
%00c A0 1 1 D
tA
I
-81-
are selected such that two lie at the extremities of the curve and the
third lies halfway between. If the first two points have coordinates
coordinates for each of these points are read from the curve aud sub-
3
i"1
2 - '3
To illu.trate, the three points read from the curve in Fig. 35a are
P3 (3 3) = (5.5, 7.2),
and
(29)(3)- (7.2)2
= 29 + 3 - (2)(7.2)'
= 2.0.
and the logs of (z4- ) are determined. The two steps are shown in
Table 9. From that point on, the steps are the sanmc as used in the
a = 1.559,
h = -0.1522,
a = 2.00,
-82-
Table 9
or
y = i01.559-0.1542x + 2.00.
The extent to which the addition of the constant a improved the situa-
the straight semi-log example. The same data were used in both cases.
-83-
= 'lx
solution for fitting the power function. Iterative methods can be used
tuting the log of y for y and the log of x for x, the appropriate
values for the example shown in Table 10 and Lg. 36a may be calculated
log a = 1.79q4,
1,= -0.7694.
in terms of the logs rather than the actual values of y. The line will
be placed ;uch that the relative, not the absolute, deviations have
been minimized.
Table 10
l2 Percent
yog X I log x log x log x log log c Deviation
= 63.01x
-0.7694
80
60
0 7691
40 u =63.01x-
-,L6,1-'79
20
0 _ I
0 20 40 60 80 100
x
100
80 (b) logarithmic scales
60
40
30
0
20
6
5
4
to the equatior
ax
as in
b
ax + c,
Y14 2 - 3
+
YI :j - 2 3
three points failing on the curez. two at the extremities and one in
x3 - 2"
and 37(b). The extent to which the addition of the constant a improved
See Appendix B.
-87-
P2 = (x 2 , Y 2 ) = (50, 3.2),
9 127 - 32
+
1 2 3
(3.2)(46)- (9.5)(9.5)
= 3.2 + 46 - 2(9.5)
A= 1.89.
V log (:4- = .a + l og x,
(y - ) = a log x + (log x) 2
(log x)[ (log
8 78
5.9598 - 10. 5a + 14.3457b,
a2 1.9318,
K = -1.0494.
The equation is
log - 1.9318
t - 1.0494 log x,
or as a power function,
-83.35x 1 0 4 9 4 +1.89.
-89-
C 'C .f (. .~ . .~ .N .~0
u ~ - .c7o. N 0 -7a 0
."
2
I
a, r-- aM7 ' T LJ n M*~ IrN
0 a', r - C 'C) - -7
r''C Le's
E-. -7 10 -
00 ON r- r
S C ; 0'l C D* -cN (N o 0 I
zr '- z D - 1 l
- Cj 3
00
z ' - .7 L(
-4.
X OI
AM A
z -0 (
.f - 0 7 3' - (- -
- A - 0 00 000
-0 '4 1 m c*
100
80 (a) logarithmic scale
60
30 -
20 -
10
8
6
2N
.8
.6
.5 I I I I I I I I I
2 3 4 5 6 8 10 20 30 40 60 80 100
x
60
41)i
ILI .: - 85. 3cr 1.8Q
0 20 40 60 80 t)A
37
Fig. -- Power functiotr with cons-ant i. fitced
using tht method of least gquares
-91-
. = a + b +),b,'+
We have already learned that the constant- term, a, %.as the value of
when was eqal3 to 0I. C"e f ur the!- Iearned' tha t bwas colI led
the, slope oif the straight lint, and that , i;evendinlz wctw b a
'And
write
and
1 1.23 + b13.2' 3
where subs ipts indicate the variable whose effect has be,: n r'Liminated.
ting that a is the value of X1 , once the effects of X 2 and X3 have been
that b12. is the net slope of the relationship between X, and X in-
point in the subscript identify which two variables are being related;
those to the right identify Lhe variables whose effects have been
relationships.
X I and each of the independent variables, and given that the two
-93-
1. = al + b + 2"3
i 1 .23 b12.3'2 b 3.
2 '
3
two separate straight lines described above. Each describes the im-
at the same time, to illustrate one way that this type of relationship
this case, we will begin with the answer and use a curve-fitting tech-
number of prospective prinLers and find that, on the average, for each
a charge of $0.10 per illustration; and $0.04 per printed page. The
linear relationship:
tions and the same number of printed pages. Further, for each com-
tions and printed pages, and determine the printing cost of eaich.
Table 12
1 1 18 1.82
2 2 4 1.36
3 2 10 '.60
4 2 20 2.00
5 3 15 1.90
6 4 13 1.92
7 5 7 1.78
8 5 16 2.14
9 I 6 6 1.84
10 6 2 1.68
11 7 1 1.74
12 .7 7 1.98
-95-
Taking Report No. 3 as an example, we can see that the cost of $1.60
is arrived at as follows:
allowed us to calculate the costs above, we have only the data contained
(which is unlike the usual case) we will assume that we know the price
the number of illustrations (I) on one graph avd the cost (C) against
the number cf printed pages (P) on the other. The first two diagrams
tionship that might exist between cost and the number of illustra-
tions is obviously distorted by the fact that reports with the same
For example, there are three reports each with two illustrations,
but one has four, one has ten, and one twenty printed pages. The
cost and number of printed pages shown in Fig. 38b. Even with all of
in Fig. 38b and to use the results to improve the relationship between
cost and the number of illustrations. For simplicity we will use the
method of averages to fit the straight line and the point-slope formula
to ,,rite the required equation. The details of these and other re-
quired computations are shown in Table 13. When using the method of
averages, the data are first ordered according to the value of the in-
are two independent variables involved and because the data cannot be
ordered according to both of them at the same time, two separate set-
are shown on the lower half of Table 13. Since the sequence requires
stepping back and forth between the upper and the lower half, the steps
are indicated by the numbers shown in circles at the head of each column.
straight line (between cost and number of printed pages) are shown in
the lower half of the table in Column 1. The coordinates of the two
C (a) Cost vs illustrations C (b) Cost vs printed pages
$2.20 . $2.20 $2.
2.00 - * 2.00 - 2.
00
1.80 - 1.80 - 0
1.60 - 1.60 - 1
1.40 - 1.40 C 1.659 + 0.0157 P I
1.20 - 1.20 1
1.00 1.UO
" _____________ I I I 1_! I ! L II, I i p
0 1 2 3 4 5 6 7 8 9 10 0 2 4 6 8 10 12 14 16 18 20
2.00 2.00 -2
1.80 1 80 - a.
1.60 F 1.60 -1
3
1.40 C = 1.212 + 0.0763 r 1.40
C4 1. 156 + 0.0342 P
1 1.20 * 0 1
.0
1.0 1.
_ _I _ i I I I I I I I I P
0 1 2 3 4 5 6 7 8 9 10 0 2 4 6 8 10 12 14 16 18 20
2.00 - 2.00 - 2.
2.
1.80 1.80 1.
1.
1.60 1.60 - 1.
1.
1.40 -1.40 .1
L I I L- -_ t ... I I I i-A...L......L..'
I I P
0 1 2 3 4 5 6 7 8 9 10 0 2 4 6 8 10 12 14 16 18 20
.00- 2.00k
.80- 1.80 -
.60 - 1.60.
.00 1.00
I I I I I I I I I P
0 1 2 3 4 5 6 7 8 9 10 0 2 4 6 8 10 12 14 16 18 20
.00 2.00 -
.80 - 1.80
.60 - 1.60
.40 - 1.40
120 -1.20 Z0 - 1.0C:6 1.074 + 0.0369 P
.00 1.00 -
LI ! I I I I I r" , I I i I I P
0 1 2 3 4 5 6 7 8 9 10 0 2 4 6 8 10 12 14 16 18 20
2.00 2.00 -
1.60 1.80 -
1.60 - 1.60
1.00 1.00
-A1 I A 1 I I
L P I 1.
0 1 2 3 4 5 6 7 8 9 10 0 2 4 6 8 10 12 14 16 18 20
Expression: C =
Data
Report
()(b) (c)
No. of Illue- No. of Printed
(P )
(dl
Coat per
(D 1 G 0
No. t reiona (I PR.e Report ( (c) i (c) (C) (C )
38
(Figo 38a) (Fig. c) (Fil. 38e) (Fig. 38g)
I1'-00545r
3 5 1"212+0.0763r C -7-0.07631 C5-1.100+0.0899
.,.3~0.5 2 4 5
.(Fi.
38b) (Fig. 38d) (Fig. 38f)
Ii 7 1 1.74 1.72 1.36 1.71 1.21 1.71
9 6 2 1.68 1.65 1.35 1.62 1.22 1.61
2 4 1.36 1.30 1.25 1.25 1.21 I22
10 6 6 1.84 1.75 1.51 1.67 1.38 1.63
A
-98-
Table 13
Calculations
0 0 0 7) 0 0 0
L() (C6) I (C 1- (,-,) 0)
( (CO0 L )1 W 12) (13)
08" 6C00+0.
.C_0.0899. (7_1o+c.o26r -- 0.09261 .1.03540.095 10-0.09541 ."-1.21 0.0981" . -".0981: .0114.0.091
10 0 -C
(J C ) (5 3)
c----I.0?.0.0!•0.042F
1 -0-0.069P 09 -0,0379F . . 1540.o .Y, 4.* 348 P 1. 0.0391 0
-99-
points (PP,CI) and (P2 1 Q2) are (4.50, 1.73) and (15.33, 1.90) re-
1 - :1*"
1-2 2 1 2 _ :
2 1 + ( 2 -'1
or
= a .
where
12 21
'2- ' 1
1. 3) 1 3) (1 1
are entered n th' table, they ;h,'.ld lt" done at that t ijrn and in-
=1.659 + "~.0i.5
9lis equat ion are estimates rath-er tiian actual-; When this equat ion
The value I or 0.0157 means that for each printed page we must add
1 .57 cents to the cost . We can reduce the cost of each case by this
(1.702),
costs.
that indicated in Fig. 38a. What has happened is this: Although the
The next step is to follow our logic and determine the relation-
ship between and I using the results to further clean up the re-
upper portion of Table 13. This fitted line can be seen plotted in
.1 =1.433 + 0.05451.
Table 13, this time to eliminate the effect of the number of illus-
2 - - 0.0545:,
-132-
9
where indicates that the cost has been adjusted for the second time.
of Table 13, and the results plotted against the number of printed
pages as in Fig. 38d. A co;pariso. of Fig. 38d with Fig. 38b shows
tions has improved the relatlinship between total cost and number of
tinued first with respect to one of the indepeudent variables and then
used to further adjust the cost; the adjusted cost is then related to
the other independent variable and the process repeated again. The
in Fig. 38k which was arrived at on the 12th adjustment can be de-
C 12 = 1.0113 + 0.03977'.
= 1.00 + 0.04P.
in Fig. 38m is also quite close to the relevant part of the original
equation:
-10 3-
3 1.0144 + 0.09811,
as compared to
= 1.00 + O.iO.
exactly. But this would have meant carrying the calculations to more
both methods result in the same solution, the method of least squares
is next applied to the same problem. Data are calculated in Table 14,
and the accompanying graphs plotted in Fig. 39. Normal equations for
T=a + 7p
2+ LT
S,' a +b 1 +2 b P2.
I ~~
~ 21.76 12a + 50b, 4 ~
119b 2Lhswrsott
C P = 24 22 20 18 16 14 12 10 8 6
$2.20
2.00 P 2
1.80
1.60
1.40
1.00 -
0 1 2 3 4 5 6 7 8 9 10
-.- 0 0 LtD 00 - N t 00 Z co z G
.. jI
-. ~~ ~0, ~
Cc- N ~ L
-o- ci' 0.0
'4
00
o C- . - N'
E-4I -
H ,.-fn
randum we will use the method of least squares to fit the straight
line, the exponential, the power function, and the parabola to a set
least squares has already been described. We remember that the linear
equation
X1 a + b3X 2 + b3X
3
X = a + b2X 2 9
and
X1 a + b3X 9
3
written b12.3 and b13.2 to show that in the first case the effect of
X 3 was eliminated, and that in the second case the effect of X 2 was
demonstrate how this could bL done. Further, it was shown that the
less eff,'rt.
-107-
good beginning.
served. But when each point is identified with its X3 value and con-
tour lines connecting all points with equal values of X3 are drawn,
Tab l 15
23.0 av
the data. The least-squares normal equations and the resulting linear
relationship follow:
+
L I Na + b2 X2 b
'Yx a X 2 + b2 X+ h x x
x1x a X +b x +b X2
XX a L 3 2L 2 3 3 '3
17
170
150
160
130
140
130
100
900
180
10
90
80
400
30
20
10
00
40 10
310 2 040 5 60 7 80 9 10
Fg 0 -- c t e i g - X v ,
-110-
170 -100
0
160 - de 9J
100 93
150 - d
140 -
130 - .--
100 .00 0
120 - 93 * -71
110 -
100 -
90 - ----
7171. -- 01
60 e
50 Op 27
2-
40 -490.-10 - ~0
.0 38 001--2 -o - .
7 27- - -
7
20 7
20 e
10- 1. .
01 1 1 1 1 1 1 . I I J ,
0 10 20 30 40 50 60 70 80 90 100
170 93
0
160 100/
4/ 71
150II
140 / /
130
1 /I/I I.5 38•
120 1 38.
110 , 11
100 /, / /
,/ /
90 I,/
/
/
/ /
/
I' / /
80
I0
#
I
/ / i
/
1/ ,/ /
70 // / // /0
60 - 93
/ / /
/ / /
/ /
82! e / /
40 609 7
27 -b 5
30 18 - -
20e
Joo t
0 16
;- 0 i ti I I I I I I
0 10 20 30 40 50 6(0 70 80 90 100 x
vary over its relevant range while holding ; constant at the values
indicated in Fig. 43. The deviations of the points from th:e appro-
very well. A tabular presentation of the results was shown i: Table 15.
poorness of f;t achieved with the linear form, a nonlinear form seems
straighten anything out. 'We will try this and see what happens.
= 2
or
an'l
var iab le quat ion is anai ogoIs to the equa t iotts .resentvi, in the be-
5
eC pp. 'Jj4 .
-113-
17o
X 0
3 7
1250
1440
10 ZK
I71 i- '
r e20tin
-114-
Either
or
equations used before are appropriate here, given that the logarithms
When the requir-.d values are calculated and this set of equations
=
log X I 0.16555 + 0.26963 log X 2 + 0.73198 log X3 ,
or
X0 1.464026963 0.73198
, 1 .4 4 2 3
'
How well this equation does the job is shown in Fig. 44 and Table
bly even a little worse. The most striking shortcoming is that the
required curve should be concave upwards, and these curves are concave
-115-
Table 16
Observiontionen
A, Y' X (c)alc.
X (cl Percent
observation
1 2 -3 1Xi ja1 Deviation
1 7.31 5 5 7.34 -0.03 -0.4
2 37.67 5 49 39.01 -1.34 -3.6
3 67.37 5 71 51.18 16.19 24.0
4 121.31 5 00 65.76 55.55 45.8
5 20.93 16 27 34.51 -13.58 -64.9
6 24.77 27 27 39.74 -i4.97 -60.4
7 33.57 27 38 51.03 -17.4 -52.0
8 22.78 38 16 29.71 -6.93 30.4
9 29.16 38 27 43.58 -14.42 -L9.5
10 11.8.26 38 93 107,75 10.51 8.9
24.3 av
place. Figure 18 (the general shape of the power function for values
one except that tie fit Is better. In such a case we might well have
the sample.
170
160
150 X 3 = 100
140 - X3 - 93
130 -
120 -
X3 = 71
110
100
90 X3 = 49
80
X 3 = 38
70
60x 3 = 27
50
40 /X 3 = 16
303
20 -
5-
10
!0 - 2
0 10 20 30 40 50 60 70 80 90 100
erty we desire; refer back to Fig. 14. The two variable exponentials
would be
Y
"2
1 = 2
and
x
X1 = a 3
X+b2X
2
and
a+b X.
X =e
taken, we have
in a + b2X
222
and
in X a + b3X 3 ,
In X I . a + b2X2 + b3X3P
in X1 = Na + b2 Y + b3 X3
X In X . a X + b X 2 +b XX
L2 1 2 2 L 2 +b 3 L 2 3'
ZX 3 In X 1 = a [X 3 +b 2 [X 2 X 3 +b 3 L X .
or
and Fig. 45. We note from observing the scatter diagrams and the
closer to fitting t!,e data than does either the linear or the power
to be explained.
2
y - a + bx + cx
and for our purposes we need one in three. Fortunately, we may proceed
-11 9-
Table 17
18.1 av
I = + 2' V + Il,).
and
1 " + 1 33
' 3'3 '
X1 + b2 2 '22 +b3' 3 + 3 3,
X3= 1W')
X 93
3
150
140
160
120
110
100
90
80 x3 = 49
70
X 3-38
60
503
40
10
0 10 20 30 40 50 60 70 80 90 100 X2
X1 = Na + b X + 2 + X + '
.. .. 2 3+ 3 3 3
V A V "2 + 2 X2 + 2 + b A +
+ +
1IX ,2 +2 2 +c 2 2 3 X23
2 3 L a23
2 2 .3
XX 3 = a 1 2X32 + b22 7x 22 X 3 + 2C L
2 XX
2 3 + b 37
32 3X + 3 _ v X3
2'3 2 3 3 + 3 X3'
Tx2 ~ 2 2 22 3 _
X1 3 3 L 2 3 2 2 X3 + b 3 , X3 3 3'
be useO. With a computer, the task becomes a 6iLple one, and the
2
I 5.006 + 0.2498 X + 0.002301 ' + 0.1499 X + 0.01000 X2
2 2 '3
Table 18 and Fig. 46 indicate that we have iideed found the cor-
rect empirical equation. However, even with fits as good as this one,
tions beyond the range of the data should be made with extreme 'a.,tion.
parabolas.)
PI
-122-
Table 18
!1 7.31 5 7.31 . ..
2 37.67 5 49 37.67 -
3 67.37 5 71 6.37
4 121.31 5 100 121.31 ......
5 20.93 16 27 20.93 ......
6 24.77 27 27 24.77
7 33. 57 27 38 33.57
8 22.78 38 16 22.78
9 1 29.16 38 27 29.16.
10 118.26 38 93 118.26 .....
11 39.62 60 27 39.62 ---
12 45.68 71 27 45.68
13 149.34 71 100 149.34 ......
14 41.97 82 5 1.97 ......
15 59.48 93 27 59.48 ......
16 148.58 93 93 148.58
17 163.14 93 1001 163.14 ......
18 73.14 100 38 73.14 ......
19 114.06 100 71| 114.06 ......
20 153.44 100 93 153.44
For exampie,
X'1 + ?
33 3 3
to form
16
X3 = 9
150
130
1260
100
130
38
110
10
90
10
0 10 20 10 40 50 60 70 80 90 100
with X )an equation like the one above might then be an appropriate
choice.
-125-
.kppendix A
A Straight Line
.-= + .-'
where L and ; are the parameters to be determined such that the sum
mum. The carets are placed over those values that are to be estimates.
with the subscript assigned according to the data point we are using,
1^
(A + SX.)
2. = Lyi - (a + 8X')]
22 = y 22 - 2&y. - 26x.. + a
^2
+ 2ix. + 2
x.
1 71b
Ut
the s,:m of the squared deviations. The expression that follows repre-
Q v!.
, . . . ~= . -5 x + + 2 ,:3 x.
X' + x.
i:: i -l >'1 ':41:=
tain these two partial derivatives and to equate them to 0. The par-
2
= -2 c + na + 2B x
3 Ed n nn x
_ ==-2 X + 2a x + 2S \ .
=I i=1 i=I
n2 n
j=l yi fa + i=1X
n n 2
which are the required normal equations. All of the information in-
from the data; this will result in two equations in two unknowns (
A Parahola
2
+ Ix +
2 - (; + i .+ Y 2.),--
2.
2 2 - 2 -2 ..
.= ,.- - " + - + 2 x. + 21x,.
, 2-,7. 2" 2. 2' 2 2.
+ 2 2 4
+ x . + 2x., + ' x..
-
2 <1 )j/j?: - 2J ," 2 . ' : 2{ ax.2-.?4 n- 2 + 2A X.
2. 2 2 3 2 4
+
2
xy.+ y x "+" ,l..X+2
Jd2 PI N N2
7 . 2 iil,
..... ij.! + 2
C1+ 2. F
?,z x. + 2 )
=l ,
-128-
2
n n n n
- -2 Y x.n. + 2Y x. + 2 x2 + ' i
i=Y i==
22 3 2 4
n 2 n n n
yx = -2; XY + + x. LX
+ Vxi.
.
Yi + i + =
These are the normal equations for fitting a parabola using the least-
squares criterion. The sums and sums of products are calculated di-
rectly from the data and substituted into t-he normal equeaiOin leaving
three equations and three unknowns. These three equations are solved
simultaneously for &, B and , As for the straight line, the solutions
•
= (;+ BlX 1 + 2
normal equations for the straight line and for the parabola. The
d2 + [- + 'Yx 1 + 2
-129-
Tho above expression is expanded and summed across all the data points,
r, i' and 6 are taken and equated to 0. The resulting normal equa-
tions are
Y = 6 + 1 '2 X2 2
-130-
Appendix B
CA
BC'
EB
DE'
CA EB
BC =DE'(1
the lengths of the above line segments and the results are substituted
in Eq. 1, we have
Since we are free to selecL the three points (xI, y1), (x2, Y2 and
(Xv ',3) in any way we wish, we do so in such a way that the denominators
-131-
-J31
g 2
I I x
or
log x I + log x 2
logx 3 2 (4)
As can be seen, log x 3 is the average of, or half way between, log x]
and log x2 .
x3 XLX2
1
?3 '= - ) ;
- ')(7
(91 .,
S2 2
2 :2 - 2 C, + 12
:14 2 - '12 - I +
2
-133-
2
a = y-YlY2 -2-- Y3 3. (5)
Yj+ y-) -2
have been
x . x- ,
xI + x2
3 2
BIBLIOGRAF
.Nldli.J'lss
ec~t,),Mc(rawHiil
R.R., 'a) Ne Yok, q55
DOCUMENT CONTROL DATA________ __________
3 REPORT TITLE
SOM4E CURVE-FITTTNG FUNDAMENTALS
Petruschell, R. L.
5. REPORT DATE
December 1968 1147
6a. TOTAL No. OF PAGES
I-
ftbNo. OF REFS.