Chapter 3 Numerical Technique
Chapter 3 Numerical Technique
Chapter 3 Numerical Technique
Descriptive
Statistics:
Numerical Methods
1
4.1 Measures of Central Location
The central data point reflects the locations of all
the actual data points.
How?
With two data points,
the central location
With one data point should fall in the middle
clearly the central between them (in order
location is at the point to reflect the location of
itself. both of them).
2
4.1 Measures of Central Location
The central data point reflects the locations of all
the actual data points.
How?
But if the third data point
appears
If the third dataon the appears
point left hand-side
in the center
of the midrange,
the measure of central itlocation
should will
“pull”
remain
in the the central
center, but…location
(click)to the left.
3
4.1 Measures of Central Location
4
The Arithmetic Mean (average)
5
The Arithmetic Mean
6
The Arithmetic Mean
Example 1
Find the mean rate of return for a portfolio equally invested in five
stocks having the following annual rate of returns: 11.2%, 8.07%,
5.55%, 13.7%, 21%.
Solution
11.2 + 8.07 + 5.55 + 13.7 + 21
x= = 9.764%
5
7
2 workers make the same product
Worker 1 spends 2m making 1 product
Worker 2 spends 6m making 1 product
Find the average time making 1 product of 2
workers:
a, They work in 6 hrs
b, Each worker works in half of the working day.
c, Each of the worker makes half of the total
products.
8
Find the average price, quantity, exchange rate of the
export product of company X
Formula:
- Step 1: Express the rate of change (R) as (1+R)
Growth
10 25 15 10 10
rate (%)
Growth
10 10 15 25 15
rate (%)
Example
Average growth rate
Rg = 5+3+2
1.10 ´1.15 ´1.25 -1= 0.14 ~14%
5 3 2
Example
◼The number of employees in a small bank over the
period 2000-2006 is presented in the table below:
Example 2
The salaries of seven employees Suppose an additional salary of $31,000
were recorded (in 1000s): 28, 60, 26, is added to the group of salaries recorded
32, 30, 26, 29. before. Find the median salary.
Find the median salary.
Odd number of observations Even number of observations
26,26,28,29,30,32,60 26,26,28,29,29.5,
30,32,60,31
There are seven salaries (K = 7). There are eight salaries (K = 8).
The (k+1)/2th salary of the ordered The two salaries in the middle are 29 (in
array is the number at the the (k/2)th =4th location), and 30 (in the
(7+1)/2th = 4th location. [(k/2)+1]th=5th location. 20
The median is 29. The median is the average number – 29.5.
The Mode
The Mode of a set of measurements is the value that
occurs most frequently.
A Set of data may have one mode (or modal class), or
two or more modes.
For large data sets
The modal class the modal class is
much more relevant
than a single-value
mode.
21
The Mode
Example 3
▪ The manager of a men’s clothing store observes the waist
size (in inches) of trousers sold last week: 31, 34, 36, 33, 28,
34, 30, 34, 32, 40.
▪ The mode of this data set is 34 in.
Mode Mean 23
Median
Relationship among Mean, Median, and Mode
If a distribution is symmetrical, the mean, median
and mode coincide
26
4.2 Measures of Variability
Measures of central location fail to tell the whole
story about the distribution.
A question of interest still remains unanswered:
27
Think of a sample portfolio composed of three stocks.
200 shares
100 shares ARR = 15% 100 shares
ARR = 10% ARR = 20%
29
The Range
The range of a set of measurements is the difference
between the largest and smallest measurements.
Its major
But,advantage is measurements
how do all the the ease withspread
whichout?
it can be
computed.
? ? ?
The range cannot assistRange
in answering this question
Its major shortcoming
Smallest
is its failure to provide
Largest
information measurement
on the dispersion ofmeasurement
the values between
the two end points.
30
Deviation measure dispersion?
Consider two small populations:
9-10= -1
A measure of dispersion
Can the sum of deviations from the mean 11-10= +1
be ashould agree with
good measure this
of dispersion? 8-10= -2
observation.
A 12-10= +2
8 9 10 11 12 Sum = 0
…but
Themeasurements
mean of both in B
4-10 = - 6
arepopulations
more dispersed
is 10...
then those in A. 16-10 = +6
B 7-10 = -3
4 7 10 13 16 13-10 = +3
31
Sum = 0
The Variance
This measure reflects the dispersion of all the
measurement values.
The variance of a population of N measurements
x1, x2,…,xN having a mean is defined as
N (
i =1 i
x − ) 2
2 =
N
The variance of a sample of n measurements
x1, x2, …,xn having a mean x is defined as
ni =1( x i − x )2
s2 =
n −1 32
The Variance
33
The Variance
Let us calculate the variance of the two populations
2 2 2 2 2
( 8 − 10 ) + ( 9 − 10 ) + (10 − 10 ) + (11 − 10 ) + (12 − 10 )
2A = =2
5
2 2 2 2 2
( 4 − 10 ) + ( 7 − 10 ) + (10 − 10 ) + (13 − 10 ) + (16 − 10 )
B2 = = 18
5
34
The Variance
Example 6
Find the variance of the following set of numbers,
representing annual rates of returns for a group of mutual
funds. Assume the set is (i) a sample, (ii) a population: -2, 4,
5, 6.9, 10
Solution
i6=1 x i − 2 + 4 + 5 + 6.9 + 10 23.9
x= = = = 4.78
5 5 5
n 2
2 ( x − x ) 1
s = i =1 i
= ( −2 − 4.78) 2 + ( 4 − 4.78) 2 + ... + (10 − 4.78) 2
n −1 5 −1
= 19.59 percent 2 Assuming a sample 35
The Variance
Example 6 - solution continued
n 2
2 ( x − x ) 1
= i =1 i
= ( −2 − 4.78) 2 + ( 4 − 4.78) 2 + ... + (10 − 4.78) 2
n 5
= 15.6736 percent 2 Assuming a population
36
Variance…
4.37
Sample: 17, 15, 23, 7, 9, 13
Sample Mean
Sample Variance
4.38
Standard Deviation
2
Sample standard deviation : s = s
2
Population standard deviation : =
39
Standard Deviation
The daily percentage of defective items in two weeks of
production (10 working days) were calculated for two
production lines?
Which line provides good items more consistently?
Line 1: 8.3, 6.2, 20.9, 2.7, 33.6, 42.9, 24.4, 5.2, 3.1, 30.05
Line 2: 12.1, 2.8, 6.4, 12.2, 27.8, 25.3, 18.2, 10.7, 1.3, 11.4
40
The Empirical Rule for a Bell Shaped Data Set …
Approximately 68% of all observations fall
within one standard deviation of the mean.
10% 90%
lie here
Lower decile
45
A demostration of Commonly used
percentiles - optional
Commonly used percentiles:
First (lower) decile = 10th percentile
First (lower) quartile, Q1, = 25th percentile
Median, = 50th percentile
Third quartile, Q3, = 75th percentile
Ninth (upper) decile = 90th percentile
P
L P = (n + 1)
100
where L P is the location of the P th percentile
48
Determining Percentiles and their
Location
Example: Compute 25th percentile of the following data
set: 2.7, 3.1, 5.2, 6.2, 8.3, 20.9, 24.4, 30.05, 33.6, 42.9
Finding the location of the 25th percentile:
P 25
LP = (n + 1) = (10 + 1) = 2.75
100 100
▪ Finding the value of the 25th percentile.
The 25th percentile is located at location 2.75,
that is, at .75 the distance from 3.1 to 5.2.
5.2
Therefore,
3.1
P25 = 3.1 + .75(5.2 – 3.1) = 4.675
492.75
2 3
Quartiles and Variability
Q1 Q2 Q3 Q1 Q2 Q3
Positively skewed Negatively skewed
histogram histogram
50
Inter-quartile Range
Interquartile range = Q3 – Q1
51
Box Plot
A box plot is a pictorial display that provides the
main descriptive measures of the measurement set:
• L - the largest measurement
• Q3 - The upper quartile An outlier is defined as any value
• Q2 - The median that is more than 1.5(Q3 – Q1)
away from the box.
• Q1 - The lower quartile
• S - The smallest measurement
1.5(Q3 – Q1) 1.5(Q3 – Q1)
Whisker Whisker
S Q1 Q2 Q3 L
52
1. Determine the first, second, and third quartiles of
the following data.
14.7 17.7 15.9 12.2 10.0 14.7 10.5
14.1 13.9 15.3 18.5 13.9 15.1
53
A set of data whose histogram is bell shaped has a
standard deviation and mean of 4 and 50, respectively.
Approximately what proportion of observations
a. are less than 46?
b. are less than 58?
c. are greater than 54?
54
Standard deviation and mean of a data set were
30 and 120, respectively. What can you say about
the proportions of observations that lie between:
a. 150 and 90
b. 60 and 180
c. 30 and 210
55
Chose: 500,000 residents pay 350$
No of
Median
Place residents Mean ($) Mode ($) STD
($)
(mill)
A 2.50 450 75 87 75
B 1.75 385 97 109 52
C 0.95 367 358 360 18
D 0.98 365 310 340 20
E 1.35 353 348 352 10
56