Statistics and Probability Notes
Statistics and Probability Notes
Measures Of Dispersion ̅=
MEAN ABSOLUTE DEVIATION (MAD) OR AVERAGE = 4,740
DEVIATION (AD) = 592.50
It is the average distance between each data value x x- ̅ |x- ̅
and the mean. 550 - 42.70 42.70
420 -172.5 172.5
A. Average Deviation for Ungrouped Data 560 -32.5 32.5
∑ ̅ 500 -92.5 92.5
AD= 700 107.5 107.5
Where: x- the value of any particular observations or 670 77.5 77.5
measurement. 860 267.5 267.5
x- sample mean or mean of data 480 112.5 112.5
N- the total number of frequency in a data
∑ ∑ ̅ )=0 ∑ ̅ =905
f- the frequency
Ex.1) The daily rates of a sample of eight employees at GNS Inc.
∑
are ₱550, ₱420, ₱560, ₱500, AD= = = 113.125 = 113.13
₱700, ₱670, ₱860, and ₱480. Find the average deviation. Hence, the average deviation of the data set is ₱113.13
Sol:
STEP 1: Compute the mean of the data set. B. Average Deviation for Grouped Data
STEP 2: Subtract the mean from each of the value in the ∑ ̅
data set. AD=
STEP 3:Get the absolute value of ̅, then get the sum. Where: x- the value of any particular observations or
STEP 4: Solve for the average deviation. measurement.
̅- sample mean or mean of data 850-999 9 924.5 8,320.50
N- the total number of frequency in a data 1,000-1,149 15 1,074 16,117.50
f- the frequency 1,150-1,299 9 1,224 11,020.50
1,300-1,449 5 1,374 6,872.50
Ex.1) The data below shows the frequency distribution of ∑
the amounts of electric consumption of a typical N=40
household in Batangas City for the month of January
2009. Find the average deviation.
̅= ∑
Amount STEP 2: Subtract the mean of the frequency
of 700- 850- 1,000- 1,150- 1,300- distribution.
Electric 849 999 1,149 1,299 1,449 STEP 3: Get the absolute values of ̅.
Bill STEP 4: Obtain the product of ̅ and f, and
Number then add.
of 2 9 15 9 5 STEP 5: Solve for the average deviation.
Families
Solution: Class
f x x- ̅ | x- ̅| f| x- ̅|
STEP 1: Compute the mean of the frequency Interval
distribution 700-849 2 774.5 -322.5 322.5 645.0
Class Class Class 850-999 9 924.5 -172.5 172.5 1,552.5
fX
Interval Frequency(f) Mark(x) 1,000- 15 1,074 -22.5 22.5 337.5
700-849 2 774.5 1,549.00 1,149 9 1,224 127.5 127.5 1,147.5
1,150- 5 1,374 277.5 277.5 1,387.5 Alternative Sol:
1,299 ∑ ∑
∑ ∑
1,300- = ;s=√
1,449
Where: sample variance
∑
s = sample standard deviation
N=40 ̅ X= value of any particular observations or measurement
∑ sum of all Xs
∑ sum of all the square of Xs
∑ ̅
AD= = = 126.75 ̅= sample mean
Hence, the average deviation of the data set is ₱126.75 n= sample population
Ex.1) The daily rates of a sample of eight employees at GMS
Inc. ₱550, ₱420, ₱560, ₱500,
Sample Variance And Sample ₱700, ₱670, ₱860, and ₱480. Find the variance and
standard deviation.
Standard Deviation For Ungrouped Sol:
Data STEP 1: Compute the mean of the data set
STEP 2: Subtract the mean from each of the value in the data
∑ ̅ ∑ ̅ set
√ STEP 3: Square the ̅, then get the sum
STEP 4: Solve for variance and the standard deviation Alternative Sol:
̅= STEP 1: Get the sum of the data set
= 4,740 STEP 2: Square the values in the data set and get the sum
STEP 3: Obtain the values of the variance and standard
= 592.50
deviation
x x- ̅ ̅ X
550 - 42.70 1,806.25 550 302,500
420 -172.5 29,756.25 420 176,400
560 -32.5 1,056.25 560 313,600
500 -92.5 8,556.25 500 250,000
700 107.5 11,556.25 700 490,000
670 77.5 6,006.25 670 448,000
860 267.5 71,556.25 860 739,900
480 112.5 12,656.25 480 230,400
∑ =4,740 ∑ ̅ =0 ∑ ̅ =142,950 ∑ =4,740 ∑ =2,951,400
∑ ̅ ∑
= ∑
= = 20,421.43
∑ ̅
√ =√ √ √ ∑
∑
s=√ =√
√ =√ =142.90 Three Types of Kurtosis
1. Leptokurtic-are distribution where values dustered
Variance – is a mathematical exception of the average heavily in the center, there are tall distribution with
squared deviation from the mean narrow humps and long and high tails. It kurtosis is
Standard Deviation- is calculated as the square root of variance positive, (kurtosis > 0) and it denotes a high degree of
peakedness.
Kurtosis – from the greek word “Krytos” or “Kurtos”, 2. Mesokurtic-are intermediate which are neither too
meaning bulging. peaked nor too flat. The values are immediately
In statistics it is a statistical measure used to describe the distributed about the center. Its kurtosis is zero,
distribution of observed data around the mean. (kurtosis=0).
Formula: 3. Platykurtic-are flat distribution which values more
̅
Kurt={* + ⌈∑ ( ) ⌉}- evenly distributed about the center with broad humps
and shot tails. It kurtosis is negative, (kurtosis < 0) and it
Where:
denotes a low degree of peakedness.
Kurt=kurtosis
Types of Kurtosis
n=sample size
Ex.1) the daily of a sample of eight employees at GMS Inc. are
x=value of particular observations or measurement
₱550, ₱420, ₱560, ₱500, ₱700, ₱670, ₱860 and ₱480.
s=standard deviation
Determine the kurtosis and describe the type.
Solve:
Step1. Compute the mean of the data set.
∑ 670 77.5 6,006.25 0.5423
̅
860 267.5 71,556.25 1.8719
480 -112.5 12.656.25 -0.7872
∑
∑ ̅ =0
= ̅ =142,950
= ₱592.50
Step2. Subtract the mean from each of the value in the Step7. Obtain the value of Kurtosis.
data set. Kurt=,* +⌈ ⌉-- =0.3724
Step3. Square the x- ̅, then get the sum.
Step4. Solve for standard deviation TYPE: Leptokurtic
∑ ̅
S=√ =√ =√ Skewness
̅
Step5. Obtain the values of Coefficient of Skewness-measures the general shape of
̅ the distribution or the lack of a symmetry of a distribution. It
Step6. Compute for the value of ( ) , then get the sum. ranges from -3 to +3 and it relates the difference between the
mean and the median to the standard deviation. The direction
̅ of the long tail of the distribution points the direction of the
x x- ̅ ̅
skewness.
550 -42.5 1,806.25 -0.2974
420 -172.5 29,756.25 -1.2071
560 -32.5 1,056.25 -0.2274
500 -92.5 8,556.25 -0.6473
700 107.5 11,556.25 0.7523
A. TYPES OF DITRIBUTION FIGURE: Type of Distribution
1. Symmetrical Distribution-the mean, median
and mode are similar and are at the center of the B. Pearson’s Coefficient of Skewness
distribution. The coefficient of skewness is positive when the
2. Positively Skewed Distribution (Right- median is less than the mean and the tail of the
distribution is skewed to the right. When the median is
Skewed Distribution) - When most of the more than the mean, the coefficient of skewnessis
values in the data fall to the left of the mean and group negative and the tail of the distribution is skewed in the
at the lower end of the distribution; the tail is to the left direction.
right. The mean is to the left of the median, and the Formula:
̅ ̃
mode is to the right of the median. Sk=
3. Negatively Skewed Distribution (Left- where: Sk=coefficient of the skewness
̅ =sample mean
Skewed Distribution) - When the mass of the
S=samplestandard deviation
data values fall to the right of the mean and the group
at the upper end of the distribution with the tail to the Ex.1) A motorcycle dealership pays its dalesperson a
left. The mean is to the right of the meadian, and the salary plus a commission on sales. The mean monthly
mode is to the left of the median. commission is ₱8,800, the median is ₱9,000, and the
standard deviation is ₱1,200. Determine the coefficient of
skewness. Comment on the shape of distribution.
Sol: ̅=₱8,800; ̃=₱9,000; s=₱1,200
̅ ̃ Step3. Divide x- ̅ by s from each of the value in the data
Sk= = = = -0.50
set.
Comment: there is a slight negative skewness in the
distribution of commission on sales. Step4. Cube the value of and get the sum.
x x- ̅ . /
C. Software Coefficient of Skewness
Formula: 550 -42.5 -0.30 -0.03
420 -172.5 -1.21 -1.77
Sk= [∑ ( ) ] 560 -32.5 -0.23 -0.01
Where: sk=coefficient of skewness 500 -92.5 -0.65 -0.27
n=sample population 700 107.5 0.75 0.42
x=value of any particular observations or 670 77.5 0.54 0.16
measurement 860 267.5 1.87 6.54
=sample mean 480 -112.5 -0.79 -0.49
S=sample standard deviation ∑ =4,740 ∑ ̅ =0 ( ) = 4.55
Ex.1) the daily rates of a sample of eight employees at GMS Inc.
are ₱550, ₱420, ₱560, ₱500, ₱700, ₱670, ₱860 and ₱480.
Determine the coefficient of skewness. Step5. Solve for the coefficient of skewness.
Solution: Sk= [∑ ( ) ]
Step1: Determine the value of the mean and the standard
deviation. = (4.55)=o.19047619(4.55)=0.87
=592.90; s=142.90
Step2. Subtract the mean from each of the value in the
data set.
TYPE: Positively Skewed Distribution and 1% of the data lies outside the 3 standard deviation
above and below the mean.