Probability Sam
Probability Sam
HH
ALL POSSIBLE HT THERE ARE 36
OUTCOMES TH POSSIBLE OUTCOMES
TT
SCORES : A B C F
CLASSES
X 10 15 8 2
Y 13 16 7 4
Z 8 11 6 5
CALCULATE THE PROBABILITY:
1. P ( A or B or F ) 2. P ( X or C ) 3. P ( Y or Z )
4. P ( Z or A ) 5. P ( C/X ) 6. P ( Z/F )
Example
In Lab: A, there are 10 units Acer computer, 5 units Compact
and 15 units Toshiba. In lab: B, there are 15 units Acer
computers, 10 unit Compact and 10 units Toshiba. In lab: C,
there are 10 units Acer, 10 units Compact and 10 units
Toshiba.
A. If each lab is selected one computer randomly, what is the
probability that all of the computers are Toshiba.
B. If one computer is selected at random from one lab, what is
the probability that the computer is Acer or Compact.
BAYES’S RULE FOR CONDITIONAL PROBABILITY
P ( Ai ) P ( B/Ai )
P ( Ai/B ) = ----------------------------------------------------------
P ( A1) P ( B/A1 ) + -------+ P ( An ) P ( B/An )
EXAMPLE 1:
A car repair firm employs three paint sprayers, Tomy, Dicky and
Harry. Tomy is responsible for painting 25 % of all the cars
produced, Dicky for 35 % and Harry for the remaining 40 %. On
the basis of frequent quality inspections it is discovered that, on
average, 5 % of the car sprayed by Tomy fall below the minimum
standard, while for Dicky is 8 % and for Harry 10 %. If a car is
selected at random is judged to be sub-standard. What is the
probability that it was sprayed by Harry?
b. If a car selected at random is judged to be standard, What is the
probability that it was sprayed by Tomy or Dicky?
Example:2
n x n–x
C π q
x
Example:
The four engines of a commercial aircraft are design so that they each operate
independently. Test, carried out over a long period of time, show that there is a one-in-a
hundred chance of in-flight failure of a single engine.
What is the probability that on a given flight:
a. No failures occur?
b. No more than two failures occur?
c. At least two failures occur.
d.. At most 2 of the engines run well.
E. Less than three of the engines run well.
f. Two of the engines run well.
BINOMIAL DISTRIBUTION
PROBABILITY OF A STUDENT WILL PASS IN THE
FINAL STATISTICS EXAM = 0.80. IF 6 STUDENTS
ARE SELECTED AT RANDOM, WHAT IS THE
PROBABILITY OF OBTAINING:
2. Zi ~ N ( 0, 1 )
Xi - µ
Zi = ---------------
σ
Zi MEASURES THE DISTANCE OF ANY
PARTICULAR VALUE OF X FROM THE µ,
MEASURED IN UNITS OF THE STANDARD
DEVIATION.
THE SIGN OF Z MIGHT BE POSITIVE, NEGATIVE
OR ZERO. IT DEPENDS ON THE DIFFERENCE
BETWEEN Xi VALUE AND ITS MEAN ( µ ).
FINDING AREAS UNDER THE NORMAL CURVE
4. PROBABILITY ( 5 QUESTIONS ):
1. INDEPENDENT TO CONDITIONAL PROB
2. BAYES THEOREM
3. BINOMIAL DISTRIBUTION
4. NORMAL DISTRIBUTION
5. BINOMIAL TO NORMAL
MID EXAM QUESTIONS
BINOMIAL DISTRIBUTION ( 2 QUEST)
NORMAL DISTRIBUTION ( 2 QUEST )
Non-Random sampling
Convenience sampling
Judgment sampling
Snowball sampling
HOW CAN SAMPLES BE USED TO MAKE ESTIMATES?
A SAMPLE IS A TOOL TO INFER SOMETHING ABOUT
POPULATION OR STATISTICS CAN BE USED TO FIND
SOMETHING ABOUT A CHARACTERISTIC OF
POPULATION OR A PARAMETER.
X - tv ( sd/√n) ≤ u ≤ X + tv ( sd/√n )
Level of Confidence
Required level value of α value of Z table
of confidence
90 % 10 % ± 1.65
95 % 5% ± 1.96
99 % 1% ± 2.58
Example : The prices at which certain type of instant coffee was
being sold on a given day were collected from a random
sample of 45 shops around the country. The mean price was $
1.95 with a standard deviation of $ 0.27. Compute a 80 %
confidence interval for the population mean.
3. A CONFIDENCE INTERVAL FOR A
PROPORTION OF POPULATION ( π )
A PROPORTION OF POPULATION ( π ) CAN BE
ESTIMATED BY THE FORMULA:
P = X/n
P = THE FRACTION OR PERCENT INDICATING THE PART
OF THE SAMPLE HAVING A PARTICULAR TRAIT OF
INTEREST.
n = SAMPLE SIZE, X IS A PART OF SAMPLE SIZE WHICH
HAVE A PARTICULAR TRAIT OF INTEREST.
CONFIDENCE INTERVAL FOR POPULATION PROPORTION
(π)
P ± Zv √ P( 1- P ) / n page:300 no:18
Application
The distribution of household that favor a certain bath soap in
West Java Province (use a 90% confidence level).
Brand Number of households
Palmolive 1500
Lux 1800
Zest 1500
Beauty 1000
Minty 1200
Other brands 2000
A. Calculate confidence interval for the proportion of households
that favor LUX bath soap for their families.
B. What is the sample size should be selected if the maximum
error not more than 0.5%.
2. Confidence interval for the population mean ( µ ) when the
sample size ( n < 30 ) t–table ( p:722 )
( X1 – X2 ) ± Zv σx
σx = pooled standard deviation.
X1 = the sample mean of X1
X2 = the sample mean of X2
n1 = the sample size of X1
n2 = the sample size of x2
Example
A study was made to estimate the difference in salaries
of college professors in the private and state colleges
of Virginia. A random sample of 100 professor in the
private colleges showed an average of $ 15.000 per
month with a standard deviation of $ 1200. A random
sample of 200 professors in state colleges showed an
average salary of $ 16.000 with a standard deviation
of $ 1400. Find a 90% confidence interval for the
difference between the average salaries of professors
teaching in state and private colleges in Virginia.
5. Estimating the difference between two population
means ( µ1- µ2 ), n< 30
( X1 - X2) ± tv SP
Compute 95% confidence interval for the difference between the average
quality for the two brands.
EXAMPLE: POPULATION PROPORTION
Conclusion
1.Testing the Population Mean ( µ ), n ≥ 30
Example: Mr X, a mouthwash distributor, has stated that the
average cost to process a sales order is $ 13,25. Miss Y, cost
controller, fears that the average cost of processing is more
than $13.25. She is interested in taking action if cost are high,
but she does not care if the actual average cost is below the
assumed value. A random sample of 100 orders had a sample
mean of $13.35, assuming the std deviation is $ 0.50. Conduct
a test at 5 % level of significance and can you conclude that
the average cost of a sales order is more than $13.25?
TEST STATISTIC FOR µ ( n ≥ 30)
Ho=
Ha=
CRITICAL VALUES ( Z TABLE )
ACCEPTANCE & REJECTION AREAS
TEST STATISTIC: X - µ
Zh = ---------
SD/√n
. TEST STATISTIC IS FOUND FROM SAMPLE
INFORMATION.
. DECISION: ACCEPT OR REJECT Ho.
. CONCLUSION:
2. TEST STATISTIC FOR π
(POPULATION PROPORTION)
TEST STATISTIC:
P–π
Zh = ---------------
π ( 1 –π )/n
TESTING FOR A PROPORTION
POPULATION ( π )
RESEARCH AT THE FEB-UGM INDICATES THAT
50% OF THE STUDENTS CHANGE THEIR MAJOR
OF STUDY AFTER THEIR FIRST YEAR IN A
PROGRAM. A RANDOM SAMPLE OF 100
STUDENTS IN THE BUSINESS PROGRAM
REVEALED THAT 48% HAD CHANGED THEIR
MAJOR AREA OF STUDY AFTER THEIR FIRST
YEAR OF THE PROGRAM. Has there been a
significant decrease in the proportion of students who
change their major after the first year in this program?
Test at the 0.05 level of significance.
Example:2
Distribution of data after promoting Lux on Televion a few months ago.
Brand of bath soap Number of Household
Palmolive 1000
Lux 1500
Zest 1000
Beauty 750
Minty 1250
Maya 1000
Other Brands 3500
Assumption: Before promotion, the market share of Lux = 14%.
a. Can you conclude that the proportion of HH that favor Lux is
different from 14%.
b. Can you conclude that the proportion of HH that favor Lux is higher
than 14 %. ( Alpha = 15 % ).
5. Testing The Population Mean (µ ), n < 30
( Use t-table ) p: 785
Test Statistic:
X -µ
th = ------------
SD / n
Example
The distribution of gasoline consumption of 6 samples of
Honda Astrea can be reported as follow:
Sample Range
1 54 km/l
2 53
3 56
4 52
5 50
6 55
The manufacture claimed that the average of gasoline
consumption was 55 km/l. Can you conclude that this claim is too
overestimate? ( use alpha = 5 % ).
TESTING THE DIFFERENCE BETWEEN TWO
POPULATION MEANS
PARAMETRIC STATISTICS:
1. SPSS: PAIRED SAMPLES
2. EVIEWS: UNPAIRED SAMPLES:
EQUALITY OF VARIANCE TEST
3. EXCEL: DATA ANALYSIS
Sd = Pc ( 1 – Pc ) / n1 + Pc ( 1 – Pc ) / n2
6.TEST STATISTIC FOR (µ1 - µ2), n < 30
( USE- t TABLE )
TEST STATISTIC:
X1 - X2
th = -------------------------------------------
( n1-1) var1+ (n2-1) var2 1 +1
(n1 + n2 – 2) n1 n2
Testing (µ1 - µ2 ), n < 30
Dr. Dony, a psychologist, administered IQ tests to
determine if female FEB students were as smart as
male students. The random sample of 15 females
had a mean score of 131 with std deviation of 17.
The random sample of 13 male students had a
mean of 126 and a std deviation of 14. At 0.01 level
of significance:
a. is there a significant difference in their IQ?
b. Can you conclude that the average female
students IQ is greater than male students IQ?
6. Dependent Samples test
Sample Productivity of Workers
After Training Before Training d ( d-d ) ( d – d )2
1 235 units 228 units 7 2.4 5.76
2 210 205 5 0.4 0.16
3 231 219 12 7.4 54.76
4 242 240 2 -2.6 6.76
5 205 198 7 2.4 5.76
6 230 223 7 2.4 5.76
7 231 227 4 -0.6 0.36
8 210 215 -5 -9.6 92.16
9 225 222 3 -1.6 2.56
10 249 245 4 -0.6 0.36
Can you conclude that the training program increase the
productivity of the employees? ( alpha = 5% ).
∑ d = 46 d = 46/10 = 4.6
SD = ∑ ( d – d )2
(n–1)
Test statistic: d
SD / n
Example:2
Sample of customer Score of: New Menu Old menu
1 36 35
2 48 46
3 50 51
4 76 74
5 55 56
6 60 59
7 71 72
8 66 64
Can you conclude that the new menu is more delicious than
the old one? ( alpha = 1% ).
7: CHY SQUARE TEST
Table: p: 787
Chy- square tests are used in a procedure that
involves the comparison of the differences between
the sample frequencies of the occurrence ( Oij ) and
the hypothetical or theoretical population frequencies
( Eij ) ( Expected value ). ( Goodness of Fit ).
It can also be used to test relationship between
variables ( independency test ).
Critical value of x2 depends on the number of rows
and columns
Chy square table
Chy- square table is always one tailed in the
right side of the curve.
Critical table is X2 ά df ( r -1 )( c - 1 ).
Ho is accepted if test statistics calculated is
less than or equal to its critical table.
Ho is rejected if test statistics calculated is
greater than its critical table.
Chy-Square test
Test statistic:
( Oij – Eij )2
Xh2 =Σ --------------
Eij
Oij = observed frequencies in ith row & jth column
Eij = expected frequencies in ith row & jth column
EXAMPLE
A GARMENT COMPANY IN CAKUNG RECORDS THE
PERFORMANCE OF ITS LABOR PRODUCTIVITY
RANDOMLY.
THE LEVEL OF PRODUCTIVITY
LOW MODERATE HIGH
WORK-SHIFT
MORNING (I) 40 45 50
AFTERNOON(II) 60 55 60
NIGHT(III) 40 30 25
Based on this research, what is your conclusion?
(α = 5%)
CHY SQUARE: INDEPENDENCY TEST