LQ1 Notes
LQ1 Notes
2. Inferential Statistics
• Frequency Histogram
Purpose: To provide a graphical display
that gives a sense of data distribution
across the range of values.
• Range-
• Variance-
n! P ( A) + P ( B )
nCr =
(n − r )!r ! “Ace or King”
n! P( A) + P( B) − P( A & B)
nPr =
(n − r )! “Ace and King”
--------------------------------------------------
“4 blue balls and 3 red balls. Probability
Circular (For unique/people) of getting blue ball. There are 7 total
number of balls. Pick 3 times”
( n − 1)!
W/ Replacement Concepts
Circular for Repeated Patterns: Every time we pick up a ball, it is
(Bracelet(Red/Blue/Red/Blue)) REPLACED by a random another ball.
(n − 1)! 4 4 4
• •
7 7 7
2 W/o Replacement Concepts
Repeating items / terms
Every time we pick up a ball, it is NOT
n! REPLACED by a random another ball.
a !b!c !... 4 3 3
• •
Let [n] be the total number of 7 6 5
variables and [a, b, c, …, z.] be the
For the first try there are still 4 blue balls
repeated variables
and 3 red balls, thus the probability is
Ex. For “MATHEMATICS” given
Total number of words [n]=11 For the second try, provided that we
picked up a blue ball during the first try,
Number of repeated M’s [a] = 2
then the remaining blue balls are 3 and
Number of repeated A’s [b] = 2 3 red balls.
Number of repeated T’s [c] = 2 For the third try, provided that we picked
up a red ball during the second try, then
11! the remaining blue balls are still three
Then… but two remaining for the red balls.
2!2!2!
Intro to Discrete Probability
Distribution
Random Variables
-Numerical Outcomes
(Probabilities) of a random event fig.1
a N − a
Probability n
x
−
P( x) =
x n x
N
Total Random Sample Can also be used for more than two
n groups:
[When people are being used,
automatically w/o replacement] P( x) =
( G1)( G 2 ) (G3)
[Success is the term used for what (TOTAL _ COMBI )
you’re “looking for”]
Random Variables is limited to the
values. It cannot be higher than the
Additional Notes: sample size or success total.
Inside of Interval
Outside of Interval
1
Variance: V [ x] =
2
Test for Means
Null Hypothesis Test Statistic Remarks
1 = 2 but Unknown
1 2 and Unknown
H : 1 − 2 = d s12 s2 2 +
+ df = n1 n2
n1 n2 2
s12 s2 2
2
n1 + n2
n1 − 1 n2 − 1
Paired Observations
d − d
t=
H : = d sd Use t-distribution with
n df = n − 1
pˆ1 − pˆ 2 x1 x2
z= Sample Proportions: pˆ1 = and pˆ 2 =
H : p1 − p2 = 0 1 1
ˆ ˆ +
pq
n1 n2
n1 n2 x1 + x2
Pooled Sample Proportion: pˆ =
n1 + n2
( x1 − x2 ) − d is Known
z=
H : = 12 22 (or sample is large)
+
n1 n2 Use Z-distribution
x − is Unknown
t=
H : = s Use t-distribution with
n df = n − 1
d − d Paired Observations
t=
H : = d sd Use t-distribution with
df = n − 1
n
x
pˆ =
pˆ − p
, where x is the
n
H : p = p z= p q
number of “successes” in
the sample, the sample size
n is n 30
Use the Z-distribution
Notes Prepared by:
𝜎 is Known
𝑥̅ − 𝜇
𝑧=
𝐻 ∶ 𝜇=𝜇 𝜎 (or sample is large)
√𝑛 Use Z-distribution
𝑥̅ − 𝜇 𝜎 is Unknown
𝑡=
𝐻 ∶ 𝜇=𝜇 𝑠
√𝑛 Use t-distribution with 𝑑𝑓 = 𝑛 − 1
𝜎 = 𝜎 but Unknown
(𝑥̅ − 𝑥̅ ) − 𝑑
𝑡= (𝑛 − 1)𝑠 + (𝑛 − 1)𝑠
𝐻 ∶ 𝜇 −𝜇 =𝑑 1 1 𝑠 =
𝑠 + 𝑛 +𝑛 −2
𝑛 𝑛
Use t-distribution with 𝑑𝑓 = 𝑛 + 𝑛 − 2
𝜎 ≠ 𝜎 and Unknown
Use t-distribution with
(𝑥̅ − 𝑥̅ ) − 𝑑
𝑡=
𝐻 ∶ 𝜇 −𝜇 =𝑑 𝑠 𝑠
𝑠 𝑠 +
+ 𝑛 𝑛
𝑛 𝑛 𝑑𝑓 =
𝑠 𝑠
𝑛 𝑛
+
𝑛 −1 𝑛 −1
𝑑̅ − 𝑑 Paired Observations
𝐻 ∶ 𝜇 =𝑑 𝑡=
𝑠
Use t-distribution with 𝑑𝑓 = 𝑛 − 1
√𝑛
𝐻 Critical Region
𝜎 >𝜎 𝜒 >𝜒
𝜎 <𝜎 𝜒 <𝜒
𝜎 ≠𝜎 𝜒 <𝜒 or 𝜒 > 𝜒
𝐻 Critical Region
𝜎 −𝜎 >0 𝑓 > 𝑓 (𝑣 , 𝑣 )
𝜎 −𝜎 <0 𝑓<𝑓 (𝑣 , 𝑣 )
𝜎 −𝜎 ≠0 𝑓>𝑓 (𝑣 , 𝑣 ) or 𝑓 < 𝑓 (𝑣 , 𝑣 )