Statistics class notes
Statistics class notes
Example my drive time from home to USF has a mean of 30 minutes and
a std deviation of 2 minutes
Answer: at most 11
1. Percentiles – give the percent of the data in the distribution that fall
below a particular value.
Ex. 75th percentile – upper quartile (75% fall below and 25% fall above)
25th percentile – lower quartile (25% fall below and 75% fall above)
The z-score tells us the number of std deviations on observation falls from
the mean, and in which direction
Outliers
1. Observation is miscoded
2. The observation comes from a population different than the one
specified
3. Observation is the result of a rare chance
5% white
Pick; red
2. Box Plot Metho: a graphical method that uses the quartiles of the
data as the basis for identifying outliers.
Statistics Notes class 3
Probability
Probability Rules
1. 0<=P(x)<=1
2. SIGMA p(x) =1
Symbols:
Independent Events – the outcome of one event does not change or affect
the probability of observing another event
Probability Tables
Car Inventory
Color
We randomly sample five college students and find one with a tattoo
n=5: x=0,1,2,3,4,5
Two Types:
Must Follow
1. 0=<p(x)=<1
2. Sigma p(x)=1
Ex. n=5 students, (assume p=0.3)
Expected Value of a Discrete Random Variable
Criteria:
The left part: The number of outcomes with x successes and (n-x) failures
The right part: the probability of each outcome that has x successes and
(n-x) failures
a. Is this a binomial?
- n = 20 questions
- Success = guess correctly
- P(success) = 0.5 = p => (1-p) =0.5
- Questions are independent
X = the number of questions guessed correctly
b. Find the probability that you guess exactly half the questions
correctly.
P (x=10) = (20! /10! (20-10)!) * (.5) ^10 * (1-0.5) ^20-10 =
0.176197
c. Find the probability that you pass the exam with at least a C.
P(x>=14) = P(14) + P(15) + … + P(20)
It’s Tough!!!
Cumulative Binomial Probabilities
a. P(x=<8) = 0.905
b. P(x>=4) = P(x=<15) or 1 – P(x=<3)
a. Find the probability that at least 7 but fewer than 12 of the M&Ms
are red
Success = red: n=20, p=0.4
P(7=<x<12)
Ex. Find the probability that more than 7 are not red
a. P(Z=<1.00)=0.8413
b. P(Z>0.32)= 1-0.6255=0.3745
P(-1.50<Z<0.610) =P(Z=<0.61)-P(Z=<-1.5)
=0.6623
d. P(Z=<Zo)=0.8944
Zo=+1.25
e. P(Z<Zo)=0.1056
Zo=-1.25
f. P(Z>Zo)=0.9582; Zo=-1.73
We can use the standard normal distribution to solve all normal curve
questions.
Norm.Dist(x,m,sd,True)
Norm.Inv(prob.;m;st)
Ex. the GPAs of students follow a normal distribution with m=2.8 and
sd=0.4
Z=(x-m)/sd;
z=(3.0-2.8)/0.4=0.50=0.50;1-0.6915=0.3085
=1-Norm.Dist(3.0;2.8;0.4;True)=0.3085
b. Find the proportion of students with GPAs between 2.0 and 2.5
Z1=-0.75
Z2=-2.00
=Norm.Dist(2.5;2.8;0.4;True)- Norm.Dist(2.0;2.8;0.4;True)
c. Identify the GPA that identifies the lowest 9% of student GPAs.
Method 1: use the “<=” probability and the cumulative normal table
to find Zo. Zo= -1.34
Solve for Xo=m+ZoSt=2.264