Statistics Review
Statistics Review
Statistics Review
s=
(x
i =1
x) 2
n 1
for grouped data
s=
f (m x )
i =1 i i
n 1
s XY =
1 n ( xi x )( yi y ) n 1 i =1
Correlation Coefficient
r=
or
s XY s X sY
[ nx (x ) 2 ][ ny 2 (y ) 2 ]
2
r=
nxy (x)( y )
a=
n(xy ) (x )( y ) n(x 2 ) (x ) 2
and b = y ax
Z =
Binomial Distribution
n x n x P(X = x) = p q x
One Variable Statistics Review
Section 2.5 Measures of Central tendency Section 2.6 Measures of Spread
MULTIPLE CHOICE 1. A box-and-whisker plot does not show the a) mean b) first quartile c) third quartile d) median
2. Which of the following is not a measure of dispersion in a set of data? a) mean c) variance b) interquartile range d) standard deviation PROBLEM 1. The following table lists the approximate numbers of residents in 21 Canadian cities in 2002. City Calgary Edmonton Halifax Hamilton Kingston Kitchener/Waterloo Lethbridge London Ottawa Regina Saint John a) b) c) d) e) Population 864 700 693 800 117 200 347 500 60 300 276 400 71 200 350 900 348 500 182 800 73 600 City Saskatoon Sault Sainte Marie St. John's Sudbury Thunder Bay Toronto Vancouver Victoria Windsor Winnipeg Population 72 500 193 600 97 500 99 200 122 500 2 571 700 534 600 76 600 213 100 635 200
Find the median, first quartile, and third quartile for these data. Determine the range and interquartile range. Calculate the mean, standard deviation, and variance. What is the z-score for the population of Windsor? What is the z-score for the population of Toronto?
Problems
1. Hans has collected data to study the effect of the total winter snowfall on the height of his corn crop the following summer. a) Complete the table below and use the results to calculate the correlation coefficient, r. Snowfall, x Corn Height, y Year (cm) (cm) x2 y2 xy 1995 1996 1997 1998 1999 Totals 173 165 152 184 178 182 190 207 180 184
b) Explain what this correlation coefficient tells you about the relationship between the amount of winter snowfall and the height of corn plants the following summer.
b)
b) c) d) e)
and . Determine the equation of the line of best fit using a graphing calculator, a spreadsheet, or Fathom. Compare the equations you found in parts a) and b), and account for any differences. What is the correlation coefficient for the set of data? What does this correlation coefficient suggest about the effectiveness of the advertisements?
2. For which of the binomial distributions listed below is the normal distribution not a reasonable approximation? a) n = 50, p = 0.4 b) n = 40, p = 0.12 c) n = 75, p = 0.11 d) n = 40, p = 0.8 SHORT ANSWER 1. Use the normal approximation to find and in a binomial distribution with n = 1000 and p = 0.5. 2. QuenCola, a soft-drink company, knows that it has a 42% market share in one region of the province. QuenColas marketing department conducts a blind taste test with 100 people at a mall in the region. Use a normal approximation to calculate the probability that fewer than 40 of these people will choose QuenCola. 3. QuenCola, a soft-drink company, knows that it has a 42% market share in one region of the province. QuenColas marketing department conducts a blind taste test with 100 people at a mall in the region. Use a normal approximation to calculate the probability that exactly 40 of these people will choose QuenCola. PROBLEM
1. The probability of an airline flight arriving on time is 90%. Use the normal approximation to find the probability that at least 300 of a random sample of 350 flights will arrive on time. Explain each step in the calculation.