0% found this document useful (0 votes)
29 views49 pages

Lab 11

Uploaded by

ahmedtarek544a
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
29 views49 pages

Lab 11

Uploaded by

ahmedtarek544a
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 49

BSC 1202

Probability and statistics


Lab 11
POPULATION AND SAMPLING DISTRIBUTIONS

• Population Distribution:
is the probability distribution of the population data.
• Sampling Distribution:
It lists the various values that can assume and the probability of each value of x
• Suppose there are only five
students in an advanced
statistics class and the midterm
scores of these five students
are:
70 78 80 80 95

• Let x denote the score of a


student
Sampling Distribution
Consider all possible samples of three
scores each that can be selected, without
replacement, from that population.
The total number of possible samples is
5! 5  4  3  2 1
5 C3    10
3!(5  3)! 3  2  1  2  1
• Suppose we assign the letters A, B, C, D, and E to
the scores of the five students so that
• A = 70, B = 78, C = 80, D = 80, E = 95
• Then, the 10 possible samples of three scores each
are
• ABC, ABD, ABE, ACD, ACE,
• ADE, BCD, BCE, BDE, CDE
All Possible
Samples and
Their Means
When the
Sample Size Is 3
Frequency and Relative Frequency
Distributions of x When the Sample Size Is 3
SAMPLING AND NONSAMPLING ERRORS

Definition
Sampling error is the difference between the value of a sample statistic
and the value of the corresponding population parameter. In the case of
the mean,
Sampling error = x
assuming that the sample is random and no nonsampling error has been
made.
Nonsampling errors is the errors that occur in the collection, recording,
and tabulation of data.
Example 7-1
Reconsider the population of five scores given in the previous
example. Suppose one sample of three scores is selected from
this population, and this sample includes the scores 70, 80, and
95. Find the sampling error.
Example 7-1
Reconsider the population of five scores given in the previous
example. Suppose one sample of three scores is selected from
this population, and this sample includes the scores 70, 80, and
95. Find the sampling error.
70  78  80  80  95
  80.60
5
70  80  95
x  81.67
3
Sampling error  x    81.67  80.60  1.07
SAMPLING AND NONSAMPLING ERRORS

• The difference between this sample mean and


the population mean is

x    82.33  80.60  1.73


• This difference does not represent the sampling
error.
• Only 1.07 of this difference is due to the sampling
error.
Nonsampling error  Incorrect x  Correct x
 82.33  81.67  .66
Definition
The mean and standard deviation of the sampling
distribution of x are called the mean and standard
deviation of x and are denoted by  x and  x ,
respectively.
Mean of the Sampling Distribution of
The mean of the sampling distribution
of is always equal to the mean of the
population. Thus,
x  
Standard Deviation of the Sampling Distribution of
The standard deviation of the sampling distribution of
is 
x 
n
where σ is the standard deviation of the population and n
is the sample size. This formula is used when n /N ≤ .05,
where N is the population size.
If the condition n /N ≤ .05 is not satisfied, we use the
following formula to calculate  x :
 N n
x 
n N 1
N n
where the factor N 1 is called the finite population
correction factor
Example 7-2
The mean wage for all 5000 employees who work at
a large company is $27.50 and the standard
deviation is $3.70.
Let x be the mean wage per hour for a random
sample of certain employees selected from this
company. Find the mean and standard deviation of
for a sample size of
(a) 30 (b) 75 (c) 200
Example 7-2: Solution

(a) N = 5000, μ = $27.50, σ = $3.70. In this case,


n/N = 30/5000 = .006 < .05.

 x    $27.50
 3.70
x    $.676
n 30

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 7-2: Solution

(b) N = 5000, μ = $27.50, σ = $3.70. In this case,


n/N = 75/5000 = .015 < .05.

 x    $27.50
 3.70
x    $.427
n 75

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 7-2: Solution

(c) In this case, n = 200 and


n/N = 200/5000 = .04, which is less than.05.

 x    $27.50
 3.70
x    $.262
n 200
Prem Mann, Introductory Statistics, 7/E
Copyright © 2010 John Wiley & Sons. All right reserved
If the population from which the samples are drawn is
normally distributed with mean μ and standard deviation σ,
then the sampling distribution of the sample mean, x , will
also be normally distributed with the following mean and
standard deviation, irrespective of the sample size:

 x   and  x 
n
Example 7-3
 In a recent SAT, the mean score for all examinees was
1020. Assume that the distribution of SAT scores of all
examinees is normal with the mean of 1020 and a
standard deviation of 153. Let x be the mean SAT
x
score of a random sample of certain examinees.
Calculate the mean and standard deviation of and
describe the shape of its sampling distribution when the
sample size is
 (a) 16 (b) 50 (c) 1000

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 7-3: Solution

(a) μ = 1020 and σ = 153.

 x    1020
 153
x    38.250
n 16

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 7-3: Solution

(b)
 x    1020
 153
x    21.637
n 50

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 7-3: Solution

(c)

 x    1020
 153
x    4.838
n 1000

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Figure 7.5

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Sampling From a Population That Is Not Normally Distributed
Central Limit Theorem
According to the central limit theorem, for a large
sample size, the sampling distribution of xis
approximately normal, irrespective of the shape of the
population distribution. The mean and standard deviation
of the sampling distribution of are x

 x   and  x 
n

The sample size is usually considered to be large if


n ≥ 30.
Example 7-4
 The mean rent paid by all tenants in a small city
is $1550 with a standard deviation of $225.
However, the population distribution of rents for
all tenants in this city is skewed to the right.
Calculate the mean and standard deviation of x
and describe the shape of its sampling
distribution when the sample size is
 (a) 30 (b) 100
Example 7-4: Solution
(a) Let x be the mean rent paid by a sample of 30
tenants.
 x    $1550
 225
x    $41.079
n 30
Example 7-4: Solution
(b) Let x be the mean rent paid by a sample of 100
tenants.
 x    $1550
 225
x    $22.500
n 100
APPLICATIONS OF THE SAMPLING
DISTRIBUTION OF x

1. If we take all possible samples of the


same (large) size from a population and
calculate the mean for each of these
samples, then about 68.26% of the
sample means will be within one
standard deviation of the population
mean.
2. If we take all possible samples of the
same (large) size from a population and
calculate the mean for each of these
samples, then about 95.44% of the
sample means will be within two
standard deviations of the population
mean.
3. If we take all possible samples of the
same (large) size from a population and
calculate the mean for each of these
samples, then about 99.74% of the
sample means will be within three
standard deviations of the population
mean.
Example 7-5
Assume that the weights of all packages of a
certain brand of cookies are normally
distributed with a mean of 32 ounces and a
standard deviation of .3 ounce. Find the
probability that the mean weight, x , of a
random sample of 20 packages of this brand
of cookies will be between 31.8 and 31.9
ounces.
Example 7-5: Solution

 x    32 ounces
 .3
x    .06708204 ounce
n 20
z Value for a Value of x

The z value for a value of x is calculated as


x 
z 
x
Example 7-5: Solution

31.8  32
• For x = 31.8: z  2.98
.06708204
31.9  32
• For x = 31.9: z  1.49
.06708204

• P(31.8 < < x 31.9) = P(-2.98 < z < -1.49)


= P(z < -1.49) - P(z < -2.98)
= .0681 - .0014 = .0667
Figure 7.12
Example 7-6
According to Sallie Mae surveys and Credit
Bureau data, college students carried an
average of $3173 credit card debt in 2008.
Suppose the probability distribution of the
current credit card debts for all college
students in the United States is known but its
mean is $3173 and the xstandard deviation is
$750. Let be the mean credit card debt of a
random sample of 400 U.S. college students.
Example 7-6

a) What is the probability that the mean of


the current credit card debts for this
sample is within $70 of the population
mean?
b) What is the probability that the mean of
the current credit card debts for this
sample is lower than the population mean
by $50 or more?
Example 7-6: Solution

μ = $3173 and σ = $750. The shape of the


probability distribution of the population is
unknown. However, the sampling distribution
of is approximately normal because the
sample is large (n > 30).
Example 7-6: Solution

(a)

• P($3103 ≤ x ≤ $3243)
= P(-1.87 ≤ z ≤ 1.87) = .9693 - .0307
= .9386
Figure 7.13 P ($3103  x  $3243)
Example 7-6: Solution

(a) Therefore, the probability that the mean of


the current credit card debts for this
sample is within $70 of the population
mean is .9386.
Example 7-6: Solution

(b)
• For x= $3123:
3123  3173
z  1.33
37.50

• P( x ≤ 3123) = P (z ≤ -1.33)
= .0918
Figure 7.14 P ( x  $3123)
Example 7-6: Solution

(b) Therefore, the probability that the mean


of the current credit card debts for this
sample is lower than the population
mean by $50 or more is .0918.
Thank You!

You might also like