0% found this document useful (0 votes)

5 views31 pages

Part 2

Uploaded by

Fadia Puan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views31 pages

Part 2

Uploaded by

Fadia Puan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 31

Sampling

Distributions
&
Sampling
Techniques
Pop Quiz

– In a cake factory, the standard deviation of sugar per cup is 129 gram. What is
the mean if not more than 62% has less than 440 gram of sugar per cup?
– What if not more than 62% has more than 440 gram of sugar per cup?
– In a vitamin factory, the standard deviation of vitamin C is 16 milligram. What is
the mean if more than 75% should have 51 milligram of vitamin C or more?
Discrete & Continuous
Distributions
– A random variable is discrete if the set of all possible values is at most a finite or a countably infinite number of possible
values.
Examples:
1. Randomly selecting 25 people who consume soft drinks and determining how many people prefer diet
soft drinks
2. Counting the number of people who arrive at a store during a five-minute period

• A random variable is continuous if it can take on values at every point over a given interval.
Examples:
1. Measuring the time between customer arrivals at a retail outlet
2. Measuring the weight of grain in a grain elevator at different points of time

• Discrete distributions (binomial, Poisson, hypergeometric) are constructed from discrete random
variables.
• Continuous distributions (uniform, normal, exponential, and others) are constructed from
continuous random variables.
I. Discrete Distribution
– A histogram is the most common graphical way of describing a discrete
distribution.
• An executive is considering out-of-town business travel for a given Friday. She recognizes
that at least one crisis could occur on the day that she is gone and she is concerned about
that possibility. Table 5.2 shows a discrete distribution that contains the number of crises
that could occur during the day that she is gone and the probability that each number will
occur.
5.2 Describing a Discrete Distribution

Mean, Variance, and Standard Deviation of Discrete Distributions

– The mean or expected value of a discrete distribution is the long run

average of occurrences.

where
long-run average
an outcome
probability of that outcome
• In the long run, the mean or expected number
of crises on a given Friday for this executive is
1.15 crises.
• However, there will never be exactly 1.15
crises.
II.
Continuous
Distributions
6.2 The Normal Distribution
Characteristics of the Normal Distribution

• It is a continuous distribution.
• It is a symmetrical
distribution about its mean.
• It is asymptotic to the
horizontal axis.
• It is unimodal.
• It is a family of curves.
• Area under the curve is 1.
6.2 The Normal Distribution
Probability Density Function of the Normal Distribution

– Shows area under the normal curve for a given mean and standard deviation.
– Since it is difficult to use the formula, common to use a table or computer.
6.2 The Normal Distribution

Standardized Normal Distribution

– The normal distribution is described by its mean and standard deviation.

– All normal distributions can be converted to a single distribution, the z distribution, using
the formula:

– A z score is the number of standard deviations that a value, x, is above or below the
mean.
– The z distribution is a normal distribution with a mean of 0 and a standard deviation of 1.
6.2 The Normal Distribution

Solving for Probabilities Using the Normal Curve

– Example: According to the U.S. Environmental Protection Agency (EPA), on average
there are 4.43 pounds of waste generated per person in the U.S. per day.
– Suppose waste generated per person per day in the U.S. is normally distributed
with a standard deviation of 1.32 pounds.
– If a U.S. person is randomly selected, what is the probability that the person generates more
than 6.00 pounds of waste per day?

– First, find the z value:

– Look the value up in the z table, which gives an area of .3830.

6.2 The Normal Distribution
Solving for Probabilities Using the Normal Curve
– Example, continued.
– .3830 is the area between the mean and the z value of 1.19 (x value of 6).
– Subtract from .5 to get the area in the upper tail.

• There is an 11.7% chance that a randomly

selected person will generate more than 6
pounds of waste per day.
6.2 The Normal Distribution
Using the Computer to Solve for Normal Distribution Probabilities

• Both Excel and Minitab can be used.

• For the waste generation problem

given earlier, if a U.S. person is
randomly selected, what is the
probability that the person generates
between 5.30 and 6.50 pounds of
waste per day?

• Both programs give the probability,

0.1965.
III. Sampling Techniques

Reasons for Sampling

– The sample can save money.
– The sample can save time.
– For given resources, the sample can broaden the scope of the study.
– Because the research process is sometimes destructive, the sample can save product.
– If accessing the population is impossible, the sample is the only option.
Reasons for Taking a Census
• Eliminate the possibility that a randomly selected sample may not be
representative of the population.
• For the safety of the consumer.
• To benchmark data for future studies.
Frame
• List, map, or directory used in the sampling process to represent the
population.
• Also called the working population.
7.1 Sampling
Frame
– A frame is overregistered if it contains units that are not in the target
population.
– A frame is underregistered if it does not include some units that are in the
population.
Types of Sampling Designs

14-15
7.1 Sampling

Random Versus Nonrandom Sampling

– In random sampling, every unit of the population has the same chance of being selected.
– In nonrandom sampling, not every unit of the population has the same chance of being
selected.
– Generally NOT an appropriate technique for gathering data for statistical analysis

Simple Random Sampling

– Each unit in the frame is numbered from 1 to N (the size of the population.
– A random number table or generator is used to select n items into the sample.
7.1 Sampling
Simple Random Sampling, continued.
Example: From the population frame of companies in Table 7.3, select a simple
random sample of six companies.
– First, the companies were numbered from 1 to 30.
7.1 Sampling
Example, continued:
– From the table of random number, two digit numbers are selected, discarding any that are over 30.
– In the table below, the first two digits are 91, which is unusable.
– The second two digits are 56, also unusable, as is 74, the next two digits
– The fourth set of two digits are 25, which corresponds with Occidental Petroleum.
7.1 Sampling
Example, continued:
– Continue moving across the rows until six two-digit numbers are selected.
– Sample will be:
– (25) Occidental Petroleum
– (27) Procter & Gamble
– (01) Alaska Airlines
– (04) Bank of America
– (02) Alcoa
– (29) Sears
7.1 Sampling
Stratified Random Sampling
– Population is divided into nonoverlapping subpopulations (strata).
– Researcher selects a random sample from each.
– Can reduce sampling error, because sample will more closely match the population.
– More costly than a simple random sample.
– Strata are usually chosen based on available information about the population.

• Within each group, there should be

homogeneity.

• Between each group, there should be

heterogeneity.
7.1 Sampling

Systematic Sampling
– Every kth item is selected to produce a sample of size n from a population of size N.

Example: A business researcher wanted to sample Texas manufacturers as part of a management study.
– Wanted to sample 1,000 companies.
– Frame-- most recent edition of the Texas Manufacturers Register® which listed 26,000 manufacturing
companies in alphabetic order.
– The value of k was 26 (26,000/1,000).
– Use random number table to choose the first element in the study.
7.1 Sampling

Cluster (or Area) Sampling

– Dividing population into nonoverlapping areas.

– Clusters that are internally heterogeneous.
– Example: states, cities
– If clusters are too large, a second set of clusters can be taken from the initial cluster (two-stage
sampling).

– Advantages: convenience, cost

– Disadvantages: may be less efficient than simple random sampling if the elements of the cluster
are similar
7.1 Sampling

Nonrandom Sampling
– Any method that does not involve a random selection process.

Convenience Sampling
– Selected for the convenience of the researcher.

Judgment Sampling
– Chosen by the judgement of the researcher.
– Since the probability of an element being selected cannot be determined, cannot determine
sampling error.
– Can be biased due to systematic errors in judgment.
7.1 Sampling

Quota Sampling
– Population subclasses, such as age or gender, are used as strata.
– Can be useful if no frame is available for the population.
– Can be less costly.
– But nonrandom, and thus probabilities cannot be calculated.

Snowball Sampling
7.1 Sampling

Sampling Error
– Occurs when the sample is not representative of the population.

Nonsampling Error
– All other errors other than sampling error.
– Missing data
– Recording errors
– Measurement errors
– Input processing errors
– Analysis errors
– Response errors
– And many more!
7.2 Sampling Distribution of
Suppose that a small, finite population contains only N = 8 numbers:
54 55 59 63 64 68 69 70

– Distribution of the population data:

– Suppose that all possible samples of size n = 2 are taken from this population.
7.2 Sampling Distribution of
Population:
54 55 59 63 64 68 69 70

All possible samples of n = 2:

– Then take the means of all of the samples.

7.2 Sampling Distribution of
Means of the samples:

Distribution of the means of the samples:

7.2 Sampling Distribution of
– Distribution of the mean of the samples looks different from the original
distribution

– Similarly, the histogram of a Poisson distribution and its samples are different.
7.2 Sampling Distribution of
The Central Limit Theorem
– If random samples of size n are repeatedly drawn from a population that has a mean of μ and a
standard deviation of σ, the sample means,, are approximately normally distributed for sufficiently
large sample sizes (n ≥ 30), regardless of the shape of the population distribution. If the population
is normally distributed, the sample means are normally distributed for any size sample.

– It can be shown that the mean of the sample means is the population mean:

– The standard deviation of the sample means (the standard error of the mean) is:
7.2 Sampling Distribution of

Advanced Statistics Concepts
No ratings yet
Advanced Statistics Concepts
96 pages
Theory of Estimation by P.G.dixit, Nirali Publication
No ratings yet
Theory of Estimation by P.G.dixit, Nirali Publication
186 pages
MBAL - User Guide
100% (1)
MBAL - User Guide
366 pages
Sampling and Sampling Distributions
No ratings yet
Sampling and Sampling Distributions
46 pages
Est&Hypgp 7
No ratings yet
Est&Hypgp 7
292 pages
Sampling Inference
No ratings yet
Sampling Inference
83 pages
Point and Interval Estimate
No ratings yet
Point and Interval Estimate
135 pages
Sampling & Sampling Distribution: by Asif Hanif
No ratings yet
Sampling & Sampling Distribution: by Asif Hanif
25 pages
Week 3
No ratings yet
Week 3
56 pages
SAMPLING AND ESTIMATION Notes and Examples
No ratings yet
SAMPLING AND ESTIMATION Notes and Examples
20 pages
CH06
No ratings yet
CH06
48 pages
Sampling Design: Basic Concepts and Procedure: Sampling Frame. Known. Random Samples
No ratings yet
Sampling Design: Basic Concepts and Procedure: Sampling Frame. Known. Random Samples
18 pages
Lab Manual - DWH
No ratings yet
Lab Manual - DWH
21 pages
Chapter 7 Sampling and Sampling Distributions
No ratings yet
Chapter 7 Sampling and Sampling Distributions
44 pages
Statistics For Managers Using Microsoft Excel: 5 Edition
No ratings yet
Statistics For Managers Using Microsoft Excel: 5 Edition
43 pages
Sampling Sta414
No ratings yet
Sampling Sta414
44 pages
Math
No ratings yet
Math
10 pages
Sampling & Sampling Distributions
No ratings yet
Sampling & Sampling Distributions
34 pages
5sampling Methods
No ratings yet
5sampling Methods
78 pages
Bus 6
No ratings yet
Bus 6
45 pages
Chapter7 Sampling Distribution
No ratings yet
Chapter7 Sampling Distribution
37 pages
Sampling Distribution of The Sample Mean and Central Limit Theorem
No ratings yet
Sampling Distribution of The Sample Mean and Central Limit Theorem
24 pages
Math Presentation Chapter 13 (Koushik)
No ratings yet
Math Presentation Chapter 13 (Koushik)
37 pages
Sampling and Sampling Distributions
No ratings yet
Sampling and Sampling Distributions
35 pages
Data Pipelines From Zero To Solid
No ratings yet
Data Pipelines From Zero To Solid
58 pages
Om Prakash Jena, Bharat Bhushan, Utku Kose - Machine Learning and Deep Learning in Medical Data Analytics and Healthcare Applications (2022, CRC Press) - Libgen - Li
No ratings yet
Om Prakash Jena, Bharat Bhushan, Utku Kose - Machine Learning and Deep Learning in Medical Data Analytics and Healthcare Applications (2022, CRC Press) - Libgen - Li
292 pages
Sampling Distribution
No ratings yet
Sampling Distribution
53 pages
Ba1 7
No ratings yet
Ba1 7
37 pages
Statistics and Probability Q3
No ratings yet
Statistics and Probability Q3
6 pages
Sampling and Sampling Distribution
100% (2)
Sampling and Sampling Distribution
43 pages
Sampling, Sampling Distributions and Estimation
No ratings yet
Sampling, Sampling Distributions and Estimation
8 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
30 pages
Business Stat CH 1
No ratings yet
Business Stat CH 1
15 pages
Lecture 3 Sampling and Sampling Distribution - Probability and Non-Probability Sampling
No ratings yet
Lecture 3 Sampling and Sampling Distribution - Probability and Non-Probability Sampling
16 pages
Statistics For Management II
No ratings yet
Statistics For Management II
99 pages
g12 Important Questions Database Concepts
No ratings yet
g12 Important Questions Database Concepts
7 pages
Attachment
No ratings yet
Attachment
36 pages
1 Chapter
No ratings yet
1 Chapter
29 pages
Day 4 Data Collection Methods-1
No ratings yet
Day 4 Data Collection Methods-1
25 pages
Module 2
No ratings yet
Module 2
148 pages
Samplin Distn
No ratings yet
Samplin Distn
37 pages
Chapter Seven
No ratings yet
Chapter Seven
35 pages
Slide TSP203 New Chap008
No ratings yet
Slide TSP203 New Chap008
24 pages
Lectorial Slides 6a
No ratings yet
Lectorial Slides 6a
30 pages
Statistics For Managenent II
No ratings yet
Statistics For Managenent II
73 pages
Chapter - 1.sampling and Sampling Distrabution
No ratings yet
Chapter - 1.sampling and Sampling Distrabution
43 pages
Kollu Hemanth - Java Resume
No ratings yet
Kollu Hemanth - Java Resume
5 pages
Lecture 5 Statistics
0% (1)
Lecture 5 Statistics
52 pages
Chapter 5 Statistics
No ratings yet
Chapter 5 Statistics
11 pages
Samplig & Sampling Distribution
No ratings yet
Samplig & Sampling Distribution
5 pages
Week 11: Sampling Distribution
No ratings yet
Week 11: Sampling Distribution
9 pages
Bootcamp in Data Analytics (AnalytixLabs)
No ratings yet
Bootcamp in Data Analytics (AnalytixLabs)
40 pages
Reviewer in Statistics and Probability
No ratings yet
Reviewer in Statistics and Probability
7 pages
Stat II Chapter One
No ratings yet
Stat II Chapter One
5 pages
AI&ML Lab Manual
No ratings yet
AI&ML Lab Manual
31 pages
Sample Design and Sampling Procedures
No ratings yet
Sample Design and Sampling Procedures
43 pages
Brief Lecture Notes
No ratings yet
Brief Lecture Notes
13 pages
BY WMS Training-Day 3
No ratings yet
BY WMS Training-Day 3
27 pages
SQL-Transactions Theory and Hands-On Exercises
No ratings yet
SQL-Transactions Theory and Hands-On Exercises
85 pages
Power BI Resume 04
No ratings yet
Power BI Resume 04
6 pages
Sampling and Sampling Distributions
No ratings yet
Sampling and Sampling Distributions
7 pages
E-Present XLSB
No ratings yet
E-Present XLSB
34 pages
Chap 008 Samplinig (Presentation)
No ratings yet
Chap 008 Samplinig (Presentation)
24 pages
Chapter 2-Part 1 Applied Statistics
No ratings yet
Chapter 2-Part 1 Applied Statistics
30 pages
Stat 11
No ratings yet
Stat 11
12 pages
CIS Oracle Database 12c Benchmark v3.0.0
No ratings yet
CIS Oracle Database 12c Benchmark v3.0.0
5 pages
Big Data Answers
No ratings yet
Big Data Answers
14 pages
Inferential Statistics 1 (G4)
No ratings yet
Inferential Statistics 1 (G4)
43 pages
Stat For Comp (7-9)
No ratings yet
Stat For Comp (7-9)
22 pages
DEX450 BuildApplicationsProgrammatically Exercises
No ratings yet
DEX450 BuildApplicationsProgrammatically Exercises
91 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
18 pages
Applets: Unit - V
No ratings yet
Applets: Unit - V
19 pages
The Complete Management System That With Your School: Grows
No ratings yet
The Complete Management System That With Your School: Grows
8 pages
APIs Vs Interfaces
No ratings yet
APIs Vs Interfaces
8 pages
ADB Chapter Two
No ratings yet
ADB Chapter Two
22 pages
Data Warehousing SS G515 Me Software Systems BITS Pilani, Dubai Campus
No ratings yet
Data Warehousing SS G515 Me Software Systems BITS Pilani, Dubai Campus
46 pages
DP200 - PracticeTests 2 AnswersAndExplanation
No ratings yet
DP200 - PracticeTests 2 AnswersAndExplanation
107 pages
Resume Anitha
No ratings yet
Resume Anitha
3 pages
Gate Scholorship Work - October: Sampling Fundamentals
No ratings yet
Gate Scholorship Work - October: Sampling Fundamentals
13 pages
Dbit DBMS
No ratings yet
Dbit DBMS
23 pages
Sridhara GT Fujitsu
No ratings yet
Sridhara GT Fujitsu
5 pages
BDA Unit-5
No ratings yet
BDA Unit-5
44 pages
All MCQ and FIB and TF Question Answer Class X IT 2024
No ratings yet
All MCQ and FIB and TF Question Answer Class X IT 2024
11 pages
3 - Microsoft PL-300 Free Practice Exam & Test Training
No ratings yet
3 - Microsoft PL-300 Free Practice Exam & Test Training
3 pages
Chapter 4 Mail Merge
No ratings yet
Chapter 4 Mail Merge
9 pages
DBMS Syllabus
No ratings yet
DBMS Syllabus
2 pages

Part 2

Uploaded by

Part 2

Uploaded by

Sampling

Mean, Variance, and Standard Deviation of Discrete Distributions

– The mean or expected value of a discrete distribution is the long run

Standardized Normal Distribution

– The normal distribution is described by its mean and standard deviation.

Solving for Probabilities Using the Normal Curve

– First, find the z value:

– Look the value up in the z table, which gives an area of .3830.

• There is an 11.7% chance that a randomly

• Both Excel and Minitab can be used.

• For the waste generation problem

• Both programs give the probability,

Reasons for Sampling

Random Versus Nonrandom Sampling

Simple Random Sampling

• Within each group, there should be

• Between each group, there should be

Cluster (or Area) Sampling

– Dividing population into nonoverlapping areas.

– Advantages: convenience, cost

– Distribution of the population data:

All possible samples of n = 2:

– Then take the means of all of the samples.

Distribution of the means of the samples:

You might also like