Binomial Distributions For Sample Counts

The document discusses binomial distributions and their applications. Binomial distributions model the number of successes in a series of fixed trials where each trial has two possible outcomes (success/failure) and a constant probability of success. They are used when assessing the occurrence of an event rather than its magnitude. The mean and standard deviation of a binomial distribution are defined. Changing the probability of success affects the shape of the distribution. Sample proportions can estimate population proportions. As sample size increases, the sampling distribution of the sample proportion approximates a normal distribution.

Uploaded by

Vishnu Venugopal

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

95 views

Binomial Distributions For Sample Counts

Uploaded by

Vishnu Venugopal

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Binomial distributions for sample counts

Binomial distributions are models for some categorical variables, typically

representing the number of successes in a series of n trials.
The observations must meet these requirements:
The total number of observations n is fixed in advance.
Each observation falls into just 1 of 2 categories: success and failure.
The outcomes of all n observations are statistically independent.
All n observations have the same probability of success, p.
We record the next 50 births at a local hospital. Each newborn is either a
boy or a girl; each baby is either born on a Sunday or not.
Applications for binomial distributions
Binomial distributions describe the possible number of times that
a particular event will occur in a sequence of observations.
They are used when we want to know about the occurrence of an
event, not its magnitude.
In a clinical trial, a patients condition may improve or not. We study the
number of patients who improved, not how much better they feel.
Is a person ambitious or not? The binomial distribution describes the
number of ambitious persons, not how ambitious they are.
In quality control we assess the number of defective items in a lot of
goods, irrespective of the type of defect.
Reminder: Sampling variability
Each time we take a random sample from a population, we are likely to get a
different set of individuals and calculate a different statistic. This is called sampling
variability.
If we take a lot of random samples of the same size from a given population, the
variation from sample to samplethe sampling distributionwill follow a
predictable pattern.
Binomial mean and standard deviation
The center and spread of the binomial
distribution for a count X are defined by the mean
m and standard deviation s:
) 1 ( p np npq np = = = o
Effect of changing pwhen nis fixed.
a) n = 10, p = 0.25
b) n = 10, p = 0.5
c) n = 10, p = 0.75
For small samples, binomial distributions
are skewed when p is different from 0.5.
0
0.05
0.1
0.15
0.2
0.25
0.3
0 1 2 3 4 5 6 7 8 9 10
Number of successes
P
(
X
=
x
)
0
0.05
0.1
0.15
0.2
0.25
0.3
0 1 2 3 4 5 6 7 8 9 10
Number of successes
P
(
X
=
x
)
0
0.05
0.1
0.15
0.2
0.25
0.3
0 1 2 3 4 5 6 7 8 9 10
Number of successes
P
(
X
=
x
)
a)
b)
c)
Sample proportions
The proportion of successes can be more informative than the count. In
statistical sampling the sample proportion of successes, , is used to estimate the
proportion p of successes in a population.
For any SRS of size n, the sample proportion of successes is:
n
X
n
p = =
sample in the successes of count

In an SRS of 50 students in an undergrad class, 10 are Hispanic:

= (10)/(50) = 0.2 (proportion of Hispanics in sample)
The 30 subjects in an SRS are asked to taste an unmarked brand of coffee and rate it
would buy or would not buy. Eighteen subjects rated the coffee would buy.
= (18)/(30) = 0.6 (proportion of would buy)
p
p
p
Sampling distribution of the sample proportion
The sampling distribution of is never exactly normal. But as the sample size
increases, the sampling distribution of becomes approximately normal.
The normal approximation is most accurate for any fixed n when p is close to 0.5, and
least accurate when p is near 0 or near 1.
p

Estimation
Estimation A process whereby we select
a random sample from a population and use
a sample statistic to estimate a population
parameter.
Point and Interval Estimation
Point Estimate A sample statistic used to
estimate the exact value of a population
parameter
Confidence interval (interval estimate) A
range of values defined by the confidence level
within which the population parameter is
estimated to fall.
Confidence Level The likelihood, expressed
as a percentage or a probability, that a specified
interval will contain the population parameter.
Apopulation distribution variation in the larger
group that we want to know about.
Adistribution of sample observations
variation in the sample that we can observe.
Asampling distribution a normal distribution
whose mean and standard deviation are unbiased
estimates of the parameters and allows one to infer
the parameters from the statistics.
Inferential Statistics involves
Three Distributions:
What does this Theorem tell us:
Even if a population distribution is skewed, we know that the
sampling distribution of the mean is normally distributed
As the sample size gets larger the mean of the sampling
distribution becomes equal to the population mean
As the sample size gets larger the standard error of the mean
decreases in size (which means that the variability in the sample
estimates from sample to sample decreases as n increases).
It is important to remember that researchers do not
typically conduct repeated samples of the same
population. Instead, they use the knowledge of theoretical
sampling distributions to construct confidence intervals
around estimates.
The Central Limit Theorem
Revisited
Arange of reasonable guesses at a population value,
for example, a mean.
Confidence level = chance that range of guesses
captures the population value.
Most common confidence level is 95%
General Format of a Confidence Interval
estimate +/- margin of error
Accuracy of a mean
A sample of n=36 college women has
mean pulse = 75.3.
The SD of these pulse rates = 8 .
How well does this sample mean estimate
the population mean ?
Standard Error of Mean
SEM = SD of sample / square root of n
SEM = 8 / square root ( 36) = 8 / 6 = 1.33
Margin of error of mean = 2 x SEM
Margin of Error = 2.66 , about 2.7
Interpretation
95% confidence that the sample mean is
within 2.7 (pulse beats) of the population
mean.
A 95% confidence interval for the
population mean
sample mean +/- margin of error
75.3 +/-2.7 ; 72.6 to 78.0
C.I. for mean pulse of men
n=49
sample mean=70.3, SD = 8
SEM = 8 / square root(49) = 1.1
margin of error=2 x 1.1 = 2.2
Interval is 70.3 +/- 2.2
68.1 to 72.5
Do men and women differ in
mean pulse?
C.I. for women is 72.6 to 78.0
C.I. for men is 68.1 to 72.5
No overlap between intervals
We say that population means differ
Confidence Levels:
Confidence Level The likelihood, expressed as a
percentage or a probability, that a specified interval
will contain the population parameter.
95% confidence level there is a .95 probability that
a specified interval DOES contain the population
mean. In other words, there are 5 chances out of 100
(or 1 chance out of 20) that the interval DOES NOT
contains the population mean.
99% confidence level there is 1 chance out of 100
that the interval DOES NOT contain the population
mean.
Constructing a
Confidence Interval (CI)
The sample mean is the point estimate of the
population mean.
The sample standard deviation is the point
estimate of the population standard deviation.
The standard error of the mean makes it
possible to state the probability that an
interval around the point estimate contains
the actual population mean.
Standard error of the mean the standard
deviation of a sampling distribution
n
x
x
o
o = = Standard Error
The Standard Error
n
x
x
o
o =
Since the standard error is generally not known, we
usually work with the estimated standard error:
n
s
s
x
x
=
Estimating standard errors
) (
x
SE Z X CI =
Determining a
Confidence Interval (CI)
) (
n
s
Z X CI
x
=
Given a large enough sample, any confidence interval for the
population mean may be constructed:
Where z is chosen from a standard normal distribution table to
obtain a desired degree of confidence.
Confidence Level Increasing our confidence level
from 95% to 99% means we are less willing to draw
the wrong conclusion we take a 1% risk (rather
than a 5%) that the specified interval does not contain
the true population mean.
If we reduce our risk of being wrong, then we need a
wider range of values . . . Sotheinterval becomes
lessprecise.
) (
n
s
Z X
x

Confidence Interval Width

Confidence Interval Width
Confidence Interval Z Values
Sample Size Larger samples result in smaller
standard errors, and therefore, in sampling
distributions that are more clustered around the
population mean. Amore closely clustered sampling
distribution indicates that our confidence intervals
will be narrower and more precise.
Confidence Interval Width
) (
n
s
Z X
x

Standard Deviation Smaller sample standard

deviations result in smaller, more precise confidence
intervals.
(Unlike sample size and confidence level, the
researcher plays no role in determining the standard
deviation of a sample.)
Confidence Interval Width
) (
n
s
Z X
x

Finding confidence interval of the mean years of education of

voters. (Table 9.4, Hamilton)
Mean = 12.97 years
Standard deviation = 2.42 years
Number of cases n= 155
Calculation of 95 percent confidence interval.
) (
n
s
Z X
x

)
155
42 . 2
( 96 . 1 97 . 12
38 . 0 97 . 12 =
So the interval is 12.59 s s 13.35
Interpretation
Informal: Based on our analysis of this
particular sample, we are about 95% confident
that the mean education among all voters in
this town lies between 12.59 and 13.35 years.
Formal: If we took a large number of random
samples, each with 155 cases, and calculated
confidence intervals in this manner for each
sample, about 95% of those confidence
intervals should include the true population
mean .
Estimating the standard error of a proportion based
on the Central Limit Theorem, a sampling distribution of
proportions is approximately normal, with a mean,
t
,
equal to the population proportion, t, and with a standard
error of proportions equal to:
( )( )
n
t t
o
t

=
1

Since the standard error of proportions is generally not

known, we usually work with the estimated standard
error:
( )( )
n
s
t t
t

1

=
Confidence Intervals for Proportions
Determining a Confidence Interval
for a Proportion
( )( )
n
Z SE Z
t t
t t
t

1

) (

=
Large sample confidence intervals for proportions
are found as
Where z is chosen from a table of the standard normal
distribution to give the desired degree of confidence.
Finding an approximate 95% confidence interval for the
proportion favoring school closings.
Sample statistics:
Proportion favoring school closed = 0.431
Number of cases n = 153
Confidence interval for population proportion
t

( )( )
n
Z SE Z
t t
t t
t

1

) (

=
( )( )
153
431 . 0 1 431 . 0
96 . 1 431 . 0

=
078 . 0 431 . 0 =
So the interval is 0.353 s t s 0.509
Interpretation
Informal: Based on our analysis of this one sample we
are about 95% confident that the proportion in favor
of closing schools, among all voters in this town, lies
between 0.353 and 0.509.
Formal: If we took a large number of random
samples, each with 153 cases, and calculated
confidence intervals in this manner for each sample,
about 95% of those confidence intervals should
include the true population proportion t.

Statistics For Dummies
From Everand
Statistics For Dummies
Deborah J. Rumsey
4/5 (27)
Math Makes Sense 8 Practice and Homework Book PDF
50% (2)
Math Makes Sense 8 Practice and Homework Book PDF
6 pages
Project Report On Topological Spaces15!5!21
No ratings yet
Project Report On Topological Spaces15!5!21
44 pages
Sample Size Determination
100% (1)
Sample Size Determination
20 pages
Student Solution Chap 09
No ratings yet
Student Solution Chap 09
10 pages
SB K49 Lecture7
No ratings yet
SB K49 Lecture7
57 pages
Chapter 9 Slides
No ratings yet
Chapter 9 Slides
33 pages
Estimation and CI
No ratings yet
Estimation and CI
87 pages
Chapter 3 - Sampling Distribution and Confidence Interval1
No ratings yet
Chapter 3 - Sampling Distribution and Confidence Interval1
54 pages
Chapter 3 - 2 Statistical Inference For 1 Population
No ratings yet
Chapter 3 - 2 Statistical Inference For 1 Population
84 pages
Chapter 8: Estimation: - Estimation Defined - Confidence Levels - Confidence Intervals - Confidence Interval Precision
No ratings yet
Chapter 8: Estimation: - Estimation Defined - Confidence Levels - Confidence Intervals - Confidence Interval Precision
29 pages
L8 Statistical Estimation 1
No ratings yet
L8 Statistical Estimation 1
48 pages
Chapter 6. Estiamation
No ratings yet
Chapter 6. Estiamation
65 pages
Chapter 3 Sampling Distribution and Confidence Interval
100% (2)
Chapter 3 Sampling Distribution and Confidence Interval
57 pages
4 Confidence Intervals
100% (1)
4 Confidence Intervals
49 pages
Lecture 6 - Estimation Part A
No ratings yet
Lecture 6 - Estimation Part A
23 pages
Chapter 4A: Inferences Based On A Single Sample: Confidence Intervals
No ratings yet
Chapter 4A: Inferences Based On A Single Sample: Confidence Intervals
88 pages
Week 6 A
No ratings yet
Week 6 A
25 pages
03 Estimation IITB PDF
No ratings yet
03 Estimation IITB PDF
58 pages
Chapter Four
No ratings yet
Chapter Four
9 pages
Estimtion Confidence Interval
No ratings yet
Estimtion Confidence Interval
46 pages
Chap 9
No ratings yet
Chap 9
9 pages
Estimation
No ratings yet
Estimation
92 pages
Applied Statistics and Probability For Engineers Chapter - 8
No ratings yet
Applied Statistics and Probability For Engineers Chapter - 8
13 pages
Methods Chapter 2
No ratings yet
Methods Chapter 2
19 pages
Chap006
No ratings yet
Chap006
38 pages
AP Stats Module 5 Notes
No ratings yet
AP Stats Module 5 Notes
3 pages
Chapter 17 Confidence Interval
0% (1)
Chapter 17 Confidence Interval
3 pages
Chapter 4 - BUSINESS STATISTICS
No ratings yet
Chapter 4 - BUSINESS STATISTICS
14 pages
Ch-1.Ppt Business Statx (2)
No ratings yet
Ch-1.Ppt Business Statx (2)
66 pages
Estimation and Confidence Intervals
No ratings yet
Estimation and Confidence Intervals
28 pages
7 Estimation
No ratings yet
7 Estimation
91 pages
Biostat Inferential Statistics
No ratings yet
Biostat Inferential Statistics
62 pages
Statistical Inference
100% (1)
Statistical Inference
33 pages
CH 2
No ratings yet
CH 2
20 pages
Stat-II CH-TWO
No ratings yet
Stat-II CH-TWO
68 pages
Week 9+10+11
No ratings yet
Week 9+10+11
82 pages
Estimation and Test of Hypothesis
No ratings yet
Estimation and Test of Hypothesis
41 pages
Estimation and Confidence Intervals: Mcgraw Hill/Irwin
No ratings yet
Estimation and Confidence Intervals: Mcgraw Hill/Irwin
15 pages
1 Review of Basic Concepts - Interval Estimation
No ratings yet
1 Review of Basic Concepts - Interval Estimation
4 pages
Statistical Estimation
No ratings yet
Statistical Estimation
28 pages
6. Introduction to Inference_Part 1
No ratings yet
6. Introduction to Inference_Part 1
18 pages
Confidence Intervals
No ratings yet
Confidence Intervals
28 pages
Chapter 6 - Estimation
No ratings yet
Chapter 6 - Estimation
20 pages
Statistical Inference
No ratings yet
Statistical Inference
52 pages
Stat Chapter 4
No ratings yet
Stat Chapter 4
19 pages
4. Interval Estimation
No ratings yet
4. Interval Estimation
69 pages
Estimations
No ratings yet
Estimations
24 pages
Lec 10-13
No ratings yet
Lec 10-13
207 pages
Chapter 8
No ratings yet
Chapter 8
19 pages
Estimation in Statistics
100% (1)
Estimation in Statistics
4 pages
Week8
No ratings yet
Week8
18 pages
Inferential PDF
No ratings yet
Inferential PDF
9 pages
Faculty of Information Science & Technology (FIST) : PSM 0325 Introduction To Probability and Statistics
No ratings yet
Faculty of Information Science & Technology (FIST) : PSM 0325 Introduction To Probability and Statistics
7 pages
1 EC108 Estimation and Confidence Interval
No ratings yet
1 EC108 Estimation and Confidence Interval
125 pages
Confidence Intervals and Hypothesis Tests For Means
No ratings yet
Confidence Intervals and Hypothesis Tests For Means
40 pages
Unit 6a Point and Interval Estimation
No ratings yet
Unit 6a Point and Interval Estimation
13 pages
Estimation
No ratings yet
Estimation
44 pages
SMMD - Midsem - Cheat Sheet
No ratings yet
SMMD - Midsem - Cheat Sheet
1 page
Sampling Distributions & Confidence Interval
No ratings yet
Sampling Distributions & Confidence Interval
42 pages
2006 Geog090 Week06 Lecture01 CentralLimitTheorem
No ratings yet
2006 Geog090 Week06 Lecture01 CentralLimitTheorem
37 pages
Sampling in Statistics
From Everand
Sampling in Statistics
Stephanie Glen
No ratings yet
Statistics II Essentials
From Everand
Statistics II Essentials
Emil Milewski
2.5/5 (1)
The State Policy On Higher Education 2011 Kerala State D14328
No ratings yet
The State Policy On Higher Education 2011 Kerala State D14328
26 pages
Forex Market Equilibrium
No ratings yet
Forex Market Equilibrium
11 pages
Marshall - Lerner Condition
No ratings yet
Marshall - Lerner Condition
5 pages
Session 17 Revised
No ratings yet
Session 17 Revised
7 pages
Session 10 Revised
No ratings yet
Session 10 Revised
11 pages
Theory of Protection II + Theory of Economic Co-Operation
No ratings yet
Theory of Protection II + Theory of Economic Co-Operation
18 pages
Econometric Theory: Module - Ii
No ratings yet
Econometric Theory: Module - Ii
11 pages
Gains From Trade
No ratings yet
Gains From Trade
14 pages
Econometric Theory: Module - Ii
No ratings yet
Econometric Theory: Module - Ii
8 pages
Econometric Theory: Module - Iii
No ratings yet
Econometric Theory: Module - Iii
10 pages
Econometric Theory: Module - Ii
No ratings yet
Econometric Theory: Module - Ii
11 pages
On Aug 25, 2014 SCI Declared That All Coal Block Allocations Made Between 1993 and 2010 Illegal
No ratings yet
On Aug 25, 2014 SCI Declared That All Coal Block Allocations Made Between 1993 and 2010 Illegal
3 pages
Indifference Curve Analysis
No ratings yet
Indifference Curve Analysis
4 pages
Lecture 1
No ratings yet
Lecture 1
38 pages
AP C - Batch Biweekly Test - 4 Syllabus (30.09.2023)
No ratings yet
AP C - Batch Biweekly Test - 4 Syllabus (30.09.2023)
2 pages
13.3 Arc Length and Curvature
No ratings yet
13.3 Arc Length and Curvature
5 pages
Chapter1-S2 - MEASUREMENTS IN EXPERIMENTS
No ratings yet
Chapter1-S2 - MEASUREMENTS IN EXPERIMENTS
39 pages
Final 3rd Periodical Exam in Math - 19 20
No ratings yet
Final 3rd Periodical Exam in Math - 19 20
3 pages
The SIMP Method in Topology Optimization - Theoretical Background, Advantages and New Applications
100% (2)
The SIMP Method in Topology Optimization - Theoretical Background, Advantages and New Applications
11 pages
Guided Notes
No ratings yet
Guided Notes
7 pages
The Relation Between Creativity and Students' Performance
No ratings yet
The Relation Between Creativity and Students' Performance
15 pages
Chapter: Compound Interest
No ratings yet
Chapter: Compound Interest
7 pages
Action Research Math Fluency 10-18
No ratings yet
Action Research Math Fluency 10-18
38 pages
ASA Mcqs 2
No ratings yet
ASA Mcqs 2
21 pages
Stability of A Planar Interface During Solidification of A Dilute Binary Alloy PDF
No ratings yet
Stability of A Planar Interface During Solidification of A Dilute Binary Alloy PDF
9 pages
Introduction To Information Theory and Coding
No ratings yet
Introduction To Information Theory and Coding
46 pages
Matematik
No ratings yet
Matematik
13 pages
Situation or Event Numbers Patterns: Exercise 1.1 - A
No ratings yet
Situation or Event Numbers Patterns: Exercise 1.1 - A
5 pages
Global Talent Search Examination - Mathematics Practice
100% (3)
Global Talent Search Examination - Mathematics Practice
12 pages
Transformation
100% (1)
Transformation
86 pages
Topic 2 Matrices
No ratings yet
Topic 2 Matrices
10 pages
Laurent Series and Residuals.: 1 Z 0 Z 0 0 1 Z 0 I 0 Z 0
No ratings yet
Laurent Series and Residuals.: 1 Z 0 Z 0 0 1 Z 0 I 0 Z 0
4 pages
Numpy Programs-1
No ratings yet
Numpy Programs-1
5 pages
Where Can Buy Cambridge IGCSE® Mathematics Core and Extended Coursebook 2nd Edition Karen Morrison Ebook With Cheap Price
100% (3)
Where Can Buy Cambridge IGCSE® Mathematics Core and Extended Coursebook 2nd Edition Karen Morrison Ebook With Cheap Price
49 pages
Grade 5 Mathematics Exemplar
No ratings yet
Grade 5 Mathematics Exemplar
18 pages
Cbse Math Ph - II b) Pair of Linear Equations in Two Variables Solutions
No ratings yet
Cbse Math Ph - II b) Pair of Linear Equations in Two Variables Solutions
26 pages
Big 10 AP Review (Inactive Key)
No ratings yet
Big 10 AP Review (Inactive Key)
3 pages
M23 AASL Paper 1
No ratings yet
M23 AASL Paper 1
10 pages
CSE2013 TOC Revised
No ratings yet
CSE2013 TOC Revised
2 pages
Maths Year 7 Group Project
No ratings yet
Maths Year 7 Group Project
26 pages
Chapter 5 Part I History of Math
No ratings yet
Chapter 5 Part I History of Math
16 pages