Ch. 5 Reading Notes

This document summarizes key concepts about sampling from Chapter 5. It discusses how samples are used to make inferences about populations since studying entire populations is often impossible. The quality of a sample depends on how representative it is of the population and how members are selected. Larger, random samples produce more accurate estimates. Probability samples in which each member has a known chance of selection allow statistical inferences, unlike nonprobability samples. Simple random sampling involves giving each member an equal chance of being picked.

Uploaded by

Helena Rocha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views7 pages

Ch. 5 Reading Notes

Uploaded by

Helena Rocha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

PLSC 270

Chapter 5 Notes
03/03/2021

Chapter 5 – Sampling: Introduction

 Ideally, we would study entire populations to answer research questions, but resources
are limited so we sometimes need to use a sample
o Advantage of sample: less costly, less resources. Disadvantage: info based on
sample is less accurate and/or more subject to error than population info
o Problem: can a sample of 1,000 people really say anything/reflect millions of
people?
 Must find the best possible sample to reduce uncertainties and increase
application of study

The Basics of Sampling

 Population might not necessarily mean people, it can be a set of countries, corporations,
gov’t agencies, years, events, etc.
o Either way, population has to be clearly, carefully, fully defined and must be
relevant to research question.
o Ex: It’s impossible to interview everyone for a survey, so you choose a few
members of a population to investigate
 Sample  any subset of units collected in some manner from a population. A subset of
observations or cases drawn from a specified population
o Sample size and how members of sample are chosen determine quality (aka
accuracy and reliability) of inferences about the whole population
 Important to clarify the method of selection and number of observations
to be drawn
o Most interesting attributes in empirical research are numerical or quantitative
indicators (ex: averages, percentages, etc.). Characteristics of interest can be
examined/measured once sample is gathered – sample statistics
 Sample statistic  estimator of a population characteristic of attribute
that is calculated from sample data. Used to approximate the
corresponding population parameters/values

How Do We Use a Sample to Learn about a Population?

 Always assume there will be a margin of error when we report sample statistic
(difference between sample and actual population parameters)
o Ex: if in a random sample of people, 54% approve of Trump, it means that in the
population, approximately 54% of people approve of Trump
 Researchers sacrifice some precision whenever they rely on samples.
How much is sacrificed depends on how sample was drawn and its size
o Loss of precision/accuracy usually comes from chance. But following proper
procedures and certain assumptions, sample is more accurate/precise
 Difficulty: figuring out how far off the estimate is likely to be
 Statistical inference  mathematical theory and techniques for making conjectures
about the unknown characteristics (parameters0 of populations based on samples
o Goal is to make supportable conjectures about the unknown characteristics of a
population based on sample statistics
o Studying statistics involves precisely defining what “supportable” means. Key to
making inferences from sample statistic = sampling distribution

Sampling Distribution
 Sampling error  difference between what you sample says and what the population
actually is. Arises because only a portion of population is observed
o Ex: 80% of sample approves of Trump, but 50% of population approves. Sampling
error is difference between 80% and 50%
o So, you need a way to measure the amount of error/uncertainty in the estimate
to report a margin of error
 How to calculate uncertainty: take lots of different samples (ex: ask 4 groups of 10
people if they approve of Trump), add it up, divide by number of groups:
o Group 1 approval = 80% (0.80), group 2 = 30%, group 3 = 40%, group 4 = 60%. So:
(0.80 + 0.30 + 0.40 + 0.60)/4 = 0.525
 Closer to real value (0.5) – so it’s better to have more samples
o If you plot all the averages of all the samples, you’ll start seeing a normal (bell)
curve. Sampling distribution  theoretical frequency distribution of a statistic
generated from an infinite number of samples drawn from a population
 Sampling is normally distributed for every observable variable, no matter
the concept – basis for inferential statistics. Mean of sampling
distribution = population parameter
 See page 106 graphs
o Expected value  the mean/average value of a sample statistic based on
repeated samples from a population: E(p) = P
 Little p = estimated sample proportion. Equation  the expected )or
long-run or average) value of sample proportions = the population
proportion (P)
 Best guess of the value of the population parameter is the value of the
sample statistic
Sample Size and Margin of Error
 Large samples are more likely to represent population because small samples have
higher chances of excluding certain groups
o Ex: you might have bad luck and not get any women, or black people, or older
people, etc. in the samples
 Larger sample = more likely to include all types of people/be truly
representative of population
o Margin of error drops a lot when sample size increases, but increasing beyond a
certain point won’t give much marginal benefit:
 Will cost more than it will improve accuracy/precision of study to
increase sample beyond a certain point
Sampling Methods
 Samples must be obtained according to certain rules
 Element (aka: unit of analysis)  single occurrence, realization, or instance of the
objects or entities being studied. A particular case or entity about which info is collected
o Ex: presidential approval rating survey – individual American adults (survey
respondents) are the elements
o In simple cases, sampling unit = element. In more complicated sampling designs,
sampling unit may be a collection of elements.
 Sampling unit  the entity listed in a sampling frame
 Sampling frame  a list from which sampling units are drawn into a sample, and it must
be specified clearly. The population from which a sample is drawn. Ideally it is the same
as the total population of interest to a study (which is usually not possible)
 A population can be stratified – subdivided into groups of similar elements – before a
sample is drawn. Each stratum is a subgroup of a population that shares 1 or more
characteristics
o Ex: population = campaign speeches. Strata = dividing speeches into campaign
years (this group of speeches was made in this year)
o Chosen strata are usually characteristics/attributes thought to be related to the
dependent variables under study
 As samples become less representative of population, inferences about the population
become less valid
o Ex: if population has 50 characteristics, and your sample only has 40. More
characteristics = more valid inferences.
o But it’s super hard to include EVERY characteristic of a population
 Closer it is to real population, the better

Types of Samples
 Purpose of samples is to make inferences about the population from a smaller group. If
sampling frame is incomplete/inappropriate, sample bias happens
o Sample bias: whenever some elements of a population are systematically
excluded from a sample. Usually due to incomplete sampling frame or a
nonprobability method of selecting elements
o Sample is unrepresentative of the population of interest and inaccurate
conclusions about the population may be drawn
 Sample bias makes it important to distinguish between probability sample and
nonprobability sample:
o Probability sample: a sample for which each element in the population has a
known probability of being included in the sample
 This knowledge allows a researcher to calculate how accurately the
sample reflects the population
o Nonprobability sample: sample in which each element in the population has an
unknown probability of being selected
 Probability of selection is required for the use of statistical theory to
make inferences.
o Probability samples > nonprobability samples (Because you can use statistical
theory to make inferences on the former but not on the latter)
 1) Simple Random Samples: each element and combination of elements has an equal
chance of being selected
o Ex: drawing names from a hat – each name has equal chance of being drawn. Ex:
assigning numbers strategy
o Requires a list of the members of the population in the forms of a sampling
frame
 Ex: if you’re studying countries and you need to pick a few countries to
study out of all 195 countries, you need a list of all countries to pick from
o Pro of SRS  as the sample gets larger and larger, the sample will share the
characteristics of the population because every element has equal chance of
being selected
 Problem is that obtaining a sampling frame that is the same as the
population is not always easy/possible
 2) Systematic Random Samples: elements are selected from a list at predetermined
intervals
o Sometimes easier than Simple RS. It also requires a list of the target pop, but the
list is randomized to maintain a random sample
o Sampling interval: the “skip” of the number of elements between elements that
are drawn  k = N/n, N = population size and n = desired sample size
 Ex: if we want to pick countries out of 195 countries to study. If we want
a sample size of n = 10, we would divide the total by 10 to get the
sampling fraction (or interval k) – k = 195/10 = 19.5. Round up to 20. So,
starting at a random point, we would take every 20th country until we had
a sample of 10
 Ex: if we start at country #11, the next would be country #31, #51,
etc.
o Useful for when we’re dealing with a long list of population elements. But it can
result in a biased sample
 If elements on the list have been ranked according to a characteristic,
you’ll get biased sample
 If the list contains a patter that corresponds to the sampling interval,
you’ll get bias (doesn’t happen often, but must be considered)
 3) Stratified Sample: probability sample in which elements are divided into groups,
called strata, based on a characteristic, and elements are selected from each stratum in
proportion to its representation in the total population
o Sampling units are divided into strata with each unit appearing in only one
stratum. Then a simple random sample or systematic RS is taken from each
stratum
o Can be proportionate or disproportionate
 Proportionate: use stratified sample in which each stratum is represented
in proportion to its size in the population (ex: divide into states, but São
Paulo is bigger than Acre, so you draw in proportion to population)
 Disproportionate: select a stratified sample in which elements sharing a
characteristic are under or overrepresented (ex: if you’re trying to study a
specific group, you can overrepresent them)
o Characteristics to stratify should have theoretical importance in study – create
strata that are meaningful for the project
 4) Cluster Samples: used when a list of elements doesn’t exist and creating one wouldn’t
be feasible. It’s a probability sample in which sampling frame initially consists of clusters
of elements
o Since only some elements are going to be selected in a sample, it is unnecessary
to secure a list of all elements in the population
o Groups/clusters of elements are identifies and listed as sampling units. Then, a
sample is drawn from this list of sampling units. Then, elements are identified
and sampled in the sampling units only
 Ex: to conduct interviews with people, you need a small sample (because
interviews are time consuming). So you choose 100 random
neighborhoods, then 10 random streets in the neighborhoods then 10
random houses in the streets – conduct interviews in those houses only
o The houses chosen are random, so it’s a random sample, but the cluster process
reduced the geographic spread of respondents and saved resources
 You don’t need to know the total number of people in the city before
starting the cluster process because each house has an equal probability
of being selected
 Probability of your house being selected = probability of your
neighborhood being selected times probability of your street being
selected times probability of your house being selected
 Systematic, stratified and cluster (2, 3 and 4) are often more practical than simple
random sample (1)
o In each case, the probability of being selected is known, so the accuracy of the
sample can be determined
o The type of sample chosen depends on the resources you have and the
availability of an accurate and comprehensive list of elements in a well-defined
target population
 Nonprobability Samples: sample for which each element in the total population has an
unknown probability of being selected.
o Used when probability samples (which are better because they represent a large
population accurately and it’s possible to calculate how close an estimated
characteristic is to the population value) can’t be used (ex: too expensive)
 Sometimes you can learn more by studying carefully selected and
perhaps unusual cases than by studying representative ones
 Ex: studying undocumented immigrants. There isn’t a list of undoc
people, so you just have to work with who you can find, which isn’t
representative
o Convenience sample: a nonprobability sample in which the selection of elements
is determined by the researcher’s convenience.
o Purposive sample: researcher exercises considerable discretion over what
observations to study because the goal is typically to study a diverse and usually
limited number of observations rather than to analyze a sample that represents
the population
o Quota sample: elements are sampled in proportion to their representation in the
population (similar to proportionate stratified sampling)
 Difference is that elements in the quota sample are not chosen in a
probabilistic way – they’re chosen in a purposive or convenient way
 Usually biased
o Snowball sample: respondents are used to identify other people who might
qualify for including in the sample
 These people are interviewed and asked to supply names for further
investigating, and the sample builds like a snowball
 Problem asking people who know each other to join the study
means you’ll probably get people from the same social circles 
similar characteristics
 Continue the process until enough people are interviewed. Very useful
when studying rare/difficult to locate population (like undocumented)

Conclusion
 If cost isn’t a major consideration and the validity of measures will not suffer, it’s
generally better to collect data for the complete target population than use a sample
 If cost/validity dictate that a sample be drawn, a probability sample is usually preferable
to a nonprobability sample
o Accuracy of sample estimates can be determined only for probability samples.
o If the desire to represent a target population accurately is not a major concern or
is impossible to achieve, then a nonprobability sample can be used
 Probability samples yield estimates of the target population. All samples are subject to
sampling error
o No sample, no matter how well drawn, can provide an exact measurement of an
attribute of, or relationship within, the target population
 Statistical theory gives us methods to make inferences about unknown parameters and
for objectively measuring the probabilities of making inferential errors
o This info allows researchers and scientific community to judge the tenability of
many empirical claims
 See page 117 for list of terms with definitions

Sales Incentive Plans
75% (4)
Sales Incentive Plans
28 pages
FT300 Manual
No ratings yet
FT300 Manual
13 pages
Assessment Rubrics PDF
No ratings yet
Assessment Rubrics PDF
16 pages
Insect Pest
No ratings yet
Insect Pest
165 pages
Weights and Measures - Norzagaray - 06 July 2017
No ratings yet
Weights and Measures - Norzagaray - 06 July 2017
146 pages
White and Simm 2003 First Break RP Aversion
No ratings yet
White and Simm 2003 First Break RP Aversion
27 pages
Manual Chem 0303213 June 21 2020
No ratings yet
Manual Chem 0303213 June 21 2020
268 pages
Safety in Clinical Hematology Laboratory
100% (3)
Safety in Clinical Hematology Laboratory
2 pages
Isatis - Neo-Mining Release Notes
No ratings yet
Isatis - Neo-Mining Release Notes
23 pages
ML Usar Manual-2
No ratings yet
ML Usar Manual-2
21 pages
English A2 Speaking Rubric
No ratings yet
English A2 Speaking Rubric
1 page
Data Analytics Classification
No ratings yet
Data Analytics Classification
56 pages
Khalsa Et Al 2008 Psychophysiology
No ratings yet
Khalsa Et Al 2008 Psychophysiology
7 pages
Calibration Certificate: Tejas Engineering
No ratings yet
Calibration Certificate: Tejas Engineering
1 page
Autonomous Navigation System For Hexa-Legged Search and Rescue Robot Using Lidar
No ratings yet
Autonomous Navigation System For Hexa-Legged Search and Rescue Robot Using Lidar
15 pages
A Study On AI Driven Motion Capture and Pose Estimation For Physiotherapy
No ratings yet
A Study On AI Driven Motion Capture and Pose Estimation For Physiotherapy
8 pages
Usp PVT Dissolution Test
No ratings yet
Usp PVT Dissolution Test
7 pages
Active Machine Learning For Heterogeneity Activity
No ratings yet
Active Machine Learning For Heterogeneity Activity
13 pages
3murphy Et Al 2019 Testing The Independence of Self Reported Interoceptive Accuracy and Attention
No ratings yet
3murphy Et Al 2019 Testing The Independence of Self Reported Interoceptive Accuracy and Attention
19 pages
PHOTOMOD. Block Adjustment PDF
No ratings yet
PHOTOMOD. Block Adjustment PDF
76 pages
Trial Paper 1 SPM 2016
100% (2)
Trial Paper 1 SPM 2016
8 pages
(BS EN 12350-5-2009) Bê tông tươi - Kiểm tra độ chảy xòe bê tông bằng bàn dằn
No ratings yet
(BS EN 12350-5-2009) Bê tông tươi - Kiểm tra độ chảy xòe bê tông bằng bàn dằn
14 pages
Animal Classification
No ratings yet
Animal Classification
9 pages
CAM625 2019 s1 Module1
No ratings yet
CAM625 2019 s1 Module1
31 pages
Omega
100% (2)
Omega
6 pages
Understanding Sources of Bias in Diagnostic Accuracy Studies
No ratings yet
Understanding Sources of Bias in Diagnostic Accuracy Studies
8 pages
UNIT 7-Evaluation
No ratings yet
UNIT 7-Evaluation
5 pages
Accuracy, Recall, Precision, F-Score & Specificity, Which To Optimize On
No ratings yet
Accuracy, Recall, Precision, F-Score & Specificity, Which To Optimize On
10 pages
INTECONT® PLUS For Measuring Systems: % Compact Weighing Electronics For
No ratings yet
INTECONT® PLUS For Measuring Systems: % Compact Weighing Electronics For
8 pages
Serfitikat Kalibrasi 23010350-00
No ratings yet
Serfitikat Kalibrasi 23010350-00
2 pages
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6458)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (643)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (1005)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (650)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
4/5 (1175)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (582)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (464)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5181)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2016)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4103)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1022)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2814)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2885)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (280)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4135)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4372)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
3.5/5 (2133)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)

Ch. 5 Reading Notes

Uploaded by

Ch. 5 Reading Notes

Uploaded by

PLSC 270

Chapter 5 – Sampling: Introduction

The Basics of Sampling

How Do We Use a Sample to Learn about a Population?

You might also like