0% found this document useful (0 votes)

4 views

1 Chapter 1 Lecture Notes

The document outlines Standard 1 on Sampling and Data, covering key topics such as definitions of statistics, types of data, sampling methods, and experimental design. It emphasizes the importance of understanding statistical vocabulary, levels of measurement, and ethical considerations in research. Various sampling techniques, including random, stratified, and cluster sampling, are discussed along with the significance of recognizing sampling errors and non-sampling errors.

Uploaded by

wqj68

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

1 Chapter 1 Lecture Notes

Uploaded by

wqj68

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Standard 1:

Sampling and Data

Topics: Objectives:
● Definitions of statistics, probability and Key terms ● Recognize and differentiate between key terms
● Data, Sampling and Variation in Data and ● Apply various types of sampling methods to data
Sampling collection
● Frequency Table and Levels of Measurement ● Create and interpret frequency tables
● Experimental Design and Ethics

Vocabulary:
Average Blinding Categorical Variable

Cluster Sampling Continuous Random Variable Control Group

Convenience Sampling Cumulative Relative Frequency Data

Discrete Random Variable Double-blinding Experimental Unit

Explanatory Variable Frequency Informed Consent

Institutional Consent Institutional Review Board Lurking Variable

Nonsampling Error Numerical Variable Parameter

Placebo Population Probability

Proportion Qualitative Data Quantitative Data

Random Sampling Relative Frequency Representative Sample

Response variable Sample Sampling Bias

Sampling Error Sampling with Replacement Sampling without Replacement

Statistics Stratified Sampling Systematic Sampling

Treatments Variable
Notes Chapter 1:

Statistics is the science of _, ____,

________________, and _______________ data in order to
make a _______________.

There are two types of Data:

●
●

Probability is a mathematical tool used to study ___________________.

It deals with the chance of an event ___________________.
The probability of an event is always between _______ and _________.

In statistics we have a required common vocabulary that we must all use:

Population:

Parameter:

Sample:

Statistics:

●
How Population and Samples Relate:

Example:
We want to know the mean amount of money first year college students spend at Shenandoah
University on school supplies that do not include books. We randomly survey 100 first year students
at the college. Three of those students spent $150, $200, and $225, respectively.

What is the population?

What is the sample?

What is the parameter?

What is the statistic?

Statistical Variables: a characteristics of ______for each _ or ______ in a

_____________________.

Types of Variables:
● Numerical or Quantitative Data:

● Categorical or Qualitative Data:

In Statistics we will measure and work with lots of
Data!!! Know the difference!!

Qualitative data - are the result of _________ or _ attributes of a

population. Hair color, blood type, ethnic group, the car a person drives, and the street a person lives
on are examples of qualitative data. Qualitative data are generally described by __________or
_____________.

Quantitative data - are always ___________ and are the result of _____________ or
____________ attributes of a population. Amount of money, pulse rate, weight, number of people
living in your town, and number of students who take statistics are examples of quantitative data.
Quantitative data may be either __________ or ________________.

Discrete data - all data that are the result of ___________are called ____________________
data. These data take on only certain ___________ values. If you count the number of phone calls
you receive for each day of the week, you might get values such as zero, one, two, or three.

Continuous data - all data that are the result of ____________ are
_________________________ data assuming that we can measure accurately. If you and your
friends carry backpacks with books in them to school, the numbers of books in the backpacks are
discrete data and the weights of the backpacks are continuous data.

Example:

You go to the supermarket and purchase three cans of soup (19 ounces) tomato bisque, 14.1 ounces
lentil, and 19 ounces Italian wedding, two packages of nuts (walnuts and peanuts), four different
kinds of vegetable (broccoli, cauliflower, spinach, and carrots), and two desserts (16 ounces Cherry
Garcia ice cream and two pounds 32 ounces chocolate chip cookies).

What of the data set is quantitative discrete?

What of the data set is quantitative continuous?

What of the data set is qualitative?

Sampling: Gathering information about an entire population often costs too much or is virtually
impossible. Instead, we use a _________ of the population. A sample should have the
_________________________________ as the population it is representing. Most
statisticians use various methods of _______________________ in an attempt to achieve this
goal. The best method of sampling is a ___________________________.
A simple random sample is a straightforward method for selecting a random sample
● Give each member of the population a number.
● Use a random number generator to select a set of labels.
● These randomly selected labels identify the members of your sample.

Example:

Suppose Lisa wants to form a four-person study group from her pre-calculus class, which has 31
members.
● To choose a simple random sample of size three from the other members of her class, Lisa
could put all 30 names in a hat, shake the hat, close her eyes, and pick out three names.
● Lisa could put all of her classmates into an order (maybe alphabetical) and use the Random
Number generator to select her three members. (using two-digit numbers and ignoring
numbers greater than 31)

Stratified Sample:
To choose a stratified sample,
● ___________ the _______________ into groups called strata
● Take a ___________________ number from each stratum.

Example:

You could stratify (group) your college population by department and then choose a proportionate
simple random sample from each stratum (each department) to get a stratified random sample.
Suppose there are 5 departments, and you want to choose a sample of 50. You choose 10 from each
department using SRS sampling.
Cluster Sample:
To choose a cluster sample
● ________the ______________ into clusters (groups)
● ____________ select some of the clusters.
● All the members from these clusters are in the cluster sample.

Example:
If you randomly sample four departments from your college population, the four departments make
up the cluster sample. Divide your college faculty by department. The departments are the clusters.
Number each department, and then choose four different numbers using simple random sampling.
All members of the four departments with those numbers are the cluster sample.

Systematic Sample:
To choose a systematic sample:
● ___________ select a ______________
● Take every ________ piece of data from a listing of the population.

Example:
Suppose you have to do a phone survey. Your phone book contains 20,000 residence listings. You
must choose 400 names for the sample. Number the population 1–20,000 and then use a simple
random sample to pick a number that represents the first name in the sample. Then choose every
fiftieth name thereafter until you have a total of 400 names (you might have to go back to the
beginning of your phone list). Systematic sampling is frequently chosen because it is a simple method.
Convenience Sampling:
A type of sampling that is _______________ is convenience sampling. Convenience sampling
involves using results that are _____________________________.

Example:
A computer software store conducts a marketing study by interviewing potential customers who
happen to be in the store browsing through the available software. The results of convenience
sampling may be very good in some cases and highly biased (favor certain outcomes) in others.

With Replacement or Without Replacement:

True random sampling is done________replacement. That is, once a member is picked, that member goes
_________ into the population and thus may be chosen _______________________________.

However for practical reasons, in most populations, simple random sampling is done _________
replacement. Surveys are typically done without replacement. That is, a member of the population may be
chosen only once.

Most samples are taken from _______ populations and the sample tends to be ______ in comparison to the
population. Since this is the case, sampling without replacement is ___________________ the same as
sampling with replacement because the chance of picking the same individual more than once with
replacement is __________________.

Sampling Errors and Non-Sampling Errors:

When you analyze data, it is important to be aware of sampling _______ and non-sampling errors.
The actual ________of sampling causes sampling errors.

For example:

● The sample may not be large enough.

● Factors not related to the sampling process cause non-sampling errors.
● A defective counting device can cause a non-sampling error.

In reality, a sample will __________be exactly representative of the population so there will always
be ________ sampling error. As a rule, the _______ the sample, the _________ the sampling
error.
Critical Evaluation:
We need to evaluate the statistical studies we read about critically and analyze them
____________________________ the results of the studies. Common problems to be aware of
include:

Example:
A study is done to determine the average tuition that San Jose State undergraduate students pay per semester.
Each student in the following samples is asked how much tuition he or she paid for the Fall semester. What is
the type of sampling in each case?
Variation:

Variation is present in ________ set of data.

Example:

16-ounce cans of beverage may contain more or less than 16 ounces of liquid. In one study, six 16
ounce cans were measured and produced the following amount (in ounces) of beverage: 15.8 16.1
15.2 14.8 15.8 15.9

Measurements of the amount of beverage in a 16-ounce can may vary because different people make
the measurements or because the exact amount, 16 ounces of liquid, was not put into the cans.
Manufacturers regularly run tests to determine if the amount of beverage in a 16-ounce can falls
within the desired range.

Answers and Rounding off in Statistics:

In this course, I would like you to round to ______ decimal places, unless the answer automatically
rounds to one place or is not a decimal.

It is ______ necessary to reduce most fractions in this course. Especially in Probability Topics, the
chapter on probability, it is more helpful to leave an answer as an unreduced _____________
Levels of Measurement:

The way a set of data is __________ is called its level of measurement. Correct statistical
procedures depend on a researcher being familiar with levels of measurement. Not every statistical
operation can be used with every set of data. Data can be classified into ______ levels of
measurement. They are (from lowest to highest level):

• _________ scale level

○ Data that is measured using a nominal scale is ______________ (categorical).
Categories, colors, names, labels and favorite foods along with yes or no responses are
examples of nominal level data. Nominal scale data are _____________________.

○ Example: Trying to order people according to their favorite food does not make any
sense. Putting veggie pizza first and vegan sushi second is not meaningful.

• ________ scale level

○ Data that is measured using an ordinal scale is similar to nominal scale data but there is
a big difference. The ordinal scale data __________ ordered.

○ Example of ordinal scale data is a list of the top five national parks in the United States.
The top five national parks in the United States can be ranked from one to five but we
cannot measure differences between the data.

• ________ scale level

○ Data that is measured using the interval scale is similar to ordinal level data because it
has a __________ ordering but there is a ___________ between data.
○ Example: Temperature scales like Celsius (C) and Fahrenheit (F) are measured by using
the interval scale.

• ________ scale level

○ Data that is measured using the ratio scale takes care of the ____ problem and gives
you the _____ information. Ratio scale data is like interval scale data, but it has a
___________ and ratios can be _________.
○ Example: four multiple choice statistics final exam scores are 80, 68, 20 and 92 (out of a
possible 100 points). The exams are machine-graded. The data can be put in order from
lowest to highest: 20, 68, 80, 92.
○ The differences between the data have meaning. The score 92 is more than the score 68
by 24 points. Ratios can be calculated. The smallest score is 0. So 80 is four times 20.
The score of 80 is four times better than the score of 20.
Level of Put data in Arrange data in Subtract Determine if one data
Measurement categories order data values value is a multiple of
another

Nominal Yes No No No

Ordinal Yes Yes No No

Interval Yes Yes Yes No

Ratio Yes Yes Yes Yes

Frequency:
Twenty students were asked how many hours they worked per day. Their responses, in hours, are as
follows:
5 6 3 3 2 4 7 5 2 3 5 6 5 4 4 3 5 2 5 3
Table below lists the different data values in ascending order and their frequencies.

Relative Frequency:
A relative frequency is the ______ (fraction or proportion) of the number of times a value of the data
_______ in the set of all outcomes to the ______ number of outcomes. To find the relative
frequencies, ________ each frequency by the _______number of students in the sample–in this
case, 20. Relative frequencies can be written as ___________, ____________, or __________.

Cumulative Relative Frequency:

Cumulative relative frequency is the _________________ of the previous relative frequencies. To
find the cumulative relative frequencies, ______ all the previous relative frequencies to the relative
frequency for the current row, as shown in Table. The sum of the values in the relative frequency
column of the table is_____. The last entry of the cumulative relative frequency column is _____,
indicating that ________________ percent of the data has been accumulated.
Experiment:
The purpose of an experiment is to investigate the relationship between two variables. When one
variable causes change in another, we call the first variable the ________________ __________.
The affected variable is called the __________ ____________ In a randomized experiment, the
researcher manipulates values of the explanatory variable and measures the resulting changes in the
response variable. The different values of the explanatory variable are called _________________.
An ________________ ________ is a single object or individual to be measured.

Example:
Researchers want to investigate whether taking aspirin regularly reduces the risk of heart attack. Four
hundred men between the ages of 50 and 84 are recruited as participants. The men are divided
randomly into two groups: one group will take aspirin, and the other group will take a placebo. Each
man takes one pill each day for three years, but he does not know whether he is taking aspirin or the
placebo. At the end of the study, researchers count the number of men in each group who have had
heart attacks.

● What is the population?

● What is the sample?

● What is the experimental units?

● What is the explanatory variable?

● What is the response variable?

● What is the treatments?

Ethics:
The U.S. Department of Health and Human Services oversees federal regulations of research studies
with the aim of protecting participants. When a university or other research institution engages in
research, it must ensure the ________ of all human subjects. For this reason, research institutions
establish oversight committees known as Institutional Review Boards _________. All planned
studies must be _________in advance by the IRB. Key protections that are mandated by law include
the following:
• Risks to participants must be ____________ and ___________with respect to projected
benefits.
• Participants must give _________ consent. This means that the risks of participation must be
________ explained to the subjects of the study. Subjects must consent in __________, and
researchers are required to keep documentation of their consent.
• Data collected from individuals must be guarded carefully to protect their privacy.

Hypothesis Testing: An Intuitive Guide for Making Data Driven Decisions
From Everand
Hypothesis Testing: An Intuitive Guide for Making Data Driven Decisions
Jim Frost
No ratings yet
M.sc. Dissertation
No ratings yet
M.sc. Dissertation
68 pages
Ch 1 Lecture Notes
No ratings yet
Ch 1 Lecture Notes
10 pages
Chapter1 Stats
No ratings yet
Chapter1 Stats
7 pages
CHAPTER 1 and 2
No ratings yet
CHAPTER 1 and 2
18 pages
Statistics for Business and Economics
No ratings yet
Statistics for Business and Economics
6 pages
Chapter 1: Introduction To Statistics
No ratings yet
Chapter 1: Introduction To Statistics
40 pages
Elementary Statistics and Probability Chapter 1 3
No ratings yet
Elementary Statistics and Probability Chapter 1 3
5 pages
ITC REPORT 2
No ratings yet
ITC REPORT 2
30 pages
Chapter 1-Introduction To Statistics
No ratings yet
Chapter 1-Introduction To Statistics
14 pages
Basic Statistics Data Management & Sampling GED0103
No ratings yet
Basic Statistics Data Management & Sampling GED0103
36 pages
Chapter 2 Statistical Data and Sampling
No ratings yet
Chapter 2 Statistical Data and Sampling
8 pages
MATH30-6-Lecture-1-1
No ratings yet
MATH30-6-Lecture-1-1
32 pages
Topic 03 - Basic Statistics
No ratings yet
Topic 03 - Basic Statistics
42 pages
Week 1: To Statistics: 1.1 An Overview of Statistics 1.2 Data Classification 1.3 Sampling Technique and Data Collection
No ratings yet
Week 1: To Statistics: 1.1 An Overview of Statistics 1.2 Data Classification 1.3 Sampling Technique and Data Collection
27 pages
Statistics Class Work # 1-3
No ratings yet
Statistics Class Work # 1-3
8 pages
Mat105 Study Guide
No ratings yet
Mat105 Study Guide
14 pages
Stats Week 1 - Notes
No ratings yet
Stats Week 1 - Notes
7 pages
Chapter 1
No ratings yet
Chapter 1
38 pages
Icte Lesson
No ratings yet
Icte Lesson
19 pages
MATH 103 Module 1 - Statistics Introduction
No ratings yet
MATH 103 Module 1 - Statistics Introduction
30 pages
Probability and Statistics Lesson 1 2
No ratings yet
Probability and Statistics Lesson 1 2
47 pages
NSTA 51516 Slides
No ratings yet
NSTA 51516 Slides
97 pages
Summary of Lectures
No ratings yet
Summary of Lectures
36 pages
Statistics and Propability 1
No ratings yet
Statistics and Propability 1
35 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
18 pages
EM-104-Module
No ratings yet
EM-104-Module
12 pages
Basic Business Statistics: Introduction and Data Collection
No ratings yet
Basic Business Statistics: Introduction and Data Collection
33 pages
Lecture Notes - Prob and Stat
No ratings yet
Lecture Notes - Prob and Stat
229 pages
Data Anal Notes
No ratings yet
Data Anal Notes
10 pages
Chapter 1 INTRODUCTION TO STATISTICS (New)
No ratings yet
Chapter 1 INTRODUCTION TO STATISTICS (New)
34 pages
Chapter 2 Review - Collecting Data SOLUTIONS
No ratings yet
Chapter 2 Review - Collecting Data SOLUTIONS
12 pages
ENGDAN 203 Engineering Data Analysis Topic 1
No ratings yet
ENGDAN 203 Engineering Data Analysis Topic 1
5 pages
02 Review On Statistics
No ratings yet
02 Review On Statistics
20 pages
ES_Chapter 1-1
No ratings yet
ES_Chapter 1-1
31 pages
Statistics - MMW
No ratings yet
Statistics - MMW
15 pages
ch01 Descriptive Statistics
No ratings yet
ch01 Descriptive Statistics
48 pages
Business STAT 2 Class Lectures
No ratings yet
Business STAT 2 Class Lectures
15 pages
Module 1&2 - Statistics Q1W1
No ratings yet
Module 1&2 - Statistics Q1W1
27 pages
PLSC214 Topic 1
No ratings yet
PLSC214 Topic 1
37 pages
Statistics 2023
No ratings yet
Statistics 2023
62 pages
Chapter 1 Statistics: Case Study 1.1
No ratings yet
Chapter 1 Statistics: Case Study 1.1
5 pages
EDA-Lecture 1
No ratings yet
EDA-Lecture 1
52 pages
STATISTICS
No ratings yet
STATISTICS
12 pages
4.1 Statistics Toolkit
No ratings yet
4.1 Statistics Toolkit
35 pages
Statictic Sammy CORRECTED3
No ratings yet
Statictic Sammy CORRECTED3
57 pages
1A Sources of Data
No ratings yet
1A Sources of Data
15 pages
Nature of Statistic
No ratings yet
Nature of Statistic
83 pages
6 Sampling and Basic Descriptive Statistics
No ratings yet
6 Sampling and Basic Descriptive Statistics
38 pages
BUSINESS STATISTICS
No ratings yet
BUSINESS STATISTICS
9 pages
Unit 4 Statistics
No ratings yet
Unit 4 Statistics
33 pages
Chapter Goals: After Completing This Chapter, You Should Be Able To
No ratings yet
Chapter Goals: After Completing This Chapter, You Should Be Able To
32 pages
Statistics and Basic terms
No ratings yet
Statistics and Basic terms
10 pages
Applied Mathematics Notes
No ratings yet
Applied Mathematics Notes
31 pages
Introduction to Biostatistics
No ratings yet
Introduction to Biostatistics
67 pages
Branches of Statistics Quantitative and Qualitative Data
No ratings yet
Branches of Statistics Quantitative and Qualitative Data
2 pages
Matht Reviewer
No ratings yet
Matht Reviewer
7 pages
AE 9 FINAL EXAM
No ratings yet
AE 9 FINAL EXAM
4 pages
Statistics and Data Management
No ratings yet
Statistics and Data Management
8 pages
AA SL - Unit 1a - Representing Data (Statistics)
No ratings yet
AA SL - Unit 1a - Representing Data (Statistics)
74 pages
Elementary Statistics
From Everand
Elementary Statistics
jay prakash Maheshwari
5/5 (1)
3 Thermal Printer Mechanism PDF
No ratings yet
3 Thermal Printer Mechanism PDF
21 pages
Protection Board
0% (1)
Protection Board
2 pages
Detained+Asylum+Process 1
No ratings yet
Detained+Asylum+Process 1
11 pages
Labor Law Exam
No ratings yet
Labor Law Exam
4 pages
Artificial Intelligence in Medicine 1st Edition Thompson Stephan download
100% (3)
Artificial Intelligence in Medicine 1st Edition Thompson Stephan download
77 pages
Homeostasis Nitrogen Cycle Close Read Rev
No ratings yet
Homeostasis Nitrogen Cycle Close Read Rev
12 pages
Instant Download Make It Jane Bull PDF All Chapters
100% (14)
Instant Download Make It Jane Bull PDF All Chapters
70 pages
INCOMPLETEDOMINANCE
No ratings yet
INCOMPLETEDOMINANCE
37 pages
BT Phone Big Button 4000 Manual en
No ratings yet
BT Phone Big Button 4000 Manual en
64 pages
Download ebooks file House of Glass - Dual Time Mystery Romance Merryn Allingham Et El all chapters
100% (1)
Download ebooks file House of Glass - Dual Time Mystery Romance Merryn Allingham Et El all chapters
40 pages
Stats
No ratings yet
Stats
5 pages
STca Merged
No ratings yet
STca Merged
105 pages
Wholesale Food Business Plan
100% (1)
Wholesale Food Business Plan
6 pages
Confined Space (SOP-CM-PI-034) 밀폐공간 배관작업-영어
No ratings yet
Confined Space (SOP-CM-PI-034) 밀폐공간 배관작업-영어
5 pages
Key Stage 4 Biology
No ratings yet
Key Stage 4 Biology
47 pages
5SYA 2039 - Mounting Instructions For HiPak
No ratings yet
5SYA 2039 - Mounting Instructions For HiPak
7 pages
4daymaximum 0
No ratings yet
4daymaximum 0
1 page
Company Profile
No ratings yet
Company Profile
100 pages
Limited Power of Attorney Form - Texas
No ratings yet
Limited Power of Attorney Form - Texas
1 page
Case Name Antonio Geluz, Petitioner, vs. The Hon. Court of Appeals and OSCAR LAZO, Respondents. Case No. - Date Ponente Facts
No ratings yet
Case Name Antonio Geluz, Petitioner, vs. The Hon. Court of Appeals and OSCAR LAZO, Respondents. Case No. - Date Ponente Facts
1 page
Screenshot 2024-01-17 at 9.09.07 AM
No ratings yet
Screenshot 2024-01-17 at 9.09.07 AM
14 pages
Enter Pre Nu Reship 12
No ratings yet
Enter Pre Nu Reship 12
8 pages
PO857971 - 27 - Pandamart - Platform (KHI)
No ratings yet
PO857971 - 27 - Pandamart - Platform (KHI)
2 pages
Handling POWs
No ratings yet
Handling POWs
124 pages
PRODUCT DEVELOPMENT MANAGEMENT - FINAL PROJECT - Part3
No ratings yet
PRODUCT DEVELOPMENT MANAGEMENT - FINAL PROJECT - Part3
2 pages
Application Form
No ratings yet
Application Form
5 pages
People Vs Espina
No ratings yet
People Vs Espina
1 page
Asthma During Pregnancy Ppt 1-2
No ratings yet
Asthma During Pregnancy Ppt 1-2
23 pages
Exercises For Developing Your Intuition
75% (4)
Exercises For Developing Your Intuition
3 pages

1 Chapter 1 Lecture Notes

Uploaded by

1 Chapter 1 Lecture Notes

Uploaded by

Standard 1:

Sampling and Data

Cluster Sampling Continuous Random Variable Control Group

Convenience Sampling Cumulative Relative Frequency Data

Discrete Random Variable Double-blinding Experimental Unit

Explanatory Variable Frequency Informed Consent

Institutional Consent Institutional Review Board Lurking Variable

Nonsampling Error Numerical Variable Parameter

Placebo Population Probability

Proportion Qualitative Data Quantitative Data

Random Sampling Relative Frequency Representative Sample

Response variable Sample Sampling Bias

Sampling Error Sampling with Replacement Sampling without Replacement

Statistics Stratified Sampling Systematic Sampling

Statistics is the science of _____________, ________________,

There are two types of Data:

Probability is a mathematical tool used to study ___________________.

In statistics we have a required common vocabulary that we must all use:

What is the population?

What is the sample?

What is the parameter?

What is the statistic?

Statistical Variables: a characteristics of ____________for each ___________ or __________ in a

● Categorical or Qualitative Data:

Qualitative data - are the result of ___________________ or ___________ attributes of a

What of the data set is quantitative discrete?

What of the data set is quantitative continuous?

What of the data set is qualitative?

With Replacement or Without Replacement:

Sampling Errors and Non-Sampling Errors:

● The sample may not be large enough.

Variation is present in ________ set of data.

Answers and Rounding off in Statistics:

• _________ scale level

• ________ scale level

• ________ scale level

• ________ scale level

Ordinal Yes Yes No No

Interval Yes Yes Yes No

Ratio Yes Yes Yes Yes

Cumulative Relative Frequency:

● What is the population?

● What is the sample?

● What is the experimental units?

● What is the explanatory variable?

● What is the response variable?

● What is the treatments?

You might also like

Statistics is the science of _, ____,

Statistical Variables: a characteristics of ______for each _ or ______ in a

Qualitative data - are the result of _________ or _ attributes of a