Introduction To Statistics. An Overview of Statistics
Introduction To Statistics. An Overview of Statistics
Leahy
Example 1: A survey of 2000 third- to twelfth-grade students found that they devoted an average of 7 hours
and 38 minutes each day to using electronic media.
Descriptive:
Make an inference:
_______________________.
Example 3:
Would you collect data using a population (a census) or a sample (a sampling) to determine the following?
Why?
d) Amount of time high school students spend on their cell phones each week.
Example 4:
Determine if each of the following is a population parameter or a sample statistic.
a) A survey of 1000 U.S. adults found that 40% think that the Internet is the best way to get news and
information.
b) At a college, 90% of the members of the Board of Trustees approved the contract of the new president.
c) A survey of 733 small business owners found that 17% have a current job opening.
d) The 2182 students who accepted admission offers to Northwestern University in 2009 have an average
SAT score of 1442. (Source: Northwestern University)
-2-
Example 5: In a recent survey, 1500 adults in the United States were asked if they thought
there was solid evidence for global warming. Eight hundred fifty-five of the adults said yes.
(Adapted from: Pew Research Center)
Identify the population and the sample.
Example 6: A large sample of men, aged 48, was studied for 18 years. For unmarried men, approximately 70%
were alive at age 65. For married men, 90% were alive at age 65. (Source: The Journal of Family Issues)
TYPES OF DATA:
Variables can be quantitative: _________________ measurements or counts for
which operations such as addition or averaging make sense.
-3-
Levels of Measurement: What statistical calculations make sense with this particular data?
-4-
*Great Charts on Page 12 of Textbook* A great read on Levels
of measurement:
Example 2:
g) Classroom temperature in °F
**
i) Time of first class (Example: 8:00AM)
-5-
1.3 (Part 1) Experiment Design Three Questions For Statistics:
GOAL: Collect data and use the data to make a decision.
Simulation
Use a mathematical or physical model to reproduce the condition of a situation or process. Useful
when the actual condition would be dangerous or impractical.
Survey
Data is collected by people asking questions. Extra care must be taken in survey design to prevent
biased questioning.
b) Study of the amount of time college students taking a full course load spend watching television.
c) Study of the effect on bone mass of a calcium supplement given to young girls.
g) Study of how often preschoolers choose gender-neutral toys over gender-specific toys.
-6-
HOW TO SET UP AN EXPERIMENT
1. CONTROL
Potential Problems:
Confounding Variables – An experimenter
cannot tell the difference between the effects
of different factors on the variable.
This can ruin an experiment!
2. RANDOMIZATION
Subjects are randomly selected into groups to prevent bias.
3. REPLICATION
BIASED QUESTIONS
These questions are bad. Can you improve them?
-7-
Sampling Techniques
We must be able to create a RANDOM SAMPLE for our experiments, surveys, and samplings. There are several
techniques for creating a random sample.
-8-
Sampling Technique: Systematic Sampling
Features: 1. Assign each element in the population a sequential number, or arrange the elements
in a natural sequential order.
2. Select a random starting point and then select every kth element for the sample.
Examples:
People are lined up to buy tickets to a rock
concert. Every 5th person is chosen for the
sample. The first person is randomly selected
from the first 5 people.
Features: 1. Divide population into (at least 2) groups or classes (strata) that share common
characteristics
2. A simple random sample (SRS) of a certain size is drawn from each stratum
3. Information is carefully adjusted/weighted based on size of stratum
Examples:
A population of all undergraduate
college students is divided into four
strata: freshman, sophomores, juniors,
and seniors. Ten students from each
strata are randomly selected and their
GPAs are recorded.
Every cardholder at the public library is divided into three strata: men, women, and children under 12.
Four cardholders from each strata are selected at random and asked how many times they visit the library
each month.
-9-
Sampling Technique: Cluster Sampling
Example:
We want to take a survey of all the school
children in a large city. Randomly select 4
schools and then include all the children from
each selected school in the sample.
Features: 1. Create a sample by using data from population members that are readily available.
2. Sample runs the risk of being extremely biased
Example:
Newspaper reported interviews the first 5 people leaving a local music score about their opinion on a tax
to support the local symphony.
-10-
Example 3: Identify the sampling technique used: Simple Random Sample (SRS), Convenience, Systematic,
Stratified, or Cluster.
a) A Los Angeles Times reporter gets a reaction to a breaking story by polling people as they pass the
front of his office building.
b) Dr. Smith randomly selects 5 students in his class by assigning each a number and drawing five
numbers from a hat.
c) The Orange County Commissioner of Jurors obtains a list of 55,014 car owners and constructs a group
of possible jurors by selecting every 50th name on the list.
d) In a Harris poll of 1,011 adults, the interview subjects were selected by using a computer to randomly
generate telephone numbers that were then called.
e) A Ford Motor Company researcher has partitioned all registered cars into categories of compact, mid-
sized, and family-size. He is surveying 75 car owners from each category.
f) Motivated by a student who died from binge drinking, Indiana State conducts a study of student
drinking by randomly selecting 10 different classes and interviewing all of the students in each of those
classes.
g) A statistics student obtains height/weight data by interviewing the members of his fraternity.
h) A UCLA researcher surveys all cardiac patients in each of 30 randomly selected hospitals.
-11-