Gathering and Organizing Data

Download as pdf or txt
Download as pdf or txt
You are on page 1of 5

Lesson 12-1 Gathering and Organizing Data

Objectives: Define data and statistics Explain the difference between a population and a sample. Describe four basic methods of sampling Construct a frequency distribution for a data set Draw a stem and leaf plot for a data set

A. Statistics ***Note: Videos begin with example 2!


Data are measurements or observations that are gathered for an event under study. Statistics is the branch of mathematics that involves collecting, organizing, summarizing, and presenting data and drawing general conclusions from that data. Populations and Samples When statistical studies are performed, we usually begin by identifying the population for the study. A population consists of all subjects under study. (i.e. all colleges in the United States) More often than not, its not realistic to gather data from every member of a population. A sample is a representative subgroup or subset of a population.

B. Sampling Methods
We will study four basic sampling methods: 1. In order to obtain a random sample, each subject of the population must have an equal chance of being selected. 2. A systematic sample is taken by numbering each member of the population and then selecting every kth member, where k is a natural number. When using systematic sampling, its important that the starting number is selected at random. 3. When a population is divided into groups where the members of each group have similar characteristics and members from each group are chosen at random, the result is called a stratified sample. 4. When an existing group of subjects that represent the population is used for a sample, it is called a cluster sample.

Lesson 12-1 Gathering and Organizing Data


1. A student in an education class is given an assignment to find out how late typical students at his campus stay up to study. He decides to stop by the union before his 9 A.M. class and ask everyone sitting at a table how late they were up studying the night before. (a) What method of sampling is he using? (b) Do you think hes likely to get a representative sample?

C. Descriptive vs. Inferential


There are two main branches of statistics: descriptive and inferential. Statistical techniques that are used to describe data are called descriptive statistics. For example, a researcher may wish to determine the average age of the full-time students enrolled in your college and the percentage who own automobiles. Statistical techniques used to make inferences are called inferential statistics. For example, every month the Bureau of Labor and Statistics estimates the number of people in the US who are unemployed. Since its would be impossible to survey everyone, they use a sample of adults to see what percent are unemployed. In this case, the information obtained from a sample is used to estimate a population measure.

Another area of inferential statistics is called hypothesis testing. A researcher tries to test a hypothesis to see if there is enough evidence to support it. A third aspect of inferential statistics is determining whether or not a relationship exists between two or more variables. This area of statistics is called correlation and regression.

Frequency Distributions
The data collected for a statistical study are called raw data. In order to describe situations and draw conclusions, the researcher must organize the data in a meaningful way. Two methods that we will use are frequency distributions and stem and leaf plots. The first type of frequency distributions that we will investigate is the categorical frequency distribution. This is used when the data are categorical rather than numerical.

Lesson 12-1 Gathering and Organizing Data


2. Twenty-five volunteers for a medical research study were given a blood test to obtain their blood types. The data follow. Construct a frequency distribution for the data.

Video

D. Frequency Distributions
Another type of frequency distribution that can be constructed uses numerical data and is called a grouped frequency distribution. In a grouped frequency distribution, the numerical data are divided into classes. When deciding on classes, here are some useful guidelines: 1. Try to keep the number of classes between 5 and 15. 2. Make sure the classes do not overlap. 3. Dont leave out any numbers between the lowest and highest, even if nothing falls into a particular class. 4. Make sure the range of numbers included in a class is the same for each one.

Lesson 12-1 Gathering and Organizing Data


3. These data represent the record high temperatures for each of the 50 states in degrees Fahrenheit. Construct a grouped frequency distribution for the data.

Video

Lesson 12-1 Gathering and Organizing Data

E. Stem and Leaf Plots


Another way to organize data is to use a stem and leaf plot (sometimes called a stem plot). Each data value or number is separated into two parts. For a two-digit number such as 53, the tens digit, 5, is called the stem, and the ones digit, 3, is called its leaf. For the number 72, the stem is 7, and the leaf is 2. For a three-digit number, say 138, the first two digits, 13, are used as the stem, and the third digit, 8, is used as the leaf.

4. The data below show the number of games won by the Chicago Cubs in each of the 21 seasons from 19882008, with the exception of 1994, which was a short season because of a player strike. Draw a stem and leaf plot for the data. 97 85 66 79 89 88 67 88 65 67 90 68 76 73 84 78 77 77 93 77 Video

You might also like