01 Introduction
01 Introduction
Statistics
§ Define statistics
§ Become aware of a wide range of
applications of statistics in business
§ Differentiate between descriptive and
inferential statistics
§ Classify numbers by level of data and
understand why doing so is important
§ A branch of mathematics taking and
transforming numbers into useful information
for decision makers
DATA
Data are the different values associated with a
variable.
OPERATIONAL DEFINITIONS
Data values are meaningless unless their variables
have operational definitions, universally accepted
meanings that are clear to all associated with an
analysis.
POPULATION
A population consists of all the items or individuals about which
you want to draw a conclusion.
SAMPLE
A sample is the portion of a population selected for analysis.
PARAMETER
A parameter is a numerical measure that describes a
characteristic of a population.
STATISTIC
A statistic is a numerical measure that describes a characteristic
of a sample.
POPULATION VS. SAMPLE
Population Sample
RD1 Red 12
RD2 Red 10
RD3 Red 13
RD4 Red 10
RD5 Red 13
BL1 Blue 27
BL2 Blue 24
GR1 Green 35
GR2 Green 35
GY1 Gray 15
GY2 Gray 18
GY3 Gray 17
Identifier Color MPG
RD2 Red 10
RD5 Red 13
GR1 Green 35
GY2 Gray 18
§ Data are the facts and figures collected,
summarized, analyzed, and interpreted.
§ The data collected in a particular study
are referred to as the data set.
§ OPERATIONAL DEFINITIONS
§ Data values are meaningless unless their
variables have operational definitions,
universally accepted meanings that are
clear to all associated with an analysis.
§ The elements are the entities on which data are
collected.
§ A variable is a characteristic of interest for the
elements.
§ The set
of measurements collected for a particular
element is called an observation.
§ A data set with n elements contains n observations.
§ The total number of datavalues in a complete data
set is the number of elements multiplied by the
number of variables.
Observation
Variables
Stock Annual Earn/
Company Exchange Sales($M) Sh.($)
Dataram AMEX 73.10 0.86
EnergySouth OTC 74.00 1.67
Keystone NYSE 365.70 0.86
LandCare NYSE 111.40 0.33
Psychemedics AMEX 17.60 0.13
§ Present data
§ e.g., Tables and graphs
§ Characterize data åX i
n
§ e.g., Sample mean =
Descriptive Statistics organizing,
summarizing, and presenting data in an
informative way.
EXAMPLE 1: EXAMPLE 2:
The average test score for According to Consumer
the students in a class, to Reports, there were 2.5
give a descriptive sense of problems per one
the typical scores and their copying machines
variation. reported during 2006.
TYPES OF STATISTICS
§ Descriptive statistics summarize/
characterize the population data by
describing what was observed in the sample
numerically (tabular) or graphically.
§ Numerical descriptors include mean and
standard deviation for continuous data types
(like heights or weights), while frequency
and percentage are more useful in terms of
describing categorical data (like race).
§ Estimation
§ e.g., Estimate the population mean
weight using the sample mean weight
§ Hypothesis testing
§ e.g., Test
the claim that the population
mean weight is 120 pounds
State of
Type of car Birth
Types of Variables
In a Quantitative Variable information is
reported numerically.
Number of children in a
family
• For each of the following, indicate whether
the appropriate variable would be qualitative
or quantitative. If the variable is quantitative,
indicate whether it would be discrete or
continuous.
a) Whether you own a • Qualitative Variable
Panasonic television § two levels: yes/no
set § no measurement
• Quantitative, Continuous
e) Sam’s travel time Variable
from his dorm to § any number
§ time is measured
the student union
§ can take on any value greater
than zero
f) The number of • Quantitative, Discrete
students on Variable
campus who §a countable number
belong to a social § only whole numbers
fraternity or
sorority
Nominal level Data are Gender
classified into categories. But
the ordering of categories is
not meaningful. These are:
–Identifier or name
–Unranked categorization
•Example: gender, car color
Eye
Color
Category of Nominal level variables must
be:
Mutually exclusive
ALL the individual (or object or
measurement) must appear in ONLY ONE
category.
Exhaustive
ALL the individual (or object or
measurement) must appear in
AT LEAST ONE of the categories.
Ordinal level: Orders are meaningful, but
differences are not meaningful.
During a taste test of
4 soft drinks, Coca
Cola was ranked 4
number 1, Dr. Pepper 2
number 2, Pepsi
number 3, and Root
Beer number 4. Can 3
we say Coca Cola is 2 1
better then Pepsi?
§ Ordinal data
§ All characteristics of nominal data plus…
§ Rank-order categories
§ Ranks are relative to each other
§ Example: Low (1), moderate (2) or high (3) risk
Interval level
Both the orders and differences are
meaningful but the ratio is not.
Temperature on the
Fahrenheit scale.
§ All of the characteristics of ordinal data
plus…