Chapter 1 Introduction To Statistics
Chapter 1 Introduction To Statistics
INTRODUCTION TO STATISTICS
POINTS TO HIGHLIGHT
3
I. WHAT IS STATISTICS?
1. Why study statistics?
●Being informed
4
I. WHAT IS STATISTICS?
5
I. WHAT IS STATISTICS?
Example:
●How should you select an online seller in Ebay based on their
6
I. WHAT IS STATISTICS?
1. Why study statistics?
● Making informed Judgments
Your ability to make informed judgments
Decide whether existing information is adequate or
whether additional information is required
If necessary, collect more information in a reasonable
and thoughtful way
Summarize the available data in a useful and informative
manner
Analyze the available data
Draw conclusion, make decision, and assess the risk of7 an
incorrect information
I. WHAT IS STATISTICS?
8
I. WHAT IS STATISTICS?
10
I. WHAT IS STATISTICS?
- Furthermore:
- Collect
- describe
- summarize
- present
- analyze
11
STATISTICAL ANALYSIS PROCESS
Pose a question/ Decide what to
problem measure and Collecting data
how to measure?
Summarizing
data
Interpreting Analyzing
12
results data
EXAMPLES
● Customers’ satisfaction toward a product/service
provided by a company
● Customers’ online shopping behavior
● Emotional intelligence and Job performance
13
APPLICATIONS IN BUSINESS AND ECONOMICS
Economics
Economists use statistical information in
making forecasts about the future of the
economy or some aspects of it.
14
APPLICATIONS IN BUSINESS AND
ECONOMICS
Marketing
Electronic point-of-sale scanners at retail
checkout counters are used to collect data for
a variety of marketing research applications.
15
APPLICATIONS IN BUSINESS AND
ECONOMICS
Example: one Midwest grocery chain analyze
local buying patterns. They discovered that
when men bought diapers on Thursdays and
Saturdays, they also tended to buy beer. The
grocery chain could use this information in
various ways to increase revenue. For
example, they could move the beer display
closer to the diaper display
16
APPLICATIONS IN BUSINESS AND
ECONOMICS
Finance
Financial advisors use price-earnings ratios and dividend yields
to guide their investment recommendations.
17
JOBS OF STATISTICS
18
MAKING SENSE OF NUMERICAL INFORMATION
19
DEALING WITH UNCERTAINTY
20
SAMPLING
E.g: Before bringing a new product to market, market
research survey about the likely level of demand of this
product maybe undertaken?
should the survey cover all potential buyers (population)?
21
Sampling
ANALYZING RELATIONSHIPS
● Let’s consider some examples below:
(i) Does the growth rate of money supply influence the inflation
rate?
(ii) If the price of a product rise by 5%, what is the effect on the
sales of this product?
22
FORECASTING
23
DECISION MAKING IN AN UNCERTAIN
ENVIRONMENT
24
II/ KEY CONCEPTS IN STATISTICS
The frame
A complete listing of all elementary units relevant to a statistical
investigation
25
II/ KEY CONCEPTS IN STATISTICS
2/ Variables and Data
Variables
Characteristics of interest of elementary units
Data
A single observation about a specified characteristic of interest is
called a datum.
Any collection of observations about one or more characteristics of
interest, for one or more elementary units, is called a data set.
A data set may be univariate, bivariate, multivariate
26
EXAMPLE: ASEAN COUNTRIES (2016 DATA)
Population GDP GDP per
Total Area
Country (million (billion capital
(km2)
persons) USD) (USD)
Vietnam 92.701 202.62 2371 332.698
Laos 6.758 15.90 1921 237.955
Malaysia 31.187 296.36 12127 330.803
Thailand 68.864 406.84 5938 513.120
Singapore 5.607 296.97 53053 719,1
Phillipines 103.320 304.90 3280 300.000
Indonesia 261.115 932.26 3635 1.904.569
27
II/ KEY CONCEPTS IN STATISTICS
3/ Population vs Sample
Population
is the WHOLE group of all elementary units of interest
Sample
is a subset of data drawn from the population.
28
Population vs. Sample
Population Sample
a b cd b c
ef gh i jk l m n gi n
o p q rs t u v w o r u
x y z y
29
3
0
Statistics
31
DESCRIPTIVE STATISTICS
- Collect data
- Present data
e.g., Charts and graphs
- Characterize data x i
e.g., Calculate mean = n
32
INFERENTIAL STATISTICS
● Procedures used to draw conclusions or inferences about
the characteristics of a population from information
obtained from the sample.
● Making estimates, testing hypothesis…
● Used when we can not enumerate the whole population
33
INFERENTIAL STATISTICS
Population parameters
Sample statistics
(unknown, but can be
(known) estimated from sample
Inference
evidence
Sample Population
34
INFERENTIAL STATISTICS
Drawing conclusions and/or making decisions
concerning a population based on sample results.
● Estimation
Data
Data can
can be
be classified
classified as
as being
being qualitative
qualitative or
or quantitative.
quantitative.
Depends
Depends on
on whether
whether the
the data
data are
are qualitative
qualitative or
or quantitative,
quantitative,
we
we choose
choose the
the most
most appropriate
appropriate statistical
statistical methods
methods
Gender
Height
Ages Quantitative
Student Evaluation
Grades
QUALITATIVE DATA
38
3
9
EXAMPLES
● Gender:
1. Male 2. Female
● Eye colors:
1.Brown 2.Black 3.Blue 4.Green
● Marital status:
1. Single
2. Married
3. Divorced
4. Widowed
QUANTITATIVE DATA
● Quantitative
●
Quantitativedata
dataisisdata
datathat
thatisisnormally
normallyexpressed
expressednumerically.
numerically.ItItindicates
indicateshow
howmany
manyor
orhow
howmuch:
much:
E.g.
(i) The number of students in a class
(ii) The number of correct answers in a test
(iii)People’s height, weight; students’ GPA
41
4/ SCALES OF MEASUREMENT
Scales
Scales of
of measurement
measurement include:
include:
Nominal Interval
Ordinal Ratio
The
The scale
scale determines
determines thethe amount
amount of
of information
information
contained
contained inin the
the data.
data.
The
The scale
scale indicates
indicates the
the data
data summarization
summarization and
and
statistical
statistical methods
methods that
that are
are most
most appropriate.
appropriate.
4/ SCALES OF MEASUREMENT
Highest Level
Measurements
Ratio/Interval Scale Complete Analysis
Data
Data are
are labels
labels or
or names
names used
used to
to identify
identify aa
characteristic
characteristic of
of the
the elementary
elementary units.
units.
Nominal
There
There isis no
no relative
relative order
order or
or rank
rank between
between these
these
data
data categories
categories
Numeric
Numeric codes
codes are
are assigned
assigned for
for each
each data
data category.
category.
SCALES OF MEASUREMENT
Nominal
ItIt never
never makes
makes sense
sense to
to add,
add, subtract,
subtract, multiply,
multiply,
divide,
divide, rank,
rank, average
average or
or manipulate
manipulate
Used
Used to
to count
count frequency
frequency of
of variables
variables outcomes
outcomes
E.g:
E.g: Numbers
Numbers assigned
assigned for
for aa person’s
person’s gender
gender
Or
Or numbers
numbers assigned
assigned for
for aa person’s
person’s marital
marital status
status
EXAMPLE
Students
Students of
of aa university
university are
are classified
classified by
by the
the school
school in
in which
which
they
they are
are enrolled
enrolled such
such as
as Business,
Business, Humanities,
Humanities, Education,
Education, and
and
so
so on.
on.
AA numeric
numeric code
code could
could be
be used
used for
for the
the school
school variable
variable (e.g.
(e.g. 11
denotes
denotes Business,
Business, 22 denotes
denotes Humanities,
Humanities, 33 denotes
denotes Education,
Education,
and
and so
so on).
on).
47
8
SCALES OF MEASUREMENT
Ordinal
The
The data
data have
have the
the properties
properties of of nominal
nominal data
data and
and
the
the order
order or
or rank
rank of
of the
the data
data isis meaningful.
meaningful.
Numeric
Numeric codes
codes may
may be
be used
used which
which do
do indicate
indicate the
the
rank
rank // order
order of
of data
data categories.
categories.
The
The gap
gap between
between numbers
numbers or
or units
units on
on this
this scale
scale
doesn’t
doesn’t mean
mean equal
equal magnitude
magnitude between
between variable
variable
outcomes
outcomes
SCALES OF MEASUREMENT
Ordinal
49
EXAMPLE
In
In aa running
running or
or swimming
swimming competition
competition or or race,
race, finishers
finishers are
are
ranked
ranked from
from the
the first
first place
place to
to the
the last
last place.
place. Let’s
Let’s assume
assume
that
that there
there were
were six
six people
people attending
attending this
this contest.
contest.
50
EXAMPLE
Let
Let 11 denotes
denotes thethe first
first place
place finishers,
finishers, 22 denotes
denotes thethe second
second
place
place finisher,
finisher, 33 denotes
denotes thethe third
third place
place finisher
finisher and
and so
so on
on
The
The gap
gap between
between these
these numbers
numbers does
does not
not indicate
indicate equal
equal
absolute
absolute magnitude
magnitude (especially
(especially number
number 11 and and number
number 2,2,
number
number 33 andand number
number 4) 4)
51
SCALES OF MEASUREMENT
Interval
Have
Have the
the properties
properties of of ordinal
ordinal data,
data, and
and
the
the intervals
intervals between
between numbers
numbers or or units
units on
on the
the scale
scale
are
are equal
equal over
over all
all level
level of
of the
the scale
scale
Interval
Interval scales
scales provide
provide more
more quantitative
quantitative
information
information
There
There isis no
no zero
zero value
value that
that indicates
indicates
that
that nothing
nothing exists
exists for
for the
the variable
variable at
at the
the zero
zero point.
point. 52
SCALES OF MEASUREMENT
Interval
Addition
Addition and
and subtraction
subtraction are
are permissible
permissible but
but
multiplication
multiplication and
and division
division continue
continue to
to make
make nono
sense
sense
Example: Melissa has an SAT score of 1205, while Kevin has an
SAT score of 1090. Melissa scored 115 points more than
Kevin.
53
EXAMPLE
● Consider the centigrade scale for measuring temperature. This
scale has properties of interval scale due to the following
reasons:
- Each value on the scale can be ranked as larger or smaller than
any other value.
- It has the equal intervals property because the scale is made up
of equal units.
- It has no zero point (water freezes at zero degrees centigrade,
but temperatures can get colder than that).
54
SCALES OF MEASUREMENT
Ratio
Have
Have all
all the
the properties
properties of
of interval
interval data
data
and
and the
the ratio
ratio of
of two
two values
values isis meaningful.
meaningful.
This
This scale
scale must
must contain
contain aa zero
zero value
value that
that indicates
indicates
that
that nothing
nothing exists
exists for
for the
the variable
variable at
at the
the zero
zero point.
point.
Variables
Variables such
such as
as distance,
distance, height,
height, weight,
weight, and
and time
time
use
use the
the ratio
ratio scale.
scale.
55
SCALES OF MEASUREMENT
Ratio
All
All types
types of
of arithmetic
arithmetic operations,
operations, even
even multiplication
multiplication and
and
division
division can
can be
be performed
performed with
with such
such data
data
56
EXAMPLE
57
EXAMPLE
Survey “Students’ opinion on having two canteens inside university campus”
1/ Gender
☐ Male ☐ Female
2/ Your major
☐ Business administration ☐ Banking and Finance
☐ International Commerce ☐ Commercial Law
3/ Your monthly expenditure:
☐ < 1000K VND ☐ 1000K – 2000K VND
☐ 2000K – 5000K VND ☐ >= 5000K VND
4/ With the scores from 1-10, can you rate the services provided by FTU corner
Your score: ….
58
5/ Do you agree that having two canteens is not necessary in FTU
☐ ☐ ☐ ☐ ☐
Strongly Agree Neutral Disagree Strongly
agree Disagree
Requirements:
1/ Answer the questions on your own
2/ Identify the scale of measurement used for each question?
59
III/ DATA ANALYSIS PROCESS
Statistical Methods:
Finding existing data: from print to
Internet
Generating new data: observation,
61
experiment, survey
III/ DATA ANALYSIS PROCESS
After being collected, data should be
preliminarily analyzed
Summarizing data graphically and
numerically
Statistical Methods:
Presenting data: Tables, Charts
and Graphs
Presenting data: descriptive
62 statistic
III/ DATA ANALYSIS PROCESS
Statistical Methods:
Estimation, Hypothesis testing,
Variance analysis, Regression and
Correlation, Time series and Forecasti
Index numbers…
63
III/ DATA ANALYSIS PROCESS
Several questions should be addressed
Step 6: Interpretation of What conclusion can be drawn from th
analysis?
results
How do the results inform us about the
research problem or question?
How the results guide future research?
64
LEARN TO VIEW STATISTICS WITH A CRITICAL
EYE
● There are three kinds of lies…..
○ Lies
○ Damn Lies
○ Statistics
● You need to make statistics work for you, not lie for
you!
65
LEARN TO VIEW STATISTICS WITH A CRITICAL EYE
66
LEARN TO VIEW STATISTICS WITH A CRITICAL EYE
67
LEARN TO VIEW STATISTICS WITH A CRITICAL EYE
68
END OF CHAPTER 1
THANK YOU!
69