Lecture 1 Data Overview and Introduction To SPSS VJU
Lecture 1 Data Overview and Introduction To SPSS VJU
1-2
Uses of Statistics
One of the tools used to make decisions in business.
We apply statistical concepts in our lives.
As a student of business or economics, basic
knowledge and skills to organize, analyze, and
transform data and to present the information.
1-3
LO1-1
1-4
LO1-1
LO1
Understanding of Statistics Helps us
Make Informed Decisions
In making informed decisions, you need to:
1. Determine if the existing information is adequate or
additional information is required.
2. Gather additional information, if needed, in such a way that it
does not provide misleading results.
3. Summarize the information in a useful and informative
manner.
4. Analyze the available information.
5. Draw conclusions and make inferences while assessing the risk
of an incorrect conclusion.
1-5
LO1-1
LO1
Collect
Organize
Present
Interpret
1-6
Who Uses Statistics?
1-7
Types of Statistics – Descriptive Statistics
and Inferential Statistics
1-9
Population vs. Sample
A population is the entire set of individuals or objects of interest, or
the measurements obtained from all individuals or objects of
interest.
1-10
Why Take a Sample Instead of Studying Every
Member of the Population?
1-11
Usefulness of a Sample in Learning about a
Population
Using a sample to learn something about a
population is done extensively in business,
agriculture, politics, and government.
1-12
Types of Variables
A. Qualitative or attribute variable - the characteristic
being studied is nonnumeric.
EXAMPLES: gender, religious affiliation, type of automobile
owned, state of birth, and eye color are examples.
1-14
Summary of Types of Variables
1-15
Four Levels of
Measurement
Nominal level – data that is Interval level – similar to the
classified into categories and ordinal level, with the
cannot be arranged in any additional property that
particular order. meaningful amounts of
differences between data
values can be determined.
There is no natural zero point.
1-16
Nominal-Level Data
Properties:
1. Observations of a qualitative variable can
only be classified and counted.
2. There is no particular order to the labels.
1-17
Ordinal-Level Data
Properties:
1. Data classifications are
represented by sets of
labels or names (high,
medium, low) that have
relative values.
2. Because of the relative
values, the data classified
can be ranked or ordered.
1-18
Interval-Level Data
Properties:
1. Data classifications are ordered according to the amount of the
characteristic they possess.
2. Equal differences in the characteristic are represented by equal
differences in the measurements.
1-19
Ratio-Level Data
Practically all quantitative data is recorded on the ratio level
of measurement.
Ratio level is the “highest” level of measurement.
Properties:
1. Data classifications are ordered according to the amount of the
characteristics they possess.
2. Equal differences in the characteristic are represented by equal
differences in the numbers assigned to the classifications.
3. The zero point is the absence of the characteristic, and the ratio between
two numbers is meaningful.
1-20
Why Know the Level of Measurement of a Data?
1-21
Summary of the Characteristics for Levels of
Measurement
1-22
Ethics and Statistics
1-23
Computer Applications
1-24
Tutorial – Measurement level in SPSS
Distinguish 4 types of scales: Nominal,
Ordinal, Interval, and Ratio
Introduction to SPSS
Application of 4 types of measurement scales
on SPSS software
Data entry in Ms. Excel
Export data from Excel to SPSS
References
26
DATA PROCESSING
Coding:
Except for some quantitative data (in the form of numbers),
which does not need to be encoded, qualitative data (not in the
form of numbers) needs to be converted into numbers.
Data Entry:
Data is entered and stored by at least two different data entry
people. In reality, entering data from the questionnaire into the
computer is entered twice.
You should enter data in Ms Excel and export them to statistical
softwares for analysis such as SPSS, STATA, Eviews
Làm sạch dữ liệu:
Check missing data, check outliers.
27
Design Questionaire – Interviewing employees
Name
Gender
Occupation
Educational level
Income
Expenditure
Level of job preference
28
Data Entry in Excel without coding
Ordinal_number Malee Gender Education Occupation Income Expenditure Job_Pref
1 Le Hoai Anh Female Primary School Farming 10.0 6.0 Very Hate
2 Nguyen Hung Anh Male Secondry School Off-farm 11.0 6.5 Hate
3 Nguyen Trung Anh Male High School Farming 7.0 4.8 Normal
4 Pham Ngoc Anh Female Above High School Off-farm 14.0 9.0 Very Like
5 Nguyen Thi Nguyet Female High School Farming 5.0 3.8 Like
6 Chu Linh Chi Female Secondry School Farming 4.0 2.3 Hate
7 Tran Thi Kim Chi Female Primary School Farming 5.5 3.2 Like
8 Kim Duc Dung Male Above High School Off-farm 8.0 4.8 Very Like
9 Nguyen Cong Duong Male High School Off-farm 7.0 4.5 Normal
10 Nguyen Tien Dat Male Secondry School Farming 2.5 1.8 Very Hate
11 Le Ha Minh Duc Male Primary School Farming 6.5 4.0 Normal
12 Nguyen Thu Ha Female Above High School Farming 4.0 2.5 Like
13 Vu Thi Hong Hanh Female Above High School Off-farm 18.0 9.5 Very Like
14 Nguyen Minh Hien Male High School Off-farm 4.0 2.5 Hate
15 Nguyen do Phuong Male Secondry School Off-farm 5.0 3.0 Hate
16 Tran Viet Hoang Male Secondry School Farming 3.0 2.0 Very Hate
29
Codebook
Gender Male= 1; Female = 0
31
Introduction to IBM SPSS
Data view
33
Variable view
34
Create Data file in SPSS
35
Create Data file in SPSS
36
Create Data file in SPSS
37
Create Data file in SPSS
38
Import Data from Ms Excel
39
Import Data from Ms Excel
40
Import Data from Ms Excel
41
Import Data from Ms Excel
42
Import Data from Ms Excel
43
Import Data from Ms Excel
44
Import Data from Ms Excel
45
Import Data from Ms Excel
46
Data cleaning– (Checking outliers and data missing)
47
Data cleaning– (Checking outliers and data missing)
48
Làm sạch dữ liệu – Kiểm tra số dị biệt (outliers)
49