0% found this document useful (0 votes)
30 views49 pages

Lecture 1 Data Overview and Introduction To SPSS VJU

Uploaded by

doandn77uel
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
30 views49 pages

Lecture 1 Data Overview and Introduction To SPSS VJU

Uploaded by

doandn77uel
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 49

Data Overview and

Introduction to IBM SPSS


Lecture 01
LEARNING OBJECTIVES
 List ways that statistics is used.
 Know the differences between descriptive and inferential
statistics.
 Understand the differences between a sample and a
population.
 Explain the difference between qualitative and quantitative
variables.
 Compare discrete and continuous variables.
 Recognize the levels of measurement in data.

1-2
Uses of Statistics
 One of the tools used to make decisions in business.
 We apply statistical concepts in our lives.
 As a student of business or economics, basic
knowledge and skills to organize, analyze, and
transform data and to present the information.

1-3
LO1-1

Why Study Statistics?


Statistics is required for many college programs. Why?
1. Numerical information is everywhere.
2. Statistical techniques are used to make decisions that affect
our daily lives.
3. Knowledge of statistical methods will help you understand
how decisions are made and give you a better
understanding of how they affect you.
No matter what line of work you select, you will find yourself
faced with decisions where an understanding of data
analysis is helpful.

1-4
LO1-1
LO1
Understanding of Statistics Helps us
Make Informed Decisions
In making informed decisions, you need to:
1. Determine if the existing information is adequate or
additional information is required.
2. Gather additional information, if needed, in such a way that it
does not provide misleading results.
3. Summarize the information in a useful and informative
manner.
4. Analyze the available information.
5. Draw conclusions and make inferences while assessing the risk
of an incorrect conclusion.
1-5
LO1-1
LO1

What Is Meant by Statistics?


STATISTICS The science of collecting, organizing,
presenting, analyzing, and interpreting data to assist in
making more effective decisions.

Collect

Organize

Present

Interpret
1-6
Who Uses Statistics?

Statistical techniques are


used extensively by
marketers, accountants,
quality control personnel,
consumers, professional
sports people, hospital
administrators, educators,
politicians, physicians, etc...

1-7
Types of Statistics – Descriptive Statistics
and Inferential Statistics

Descriptive Statistics - methods of organizing,


summarizing, and presenting data in an informative way.

EXAMPLE 1: The United States government reports the population of the


US was 179,323,000 in 1960; 203,302,000 in 1970; 226,542,000 in
1980; 248,709,000 in 1990; 265,000,000 in 2000; and 308,400,000 in
2010.
EXAMPLE 2: There are a total of 46,837 miles of interstate highways in the
United States. The interstate system represents only 1% of the
nation’s total roads but carries more than 20% of the traffic.
EXAMPLE 3: The average person spent $103.00 on traditional Valentine’s
Day merchandise in 2010. This is an increase of $0.50 from 2009.
1-8
Types of Statistics – Descriptive
Statistics and Inferential Statistics

Inferential Statistics: A decision, estimate, prediction,


or generalization about a population, based on a
sample.

Note: In statistics, the word population and sample


have a broader meaning. A population or sample
may consist of individuals or objects.

1-9
Population vs. Sample
A population is the entire set of individuals or objects of interest, or
the measurements obtained from all individuals or objects of
interest.

A sample is a portion, or part, of the population of interest.

1-10
Why Take a Sample Instead of Studying Every
Member of the Population?

1. Prohibitive cost of census


2. Destruction of item being studied may be
required
3. Not possible to test or inspect all members
of a population being studied

1-11
Usefulness of a Sample in Learning about a
Population
Using a sample to learn something about a
population is done extensively in business,
agriculture, politics, and government.

EXAMPLE: Television networks constantly monitor the


popularity of their programs by hiring Nielsen and
other organizations to sample the preferences of TV
viewers.

1-12
Types of Variables
A. Qualitative or attribute variable - the characteristic
being studied is nonnumeric.
EXAMPLES: gender, religious affiliation, type of automobile
owned, state of birth, and eye color are examples.

B. Quantitative variable - information is reported


numerically.
EXAMPLES: balance in your checking account, minutes
remaining in class, or number of children in a family.
1-13
Quantitative Variables - Classifications
Quantitative variables can be classified as either discrete or
continuous.

A. Discrete variables can only assume certain values, and


there are usually “gaps” between values.

EXAMPLE: the number of bedrooms in a house, or the number of


hammers sold at the local Home Depot (1,2,3,…,etc).

B. Continuous variable can assume any value within a


specified range.

EXAMPLE: the pressure in a tire, the weight of a pork chop, or the


height of students in a class.

1-14
Summary of Types of Variables

1-15
Four Levels of
Measurement
Nominal level – data that is Interval level – similar to the
classified into categories and ordinal level, with the
cannot be arranged in any additional property that
particular order. meaningful amounts of
differences between data
values can be determined.
There is no natural zero point.

Ordinal level – data arranged in


some order, but the Ratio level – the interval level with
differences between data an inherent zero starting point.
values cannot be determined Differences and ratios are
or are meaningless. meaningful for this level of
measurement.

1-16
Nominal-Level Data
Properties:
1. Observations of a qualitative variable can
only be classified and counted.
2. There is no particular order to the labels.

1-17
Ordinal-Level Data
Properties:
1. Data classifications are
represented by sets of
labels or names (high,
medium, low) that have
relative values.
2. Because of the relative
values, the data classified
can be ranked or ordered.

1-18
Interval-Level Data
Properties:
1. Data classifications are ordered according to the amount of the
characteristic they possess.
2. Equal differences in the characteristic are represented by equal
differences in the measurements.

Example: Women’s dress sizes


listed on the table.

1-19
Ratio-Level Data
 Practically all quantitative data is recorded on the ratio level
of measurement.
 Ratio level is the “highest” level of measurement.

Properties:
1. Data classifications are ordered according to the amount of the
characteristics they possess.
2. Equal differences in the characteristic are represented by equal
differences in the numbers assigned to the classifications.
3. The zero point is the absence of the characteristic, and the ratio between
two numbers is meaningful.

1-20
Why Know the Level of Measurement of a Data?

 The level of measurement of the data


dictates the calculations that can be done to
summarize and present the data.
 To determine the statistical tests that should
be performed on the data.

1-21
Summary of the Characteristics for Levels of
Measurement

1-22
Ethics and Statistics

 Practice statistics with integrity and honesty.


 “Do the right thing” when collecting,
organizing, summarizing, analyzing, and
interpreting numerical information.
 The real contribution of statistics to society is
a moral one.

1-23
Computer Applications

 Computers are now available to students at


most colleges and universities.
 Spreadsheets (Microsoft Excel)
 Statistical software (Stata, Eview, SPSS, etc.)

1-24
Tutorial – Measurement level in SPSS
 Distinguish 4 types of scales: Nominal,
Ordinal, Interval, and Ratio
 Introduction to SPSS
 Application of 4 types of measurement scales
on SPSS software
 Data entry in Ms. Excel
 Export data from Excel to SPSS
References

26
DATA PROCESSING
 Coding:
 Except for some quantitative data (in the form of numbers),
which does not need to be encoded, qualitative data (not in the
form of numbers) needs to be converted into numbers.
 Data Entry:
 Data is entered and stored by at least two different data entry
people. In reality, entering data from the questionnaire into the
computer is entered twice.
 You should enter data in Ms Excel and export them to statistical
softwares for analysis such as SPSS, STATA, Eviews
 Làm sạch dữ liệu:
 Check missing data, check outliers.
27
Design Questionaire – Interviewing employees

 Name
 Gender
 Occupation
 Educational level
 Income
 Expenditure
 Level of job preference
28
Data Entry in Excel without coding
Ordinal_number Malee Gender Education Occupation Income Expenditure Job_Pref

1 Le Hoai Anh Female Primary School Farming 10.0 6.0 Very Hate
2 Nguyen Hung Anh Male Secondry School Off-farm 11.0 6.5 Hate
3 Nguyen Trung Anh Male High School Farming 7.0 4.8 Normal
4 Pham Ngoc Anh Female Above High School Off-farm 14.0 9.0 Very Like
5 Nguyen Thi Nguyet Female High School Farming 5.0 3.8 Like
6 Chu Linh Chi Female Secondry School Farming 4.0 2.3 Hate
7 Tran Thi Kim Chi Female Primary School Farming 5.5 3.2 Like
8 Kim Duc Dung Male Above High School Off-farm 8.0 4.8 Very Like
9 Nguyen Cong Duong Male High School Off-farm 7.0 4.5 Normal
10 Nguyen Tien Dat Male Secondry School Farming 2.5 1.8 Very Hate
11 Le Ha Minh Duc Male Primary School Farming 6.5 4.0 Normal
12 Nguyen Thu Ha Female Above High School Farming 4.0 2.5 Like
13 Vu Thi Hong Hanh Female Above High School Off-farm 18.0 9.5 Very Like
14 Nguyen Minh Hien Male High School Off-farm 4.0 2.5 Hate
15 Nguyen do Phuong Male Secondry School Off-farm 5.0 3.0 Hate
16 Tran Viet Hoang Male Secondry School Farming 3.0 2.0 Very Hate

29
Codebook
Gender Male= 1; Female = 0

Occupation Farming= 1; Off-farm = 0

Educational level Primary = 1; Secondary= 2; High school=


3; Above high school= 4;

Level of job Very hate = 1; Hate = 2; Normal = 3;


preference Like = 4; Very like = 5 30
Data entry in Excel after coding
Ordinal_number Name Gender Education Occupation Income Expenditure Job_Pref
1 Le Hoai Anh 0 1 1 10.0 6.0 1
2 Nguyen Hung Anh 1 2 0 11.0 6.5 2
3 Nguyen Trung Anh 1 3 1 7.0 4.8 3
4 Pham Ngoc Anh 0 4 0 14.0 9.0 5
5 Nguyen Thi Nguyet 0 3 1 5.0 3.8 4
6 Chu Linh Chi 0 2 1 4.0 2.3 2
7 Tran Thi Kim Chi 0 1 1 5.5 3.2 4
8 Kim Duc Dung 1 4 0 8.0 4.8 5
9 Nguyen Cong Duong 1 3 0 7.0 4.5 3
10 Nguyen Tien Dat 1 2 1 2.5 1.8 1
11 Le Ha Minh Duc 1 1 1 6.5 4.0 3
12 Nguyen Thu Ha 0 4 1 4.0 2.5 4
13 Vu Thi Hong Hanh 0 4 0 18.0 9.5 5
14 Nguyen Minh Hien 1 3 0 4.0 2.5 2
15 Nguyen do Phuong 1 2 0 5.0 3.0 2
16 Tran Viet Hoang 1 2 1 3.0 2.0 1

31
Introduction to IBM SPSS
Data view

33
Variable view

34
Create Data file in SPSS

35
Create Data file in SPSS

36
Create Data file in SPSS

37
Create Data file in SPSS

38
Import Data from Ms Excel

39
Import Data from Ms Excel

40
Import Data from Ms Excel

41
Import Data from Ms Excel

42
Import Data from Ms Excel

43
Import Data from Ms Excel

44
Import Data from Ms Excel

45
Import Data from Ms Excel

46
Data cleaning– (Checking outliers and data missing)

47
Data cleaning– (Checking outliers and data missing)

48
Làm sạch dữ liệu – Kiểm tra số dị biệt (outliers)

49

You might also like