BRM - Data Analysis, Interpretation and Reporting Part I
BRM - Data Analysis, Interpretation and Reporting Part I
Alemseged Gerezgiher
(BSc, MBA, PhD)
05/07/24 1
Part VI (Sub-part I)
Data Analysis, Interpretation
and Reporting
05/07/24 2
Chapter Six: Data Analysis, Interpretation and
Reporting
Hypothesis Testing
Descriptive Analysis
Inferential Analysis
Interpretation, scientific writing and Reporting
05/07/24 3
Data Management and Support Software
Data processing and analysis starts with proper preparation of
the data and making it ready for appropriate analysis.
It involves:
editing,
coding,
classifying and
tabulation of the collected data
i) Editing
Editing of data is the process of examining the collected raw
data to detect errors and omissions.
In general one edits to assure that the data are:
Accurate
05/07/24 7
Data Management…
In a nutshell, data management involves:
Defining variables
Coding values
Entering and editing data
Creating new variables
Recoding variables
Selecting cases (observations)
05/07/24 8
Support Software: the SPSS
05/07/24 9
The basics of windows in SPSS
What is SPSS?
Originally it is an acronym of Statistical Package for the
Social Science but now it stands for Statistical Product and
Service Solutions (IBM)
One of the most popular statistical packages which can
perform highly complex data manipulation and analysis
with simple instructions
A software package useful for conducting statistical
analysis, manipulating data, and generating tables and
graphs.
05/07/24 10
The basics of windows in SPSS
Opening SPSS
Start → All Programs → IBM SPSS Statistics→ SPSS version → SPSS version
Or Double Click on the SPSS icon on your Desktop
05/07/24 11
The basics of windows in SPSS
05/07/24 12
The basics of windows in SPSS
• Two bars appear (menu bar and tool bar) once the window is opened
• Menu Bar:
• The menu bar provides a series of “drop down” commands to perform
most essential SPSS functions. By clicking on a menu command, a further
series of menu options will appear. Many of these submenu commands
will be discussed in further detail in this note.
• File: These are the basic file management operations.
• e.g. opening, saving, and printing files
• Edit: This allows you to perform editing functions on the current data set.
• e.g. cut, copy, clear, undo changes and redo changes
• View: Allows you to change the current view of data, as well as toolbar
options.
•
05/07/24
e.g. grid lines, value labels 13
The basics of windows in SPSS
Data: These functions deal with the configuration, defining, and
management of data.
e.g. insert variables/cases, sort data, merge files
Transform: This allows you to transform the data set you’ve entered.
e.g. calculating new variables, recoding, missing values
Analyze: Includes the main data analysis functions.
e.g. descriptive statistics, t-Tests, ANOVA, correlation, data
reduction
Windows: Allows you to alter the appearance, format, position of the
SPSS windows.
05/07/24 14
The basics of windows in SPSS
Toolbar: The toolbar contains icons to facilitate easy point and click operations.
Because these can be customized, they may vary slightly between systems. Below is
a description of the main icons:
Open files Undo last operation
Save data Redo last operation
Print file Insert case
Find data Insert variable
(1) (2)
05/07/24 15
The basics of windows in SPSS
The default window will have the data editor
There are two sheets in the window:
1. Data view 2. Variable view
05/07/24 16
The basics of windows in SPSS
Data Editor
Spreadsheet-like system for defining, entering, editing,
and displaying data. Extension of the saved file will be
“sav.”
It contains data view and variable view.
Data view
The place to enter data
Columns: variables
Rows: records
Variable view
The place to enter variables
List of all variables
Characteristics of all variables
05/07/24 17
Creating own dataset: The data entry process
It involves two general steps.
Define your variables in Variable View
Enter the data, the values of the variables, in Data
View
• The two parts (Views) of the “Data Editor” are
used in combination to enter data.
• First, variables are defined in “Variable View”.
• Second, the data are entered in “Data View”.
05/07/24 18
The data entry process
Data View has two parts: rows and columns.
Rows represent cases or observations, that is, the
objects on which data have been collected:
For example, rows represent the contents of a single
observation data collection form, the information on an
individual
Columns represent variables or characteristics of the
object of interest:
For example, each column contains the answers to the
questions on the observation data collection form: age,
gender, etc
05/07/24 19
Columns:
variables
Rows:
cases
Under Data
View
05/07/24 20
The data entry process
Definition of variables
10 characteristics are used to define a variable:
Name Values
Type Missing
Width Column
Decimals Align
Label Measure
05/07/24 21
The data entry process: enter variables
05/07/24 22
Enter variables…
Variable View window
This sheet contains information about the data that is stored with the
dataset
Name
The first character of the variable name must be alphabetic
Variable names must be unique, and have to be less than 64 characters.
Spaces are NOT allowed and the name should be consistent
Try to give meaningful variable names
05/07/24 23
Enter variables…
Type
Click on the ‘type’ box. The two basic types of variables for ‘input’
and ‘output’
This column enables you to specify the type of variable.
05/07/24 24
Enter variables…
Internal formats: Output formats:
Numeric Comma
String (alphanumeric) Dot
Date Scientific notation
Dollar
Custom currency
05/07/24 25
Enter variables…
Numeric variables:
Numeric measurements
Codes
Definition of the size of the variable
String variables (alphanumeric) contain words or
characters; strings can include numbers but, taken here as
characters, mathematical operations cannot be applied to
them
The maximum size of a string variable is 255 characters
Date: The input format for date variables must be defined,
such as DD/MM/YYYY, MM/DD/YYYY or MM/DD/YY
Example: 1 January 2020
05/07/24 26
Enter Variables
Width
Width allows you to determine the number of characters
SPSS will allow to be entered for the variable
05/07/24 27
Enter variables…
Decimals
Number of decimals
It has to be less than or equal to 16
3.14159265
05/07/24 28
Enter variables…
Label
You can specify the details of the variable
Descriptors for the variables
You can write characters with spaces up to 256
characters
Used in the output
05/07/24 29
Enter variables…
Values
This is used and to suggest which numbers represent which
categories when the variable represents a category
Value labels are descriptors of the categories of a variable
It uses coding
05/07/24 30
Enter variables…
Values
Click the cell in the values column as shown below
For the value, and the label, you can put up to 60 characters.
After defining the values click add and then click OK.
Click
05/07/24 31
Enter variables…
Missing
Defines missing values
The values are excluded from some analysis
Options:
Up to 3 discrete missing values
A range of missing values plus one discrete missing value
05/07/24 32
Enter variables…
05/07/24 33
Enter variables…
05/07/24 34
Enter variables…
Columns and Aligns
Columns sets the amount of space reserved to display
the contents of the variable in Data View; generally the
default value is adequate
Align sets whether the contents of the variable appear
on the left, centre or right of the cell in Data View
Numeric variables are right-hand justified by default
and string variables left-hand justified by default; the
defaults are generally adequate
05/07/24 35
Enter variables…
Measure
Indicates the nature/level of measurement of the variable
Levels of measurement:
Nominal
Ordinal
Interval
Ratio
05/07/24 36
Enter cases/data
05/07/24 37
Import data from Excel
Select File Open Data
Choose Excel as file type
Select the file you want to import
Then click Open
05/07/24 38
Open Excel files in SPSS
05/07/24 39
Continue
Save this
file as
SPSS
data
05/07/24 40
Saving the file
The file must always be saved in order to save the
work that has been done to date:
File/Save
Move to the target directory
Enter a file name
Save
05/07/24 41
Output Viewer
Output Viewer
Displays output and errors. Extension of the saved file will
be “spv.”
05/07/24 42
Transforming data
Click ‘Transform’ and then click ‘Recode Variable’ or ‘Compute
Variable…’
05/07/24 43
Transforming data: Computing variables
Let’s create a new variable which is the sum of other
variables
Click on “Transform” and then on “Compute”
Enter the new variable name into the target variable box
Enter the formula for this new variable into the “Numeric
Expression” box
Click on OK
05/07/24 44
Computing variables
Example: Adding a new variable named ‘lnheight’ which is
the natural log of height
Type in lnheight in the ‘Target Variable’ box. Then type in
‘ln(height)’ in the ‘Numeric Expression’ box. Click OK
Click
05/07/24 45
Computing Variables…
A new variable ‘lnheight’ is added to the table
05/07/24 46
Transforming Data: Recoding into variables
We can transform variables by recoding which
means to combine categories on an existing
variable into fewer categories
We can transform variables by creating new
variables out of existing variables
We can select particular cases and analyze only
these cases
05/07/24 47
Recoding into variables…
Recoding into different variables
Recoding into the same variable
It is recommended recoding into different variables
and not using the into same variable option
05/07/24 48
Recoding into different variables
Click on “Transform” and then on “Recode” and then
on “into different variables”
Select the variable you want to recode
Start by giving the new variable a new name and
assigning a variable label to the new variable. Click
on “Change”
05/07/24 49
Recoding into different variables…
Recode AGE into four categories and give it the
name of AGE1
Click on “Old and New Values”
Use “Range” (fourth option down) to recode as
follows. Remember to click on “Add” after
entering each recode
18 to 29 = 1
30 to 49 = 2
50 to 69 = 3
70 to 89 = 4
05/07/24 50