0% found this document useful (0 votes)

133 views6 pages

Treatment of Missing Data

The document describes how to perform multiple imputation in SPSS to handle missing data. It explains that SPSS will create 5 imputed datasets and store them in a file called "SPSSImputations". Regression can then be run on each imputed dataset and the results pooled to obtain estimates that account for the uncertainty due to missing values. The syntax provided shows how to specify the multiple imputation and regression steps to perform this analysis in SPSS.

Uploaded by

raj sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

133 views6 pages

Treatment of Missing Data

Uploaded by

raj sharma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

7/10/2014

Treatment of Missing Data

MULTIPLE IMPUTATION USING SPSS

David C. Howell

USING SPSS TO HANDLE MISSING DATA

SPSS will do missing data imputation and analysis, but, at least for me, it takes some getting used to.
Because SPSS works primarily through a GUI, it is easiest to present it that way. However I will also provide
the script that results from what I do.
The data file is named CancerHead-9.dat and contains the following variables related to child behavior
problems among kids who have a parent with cancer. (The "-9" in the title of the file is there to remind me that
this file used "-9" for missing data, which is a common notation for missing data in SPSS. (You could also
use 999, 99, or whatever set of values you want.) Once the data are read in you go to the Variable View and
enter the missing value (e.g. -9) as the missing data entry for each variable. The "Head" tells me that the
names of the variables are to be found in Line 1. Several of the variables in this example relate to the parent
(patient) with cancer. The other variables relate to the spouse of the patient. The variable names are, in order,
SexP (sex parent), DeptP (parent's depression T score), AnxtP (parent's anxiety T score), GSItP (parent's
global symptom index T score), DeptS, AnxtS, GSItS (same variables for spouse), SexChild, Totbpt (total
behavior problem T score for child). These are a subset of a larger dataset, and the analysis itself has no
particular meaning. I just needed a bunch of data and I grabbed an available file related to a research project
with which I was involved. We will assume that we want to predict the child's Total Behavior Problem T score
as a function of several other variables. I no longer recall whether the missing values were actually missing or
whether I deleted a bunch of values to create an example.
The first few cases are shown below. Notice that variable names are included in the first line. Missing data
are indicated by "-9".
SexP

DeptP

AnxtP

GSItP

DeptS

AnxtS

-9

GSItS

SexChild

Totbpt

-9

-9
-9
52
-9
51

-9

We read in the data as we normally do in SPSS, in my case as a "dat" file. Then from the Analyze menu
choose Multiple Imputation and then select Impute Missing Values. When you have made the necessary
assignments of variables to the role you will have a menu that looks like the following.

http://www.uvm.edu/~dhowell/StatPages/More_Stuff/Missing_Data/MissingDataSPSS.html

1/6

7/10/2014

Treatment of Missing Data

Notice that I have included all nine variables in doing the imputations, even though I will only use six of them in
the regressions. I do this because those extra variables may be able to add importantly to the imputed values.
For example, suppose that I had a second measure of depression, but chose not to use in in the final
analysis. That measures would presumably be nicely correlated with DeptP, and would be useful in imputing
missing data for that variable. So I include it here, even though I drop it later.
The important thing to notice here is the section called "Location of Imputed Data." I have taken the default
and specified that the new dataset will be named SPSSImputations. It is important to note that this will NOT
create a file in your directory with that name. It will create a file in your current session to which we will turn very
shortly.
I am not going to present the output from that procedure because it doesn't get us very far. Basically you will
see a list of variables with their means, standard deviations, etc. from the raw data and from the imputed
data. You should look at that, but it is not very exciting.
This step of the procedure doesn't look as if it has done much for us, but in fact it has. It has created five data
sets containing imputed values, and those are held in SPSSImputations. If you go to the Window tag in the
main SPSS Window, it will offer you the choice of going to that data set. You can see this in the following
image. There are other choices in that window because I have created other stuff as I wrote this page, but you
want to select "Untitled[SPSSImputations]-IBM SPSS Statistics Editor." When you make that selection you
will get the following data set. Notice that it looks like the original, but with a new variable called "Imputation_."
This will consist of the numbers 0 to 5, referring to the particular imputation session. (Imputation = 0 refers to
the original data file.) You can see part of that data file below, showing the last few lines of the original data
and the first few lines of the data from imputation 1. The areas shaded in yellow are imputed values where the
value was missing in the original.

http://www.uvm.edu/~dhowell/StatPages/More_Stuff/Missing_Data/MissingDataSPSS.html

2/6

7/10/2014

Treatment of Missing Data

Now we are ready to do our analysis, but we do it in kind of a strange way. If you look back at the first window
that I showed you, you will see a note at the bottom referring to a special icon. This means that if you now take
this new data set and go to the standard Analyze menu, you will see that some of the procedures have this
icon next to them. That really means that if you use this data set with that procedure, SPSS will recognize that
you want to combine imputed data sets and will allow you to do so. For example, we want to use linear
regression to predict Totbpt from 5 other variables. You set this up as follows

http://www.uvm.edu/~dhowell/StatPages/More_Stuff/Missing_Data/MissingDataSPSS.html

3/6

7/10/2014

Treatment of Missing Data

Noitice that I have added "Imputation_ to the box labeled "Selection Variable" and used the "Rule" to specify
that I want it to use all imputations numbered 1 or more. The partial results of this printout follow.

http://www.uvm.edu/~dhowell/StatPages/More_Stuff/Missing_Data/MissingDataSPSS.html

4/6

7/10/2014

Treatment of Missing Data

The important part is the last set of output. It shows you what the regression coefficients, their standard errors,
etc were for the 5 separate imputations, and then it shows you for the "pooled" data. This is the result you
were looking for, and is comparable to what we found in the last bit of printouts for NORM and SAS. The
values will not be exactly the same, but they will be reasonably close.I think that the "error message" in that
last window is not an error message. It is simply saying that I did not chose to include Imputation 0, which was
the original data.

SPSS Syntax
For those who like to work with syntax rather than focussing on the GUI, the syntax for this analysis follows.

*Impute Missing Data Values.

DATASET DECLARE SPSSImputations.
MULTIPLE IMPUTATION SexP DeptP AnxtP AnxtS Totbpt DeptS GSItP GSItS SexChild
/IMPUTE METHOD=AUTO NIMPUTATIONS=5 MAXPCTMISSING=NONE
/MISSINGSUMMARIES NONE
/IMPUTATIONSUMMARIES MODELS DESCRIPTIVES
/OUTFILE IMPUTATIONS=SPSSImputations .

DATASET ACTIVATE DataSet2.

http://www.uvm.edu/~dhowell/StatPages/More_Stuff/Missing_Data/MissingDataSPSS.html

5/6

7/10/2014

Treatment of Missing Data

REGRESSION
/SELECT=Imputation_ GE 1
/MISSING LISTWISE
/STATISTICS COEFF OUTS CI(95) R ANOVA
/CRITERIA=PIN(.05) POUT(.10)
/NOORIGIN
/DEPENDENT Totbpt
/METHOD=ENTER SexP DeptP AnxtP DeptS AnxtS
/SAVE SRESID.

Return to Dave Howell's Statistical Home Page

Send mail to: [email protected])

Last revised 12/37/2012

http://www.uvm.edu/~dhowell/StatPages/More_Stuff/Missing_Data/MissingDataSPSS.html

6/6

Monthly Reimbursement Bill Enclosure
No ratings yet
Monthly Reimbursement Bill Enclosure
3 pages
S5 Math Exercise
No ratings yet
S5 Math Exercise
6 pages
Sandeep Julakanti - Resume
No ratings yet
Sandeep Julakanti - Resume
9 pages
Glass Ampoules & Glass Vials Import Sample
No ratings yet
Glass Ampoules & Glass Vials Import Sample
15 pages
Bodybuilding, Drugs and Risk
No ratings yet
Bodybuilding, Drugs and Risk
230 pages
A1 Reynaldi Azhar 24610062 Pertemuan 12 Haisl SPSS
No ratings yet
A1 Reynaldi Azhar 24610062 Pertemuan 12 Haisl SPSS
9 pages
RFQ - Section - III - Technical - Questionnaire
No ratings yet
RFQ - Section - III - Technical - Questionnaire
12 pages
A Circular-Economy-Retrospective
No ratings yet
A Circular-Economy-Retrospective
16 pages
IBM SPSS Missing Values
No ratings yet
IBM SPSS Missing Values
28 pages
Ethical Considerations in Civic Engagement
80% (5)
Ethical Considerations in Civic Engagement
2 pages
300 GPD Water Maker
No ratings yet
300 GPD Water Maker
7 pages
MI - Summary Stat
No ratings yet
MI - Summary Stat
25 pages
MMB-Barriers - DMP Correl - Age - Anova
No ratings yet
MMB-Barriers - DMP Correl - Age - Anova
6 pages
Prakkom Stat 2
No ratings yet
Prakkom Stat 2
8 pages
Missing Data
No ratings yet
Missing Data
71 pages
Business Analytics ST1
No ratings yet
Business Analytics ST1
13 pages
Namra Finance Limited
No ratings yet
Namra Finance Limited
5 pages
One Sample T-Test
No ratings yet
One Sample T-Test
2 pages
10 Data Preparation
No ratings yet
10 Data Preparation
42 pages
Module 1
No ratings yet
Module 1
5 pages
For Ex Project
No ratings yet
For Ex Project
64 pages
Lang Aquisition - Emergent Rubric Original All Criteria
No ratings yet
Lang Aquisition - Emergent Rubric Original All Criteria
4 pages
Unrestricted Warfare A Chinese Doctrine For PDF
75% (4)
Unrestricted Warfare A Chinese Doctrine For PDF
26 pages
Shandon Cytospin 3 Operator Guide
No ratings yet
Shandon Cytospin 3 Operator Guide
68 pages
01 Dealing With Missing Data The Art and Science of Imputation
No ratings yet
01 Dealing With Missing Data The Art and Science of Imputation
26 pages
Uocluongthamso Nhom1
No ratings yet
Uocluongthamso Nhom1
26 pages
How You Can Talk With God
No ratings yet
How You Can Talk With God
5 pages
Excel Statistics: Step by Step
From Everand
Excel Statistics: Step by Step
Stephanie Glen
4/5 (8)
Output
No ratings yet
Output
18 pages
Missing Data
No ratings yet
Missing Data
7 pages
Confirmation - Flight Booking - Etihad
No ratings yet
Confirmation - Flight Booking - Etihad
2 pages
SPSS
No ratings yet
SPSS
6 pages
Daily Lesson Log: Tle - Icttd9 - 12al - Ic - E - 3
No ratings yet
Daily Lesson Log: Tle - Icttd9 - 12al - Ic - E - 3
4 pages
Statistik 3
No ratings yet
Statistik 3
4 pages
Non-random-Thoughts - Is Vedic Astrology Derived From Greek Astrology - (Part 22) (Masonry and Stemmed Cup - From Pandyans To Tiryns)
0% (1)
Non-random-Thoughts - Is Vedic Astrology Derived From Greek Astrology - (Part 22) (Masonry and Stemmed Cup - From Pandyans To Tiryns)
13 pages
La Liberación Del Libro. Una Crítica Del Sistema de Precio Fijo. Pedro Schwartz.
No ratings yet
La Liberación Del Libro. Una Crítica Del Sistema de Precio Fijo. Pedro Schwartz.
79 pages
Missng Data
No ratings yet
Missng Data
8 pages
Missing Data and Data Cleaning - Tagged
No ratings yet
Missing Data and Data Cleaning - Tagged
31 pages
Multiple Imputation in Practice
No ratings yet
Multiple Imputation in Practice
11 pages
LAMPIRAN
No ratings yet
LAMPIRAN
23 pages
Output 10
No ratings yet
Output 10
7 pages
Critical Appreciation of K N Rao Interview
50% (2)
Critical Appreciation of K N Rao Interview
39 pages
Mini Research On Homeless
No ratings yet
Mini Research On Homeless
6 pages
Environmental Law and Jurisprudence
No ratings yet
Environmental Law and Jurisprudence
76 pages
VRTM
No ratings yet
VRTM
161 pages
Analyzing Missing Data
No ratings yet
Analyzing Missing Data
49 pages
Test of Difference - Revised Table With Analysis
No ratings yet
Test of Difference - Revised Table With Analysis
4 pages
Properties of KMnO4 and K2Cr2O7.PDF-65
No ratings yet
Properties of KMnO4 and K2Cr2O7.PDF-65
7 pages
Hindu Capitalism-Why Capitalism Is The Only Economic System Compatible With Indian Culture
No ratings yet
Hindu Capitalism-Why Capitalism Is The Only Economic System Compatible With Indian Culture
150 pages
Lecture 2.3.10
No ratings yet
Lecture 2.3.10
30 pages
2 PDF
No ratings yet
2 PDF
232 pages
Values
No ratings yet
Values
30 pages
IOSA Checklist: ISM Edition 9 - Effective September 1, 2015
No ratings yet
IOSA Checklist: ISM Edition 9 - Effective September 1, 2015
253 pages
FMD PRACTICAL FILE
No ratings yet
FMD PRACTICAL FILE
61 pages
Data Quality Review For Missing Values and Outliers
No ratings yet
Data Quality Review For Missing Values and Outliers
8 pages
Unit of Time
No ratings yet
Unit of Time
2 pages
ACM-F015 Intern's Competency Checklist
No ratings yet
ACM-F015 Intern's Competency Checklist
6 pages
Military History of India - Camels in Indian Warfare
No ratings yet
Military History of India - Camels in Indian Warfare
5 pages
UJI Normalitas
No ratings yet
UJI Normalitas
4 pages
Astrology, Science, and Cau..
No ratings yet
Astrology, Science, and Cau..
7 pages
#6 Adding File Upload To A Form
No ratings yet
#6 Adding File Upload To A Form
10 pages
Death by Thomas Nagel Com
100% (1)
Death by Thomas Nagel Com
10 pages
The Crest-Jewel of Wisdom PDF
No ratings yet
The Crest-Jewel of Wisdom PDF
35 pages
Schafer SMMR 1999 MI Primer
No ratings yet
Schafer SMMR 1999 MI Primer
14 pages
Gavi Gangadhareshwara Temple
No ratings yet
Gavi Gangadhareshwara Temple
5 pages
A Mahabharata Reference To Sidereal System
No ratings yet
A Mahabharata Reference To Sidereal System
8 pages
Kruse
No ratings yet
Kruse
25 pages
Venkatapati Deva Raya - The Great Savior of Southern India - Vajrin
No ratings yet
Venkatapati Deva Raya - The Great Savior of Southern India - Vajrin
5 pages
Huawei S3700 Switch Datasheet (22-Oct-2012)
No ratings yet
Huawei S3700 Switch Datasheet (22-Oct-2012)
12 pages
Geol 194 Syllabus Revised
No ratings yet
Geol 194 Syllabus Revised
4 pages
Is It Necessary To Get Afraid of Sade Sati
No ratings yet
Is It Necessary To Get Afraid of Sade Sati
9 pages
Missing Data Analysis: University College London, 2015
No ratings yet
Missing Data Analysis: University College London, 2015
37 pages
T-Test: T-Test /testval 3 /missing Analysis /VARIABLES x12 - A x12 - B x12 - C x12 - D x12 - e x12 - F /CRITERIA CI (.95)
No ratings yet
T-Test: T-Test /testval 3 /missing Analysis /VARIABLES x12 - A x12 - B x12 - C x12 - D x12 - e x12 - F /CRITERIA CI (.95)
2 pages
11 - Missing Data in SPSS - 1.1
No ratings yet
11 - Missing Data in SPSS - 1.1
26 pages
Imputation: - Applied Multivariate Analysis & Statistical Learning
No ratings yet
Imputation: - Applied Multivariate Analysis & Statistical Learning
17 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Descriptives: Descriptives Variables Roe Der Indeks Tobinsq /statistics Mean Stddev Min Max
No ratings yet
Descriptives: Descriptives Variables Roe Der Indeks Tobinsq /statistics Mean Stddev Min Max
18 pages
Frequencies: Notes
No ratings yet
Frequencies: Notes
10 pages
3071 Output
No ratings yet
3071 Output
22 pages
Descriptives: DESCRIPTIVES VARIABLES Trust Awareness Comfort Freq /statistics Mean Stddev Min Max
No ratings yet
Descriptives: DESCRIPTIVES VARIABLES Trust Awareness Comfort Freq /statistics Mean Stddev Min Max
46 pages
SPSS
No ratings yet
SPSS
92 pages
AFIYASPSS
No ratings yet
AFIYASPSS
11 pages
DADM S5 Imputation of Missing Data
No ratings yet
DADM S5 Imputation of Missing Data
15 pages
T-Test: T-Test /testval 3 /missing Analysis /variables Depressive - Symptom /CRITERIA CI (.95)
No ratings yet
T-Test: T-Test /testval 3 /missing Analysis /variables Depressive - Symptom /CRITERIA CI (.95)
5 pages
OUTPUT1
No ratings yet
OUTPUT1
3 pages
Jaimini Sutram
No ratings yet
Jaimini Sutram
18 pages
Nama: Aida Fajriyatin Formaningrum NIM: G0116006 / A
No ratings yet
Nama: Aida Fajriyatin Formaningrum NIM: G0116006 / A
10 pages
603-8-1 Donders - J Clin Epidemiol 2006 v59 n10 p1087-91
No ratings yet
603-8-1 Donders - J Clin Epidemiol 2006 v59 n10 p1087-91
5 pages
Data-Two Categories: Ram Saran R (1827318:)
No ratings yet
Data-Two Categories: Ram Saran R (1827318:)
4 pages
Missing Data Techniques - UCLA
No ratings yet
Missing Data Techniques - UCLA
66 pages
IBM SPSS Missing Values
100% (1)
IBM SPSS Missing Values
34 pages
Output SPSS
No ratings yet
Output SPSS
9 pages
T-Test: Notes
No ratings yet
T-Test: Notes
3 pages
Analyzing Missing Data: Problems Using Scripts
No ratings yet
Analyzing Missing Data: Problems Using Scripts
49 pages
Imputation
No ratings yet
Imputation
10 pages
How To Deal With Missing Values (DR SEE KIN HAI)
No ratings yet
How To Deal With Missing Values (DR SEE KIN HAI)
4 pages
Quntative Data Analysis SPSS: Formating, Handling, & Manipulation
No ratings yet
Quntative Data Analysis SPSS: Formating, Handling, & Manipulation
22 pages
Spss Outputs
No ratings yet
Spss Outputs
8 pages
Missing Data Imputation Using Singular Value Decomposition
No ratings yet
Missing Data Imputation Using Singular Value Decomposition
6 pages
Abhinn - Spss Lab File
No ratings yet
Abhinn - Spss Lab File
67 pages
Npar Tests: Notes
No ratings yet
Npar Tests: Notes
9 pages
Practical Missing Data Analysis in SPSS
No ratings yet
Practical Missing Data Analysis in SPSS
19 pages
Missing Data
100% (2)
Missing Data
35 pages