Multiple Imputation IN: Mplus
Multiple Imputation IN: Mplus
IN MPLUS
EMPLOYEE DATA
• Data
set containing scores from 480 employees on eight
work-related variables
• Variables:
• 33% of the cases have missing well-being scores, and 33% have
missing satisfaction scores
ANALYSIS EXAMPLE
B1
Well-Being Performance ε
B2
Satisfaction
• ANALYSIS
• MODEL
• OUTPUT
• PLOT
DATA COMMAND
DATA:
file = ‘c:\Data\employee.dat’;
DATA:
file = employee.dat;
VARIABLE COMMAND
• The USEVARIABLES list should include all variables that will be part of
the imputation process
VARIABLE:
! Information about the contents of the data file;
names = id age tenure female wbeing jobsat jobperf turnover iq;
usevariables = age tenure wbeing jobsat jobperf iq;
missing = all (-99);
ANALYSIS COMMAND
ANALYSIS:
! Bayesian estimation;
estimator = bayes;
! Random number seed for MCMC algorithm;
bseed = 48932;
! Convergence criterion (.05 = PSR < 1.05);
bconvergence = .05;
BCONVERGENCE
SUBCOMMAND
• Mplus generates imputed data sets only after the MCMC
algorithm has converged (this feature prevents imputations
from pathological chains)
• Setting
BCONVERGENCE = .05 (the default) specifies that
convergence is achieved once the PSR drops below 1.05
• Smaller
values give a more conservative picture of
convergence but require more computational time
MODEL COMMAND
MODEL:
[jobperf tenure wbeing jobsat turnover iq]; ! Means;
jobperf tenure wbeing jobsat turnover iq; ! Variances;
jobperf tenure wbeing jobsat turnover iq with
jobperf tenure wbeing jobsat turnover iq; ! Covariances;
OUTPUT COMMAND
OUTPUT:
! Tech8 gives the PSR statistic;
tech8;
PLOT COMMAND
• This
command is only operational with the ESTIMATOR =
BAYES option (typically used only for the diagnostic run)
PLOT:
! Graphical convergence diagnostics;
type = plot2;
TECHNICAL 8 OUTPUT
POSTERIOR DISTRIBUTION OF
THE JOB SATISFACTION MEAN
TRACE PLOT OF THE JOB
SATISFACTION MEAN
• ANALYSIS
• DATA IMPUTATION
• OUTPUT
ANALYSIS COMMAND
ANALYSIS:
! Saturated imputation model;
type = basic;
! Random number seed for MCMC algorithm;
bseed = 48932;
! Convergence criterion (.05 = PSR < 1.05);
bconvergence = .05;
DATA IMPUTATION
COMMAND
• Thefollowing commands apply to the final MCMC run that
generates the imputed data sets
DATA IMPUTATION:
! Incomplete variables to be imputed;
! The USEVARIABLES list specifies other variables in the model;
impute = wbeing jobsat;
! Number of imputed data sets;
ndatasets = 50;
! File name prefix for imputed data sets;
save = employeeimp*.dat;
! Between-imputation interval;
thin = 300;
MPLUS IMPUTATION PROGRAM
DATA:
file = employee.dat;
VARIABLE:
names = id age tenure female wbeing jobsat jobperf turnover iq;
usevariables = age tenure wbeing jobsat jobperf iq;
missing = all (-99);
ANALYSIS:
type = basic;
bseed = 48932;
bconvergence = .05;
DATA IMPUTATION:
impute = wbeing jobsat;
ndatasets = 50;
save = employeeimp*.dat;
thin = 300;
OUTPUT:
tech8;
SAVEDATA INFORMATION
Order of variables
AGE
TENURE
WBEING
JOBSAT
JOBPERF
IQ
IMPUTED DATA FORMAT
• The file names use the prefix specified in the SAVE command
(e.g., employeeimp1.dat, employeeimp2.dat, etc.)
• The
program also generates a list file that contains the file
names of the imputed data sets (e.g., employeeimplist.dat)
• The list file serves as input data for all subsequent analyses
• Analyzing
imputed data sets requires a small change to the
DATA command, but the remaining commands are identical to
a complete-data analysis
DATA COMMAND
• Omitted file path (implist file in the same directory as input file)
DATA:
file = employeeimplist.dat; ! List of imputation file names;
type = imputation; ! Imputation data;
MPLUS ANALYSIS PROGRAM
DATA:
file = employeeimplist.dat;
type = imputation;
VARIABLE:
names = age tenure wbeing jobsat jobperf iq;
usevariables = wbeing jobsat jobperf;
ANALYSIS:
estimator = ml;
MODEL:
jobperf on wbeing (b1);
jobperf on jobsat (b2);
MODEL TEST:
b1 = 0;
b2 = 0;
OUTPUT:
standardized;
DESCRIPTIVES
SAMPLE STATISTICS
SAMPLE STATISTICS
Means
JOBPERF WBEING JOBSAT
________ ________ ________
1 6.021 6.296 5.949
Covariances
JOBPERF WBEING JOBSAT
________ ________ ________
JOBPERF 1.570
WBEING 0.676 1.380
JOBSAT 0.263 0.448 1.404
DESCRIPTIVES, CONTINUED
Correlations
JOBPERF WBEING JOBSAT
________ ________ ________
JOBPERF 1.000
WBEING 0.459 1.000
JOBSAT 0.177 0.322 1.000
WALD TEST
(MODEL TEST COMMAND)
• The Wald statistic (a chi-square with 2 degrees of freedom) is akin
to the omnibus F test in OLS regression
Value 136.976
Degrees of Freedom 2
P-Value 0.0000
Two-Tailed
Estimate S.E. Est./S.E. P-Value
JOBPERF ON
WBEING 0.479 0.055 8.783 0.000
JOBSAT 0.035 0.056 0.622 0.534
Intercepts
JOBPERF 2.797 0.402 6.959 0.000
Residual Variances
JOBPERF 1.236 0.088 14.052 0.000
INTERPRETATIONS
• Interpret
and report MI estimates in the same way as a
complete-data analysis
• Controlling
for job satisfaction, a one-point increase in
psychological well-being results in a .479 increase in job
performance, on average
STDYX Standardization
Two-Tailed
Estimate S.E. Est./S.E. P-Value
JOBPERF ON
WBEING 0.449 0.048 9.400 0.000
JOBSAT 0.033 0.053 0.622 0.534
STANDARDIZED ESTIMATES,
CONTINUED
R-SQUARE
Observed Two-Tailed
Variable Estimate S.E. Est./S.E. P-Value