0% found this document useful (0 votes)

7 views25 pages

Working With Difficult Errors in Sem

This document discusses troubleshooting issues in Structural Equation Modeling (SEM) related to improper solutions, such as negative variance parameters and non-positive definite covariance matrices. It outlines common causes of these problems, signs to recognize them, and strategies for fixing them, emphasizing the importance of model specification and data quality. The document also includes references for further reading on the topic.

Uploaded by

ayeshaford88

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views25 pages

Working With Difficult Errors in Sem

Uploaded by

ayeshaford88

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 25

Troubleshooting problems with

SEM models that have “Heywood”

cases such as negative variance
parameters and non-positive
definite covariance matrices

Jeremy Yorgason
Brigham Young University
Introduction
• In SEM, it is fairly common to encounter Improper Solutions
• Non-positive definite covariance matrices
• Models with negative variance terms
• Negative PSI matrix
• Correlation or other standardized values > 1
• Model is not identified, you need “x” number of constraints for it to be
identified

• Why is this important?

• Results from models that have this problem cannot be trusted, and shouldn’t
be reported in journal articles.
• Standard errors of estimates may be affected (Chen et al., 2001)
• Error messages are Diagnostic tools
• It’s a good idea to confirm the diagnosis that the computer system is giving you
• The point is to understand what may be going on with your model/data
• This often requires that you look at all of the output for your model
Goals for this Segment of the Workshop

1. How do I recognize the

problem?

2. How do I fix the problem?

3. Examples
Causes of Improper Solutions in SEM
Causes of Improper Solutions in SEM
1. Specification error in the model
A.Missing a “1” on one of the factor loadings of a latent variable, or
on an error term
B.Correlations of variables or errors from IV to DV of a model
C.Excessive error correlations on indicators of a single latent variable
D.Very low factor loadings on a latent variable
E.Omitted paths that should be in a model
2. Model under-identified (negative degrees of freedom)
A. V(V+1)/2 minus parms (if estimating means/intercepts use V(V+3)/2)
3. Non-convergence
4. Outliers in the data
5. Too small of sample for the model being estimated
Kline, 2011; Kolenikov & Bollen, 2012; Chen et al., 2001; Newsome, 2012
Causes of Improper Solutions in SEM
6. Missing data
7. “Sampling fluctuations”
8. Two indicator latent variables
A.This includes 2nd order latent variables
9. Non-normally distributed outcome or indicator variables in your
model
B.Categorical
C.Count, zero-inflated, etc.
10. Empirical under-identification
A.“Positive degrees of freedom, but there is insufficient covariance
information in a portion of the model for the computer to generate valid
estimates” (Newsome, 2012)
B.May be caused by some of the above issues

Kline, 2011; Kolenikov & Bollen, 2012; Chen et al., 2001; Newsome, 2012
Signs that there is a problem
Amos:
“XX: Default Model”

“The following variances are negative.”

“This solution is not admissible”

“The model is probably unidentified. In order to achieve

identifiability, it will probably be necessary to impose 1 additional
constraint.”

In place of estimates in the Amos output you see “unidentified”

Signs that there is a problem
Mplus:
THE MODEL ESTIMATION TERMINATED NORMALLY

THE STANDARD ERRORS OF THE MODEL PARAMETER

ESTIMATES MAY NOT BE TRUSTWORTHY FOR SOME
PARAMETERS DUE TO A NON-POSITIVE DEFINITE FIRST-
ORDER DERIVATIVE PRODUCT MATRIX. THIS MAY BE DUE TO
THE STARTING VALUES BUT MAY ALSO BE AN INDICATION OF
MODEL NONIDENTIFICATION. THE CONDITION NUMBER IS
-0.762D-17. PROBLEM INVOLVING PARAMETER 59.

MODIFICATION INDICES COULD NOT BE COMPUTED.

THE MODEL MAY NOT BE IDENTIFIED.
More signs that there is a problem
Mplus or other programs:
1. Negative variance estimate (remember Variance =
stddev2)
A. Find it in your output

2. Correlations above 1 (remember, can’t be larger than 1)

A. Find it in your output

3. Error variance that is really BIG (999 usually indicates a

problem in Mplus, although this is ok if something is
“constrained” to be a certain number)
How can I fix these problems?
1. Look at a diagram of your model and see if you have miss-specified
your model.
A. Check your syntax (e.g., look for missing semi-colons in Mplus)
B. Missing “1” for a factor loading on latent variables
C. Missing “1” on regression path of error term
D. Sometimes Amos creates “GHOST” variables. You can’t see them, but they
are there! Sometimes off the screen, sometimes really, really, really, small
E. Sometimes Amos will “double correlate variables”
F. Any correlations across IV/DV lines?
1. Careful, this is something your “modification indices” will suggest to improve
model fit. However, don’t ever add parameters that go against theory
G. Make sure you have appropriate regression paths in the model (not too few,
in this case)
H. Make sure your measurement model is appropriate
A. Factor loadings > .40
B. Error correlations – start with none, correlation between items is captured in the
latent variables. Typically you’ll use mod indices here
How can I fix these problems?
2. Researchers need to be attentive to model problems
when there are latent variables with only 2 indicators (can
be unstable)
A. Newsom (2012) suggests constraining the two factor loadings to
be equal…

3. Caution is also warranted when estimating “higher order”

latent variables with only two factors, and certain complex
models (e.g., common fate models) that require specific
constraints in order for the model to be identified
How can I fix these problems?
4. Either use a large sample , OR check the sample size
and compare with the number of parameters being
estimated.
• N/q rule (n = sample, q = parameters in the model; Kline, 2011)
• Count variances, covariances, and means OR
• Most programs tell you how many parms are in your model. Amos:

Number of distinct sample moments: 77

Number of distinct parameters to be
44
estimated:
Degrees of freedom (77 - 44): 33

• Quick Check: 10 people in the sample for every observed

(rectangle) variable in the model
How can I fix these problems?
5. If your model looks to be specified correctly, but you still
have a problem with the model, it’s time to start looking at
your data
A. Run a frequency on all variables in the model, to see if there is
some data entry error or outliers that could be inflating the
variance of one or more variables
1. Side note: sometimes SEM models have trouble with variables
that have very different (larger or smaller) variance values than
the rest of the variables in your model (e.g., income in dollars)
2. If this is the case, you will want to rescale or transform these
variables to ensure similar variances
3. Also, in the transfer of data from one program to another,
sometimes columns of data are shifted or otherwise corrupted
How can I fix these problems?
6. Do you have any categorical or non-normally distributed
dependent variables that are specified as continuous?
A. Amos doesn’t handle dichotomous, count, or zero inflated
outcomes
B. Mplus does handle them well, but you have to specify in the
syntax that you are working with such distributions
C. You may have specified non-normal variable distributions, but
you have small cell sizes (e.g., ordered categorical variable
with only 1 or 2 cases on one end of the distribution)
How can I fix these problems?
• 7. If your model does not “converge” it means that the
program went through X number of iterations, but could
not find a suitable solution. You can increase iterations
from the default number to try to estimate your model. If
this doesn’t work, you probably need to change your
model or you have a data problem.
Atypical Solutions: Start Values and
Iterations
8. A start value is a number assigned to each estimated
parameter when “iterations” begin for a model. Amos and
Mplus automatically create start values for each parameter to
be estimated, yet it is possible to assign start values if the
program assigned ones don’t work. Researchers can provide
start values for a model, which are essentially any known
parameter estimates (e.g., regression weight or coefficient).
You can get these by running simple linear regression with the
variables in your model, and then plug in the coefficient from
the simpler model.
A. How in the world would I know if I have bad start values???
B. How would I know what variable to look at that might be non-
normally distributed? Or be categorical and have small cell sizes?
Greek Alphabet and Mplus Output
• Nu (Ν/ν)= intercepts or means of observed variables
• Lambda (Λ/λ)= Factor Loadings
• Theta (Θ/θ)= error variances and covariances
• Alpha (Α/α)= means and intercepts of latent variables
• Beta (Β/β) and Gamma (Γ/γ) = regression coefficients
• Psi (Ψ/ψ)= residual variances and covariances of continuous latent variables
• Tau (Τ/τ) = thresholds of categorical observed variables
• Delta (Δ/δ) = scaling information for observed dependent variables
• Etc. – see Mplus manual

• Ask for Tech1 in the output and then when Mplus says there is a problem with, for
example, parameter #16, go and find that parameter and see which matrix it is in
and then identify the variable and go look at the model/data to see where the
problem is. If no variable is identified, need to go back to Model Specification.
• CAUTION: Specific Parameter warnings are usually a DECOY! They generally are simply letting
you know the model is not correctly specified, and no matter what you do to the identified variable
it will not make your model work.
Examples: “Message of Death!”
• From a class assignment with a model involving 56 cases.
• Mplus error:
THE MODEL ESTIMATION TERMINATED NORMALLY

THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES MAY NOT BE

TRUSTWORTHY FOR SOME PARAMETERS DUE TO A NON-POSITIVE DEFINITE
FIRST-ORDER DERIVATIVE PRODUCT MATRIX. THIS MAY BE DUE TO THE STARTING
VALUES BUT MAY ALSO BE AN INDICATION OF MODEL NONIDENTIFICATION. THE
CONDITION NUMBER IS 0.383D-13. PROBLEM INVOLVING PARAMETER 31.

THIS IS MOST LIKELY DUE TO HAVING MORE PARAMETERS THAN THE SAMPLE SIZE
IN ONE OF THE GROUPS.

WARNING: THE RESIDUAL COVARIANCE MATRIX (THETA) IN GROUP GRAD IS NOT

POSITIVE DEFINITE. THIS COULD INDICATE A NEGATIVE VARIANCE/RESIDUAL
VARIANCE FOR AN OBSERVED VARIABLE, A CORRELATION GREATER OR EQUAL TO
ONE
BETWEEN TWO OBSERVED VARIABLES, OR A LINEAR DEPENDENCY AMONG MORE
THAN TWO
OBSERVED VARIABLES. CHECK THE RESULTS SECTION FOR MORE INFORMATION.
PROBLEM INVOLVING VARIABLE DE4.
Atypical Solutions: Sampling Fluctuations
• Model is specified correctly
• Don’t have outliers in your data, and you have a large enough sample to
estimate the model at hand
• Model is not identified, although you have positive degrees of freedom

• 9. Possible tests to confirm you have sampling fluctuations and not some other
problem:
• Confidence interval from standard errors includes a zero
• Calculate a “z” by taking the ratio – Estimate: Standard Error, and then compare to a z
distribution
• Wald test, take ratio – (Estimate:Standard Error) 2 then compare to a chi-square distribution
with 1 df
• Likelihood ratio test statistic
• Lagrangian multiplier (mod indices when var constrained to 0)
• Boostrap resampling method (esp. with non-normal data)
• Scaled chi-square difference test
• Signed root tests
• Empirical sandwich estimators
Atypical Solutions: Sampling Fluctuations

• Model is specified correctly

• Don’t have outliers in your data, and you have a large
enough sample to estimate the model at hand
• Model is not identified, although you have positive degrees of
freedom

• Fix the negative variance to 0 or to a small positive number

• This can affect model parameters
Handout
• Chen et al (2001) suggested decision tree
• 1. Is your model identified?
• 2. If so, do you have any negative error variances?
• 3. If so, do you have any outliers that are a problem?
• 4. If not, is the model empirically underidentified?
• 5. If not, do you have sampling fluctuations?
• 6. If so, constrain the negative variance to be 0, a small positive
number, or to be the population variance
• Newsome (2012) prevention tips
• Careful specification
• Use larger samples
• Model factors with 3 or more indicators
• Use reliable measures (high loadings)
• Well conditioned data
Working Example
• See Amos Program

• Depending on time, manipulate an example to show what

errors commonly occur, what the program tells you, and
how to fix the problems
Conclusion
• Either….
• Work with perfect data and perfect models

• OR
• Learn to interpret SEM error messages, and how to fix common
problems
References
• Chen, F., Bollen, K. A., Paxton, P., Curran, P., & Kirby, J.
(2001).Improper solutions in structural equation models:
Causes, consequences, and strategies. Sociological
Methods and Research, 29, 468-508.
• Kline, R. B. (2011). Principles and practices of structural
equation modeling (3rd Ed). New York, NY: Guilford Press.
• Kolenikov, S. & Bollen, K. A. (2012). Testing negative error
variances: Is a Heywood case a symptom of
misspecification? Sociological Methods and Research, 41,
124-167.

Econometric Modeling: Model Specification and Diagnostic Testing
No ratings yet
Econometric Modeling: Model Specification and Diagnostic Testing
11 pages
Amos Annotated Output Sem Cfa PDF
No ratings yet
Amos Annotated Output Sem Cfa PDF
31 pages
Discussion 8 Inference
No ratings yet
Discussion 8 Inference
4 pages
India Credit Risk Model - Varalkshmi
100% (1)
India Credit Risk Model - Varalkshmi
20 pages
Machine Learning
No ratings yet
Machine Learning
92 pages
Not Positive Definite Matrices - Causes and Cures
100% (1)
Not Positive Definite Matrices - Causes and Cures
6 pages
India Credit Risk Default Model - Nivedita Dey - PGP BABI May19 - 2
100% (4)
India Credit Risk Default Model - Nivedita Dey - PGP BABI May19 - 2
19 pages
Sem With Amos I PDF
100% (1)
Sem With Amos I PDF
68 pages
Data Science Interview Preparation (30 Days of Interview Preparation)
No ratings yet
Data Science Interview Preparation (30 Days of Interview Preparation)
45 pages
Lesson 3 Overview Problems and Outliers
No ratings yet
Lesson 3 Overview Problems and Outliers
31 pages
Eviews: Student Version
No ratings yet
Eviews: Student Version
24 pages
Stats101A - Chapter 3
No ratings yet
Stats101A - Chapter 3
54 pages
SEM - מצגת 4 - Identification - חדש
No ratings yet
SEM - מצגת 4 - Identification - חדש
30 pages
Structural Equation Model-SEM
No ratings yet
Structural Equation Model-SEM
113 pages
Model Identification
No ratings yet
Model Identification
19 pages
Handout 3
No ratings yet
Handout 3
24 pages
Unit 5. Model Selection: María José Olmo Jiménez
No ratings yet
Unit 5. Model Selection: María José Olmo Jiménez
15 pages
Linear Model 4
No ratings yet
Linear Model 4
13 pages
Reading 1 A
No ratings yet
Reading 1 A
31 pages
Statistical Learning
No ratings yet
Statistical Learning
31 pages
EViews Workshop
No ratings yet
EViews Workshop
26 pages
424937689
No ratings yet
424937689
25 pages
PSYC588Lecture 14
No ratings yet
PSYC588Lecture 14
17 pages
Most Important Findings 1zm31 Per Subject
No ratings yet
Most Important Findings 1zm31 Per Subject
24 pages
Presenter:: Prof. Richard Chinomona
100% (1)
Presenter:: Prof. Richard Chinomona
55 pages
Econometrics 1: Classical Linear Regression Analysis
No ratings yet
Econometrics 1: Classical Linear Regression Analysis
20 pages
Regression Assumptions Explained
No ratings yet
Regression Assumptions Explained
6 pages
Econometrics I 6
No ratings yet
Econometrics I 6
20 pages
Testing The Assumptions of Linear Regression
100% (1)
Testing The Assumptions of Linear Regression
14 pages
Data Science Interview Preparation (30 Days of Interview Preparation)
No ratings yet
Data Science Interview Preparation (30 Days of Interview Preparation)
18 pages
LR Assumptions
No ratings yet
LR Assumptions
9 pages
Data Screening and Main Model Analysis in Spss
No ratings yet
Data Screening and Main Model Analysis in Spss
26 pages
Assumption Checking On Linear Regression
No ratings yet
Assumption Checking On Linear Regression
65 pages
Specification Error OR Misspecification in Statistical Models
No ratings yet
Specification Error OR Misspecification in Statistical Models
4 pages
SL Sir App Ecotrix UNIT 1
No ratings yet
SL Sir App Ecotrix UNIT 1
18 pages
Bio Stat Problems 2
No ratings yet
Bio Stat Problems 2
15 pages
Empirical Finance8
No ratings yet
Empirical Finance8
11 pages
T G S E M W L V: I. Specification
No ratings yet
T G S E M W L V: I. Specification
25 pages
PRI Workshop - Introduction To AMOS: Krissy Zeiser Pennsylvania State University Klz124@pop - Psu.edu 12-1pm 11/13/2008
No ratings yet
PRI Workshop - Introduction To AMOS: Krissy Zeiser Pennsylvania State University Klz124@pop - Psu.edu 12-1pm 11/13/2008
10 pages
BRM Statwiki
No ratings yet
BRM Statwiki
55 pages
FAQ - ReCell
No ratings yet
FAQ - ReCell
5 pages
LR Assumptions - 05
No ratings yet
LR Assumptions - 05
12 pages
ECON 1630 Problem Set #2 Fall 2021: Bias Variance
No ratings yet
ECON 1630 Problem Set #2 Fall 2021: Bias Variance
9 pages
GP506 L2 Error Analysis 2
No ratings yet
GP506 L2 Error Analysis 2
28 pages
Research Methodology - Unit 5 - Week 3 - Data Analysis and Modelling Skills
No ratings yet
Research Methodology - Unit 5 - Week 3 - Data Analysis and Modelling Skills
4 pages
1663-2017 Imran Rasheed
No ratings yet
1663-2017 Imran Rasheed
11 pages
Structural Equation Modeling: Dr. Arshad Hassan
No ratings yet
Structural Equation Modeling: Dr. Arshad Hassan
47 pages
Day 3
No ratings yet
Day 3
18 pages
A Comprehensive Approach To Misspecification Testing in Linear Regression Models
No ratings yet
A Comprehensive Approach To Misspecification Testing in Linear Regression Models
6 pages
Data Screening Assumptions
No ratings yet
Data Screening Assumptions
29 pages
Lecture 10
No ratings yet
Lecture 10
5 pages
936-Module 04 PPT
No ratings yet
936-Module 04 PPT
15 pages
FAQ - ReCell
No ratings yet
FAQ - ReCell
7 pages
ch04 PDF
67% (3)
ch04 PDF
89 pages
SRM Formula Sheet
No ratings yet
SRM Formula Sheet
10 pages
Foundations of Econometrics Using SAS Simulations and Examples
No ratings yet
Foundations of Econometrics Using SAS Simulations and Examples
56 pages
PE Civil: Transportation Ebook Practice Exam
No ratings yet
PE Civil: Transportation Ebook Practice Exam
41 pages
Business Statistics For Contemporary Decision Making 7th Edition by Ken Black Ebook and TestBank Bundle Verified PDF
No ratings yet
Business Statistics For Contemporary Decision Making 7th Edition by Ken Black Ebook and TestBank Bundle Verified PDF
410 pages
Chapter 3 Econometrics Practice MC
No ratings yet
Chapter 3 Econometrics Practice MC
35 pages
Afi - Forest Mensuration and Yield Science 1
No ratings yet
Afi - Forest Mensuration and Yield Science 1
116 pages
Set 1 Assignments Nptel
No ratings yet
Set 1 Assignments Nptel
32 pages
DC Digital Communication PART6
No ratings yet
DC Digital Communication PART6
152 pages
Mean-Variance Portfolio Selection With Estimation Risk and Transaction Costs
No ratings yet
Mean-Variance Portfolio Selection With Estimation Risk and Transaction Costs
19 pages
Large Scale Inverse Problems and Quantification of Uncertainty Wiley Series in Computational Statistics 1st Edition Lorenz Biegler Download
No ratings yet
Large Scale Inverse Problems and Quantification of Uncertainty Wiley Series in Computational Statistics 1st Edition Lorenz Biegler Download
42 pages
2012 June
No ratings yet
2012 June
20 pages
SSTA082
No ratings yet
SSTA082
6 pages
Schmalensee 1985
No ratings yet
Schmalensee 1985
9 pages
Hesselbarth 2019
No ratings yet
Hesselbarth 2019
10 pages
Annurev Statistics 042522 103837
No ratings yet
Annurev Statistics 042522 103837
24 pages
Verly Et Al. (Eds.), Geostatistics For Natural Resources Characterization, Part by D. Reidel Publishing Company
No ratings yet
Verly Et Al. (Eds.), Geostatistics For Natural Resources Characterization, Part by D. Reidel Publishing Company
19 pages
Lecture-10 Michigan Point Est
No ratings yet
Lecture-10 Michigan Point Est
31 pages
Statistical Design of Experiments For Synthetic Biology
No ratings yet
Statistical Design of Experiments For Synthetic Biology
18 pages
IRT in Mplus: 1 ICC Curves
No ratings yet
IRT in Mplus: 1 ICC Curves
8 pages
Quiz 14.15 Confidence Interval Practices
No ratings yet
Quiz 14.15 Confidence Interval Practices
11 pages
Regression
No ratings yet
Regression
33 pages
Estimation and Confidence Intervals: Properties of Point Estimates
No ratings yet
Estimation and Confidence Intervals: Properties of Point Estimates
14 pages
Business Statistics, 4e: by Ken Black
No ratings yet
Business Statistics, 4e: by Ken Black
42 pages
01 Estimation PDF
No ratings yet
01 Estimation PDF
13 pages
Econometrics Exam 2
No ratings yet
Econometrics Exam 2
3 pages
Mavrin 19 A
No ratings yet
Mavrin 19 A
11 pages
Stat Basic Definitions
No ratings yet
Stat Basic Definitions
4 pages
Thrusfield VetRec2001
No ratings yet
Thrusfield VetRec2001
7 pages
Solutions Manual to accompany An Introduction to Numerical Methods and Analysis
From Everand
Solutions Manual to accompany An Introduction to Numerical Methods and Analysis
James F. Epperson
5/5 (1)
Practical Design of Experiments: DoE Made Easy
From Everand
Practical Design of Experiments: DoE Made Easy
Colin Hardwick
4.5/5 (7)
Machine Learning. Supervised Learning Techniques and Tools: Nonlinear Models Exercises with R, SAS, Stata, Eviews and SPSS
From Everand
Machine Learning. Supervised Learning Techniques and Tools: Nonlinear Models Exercises with R, SAS, Stata, Eviews and SPSS
César Pérez López
No ratings yet
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Gale Researcher Guide for: Econometric Models
From Everand
Gale Researcher Guide for: Econometric Models
Chupp
No ratings yet
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Errors of Regression Models: Bite-Size Machine Learning, #1
From Everand
Errors of Regression Models: Bite-Size Machine Learning, #1
Lee Baker
No ratings yet
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet

Working With Difficult Errors in Sem

Uploaded by

Working With Difficult Errors in Sem

Uploaded by

Troubleshooting problems with

SEM models that have “Heywood”

• Why is this important?

1. How do I recognize the

2. How do I fix the problem?

“The following variances are negative.”

“This solution is not admissible”

“The model is probably unidentified. In order to achieve

In place of estimates in the Amos output you see “unidentified”

THE STANDARD ERRORS OF THE MODEL PARAMETER

MODIFICATION INDICES COULD NOT BE COMPUTED.

2. Correlations above 1 (remember, can’t be larger than 1)

3. Error variance that is really BIG (999 usually indicates a

3. Caution is also warranted when estimating “higher order”

Number of distinct sample moments: 77

• Quick Check: 10 people in the sample for every observed

THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES MAY NOT BE

WARNING: THE RESIDUAL COVARIANCE MATRIX (THETA) IN GROUP GRAD IS NOT

• Model is specified correctly

• Fix the negative variance to 0 or to a small positive number

• Depending on time, manipulate an example to show what

You might also like