Essentials of Structural Equation Modeling PDF
Essentials of Structural Equation Modeling PDF
3-12-2018
Recommended Citation
Civelek, Mustafa Emre, "Essentials of Structural Equation Modeling" (2018). Zea E-Books. 64.
https://digitalcommons.unl.edu/zeabook/64
This Book is brought to you for free and open access by the Zea E-Books at DigitalCommons@University of Nebraska - Lincoln. It has been accepted
for inclusion in Zea E-Books by an authorized administrator of DigitalCommons@University of Nebraska - Lincoln.
Essentials of
Structural
Equation
Modeling
This book aims to guide the researcher who wants to use this method in
a way that is free from math expressions. It teaches the steps of a research
program using structured equality modeling practically. For students
writing theses and scholars preparing academic articles, this book aims
to analyze systematically the methodology of scientific studies conducted
using structural equation modeling methods in the social sciences.
ISBN: 978-1-60962-129-2
doi:10.13014/K2SJ1HR5
Zea Books
Lincoln, Nebraska 2018
1
ISBN: 978-1-60962-129-2
doi: 10.13014/K2SJ1HR5
1
CONTENT
PREFACE .......................................................................................................................... 1
CONTENT ........................................................................................................................ 2
1. INTRODUCTION ....................................................................................................... 4
I. CHAPTER...................................................................................................... 6
CONCEPTS AND METHODOLOGY ...................................................... 6
2. BASIC CONCEPTS OF STRUCTURAL EQUATION
MODELING ..................................................................................................................... 6
2.1. Definitions and Features .................................................................. 6
2.2. Latent and Observed Variables .....................................................10
2.3. Endogeneous and Exogeneous Variables ...................................13
2.4. Parameters ........................................................................................16
2.5. Fit Indices.........................................................................................17
2.6. Assumptions of Structural Equation Modeling ..........................22
2.7. Types of Structural Equation Models ..........................................26
2.7.1. Path Analysis Models ..............................................................26
2.7.2. Confirmatory Factor Analysis Models .................................27
2.7.3. Structural Regression Models ................................................28
2.7.4. Latent Change Models ............................................................28
3. VALIDITY AND RELIABILITY ANALYSIS ..................................................... 31
3.1. Determination of Convergent Validity ........................................32
3.2. Determination of Discriminant Validity ......................................41
3.3. Determination of Reliability ..........................................................42
4. STRUCTURAL REGRESSION MODEL ............................................................. 44
5. PATH ANALYSIS ...................................................................................................... 48
6. MODEL MODIFICATION .................................................................................... 50
7. MEDIATOR VARIABLE ANALYSIS METHODOLOGY ............................. 52
II. CHAPTER ..................................................................................................61
APPLICATIONS ............................................................................................61
2
8. USE OF AMOS Program .......................................................................................... 61
9. SAMPLE OF MEDIATOR VARIABLE ANALYSIS ......................................... 73
9.1. Title of the Research.......................................................................73
9.2. Purpose .............................................................................................74
9.3. Conceptual Model and Scales........................................................74
9.4. Determination of Validity and Reliability ....................................75
9.5. Analysis Results ...............................................................................77
9.6. Conclusion .......................................................................................79
10. MULTITRAIT-MULTIMETHOD MODEL ...................................................... 80
11. METHODS TO BE APPLIED IN CASE OF DATA
INADEQUACY ............................................................................................................ 100
11.1. Bootstrap Method ...................................................................... 100
11.2. Partial Least Square Structural Equation Modeling ( PLS-
SEM) ..................................................................................................... 102
Bibliography .................................................................................................................... 111
3
1. INTRODUCTION
There are various computer programs that are used when the
structural equation modeling method is applied. The most
common ones are LISREL (Linear Structural Relations),
AMOS (Analysis of Moment Structures), MPlus, EQS
(Equation Modeling Software) (Taşkın & Akat, 2010). The
samples and illustrations in this book were made according to
the AMOS program. The AMOS program is a visual program
that is easier to use than other programs. This is why it is
preferred in this book. There will also be plenty of videos
related to AMOS on YouTube and similar sites. That will
make the learning process easier for readers. There are
different versions of the AMOS program. AMOS 22 version
was used for the analysis examples in the book. All the
narrations in the book have been tried to be supported by
visuals. In this book, it is assumed that the readers have basic
statistical information. The main aim is to provide a guide to
the readers on structural equation modeling.
4
2. Possible mistakes in the measurements of the observed
variables are taken into consideration. The classic regression
approach assumes no measurement error.
3. It is a very useful method to analyze highly complex
multiple variable models and to reveal direct and indirect
relationships between variables.
5
I. CHAPTER
CONCEPTS AND METHODOLOGY
6
variables. In this way measurement errors can be minimized.
In traditional regression analysis, potential measurement errors
are neglected. Another difference from the regression models
of structural equality models is that they are based on the
covariance matrix. For this reason, in some sources, it is
named as covariance structure modeling or analysis of
covariance structure (Bayram, 2013). On the other hand, , the
correlation matrix is the basis of the regression. Covariance is
a nonstandardized measure of the relationship between two
variables, so it can take values between - ∞ and + ∞.
Correlation, however, can take values between -1 and +1,
since it is standardized (Gujarati, 1999). The covariance which
is the basic statistic of the structural equation model can be
shown for two observed and continuous variables as follows:
7
Covxy = rxy SDx SDy
Explanatory Confirmatory
Analysis
x Partial Least Squares
Structural Equation x Covariance Based
Modeling (PLS-SEM) Structural Equation
Modeling (CB-SEM)
Source: Hair, J., Hult, G., Ringle, C., & Sarstedt, M. (2017). A primer on partial least
squares structural equation modeling PLS-SEM. Los Angeles: SAGE.
8
However, structural equation modeling confirms the
correspondence of the data of the relations in the theoretical
model. For this reason, it can be said that structural equation
modeling is more suitable for testing the hypothesis than other
methods (Karagöz, 2016). Structural equation modeling
consists of a system of linear equations. The key in the
regression analysis is to determine how much of the change in
the dependent variable is explained by the independent
variable or variables. Although multiple regression analysis can
only be applied to observed variables, the basic principles can
be applied to structural equation modeling (Kline, 2011).
Differently from the regression, structural equation modeling,
as a new statistical analysis technique, allows to test research
hypotheses in a single process by modeling complex
relationships among many observed and latent variables. In
traditional regression analysis, only direct effects can be
detected. However, in the method of structural equation
modeling, direct and indirect effects are put together.
9
model and the structural model are interwoven. But the
structural equation modeling is based on the confirmatory
approach. It is based on the statistical confirmation of the
theoretical model. For this reason, the measurement model is
confirmatory factor analysis.
10
Latent variables are always continuous variables due to they
are linked to more than one indicator.
11
connecting the observed variables and the hidden variables is
important.
12
2.3. Endogeneous and Exogeneous Variables
Variables in the structural equation modeling, except for the
distinction between latent and observed, are dealt with in two
groups: endogeneous and exogeneous. In structural equation
modeling, the Endogeneous Exogeneous distinction is used as
a more accurate distinction because a variable can assume the
role of both the dependent variable and the independent
variable at the same time. Endogeneous variables are
dependent variables explained by other variables. In Figure 2,
the variables Z, W and T are endogeneous variables.
Exogeneous variables are independent variables that are not
explained by any variables. In Figure 2, the variables X and Y
are external variables. If there are more than one exogeneous
variable, covariance between these variables is required. As
shown in Figure 2, a bi-directional arrow is placed between the
X and Y latent variables. It should not be forgotten to add an
error term to the endogeneous variables. The terms Res1,
Res2 and Res3, which appear in Figure 2, represent the
residuals of each endogeneous variable. These residuals are
also called as error terms of the structural model. The error
terms in the measurement model are shown in Figure 2 by the
notation 'e'. The error terms in both groups are usually marked
with a separate notation in this way. Unlike regression models,
structural equality models are based on the covariance matrix.
But it mainly consists of the system of linear equations.
Therefore, the linear equations of the model in Figure 2 can
be written as:
(1. Equation) Z= β1.X+β2.Y+1.Res1
13
As shown in Figure 2, there are 3 endogeneous (dependent)
variables, and the above three equations are written. The
residual term of the first equation named as Res1, the residual
term of the second and third equation named as Res2 and
Res3 respectively. The coefficients of residual terms are fixed
as '1'. β coefficients are free parameters. The variables to the
left of the above equations are dependent variables, while the
ones to the right are independent variables. There is another
point of interest in equations. W and Z variables are
dependent variables in the first and second equations while
they play an independent role in the third equation. In the
group of these equations only the variables X and Y are always
independent. This is because X and Y variables are
exogeneous variables as seen in Figure 2. And the variables T,
W and Z are endogeneous variables. Again as seen in Figure
2., residual error of each latent variable correspond to the
residual term of each regression equation in which the latent
variable has the dependent variable role. Residual terms mainly
represent variance that can not be explained by factor (Kline,
2011). The square root of the variance equals to the standard
deviation. This value represents how far or near the
distribution of the values in a serial from the mean. If the
standard deviation is small, the values are scattered close to
the serial average.
14
regression equations described above are equations for the
structural model. However, as already mentioned, the
structural equation model consists of two parts, the
measurement model and the structural model. In the example
in Figure 2, there are 15 more regression equations in the
measurement model as seen below:
15
In these above equations, each observed variable is a
dependent variable. Latent variables are independent variables.
This is due to the fact that the observed variables and the
latent variables are connected in a reflective way in the
measurement model.
2.4. Parameters
16
2.5. Fit Indices
In the method of structural equation modeling, the measures
that assess the compliance of the models with the data are
called fit indices or fit statistics. There are many fit indices in
the literature. Below there are definitions of the most
commonly used of these fit indices. The size of the sample
should be considered in the analyzes to be done by the
structural equation modeling. Because many of the fit indices
are affected by sample size. The minimum sample size that
must be used in the structural equation modeling method is at
least 10 times the number of parameters that can be estimated
in the model (Jayaram, Kannan, & Tan, 2004). In addition,
the minimum sample size for structural equation modeling is
suggested as 150 (Bentler & Chou, 1987). Some researchers
suggest that the sample size for Structural Equation Models
should be 200-500, at least 200 (Çelik & Yılmaz, 2013).
17
p(p+1)/2. In this case, the degree of freedom is found as
follows (Raykov & Marcoulides, 2006):
CMIN/DF 0 <CMIN/DF< 2
18
dimensions that form the research model. CFI values can
range from 0 to 1, values above 0,90 and close to 1 show good
fit (Schermelleh-Engel, Moosbrugger, & Müller, 2003). CFI is
in the group of fit indices based on independent models.
Table 2 summarizes the goodness of fit values according to
the literature. Table 3 summarizes acceptable fit values
according to the literature. The AGFI fit indice is calculated
using the degree of freedom. It is affected by sample size.
When the sample size increases, the value of the AGFI indice
also increases. AGFI takes a value between 0 and 1. Values
over 0.90 indicate that the fit is good (Bayram, 2013).
CMIN/DF 2 <CMIN/DF< 3
19
value ranges from 0 to 1. Values above 0.90 are considered
acceptable model indices. Values above 0.90 indicate that
covariance is calculated among the observed variables. GFI
and AGFI fit indices are based on the residuals (Bayram,
2013).
20
possible model. This line contains the worst possible indice
values. The CMIN (chi-square likelihood ratio) value appears
to be significant when the P value is 0,000 in the example
below. However, this test is required to be insignificant since it
is a difference test. However, in most cases this value is
significant. This may be due to the neglect of some of the
assumptions of structural equation modeling described in the
next section as general practice during analyzes.
Default
111 1529,801 555 ,000 2,756
model
Saturated
666 ,000 0
model
Independence
36 9511,261 630 ,000 15,097
model
21
NFI RFI IFI TLI
Model CFI
Delta1 rho1 Delta2 rho2
22
and independent variables is linear, normal distribution of
error terms (normality), no multicollinearity which means
independent variables are not related to each other, the
variance of error terms is fixed (homoscedasticity) or in other
words there is no relationship between independent variables
and error terms ve no authocorrelation that means that there
is no relationship between error terms (Wooldridge, 2003). If
these assumptions are met, it should be considered whether
the assumptions required for the structural equation models
are also met. These assumptions can be summarized as
follows (Bayram, 2013).
23
becomes farther away from the normal distribution, it is
necessary to increase the number of data (Kline, 2011).
In cases where the data set does not fit the normal
distribution, the outliers (extreme values) should be cleared
first. In AMOS program in analysis properties window in
output tab, normality and outliers can be tested. It is sufficient
to mark the “test for normality” and “outliers” options so that
these test values can be obtained in a tabular form.
Additionally in SPSS, outliers can be determined by examining
the Mahalonobis distance value. If the dataset is not normally
distributed, what can be done is covered in the last section of
this book.
24
x Linearity:
As stated at the beginning of the book, structural equation
modeling is a component of factor and regression analysis.
Therefore, linearity, which is the most important assumption
of regression analysis, also applies to structural equation
modeling. In the structural equation model, it is assumed that
there are linear relationships between latent variables and also
between observed and latent variables.
x Absence of outliers:
The outlier affects the significance of the existence model
negatively.
x Multiple measurements:
In the structural equation model, three or more observed
variables must be used to measure each latent variable.
x No multicollinearity:
It is assumed that there is no relation between the
independent variables in the structural equation model.
x Sample size:
In the structural equation modeling, many of the fit indices are
influenced by sample size. In some sources, a minimum
25
sample size of 150 is recommended for structural equation
models (Bentler & Chou, 1987). The minimum sample size
that should be used in the structural equation modeling
method is at least 10 times the number of parameters that can
be estimated in the model. (Jayaram, Kannan, & Tan, 2004).
According to some researchers, the sample size required for
structural equation modeling should be at least 200 and 200-
500 (Çelik & Yılmaz, 2013).
26
dependent variable in the multiple regression. However, there
may be more than one dependent variable in the path analysis,
and a variable can be both a dependent variable and an
independent variable. In path analyzes, more than one
regression model can be analyzed at the same time, and
indirect and direct effects can be measured at the same time.
Direct effect is the effect of one variable on another variable
without any mediation. However, the indirect effect arises
from the intervention of a variable which is playing mediator
role between independent and dependent variables. This
variable is named as the mediator variable. The sum of the
direct effect and the indirect effect of a variable on another
variable is called the total effect (Raykov & Marcoulides,
2006). Since path analyzes do not contain latent variables, they
can not be saved from measurement errors (Meydan & Şen,
2011). For this reason, structural regression models generated
by latent variables give more accurate results. Because
structural regression models include measurement model. Path
analysis models and examples will be discussed in more detail
in chapter 5.
27
from the observed variables. However, in the confirmatory
factor analysis, previously discovered scales are confirmed
again with the collected data. In Section 3, examples of first-
level one-factor, first-level multi-factor and second-level multi-
factor confirmatory factor analysis are included.
28
model can be used for the following purposes. (Baltes &
Nesselroade, 1979):
29
(2) Characterize the development of individuals and groups.
(3) To predict individual and group differences in
developmental forms.
(4) To examine the dynamic determinants among variables in
time.
(5) To reveal the group differences of the dynamic
determinants between variables in time.
30
3. VALIDITY AND RELIABILITY ANALYSIS
Scale is the method used to find the numerical values of the
dimensions that constitute a concept. Since concepts can not
be directly measured in social sciences, questionnaires are
formed to define these concepts. Reliability means that a scale
is always measure the same value under the same conditions
consistently. For example, a questionnaire form is reliable if
the same group is given the same result when applied two
different times. So if we ask the same questions about the
same people, if the conditions are not changed, they are
expected to give the same answers. Otherwise, this means that
the persons in the sample either they did not understand the
questions on the questionnaire or they did not read them.
Validity is a measure of what we really want to measure. For
example, if a questionnaire actually measures a different
concept than the dimension we want to measure, it is not
valid. If the questions we ask about the concept A are
confused with the questions about the concept B, then it
means that the concepts we consider to measure are not
perceived or perceived as different from those in the sample.
In this case, the scale we use is not a valid measurement tool
for this sample. For this reason, it is necessary to test the
validity and reliability of the scale before any analysis is started.
As a result of these tests, verification of unidimensionality is
generally provided. Unidimensionality means that the
observed variables used to measure each dimension must
measure only one dimension (Avcılar & Varinli, 2013).
Construct validity and reliability must be determined in order
to confirm unidimensionality. In a theoretically determined
model, construct validity refers to convergence of observed
31
variables that are connected to the same latent variable
(convergent validity) and dissociation of observed variables
from other observed variables that are connected to other
latent variables (discriminant validity). The construct validity
indicates that the observed variables do not measure any latent
variable other than they connected in the conceptual model.
But in this case it would not be correct to say that the validity
of the construct is fully realized without confirming the
reliability of the scale (Gerbing & Anderson, 1988).
32
adversely affected. The fact that the factor loads are above 0,5
is evidence of convergent validity. If the critical rate value of a
question in CFA results is greater than 2 as an absolute value
this means that this item is loaded to the factor it is connected.
33
Figure 5. Single Factor CFA Model
34
set analyzable. It fulfills the following functions for this (Aksu,
Eser, & Güzeller, 2017):
Qestion1 0,818 1
Qestion4 0,825 1
Qestion7 0,757 1
Qestion10 0,676 1
35
After conducting explanatory factor analysis and purification
by principle component analysis, the remaining indicators
(questions) are linked under the structures they belong to and
the CFA model is created in the AMOS program as shown in
Figure 5 (The principle component analysis performed in the
SPSS program is not explained because it is out of scope of
this book). The number of factors is freed in EFA. In CFA,
the number of factors and which indicators are connected is
determined in advance.
36
Figure 8, covariance is not placed when there is only one
exogeneous variable. However Figure 9 shows covariance
between two exogeneous variables.
37
Figure 7. Multi-Factor First-Order CFA Model
38
Figure 8. Second-Order CFA Model
39
Figure 9. Multi-Factor Second-Order CFA Model
40
3.2. Determination of Discriminant Validity
Discriminant validity is the measure of the level at which a
structure in a measurement model differs from other
structures. It is an indicator of a low correlation between the
questions that form a construct and other questions that form
other construct. To find the discriminant validity for each
dimension, we first need to calculate the Average Variance
Extracted (AVE) value for each dimension. The acceptable
AVE value must be greater than 0.50 or 0.50. However, as
noted in the previous section, this value confirms convergent
validity when examined alone (Fornell & Larcker, 1981). In
order to determine discriminant validity, it is also desirable
that the values of the AVE for each construct in the data set
are larger than the correlation coefficients of that construct
with the other constructs. In this case, it can be determined
that the scales used have discriminant validity for each
dimension. AVE value alone does not indicate discriminant
validity but the square root of the AVE value of each
construct is larger than the inter-dimensional correlation value
it can be said that there is discriminant validity (Fornell &
Larcker, 1981). In Table 5, the values shown in parentheses as
crosswise is the square root of the AVE values. For the 9
dimensions in the table, the square root of the AVE values in
each column is higher than the correlation coefficients in that
column. In addition, the AVE values are above the 0.50
threshold. In this case, it can be said that the scales used for
this example have discriminant validity. The AVE value is not
calculated by the AMOS package program. However, it is easy
to find ready-made excel files that provide this value
calculation on the internet.
41
3.3. Determination of Reliability
After determination of the validity of the scales by means of
CFA reliability analysis must be conducted for each construct.
First of all, Cronbach's α value is calculated for each
dimension separately. Values greater than 0.7 threshold
indicate that the internal reliability of the scale used is
sufficient. Cronbach’s α is a measure based on correlations
between items in a consruct. It is obtained by dividing the sum
of the variances of the items constituting a scale by the general
variance. It takes a value between 0 and 1. Values beyond 0.7
threshold indicate that the scale is reliable. If it is below 0.6,
the reliability of the scale is low. (Karagöz, 2016).
42
Table 5. Descriptive Statistics, Correlation Coefficient, Reliability
Results and Discriminant Validity
Std.
Avr. 1 2 3 4 5 6 7 8 9
Dev.
7. Construct 3,23 0,87 ,585* ,166* ,452* ,174* ,449* ,479* (0,800)
8. Construct 3,68 0,67 ,394* ,496* ,672* ,508* ,350* ,508* ,358* (0,722)
9. Construct 3,02 0,77 ,340* ,374* ,353* ,335* ,209* ,302* ,219* ,410* (0,754)
Average Variance Extracted (AVE) 0,710 0,506 0,706 0,516 0,592 0,539 0,640 0,522 0,570
* P<0,05, Note: the values written in brackets indicate the square root of the AVE values.
43
4. STRUCTURAL REGRESSION MODEL
As a result of the processes described above, convergence
validity and discriminant validity are determined. Then the
phase for forming the structural model begins. The structural
model is based on measurement model. In this phase it should
be noted that the structure and order of CFA model is
preserved in the model that will be established with latent
variables. Depending on the causal relationships between
latent variables, the directions of the arrows are determined in
accordance with the developed hypotheses and a structural
model is constructed. Two-way arrows indicate the covariance
between two variables without specifying the direction of
causality. While constructing the structural model, the
conceptual model leads the way. The conceptual model is
mainly based on the relations found in the literature. Finally,
the conceptual model is tested using real data. Before the
hypothesis tests, the fit indices of the model are examined.
If the fit indices of the model are not within the limits
recommended in the literature, the modifications are made
and the fit indices are improved provided that they are
compatible with the literature. If the fit indices are at an
adequate level, the predicted values of the model parameters
are checked first. Then the hypothesis test results of the
research are given in a table. Figure 10 shows the properties
to be marked in the analysis properties menu in the AMOS
program when starting the path analysis. Table 6 shows an
example of the hypothesis test results. The values in this table
are in the estimates section of the output screen of the AMOS
44
program. The notation *** in AMOS output means that P is
equal to zero.
Unstandar
Standard d
Relations
Coefficients Coefficien
ts
*p < 0.05
45
Figure 10. Analysis Properties Window for the Structural
Model in the AMOS program.
46
Careful attention should be paid to these rules when
constructing structural models. Otherwise the model will not
work.
47
5. PATH ANALYSIS
Structural model can be established by directly observed
variables in cases factor analysis is done before and the
average of the questions that make up the constructs or when
working with the secondary data. Such models are called as
path analysis. Figures 13 and 14 show the models constructed
in Figures 11 and 12, respectively, with the observed variables.
As seen in the figures, covariance and residuals remained the
same in the models. This is because the rules are the same in
the path analyzes made with the observed variables. But the
measurement models are gone. Therefore, it is assumed that
there are no measurement errors in the path analysis.
Predictive error terms are included in dependent variables.
Before running the path analysis in the AMOS program, the
options in Figure 10 should be checked in the analysis
properties window.
48
Figure 14. Example of Path Analysis
49
6. MODEL MODIFICATION
If the fit indices of the structural model do not come to an
adequate level, the AMOS program suggests modifications to
the user. These modifications improve fit indices. For
example, Figure 15 shows the modifications suggested under
the Modification Indices menu in the AMOS program. In the
case of placing covariance among those with the highest value
among the proposed changes in Figure 13, the model's fit
indices will improve. Because the modification index value
corresponds to the decrease in the chi-square value of the
model. However, these covariances are not to be considered
as unrelated concepts in the literature that should be taken
into account when setting. Because each modification changes
the conceptual model which is first introduced. It should be
noted that the changes made by this reason do not contradict
the purpose of the research and the relations in the literature.
To reach the modification indices, the modification indices
box in the analysis properties window in the AMOS program
must be marked as shown in Figure 10. In the example shown
in Figure 15, first the model is run again by adding covariance
between the error terms e24 and e22 and compared with the
previous situation. Because the highest modification value in
the table is 17.457, which is between e24 and e22. After the
modifications are made, the model is retested, and if sufficient
compliance values can not be obtained, the proposed
modifications can be repeated. Again, modifications made
should be consistent with the literature. Unrepeatable and
inappropriate modifications made are scientifically
unacceptable and result in inconsistent situations with the
population. For this reason, it is necessary to apply it very
50
carefully. Just in order to raise the fit indices, covariance
should not be placed between concepts that are not related to
each other.
51
7. MEDIATOR VARIABLE ANALYSIS
METHODOLOGY
The variable starting the causality relation between the
independent and the dependent variable is called as mediator
variable (Wu & Zumbo , 2008). It is also called as intervening
variable (MacKinnon, Lockwood, Hoffman, West, & Sheets,
2002). It can also be defined as the variable that transfers the
indirect effect of an independent variable to the dependent
variable. Analysis of mediator variable is based on the
hierarchical regression method introduced by Baron and
Kenny in 1986. In order to apply this method, the following
conditions must first be met (Baron & Kenny, 1986) :
52
model meets the first two preconditions put forward by Baron
and Kenny. Table 7 shows an example of correlation table.
Constructs A B C
A - - -
B 0,492* - -
C 0,575* 0,672* -
* P< 0.01
53
When an mediator analysis is performed, three different
models are run and the coefficients of the models are
compared with one another. Models and hypotheses tested are
as follows:
54
Table 8. Example of Regression Coefficients Table
Relations Model 1 Model 2 Model 3
A→B - 0,575* -
B→C - - 0,582*
* P<0.001
55
comparing the β coefficients found after running models in
AMOS.
56
For scientific validation of mediator variable roles, these fit
indices must also be within acceptable limits. This method can
be applied to more variables. The basic logic of this analysis is
the comparison of the coefficients of the model to which the
mediator variables are not included and the coefficients of the
next model to which the mediator variables are included.
Variables 1 2 3 4
A –
B 0.883* –
C 0.861* 0.928* –
D 0.430* 0.653* 0.703* –
*p < 0.01.
57
If the statistically significant coefficients become insignificant,
the role of intermediate variable is confirmed. If the p value of
the coefficient remains significant but there is a serious
decrease in the coefficient, it can be said that there is semi-
mediator effect. In the conceptual model in Figure 17, the
mediator roles of B and C can be tested together. But if
needed, these variables can be included to analyses one by
one.
58
Tablo 10. Analysis Results Sample Table
C → D (H4) 0.532*
B → D (H5) 0.935*
χ2/df=2.554 χ2/df=2.921 χ2/df=2.856
CFI=0.947
Model Fit CFI=0.985 CFI=0.954
Indices IFI=0.985 IFI=0.947
RMSEA=0.14 IFI=0.954 RMSEA=0.14
RMSEA=0.14
*p<0.01
59
Figure 20. Model 3
60
II. CHAPTER
APPLICATIONS
61
The AMOS program is a program that works together with
the SPSS program. For this reason, it is first necessary to
contact the SPSS file containing the data to be worked on
before starting the model drawing. For this, File → Data Files
menus are followed and the window that appears in Figure 23
opens. In this screen, click the File Name menu to select the
relevant file and click OK to close the window. In this way,
the AMOS file is linked to the data to be run.
When starting to draw the model after this step, firstly the
indicator symbol is pressed to draw the hidden variable
and the three observed variables (indicator) which are shown
in Figure 24.
62
Each click creates an indicator. Therefore, it is necessary to
click three times to draw three indicators. Then change
direction of the indicators by clicking on the direction
change icon. One more latent variable is drawn by following
the same sequence and the model shown in Figure 25 is
obtained. In this model, relations between two latent variables
are drawn by using the regression symbol and covariance
is drawn between two external variables using the covariance
symbol .
63
Figure 22. Drawing Tools in the AMOS Program
64
Figure 22. Continued
65
Figure 23. Data Files Window
66
Figure 25. Sample Model Drawing
67
26 opens. In this window in text tab, variable names can be
assigned.
68
click the icon to view the results on the model. Click on
the view text icon to view the results in tabular form.
69
Figure 27. Analysis Properties Window Estimation Tab
70
Figure 27. Analysis Properties Window Output Tab
(Continued)
71
Figure 28. Text View Window
72
9. SAMPLE OF MEDIATOR VARIABLE
ANALYSIS
73
9.2. Purpose
The aim of this research is to determine the role of users'
attitudes towards the site and satisfaction levels in system
quality which was found to have a positive impact on net
benefit in previous research.
74
dimensions (Chen, Rungruengsamrit, Rajkumar, & Yen, 2013).
The scales were measured according to the five-point Likert
scale. The conceptual model of the research is shown in
Figure 30.
75
Table 11. Confirmatory Factor Analysis Results
Standardized Unstandardize
Constructs Items
Factor Loads d Factor Loads
76
9.5. Analysis Results
Three separate models have been analyzed as described in
Chapter 7. The analysis results of three different models are
compared in Table 13. In Figure 31, the results of the analysis
of model 3 are given as an image.
(0.767)
1. System Quality
*p < 0.01
Note: The values in brackets indicate the square root of the AVE
values.
77
Table 13. Test Results
Relations Model 1 Model 2 Model 3
System Quality
0.701* -0.03**
→Net Benefit
System Quality →
0.733* 0.839*
User Satisfaction
System Quality →
0.487* 0.880*
Attitude Toward Site
User Satisfaction
0.331*
→Net Benefit
78
Note: χ2/DF = 3.750, CFI = 0.944, IFI = 0.944, RMSEA=
0.082
9.6. Conclusion
79
10. MULTITRAIT-MULTIMETHOD MODEL
In some cases, nested structures can be found. In other words,
some scale questions can be designed to measure nested
concepts at the same time. As an example, assume that a scale
to measure teachers' competence in a school has been
developed. On this scale, there are questions that measure the
social, academic and English competence of the teachers. The
questions were asked to the same number of groups of
students and colleagues. In this example, there is a structure
consisting of three features and two separate methods as
shown in Figure 32, the best method that can be used to
determine the construct validity of such models is Multitrait-
Multimethod Model. As shown in Figure 32, there are two
separate questions asked to two different groups. In the model
shown in Figure 32, social, academic and English competences
are the traits and students and peers are the methods.
80
Figure 32. Multitrait-Multimethod Model
Source: Byrne, B. M. (2010). Structural Equation Modeling with AMOS. New York:
Routledge Taylor & Francis Group.
81
In Figure 33, there is a larger example that will provide a
better understanding of Multitrait-Multimethod Model. In the
figure, there is an assumption model of this example. In
Figure 33, there are 7 traits as X, Y, V, W, Z, T, U and 5
methods as A, B, C, D, E. Therefore, a scale consisting of 35
questions was used. The notation of “,1” on the latent
variables forming traits and methods indicates that the factor
variance is fixed at 1 (The another notation “1,” indicates that
the factor averages are fixed at 1). This is a symbolic notation,
and comma notation does not appear in the AMOS program,
it looks as seen in Figure 35. To fix the variance of a hidden
variable in the AMOS program, enter the value in the variance
box under the "Parameter" tab in the "Object Properties"
dialog box which opened by right mouse button on selected
variable. When we look at the parameter summaries in Table
14, it is seen that the variances of 12 variables are kept
constant.
82
Figure 33. Hypothesized Multitrait-Multimethod Model
83
Table 14. Parameter Summary
Weights Covariances Variances Total
Fixed 35 0 12 47
Labeled 0 0 0 0
Unlabeled 70 31 35 136
84
Figure 35. Model 1 (freely correlated traits; freely
correlated methods)
85
Figure 36. AMOS Warnings Window
86
Table 15. Summary of Fit Indices
Fit Indices
1. Freely
correlated
traits; freely 86.62 78 .897 .015 .000 .048 .897
correlated
methods
2. No traits;
freely 459.1
98 .693 .204 .122 .157 .000
correlated 2
methods
3. Perfectly
correlated
317.1
traits; freely 85 .795 .086 .081 .110 .000
2
correlated
methods
4. Freely
correlated
123.3
traits; 81 .964 .058 .037 .065 .000
9
uncorrelated
methods
87
Figure 37. Post Hoc Model 1
88
Figure 38. Model 2 (no traits; freely correlated methods)
89
Figure 39. Model 3 (perfectly correlated traits; freely
correlated methods)
90
Figure 40. Model 4 (freely correlated traits; uncorrelated
methods)
91
In some cases, the error terms get negative values when the
model is run. In this case, a post hoc model is created as
shown in Figure 37.
Difference
Models x2 df CFI
Test of Convergent
Validity
92
methods are released. In Model 3, the features and methods
are included in the same model but this time the covariance
parameters between the latent variables forming the traits are
fixed to 1.
X Y V W Z T U A B C D E
X .920 .008
Y .901 .601
V .898 .007
W .794 .405
Z .854 .506
T .426 .522
U .324 .714
X .401 .302
Y .306 .852
V .384 .701
W .399 .628
Z .424 .701
T .789 .574
U .698 .358
X .640 .406
Y .501 .720
93
V .654 .506
W .701 .525
Z .689 .603
T .745
U .754 .289
.597 .374
X .256 .356
Y .406 .902
V .250 .604
W .274 .712
Z .372 .586
T .755 .457
U .743 .771
X .595 .411
Y .489 .398
V .525 .549
W .424 .375
Z .445 .601
T .379 .832
U .408 .566
94
where the correlations between traits and methods are free,
calculated correlation values are shown in Table 18. In the
Multitrait-Multimethod models approach, analyzes are
performed at the matrix level and the parameter level to test
the construct validity (both convergent and discriminant
validity).
95
Table 18. Traits and Methods Correlations in Model 1
Traits Methods
Construct X Y V W Z T U A B C D E
X 1
Y .345 1
V .302 .789 1
A 1
B .197 1
C .201 .421 1
96
Figure 41. Model 5 (Correlated Uniqueness Model)
97
Table 19. Factor Loading in Model 5
X Y V W Z T U
X .810
Y .801
V .797
W .644
Z .751
T .800
U .721
X .501
Y .416
V .353
W .489
Z .518
T .468
U .499
X .741
Y .428
V .555
W .721
98
Z .489
T .706
U .742
X .368
Y .517
V .398
W .277
Z .365
T .479
U .398
X .498
Y .489
V .520
W .328
Z .531
T .544
U .298
Note: Estimates are standardized
99
11. METHODS TO BE APPLIED IN CASE OF
DATA INADEQUACY
Sometimes, there may be cases where the assumptions of the
estimation methods used are not met by the existing data set.
In this case, there are methods that can be applied if it is
necessary to be satisfied with the dataset available. Leading
methods among them are bootstrap partial least sequare.
100
There are advantages and limitations of the bootstrap process.
The main advantage of the bootstrap technique is the ability to
evaluate the accuracy of the predicted parameters. The idea
underlying the bootstrap technique is to create sub-samples of
the current data and look at the distribution of the parameters
computed from each sub-sample.
101
There are various criticisms in the literature about the
correctness of the results obtained with this technique. (Kline,
2011) (Ichikawa, 1995) (Yung & Bentler, 1994) (Hancock &
Nevitt, 1999). It's wrong to see this technique as a magical
method. This method should not be used especially in small
samples with low representation ability and in extreme non-
normal distribution (Kline, 2011). Because especially in small
samples, there is the posibility to further enhance its
properties that do not match the population (Rodgers, 1999).
The AMOS program includes bootstrap analysis. Figure 42
shows the bootstrap tab under the analysis properties window.
In this window it is initially marked that how much the
bootstrap technique can be applied in the sample. The
window is closed after the confidence interval and prediction
method are selected.
102
Table 20. Classification of Multivariate Methods
Exploratory Confirmatory
x Confirmatory
Factor Analysis
103
According to some sources in the literature, covariance-based
structural equation modeling is a more powerful and reliable
method. For this reason, the partial least square structural
equation modeling method is generally preferred in cases
where the conditions listed below are found:
• If there is a multicollinearity.
• There is missing value.
104
However, the PLS-SEM method is a non-parametric method
because it does not have any distributional assumption (Hair,
Hult, Ringle, & Sarstedt, 2017). It is also an explanatory
approach, which is why it is preferred in exploratory research.
In other words, when the theory is underdeveloped, it can be
said that researchers prefer to use partial least squares
structural equation modeling. This judgment is partially
correct in cases where the structure need to be predicted and
relations need to be explained (Rigdon, 2012).
105
solution alternatives are exhausted if the partial least squares
method is used in a research according to some sources. But
there are also resources that assess them as blind and
misleading criticisms, and that reveal the advantages of the
partial least squares method (Henseler, Dijkstra, Sarstedt,
Ringle, Diamantopoulos, & Straub, 2014). Despite all this
criticism and hesitation, the PLS-SEM method has become an
increasingly used method in scientific studies (Hair, Hult,
Ringle, & Sarstedt, 2017).
106
When the command is clicked, the "Create Project" window
shown in Figure 44 opens. Enter the project name in the
Name field of this window and press the OK button to close
the window. As a result, the project name is displayed in the
Project Explorer section of the main screen. Double click on
"Double-click to import data" on the project name and the
data set to be worked on is connected to the project. The
SmartPLS program is not a SPSS-compatible program like
AMOS, so it only accepts Excel files. If the data set to be used
is in the SPSS program, data can be easily transferred from the
SPSS to the Excel file.
After the data set has been connected, in the indicator section,
indicators are displayed as listed. Once you click on the project
name in the Project Explorer section, the white screen opens
again. When the desired indications in this segment are
selected in groups and are dragged in to the middle of screen,
107
it can be seen that the latent variable and its indicators are
automatically drawn as shown in Figure 45.
108
Figure 46. Coefficient Estimation Results
109
Figure 47. Estimation Results
110
BIBLIOGRAPHY
Aksu, G., Eser, M. T., & Güzeller, C. O. (2017). Açımlayıcı ve
Doğrulayıcı Faktör Analizi ile Yapısal Eşitlik Modeli
Uygulamaları. Ankara: Detay Yayıncılık.
111
Bayram, N. (2013). Yapısal Eşitlik Modellemesine Giriş. Bursa:
Ezgi Kitapevi.
Bentler, P. M., & Chou, C.-P. (1987). Practical Issues in
Structural Modeling. Sociological Methods Research, 16(1),
78-117.
Byrne, B. M. (2010). Structural Equation Modeling with AMOS.
New York: Routledge Taylor & Francis Group.
112
Efron, B. (1979). Bootstrap method: Another look at the
jackknife. Annals of Statistics, 7, 1-26.
Fornell, C., & Larcker, D. (1981). Evaluating Structural
Equation Models with Unobservable Variables and
Measurement Error. Journal of Marketing Research, 18(1),
39-50.
Gerbing, D., & Anderson, J. (1988). An Updated Paradigm for
Scale Development Incorporating Unidimensionality
and Its. Joournal of Marketing Research, 25(2), 186-192.
Gujarati, D. (1999). Essentials of Econometrics (2 b.). Singapore:
McGRAW-HILL.
Hair, J., Hult, G., Ringle, C., & Sarstedt, M. (2017). A primer on
partial least squares structural equation modeling PLS-SEM.
Los Angeles: SAGE.
Hancock, G., & Nevitt, J. (1999). Bootstrapping and the
identifcation of exogenous latent variables within
structural equation models. Structural Equation Modeling,
6, 394-399.
Henseler, J., Dijkstra, T., Sarstedt, M., Ringle, C.,
Diamantopoulos, A., & Straub, D. (2014). Common
beliefs and reality about partial least squares: Comments
on Rönkkö & Evermann (2013). Organizational Research
Methods, 17, 182-209.
Ichikawa, M. (1995). Application of the bootstrap methods in
factor analysis. Psychometrika, 60, 77-93.
Jayaram, J., Kannan, V., & Tan, K. (2004). Influence of
initiators on supply chain value creation. International
Journal of Production Research, 42(20), 4377-4399.
113
Karagöz, Y. (2016). SPSS ve AMOS 23 Uygulamalı İstatistiksel
Analizler. Ankara: Nobel.
Kline, R. (2011). Principles and practice of structural equation
modeling (3nd Ed. b.). New York: Guilford.
MacKinnon, D., Lockwood, C., Hoffman, J., West, S., &
Sheets, V. (2002). A comparison of methods to test
mediation and other intervening variable effects.
Psychological Methods, 7, 83-104.
114
Schermelleh-Engel, K., Moosbrugger, H., & Müller, H. (2003).
Evaluating the Fit of Structural Equation Models: Tests
of Significance and Descriptive Goodness-of-Fit
Measures. Methods of Psychological Research Online, 8(2), 23-
74.
Sipahi, B., Yurtkoru, E., & Çinko, M. (2010). Sosyal bilimlerde
SPSS'le Veri Analizi. İstanbul: Beta Basım A.Ş.
Tabachnick, B., & Fidell, L. (2001). Using multivariate statistics.
Boston: Ally and Bacon.
Taşkın, Ç., & Akat, Ö. (2010). Araştırma Yöntemlerinde Yapısal
Eşitlik Modelleme. Bursa: Ekin Basım Yayın.
115