Efa Vs Cfa
Efa Vs Cfa
10
Exploratory Factor Analysis versus
Confirmatory Factor Analysis
Siti Aishah Hassan and Kaseh Abu Bakar
INTRODUCTION
Nowadays, most empirical researchers read, hear, and use the various available
multivariate data analysis techniques, such as Multiple Regression Analysis (MRA),
Multiple Discriminant Analysis (MDA), Multiple Analysis of Variance
(MANOVA), Exploratory Factor Analysis/ Principle Component Analysis (PCA),
Confirmatory Factor Analysis (CFA), and Structural Equation Modelling (SEM).
Some of these data analyses are frequently used by education researchers. However,
other data analyses still remain novel to the researchers. Particularly, exploratory
factor analysis (EFA) is widely taught in statistics at the level of postgraduate
studies but confirmatory factor analysis (CFA) still remains exclusive to advanced
researchers.
Nevertheless, with the advancement in computer technology, postgraduate
students and researchers are expected not to rely on EFA only. More and more
user friendly software are available in the market to handle the needs of the more
advanced data analysis techniques. CFA, is predominantly beyond the capacity
provided by Statistical Package for Social Science (SPSS). Therefore, a number of
software programs such as AMOS, SAS, EQS, LISREL, SEPATH, CALIS along
with others are available to serve the needs of CFA. Yet, before researchers embark
on the more advanced technique of data analysis, they need to conceptually
understand the relationships between the two techniques. They also need to compare
the similarities and the differences of these two data analyses. Accordingly, they
will understand the link between the two techniques and able to use the techniques
appropriately.
Therefore, this chapter seeks to answer four germane questions: What is factor
analysis? Why conduct factor analysis? What are the similarities and differences
between exploratory and confirmatory factor analysis? What kind of research
questions may employ exploratory or confirmatory factor analysis? Finally, this
chapter provides examples of research report writing that employs both exploratory
and confirmatory factor analyses.
Exploratory Factor Analysis versus Confirmatory Factor Analysis 129
DEFINITION OF TERMS
In order to better comprehend this chapter, definitions of relevant terms are provided
below.
‘Factor Loadings’ are multiple correlations between factors and variables (items).
High loading indicates the factor significantly explained the variables. A
cutting point of .40 for loading is recommended.
‘Factor matrix’ is the table that displays the factor loadings of all variables on each
factor of orthogonal rotation.
‘Factor pattern matrix’ is the table that displays the factor loadings of all variables
on each factor of oblique rotation.
‘Orthogonal rotation’ is an extraction method of rotation for uncorrelated or very
low correlation among the factors.
‘Oblique rotations’ is an extraction method of rotation for highly correlated factors.
‘Squared multiple correlations’ (SMC) is the proportion variance explained of the
item by the factor. High SMC indicates a high reliability of the item.
‘Eigenvalue’ is the value that indicates the amount of variance accounted for by
each factor. For eigenvalues greater than 1, the total variance explained by
the emerging factors is at the optimum point. It is a value that normally
represents a sharp “elbow” or break point in the scree plot.
‘Scree plot’ is a graphical representation of the relationships between the number
of extracted factors and eigenvalues.
‘Bartlett test of sphericity’ is a test for overall significant correlation matrix. A
significant result indicates that the variables are highly correlated and hence
suitable for factor analysis study. In other words, some degree of
multicollinearity (intercorrelation) is desired.
‘Manifest variable’ is a directly measured variable. It is the item of the instruments.
‘Latent construct’ is the unobserved variable. It is the extraction of the hypothesized
factor. It is also known as dimension, factor, component, or underlying
structure.
‘Exogenous variable’ is a variable that may roughly be equated with an independent
or predictor variable, an arrow pointing outward.
‘Endogenous variable’ is a variable which may roughly be equated with a dependent
or criterion variable, an arrow pointing towards it.
identify separate dimensions of the structure, which later can be used for two
purposes - summarization and data reduction. Summarization is to describe the
data using a much smaller number of concepts than the original individual data.
Data reduction is to have a composite measure for each dimension and to substitute
them in place of individual data. This is particularly important in order to further
analyze data by using other statistical techniques such as multiple regressions or
path analysis (Abdul Hair et al., 1998).
EFA CFA
Similarity To condense a large number of variables into a smaller
number of factors
PURPOSE
Difference To explore the underlying To test the construct
construct validity (to “confirm”
the construct)
Similarity Application on new population
APPLICABILITY Newly developed Has conducted EFA
instrument Replication of the
Difference Newly adapted instrument construct validity test
onto a different sample
THEORY ON Similarity Factors conceptually define variables that are highly
FACTOR correlated to each other
“Grounded” theory - Theory grounded- test
generate theory theory
Difference No predetermined number The number of factors
of factors is fixed
Items free to load on any Items are forced to load
factor only on specific factors.
INTER FACTOR Similarity A concept or theory explains whether the factors are
CORRELATION correlated or uncorrelated
Difference Orthogonal rotation for Single order construct
uncorrelated factors for uncorrelated /low
inter factor correlation
Parallel Oblique rotation for Second order construct
correlated factors for highly inter-
correlated factor
SOFTWARE Similarity Both use SPSS
Difference SPSS only AMOS embedded in SPSS
132 Teachers’ Learning, Curriculum Innovations and Knowledge Applications
CONCLUSION
This chapter provides examples from our own research that have been published
in journals and the unpublished dissertation. It is worthy to note that due to the
limited number of pages that are allowed in journals, the details have been reduced.
For dissertation writing however, lengthy report writing is preferred. Hence, we
provide both styles of writing. The first example is a Principle Component Analysis
(PCA) with orthogonal rotation. The second example is a PCA with an oblique
rotation. The third example is a Confirmatory Factor Analysis (CFA) with a single
order construct. The fourth example is a CFA with a second order construct.
Extensive report writing is provided for each type of analysis.
In summary, exploratory factor analysis is conducted based on three focal
principles: to determine how many factors are present to label the factors, and to
allow the items the freedom to load on any factors. On the other hand, confirmatory
factor analysis is conducted to test the construct validity; the researcher’s a priori
sets the number of factors and forces the items to load only on a specified factor.
Both analyses are conducted to condense data and increase reliability and validity
of the instruments. Hence, prudent use of both techniques of factor analyses is
ensured when their relationships are understood.
Exploratory Factor Analysis versus Confirmatory Factor Analysis 133
Preliminary Analysis
In order to justify the use of principle component analysis, one of the important
assumptions to be assessed is the intercorrelations among the items. Barlett
Sphericity Test was statistically significant, χ2 (36) = 219.28, p = .001 the
variables were higly correlated to one another. Besides that, Kaiser-Meyer-
Olkin Measure of Sampling Adequacy was .650. Thus, there was evidence
for the overall measurement of sampling adequacy fulfilling the requirement
of at least .50. The individual measure of sampling adequacy is listed in
Table 1. The individual measure of sampling adequacy revealed that item 10
was low, MSA = .492. Thus, this item was considered as a candidate to be
deleted. Next, communality was assessed. It revealed that there were three
items needed to be deleted – items 12, 13, and 14. As a result, four items
were deleted from the original list of maternal involvement. Hence, parental
involvement inventory left with ten items.
Table
Table 1 Preliminary
1: Prelim analysis
inary analysis and and descriptive
descriptive statistics
statistics ofitem
of all all items
s
Item 1 2 3 4 5 6 7 8 9 10 11 12 13 14
1 .597
2 .360 .667
3 .464 .373 .507
4 .191 .116 .131 .641
5 .096 .131 .064 .553 .550
6 -.054 .198 .035 .208 .293 .682
7 -.109 .102 -.060 .172 .207 .271 .637
8 -.030 .226 .026 .158 .115 .163 .629 .544
9 .092 .290 .186 .132 .146 .059 .264 .327 .738
10 -.050 .158 .214 -.064 -.176 .147 .082 .025 .231 .492
11 -.081 .211 .008 .023 .084 .196 .303 .228 .314 .531 .630
12 .010 .017 -.029 .321 .237 .107 .281 .218 .141 -.061 .197 .684
13 .005 .129 .045 -.029 -.105 .050 .208 .089 .206 .197 .224 .250 .493
14 .060 .198 -.126 .134 .065 .256 .111 -.082 .219 .184 .196 .030 .014 .387
Commu .643 .557 .655 .650 .644 .416 .736 .812 .469 .689 .628 .363 .393 .232
M ean 3.03 3.65 3.33 .639 1.05 .98 .62 .69 1.97 2.99 2.63 2.20 2.53 2.38
SD .81 .83 .85 .92 .62 .74 .80 .93 1.12 1.2 1.13 1.3 0 1.01 1.06
Note: The diagonal ent ry is M SA
total variance explained. Factor 1 consisted of items 8, 9, 10, and 12. Whereas
factor 2 consisted of items 1, 2, and 3 and finally factor 3 consisted of items
4, 5, and 6. Analyzing the items of each factor, we noticed that factor 1 could
not be logically labeled, which consisted of item 11. This item is related to
Islamic dimension and does not logically related to the existing factor.
Thus, we finally conducted the third varimax rotation for the 9 items left
after discarding item 11. The results of the analysis showed there were three
latent variables emerged. This indicated that the three underlying dimensions
accounted for 61.82% of the total variance explained of nine items and found
to be higher as compared to the first (56.32%) and the second (58.17%) times
varimax rotations were conducted with fourteen and ten items respectively.
Thus, the solution of three latent structures of maternal involvement seemed
to be the best. The dimensions can be logically labelled as reading
encouragement for factor 1, homework monitoring for factor 2, and school
relationship for factor 3. The variance of the first dimension, the largest
eigenvalue was 2.47, while the other subsequence eigenvalues were 1.76 and
1.34, respectively. Inspection on the scree plot also pointed out the 9-items
measured three factors, with the sharp break point after the three factors. All
estimated factor loadings were large enough to be of practical significance at
p = .05; even the weakest loading extracted was .549. In addition, all of the
directions of loadings were positive and free from factorial complexity. The
results of the final factor solution are illustrated in Table 2.
In summary, the study suggests that there are three underlying dimensions of
maternal involvement construct among mothers of a selected Integrated
Primary Islamic Schools. The dimensions are maternal reading
encouragement, maternal homework monitoring, and maternal school
relationship.
Exploratory Factor Analysis versus Confirmatory Factor Analysis 135
REFERENCE
Arbuckle, J.L. (2007). AmosTM 16.0 User’s Guide. Bethelhem Pike, PA: Amos
Development Corporation.
Arbuckle, J.L. and Wothke, W. (1999). AMOS 4.0 User’s Guide. Chicago, IL:
Small Waters Corporation.
Armsden, G.C. and Greenberg, M.T. (1987). The inventory of parent and peer
attachment: Individual differences and their relationship to psychological well-
being in adolescence. Journal of Youth and Adolescence, 16, 427-425.
138 Teachers’ Learning, Curriculum Innovations and Knowledge Applications