CH 03
CH 03
CH 03
Chapter 3:
Exploratory Factor Analysis
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
LEARNING OBJECTIVES
Overview
4
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
5
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
6
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
7
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
Research Plan: nine store image elements, including measures of the product offering, store personnel,
price levels, and in-store service and experiences
V1 V2 V3 V4 V5 V6 V7 V8 V9
V1 Price Level 1.00
V2 Store Personnel .427 1.00
V3 Return Policy .302 .771 1.00
V4 Product Availability .470 .497 .427 1.00
V5 Product Quality .765 .406 .307 .472 1.00
V6 Assortment Depth .281 .445 .423 .713 .325 1.00
V7 Assortment Width .354 .490 .471 .719 .378 .724 1.00
V8 In-Store Service .242 .719 .733 .428 .240 .311 .435 1.00
V9 Store Atmosphere .372 .737 .774 .479 .326 .429 .466 .710 1.00 8
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
V3 V8 V9 V2 V6 V7 V4 V1 V5
V3 Return Policy 1.00
V8 In-store Service .733 1.00
V9 Store Atmosphere .774 .710 1.00
V2 Store Personnel .741 .719 .787 1.00
V6 Assortment Depth .423 .311 .429 .445 1.00
V7 Assortment Width .471 .435 .468 .490 .724 1.00
V4 Product Availability .427 .428 .479 .497 .713 .719 1.00
V1 Price Level .302 .242 .372 .427 .281 .354 .470 1. 00
V5 Product Quality .307 .240 .326 .406 .325 .378 .472 .765 1.00
9
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
V8 In-store Service
In-store Experience
V9 Store Atmosphere
V2 store Personnel
V6 Assortment Depth
V4 Product Availability
V1 Price Level
Value
V5 Product Quality 10
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
STAGE 1: OBJECTIVES OF
EXPLORATORY FACTOR ANALYSIS
Specifying the Unit of Analysis
Achieving Data Summarization Versus Data Reduction
Variable Selection
Using Factor Analysis with Other Multivariate Techniques
12
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
13
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
Unit of Analysis
• Variables – most common application, termed R-factor analysis.
• Cases – grouping cases, similar to cluster analysis, termed Q-factor analysis.
STAGE 2: DESIGNING AN
EXPLORATORY FACTOR ANALYSIS
Variable Selection and Measurement Issues
Sample Size
Correlations among Variables or Respondents
16
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
Conceptual Issues
Statistical Issues
18
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
Assumptions of EFA
Conceptual
• Before the factor analysis is performed, there must be a strong conceptual
foundation to support the assumption that a structure does exist.
• Assume a homogeneity of sub-sample factor solutions
Statistical Issues Addressing Multicollinearity
• A statistically significant Bartlett’s test of sphericity (sig. < .05) indicates that sufficient
correlations exist among the variables to proceed with an exploratory factor analysis.
• More diagnostic test is the MSA (measure of sampling adequacy).
• Measured by the Kaiser-Meyer-Olkin (KMO) statistic that indicates if a set of variables is likely
identify a good factor solution based on correlations/partial correlations.
• KMO statistic exists for each individual variable and an overall value that can vary from 0 to 1.0.
• KMO value must exceed .50 for both the overall test and each individual variable. Variables with
values less than .50 should be omitted from the factor analysis one at a time, with the smallest one
being omitted each time, until both overall and all individual values are above .50
19
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
20
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
22
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
A Priori Criterion
• researcher knows how many factors to extract and specifies for factor extraction.
Latent Root Criterion (aka Kaiser rule)
• most commonly used technique.
• factors having latent roots or eigenvalues greater than 1 are considered significant.
• most applicable to principal components analysis where the diagonal value
representing the amount of variance for each variable is 1.0.
• less accurate with a small number of variables or lower communalities.
Percentage of Variance
• Extract enough components to achieve a specified cumulative percentage of total
variance extracted.
24
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
Parallel Analysis
• Empirical measure based on the specific characteristics (i.e., number of variables and
sample size) of the dataset being analyzed.
Procedure
1. Generate a large number (e.g., 500 or 1000) simulated datasets with random values
for the same number of variables and sample size.
2. Apply either principal components or common factor analysis to each dataset.
3. Calculate average eigenvalue across datasets for each factor(i.e., first, second, . . . )
4. Compare to eigenvalues for original data and retain all factors that are greater than
average eigenvalues from random datasets
• Variants of this rule suggest taking the upper bound (95th percentile) of the simulated eigenvalues.
the principal component analysis model is most appropriate when data reduction is paramount.
the common factor model is best in well-specified theoretical applications.
• A decision on the number of retained factors should be based on several considerations:
Use several stopping criteria to determine the initial number of factors to retain.
Factors With Eigenvalues greater than 1.0.
A pre-determined number based on research objectives and/or prior research.
Enough factors for a specified percentage of variance explained (e.g., usually 60%).
Factors shown by the scree test to have substantial amounts of common variance (i.e, before inflection point).
Factors which have eigenvalues greater than factors from randomly-generated data.
Factors above the threshold established by parallel analysis.
More factors when there is heterogeneity among sample subgroups.
• Consideration of several alternative solutions (one more and one less factor than the initial
solution) to ensure the best structure is identified. 27
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
Alternatives to Principal Components and Common Factor
Analysis
Factor Analysis of Categorical Data
• Optimal scaling is a process to derive interval measurement properties for
variables which were originally nominal or ordinal measures.
• Two issues
• optimal scaling works within the variable set specified (e.g., maximizes the correlations among
the variables in the analysis), but this scaling may change in other analyses.
• the researcher sets a series of allowable transformations (e.g., retain ordinality or not).
Variable Clustering
• Variant of PCA popularized in the PROC VARCLUS of SAS.
• Form of “sequential” factor extraction where factors are split to form new factors
• Benefits include:
• not impacted by the number of variables included like PCA.
• based on a hierarchical process of component formation.
• introduces additional diagnostics for selecting the most representative variable(s).
28
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
29
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
Rotation of Factors
Procedure
• The reference axes of the factors are turned about the origin until some other
position has been reached. Loadings of each variable remain fixed relative to other
loadings.
Impact
• The ultimate effect of rotating the factor matrix is to redistribute the variance
from earlier factors to later ones to achieve a simpler, theoretically more
meaningful factor pattern.
Alternative Methods
• Orthogonal rotation – simplest approach which maintains orthogonality of factors.
• Most common method – VARIMAX.
• Oblique rotation – allows for correlation among rotated factors.
31
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
32
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
33
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
Starting Point:
• Factor matrix has loadings of every variable on every factor extracted.
• Examine each variable for the highest loading/significant loading on a factor.
• Cross-loading – when a variable has a significant loading on more than one factor.
36
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
38
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
Communality
• Represents the amount of variance accounted for by the factor solution for each
variable.
• Calculated as the sum of the squared loadings across all retained factors.
40
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
Objective:
• Assign some meaning to the pattern of factor loadings.
• Variables with higher loadings are considered more important and have greater
influence on the name or label selected to represent a factor.
Considerations
• The signs are interpreted just as with any other correlation coefficients. For each
factor, like signs mean the variables are positively related, and opposite signs
mean the variables are negatively related.
• The label is intuitively developed by the researcher based on its appropriateness
for representing the underlying dimensions of a particular factor.
41
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
• An optimal structure exists when all variables have high loadings only on a
single factor.
• Variables that cross-load (load highly on two or more factors) are usually deleted
unless theoretically justified or the objective is strictly data reduction.
• Variables should generally have communalities of greater than .50 to be
retained in the analysis.
• Respecification of a factor analysis can include options such as:
• deleting a variable(s),
• changing rotation methods, and/or
• increasing or decreasing the number of factors.
42
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
STAGE 6: VALIDATION OF
EXPLORATORY FACTOR ANALYSIS
43
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
Approaches to Validation
45
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
Advantages:
• simple to administer and interpret.
Disadvantages:
• does not represent all “facets” of a factor.
• prone to measurement error.
46
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
Unidimensionality
• Never create a summated scale without first assessing its unidimensionality with
exploratory or confirmatory factor analysis.
Reliability
• With unidimensionality, its reliability score, as measured by Cronbach’s alpha:
• should exceed a threshold of .70, although a .60 level can be used in exploratory research.
• the threshold should be raised as the number of items increases, especially as the number of items
approaches 10 or more.
Validity
• With reliability established, validity should be assessed in terms of:
• convergent validity = scale correlates with other similar scales.
• discriminant validity = scale is sufficiently different from other related scales.
• nomological validity = scale “predicts” as theoretically suggested.
48
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
Advantages:
• represents all variables loading on the factor.
• best method for complete data reduction.
• by default, factors (and factor scores) are orthogonal and can avoid complications
caused by multicollinearity.
Disadvantages:
• interpretation more difficult since all variables contribute through loadings.
• difficult to replicate across studies.
49
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
50
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
Research Design
Research Objective:
• Assess dimensional structure among 13 Performance Perception Variables
Variables used Variable Description Variable Type
Analysis
52
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈
53
For use with Hair, Black, Babin and Anderson, Multivariate Data Analysis 8e © 2018 Cengage Learning EMEA
© 2019 Cengage 版權所有,為課本著作之延伸教材,亦受著作權法之規範保護,僅作為授課教學使用,禁止列印、影印、未經授權重製和公開散佈