Factor Analysis: Nazia Qayyum SAP ID 48541
Factor Analysis: Nazia Qayyum SAP ID 48541
Nazia Qayyum
SAP ID 48541
What is Factor Analysis
• Factor analysis is used as a data reduction technique.
• Factor analysis takes a large number of variables and reduces or summarizes it to
represent them in different smaller factors, those factors are made up of the initial
set of variables.
• It is a method for investigating whether a number of variables of interest are related
to a smaller number of unobservable factors. This is done by grouping variables
based on inter-correlations among a set of variables.
• Those initial variables are the manifest variables/observed variables while the
factors that are extracted in the process are the latent variables.
What is Factor Analysis
• A common usage of factor analysis is in developing scale/questionnaires for
measuring constructs that are not directly observable in real life.
• The factor Analysis technique primarily examines the systematic interdependence
among a set of observed variables (through correlation), and those variables, that
have higher correlation are grouped together.
• Factor analysis helps the scholars answer the question that “How well do the items
go well together? In case we are building a new Scale”
EFA Vs CFA
• used to either confirm a priori established theories or identify data patterns and
relationships
• confirmatory when testing the hypotheses of existing theories and concepts and
exploratory when they search for latent patterns in the data in case there is no or only
little prior knowledge on how the variables are related.
• When EFA is applied to a data set, the method searches for relationships (variables
with high correlation are grouped together) between the variables in an effort to
reduce a large number of variables to a smaller set of composite factors (i.e.,
combinations of variables).
EFA Vs CFA
The final set of composite factors is a result of exploring relationships in the data and
reporting the relationships that are found (if any).
In simple words, EFA is an exploratory technique that is utilized in research to group
a large number of variables (observed variables) into smaller representative factors
(latent factors) whereas CFA is utilized to test a particular set of relationships based
on some theory and to ascertain the data fits the proposed model in an adequate
manner.
Assumptions of Factor Analysis
The Problem:
Investigate if there is any smaller number of unobservable factors in the 19 variables
that measure University Social Responsibility on which the data is available. The
example is based on scale development. Initial items identified to measure University
Social Responsibility were 19, the researcher would like to assess if there are any
underlying dimensions.
Steps to run Factor Analysis
Choose Analyze → Dimension
Reduction → Factor
The resulting dialog box is shown
in Figure
Select the variables from the
left-hand side box and
transfer them to the box
labeled Varibles
Click on the Descriptives button which
brings up a dialog box as shown in the
figure. In the Statistics section, make sure
that Initial Solution is ticked. In the
section marked Correlation Matrix, select
the options Coefficients and KMO and
Bartlett’s test of sphericity. Click
on Continue.
Click on the button
labeled Extraction which
brings up a dialog box as shown
in Figure.
•There are many extraction methods listed, which can be obtained by clicking on the
drop-down arrow in the box against Method. Two commonly used extraction methods
are Principal Components and Principal Axis Factoring. I have selected Principal
Axis Factoring in this case. Also, check the Scree plot check box.
•Next select whether we want to analyze the correlation matrix or the covariance matrix
for FA. The recommended option for beginners is to use the correlation matrix,
advanced users may, however, choose the covariance matrix for special cases.
•Click against Unrotated factor solution and Scree plot to display the two in the
output.
•SPSS allows specifying the number of factors we want to extract. Default setting is to
choose factors with eigenvalues greater than 1 as factors with eigenvalues less than 1 do
not carry enough information. We can also specify the number of factors if we have a
specific requirement to extract a certain number of factors.
Click on Continue to return to
the main dialog box.
Next, click on the button labeled Rotation, to
specify the specific rotation strategy you want
to adopt. This brings up a dialog box as shown
in Figure
The SPSS program gives five options for
rotations. Select Varimax from this box. Click
on Continue to return to the main dialog box.
Finally click on the button labeled Options,
which will bring up a dialog box as shown in
Figure. It is advisable to suppress values
below 0.40 as this is a standard criterion used
by researchers to identify important factor
loadings. We have not done this in order to
present the full output.
Click on Continue to return to the main
dialog box and click on OK to run the
analysis.
Step 7: Interpretation and Reporting
An EFA was performed using a principal component analysis and varimax rotation. The minimum
factor loading criteria was set to 0.50. The communality of the scale, which indicates the amount
of variance in each dimension, was also assessed to ensure acceptable levels of explanation. The
results show that all communalities were over 0.50.
An important step involved weighing the overall significance of the correlation matrix through
Bartlett’s Test of Sphericity, which provides a measure of the statistical probability that the
correlation matrix has significant correlations among some of its components. The results were
significant, x2(n = 215) = 2013.292 (p < 0.001), which indicates its suitability for factor analysis.
The Kaiser–Meyer–Olkin measure of sampling adequacy (MSA), which indicates the
appropriateness of the data for factor analysis, was 0.931. In this regard, data with MSA values
above 0.800 are considered appropriate for factor analysis. Finally, the factor solution derived
from this analysis yielded four factors for the scale, which accounted for 57.753 per cent of the
variation in the data.
Nonetheless, in this initial EFA, two items (i.e. “RDR1: The university is involved in
funding ‘relevant’ research.”, “PR1: The university is performing in a manner
consistent with the philanthropic and charitable expectations of society.”) failed to
load on any dimension significantly. “RDR2: Students are educated regarding their
social responsibility in their area of specialization.” loaded onto a factor other than its
underlying factor. Hence, the three items were removed from further analysis.
The authors repeated the EFA without including these items. The results of this new
analysis confirmed the five-dimensional structure theoretically defined in the
research (see Table). The Kaiser–Meyer–Olkin MSA was 0.917. The three
dimensions explained a total of 60.798 per cent of the variance among the items in
the study. The Bartlett’s Test of sphericity proved to be significant and all
communalities were over the required value of 0.500. The four factors identified as
part of this EFA aligned with the theoretical proposition in this research. Factor 1
includes items ER1 to ER7, referring to Ethical Responsibilities (ER). Factor 2
gathers items RDR2 to RDR6, which represents Research and Development
Responsibilities (RDR). Finally, Factor 3 includes items PR2 to PR6, referring to
Philanthropic Responsibilities (PR). Factor Loadings are presented in table.
Reporting
Articles as examples