Factor Analysis
Factor Analysis
FACTOR ANALYSIS
Abstract
In the world of biostatistics, factor analysis is like a trusted guide helping us uncover hidden
structures within complex datasets. It's our go-to tool for peeling back layers and understanding
the factors that influence the things we observe. Think of it as a special pair of glasses that lets us
see beneath the surface. We use two main approaches: Exploratory Factor Analysis (EFA) and
Confirmatory Factor Analysis (CFA). EFA is like an explorer, helping us discover factors without
preconceived notions. On the other hand, CFA is like a validator, testing a predefined model to
see if it fits our data. Choosing the right number of factors can be tricky, but we have tools like
scree plots, Kaiser's rule, and parallel analysis to help us navigate this decision-making process.
Factor rotation is like adjusting the lens on a camera, making our view clearer. It helps us better
understand which factors really matter. Comparing factor analysis with principal component
analysis (PCA) is like understanding two different tools in a toolbox. They might seem similar,
but they serve distinct purposes, and factor analysis tends to be more about understanding
underlying influences. In a real-world example—let's say, in genetic research—factor analysis is
the hero. It's what allows biostatisticians to sift through complex genetic data, revealing hidden
structures and turning them into meaningful insights. It's like a secret codebreaker, helping us
make sense of the biological mysteries that lie within the data
2.1.2 Process:
EFA involves an iterative process where the algorithm identifies the most significant factors that
explain the observed variance in the data. The researcher explores factor loadings,
communalities, and eigenvalues to interpret the underlying structure.
2.1.3 Applications:
EFA is commonly used in biostatistics for discovering hidden patterns in biological data, such as
identifying common genetic factors contributing to disease susceptibility across a population.
2.2.2 Process:
In CFA, the researcher specifies a model based on prior knowledge or theory. The algorithm then
assesses how well the observed data fit the hypothesized model, allowing for the validation or
refinement of the initial assumptions.
2.2.3 Applications:
CFA is applied in biostatistics when researchers want to confirm or reject a theoretical model of
factor structure, such as validating a measurement instrument for assessing patient-reported
outcomes.
These two types of factor analysis cater to different research scenarios, offering flexibility in
exploring unknown structures or confirming existing theories within the intricate realm of
biostatistics. In the subsequent sections, we will delve into the methods for determining the
optimal number of factors, a critical aspect of the factor analysis process.
Factor loading tells you how much each observed variable is associated with each latent factor.
Communalities indicate the proportion of variability in an observed variable explained by the
latent factors.
Eigenvalues measure the amount of variance explained by each factor, helping to determine the
significance of each factor in explaining the observed data
3.1.1 Explanation:
A scree plot is a graphical representation of eigenvalues plotted against the number of factors.
Eigenvalues represent the amount of variance explained by each factor. In a scree plot, a "knee"
or point where eigenvalues level off indicates the optimal number of factors.
3.1.2 Interpretation:
Researchers should inspect the scree plot for the point where eigenvalues sharply decrease,
suggesting the number of factors that adequately represent the variance in the data. This visual
inspection helps avoid over-extraction or under-extraction of factors.
3.2.2 Limitations:
While Kaiser's rule is a widely used guideline, it has limitations and may overestimate or
underestimate the number of factors. Researchers should consider other criteria alongside
Kaiser's rule for a more robust determination.
3.3.2 Advantages:
Parallel analysis is considered more accurate than Kaiser's rule and is particularly useful when
dealing with complex datasets. It provides a statistical foundation for determining the appropriate
number of factors.
In practice, a combination of these methods is often recommended to ensure a robust
determination of the number of factors. As we transition to the next section, it's important to note
the role of factor rotation in refining the interpretation of the identified factors.
Factor rotation is a technique used in factor analysis to simplify the interpretation of the factor
solution. After the initial extraction of factors, rotation is applied to adjust the factor loadings and
make the factors more interpretable. The goal is to achieve a simpler and more meaningful factor
structure.
5.1.2 Interpretability:
FA: Factors are interpreted as latent constructs influencing the observed variables. It
provides insight into the underlying structure of the data.
PCA: Principal components are linear combinations of the original variables. While they
capture maximum variance, their interpretability may be challenging.
6.3 Interpretation
In this example, the factor loadings will indicate how much each aspect (Sweetness, Bitterness,
Aroma, Temperature) is influenced by the hidden factors. The communalities will tell you how
much of each variable is explained by these factors. The scree plot will help you visually inspect
the eigenvalues.
7. Conclusion
In this chapter, we embarked on a journey through the intricacies of factor analysis in
biostatistics, uncovering its fundamental concepts, methodologies, and practical applications.
Let's recap the key takeaways:
But, just like any detective work, using factor analysis involves making careful choices. Think of
it as deciding which clues to follow, determining how many factors to investigate, and then
piecing together the puzzle of results. It's a bit like choosing the right path in a dense forest—
there are different routes, and each decision matters
.References
https://www.statisticssolutions.com/free-resources/directory-of-statistical-analyses/factor-
analysis/#:~:text=Factor%20analysis%20is%20a%20technique,them%20into%20a
%20common%20score.
https://www.analytixlabs.co.in/blog/factor-analysis-vs-pca/#What_is_Factor_Analysis
https://docs.tibco.com/pub/stat/14.0.0/doc/html/UsersGuide/GUID-6087F7F5-F407-
45FA-B6DA-24674BCDC07C.html
https://www.youtube.com/watch?v=TeIx7dRedkg
https://www.youtube.com/watch?v=WV_jcaDBZ2I&t=332s