Using Latent Class Analysis To Model Prescription Medications in The Measurement of Falling Among A Community Elderly Population
Using Latent Class Analysis To Model Prescription Medications in The Measurement of Falling Among A Community Elderly Population
Abstract
Background: Falls among the elderly are a major public health concern. Therefore, the possibility of a modeling
technique which could better estimate fall probability is both timely and needed. Using biomedical,
pharmacological and demographic variables as predictors, latent class analysis (LCA) is demonstrated as a tool for
the prediction of falls among community dwelling elderly.
Methods: Using a retrospective data-set a two-step LCA modeling approach was employed. First, we looked for
the optimal number of latent classes for the seven medical indicators, along with the patients prescription
medication and three covariates (age, gender, and number of medications). Second, the appropriate latent class
structure, with the covariates, were modeled on the distal outcome (fall/no fall). The default estimator was
maximum likelihood with robust standard errors. The Pearson chi-square, likelihood ratio chi-square, BIC, Lo-
Mendell-Rubin Adjusted Likelihood Ratio test and the bootstrap likelihood ratio test were used for model
comparisons.
Results: A review of the model fit indices with covariates shows that a six-class solution was preferred. The
predictive probability for latent classes ranged from 84% to 97%. Entropy, a measure of classification accuracy, was
good at 90%. Specific prescription medications were found to strongly influence group membership.
Conclusions: In conclusion the LCA method was effective at finding relevant subgroups within a heterogenous at-
risk population for falling. This study demonstrated that LCA offers researchers a valuable tool to model medical
data.
2013 Hardigan et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative
Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and
reproduction in any medium, provided the original work is properly cited.
Hardigan et al. BMC Medical Informatics and Decision Making 2013, 13:60 Page 2 of 7
http://www.biomedcentral.com/1472-6947/13/60
and the refinement of the measurement criteria are typ- or no fall) and second what covariates increase or decrease
ically inter-related research goals. Appropriate statistical the likelihood of this occurrence. The four specific aims of
applications, such as latent class analysis, have become the study are to identify items that indicate classes,
available for researchers to model the complex hetero- estimate class probabilities, relate the class probabilities to
genous measurements. covariates, and predict a distal outcome (fall/no-fall) based
Latent class models are used to cluster participants. on class membership. We model this process through the
This type of model is adequate if the sample consists of application of latent class analysis (Figure 1).
different subtypes and it is not known before-hand
which participant belongs to which of the subtypes [2].
The latent categorical variable is used to model hetero- Methods
geneity. In the classic form of the latent class model, A convenient retrospective database consisting of a ran-
observed variables within each latent class are assumed dom sample of 3,293 elderly patients was used to develop
to be independent, and no structure for the covariances a model to predict the likelihood of falling among individ-
of observed variables is specified [2]. uals aged 65 years or older. Due to the retrospective
LCA is one of the most widely used latent structure nature of this study, this study was granted an exemption
models for categorical data [12]. LCA differs from more in writing by the Nova Southeastern Universitys IRB.
well-known methods such as K-means clustering which This is a poof-of-concept analysis so it should be noted
apply arbitrary distance metrics to group individuals that this data set was not designed for an LCA, therefore,
based on their similarity [13-15]. LCA derives clusters additional medical variables which may predict falling
based on conditional independence assumptions applied were not available. For this study an elderly person was
to multivariate categorical data distributed as binomial defined as someone aged 65 years or older. Descriptive
or multinomial variables [16,17]. Using statistical distri- data were as follows (Table 1): The average age of patients
butions rather than distance metrics to define clusters was 77 years old; 32 percent of the subjects had fallen in
helps in evaluating whether a model with a particular the last 30 days; falling patients were taking an average of
number of clusters is able to fit the data, since tests can five prescription medications while non-fallers were con-
be performed to observed (ni) versus model expected suming two; and 75 percent of the subjects were female.
values (mi), using exact methods as recommended [18,19]. Research demonstrates that about 22% of community-
This comparison gives rise to a 2 test of global model fit, dwelling elderly persons fall each year; 10% of these
in which significant values indicate lack of fit [20]. Here "fallers" have multiple episodes [21]. This research was ap-
lack of fit means deviation of (model) predicted (m) proved by Nova Southeastern University's Institutional
frequencies from observed frequencies (n) [16]. Review Board for human subjects research.
Latent class analysis assumes that each observation is a
member of one and only one latent class (unobservable)
and that the indicator (manifest) variables are mutually in-
dependent of each other [20]. The models are expressed
in probabilities of belonging to each latent class. For
example, seven manifest variables can be expressed as:
AjX BjX CjX DjX EjX FjX GjX
ijklmnot Xt it jt kt lt mt nt ot
Table 1 Descriptive statistics cerebral ischemia. Data from both principal and
Variable Statistic No-Fall Fall secondary diagnosis fields within a patient record.
N = 1906 N = 908
Age Mean SD 77.47 6.91 77.98 7.41 Pharmacological variables
Number of Medications Mean SD 2.30 5.57 5.10 10.10
Type of prescription medicationtype of
Gender Male 27% 22%
prescription medication was taken from patient
Female 73% 78% records.
Number of prescription medicationswas taken
The data set was taken from the State of Floridas Elder from patient records.
Affairs Office. All variables were physician diagnosed and
recorded in an electronic dataset using appropriate ICD-9 Demographic variables
codes. Variables included in the database were:
Agewas taken from patient records.
Biomedical GenderSelf reported male or female taken from
patients record.
Arthritisdefined as a person diagnosed with
osteoarthritis (OA) and/or rheumatoid arthritis Outcome
(RA). Presence or absence of arthritis was based on
responses to questions on the basis of ICD-9 714.0, Fallingwas defined as an event which results in
715. -716., from both principal and secondary the person coming to rest inadvertently on the
diagnosis fields within a patient record. ground or other lower level, and other than as a
High Blood Pressure (HBP)defined as a person consequence of sustaining a violent blow. Falling
diagnosed with hypertension. HBP was identified on was taken from both principal and secondary
the basis of ICD-9 codes 401405, from both diagnosis fields within a patient record.
principal and secondary diagnosis fields within a
patient record. A two-step modeling approach was employed. First, it
Diabetesdefined as a person diagnosed with was necessary to reduce the number of different medica-
diabetes mellitus. Diabetes was identified on the tions (N = 121). Initially, a licensed geriatric pharmacist
basis of ICD-9 codes of 250.0250.5 and 250.7 (PharmD) reviewed the medication list for accuracy and
250.9 from both principal and secondary diagnosis to remove medications that have not been shown to im-
fields within a patient record. pact the probability of falling. Using correspondence
Heart Disease (HD)defined as a person diagnosed analysis (CA) the medications were converted to con-
with coronary artery disease. HD was identified on tinuous scores. CA is an exploratory technique related
the basis of ICD-9 codes 414.0x, from both principal to principal components analysis which finds a multidi-
and secondary diagnosis fields within a patient mensional representation of the association between the
record. row and column categories of a multi-way contingency
Foot Disorders (FD)defined as a person diagnosed table [22]. This technique finds scores for the row and
with peripheral neuropathy, foot wounds, peripheral column categories on a small number of dimensions
vascular disease, or Charcot arthropa. FD was which account for the greatest proportion of the chi2 for
identified on the basis of ICD-9 codes 356.9, 892.0- association between the row and column categories, just
892.2, 443.9, and 713.5 from both principal and as principal components account for maximum variance
secondary diagnosis fields within a patient record. [22]. These scores were then used in the latent class ana-
Parkinsons Disease (PD)defined as a person lysis. Similar to other data reduction techniques, CA can
diagnosed with Parkinsons Disease. PD was be used to transform data [23].
identified on the basis of ICD-9 code 332.0 from Second, we looked for the optimal number of latent clas-
both principal and secondary diagnosis fields within ses for the seven binary indicators: (1) arthritis, (2) high
a patient record. blood pressure, (3) diabetes, (4) heart disease, (5) foot disor-
Strokedefined as a person diagnosed with ders, (6) Parkinsons disease, and (7) stroke; along with the
occlusion and stenosis of precerebral arteries patients medication score and three covariates (age, gen-
including basilar artery, carotid artery, and vertebral der, and number of medications). The appropriate latent
artery, etc.; occlusion of cerebral arteries including class structure, with the covariates, were modeled on the
cerebral thrombosis and Cerebral embolism; distal outcome (fall/no fall). The default estimator was max-
unspecified cerebral artery occlusion; and transient imum likelihood with robust standard errors. The Pearson
Hardigan et al. BMC Medical Informatics and Decision Making 2013, 13:60 Page 4 of 7
http://www.biomedcentral.com/1472-6947/13/60
chi-square, likelihood ratio chi-square, (BIC), Lo-Mendell- Figure 2 Proposed fall model for the latent class analysis. This
is a plot of the probability of falling by correspondence analysis
Rubin Adjusted Likelihood Ratio test and the bootstrap
derived drug score.
likelihood ratio test were used for model comparisons.
Hardigan et al. BMC Medical Informatics and Decision Making 2013, 13:60 Page 5 of 7
http://www.biomedcentral.com/1472-6947/13/60
FD
BP
PD
ke
D
rit
ro
H
th
St
Ar
number of medications at their means, females with a 6. Cummings RG: Epidemiology of medication-related falls and fractures in
drug falling measure of 1.50 [i.e., Thioridazine & the elderly. Drugs Aging 1998, 12:4353.
7. Kannus SR, Sherrington C, Menz H: Falls in older people: risk factors and
Amoxapine] have a 80% greater chance of falling than the strategies for prevention. Cambridge: Cambridge University Press; 2001.
same subjects with a drug falling measure of -0.50 [i.e., 8. Weir E, Culmer L: Fall prevention in the elderly population. Can Med Assoc
Imipramine & Methadone] (p < 0.05). We stress that the J 2004, 171:724.
9. Tinetti ME: Preventing falls in elderly persons. N Engl J Med 2003,
latent classes are composite variables, so one should not 348:4249.
look at medications in isolation. As one would expect, the 10. Kannus P, Sievnen H, Palvanen M, et al: Prevention of falls and consequent
two latent classes with the highest probability of falling injuries in elderly people. Lancet 2005, 366:18851893.
11. Kannus P, Palvanen M, Niemi S: Time trends in severe head injuries
also possess the highest drug falling measure and the among elderly Finns. J Am Med Assoc 2001, 286:673674.
worst medical conditions. 12. Croudace TJ, Jarvelin MR, Wadsworth ME, Jones PB: Developmental
typology of trajectories to nighttime bladder control: Epidemiologic
application of longitudinal latent class analysis. Am J Epidemiol 2003,
Conclusion 157:834842.
As was demonstrated in past research correspondence 13. Everitt BS: The analysis of contingency tables. London: Chapman & Hall; 1992.
analysis is a useful tool for researchers examining prescrip- 14. Everitt BS, Hand DJ: Finite mixture distributions. London: Chapman & Hall;
1991.
tion medication data [25]. Combining LCA with CA pro- 15. Magidson J, Vermunt JK: Latent class models for clustering: a comparison
vides researchers a powerful tool for data reduction with K-means. Canadian Journal of Marketing Research 2002, 20:3744.
analysis. We demonstrated that this approach was effect- 16. Ploubidis GB, Abbott RA, Huppert FA, Kuh D, Wadsworth EJ, Croudace TJ:
Improvements in social functioning reported by a birth cohort in mid-
ive for finding relevant subgroups with a heterogenous at- adult life: A person-centred analysis of GHQ-28 social dysfunction items
risk population for falling. Nevertheless, the results may using latent class analysis. Personal Individ Differ 2007, 42:305316.
not be relevant to other countries, with different lifestyles 17. In CCC, Arminger G, Clogg CC, Sobel ME: Handbook of statistical modeling
for the social and behavioral sciences. New York: Plenum; 1995.
and different socio-economic status. 18. Read TR, Cressie N: Goodness-of-fit statistics for discrete multivariate data.
LCA and CA possess limitations which make its applica- New York: Springer; 1988.
tion to this type of modeling dependent on replication 19. Langeheine R, Pannekoek J, van de Pol F: Bootstrapping goodness-of-fit
measures in categorical data analysis. Sociol Methods Res 1996,
studies. The specific limitations include (1) Classes not 24:492516.
known prior to analysis, and (2) Class characteristics not 20. Magidson J, Vermunt J: Latent class models. [http://www.
know until after analysis. Both of these problems are re- statisticalinnovations.com/articles/sage11.pdf]
21. Scott V, Donaldson M, Gallagher E: A review of the literature on best
lated to LCA being an exploratory procedure for under- practices in falls prevention of long-term care facilities. Long Term Care
standing data. Furthermore, the items were not designed Falls Review 2003. September.
for a LCA approach. A latent class study designed a-priori 22. Ezekiel J, Mordecai L, Fox K: Methods of Correlation and Regression Analysis.
3rd edition. New York: WIley and Sones; 1959.
may offer better solutions. We would also suggest that 23. Friendly M: Categorical Data Analysis with Graphics: Part 5 Correspondence
additional items (medical) be used which have demon- Analysis. [http://www.statisticalinnovations.com/articles/sage11.pdf]
strated to impact falling among elderly community 24. Lewis-Beck MS, Bryman A, Liao TF: The Sage encyclopedia of social science
research methods. New York: Sage Publications Inc.; 2004.
dwellers--such as eye disease and pain. 25. Inciardi JF, Stijnen T, McMahon K: Using correspondence analysis in
pharmacy practice. Am J Health Syst Pharm 2002, 59:968972.
Competing interests
The authors declare that they have no competing interests.
doi:10.1186/1472-6947-13-60
Cite this article as: Hardigan et al.: Using latent class analysis to model
Authors contributions prescription medications in the measurement of falling among a
PH, DS and WH participated in the design, coordination, project planning community elderly population. BMC Medical Informatics and Decision
and data collection. PH performed the statistical analysis and drafted the Making 2013 13:60.
manuscript. All authors read and approved the final manuscript.
Author details
1
Department of Public Health, Nova Southeastern University, 3200 South
University Dr., Health Professions Division, Ft. Lauderdale, FL 33328, USA. 2The
ElderCare Companies, Inc, 2517 State Rt. 35, Bldg. J Ste. 203, Manasquan, NJ
08736, USA. 3College of Pharmacy, Nova Southeastern University, 3200 South
University Dr., Health Professions Division, Ft. Lauderdale, FL 33328, USA. Submit your next manuscript to BioMed Central
and take full advantage of:
Received: 8 August 2012 Accepted: 13 May 2013
Published: 25 May 2013
Convenient online submission