Survival Data Research
Survival Data Research
European Journal
European Journal of Medical Research (2024) 29:452
https://doi.org/10.1186/s40001-024-02026-9 of Medical Research
Abstract
Background and purpose A stroke or a cerebrovascular accident is a common cause of death and a leading cause
of long-term, severe disability in both developed and developing countries. The most recent global burden of dis-
ease report states that there were 11.9 million new cases of stroke worldwide; stroke accounts for nearly 1 in 8 deaths
globally (12%, 6.5 million deaths) and claims a life every 5 s, making it the second most common cause of death
worldwide. The goal of the study was to identify the most important factors influencing stroke patients’ time to death
at Gambella General Hospital.
Methods Data was gathered from patient files in a hospital using a retrospective study methodology, spanning
the period from September 2018 to September 2020. R 3.4.0 statistical software and STATA version 14.2 were used
for data entry and analysis. The survival time was compared using the log-rank tests and the Kaplan–Meier survival
curve. The fitness of the Cox proportional hazard model was examined.
Results The final model that was fitted was the log-logistic AFT model. A statistically significant correlation
was defined as having a p value of less than 0.05 and the accelerated factor (γ) with its 95% confidence interval
was employed. Eight days was the total median death time (95% CI 6–10). Significant predictors for shortened mortal-
ity time were age (γ = 0.94; 95% CI (0.0.920–0.980), hypertension (γ = 0.63; 95% CI (0.605–0.660), and baseline compli-
cations (γ = 0.24; 95% CI (0.223–0.256).
Conclusions The shortened timing of death was significantly predicted by age, hypertension, and baseline compli-
cations. In light of the study’s findings, health administrators and caregivers should work to improve society’s overall
health.
Keywords Stroke, Survival, Time to death, Survival analysis
Introduction
*Correspondence: A blood vessel bursts or becomes blocked by a clot, fre-
Chekol Alemu
[email protected] quently leading to a stroke, which is caused by the dis-
1
Department of Statistics, College of Natural and Computational ruption of the brain’s blood flow. Damage to the brain
Sciences, Gambella University, Gambella, Ethiopia tissue results from cutting off the delivery of oxygen and
2
Monitoring, Evaluation, Accountability and Learning (MEAL) Officer,
Doctors with Africa-CUAMM, Gambella, Ethiopia nutrients. In both industrialized and developing nations,
3
Department of Rural Development and Agricultural Extension, College a stroke or cerebrovascular accident is a frequent cause of
of Agriculture and Natural Resource, Gambella University, Gambella, demise and a major factor in severe, long-term disability
Ethiopia
4
Department of Statistics, College of Natural Sciences, Jimma University, [1].
Jimma, Ethiopia According to the most recent report on the global
burden of disease, there were 11.9 million new cases of
© The Author(s) 2024. Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0
International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long
as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if
you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or
parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated
otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not
permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To
view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
Alemu et al. European Journal of Medical Research (2024) 29:452 Page 2 of 10
stroke worldwide. Nearly 1 in 8 deaths (12%, 6.5 million receiving adequate rehabilitation (cure) services. This a
deaths) were attributed to stroke, making it the second series implication in terms of saving the life of patients
world wide’s most common cause of death [2]. Every especially in poorly developed societies where hemor-
2 s, a stroke occurs somewhere in the world due to the rhage strokes which are characterized by sever neuro-
increasing burden of the disease [3]. By the end of 2030, logic presentation are very much prevalent [13]. A study
it is predicted that stroke will have increased to 23 mil- which was conducted in Addis Ababa city, types and
lion new cases and 7.8 million deaths per year in the associated factor of stroke at selected public referral hos-
absence of a strong global public health response [4]. pitals in Addis Ababa; Ethiopia by [10] and another study
There are no methodologically sound stroke studies in conducted in Addis Ababa city, prevalence, nursing man-
Sub-Saharan Africa, including Ethiopia [5]. Additionally, agements and patients outcomes among stroke patients
earlier studies on stroke in Ethiopia and the rest of Africa admitted to Tikur Anbessa Specialized Hospital, Addis
were primarily descriptive summaries of stroke kinds, Ababa, Ethiopia using logistic regression model, by [14].
subtypes, patient risk profiles, and risk factor magnitude. Logistic regression has been used extensively in studies
In Ethiopia although, admission to the hospitals due to on the prevalence and risk factors for stroke, and the Cox
stroke is increased time to time. According to the latest proportional hazards model has also been used in several
data published in 2017, stroke deaths in Ethiopia reached studies using mortality as the endpoint [10, 14]. However
39,571 or 6.23% of total deaths [6]. Stroke deaths account Logistic regression does not account for the censoring
for 89.82 deaths per 100,000 people when age is taken observations. Even though a semi-parametric estimate
into consideration [6]. In high-income nations, the esti- provides more flexibility, a parametric estimate is more
mated age-adjusted incidence rate in 2010 was 138.9 per powerful provided the baseline hazard’s form is known
100,000 person-years, while in low- and middle-income in advance. There is limited evidence regarding the deter-
countries, it was 182.6 per 100,000 person-years [7]. minants of time to death of a stroke in the current study
Studies from 61 low-income individuals found an area.
increase in hemorrhagic and ischemic strokes of 22%
(from 5 to 30%) and 6% (from −7 to 18%), respectively
[8]. Even though the exact emergency burden of stroke
Objectives
The objectives of this were: (1) to estimate median sur-
in Ethiopia is not known, it has been estimated to be
vival time; (2) to identify determinant factors associated
increasing and stroke accounts for 2.5% of all hospital
with stroke-related death; and (3) to compare the survival
admissions and 13.7% of medical admissions [9]. Stroke
probability of stroke patients among different levels of
is a common and serious condition, and treatments for it
determinant factors.
have a small effect on overall health. As of 2008, it’s been
stated that the incidence of stroke in emerging nations
has overtaken that in industrialized nations [10]. Methods
In the twenty-first century, there has been a 42% Study design
decrease in stroke incidence in the high-income coun- The retrospective study design was gathered for patients
tries; stroke incidence in the low- to middle-income in the medical ward [15].
countries has increased by more than 100% [11]. The
trend is generalized because studies suggest that the geo-
graphical variations in stroke incidence and prevalence Study setting
are small. While the geographical variation of stroke The study was conducted from retrospective records at
incidence is small worldwide, the burden of stroke shows Gambella General Hospital; Gambella Peoples National
larger geographical variation. Unfortunately, most stroke Regional State, Southwestern part of Ethiopia from the
burden is carried by the low- to middle-income coun- 1st of January 2018 to the 1st of January 2022 among
tries [12]. Despite the alarming threat of stroke as a major patients who were admitted by stroke.
public health problem in Ethiopia, stroke epidemiology is
not well-studied in Ethiopia. Participants
In Ethiopia, stroke is a frequent cause of mortality Stroke patients were registered and admitted to the
and morbidity from non-communicable diseases. It has intensive care unit ward of Gambella General Hospital
been shown to be the most common neurological con- during the required period and patients for whom data
dition seen in Ethiopia. Like other developing countries for variables of interest was complete were included while
resources for stroke care and rehabilitation are defi- out of the interval period and patients with insufficient
cient (poor) in Ethiopia. Patients with stroke are often information about one of the factor variables either in the
poorly managed and discharged from hospital without registration book or in the card that was not included.
Alemu et al. European Journal of Medical Research (2024) 29:452 Page 3 of 10
Fig. 2 K–M plots of the survival function of stroke patients at GGH from 2018 to 2022
categories of predictors. For example, the median death Survival of the patients is significantly related to sex,
time of patients who had cardiac disease was 6 days and age, diabetes mellitus, and hypertension at a 25% level
those who had no cardiac disease were 10 days. The of significance were selected as candidate potential vari-
median times to death of patients with past baseline ables. In the 2nd step, all selected predictors in 1st step
complication and without baseline complication were 10 were fitted in the proportional hazard model and candi-
and 7 days, respectively, as shown in Table 2. date predictors at a 10% level of significance were chosen
using the backward selection method, variables duration,
history of ARTI, insurance status, and clinical presenta-
The univariable and multivariable analysis result tion during admission were selected as candidate poten-
The 1st step in the model-building process is univariable tial variables.
analysis. Predictors which had an association at a p value All selected predictors were fitted in the proportional
of 0.25 in univariable Cox regression were included in hazard model and candidate predictors at a 10% level of
multivariable Cox regression. significance were chosen using the backward selection
Alemu et al. European Journal of Medical Research (2024) 29:452 Page 6 of 10
Table 2 Median time to death and log-rank test by predictors of stroke patients
Variable Category Median death time (95% CI) Log-rank p value
X2 value (df)
method. Variables of sex, age, and hypertension were Table 3 AIC, BIC and log-likelihood of the candidate parametric
selected as candidate potential variables. models
All selected variables at a 10% level of significance in Distribution AIC BIC Log-likelihood
the second step and the non-significant variable in the
univariate analysis at a 25% level of significance were Exponential 578.10 607.54 −281.049
modeled together using the forward selection method Weibull 231.58 264.70 −106.79
the following predictors were selected at a 10% level of Log-normal 204.42 237.54 −93.21
significance. Log-logistic 143.58 160.70 −68.79
Age, presence of baseline complication, hypertension,
and diabetes-mellitus were statistically significant at a 5%
significance level and those predictors were selected as
log-logistic AFT model has (AIC = 143.58, BIC = 160.70)
the final model. It is the best model compared to forward
which is selected as a good model to fit the survival time
and backward selection methods since it has the smallest
of stroke patients data than other accelerated failure time
value of AIC.
model such as exponential, Weibull and lognormal as a
Using different methods predictors of age and hyper-
baseline distribution.
tension violate the proportional hazard assumption.
The final model results are shown as follows according
Thus, we doubt the accuracy of the PH assumption and
to Table 4 under the log-logistic AFT model. Hyperten-
consider the AFT model for this data set.
sion, baseline complication, and age of stroke patients
were significant at a 5% significance level. An accelera-
Accelerated failure time (AFT) model tion factor greater than one (positive coefficient) indi-
When PH assumptions were not satisfied, the paramet- cates extending the time to death while an acceleration
ric AFT model should be used instead of the Cox model factor less than one (negative coefficient) indicates short-
[25]. ened time to death. The output of the final log-logistic
Multivariable analysis of exponential, Weibull, log- AFT model is presented in Table 4. This output showed
normal and log-logistic parametric models was done Stroke patients with hypertensive, with baseline com-
using all significant predictors in the final multivariable plications and patients who were older had significantly
Cox PH model at a 5% level of significance. To compare shortened survival times. The estimated acceleration
the efficiency of different models AIC and BIC was used. factor for patients with hypertension is 0.63 with (95%
A model having the minimum AIC and BIC value was CI 0.605 0.660). The confidence interval for the accelera-
selected as a good model. Accordingly, from Table 3, tion factor did not include one and the p value is small
Alemu et al. European Journal of Medical Research (2024) 29:452 Page 7 of 10
Table 4 Summary result of the final Log-logistic AFT model of stroke patients
Variable Category
β SE Sig γ 95% CI (γ )
Discussion
Stroke, also known as a cerebrovascular accident, is a
prominent cause of severe, long-term impairment in
Fig. 4 Log-logistic baseline distributions plot of stroke patients
both industrialized and developing nations. For the effec-
at GGH from 2018 to 2022 tive management of stroke patients and the develop-
ment of a stroke preventive strategy, the time to death
and the factors that determine it are crucial. The goal of
this study was to pinpoint the variables that affected how
(p = 0.003). This indicates hypertensive patients have less long it took stroke victims at Gambella General Hospital
survival time than patients who are not hypertensive. to pass away. A total of 203 patients were enrolled in the
Similarly acceleration factor for patients with baseline study to determine the associated factors of time to death
complication was 0.24 with (95% CI 0.223 0.256) the γ CI for stroke patients; of those patients, 74.9 were censored
did not include one and the p value is small (p = 0.0023). or did not experience the event, and 25.1% died. This
This implied the expected survival time of stroke patients study agrees with the study conducted by [26], that 27.2%
decreased by 76% for patients with baseline complication perished, while 72.8% were censored or did not witness
as compared to patients who have no baseline complica- the tragedy. People with hypertension, baseline complica-
tion (reference), finally holding other factors constant in tions, and older ages were greater at risk for stroke than
the model. Finally holding other factors constant in the person’s without hypertension, baseline complications,
model, for the age of the stroke patients for 1 year change and younger ages. The average time for all patients was
in the age of patients the log of survival time is decreased 6 with a standard deviation of 3.2 this study agrees with a
by 0.06. study conducted by [26].
Survival models that were parametric, semi-paramet-
Model diagnostic ric, and nonparametric were all used in this investiga-
To check whether the fitted model adequately describes tion. Based on the Kaplan–Meier estimate approach, a
the data or not two graphical methods and the Likeli- non-parametric method is utilized to compare the dif-
hood ratio test were used Adequacy of Parametric Base- ferences between each categorical covariate. The Cox PH
lines plot and CoxSnell residual plot. From Figs. 4 and 5, model was used to fit a semi-parametric survival analysis.
Alemu et al. European Journal of Medical Research (2024) 29:452 Page 8 of 10
Table 5 Likelihood ratio and significance of the Log- logistic AFT modifiable risk factor for stroke and hypertension was
model the most frequent co-morbidity that occurred in 50.6% of
Loglik(intercept only) Loglik(model) Chisq DF Sig all stroke patients [28].
Age was found to be a key determinant in this study
−273.12 −68.79 408.66 5 0.000 when determining the time until a stroke patient died;
as patients aged, their chances of survival reduced. This
result is consistent with another investigation in the liter-
Schoenfeld residuals and the Cox PH model’s assump- ature that found that becoming older is the primary, non-
tions were tested graphically, and both were shown to be modifiable driver of stroke risk [28]. For this study, there
false. The researcher then proposed a parametric AFT are no significant differences between rural and urban
survival model as a substitute for the Cox PH model to patients with time to death. In contrast, another study
suit the pneumonia data from Gambella General Hospi- conducted in Tanzania showed there were significant dif-
tal. For the Stroke patient dataset at Gambella General ferences between rural and urban populations with time
Hospital, the researcher fit AFT models using several to death [29].
baseline distribution patterns. The baseline distributions
used in this study were Exponential, Weibull, Log-nor- Conclusion
mal, and Log-logistic. The log-logistic AFT model was This study used the survival time of Stroke patients’ data-
selected as a better AFT model than Weibull, Exponen- set of those patients who started their Stroke treatment
tial, and log-normal models based on comparison crite- from 1st January 2018 to 1st January 2022 years to deter-
ria with smaller AIC and BIC values. The overall median mine the determinant factors of time to death of Stroke
time from stroke patients was 8 days (mean = 6 days; patients in Gambella General Hospital. Out of the total
standard deviation = 3 days). This study is almost consist- 203 stroke patients who started Stroke treatments, about
ent with the Research conducted by [26]. 25.1% died at the end of the study. The estimated median
Age, baseline complications, and hypertension were survival time of stroke patients was 8 days.
statistically significant predictors of the survival status To determine the associated factors of survival time
of stroke in this study. This study is consistent with the of stroke patients, the Cox PH model was used and the
study conducted by [26] Hypertension is a highly predic- PH assumption was checked by graphical, Schoenfeld
tor of death of stroke patients. This is consistent in the residual plot and global test. Then, AFT model was fitted
literature, multiple studies have identified hypertension because the assumption of the Cox proportional model
as the leading risk factor for stroke in SSAs [27]. Other was violated. Different AFT models using different base-
studies have found that hypertension is an important line distributions were applied. Among them using AIC
Alemu et al. European Journal of Medical Research (2024) 29:452 Page 9 of 10
and BIC, the Log-logistic AFT model is a better-fitted the national level, but the patients came from differ-
survival time of Stroke patients’ dataset than other AFT ent regions of the country. As the data is gathered from
baseline distributions. the treatment card of patients of the study has limited
The best model to fit the data to explain the survival number of variables considered as risk factors for the
time of the Stroke patient dataset in Gambella General survival time of stroke patients.
Hospital was the Log-logistic AFT model, which was
Abbreviations
revealed using the graphical technique and Cox-Snell AIC Akaike information criterion
residuals plots. AFT Accelerated failure time
In Gambella General Hospital, the results of a Log- BIC Bayesian information criterion
CVA Cerebrovascular accident dalys
logistic AFT model revealed that age, hypertension and CI Confidence interval
baseline complication were found to be determinant DALYs Disability-adjusted life-years
factors of the survival status of stroke patients. Patients GGH Gambella General Hospital
GBD Global burden of disease
without hypertension and baseline complications had HR Hazard ratio
considerably longer survival time (higher survival expe- PH Proportional hazards
rience). While 1-year increases in age (older age) short- SNNPR Southern Nations Nationalities and Peoples region
SSA Sub-Saharan Africa
ened the survival time by 0.94 times. The health giver S.D Standard deviation
to be planned and awareness about the risk factors of S.E Standard error
stroke, and the benefit of regular medical checkups and TPAs Tissue plasminogen activators
TR Time ratio
treatment follow up should be given to the community. WHO World Health Organization
Strategies for screening and management of hyperten-
sion, age and baseline complication should be given pri- Acknowledgements
We would like to acknowledge Gambella university office of research
ority as they are the most prevalent determinant factors directorate for their sponsorship and financial support for this study and the
identified. Identifying and managing early stroke compli- Gambella General Hospital Health staff in Gambella to undertake this study
cations are important for the prevention of early stroke with their cooperation and permission in using the data.
related mortality. To prevent strokes we should focus on Author contributions
reducing vascular risk factors such as high blood pres- Chekol Alemu was involved in this study from the data acquisition, inception
sure stroke patients. to design, data cleaning, data analysis, and interpretation and drafting and
revising of the manuscript. Habitamu Wudu, Bizuayehu Bogale, Zerihun
Based on this study, the following recommendations Getachew and Abebe Nega were involved in principal supervision, interpreta-
are forwarded for policy makers and the responsible bod- tion, data analysis, and revising the final manuscript. All authors read and
ies: age, baseline complication and hypertension were approved the final manuscript.
significant factors and need to be considered when plan- Funding
ning and developing policies against stroke to increase The only funder for the study was Gambella University. The funding body did
patient’s survival time. Additionally, special attention not have any role in study design, data collection, data analysis, interpretation
of data, or in writing the manuscript.
should be given for old age patients in order to prolonged
death timing. Availability of data and materials
Based on the finding of the study the following rec- The datasets used and/or analyzed during the current study are available from
the corresponding author upon reasonable request.
ommendations were made for ministry of health, the
community at large, Gambella General Hospital and
Declarations
researcher. Community outreach program has to be
planned and awareness about risk factors of stroke, ben- Ethics approval and consent to participate
efit of regular medical checkup and treatment follow up All methods are performed according to the relevant regulations and guide-
lines of the journal. The ethical clearance approval letter was obtained from
should be given to the community and periodic follow up the Gambella University Institutional Review Board research directorate ethical
and adherence to the treatment of determinant hyperten- approval committee (with reference number GURPGC/201/2015). The struc-
sion, baseline complication can minimize the chance of tured questionnaire was developed by the researcher and the secondary data
from patients’ charts or log-book were collected by well-experienced health
getting stroke. workers from Gambella General Hospital, Gambella, Ethiopia. The study was
conducted without individual informed consent obtained from all subjects
and their literate legal guardian because of the secondary nature of the data.
Limitation of study All methods were performed per the Declarations of Helsinki.
The following were some of the study’s limitations.
Consent for publication
There is a lack of published literature in the country Not applicable. No person’s details, images, or videos are being used in this
regarding the survival time of stroke disease, with ref- study.
erences to the outcomes of other countries. This study
Competing interests
used Gambella General Hospital data from a single The authors declare that they have no competing interests.
hospital, which does not represent the prevalence at
Alemu et al. European Journal of Medical Research (2024) 29:452 Page 10 of 10