Stroke-Prediction-Using-Linear-Regression

This study investigates the use of linear regression models to predict stroke risk based on clinical and demographic factors, utilizing a publicly available dataset. The research aims to identify significant predictors and establish a reliable model for early stroke risk identification, emphasizing the importance of timely interventions. While linear regression shows potential, the study suggests that combining it with more complex algorithms could enhance predictive accuracy.

Uploaded by

nahala89_694995657

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Stroke-Prediction-Using-Linear-Regression

Uploaded by

nahala89_694995657

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

International Journal of Scientific Research in Engineering and Management (IJSREM)

Volume: 09 Issue: 01 | Jan - 2025 SJIF Rating: 8.448 ISSN: 2582-3930

Stroke Prediction Using Linear Regression

Nahala M A1, Sooraj Subhash2, Kishore Xavier3 , Rahul Manoj4 Sreehari V V5

1
Asst. Prof , Dept of CSE, Sree Narayana Gurukulam College Of Engineering, Kochi, India,
[email protected]
2
Student, Dept of CSE, Sree Narayana Gurukulam College Of Engineering, Kochi, India,
[email protected]
3
Student, Dept of CSE, Sree Narayana Gurukulam College Of Engineering, Kochi, India,
[email protected]
4
Student, Dept of CSE, Sree Narayana Gurukulam College Of Engineering, Kochi, India,
[email protected]
5
Student, Dept of CSE, Sree Narayana Gurukulam College Of Engineering, Kochi, India,
[email protected]
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - Stroke is one of the leading causes of death 1.INTRODUCTION
and disability worldwide, and early prediction can
significantly improve patient outcomes through timely Stroke is one of the most leading causes of death and
interventions. This study explores the potential of using disability, and early prediction will be significantly
linear regression models to predict the likelihood of improved through timely interventions. The aim of this
stroke in individuals based on a set of clinical and study is to determine whether a linear regression model
demographic factors. Data used came from a publicly can be used for the prediction of stroke probability in a
available stroke dataset; the features used include age, person given a set of clinical and demographic factors.
gender, hypertension, heart disease, marital status, type Using a public available dataset concerning strokes, the
of work, smoking habits, among others. The goal of this variables present included age, gender, high blood
study is to find some important predictors and then pressure, heart problems, marital status, employment
establish a linear regression model which is capable of status, smoking habit, among others. It attempts to find
approximating stroke risk with reasonable accuracy. the strong predictors in a linear regression model capable
Hence, feature selection and preprocessing aided the of giving a reasonable stroke prediction accuracy.
choice of relevant variables with which to build the
Stroke is a medical condition characterized by the sudden
predicting model. The subset formed by training and
interruption of blood flow to the brain, resulting in loss
testing will be used to analyze a range of metrics, such
of brain function. It is one of the leading causes of death
as the mean squared error and the value of R-squared to
and long-term disability worldwide, affecting millions of
reflect performance. The outcomes do indeed show that
people annually. The ability to predict stroke risk is
using relevant features for linear regression results can
important for early intervention, prevention, and
indeed be used for predictions related to stroke risks:
personalized treatment strategies that may reduce the
thereby resulting in a simple but readable early risk
burden of this debilitating disease. As healthcare systems
identification model. More, however, the accuracy found
shift towards making decisions based on data, predictive
of the model suggests that other algorithmic and data
modeling has emerged as a promising tool for predicting
needs could allow for increased reliability in this field.
medical conditions, including stroke.
The paper concludes that with linear regression, there
seems a viable foundation to predict stroke while Traditional stroke risk assessment is based on clinical
suggesting further refinement and more refined models guidelines and risk factors such as age, hypertension,
would be necessary in clinical applications. diabetes, heart disease, smoking, and family history.
These are all very well-known risk factors, but the
interaction between them makes it challenging to
Key Words: Stroke prediction, linear regression, feature quantify and predict stroke risk in individual patients
selection, data preprocessing, machine learning. with a reasonable degree of accuracy. Recent advances in
machine learning and statistical modeling offer new
opportunities for improving predictive accuracy. Linear
regression is a very popular and interpretable statistical
method that offers a straightforward approach to
modeling the relationship between stroke risk and
various demographic, medical, and behavioral factors.