Medhun Final 1
Medhun Final 1
1.Problem Statement :
The model will rely only on the available data (e.g., public datasets or
partnerships with healthcare providers).
The model will not be deployed in real-time but will focus on
developing the algorithm and providing predictions based on historical
data.
The scope may exclude certain rare diseases or datasets due to data
availability.
4.Data Sources :
5.High-Level Methodology :
Data Collection:
Obtain the dataset from public sources like Kaggle or UCI Machine
Learning Repository.
Scrape additional relevant data if needed from healthcare APIs (e.g.,
from hospitals or healthcare institutions).
Data Cleaning:
Feature Engineering:
Derive new features, such as BMI (Body Mass Index) from weight and
height data, or age groups based on age.
Transform features (e.g., scaling, normalization) to improve model
accuracy.
Model Building:
Model Evaluation:
Deployment:
Libraries: