0% found this document useful (0 votes)

8 views4 pages

Prediction of Obesity Level Based On Lifestyle and Eating Habits Data

The project aims to predict obesity levels using machine learning algorithms based on lifestyle and eating habits data. By analyzing features such as diet and physical activity, the project seeks to develop accurate classification models to identify individuals at risk of obesity. The methodology includes data collection, preprocessing, model training, and evaluation, with the goal of providing a tool for early obesity risk assessment and personalized health recommendations.

Uploaded by

red.priyansh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views4 pages

Prediction of Obesity Level Based On Lifestyle and Eating Habits Data

Uploaded by

red.priyansh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Prediction of Obesity Level Based on Lifestyle and

Eating Habits Data

Amity University, Lucknow Department of Computer
Science & Engineering B.Tech (CSE), Batch 2023–
2027

Name: Priyansh Bansal

Enrollment No.: A7605223097
Branch: Computer Science and Engineering
Batch: 2023 – 2027
Project Title: Prediction of Obesity Level Based on Lifestyle and Eating Habits Data

Table of Contents
1. Title Page
2. Table of Contents
3. Introduction
4. Rationale
5. Objectives
6. Literature Review
7. Feasibility Study
8. Methodology/Planning of Work
9. Facilities Required
10.Expected Outcomes
11.References

Introduction
Obesity is a growing global health epidemic, imposing serious risks for chronic diseases and
increasing healthcare1 burdens
2 . The World Health Organization reports that in 2022 roughly one
in eight people worldwide was living with obesity, with 43% of adults classified as overweight and
16% as obese
2 . By standard WHO definitions, adults with body mass index (BMI) ≥25 kg/m² are overweight
and those with BMI ≥30 kg/m² are obese.3 Accurate prediction of an individual’s obesity level
from modifiable factors can enable early intervention. Recent research has demonstrated that
machine learning (ML) can effectively predict obesity risk from lifestyle and dietary data. For
example, using a public UCI dataset of people’s physical and eating-habit attributes, a Gradient
Boosting classifier achieved 98.11% accuracy in classifying obesity level . This project (a
Machine Learning specialization project) will use Python and libraries like scikit-learn and pandas
4 5
to build classification models that predict obesity level (non-obese vs overweight vs obese)
based on features such as diet, activity, and lifestyle. We will employ supervised learning
algorithms (e.g. decision trees, random forests, SVM, XGBoost) and evaluate performance on a
suitable dataset.

1
Rationale
Overweight and obesity have become a serious public health issue with rising prevalence
6 1

. Traditional broad prevention strategies (general diet and exercise advice) have had limited
impact, suggesting the need for personalized approaches. Recent work highlights AI and ML as
powerful tools to capture complex, nonlinear relationships among
7 risk factors. By analyzing
individual lifestyle, dietary, and demographic factors, a predictive model can identify high-risk
individuals before chronic conditions develop. Such a system could aid doctors and patients in
making targeted lifestyle modifications. Given the increasing availability of health data and the
success of ML-based risk models, this project meets an important need for data-driven obesity
management.

Objectives
•Collect or obtain a dataset containing obesity-related features (lifestyle, diet, physical
activity, demographics).
•Preprocess the data (cleaning, normalization, feature selection) for machine learning.
•Develop and train multiple ML classification models (e.g. Decision Tree, Random Forest,
SVM, XGBoost).
•Evaluate and compare model performance (accuracy, precision, recall) to select
the best predictor.
•Analyze feature importance (e.g. using SHAP or tree feature importances) to identify key
factors influencing obesity risk.

Literature Review
Several recent studies demonstrate the use of ML for obesity prediction. Kumar et al. (2022)
collected a UCI dataset of personal and eating-habit attributes and applied various ML algorithms
(Gradient Boosting, Random Forest, SVM, etc.) to predict obesity. They reported that Gradient
Boosting achieved the highest accuracy (98.11%) , underscoring the value of diet and lifestyle
features in prediction. Carabantes-Alarcón
5 et al. (2024) designed an ensemble cascade model
combining Gradient Boosting, Random Forest, and Logistic Regression. Their hybrid model
significantly outperformed individual algorithms, reaching about 79% accuracy in
overweight/obesity risk classification . Helforoush and Sayyad (2024) proposed a novel ANN-
PSO hybrid neural model and achieved 92% accuracy in predicting obesity risk 8 . They also used
SHAP analysis to interpret feature contributions. Du et al. (2024) built a visualized risk prediction
system using 9
XGBoost on a health checkup dataset (including lifestyle and lab factors) and
demonstrated high predictive performance with interpretability, aiding personalized
management . Another study by Sun et al. (2024) used decision trees, random forest, and
gradient-boosting on large survey data (CHNS/NHANES) to predict weight status from lifestyle
factors. They applied
10 11 interpretable ML (SHAP) and identified physical activity, diet, tobacco and
alcohol use as important predictors . These works collectively show that ML models,
especially tree-based and ensemble methods, can accurately classify obesity levels from lifestyle
and dietary data, justifying this project’s approach.
12 13

Feasibility Study
The project is highly feasible with available resources. Relevant data is publicly accessible: for
instance, the UCI Machine Learning Repository hosts an obesity dataset with demographic,
activity, and eating- habit features. We will implement the solution in Python using standard
4
packages ( Jupyter Notebook, Pandas, NumPy, scikit-learn, XGBoost, etc.), which are freely
available. No specialized hardware is needed beyond a typical personal computer. The
significance of the project is clear given

2
the obesity epidemic: a predictive tool could guide timely lifestyle interventions. The cost and
effort are moderate (mostly student effort), while potential benefits (improved health outcomes,
preventive care) are high. Thus, the proposed study is both practical and valuable.

Methodology/Planning of Work
We will follow a CRISP-DM style methodology. First, we will perform data collection by sourcing
an appropriate obesity-related dataset (features like age, diet, exercise, habits) and
understanding its structure. Next, data preprocessing will include cleaning missing values,
encoding categorical factors, and normalizing numeric attributes. In the modeling phase, we will
split the data into training and test sets (e.g. 80:20) and train multiple supervised classifiers
(Decision Tree, Random Forest, SVM, XGBoost, etc.). We will tune hyperparameters (via grid
search or cross-validation) to optimize performance. Following [14], we will also explore
ensemble techniques such as a stacked or cascade classifier (e.g. combining boosting, random
forest, and logistic regression) . Model evaluation will use metrics like accuracy, precision,
recall, and ROC-AUC on the test set. Finally, we will analyze
8 feature importance (using built-in
importance or SHAP) to determine which lifestyle factors most strongly influence obesity
predictions. The workflow steps are Data Acquisition → Preprocessing → Model Training & Validation
→ Evaluation & Interpretation → Documentation of results.

Facilities Required
This project requires a standard software development environment. We will use Python 3.x with
Jupyter Notebook or similar IDE. Key libraries include pandas (data handling), NumPy, scikit-learn
(ML models), XGBoost or LightGBM, and Matplotlib/Seaborn (visualization). For interpretability, we
may use the SHAP library. The dataset can be downloaded from the UCI repository or Kaggle.
Hardware requirements are minimal: a personal computer with at least 8 GB RAM will suffice. No
specialized equipment is needed.

Expected Outcomes
The expected outcome is a validated predictive model and insights from it. We anticipate
developing one or more classification models that can accurately estimate an individual’s obesity
level (e.g. normal, overweight, obese) from lifestyle and eating data. We will produce
performance reports (accuracy, confusion matrices) demonstrating model effectiveness.
Additionally, the project will highlight key factors (such as diet type, exercise frequency, etc.) that
contribute most to obesity risk. As a result, the work can contribute to health awareness by
providing a tool or recommendation system for early obesity risk assessment. Ideally, a
prototype interface (such as a web form) could be developed to let users input their lifestyle
data and receive a risk prediction, fostering personalized preventive strategies.

References
[1]J. Du et al., “Visualization obesity risk prediction system based on machine learning,” Sci. Rep.,
vol. 14, art. 22424, 2024.

[2] D. Carabantes-Alarcón et al., “Combination of Machine Learning Techniques to Predict

Overweight/ Obesity in Adults,” J. Pers. Med., vol. 14, no. 8, p. 816, 2024.

3
[3] Z. Helforoush and H. Sayyad, “Prediction and classification of obesity risk based on a hybrid
metaheuristic machine learning approach,” Front. Big Data, vol. 7, art. 1469981, 2024.

[4] Z. Sun et al., “Using interpretable machine learning methods to identify the relative
importance of lifestyle factors for overweight and obesity in adults: pooled evidence from CHNS
and NHANES,” BMC Public Health, vol. 24, art. 3034, 2024.

[5]R. Kaur, R. Kumar, and M. Gupta, “Predicting risk of obesity and meal planning to reduce the
obese in adulthood using artificial intelligence,” Endocrine, vol. 78, no. 3, pp. 458–469, 2022.

[6] A. C. Genc and E. Arıcan, “Obesity classification: a comparative study of machine learning
models excluding weight and height data,” Rev. Assoc. Med. Bras., vol. 71, no. 1, e20241282, 2025.

1 9 Frontiers | Prediction and classification of obesity risk based on a hybrid

metaheuristic machine learning approach
https://www.frontiersin.org/journals/big-data/articles/10.3389/fdata.2024.1469981/full

2 6 12 13 Using interpretable machine learning methods to identify the relative

importance of lifestyle factors for overweight and obesity in adults: pooled evidence from
CHNS and NHANES | BMC Public Health | Full Text
https://bmcpublichealth.biomedcentral.com/articles/10.1186/s12889-024-20510-z

3 7 Combination
8 of Machine Learning Techniques to Predict Overweight/Obesity in Adults
https://www.mdpi.com/2075-4426/14/8/816

4 5 Predicting risk of obesity and meal planning to reduce the obese in adulthood using
artificial intelligence - PMC
https://pmc.ncbi.nlm.nih.gov/articles/PMC9555702/

10 11 Visualization obesity risk prediction system based on machine learning | Scientific

Reports
https://www.nature.com/articles/s41598-024-73826-6

Power Transformer Fundamentals: Design and Manufacturing
100% (3)
Power Transformer Fundamentals: Design and Manufacturing
52 pages
D' Mallows Income Statement For The Year Ended 2018-2022 Schedule 2018 2019
No ratings yet
D' Mallows Income Statement For The Year Ended 2018-2022 Schedule 2018 2019
23 pages
Top Engineering School
No ratings yet
Top Engineering School
6 pages
Supplementary KYC
No ratings yet
Supplementary KYC
1 page
G.R. No. 189655 April 13, 2011 Aowa Electronic Philippines, Inc., Petitioner, DEPARTMENT OF TRADE AND INDUSTRY, National Capital Region, Respondent
No ratings yet
G.R. No. 189655 April 13, 2011 Aowa Electronic Philippines, Inc., Petitioner, DEPARTMENT OF TRADE AND INDUSTRY, National Capital Region, Respondent
27 pages
Prueba
No ratings yet
Prueba
22 pages
Airtel Vodafone
100% (2)
Airtel Vodafone
27 pages
Workflow Manager
No ratings yet
Workflow Manager
64 pages
Identification of Malnutrition and Prediction of BMI From Facial Images Using Machine Learning
No ratings yet
Identification of Malnutrition and Prediction of BMI From Facial Images Using Machine Learning
51 pages
GAD Resolution
No ratings yet
GAD Resolution
5 pages
Bloom's Taxonomy Domain Verbs
No ratings yet
Bloom's Taxonomy Domain Verbs
3 pages
Balance of Payment - FOREX
No ratings yet
Balance of Payment - FOREX
18 pages
Mbasic English For Academic Purposes
No ratings yet
Mbasic English For Academic Purposes
4 pages
SHEELA
No ratings yet
SHEELA
55 pages
Insurtech - Innovation in The Insurance Industry
No ratings yet
Insurtech - Innovation in The Insurance Industry
8 pages
Five Machine Learning Supervised Algorithms For The Analysis and The Prediction of Obesity
No ratings yet
Five Machine Learning Supervised Algorithms For The Analysis and The Prediction of Obesity
9 pages
A Machine Learning Approach For Predicting Weight Gain Risks in Young Adults
No ratings yet
A Machine Learning Approach For Predicting Weight Gain Risks in Young Adults
4 pages
Environmental Management - Plate Tectonics
No ratings yet
Environmental Management - Plate Tectonics
17 pages
CBMS Brochure
No ratings yet
CBMS Brochure
2 pages
Ijerph 20 06263
No ratings yet
Ijerph 20 06263
14 pages
Dataplatform
No ratings yet
Dataplatform
2 pages
Body Fitness Prediction
No ratings yet
Body Fitness Prediction
16 pages
Exxsol D80
No ratings yet
Exxsol D80
2 pages
RK20BTA40 Online Assignment 1 INT213 12013583
No ratings yet
RK20BTA40 Online Assignment 1 INT213 12013583
16 pages
【Organization Chart of Mitsubishi UFJ Financial Group】
No ratings yet
【Organization Chart of Mitsubishi UFJ Financial Group】
1 page
Minuet in G - Petzold
No ratings yet
Minuet in G - Petzold
2 pages
SCO 2080R Datasheet
No ratings yet
SCO 2080R Datasheet
2 pages
Predictive Modeling of Obesity and Cardiovascular Disease Risk
No ratings yet
Predictive Modeling of Obesity and Cardiovascular Disease Risk
13 pages
153 PTQ q3 2024 Issue
No ratings yet
153 PTQ q3 2024 Issue
102 pages
Smartphysics Homework Solutions
100% (1)
Smartphysics Homework Solutions
5 pages
Foreclosure Letter
No ratings yet
Foreclosure Letter
2 pages
Paper1 2
No ratings yet
Paper1 2
18 pages
Binod ML Project-052
No ratings yet
Binod ML Project-052
14 pages
Devloping A Dietary and Fitness App 2.0
No ratings yet
Devloping A Dietary and Fitness App 2.0
11 pages
Body Fat Precantage Prediction-2021-2
No ratings yet
Body Fat Precantage Prediction-2021-2
21 pages
QUESTIONNAIRE
No ratings yet
QUESTIONNAIRE
7 pages
Chatbot For Prediction of Weight and BMI
No ratings yet
Chatbot For Prediction of Weight and BMI
3 pages
E Health and Fitness Recommendation System Using Machine Learning
No ratings yet
E Health and Fitness Recommendation System Using Machine Learning
8 pages
Sub Paper-Pang2019
No ratings yet
Sub Paper-Pang2019
6 pages
Classification of Obesity Among South African Female Adolescents Comparative Analysis of Logistic Regression and Random Forest Algorithms
No ratings yet
Classification of Obesity Among South African Female Adolescents Comparative Analysis of Logistic Regression and Random Forest Algorithms
15 pages
Document
No ratings yet
Document
18 pages
B13 Poster (Final)
No ratings yet
B13 Poster (Final)
1 page
Aws Data Engineer
No ratings yet
Aws Data Engineer
66 pages
Age-Specific Risk Factors For The Prediction of Obesity Using A Machine Learning Approach
No ratings yet
Age-Specific Risk Factors For The Prediction of Obesity Using A Machine Learning Approach
12 pages
Als Project
No ratings yet
Als Project
18 pages
Predictive Equations For Fat Mass in Older Hispanic Adults With Excess Adiposity Using The 4 Compartment Model As A Reference Method
No ratings yet
Predictive Equations For Fat Mass in Older Hispanic Adults With Excess Adiposity Using The 4 Compartment Model As A Reference Method
10 pages
Ford Figo b517 2010 25 Ewd65
No ratings yet
Ford Figo b517 2010 25 Ewd65
1 page
Major Repair and Alteration (Airframe, Powerplant, Propeller, or Appliance)
No ratings yet
Major Repair and Alteration (Airframe, Powerplant, Propeller, or Appliance)
3 pages
Baitap 5
No ratings yet
Baitap 5
1 page
BA - Group02 - SecB-final Final
No ratings yet
BA - Group02 - SecB-final Final
14 pages
10 33484-Sinopfbd 1445215-3764172
No ratings yet
10 33484-Sinopfbd 1445215-3764172
23 pages
BA - Presentation GRP 2
No ratings yet
BA - Presentation GRP 2
7 pages
Predicting Diabetes Onset Using Machine Learning
No ratings yet
Predicting Diabetes Onset Using Machine Learning
4 pages
Camera Ready Paper Jason
No ratings yet
Camera Ready Paper Jason
6 pages
Obesity RRR
No ratings yet
Obesity RRR
19 pages
Ai Datascience Project Grade 10
No ratings yet
Ai Datascience Project Grade 10
14 pages
Conclusion Body Fat Model
No ratings yet
Conclusion Body Fat Model
1 page
Algorithme Used and Evaluation: Visualisation
No ratings yet
Algorithme Used and Evaluation: Visualisation
1 page
Obesity Disease Risk Prediction Using Machine Learning
No ratings yet
Obesity Disease Risk Prediction Using Machine Learning
10 pages
MasterTop 314ULv3
No ratings yet
MasterTop 314ULv3
2 pages
Estimation of Obesity Levels Based On Computational Intelligence
No ratings yet
Estimation of Obesity Levels Based On Computational Intelligence
5 pages
Prioritization of Multi-Level Risk Factors For
No ratings yet
Prioritization of Multi-Level Risk Factors For
8 pages
OBESITY
No ratings yet
OBESITY
17 pages
A Machine Learning Approach To Predict The Trend of Obesity Prevalence at A Global Level
No ratings yet
A Machine Learning Approach To Predict The Trend of Obesity Prevalence at A Global Level
6 pages
JETIR2303465
No ratings yet
JETIR2303465
9 pages
Personalized BMI Calculator and Healthy Food Recommendation System Using Machine Learning
No ratings yet
Personalized BMI Calculator and Healthy Food Recommendation System Using Machine Learning
2 pages
MLPPT 11 45
No ratings yet
MLPPT 11 45
31 pages
Diabetes - Test Report
No ratings yet
Diabetes - Test Report
62 pages
Jupyter Notebook On Obesity Prediction
No ratings yet
Jupyter Notebook On Obesity Prediction
15 pages
Title of Project
No ratings yet
Title of Project
7 pages
1 - 13 - 76 - e Proportional Valve Wandfluh
No ratings yet
1 - 13 - 76 - e Proportional Valve Wandfluh
10 pages
Base Paper
No ratings yet
Base Paper
5 pages
1 s2.0 S1386505625000218 Main
No ratings yet
1 s2.0 S1386505625000218 Main
13 pages
Explainable AI (XAI) For Obesity Prediction: An Optimized MLP Approach With SHAP Interpretability On Lifestyle and Behavioral Data
No ratings yet
Explainable AI (XAI) For Obesity Prediction: An Optimized MLP Approach With SHAP Interpretability On Lifestyle and Behavioral Data
9 pages
23UCC554
No ratings yet
23UCC554
9 pages
Prediction of Obesity Level Based On Lifestyle and Eating Habits Data
No ratings yet
Prediction of Obesity Level Based On Lifestyle and Eating Habits Data
4 pages
Varnika Resume Final
No ratings yet
Varnika Resume Final
2 pages
Black Book DipeshThali
No ratings yet
Black Book DipeshThali
158 pages
ML Report098
No ratings yet
ML Report098
20 pages
ML Report098
No ratings yet
ML Report098
23 pages
Santhosh Minor
No ratings yet
Santhosh Minor
18 pages
Age-Specific Risk Factors For The Prediction of Obesity Using A Machine Learning Approach
No ratings yet
Age-Specific Risk Factors For The Prediction of Obesity Using A Machine Learning Approach
2 pages
Caloric Expenditure
No ratings yet
Caloric Expenditure
15 pages
Obesity in The US Exploring The Paradox of Increasi 2024 Obesity Research
No ratings yet
Obesity in The US Exploring The Paradox of Increasi 2024 Obesity Research
4 pages
Data Science through R. Unsupervised Learning. Dimension Reduction Techniques: Principal Components, Factor Analysis and Correspondence Analysis
From Everand
Data Science through R. Unsupervised Learning. Dimension Reduction Techniques: Principal Components, Factor Analysis and Correspondence Analysis
César Pérez López
No ratings yet
Clinical Decision Support System: Fundamentals and Applications
From Everand
Clinical Decision Support System: Fundamentals and Applications
Fouad Sabry
5/5 (1)
Applications of Multi-Omics: Fundamentals of Integrating Biological Data for Precision Medicine and Research
From Everand
Applications of Multi-Omics: Fundamentals of Integrating Biological Data for Precision Medicine and Research
Richard Skiba
No ratings yet
Health Data Analytics And Informatics
From Everand
Health Data Analytics And Informatics
Mbuso Mabuza
No ratings yet
Comprehensive Guide to Statistics
From Everand
Comprehensive Guide to Statistics
Mohit Chatterjee
No ratings yet
Clinical Trial Management – an Overview
From Everand
Clinical Trial Management – an Overview
Editor IJSMI
No ratings yet

Prediction of Obesity Level Based On Lifestyle and Eating Habits Data

Uploaded by

Prediction of Obesity Level Based On Lifestyle and Eating Habits Data

Uploaded by

Prediction of Obesity Level Based on Lifestyle and

Eating Habits Data

Name: Priyansh Bansal

[2] D. Carabantes-Alarcón et al., “Combination of Machine Learning Techniques to Predict

1 9 Frontiers | Prediction and classification of obesity risk based on a hybrid

2 6 12 13 Using interpretable machine learning methods to identify the relative

10 11 Visualization obesity risk prediction system based on machine learning | Scientific

You might also like