0% found this document useful (0 votes)

9 views13 pages

PDS Report 2024-25

The document presents a crop recommendation system utilizing machine learning to help farmers select optimal crops based on soil and environmental conditions. Various models, including Random Forest, Decision Tree, and SVM, were evaluated, with Random Forest achieving the highest accuracy of 99.15%. The project emphasizes the importance of data-driven decision-making in agriculture, while also acknowledging limitations such as dataset size and class imbalance.

Uploaded by

Shav Aggrawal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views13 pages

PDS Report 2024-25

Uploaded by

Shav Aggrawal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Bhartiya Vidya Bhavan’s

Sardar Patel Institute of Technology

(Autonomous Institute Affiliated to University of Mumbai)
Department of Computer Engineering

Crop Recommendation System

By
Aaryan Mantri(2023300001)
Agarwal Vedant Rakesh(2023300002)
Balla Mahadev Shrikrishna(2023300010)

Guided by
Sunil Ghane

Course Project
Python for Data Science (S.Y.)
Abstract

This project presents a crop recommendation system using machine learning to

assist farmers and agronomists in selecting optimal crops based on soil and
environmental conditions. By analyzing a comprehensive dataset of soil properties
and climate variables, such as nitrogen, phosphorus, potassium content,
temperature, humidity, pH, and rainfall, the project leverages machine learning to
classify the suitability of crops under varying conditions. Key objectives include
developing and comparing multiple machine learning models to identify the best-
performing algorithms. The methodology involves data preprocessing, feature
engineering, model training and evaluation, and accuracy comparison. We trained
several models, including Decision Tree, Naive Bayes, SVM, Logistic Regression,
and Random Forest. Experimental results highlight that the Random Forest model
achieved the highest accuracy, making it ideal for crop recommendation in real-
world applications. This system, backed by scientific data, is poised to support
sustainable agricultural practices by promoting informed crop selection.
Introduction

● Problem Statement: Agriculture plays a crucial role in supporting the

global food supply. However, deciding on the right crop to plant under
varying environmental conditions can be challenging and often depends on
expertise or limited local data. This project addresses the task of
recommending suitable crops based on specific soil and environmental data,
empowering farmers to make data-driven decisions that maximize yield and
resource efficiency.
● Objective: The primary goal of this project is to develop a machine learning
model that can predict the most suitable crop for a given combination of soil
and environmental factors. The system aims to classify and recommend
crops by analyzing multiple features, including soil nutrients and weather
conditions.
● Motivation: Accurate crop selection can lead to significant improvements in
agricultural productivity, sustainability, and profitability. A data-driven
approach not only saves resources but also helps to meet increasing food
demands. This project combines data science with agriculture, making a
practical impact in a critical field.
● Outline: The report begins with a description of the dataset, followed by
preprocessing steps and exploratory analysis. Next, we describe the
methodology, including model selection and implementation. Finally, we
present a comparison of results, discuss findings, and conclude with future
directions.
Dataset
● Description of Dataset:

 Source: Kaggle’s Crop Recommendation Dataset.

 Size: The dataset contains over 2200 entries with 8 key attributes.
 Type of Data: Each entry includes both numeric and categorical data.
 Features:
o N, P, K: Levels of nitrogen, phosphorus, and potassium in the soil.
o Temperature: Ambient temperature in degrees Celsius.
o Humidity: Relative humidity percentage.
o pH: Soil pH level indicating acidity or alkalinity.
o Rainfall: Annual rainfall in mm.
o Label: Target variable representing the recommended crop type for
the given conditions.

This dataset provides a comprehensive basis for training machine learning models
to predict suitable crops based on environmental and soil parameters.

● Preprocessing:

 Handling Missing Data - Ensured no missing values for cleaner and more
effective model training.
 Normalization - For SVM, all feature values were scaled to a 0-1 range
using MinMaxScaler to improve model stability and convergence.
 Label Encoding - Categorical crop labels were encoded into numeric values
to ensure compatibility with machine learning models.

● Data Exploration:

 Summary Statistics: Calculated mean, median, standard deviation, etc., for

each feature.
 Visualization: Histograms and pair plots to analyze feature distributions and
relationships.
 Correlation Analysis: A correlation heat-map was used to identify
relationships between features, highlighting significant correlations.
Methodology
1. Machine Learning Models

● Models Used: The models tested include Decision Tree, Naive Bayes, SVM,
Logistic Regression, and Random Forest.
● Justification: Models were selected to capture a range of learning methods,
from simple decision boundaries (Decision Tree) to ensemble learning
(Random Forest) for complex patterns. SVM was chosen for its robustness
with normalized data, while Naive Bayes and Logistic Regression provided
baseline comparisons.

2. Model Implementation

● Training and Testing: Data was split into 80% for training and 20% for
testing to assess generalizability.
● Hyperparameter Tuning: Optimized parameters such as the maximum
depth for Decision Tree and kernel type for SVM. Random Forest was
evaluated with different tree counts.

3. Feature Selection and Extraction

 All features—nitrogen, phosphorus, potassium, temperature, humidity, pH,

and rainfall—were retained because each uniquely contributes to predicting
crop suitability. These attributes represent essential agricultural factors, like
soil nutrients and environmental conditions, crucial for crop health.

 Techniques like PCA or LDA were not applied, as reducing dimensions

could obscure the influence of individual features. Keeping all features
ensured comprehensive analysis, maximizing model interpretability and
accuracy, especially in the model like Random Forest that leverages feature
importance effectively.
Experimental Setup
● Tools Used: Python 3.8, Scikit-learn, Seaborn, Matplotlib, and Pandas.
● Evaluation Metrics:
 Accuracy: Accuracy is the most straightforward metric, calculated as
the proportion of correct predictions out of the total predictions made.

 Precision: Precision measures the proportion of true positive predictions

out of all positive predictions made by the model.

 Recall: Recall, also known as sensitivity or true positive rate, measures

the proportion of true positives correctly identified by the model.

 F1-Score: The F1-score is the harmonic mean of precision and recall,

providing a balanced metric when both false positives and false
negatives need to be minimized.
Results and Discussion
● Performance Comparison:
Each model was trained and tested on the crop recommendation dataset, and
their performance was measured using accuracy and classification metrics:
○ Decision Tree: This model achieved an accuracy of 97.18%. It
constructed clear and interpretable decision boundaries, making it
effective for classification. However, due to its depth (maximum
depth = 5), the model occasionally exhibited overfitting tendencies. It
performed well for common crops but struggled to generalize for rarer
ones, especially when feature distributions overlapped. Despite these
challenges, its simplicity and explainability remain advantageous in
real-world applications.
○ Naive Bayes: As a probabilistic model, Naive Bayes achieved an
accuracy of 98.59%. It worked well for crops with distinct feature
distributions due to its assumption of feature independence. However,
this same assumption limited its ability to handle interdependencies
among features, causing lower precision and recall for certain crops.
The model performed better for crops with clearly separated clusters
in the feature space but struggled in regions with complex interactions.
○ Support Vector Machine (SVM): With normalized data and a
polynomial kernel, SVM achieved an accuracy of 98.02%. This model
effectively captured non-linear relationships, demonstrating its
capability in handling overlapping classes. However, the kernelized
approach required significant computational resources, making it less
efficient for large datasets. Its ability to define precise decision
boundaries made it suitable for predicting crops where subtle
variations in features mattered.
○ Logistic Regression: Logistic Regression achieved an accuracy of
95.76%. It was particularly effective for crops with linearly separable
data but struggled with those requiring non-linear decision boundaries.
As a baseline model, it provided a reference point for evaluating more
complex algorithms. While it lacked the sophistication to capture
intricate patterns, its simplicity and computational efficiency made it a
viable option for straightforward problems.
○ Random Forest: This ensemble model delivered the highest accuracy
of 99.15%. By combining predictions from multiple decision trees, it
captured complex patterns while minimizing overfitting. Its bagging
approach ensured robustness, and feature importance analysis
highlighted attributes like temperature, pH, and rainfall as critical for
predictions. Random Forest provided consistent performance across
all classes, including crops with overlapping features, making it ideal
for practical deployment.

● Confusion Matrix/Classification Report:

○ Random Forest Confusion Matrix: The Random Forest model
displayed strong performance with minimal misclassifications across
all crop categories. The classification report highlighted high
precision, recall, and F1-scores for most crops, though a few crops
with overlapping feature distributions exhibited slightly lower recall.
○ Support Vector Machine (SVM) Confusion Matrix: The SVM
model demonstrated commendable accuracy, particularly for crops
with non-linear boundaries. Its confusion matrix reflected balanced
classification, although its performance slightly dropped for crops
with subtle variations in feature space.
○ Naive Bayes Confusion Matrix: Naive Bayes, while effective for
crops with distinct feature distributions, showed increased
misclassifications for crops requiring a nuanced understanding of
feature interactions. Its classification report reflected lower F1-scores
for certain classes due to its assumption of feature independence.
○ Classification Report Insights: The Random Forest model emerged
as the most balanced performer, achieving high F1-scores even for
crops with limited samples in the test set. SVM followed closely,
performing well for non-linearly separable crops. In contrast, Naive
Bayes struggled with interdependent features, leading to reduced
precision and recall for certain crops.
● Model Interpretation:
○ Feature Importance:
■ For tree-based models like Random Forest, an analysis of
feature importance revealed the most influential attributes for
crop predictions. Features such as temperature, pH, and rainfall
had the highest importance scores, underscoring the critical role
of environmental factors in crop suitability. This insight
reinforces the need for comprehensive environmental data in
agricultural decision-making.
○ Error Analysis:
■ Underfitting: Naive Bayes and Logistic Regression struggled
with crops requiring non-linear decision boundaries due to their
inherent model assumptions. Their simplifying assumptions
limited their ability to capture intricate patterns, leading to
reduced accuracy for such cases.

■ Overfitting: The Decision Tree model, despite its constrained

depth, occasionally memorized training patterns, resulting in
reduced generalization during testing. This issue was mitigated
in Random Forest due to its ensemble learning approach.

■ Class Imbalance: All models exhibited slightly lower recall for

less-represented crops, a common issue in datasets with uneven
class distributions. Techniques like oversampling or synthetic
data generation could help mitigate this problem in future
iterations.
Conclusion
● Summary: This project explored the application of machine learning to
recommend crops based on soil and environmental factors. Using a variety
of models, we demonstrated the effectiveness of data-driven decision-
making in agriculture.
● Findings: The Random Forest model emerged as the top performer,
achieving the highest accuracy of 99.15%. Its ensemble learning approach
provided robust predictions, making it particularly suitable for complex
agricultural datasets. Other models like Decision Tree and SVM also showed
promise, albeit with limitations in generalization and computational
efficiency.
● Limitations: While the models performed well on the dataset, their real-
world applicability is constrained by the limited size and scope of the dataset.
Factors like extreme environmental conditions or additional variables (e.g.,
micronutrient levels) were not accounted for. Additionally, class imbalance
in the dataset affected recall for less-represented crops, highlighting the need
for balanced data or advanced techniques like oversampling.
● Future Work:
■ Future efforts can focus on the following enhancements:
● Incorporating advanced models like XGBoost, which
leverage gradient boosting to capture subtle patterns and
relationships in complex datasets. XGBoost has the
potential to outperform Random Forest by providing
higher accuracy and better generalization.
● Expanding the dataset to include more crop types, soil
compositions, and environmental conditions to improve
model robustness and applicability across diverse
agricultural contexts.
● Experimenting with deep learning or transfer learning
approaches to explore their suitability for crop
recommendation tasks. Neural networks, especially
convolutional or recurrent architectures, might provide
further improvements in accuracy.
Timesheet
● Aaryan Mantri: Initial model setup, Hyperparameter tuning

● Agarwal Vedant Rakesh: Data preprocessing, EDA, Analysis

● Balla Mahadev Shrikrishna: Model training, Report Preparation,

Documentation
References
● Introduction to Machine Learning for Everyone

○ Machine Learning for Everybody by Kylie Ying: FreeCodeCamp

Tutorial, FreeCodeCamp, 2022. Link: https://youtu.be/i_LwzRVP7bg

● Pandas Documentation

○ Pandas: Python Data Analysis Library, 2024. Link:

https://pandas.pydata.org

● NumPy Documentation

○ NumPy: The fundamental package for scientific computing with

Python, 2024. Link: https://numpy.org

● Scikit-learn: Machine Learning in Python

○ Link: https://scikit-learn.org/stable/

● Seaborn Documentation

○ M. Waskom, Seaborn: Statistical Data Visualization, 2024. [Online].

Link: https://seaborn.pydata.org

● Matplotlib Documentation

○ Link: https://matplotlib.org
Appendices (optional)
● Additional Figures or Tables: Include any figures or tables that do not fit
into the main body.
● Code Snippets: Provide any relevant code sections, especially if you want
to highlight a specific method or function.

Classification Model For Discovering The Type of Crop To Plant Us
No ratings yet
Classification Model For Discovering The Type of Crop To Plant Us
39 pages
SK Mapa Linear Algebra
100% (2)
SK Mapa Linear Algebra
163 pages
Comprehensive Analysis of Crop Recommendation System Using Machine Learning and IoT
No ratings yet
Comprehensive Analysis of Crop Recommendation System Using Machine Learning and IoT
21 pages
Mini Project PP T
No ratings yet
Mini Project PP T
20 pages
Project
No ratings yet
Project
14 pages
Soil Nutrient Analysis
No ratings yet
Soil Nutrient Analysis
9 pages
Final Project
100% (2)
Final Project
28 pages
Deep Learning Report
No ratings yet
Deep Learning Report
18 pages
Smart Farm Data Driven Crop Recommendation System
No ratings yet
Smart Farm Data Driven Crop Recommendation System
14 pages
Batch-4 Idp
No ratings yet
Batch-4 Idp
52 pages
Review 2 Capstone
No ratings yet
Review 2 Capstone
15 pages
Agricultural ML
No ratings yet
Agricultural ML
11 pages
Yield Prediction Using Machine Learning
0% (1)
Yield Prediction Using Machine Learning
8 pages
GPTGeniuses Crop Recommendation System
No ratings yet
GPTGeniuses Crop Recommendation System
13 pages
Acer Aspire v5-572p Quanta ZQK DAOZQKMB8E0 Rev1A Schematic
100% (1)
Acer Aspire v5-572p Quanta ZQK DAOZQKMB8E0 Rev1A Schematic
46 pages
Crop Recommendation System and Plant Disease Classification Using Machine Learning For Precision Agriculture
No ratings yet
Crop Recommendation System and Plant Disease Classification Using Machine Learning For Precision Agriculture
11 pages
Crop Recommendation
No ratings yet
Crop Recommendation
12 pages
Sample Template File For Project
No ratings yet
Sample Template File For Project
13 pages
Crop Recommendation System Using ML
No ratings yet
Crop Recommendation System Using ML
11 pages
Project Report Crop Recommendations System Using Data Science
No ratings yet
Project Report Crop Recommendations System Using Data Science
14 pages
Agri Crop
No ratings yet
Agri Crop
13 pages
Paper 2
No ratings yet
Paper 2
4 pages
Paper Id - 167
No ratings yet
Paper Id - 167
7 pages
ENSEMBLED CROPIFY Crop Amp Fertilizer Recommender System With Leaf Disease Prediction
No ratings yet
ENSEMBLED CROPIFY Crop Amp Fertilizer Recommender System With Leaf Disease Prediction
5 pages
T-3 A Comprehensive Crop Recommendation System Integrating Machine Learning and Deep Learning Models
No ratings yet
T-3 A Comprehensive Crop Recommendation System Integrating Machine Learning and Deep Learning Models
8 pages
Slides
No ratings yet
Slides
21 pages
Agriculture Crop Recommendation - Batch-13
No ratings yet
Agriculture Crop Recommendation - Batch-13
25 pages
Draft Version: Precision Agriculture: A Machine Learning Approach To Crop Recommendation
No ratings yet
Draft Version: Precision Agriculture: A Machine Learning Approach To Crop Recommendation
6 pages
Engproc 67 07073
No ratings yet
Engproc 67 07073
11 pages
IEEE Conference Template
No ratings yet
IEEE Conference Template
5 pages
Enabled System For Crop Recommendation
No ratings yet
Enabled System For Crop Recommendation
11 pages
Smart Crop Recommendation System: Internship Report
No ratings yet
Smart Crop Recommendation System: Internship Report
22 pages
Research Paper
No ratings yet
Research Paper
6 pages
Krishi Mitra - Intelligent Crop Recommender System: Department of Computer Engineering
No ratings yet
Krishi Mitra - Intelligent Crop Recommender System: Department of Computer Engineering
22 pages
Crop Recommendation Based On Geographical Factors Using Machine Learning Approach
No ratings yet
Crop Recommendation Based On Geographical Factors Using Machine Learning Approach
3 pages
Project
No ratings yet
Project
30 pages
Project Report PDF
No ratings yet
Project Report PDF
23 pages
Agriai Manuscript
No ratings yet
Agriai Manuscript
9 pages
MINI Project
No ratings yet
MINI Project
23 pages
Tellicorp An Ensemble Model To Predict Crop Using Machine Learning Algorithms
No ratings yet
Tellicorp An Ensemble Model To Predict Crop Using Machine Learning Algorithms
11 pages
Agricultural Crop Recommendation System
No ratings yet
Agricultural Crop Recommendation System
5 pages
Ai Driven Soil Monitoring and Crop Recommendation Using Machine Learning Algorithm
No ratings yet
Ai Driven Soil Monitoring and Crop Recommendation Using Machine Learning Algorithm
8 pages
Crop 7
No ratings yet
Crop 7
5 pages
Synopsis CROPRECOMMENDATION
No ratings yet
Synopsis CROPRECOMMENDATION
13 pages
Krishi Sahayak Research Paper
No ratings yet
Krishi Sahayak Research Paper
9 pages
CRS Research Paper-1
No ratings yet
CRS Research Paper-1
6 pages
Crop Prediction Based On Characteristics of Agricultural Environment
No ratings yet
Crop Prediction Based On Characteristics of Agricultural Environment
9 pages
Crop and Fertilizer Recommendation Using AI Ijariie19386
No ratings yet
Crop and Fertilizer Recommendation Using AI Ijariie19386
8 pages
CAPSTONE THESIS Format
No ratings yet
CAPSTONE THESIS Format
29 pages
Mini Project
No ratings yet
Mini Project
17 pages
Research Pepar
No ratings yet
Research Pepar
4 pages
Paper 4
No ratings yet
Paper 4
7 pages
Paper 6
No ratings yet
Paper 6
5 pages
Crop&Fertilizer Synopsis
No ratings yet
Crop&Fertilizer Synopsis
7 pages
Agrobot in Field of Machine Learning
No ratings yet
Agrobot in Field of Machine Learning
7 pages
Crop and Nutrient Recommendation System Using Machine Learning For Precision Agriculture
No ratings yet
Crop and Nutrient Recommendation System Using Machine Learning For Precision Agriculture
6 pages
Sample Poster
No ratings yet
Sample Poster
1 page
Agricultural Crop Recommendation System
No ratings yet
Agricultural Crop Recommendation System
10 pages
Sample Concept Note
No ratings yet
Sample Concept Note
2 pages
Crop Prediction Using Machine Learning
No ratings yet
Crop Prediction Using Machine Learning
6 pages
Halogen Derivatives
No ratings yet
Halogen Derivatives
20 pages
Tableau Tutorial For Beginners
No ratings yet
Tableau Tutorial For Beginners
8 pages
Modifications For The Kenwood TS-940
No ratings yet
Modifications For The Kenwood TS-940
10 pages
Sma 306 - Complex Analysis 1 - April 2017
No ratings yet
Sma 306 - Complex Analysis 1 - April 2017
4 pages
J.E. Maintenance Manual 2011 07
No ratings yet
J.E. Maintenance Manual 2011 07
8 pages
Dbms Mini Project Spms
No ratings yet
Dbms Mini Project Spms
15 pages
Spesifikasi Barang Listrik
No ratings yet
Spesifikasi Barang Listrik
2 pages
MHT-CET - PCM - Lesson Plan
No ratings yet
MHT-CET - PCM - Lesson Plan
1 page
CBSE Class12 PYQs Electric Charges and Fields-1
No ratings yet
CBSE Class12 PYQs Electric Charges and Fields-1
2 pages
Emerson Digital Compressor Controller
No ratings yet
Emerson Digital Compressor Controller
17 pages
General Mathematics 11-Module 1
No ratings yet
General Mathematics 11-Module 1
6 pages
Quant Checklist Module 16 by Aashish Arora
No ratings yet
Quant Checklist Module 16 by Aashish Arora
60 pages
Wear3 PDF
No ratings yet
Wear3 PDF
8 pages
Vogelsang ETEP-Journal Detection of Electrical Tree Propagation by Partial Discharge Measurements
No ratings yet
Vogelsang ETEP-Journal Detection of Electrical Tree Propagation by Partial Discharge Measurements
7 pages
(Question) Mat 491 Final Assessment 6aug2021 3PM-5PM
No ratings yet
(Question) Mat 491 Final Assessment 6aug2021 3PM-5PM
4 pages
Linked List: Unit Ii
No ratings yet
Linked List: Unit Ii
22 pages
Quat 6221 WB
No ratings yet
Quat 6221 WB
148 pages
Physics 2020 QP Set 1 English
No ratings yet
Physics 2020 QP Set 1 English
10 pages
Laser Maser
No ratings yet
Laser Maser
4 pages
Unit 1 Exam Qs and MS
No ratings yet
Unit 1 Exam Qs and MS
17 pages
Wa0000.
No ratings yet
Wa0000.
5 pages
Edit (Transformation Rules)
No ratings yet
Edit (Transformation Rules)
5 pages
Classification of Reservoirs and Reservoir Fluid Properties: Dr. Farqad Hadi
No ratings yet
Classification of Reservoirs and Reservoir Fluid Properties: Dr. Farqad Hadi
7 pages
TG63 DS en
No ratings yet
TG63 DS en
4 pages
Virtual Density Lab 2018 PDF
No ratings yet
Virtual Density Lab 2018 PDF
2 pages
2019 10 04 Metrycom MS4000 A PDF
No ratings yet
2019 10 04 Metrycom MS4000 A PDF
18 pages
MEC132 F3 1819 Solutions
No ratings yet
MEC132 F3 1819 Solutions
7 pages
Kinematics of Motion: Motion Along A Straight Line
No ratings yet
Kinematics of Motion: Motion Along A Straight Line
26 pages
A Few TEQC Tips For Getting Started: Beth Pratt-Sitaula (UNAVCO)
No ratings yet
A Few TEQC Tips For Getting Started: Beth Pratt-Sitaula (UNAVCO)
2 pages
Bar 2
No ratings yet
Bar 2
3 pages
C++ Programs
No ratings yet
C++ Programs
7 pages
CHE 3800 - Mass Transfer and Separation Process (Winter 2017)
No ratings yet
CHE 3800 - Mass Transfer and Separation Process (Winter 2017)
3 pages
Math 102 Midterms Reviewer (With Mock Tests)
No ratings yet
Math 102 Midterms Reviewer (With Mock Tests)
3 pages
Data Science through R. Unsupervised Learning. Dimension Reduction Techniques: Principal Components, Factor Analysis and Correspondence Analysis
From Everand
Data Science through R. Unsupervised Learning. Dimension Reduction Techniques: Principal Components, Factor Analysis and Correspondence Analysis
César Pérez López
No ratings yet
Use Cases of AI and ML in Agriculture: Smart Project Ideas
From Everand
Use Cases of AI and ML in Agriculture: Smart Project Ideas
Zemelak Goraga
No ratings yet

PDS Report 2024-25

Uploaded by

PDS Report 2024-25

Uploaded by

Bhartiya Vidya Bhavan’s

Sardar Patel Institute of Technology

Crop Recommendation System

This project presents a crop recommendation system using machine learning to

● Problem Statement: Agriculture plays a crucial role in supporting the

 Source: Kaggle’s Crop Recommendation Dataset.

 Summary Statistics: Calculated mean, median, standard deviation, etc., for

3. Feature Selection and Extraction

 All features—nitrogen, phosphorus, potassium, temperature, humidity, pH,

 Techniques like PCA or LDA were not applied, as reducing dimensions

 Precision: Precision measures the proportion of true positive predictions

 Recall: Recall, also known as sensitivity or true positive rate, measures

 F1-Score: The F1-score is the harmonic mean of precision and recall,

● Confusion Matrix/Classification Report:

■ Overfitting: The Decision Tree model, despite its constrained

■ Class Imbalance: All models exhibited slightly lower recall for

● Agarwal Vedant Rakesh: Data preprocessing, EDA, Analysis

● Balla Mahadev Shrikrishna: Model training, Report Preparation,

○ Machine Learning for Everybody by Kylie Ying: FreeCodeCamp

○ Pandas: Python Data Analysis Library, 2024. Link:

○ NumPy: The fundamental package for scientific computing with

● Scikit-learn: Machine Learning in Python

○ M. Waskom, Seaborn: Statistical Data Visualization, 2024. [Online].

You might also like