ML Project (1) Final
ML Project (1) Final
selling price of
used cars in India
USING REGRESSION AND DECISION
TREE MODELS
MACHINE LEARNING – 21CSC305P
D I V YA M T YA G I – RA2211003030399
A N S H S A C H D E VA – RA2211003030378
P RAT H A M G U L AT I – RA2211003030389
Index
2.Introduction
and 3. Dataset
1. Abstract
Background Description
Research
6.
4. 5.Architecture
Experimental
Methodology of System
Setup
7.Result 8.Conclusion
The used car market in India
is growing rapidly, with a Accurately predicting the
wide range of factors price is crucial for both
influencing the selling price buyers and sellers.
of vehicles.
Abstract
Abstract – Problem
Statement
• Challenge: The used car market in
India is characterized by a wide range
of variables that affect the selling
price. Factors such as brand reputation,
car condition, mileage, and fuel type
contribute to price variability.
Traditional pricing methods often fail to
account for all these variables
effectively.
• Objective: To address this issue, we
developed machine learning models to
predict car prices based on a
comprehensive dataset. Our goal was
to provide an objective, data-driven
approach to pricing used cars.
Develop Models: Create and train
machine learning models using Linear
Regression and Decision Tree
algorithms to predict the selling price
of used cars.
Algorithms Covered :
Decision Tree: A non-linear model that splits data into subsets based on
feature values to make predictions.
Objective: To understand how each algorithm works and how they are
applied to the problem of predicting car prices.
Introduction to Algorithms
Linear Regression Algorithm - Steps Encode Categorical
Handle Missing Features: Convert
Values: Use techniques categorical data (e.g.,
Steps: Data Preparation: like mean imputation or brand, fuel type) into
remove records with numerical formats using
excessive missing data. One-Hot Encoding or
Label Encoding.
Scale Numerical
Objective Function:
Features: Apply scaling Fit the Model: Train the
Minimize the Mean
methods (e.g., Standard Linear Regression model
Model Training: Squared Error (MSE) to
Scaling) to ensure on the prepared training
find the best-fitting line
features contribute data.
through the data.
equally to the model.
Metrics: Assess
performance using R- Predict Prices: Use the
squared (explained trained model to forecast
Model Evaluation: Prediction:
variance), Mean Squared the selling prices of new
Error (MSE), and Mean or unseen cars.
Absolute Error (MAE).
Decision Tree Algorithm - Steps
Handle Missing Encode Categorical
Values: Similar to Linear Features: Convert
Steps: Data Preparation: Regression, preprocess categorical data to
data to manage missing numerical values to be
values. usable by the algorithm.
Scope :
Objectives The project focuses on cars sold in India, utilizing data from
sources such as online car marketplaces, dealerships, and