0% found this document useful (0 votes)

18 views15 pages

ML Project (1) Final

The document discusses a project aimed at predicting the selling price of used cars in India using machine learning models, specifically Linear Regression and Decision Trees. It highlights the challenges in the used car market, such as varying factors affecting prices and the limitations of traditional pricing methods. The project utilizes a comprehensive dataset to develop and evaluate models for accurate price predictions, emphasizing the importance of a data-driven approach.

Uploaded by

rs9938

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views15 pages

ML Project (1) Final

Uploaded by

rs9938

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Prediction of

selling price of
used cars in India
USING REGRESSION AND DECISION
TREE MODELS
MACHINE LEARNING – 21CSC305P

D I V YA M T YA G I – RA2211003030399
A N S H S A C H D E VA – RA2211003030378
P RAT H A M G U L AT I – RA2211003030389
Index
2.Introduction
and 3. Dataset
1. Abstract
Background Description
Research

6.
4. 5.Architecture
Experimental
Methodology of System
Setup

7.Result 8.Conclusion
The used car market in India
is growing rapidly, with a Accurately predicting the
wide range of factors price is crucial for both
influencing the selling price buyers and sellers.
of vehicles.

In this project, we utilize

Our approach involves
machine learning models,
feature engineering, data
specifically Linear
preprocessing, and model
Regression and Decision
evaluation to determine the
Tree algorithms, to predict
most accurate and efficient
the selling price of used
model for this task.
cars in India.

Abstract
Abstract – Problem
Statement
• Challenge: The used car market in
India is characterized by a wide range
of variables that affect the selling
price. Factors such as brand reputation,
car condition, mileage, and fuel type
contribute to price variability.
Traditional pricing methods often fail to
account for all these variables
effectively.
• Objective: To address this issue, we
developed machine learning models to
predict car prices based on a
comprehensive dataset. Our goal was
to provide an objective, data-driven
approach to pricing used cars.
Develop Models: Create and train
machine learning models using Linear
Regression and Decision Tree
algorithms to predict the selling price
of used cars.

Evaluate Performance: Assess the

performance of these models using
metrics like Mean Squared Error (MSE)
and R-squared to determine their
Abstract – accuracy and effectiveness.

Objectives of Compare Models: Compare the

results of the Linear Regression and
the Project Decision Tree models to identify which
approach provides better predictions
for used car prices.
Introduction
• The used car market in India is one of
the fastest-growing sectors, driven by
increasing demand for affordable
vehicles. Pricing used cars accurately is
critical to ensure fair deals for both
buyers and sellers. Traditional valuation
methods, often based on experience or
subjective judgment, may not capture
the full range of factors that influence
the price.
• Significance: Machine learning offers a
data-driven approach to this problem,
allowing for more accurate and objective
price predictions. By utilizing models like
Linear Regression and Decision Trees,
we can predict prices based on a range
of factors, making the process more
efficient and reliable.
Purpose: This presentation explores the machine learning algorithms used
to predict the selling price of used cars in India.

Algorithms Covered :

Linear Regression: A statistical method for predicting a continuous target

variable based on linear relationships.

Decision Tree: A non-linear model that splits data into subsets based on
feature values to make predictions.

Objective: To understand how each algorithm works and how they are
applied to the problem of predicting car prices.

Introduction to Algorithms
Linear Regression Algorithm - Steps Encode Categorical
Handle Missing Features: Convert
Values: Use techniques categorical data (e.g.,
Steps: Data Preparation: like mean imputation or brand, fuel type) into
remove records with numerical formats using
excessive missing data. One-Hot Encoding or
Label Encoding.

Scale Numerical
Objective Function:
Features: Apply scaling Fit the Model: Train the
Minimize the Mean
methods (e.g., Standard Linear Regression model
Model Training: Squared Error (MSE) to
Scaling) to ensure on the prepared training
find the best-fitting line
features contribute data.
through the data.
equally to the model.

Metrics: Assess
performance using R- Predict Prices: Use the
squared (explained trained model to forecast
Model Evaluation: Prediction:
variance), Mean Squared the selling prices of new
Error (MSE), and Mean or unseen cars.
Absolute Error (MAE).
Decision Tree Algorithm - Steps
Handle Missing Encode Categorical
Values: Similar to Linear Features: Convert
Steps: Data Preparation: Regression, preprocess categorical data to
data to manage missing numerical values to be
values. usable by the algorithm.

Build the Tree:

Recursively split the
Splitting Criteria: Use
data based on the most
metrics like Gini Impurity
significant features until
Model Training: or Information Gain to Model Evaluation:
the stopping criterion is
evaluate and choose the
met (e.g., maximum
best splits.
depth, minimum
samples per leaf).

Metrics: Evaluate using Predict Prices: Use the

Accuracy, Precision, constructed tree to
Recall, and F1-Score. predict the selling price
Prediction:
Visualize the tree to by traversing the tree
understand decision based on the feature
paths. values of new cars.
Background Study

Challenges in the Indian

Market: The Indian used car
market is unique due to the
Traditional Methods: Typically,
diversity of car brands, varying
car dealerships and individual
conditions of roads, climate
sellers rely on their experience or
effects, and different buyer
basic online tools to estimate car
preferences. Factors like the
prices. However, these methods
brand, age, mileage, and fuel
often fail to account for complex
type play a significant role in
relationships between various
determining the selling price. In
factors affecting price.
addition, unstructured data and
inconsistent records make pricing
prediction a difficult task.
Problem Statement
• Inconsistency in Price Predictions:
Due to the wide range of factors affecting
the price, predictions can vary
significantly from one evaluator to
another. This inconsistency leads to
mistrust among buyers and sellers.
• Need for Objectivity: A more objective,
data-driven approach can reduce errors in
price estimation, making transactions
smoother and more transparent.
Objectives:

To analyze the factors influencing the selling price of used

cars in India.

To develop machine learning models that provide accurate

price predictions.

To evaluate and compare the performance of Linear

Regression and Decision Tree models for this task.

Scope :

Objectives The project focuses on cars sold in India, utilizing data from
sources such as online car marketplaces, dealerships, and

and Scope other available datasets.

The models will predict prices based on key features such as
brand, model, year, mileage, fuel type, and ownership history.
The project is limited to Linear Regression and Decision Tree
models, though other models may be considered in future
work.
Dataset Description
- Overview
• The dataset was obtained from
https://www.kaggle.com/ , https://www.cardekho.com/
. It reflects real-world transactions of used cars in
India, providing a comprehensive view of various
factors affecting car prices.
• The dataset includes approximately Mileage records ,
Fuel Type features and Accident History, covering a
broad spectrum of car models, brands, and conditions.
It encompasses data from different regions across
India, allowing for diverse market representation.
• The primary aim of using this dataset is to develop
and evaluate machine learning models that predict
used car prices based on various features.
Dataset Description
Key Features - Part 1
• 1. Brand : Indicates the car's manufacturer (e.g., Maruti,
Hyundai, Honda). The brand significantly impacts the car's
resale value due to factors like brand reputation, customer
loyalty, and perceived quality .
• Example: Maruti Suzuki, being a dominant player in the
Indian market, often results in higher resale values for its
vehicles.
• 2. Model : Specifies the car model (e.g., Swift, i20, City).
Different models within a brand may have varying levels of
demand and resale value .
• Example: The Hyundai i20 is generally more sought after
compared to other models due to its popularity and features.
• 3. Year of Manufacture : The year the car was produced.
Newer cars usually have a higher resale value due to less
depreciation compared to older models .
• Example: A car from 2018 will typically be priced higher
than a car from 2012, given similar conditions and mileage.
Dataset Description
Key Features – Part 2
• 4. Mileage : The total distance traveled by the car in
kilometers. Higher mileage often correlates with increased
wear and tear, impacting the car’s resale price negatively.
• Example: A car with 40,000 km will usually be valued
higher than a similar car with 80,000 km.
• 5. Fuel Type : The type of fuel used by the car (e.g., Petrol,
Diesel, CNG, Electric). Fuel type influences not only the car’s
running cost but also its resale value, with diesel cars
traditionally fetching higher prices .
• Example: Diesel cars may have higher resale values in
regions where diesel is more economical compared to
petrol.
• 6. Transmission : The type of transmission system
(Manual or Automatic). Automatic transmissions are
increasingly preferred in urban settings, potentially affecting
resale values .
• Example: An automatic transmission may add to the car's
resale value in busy metropolitan areas due to ease of
driving.

Car Price Prediction
67% (3)
Car Price Prediction
54 pages
Car Resale Value
No ratings yet
Car Resale Value
20 pages
Pre-Owned Car Price and Life Prediction Using Machine Learning
No ratings yet
Pre-Owned Car Price and Life Prediction Using Machine Learning
26 pages
Bulldozer Price Prediction Using Regression Model (Research Ethics)
No ratings yet
Bulldozer Price Prediction Using Regression Model (Research Ethics)
19 pages
Ai Pera
No ratings yet
Ai Pera
10 pages
Final Print
No ratings yet
Final Print
39 pages
Used Cars Price Prediction and Valuation Using Data Mining Techni
No ratings yet
Used Cars Price Prediction and Valuation Using Data Mining Techni
37 pages
Car Price Prediction Project Chapters
No ratings yet
Car Price Prediction Project Chapters
30 pages
Updated Used Cars Price Prediction Using Machine Learning
No ratings yet
Updated Used Cars Price Prediction Using Machine Learning
24 pages
Presentation 1
No ratings yet
Presentation 1
13 pages
ITS307 Group 4 Report
No ratings yet
ITS307 Group 4 Report
14 pages
Used Car Price Prediction Using Different Machine Learning Algorithms
No ratings yet
Used Car Price Prediction Using Different Machine Learning Algorithms
8 pages
Analyzing Selling Price of Used Cars Using Machine Learning
No ratings yet
Analyzing Selling Price of Used Cars Using Machine Learning
41 pages
PDF Sample of Film Production Management 101-2nd Edition
81% (16)
PDF Sample of Film Production Management 101-2nd Edition
26 pages
Used Cars Price Prediction and Valuation Using Data Mining Techni
100% (1)
Used Cars Price Prediction and Valuation Using Data Mining Techni
37 pages
Report Car Price Prediction
No ratings yet
Report Car Price Prediction
8 pages
Final Project - Merged
No ratings yet
Final Project - Merged
17 pages
Project Soft
No ratings yet
Project Soft
28 pages
Price Prediction
No ratings yet
Price Prediction
14 pages
Project
No ratings yet
Project
24 pages
Car Price Prediction
No ratings yet
Car Price Prediction
18 pages
Minor Project RRR
No ratings yet
Minor Project RRR
24 pages
Ajay and Saurabh
No ratings yet
Ajay and Saurabh
16 pages
A13 Nandan and Ghosh 167-184
No ratings yet
A13 Nandan and Ghosh 167-184
18 pages
Mini Project New
No ratings yet
Mini Project New
25 pages
Prediction of The Price of Used Cars Based On Mach
No ratings yet
Prediction of The Price of Used Cars Based On Mach
7 pages
Car Price Prediction Using Ai
No ratings yet
Car Price Prediction Using Ai
6 pages
Car Price Prediction Using Various Algorithms
100% (1)
Car Price Prediction Using Various Algorithms
19 pages
Sample
No ratings yet
Sample
15 pages
Used Car Price Prediction Using Machine Learning: Veluru Ranjith (Urk18Cs020)
No ratings yet
Used Car Price Prediction Using Machine Learning: Veluru Ranjith (Urk18Cs020)
26 pages
Car Price Prediction Leveraging Machine Learning
No ratings yet
Car Price Prediction Leveraging Machine Learning
11 pages
Predicting Pre-Owned Car Prices Using Machine Learning
No ratings yet
Predicting Pre-Owned Car Prices Using Machine Learning
17 pages
74 Ijcse2018 19
No ratings yet
74 Ijcse2018 19
7 pages
IRJMETS60300008997
No ratings yet
IRJMETS60300008997
6 pages
Car Dekho-Used Car Price Prediction
No ratings yet
Car Dekho-Used Car Price Prediction
10 pages
Pre-Owned Car Price Prediction Using Machine Learning Techniques
No ratings yet
Pre-Owned Car Price Prediction Using Machine Learning Techniques
5 pages
Duplichecker Plagiarism Report
No ratings yet
Duplichecker Plagiarism Report
3 pages
Sample Paper 6
No ratings yet
Sample Paper 6
10 pages
Car Evaluation
No ratings yet
Car Evaluation
62 pages
PPSD 1743674861
No ratings yet
PPSD 1743674861
3 pages
Sanke 2024 Ijca 923900
No ratings yet
Sanke 2024 Ijca 923900
6 pages
Car Price Prediction Project
No ratings yet
Car Price Prediction Project
34 pages
IOMP1
No ratings yet
IOMP1
21 pages
Data Mining Report
No ratings yet
Data Mining Report
25 pages
Paper 10479
No ratings yet
Paper 10479
4 pages
33 Submission
No ratings yet
33 Submission
8 pages
Price Prediction For Pre-Owned Cars Using Ensemble
No ratings yet
Price Prediction For Pre-Owned Cars Using Ensemble
10 pages
Ai and Machine Learning For Predicting
No ratings yet
Ai and Machine Learning For Predicting
9 pages
Project Poster A17
No ratings yet
Project Poster A17
1 page
1st Review
No ratings yet
1st Review
9 pages
Demo Abstract
No ratings yet
Demo Abstract
1 page
Scania Diagnos & Programmer 3 2.28
80% (5)
Scania Diagnos & Programmer 3 2.28
13 pages
ML Case Study
No ratings yet
ML Case Study
11 pages
Used Car Price Prediction
No ratings yet
Used Car Price Prediction
20 pages
Machine Learning-Based Models For Accurate Car Pri
No ratings yet
Machine Learning-Based Models For Accurate Car Pri
6 pages
Prediction of Car Price Using Linear Regression
No ratings yet
Prediction of Car Price Using Linear Regression
4 pages
Activity No 03
No ratings yet
Activity No 03
4 pages
Duplichecker Plagiarism Report
No ratings yet
Duplichecker Plagiarism Report
1 page
Car Price Prediction
No ratings yet
Car Price Prediction
12 pages
Predicting Used Car Prices With Data Analytics
No ratings yet
Predicting Used Car Prices With Data Analytics
10 pages
Research Paper
No ratings yet
Research Paper
3 pages
UN40D5500 Trobleshooting PDF
100% (1)
UN40D5500 Trobleshooting PDF
49 pages
Differences Between Precision and Comfort Cooling
No ratings yet
Differences Between Precision and Comfort Cooling
5 pages
UNIT 2 Design of Shafts, Keys and Couplings
No ratings yet
UNIT 2 Design of Shafts, Keys and Couplings
6 pages
Implementation of Robot Systems An Introduction To Robotics Automation and Successful Systems Integration in Manufacturing 1st Edition by MIKE WILSON 9780124047495 0124047491 Download
No ratings yet
Implementation of Robot Systems An Introduction To Robotics Automation and Successful Systems Integration in Manufacturing 1st Edition by MIKE WILSON 9780124047495 0124047491 Download
65 pages
Lesson Plan in TLE IC 8: Teaching The Common Competencies in ICT
No ratings yet
Lesson Plan in TLE IC 8: Teaching The Common Competencies in ICT
2 pages
P-WPS 135 - MAG (GR 316)
No ratings yet
P-WPS 135 - MAG (GR 316)
9 pages
(GF54.15-P-1256-06TB) Fuse Assignment in (N10-2) RearSAM
No ratings yet
(GF54.15-P-1256-06TB) Fuse Assignment in (N10-2) RearSAM
3 pages
Kingspan Quadcore ks1000rw Roof Panel Data Sheet en GB Ie
No ratings yet
Kingspan Quadcore ks1000rw Roof Panel Data Sheet en GB Ie
9 pages
ERTAF - ERTAF-2 - Product Report
No ratings yet
ERTAF - ERTAF-2 - Product Report
2 pages
MMA0041 Merged
No ratings yet
MMA0041 Merged
382 pages
Processes and Threads
No ratings yet
Processes and Threads
14 pages
ASM450 FC44 FB240 e
No ratings yet
ASM450 FC44 FB240 e
102 pages
FactSheet - QoS v1
No ratings yet
FactSheet - QoS v1
4 pages
Building and Installing The USRP Open-Source Toolchain (UHD and GNU Radio) On Linux PDF
No ratings yet
Building and Installing The USRP Open-Source Toolchain (UHD and GNU Radio) On Linux PDF
5 pages
Specs Adafruit Feather
No ratings yet
Specs Adafruit Feather
113 pages
PD80-01 SpecSheet
No ratings yet
PD80-01 SpecSheet
4 pages
PhpMyAdmin SQL Dump
No ratings yet
PhpMyAdmin SQL Dump
11 pages
Package Aware R
No ratings yet
Package Aware R
98 pages
Ratesem 2000 Handouts Int2
No ratings yet
Ratesem 2000 Handouts Int2
31 pages
Shear Check As Per Codes
No ratings yet
Shear Check As Per Codes
10 pages
Flip-Flops - Conversions
No ratings yet
Flip-Flops - Conversions
18 pages
AIAA Space 2016 Proceedings LFRE
No ratings yet
AIAA Space 2016 Proceedings LFRE
8 pages
Email Spoofing Detection Using Volatile Memory
No ratings yet
Email Spoofing Detection Using Volatile Memory
7 pages
Land Forces Academy Review) Collision Avoidance System Using Ultrasonic Sensor
No ratings yet
Land Forces Academy Review) Collision Avoidance System Using Ultrasonic Sensor
8 pages
Course Outline - BBA - Bus Stat Fall 14-15
No ratings yet
Course Outline - BBA - Bus Stat Fall 14-15
3 pages
Session - 9: Advanced Microprocessor Features - Study of Intel 80286 Processor
No ratings yet
Session - 9: Advanced Microprocessor Features - Study of Intel 80286 Processor
10 pages
HiperLAN - Wikipedia
No ratings yet
HiperLAN - Wikipedia
3 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet

ML Project (1) Final

Uploaded by

ML Project (1) Final

Uploaded by

Prediction of

In this project, we utilize

Evaluate Performance: Assess the

Objectives of Compare Models: Compare the

Linear Regression: A statistical method for predicting a continuous target

Build the Tree:

Metrics: Evaluate using Predict Prices: Use the

Challenges in the Indian

To analyze the factors influencing the selling price of used

To develop machine learning models that provide accurate

To evaluate and compare the performance of Linear

and Scope other available datasets.

You might also like