0% found this document useful (0 votes)

78 views36 pages

Predicting True Value of Cars Using Ml-1

This document describes a mini project that aims to build a machine learning model to predict the true value of used cars using attributes like make, model, year, mileage. The project uses a dataset of used car prices to train a linear regression model in Python with libraries like Scikit-learn, Pandas and Matplotlib. The model could help determine the actual price of a used car based on its features.

Uploaded by

mounikayelisetti706

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

78 views36 pages

Predicting True Value of Cars Using Ml-1

Uploaded by

mounikayelisetti706

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

A MINI PROJECT REPORT

PREDICTING TRUE VALUE OF USED CAR USING MACHINE

LEARNING TECHNIQUE
Submitted in partial fulfilment of the requirements for the award of the degree of

Bachelor of Technology
In

Computer science and engineering

By
P.A.D.Prasanna O180792
K.Naga Sandhya O180807
P.Sai Nageswari O180812
C.V.V.Vineeth O180817

Under the Guidance

Mr.Krishna M, Assistant Professor
Computer Science Engineering

Department of Computer Science and Engineering

RAJIV GANDHI UNIVERSITY OF KNOWLEDGE TECHNOLOGIES
(Established through Government of A.P Act of 18 of 2008)
Ongole, Prakasam (Dt.) AP-523225

i
CERTIFICATE

This is to certify that the project entitled “Predicting True Value of Used Car Using Machine
Learning Technique” being submitted by K Naga Sandhya bearing ID Number O180807and
P.A.D.Prasanna bearing ID Number O180792 and P Sai Nageswari bearing ID Number O180812
and C.V.V.Vineeth bearing ID Number O180817 in partial fulfillment of the requirements for the
award of the degree of the Bachelor of Technology in Computer Science and Engineering in Dr. APJ
Abdul Kalam, RGUKT-AP, IIIT Ongole is a record of bonafide work carried out by them under my
guidance and supervision from February 2023 to June 2023.

The results presented in this project have been verified and found to be satisfactory.
The results embodied in this project report have not been submitted to any other University for the
award of any other degree or diploma.

Mr.Krishna M Mr. B Sampath Babu

Assistant Professor, Head of the Department,

Department of CSE, Department of CSE,

RGUKT,Ongole. RGUKT,Ongole

ii
APPROVAL SHEET

This report entitled “Predicting True Value of Used Car Using Machine Learning Technique”
being submitted by K Naga Sandhya bearing ID Number O180807 and P.A.D.Prasanna bearing ID
Number O180792 and P Sai Nageswari bearing ID Number O180812 and C.V.V.Vineeth bearing ID
Number O180817 guided by Mr.Krishna M approved for the degree of Bachelor of Technology in
Computer Science and Engineering.

Examiners

Supervisors(s)

Date:

Place:

iii
ACKNOWLEDGEMENT

It is our privilege to express a profound sense of respect, gratitude and indebtedness to our guide
Mr.Krishna M, Assistant Professor, Dept. of Computer Science and Engineering Dr APJ Abdul
kalam RGUKT-AP, IIIT Ongole, for her indefatigable inspiration, guidance, cogent, discussion,
constructive criticisms and encouragement throughout the dissertation work.

We express our sincere gratitude to B. Sampath Babu, Asst. Professor & Head. Department of
Computer Science and Engineering, Dr APJ Abdul kalam, RGUKT-AP, IIIT Ongole for his suggestion
motivations and co-operation for the successful completion of the work.

We extend our sincere thanks to Dr Rupas Kumar, Dean Research and development, Dr APJ Abdul
kalam, RGUKT-AP, IIIT Ongole, for his encouragement and constant help.

We extend our sincere thanks Dr B Jaya Rami Reddy, Director, Dr APJ Abdul kalam RGUKT AP,
IIIT Ongole for his encouragement.

P.A.D.Prasanna - o180792

K.Naga Sandhya – o180807

P.Sai Nageswari – o180812

C.V.V.Vineeth – o180817

iv
DECLARATION

We hereby declare that the project work entitles “ Predicting True Value of Used Car Using
Machine Learning Technique ” submitted to the Dr APJ Abdul kalam, RGUKT-AP, IIIT Ongole
in partial fulfilment of the requirement for the award of the degree of Bachelor of Technology (B
Tech) in Computer Science and Engineering is a record of an original work done by us under the
guidance of Mr.Krishna M, Assistant Professor and this project work have not been submitted to any
university for the award of any other degree or diploma.

P.A.D.Prasanna - o180792

K.Naga Sandhya – o180807

P.Sai Nageswari – o180812

C.V.V.Vineeth – o180817

Date:

v
ABSTRACT

A car price prediction has been a high interest, as this kind of system becomes handy for many people.
So, to build a model for predicting the price of used cars we applied some machine learning techniques
as it requires attributes that are examined for the reliable and accurate prediction. So, this will provide
you the approximate selling price for your car based on the car company,model of the car,fuel type,
years of service, kilometres driven etc.

Used car price prediction can be used by giving the dataset to the model so that it can predict the actual
price prediction. This model can be built by using machine learning algorithms.

Here we have choosen the Linear regression model and trained our machine.In case of manual system,
they need lot of time to analyse the complete data set. Here almost all work is computerized, so the
accuracy is also maintained.

vi
CONTENTS
S No PAGE NO
1.Introduction 1
1.1 Motivation 1
1.2 Problem Definition 2
1.3 Objective of the project 2
2.Literature Survey 3
3.Analysis 4
3.1 Existing System 4
3.2 Proposed System 4
3.3 Software requirement specification 5
3.3.1 Functional requirements
3.3.2 Non-Functional requirements
4.Diagrams 6
4.1 UML Diagrams 6
5. Implementation 12
5.1 Software Environment 12
5.2 Module Description 13
5.3 DataSet 14
5.4 Sample code 15
6.Test cases 23
7.Screenshots 25
8.Conclusion 27
9. Future Enhancement 28
10.Bibiliography 29

vii
1.INTRODUCTION

The increased prices of new cars and the financial incapability of the customers to buy them,used Car
sales are on a global increase. Therefore, there is an urgent need for a Predicting True Value of Used
Cars Using Machine Learning Technique which effectively determines the worthiness of the car using
a variety of features. Determining whether the listed price of a used car is a challenging task, due to the
many factors that drive a used vehicle’s price on the market. The focus of this project is developing
model that can accurately predict the price of a used car based on its features.When compared with
Artificial Intelligence the goal of Machine learning is to allowmachines to learn from data so that they
can give accurate output. But in Artificial Intelligence, we make intelligent systems to perform any task
like a human. Therefore, we use Machine Learning, so that we teach machines with data to perform a
particular task and give an accurate result. In this project we implement and evaluate machine learning
methods on a dataset consisting of the car prices of different makes and models. We used python as the
base language to implement the model. Here, we also used some Python libraries which provides base-
level items because Python code is concise and readable even to new developers, which is beneficial to
machine. As Machine learning requires continuous data processing, and Python libraries allow you to
access, process, and transform your data. These are some of the libraries we used in this project:

Scikit-learn: for handling basic ML algorithms like clustering, linear and logistic
regressions,regression, classification, and others.

Pandas: for high-level data structures and analysis. It allows merging and filtering of data.

Matplotlib: for creating 2D plots, histograms, charts, and other forms of visualization.

We will compute the performance of machine learning algorithm using Linear Regression and predict
the best out of it. Depending on various parameters we will determine the price of the car. Regression
Algorithm are used because they provide us with continuous value as an output and not a categorized
value because of which it will be possible to predict the actual price a car rather than the price range of
a car. As a result, we offer a Machine Learning- based methodology for predicting the prices of second-
hand cars based on their characteristics.

1.1 MOTIVATION:

Almost everyone wants their own car these days, but because of factors like affordability or economic
conditions, many prefer to opt for pre-owned cars. Accurately predicting used car prices requires expert
knowledge due to the nature of their dependence on a variety of factors and features. Used car prices
are not constant in the market, both buyers and sellers need an intelligent system that will allow them to
1
predict the correct price efficiently. In this intelligent system, the most difficult problem is the
collection of the dataset which contains all important elements like the manufacturing year of the car,
its gas type, its condition, miles driven, horsepower, doors, number of times a car has been painted,
customer reviews, the weight of the car, etc. The price of the product is affected by many factors, but
unfortunately, information about these features is not always readily available. Since this project
primarily focuses on the specific dataset, the benchmark dataset containing all key features is scraped.
It is necessary to pre-process, and transform collected data in the proper format prior to feeding it
directly to the machine learning model. As a first step, the dataset was statistically analysed and plotted.
Missing, duplicated, and null values were identified and dealt with. Features were chosen and extracted
using 2 correlation matrices. To build an efficient model, the most correlated features were retained,
and others were discarded. This prediction problem can be considered a regression problem since it
belongs to the supervised learning domain. Here we used the Linear regression algorithm for the
prediction

1.2 PROBLEM DEFINITION

The purpose of this study is to understand and evaluate used car prices, and to develop a model or a
strategy that utilizes Machine learning techniques to predict used car prices.

1.3 OBJECTIVE OF THE PROJECT:

Deciding whether a used car is worth the posted price when you see listings online can be difficult.
Several factors, including mileage, make, model, year, etc. can influence the actual worth of a car.
From the perspective of a seller, it is also a dilemma to price a used car appropriately. Based on
existing data, the aim is to use machine learning algorithms to develop models for predicting used car
prices.

2
2.LITERATURE REVIEW

We have revised several papers and articles based our project named “Predicting True Value of Used
Cars Using Machine Learning Technique”.

The first paper is “Predicting the price of Used Car Using Machine Learning Techniques”.In this
paper, they investigate the application of supervised machine learning techniques to predict the price of
used cars in Mauritius. The predictions are based on historical data collected from daily newspapers.
Different techniques like multiple linear regression analysis, k-nearest neighbours, naïve bayes and
decision trees have been used to make the predictions..

The Second paper is “Car Price Prediction Using Machine Learning Techniques”. Considerable
number of distinct attributes are examined for the reliable and accurate prediction. To build a model for
predicting the price of used cars in Bosnia and Herzegovina, they have applied three machine learning
techniques (Artificial Neural Network, Support Vector Machine and Random Forest).

The Third paper is “Price Evaluation model in second hand car system”. In this paper, the price
evaluation model based on big data analysis is proposed, which takes advantage of widely circulated
vehicle data and a large number of vehicle transaction data to analyze the price data for each type of
vehicles by using the optimized BP neural network algorithm. It aims to establish a second-hand car
price evaluation model to get the price that best matches the car.

3
3.REVIEW

3.1 EXISTED SYSTEMS

In the existing system, to predict the price of four-wheeler, a lot of Machine Learning algorithm were
widely used. The major drawback of this existing system is they need more attributes in order to predict
the vehicle price. It is highly complicated to get sufficient data sets that were spread widely all over the
world. The datasets can be collected only through online. But not on the offline mode. The data sets
will not have about the vehicles which were not used for long time and also the traditional model
vehicles may or may not be included in the data sets. The major drawbacks of existing system are the
system is very slow due to most of the works about the keyword query just analyse individual points,
and they are inappropriate to many applications that call for analysis of groups of different car points.

3.2 PROPOSED SYSTEM

Based on the varying features and factors, and with the help of expert’s knowledge the
vehicle price prediction has been done accurately. The most necessity ingredient for
prediction is brand and model, period usage of vehicle, mileage of vehicle. The fuel type used in the
vehicle as well as fuel consumption per mile highly affect price of a vehicle due to a frequent change in
the price of a fuel. Different features like exterior color, door number, type of transmission,
dimensions, safety, air condition, interior, whether it has navigation or not will also influence the
vehicle price. In this project, we applied different methods and techniques in order to achieve higher
precision of the used vehicle price prediction.
Advantages
 The system is more effective since it measures the vehicle combinations by their prices.
 The system is predicted accurately due to Linear regression

3.3 SOFTWARE REQUIREMENT SPECIFICATION

The system requirements or software requirements is a listing of what software programs or

hardware devices are required to operate the program or game properly. System requirements is a
statement that identifies the functionality that is needed by a system in order to satisfy the user's
requirements. They are the first and foremost important part of any project, because if the system
requirements are not fulfilled, then the project is not complete.
A software requirement can be of 2 types:
4
1. Functional Requirements
2. Non-functional Requirements

3.3.1 Functional Requirements:

The functional requirements for a system describe what the system should do. Those requirements
depend on the type of software being developed, the expected users of the software. These are
statement of services the system should provide, how the system should react to particular inputs and
how the system should behave in particular situation.

3.3.2. Non-Functional requirements:

Hardware requirements:

The hardware requirements are the requirements of a hardware device.

1. Intel i3 processor or above
2. RAM 4GB or above
3. Hard disk 50GB

Software requirements:

The software requirements are the requirements of a software device.

1. Python: 3.8.5
2. NumPy: 1.19.5
3. pandas: 1.1.5
4. matplotlib: 3.2.2
5. Visual Studio Code
6. Jupyter Notebook Chrome

5
4.DIAGRAMS

4.1 UML DIAGRAMS:

UML is the short form of Unified Modelling Language. UML is a standardized

General purpose modeling language in the field of object-oriented software engineering. The standard
is managed, and was created by, the Object Management Group.
The important goal for UML is to create a common modelling language for the sake of
Object-Oriented Software engineering.

The Primary goals in the design of the UML are as follows:

1. Provide users a ready-to-use, expressive visual modeling Language so that they can develop and
exchange meaningful models.
2. Provide extendibility and specialization mechanisms to extend the core concepts.
3. Be independent of particular programming languages and development processes.
4. Provide a formal basis for understanding the modeling language
5. Encourage the growth of the OO tools market.
6. Support higher level development concepts such as collaborations frameworks, patterns and
components and Integrate best practices
.

6
CLASS DIAGRAM

Class diagrams are one of the most widely used diagrams. It is the backbone of all the object-
oriented software systems. It depicts the static structure of the system. It displays the system's class,
attributes, and methods. It is helpful in recognizing the relation between different objects as well as
classes.

7
SEQUENCE DIAGRAM

The sequence diagram represents the flow of messages in the system and is also termed as an event
diagram. It helps in envisioning several dynamic scenarios. It portrays the communication between any
two lifelines as a time-ordered sequence of events, such that these lifelines took part at the run time. In
UML, the lifeline is represented by a vertical bar, whereas the message flow is represented by a vertical
dotted line that extends across the bottom of the page. It incorporates the iterations as well as
branching.

8
DEPLOYMENT DIAGRAM

It presents the system's software and its hardware by telling what the existing physical
components are and what software components are running on them. It produces information about
system software. It is incorporated whenever software is used, distributed, or deployed across multiple
machines with dissimilar configurations.

9
STATE MACHINE DIAGRAM

The state machine diagram is also called the State chart or State Transition diagram, which
shows the order of states underwent by an object within the system. It captures the software system's
behavior. It models the behavior of a class, a subsystem, a package, and a complete system. It tends out
to be an efficient way of modeling the interactions and collaborations in the external entities and the
system. It models event-based systems to handle the state of an object. It also defines several distinct
states of a component within the system. Each object/component has a specific state.

10
USECASE DIAGRAM

A Use Case Diagram is a visual representation in UML (Unified Modeling Language) that depicts the
interactions between actors (users or external systems) and a system to showcase the system's
functionality from a user's perspective. It provides a high-level view of the system's behavior, focusing
on what the system does rather than how it is implemented.

11
5.IMPLEMENTATION

5.1 Software Environment

A software development environment (SDE) is the collection of hardware and software tools a
system developer uses to build software systems. When you are developing software, you probably
don't want your users to see every messy part of your application creation process.The software
technology used in this project is python. Python is the fastest growing programming language.It
supports multiple programming paradigms,including structured,object-oriented and functional
programming.And it is dynamically-typed and garbage collected. It consistently ranks as one of the
most popular programming languages.It can be also used on a server to create web applications. It has a
huge number of libraries and frameworks. Python frameworks are no different they are a collection of
modules and packages. These frameworks automate common processes and implementation. For
instance, developers can focus on application logic rather than dealing with routinary processes.
The python libraries used are:
 numpy
 pandas
 matplotlib
 sklearn
 seaborn

Numpy-
The name “Numpy” stands for “Numerical Python”. It is the commonly used library. It is a popular
machine learning library that supports large matrices and multi-dimensional data. It consists of in-built
mathematical functions for easy computations. Even libraries like TensorFlow use Numpy internally to
perform several operations on tensors. Array Interface is one of the key features of this library.

Pandas-
Pandas is a software library written for the Python programming language for data manipulation and
analysis. When we have to work on Tabular data, we prefer the pandas module. The powerful tools of
pandas are Data frame and Series. Pandas has a better performance when a number of rows is 500K or
more.

12
Matplotlib-
matplotlib() is a library function that is responsible for plotting numerical data. And that’s why it is
used in data analysis. It is also an open-source library and plots high-defined figures like pie charts,
histograms, scatterplots, graphs, etc.

Seaborn-
Seaborn is a library that uses Matplotlib underneath to plot graphs. It will be used to visualize random
distributions. It is used for data visualization and exploratory data analysis. Seaborn works easily with
data frames and the Pandas library. The graphs created can also be customized easily.

5.2 SOFTWARE TECHNOLOGIES

Python: Python is a widely-used programming language in the field of machine learning. It offers
various libraries and frameworks, such as scikit-learn, TensorFlow, and Keras, which provide tools for
data preprocessing, model development, and evaluation.

Jupyter Notebook: Jupyter Notebook is an open-source web application that allows you to create and
share documents containing code, visualizations, and explanatory text. It's commonly used for data
exploration, model development, and collaboration in machine learning projects.

Pandas: Pandas is a powerful data manipulation library in Python. It provides data structures and
functions for efficiently handling and analyzing structured data, such as CSV files or databases, which
are commonly used in used car price prediction projects.

Scikit-learn: Scikit-learn is a popular machine learning library in Python. It provides a wide range of
algorithms and tools for regression, classification, and other machine learning tasks. You can use it to
implement regression models for predicting used car prices.

XGBoost/LightGBM: XGBoost and LightGBM are gradient boosting frameworks that are highly
effective for regression problems. They provide optimized implementations of gradient boosting
algorithms, which can be used to build accurate used car price prediction models.

13
5.3 DATASET:
https://www.kaggle.com/datasets/sidharth178/car-prices-dataset

14
5.4 SAMPLE CODE

Code in Jupyter Notebook:

15
16
Code for website:

from flask import Flask,render_template,request,redirect

from flask_cors import CORS,cross_origin
import pickle
import pandas as pd
import numpy as np
from sklearn.linear_model import LinearRegression
from sklearn.preprocessing import OneHotEncoder
from sklearn.compose import ColumnTransformer
app=Flask(_name_)
cors=CORS(app)
model=pickle.load(open('LinearRegressionModel.pkl','rb'))
car=pd.read_csv('Cleaned_Car_data.csv')

@app.route('/',methods=['GET','POST'])
def index():
companies=sorted(car['company'].unique())
car_models=sorted(car['name'].unique())
year=sorted(car['year'].unique(),reverse=True)
fuel_type=car['fuel_type'].unique()

companies.insert(0,'Select Company')
return render_template('index.html',companies=companies, car_models=car_models,
years=year,fuel_types=fuel_type)

@app.route('/predict',methods=['POST'])
@cross_origin()
def predict():
company=request.form.get('company')
car_model=request.form.get('car_models')
year=request.form.get('year')
fuel_type=request.form.get('fuel_type')
km_driven=request.form.get('kilo_driven')
17
dat=np.array([car_model,company,year,km_driven,fuel_type])
pred=model.predict(pd.DataFrame(columns=['name','company','year','kms_driven','fuel_type'],data=
dat.reshape(1,5)))
return str(np.round(pred,0)[0])

if _name=='main_':
app.run(debug=True)

<!DOCTYPE html>
<html lang="en">
<head xmlns="http://www.w3.org/1999/xhtml">
<meta charset="UTF-8">
<title>Car Price Predictor</title>
<link rel="stylesheet" href="static/css/style.css">
<link rel="stylesheet" type="text/css"
href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.11.2/css/all.css">
<script src="https://ajax.googleapis.com/ajax/libs/jquery/3.4.1/jquery.min.js"></script>
<script src="https://cdn.jsdelivr.net/npm/[email protected]/dist/umd/popper.min.js"
integrity="sha384-
Q6E9RHvbIyZFJoft+2mJbHaEWldlvI9IOYy5n3zV9zzTtmI3UksdQRVvoxMfooAo"
crossorigin="anonymous"></script>

</head>
<body class="bg-dark">

<div class="container">
<div class="row">
18
<div class="card mt-50" style="width: 100%; height: 100%">
<div class="card-header" style="text-align: center">
<h1>Welcome to Car Price Predictor</h1>
</div>
<div class="card-body">
<div class="col-12" style="text-align: center">
<h5>This app predicts the price of a car you want to sell. Try filling the details below:
</h5>
</div>
<br>
<form method="post" accept-charset="utf-8" name="Modelform">
<div class="col-md-10 form-group" style="text-align: center">
<label><b>Select the company:</b> </label><br>
<select class="selectpicker form-control" id="company" name="company" required="1"
onchange="load_car_models(this.id,'car_models')">
{% for company in companies %}
<option value="{{ company }}">{{ company }}</option>
{% endfor %}
</select>
</div>
<div class="col-md-10 form-group" style="text-align: center">
<label><b>Select the model:</b> </label><br>
<select class="selectpicker form-control" id="car_models" name="car_models"
required="1">
</select>
</div>
<div class="col-md-10 form-group" style="text-align: center">
<label><b>Select Year of Purchase:</b> </label><br>
<select class="selectpicker form-control" id="year" name="year" required="1">
{% for year in years %}
<option value="{{ year }}">{{ year }}</option>
{% endfor %}
</select>
</div>
<div class="col-md-10 form-group" style="text-align: center">
19
<label><b>Select the Fuel Type:</b> </label><br>
<select class="selectpicker form-control" id="fuel_type" name="fuel_type"
required="1">
{% for fuel in fuel_types %}
<option value="{{ fuel }}">{{ fuel }}</option>
{% endfor %}
</select>
</div>
<div class="col-md-10 form-group" style="text-align: center">
<label><b>Enter the Number of Kilometres that the car has travelled:</b> </label><br>
<input type="text" class="form-control" id="kilo_driven" name="kilo_driven"
placeholder="Enter the kilometres driven ">
</div>
<div class="col-md-10 form-group" style="text-align: center">
<button class="btn btn-primary form-control" onclick="send_data()">Predict
Price</button>
</div>
</form>
<br>
<div class="row">
<div class="col-12" style="text-align: center">
<h4><span id="prediction"></span></h4>
</div>
</div>
</div>
</div>
</div>
</div>

var newOption= document.createElement("option");

newOption.value="{{ model }}";
newOption.innerHTML="{{ model }}";
car_model.options.add(newOption);
{% endif %}
{% endfor %}
}
{% endfor %}
}

function form_handler(event) {
event.preventDefault(); // Don't submit the form normally
}
function send_data()
{
document.querySelector('form').addEventListener("submit",form_handler);

var fd=new FormData(document.querySelector('form'));

var xhr= new XMLHttpRequest({mozSystem: true});

xhr.open('POST','/predict',true);
document.getElementById('prediction').innerHTML="Wait! Predicting Price.....";
xhr.onreadystatechange = function(){
if(xhr.readyState == XMLHttpRequest.DONE){
document.getElementById('prediction').innerHTML="Prediction: ₹"+xhr.responseText;"+xhr.responseText;
21
}
};

xhr.onload= function(){};

xhr.send(fd);
}
</script>

22
6.TEST CASES

Input Data Validation:

Test case: Provide invalid or missing input data (e.g., missing car model, negative mileage, unrealistic
year) and verify that the system handles and reports the errors appropriately.

Training and Testing Data Split:

Test case: Split your dataset into training and testing sets, ensuring that the proportion of data in each
set is appropriate (e.g., 80% training, 20% testing).
Test case: Verify that the training and testing datasets are mutually exclusive and do not contain
overlapping records.

Model Training and Evaluation:

Test case: Train your machine learning model using the training dataset and verify that it converges
without any errors.
Test case: Evaluate the trained model on the testing dataset and measure relevant metrics such as mean
squared error (MSE), root mean squared error (RMSE), or R-squared value.

Feature Selection and Importance:

Test case: Perform feature selection techniques (e.g., correlation analysis, feature importance) and
validate that the selected features are meaningful and contribute to the predictive performance of the
model.

Model Performance:
Test case: Provide a set of test samples with known prices and compare the predicted prices from your
model against the actual prices. Calculate and report the accuracy of the model predictions.

Handling Outliers and Anomalies:

Test case: Introduce outliers or anomalies in the input data and verify that the model can handle and
produce reasonable predictions in such cases.

23
Model Persistence and Loading:
Test case: Save the trained model to disk and verify that it can be successfully loaded and used for
making predictions.

Real-Time Predictions:
Test case: Implement a mechanism to accept real-time input data (e.g., car features) and ensure that the
model can make accurate predictions in real-time scenarios.While selecting the schedule the user
doesn’t give any notes to the pickup person then it is asking that do you want to give any notes to the
pickup person or not.

24
7.SCREENSHOTS

25
26
8.CONCLUSION

In conclusion, the used car price prediction project involves building a machine learning model to
estimate the prices of used cars. By leveraging various software technologies and techniques, such as
Python, Jupyter Notebook, Pandas, scikit-learn, XGBoost/LightGBM, and Flask/Django, you can
develop a robust and accurate solution.

Throughout the project, it is essential to validate and preprocess the input data, split it into training and
testing sets, and train the machine learning model. Evaluating the model's performance using
appropriate metrics and test cases will help assess its accuracy and effectiveness.

Additionally, feature selection and handling outliers/anomalies are crucial steps to improve the model's
predictive power. It's important to consider the practicality and usefulness of the selected features and
ensure the model can handle unexpected data points.

Once the model is trained and evaluated, it can be persisted and loaded for real-time predictions.
Implementing a user-friendly interface, such as a web application using Flask or Django, allows users
to interact with the model and obtain price estimates based on input features.

Overall, the project aims to provide a valuable tool for estimating used car prices, facilitating decision-
making for buyers, sellers, and car enthusiasts. Continuous improvement and refinement of the model,
along with user feedback, can enhance its performance and make it more reliable over time.

27
9.FUTURE SCOPE

The future scope of a used car price prediction project can involve several potential areas of
improvement and expansion. Here are some possibilities to consider:

Advanced Machine Learning Techniques: Experiment with advanced machine learning algorithms and
techniques like ensemble methods, deep learning, or time-series analysis to potentially improve the
model's performance and ability to capture complex patterns in the data.

Expand to Other Vehicle Types: Consider extending the project to include price prediction for other
types of vehicles, such as motorcycles, trucks, or recreational vehicles. This expansion can broaden the
scope of the application and cater to a wider range of users.

28
10. BIBILOGRAPHY

PAPERS REFFERED:
"Predicting Used Car Prices with Machine Learning Techniques" by A. Sharma
"Car Price Prediction Based on Machine Learning Techniques" by R. Wang et al.

WEBSITES:
1.https://www.ijcaonline.org/archives/volume167/number9/noor-2017-ijca-914373.pdf
2. http://cs229.stanford.edu/proj2019aut/data/assignment_308832_raw/26612934.pdf
3. https://en.wikipedia.org/wiki/Linear_regression
4.https://www.ibm.com/in-en/topics/linear-regression#:~:text=Resources-,What%20is%20linear
%20regression%3F,is%20called%20the%20independent%20variable.

Vehicle Count Prediction
100% (2)
Vehicle Count Prediction
33 pages
Thesis Machine Learning
No ratings yet
Thesis Machine Learning
29 pages
Cse-F Batch8 Finaldoc
No ratings yet
Cse-F Batch8 Finaldoc
81 pages
Projecr - Report House Price Pred
No ratings yet
Projecr - Report House Price Pred
18 pages
Majp Doc M
No ratings yet
Majp Doc M
70 pages
Mini Project Surya
No ratings yet
Mini Project Surya
50 pages
Car Price Prediction Report
No ratings yet
Car Price Prediction Report
24 pages
Intelligent Vehicle Support
No ratings yet
Intelligent Vehicle Support
35 pages
Music Genre Classification Report
No ratings yet
Music Genre Classification Report
36 pages
O180421 Summer Internship Report
No ratings yet
O180421 Summer Internship Report
33 pages
PPF and Train-Summer-Internship-Report
No ratings yet
PPF and Train-Summer-Internship-Report
33 pages
Wa0007.
No ratings yet
Wa0007.
51 pages
RTRP Project Documentation Format-2024 (AutoRecovered)
No ratings yet
RTRP Project Documentation Format-2024 (AutoRecovered)
62 pages
Final Documentation d7
No ratings yet
Final Documentation d7
53 pages
KGiSL Institute of Technolog (Final)
No ratings yet
KGiSL Institute of Technolog (Final)
31 pages
Cyberbullying A17 Major Project
No ratings yet
Cyberbullying A17 Major Project
98 pages
Final Drive Douctment Major
No ratings yet
Final Drive Douctment Major
44 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
50 pages
An Anaya
No ratings yet
An Anaya
40 pages
Project Report
No ratings yet
Project Report
54 pages
Data Mining Report
No ratings yet
Data Mining Report
25 pages
MINI DOCC LAST (1) - Removed
No ratings yet
MINI DOCC LAST (1) - Removed
52 pages
Project Report
No ratings yet
Project Report
70 pages
1.3.2 Final
No ratings yet
1.3.2 Final
72 pages
BLACKBOOK
No ratings yet
BLACKBOOK
33 pages
Digit Final PDF
No ratings yet
Digit Final PDF
46 pages
Project Report December 2022 Final
No ratings yet
Project Report December 2022 Final
42 pages
570 Report
No ratings yet
570 Report
38 pages
Documentation RTP Merged
No ratings yet
Documentation RTP Merged
36 pages
Binder 1
No ratings yet
Binder 1
93 pages
House Price Prediction Using Machine Learning: A Project Report On
No ratings yet
House Price Prediction Using Machine Learning: A Project Report On
19 pages
Thesis Machine Learning
No ratings yet
Thesis Machine Learning
28 pages
Updated Car Price Prediction Report v2
No ratings yet
Updated Car Price Prediction Report v2
27 pages
Visvesvaraya Technological University: "Machine Learning Based Approach To Detect Phishing Attacks"
No ratings yet
Visvesvaraya Technological University: "Machine Learning Based Approach To Detect Phishing Attacks"
78 pages
A15 Final Document
No ratings yet
A15 Final Document
68 pages
Banking Smarter Chatbot Final
No ratings yet
Banking Smarter Chatbot Final
93 pages
Andriya-Seminar Repot (1) ..
No ratings yet
Andriya-Seminar Repot (1) ..
28 pages
Laptop Price Prediction: A Project Report On
100% (1)
Laptop Price Prediction: A Project Report On
20 pages
Sam Path
No ratings yet
Sam Path
7 pages
Santhosh BE Paper To Jeevi Veh
No ratings yet
Santhosh BE Paper To Jeevi Veh
47 pages
Anusha Mini Project Synopsis
No ratings yet
Anusha Mini Project Synopsis
25 pages
BTP Report Final 1
No ratings yet
BTP Report Final 1
28 pages
Frond Page 5
No ratings yet
Frond Page 5
5 pages
A Major Project Report On: Bachelor of Technology
No ratings yet
A Major Project Report On: Bachelor of Technology
109 pages
1visvesvaraya Technological University
No ratings yet
1visvesvaraya Technological University
29 pages
Dec Cer Ack 2
No ratings yet
Dec Cer Ack 2
3 pages
Ammu Final 12
No ratings yet
Ammu Final 12
7 pages
Template To Prepare Documentation
No ratings yet
Template To Prepare Documentation
6 pages
Internpro Report58
No ratings yet
Internpro Report58
42 pages
Used Car Price Prediction Using Machine Learning: Veluru Ranjith (Urk18Cs020)
No ratings yet
Used Car Price Prediction Using Machine Learning: Veluru Ranjith (Urk18Cs020)
26 pages
Flat - Unit 3
No ratings yet
Flat - Unit 3
18 pages
J1 (SkillDzire)
No ratings yet
J1 (SkillDzire)
49 pages
Minor Project Report Format 2023IML
No ratings yet
Minor Project Report Format 2023IML
10 pages
Uptade 1
No ratings yet
Uptade 1
9 pages
Fake Review Detection Prj2
No ratings yet
Fake Review Detection Prj2
30 pages
Machine Learning Based Car Price Prediction System
No ratings yet
Machine Learning Based Car Price Prediction System
32 pages
B2 Salma Fayaz
No ratings yet
B2 Salma Fayaz
56 pages
Packing Automation in A High Variety Conveyor Line Via Image Classification
No ratings yet
Packing Automation in A High Variety Conveyor Line Via Image Classification
11 pages
Chapter 8 Javascript
100% (1)
Chapter 8 Javascript
132 pages
Computer Science Investigatory Project
50% (2)
Computer Science Investigatory Project
15 pages
EWM-QM Integration Consulting Note.
0% (1)
EWM-QM Integration Consulting Note.
2 pages
Latest Thesis Topics in Software Engineering
100% (3)
Latest Thesis Topics in Software Engineering
6 pages
Modul Python 1
No ratings yet
Modul Python 1
36 pages
Activity Diagram
No ratings yet
Activity Diagram
11 pages
Engineering Lab Manuals - CS2308 - System Software LM PDF
No ratings yet
Engineering Lab Manuals - CS2308 - System Software LM PDF
14 pages
Java
No ratings yet
Java
263 pages
Principles and Techniques of
No ratings yet
Principles and Techniques of
4 pages
Unit I Lexical Analysis
No ratings yet
Unit I Lexical Analysis
27 pages
Weekends: 4 Hrs Per Day: Course Duration: Timing Method: Breaks: System Access Study Material SAP R/3 Fundamentals
No ratings yet
Weekends: 4 Hrs Per Day: Course Duration: Timing Method: Breaks: System Access Study Material SAP R/3 Fundamentals
3 pages
Bopf Code
No ratings yet
Bopf Code
4 pages
WWI Report Templete
No ratings yet
WWI Report Templete
16 pages
07 Array Processing - Slide
No ratings yet
07 Array Processing - Slide
51 pages
Spma001a ADC
No ratings yet
Spma001a ADC
59 pages
PWP Microproject
No ratings yet
PWP Microproject
11 pages
PHP Practical File
No ratings yet
PHP Practical File
45 pages
Answers To Debugging Exercises Chap 14
No ratings yet
Answers To Debugging Exercises Chap 14
11 pages
How To Fill A ListView With Any ADO
No ratings yet
How To Fill A ListView With Any ADO
12 pages
Advanced PHP and MySQL
No ratings yet
Advanced PHP and MySQL
5 pages
Introduction To Algorithms
0% (1)
Introduction To Algorithms
6 pages
Python
No ratings yet
Python
20 pages
HR Database Exercises: Name: Dhiraj Subrao Desai
No ratings yet
HR Database Exercises: Name: Dhiraj Subrao Desai
23 pages
ICSE Class 10 Computer Project
No ratings yet
ICSE Class 10 Computer Project
5 pages
13 @home
No ratings yet
13 @home
13 pages
Interview
No ratings yet
Interview
5 pages
Laravel Get Latest Record For Each Group
No ratings yet
Laravel Get Latest Record For Each Group
5 pages
A Comparison of Various Normalization in Techniques For Order Performance by Similarity To Ideal Solution (TOPSIS)
No ratings yet
A Comparison of Various Normalization in Techniques For Order Performance by Similarity To Ideal Solution (TOPSIS)
1 page
SQL Query Samples For Unilever Reports
No ratings yet
SQL Query Samples For Unilever Reports
2 pages
Machine Learning Mastery for Engineers
From Everand
Machine Learning Mastery for Engineers
Abdellatif Sadeq
No ratings yet
PTC Creo Parametric 3.0 for Designers
From Everand
PTC Creo Parametric 3.0 for Designers
Prof. Sham Tickoo
5/5 (1)
AutoCAD Plant 3D 2018 for Designers, 4th Edition
From Everand
AutoCAD Plant 3D 2018 for Designers, 4th Edition
Prof. Sham Tickoo
No ratings yet

Predicting True Value of Cars Using Ml-1

Uploaded by

Predicting True Value of Cars Using Ml-1

Uploaded by

A MINI PROJECT REPORT

PREDICTING TRUE VALUE OF USED CAR USING MACHINE

Computer science and engineering

Under the Guidance

Department of Computer Science and Engineering

Mr.Krishna M Mr. B Sampath Babu

Assistant Professor, Head of the Department,

Department of CSE, Department of CSE,

K.Naga Sandhya – o180807

P.Sai Nageswari – o180812

K.Naga Sandhya – o180807

P.Sai Nageswari – o180812

1.2 PROBLEM DEFINITION

1.3 OBJECTIVE OF THE PROJECT:

3.1 EXISTED SYSTEMS

3.2 PROPOSED SYSTEM

3.3 SOFTWARE REQUIREMENT SPECIFICATION

The system requirements or software requirements is a listing of what software programs or

3.3.1 Functional Requirements:

3.3.2. Non-Functional requirements:

The hardware requirements are the requirements of a hardware device.

The software requirements are the requirements of a software device.

4.1 UML DIAGRAMS:

UML is the short form of Unified Modelling Language. UML is a standardized

The Primary goals in the design of the UML are as follows:

5.1 Software Environment

5.2 SOFTWARE TECHNOLOGIES

Code in Jupyter Notebook:

from flask import Flask,render_template,request,redirect

var newOption= document.createElement("option");

var fd=new FormData(document.querySelector('form'));

var xhr= new XMLHttpRequest({mozSystem: true});

Input Data Validation:

Training and Testing Data Split:

Model Training and Evaluation:

Feature Selection and Importance:

Handling Outliers and Anomalies:

You might also like