Machine Learning Based Car Price Prediction System
Machine Learning Based Car Price Prediction System
Machine Learning Based Car Price Prediction System
Submitted to
Abhishek Raj,BK3571
Adhikarla Shravani,Bk3506
Ananya Mishra,BK3513
ii
iii
DECLARATION
I the undersigned solemnly declare that the report of the Project work entitled “Machine
learning based car price prediction system”, is based on my own work carried out during my study
under the supervision of Prof. Dolly Gautam, Department of Electronics and Telecommunication
Engineering, Bhilai Institute of Technology, Durg, Chhattisgarh.
I assert that the statements made, and conclusions drawn are an outcome of the project work. I
further declare that to the best of my knowledge and belief that the report does not contain any part of any
work which has been submitted for the award of any other degree/diploma/certificate in this University/
deemed University of India or any other country.All help received and citations used for the preparation of
the Report have been duly acknowledged.
iv
CERTIFICATE BY THE SUPERVISOR
This is to certify that the report of the Project submitted is an outcome of the project work
entitled “Machine learning based car price prediction system”, carried out by
Abhishek Raj bearing Univ. Roll No.: 300102820070 & Enrollment No.: BK3571
Adhikarla Shravani bearing Univ. Roll No.: 300102820090 & Enrollment No.: BK3506
Aman Kumar Sahu bearing Univ.Roll No.: 300102820009&Enrollment No.: BK3510
Ananya Mishra bearing Univ. Roll No.: 300102820012 & Enrollment No.: BK3513
Carried out under my guidance and supervision for completion of the Minor Project-II at
Department of Electronics and Telecommunication Engineering of Bhilai Institute of Technology,
Durg.
To the best of my knowledge and the Report
i. Embodies the work of the candidate him/herself,
ii. Has duly been completed and
iii. Is up to the desired standard for the purpose of which is submitted.
The project work as mentioned above is hereby being recommended and forwarded for
examination and evaluation.
v
CERTIFICATE BY THE EXAMINERS
has been examined by the undersigned as a part of the Minor-Project-II examination, B.Tech
6rd semester, Department of Electronics & Telecommunication Engineering, Bhilai Institute of
Technology, Durg (C.G.).
vi
ACKNOWLEDGEMENT
With deep regards and profound respect, I avail this opportunity to express my deep
sense of gratitude and indebtedness to Prof. Dolly Gautam, Department of Electronics and
Telecommunication Engineering, BIT Durg for his valuable guidance and support. I amdeeply
indebted to the valuable discussions at each phase of the project. I consider it my good fortune
to have got an opportunity to work with such a wonderful person.
I express my sincere gratitude to Dr. Arun Arora, Director, Bhilai Institute of
Technology, Durg, for providing adequate infrastructure to carry out present investigations and
also motivating for research work, which has been a constant source of inspiration in
completing this work.
I take immense pleasure in thanking Dr. Mohan Kumar Gupta, Principal, Bhilai
Institute of Technology, Durg, for providing adequate academic facilities to work in my
research area.
I take immense pleasure to thank Dr. Manisha Sharma, Vice Principal, Bhilai Institute
of Technology, Durg, for motivating us to work in research direction and providing
opportunities to connect with global research.
I take immense pleasure in thanking Dr. Arun Kumar, HOD (ETC), Bhilai Institute of
Technology, Durg, for constant feedback and encouragements and endless support and help
throughout this project work.
Lastly, I feel immensely moved in expressing my indebtedness to my revered parents
whose sacrifice, guidance and blessings helped me to complete my work.
Abhishek Raj
Adhikarla Shravani
Ananya Mishra
vii
Table of Contents
I Introduction 01
II Methodology 04
III Result 08
3.1 Jupyter Notebook Code
3.2 Pycharm Code
3.3 Output
IV Conclusion and Future Scope 23
4.1 Conclusion
4.2 Future Scope
V References 25
viii
Chapter 01: Introduction
1.1 What is Machine Learning ?
Figure-1.1 Machine
Learning
1
1.2 Types Of Machine Learning
A machine is said to be learning from past experiences (data feed-in) with
respect to some class of tasks if its Performance in a given Task improves with
the Experience. For example, assume that a machine has to predict whether a
customer will buy a specific product let’s say ―Antivirus‖ this year or not. The
machine will do it by looking at the previous knowledge/past experiences i.e
the data of products that the customer had bought every year and if he buys
Antivirus every year, then there is a high probability that the customer is going
to buy an antivirus this year as well. This is how machine learning works at the
basic conceptual level.
2
Problem Statement - It is easy for any company to price their new cars based on
the manufacturing and marketing cost it involves. But when it comes to a used car
it is quite difficult to define a price because it involves it is influenced by various
parameters like car brand, manufactured year and etc. The goal of our project is to
predict the best price for a pre-owned car in the Indian market based on the
previous data related to sold cars using Linear Regression.
3
Chapter 02: Methodology
Data Gathering: The source of the data is the web portal of quikr.com where
vehicle dataset is provided for selling and buying of cars. The dataset gave the
following set of features:
Car Name, Year, Price, Kilometers driven, Fuel Type: Petrol, Diesel or LPG
(Liquid Petroleum Gas) etc.
Data Reading: The csv file is imported and read for the study which is the
primary step. The dataset is thoroughly read on various aspects like null values,
shape, columns, numerical and categorical features, dataset columns, unique
values of each feature, data info etc.
Data Pre-processing: Some of the features in the data were renamed for better
4
understanding and some other features that were not useful for analysis were also
dropped. Exploratory Data Analysis of data is done in which we use statistical
graphics and other visualization methods to summarize the main characteristics
of data. After completing EDA, One Hot Encoding technique is employed for
dealing with the categorical features of the dataset. Thereafter, the correlation
features of the dataset are produced and analyzed thoroughly by visualizing some
plots. Then the features allocation of data is done where the dependent feature
and independent features are allocated for further procedure.
Model Building: After the Train-Test split, modelling of data is done where the
process of building the model begins. The model along with a few parameters is
defined for further implementation. After the model is ready, various algorithms
are then applied to obtain the final results generated by them. The following
algorithms are employed for the predictive analysis after model building.
6
individual models (base learners) to generate a single prediction.
Gradient Boosting Regression: It is a technique in machine learning for regression
and classification problems to generate a prediction model. The prediction model
produce is an ensemble of weak prediction models which typically are the decision
trees. This technique generally outperforms the random forest method.
7
Chapter 03: Result
8
9
10
11
12
13
14
15
16
17
3.2 Pycharm Code
18
3.3 Output
19
20
21
22
Chapter 04: Conclusion and Future Scope
4.1 Conclusion
Since India’s used-car market is booming as buyers have a wide range of options,
easy financing, convenient digital sales channels, and a growing preference for
personal mobility in the COVID-19 era, car prediction can be a challenging task
due to the high number of attributes that should be considered for accurate
prediction. The main weakness of Gradient boosting is that it sacrifices
intelligibility and interpretability. The main limitation of this study
is the low number of records that have been used. In future work, we intend to
collect more data related to electric vehicles and combustion vehicles and to use
more advanced techniques.
24
References
25