0% found this document useful (0 votes)
36 views14 pages

DSPY Lab Project (Formatted) 2

The document describes a car price prediction model developed using machine learning techniques. It discusses the methodology used, including data gathering, preprocessing, splitting the data, and training an SVM model. The results section describes how the model takes user inputs and predicts car prices. In conclusion, the model represents an improvement over traditional pricing methods by leveraging big data and machine learning.

Uploaded by

Qureshi Tamir
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
36 views14 pages

DSPY Lab Project (Formatted) 2

The document describes a car price prediction model developed using machine learning techniques. It discusses the methodology used, including data gathering, preprocessing, splitting the data, and training an SVM model. The results section describes how the model takes user inputs and predicts car prices. In conclusion, the model represents an improvement over traditional pricing methods by leveraging big data and machine learning.

Uploaded by

Qureshi Tamir
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 14

DSPY Lab Project Report on

“Car Price Prediction Model”


Submitted in partial fulfilment of the requirements of the degree of
Bachelor of Engineering in Information Technology.

Submitted by

AFIFAH KHAN

VIVEK NAIK

APOORVA NIKAM

Guided by

Dr. Swati Narwane

DEPARTMENT OF INFORMATION TECHNOLOGY

UNIVERSITY OF MUMBAI

2024
1
INDEX

Chapters Title of the Chapter Pages

Chapter 1 Abstract 3

Chapter 2 Introduction 4

Chapter 3 Methodology 5-7

Chapter 4 Results 8-10

Chapter 5 Conclusion 11

Chapter 6 Future Scope 12

Chapter 7 References 13

2
ABSTRACT

Developing a predictive model for detecting the onset of car prices involves utilising
machine learning techniques. The dataset, sourced from clinical records, undergoes
preprocessing steps including scaling and splitting into training and testing sets. A
linear Support Vector Machine (SVM) model is trained on the scaled training data and
evaluated using accuracy metrics, including precision, recall, and F1-score, to account
for potential class imbalances. Model performance is assessed on both training and
testing datasets, with attention given to interpretability and generalisation. The
automotive industry is characterised by dynamic market conditions, making accurate
pricing a challenging task for both buyers and sellers. In response to this challenge,
machine learning techniques have emerged as powerful tools for predicting car prices
with precision and reliability. This abstract introduces a novel car price prediction
model that leverages advanced algorithms and data analysis to forecast the market
value of vehicles.

3
INTRODUCTION
A car price prediction system leverages advanced algorithms and data analysis techniques to
forecast the market value of vehicles with remarkable precision. By analysing historical sales
data, market trends, economic indicators, and various vehicle attributes, these systems provide
valuable insights into the factors driving price fluctuations.

The automotive industry is a complex ecosystem influenced by numerous factors, making


accurate pricing a challenging endeavour for buyers and sellers alike. However, with the advent
of machine learning, predictive analytics, and big data, a new era of car price prediction has
emerged, offering unprecedented insights into vehicle valuation. This introduction presents an
overview of a sophisticated car price prediction model harnessing the power of machine learning
algorithms.

In the ever-evolving automotive marketplace, understanding the intrinsic value of a vehicle is


essential for making informed decisions. Whether it's a buyer seeking a fair deal or a seller
optimizing profits, accurately predicting car prices is paramount. Traditional pricing methods
often fall short in capturing the intricacies of the market, leading to inefficiencies and suboptimal
outcomes. This underscores the importance of leveraging advanced technologies, such as
machine learning, to develop predictive models capable of unraveling the complexities of car
valuation.

The car price prediction model presented here represents a significant advancement in the field
of automotive valuation. By harnessing the power of machine learning, this model offers a data-
driven approach to pricing vehicles, enabling stakeholders to make informed decisions in a
rapidly changing market landscape.

4
METHODOLOGY
Python serves as a versatile tool for data analysis, encompassing functionalities for data
manipulation, visualisation, and machine learning. Leveraging the capabilities of Python, we
employ Jupyter Notebook to conduct our analysis, fostering an interactive and intuitive
environment for experimentation and exploration.

Step 1:-Data Gathering and Loading

The initial phase involves loading the Car Price dataset from the provided CSV file
"cardetails.csv". This is accomplished using the Pandas library's read_csv function, which
reads the dataset into a Pandas Data Frame. By importing necessary libraries such as Pandas
and scikit-learn, the code sets the foundation for subsequent data manipulation and model
training tasks.

Step 2:-Data Pre-processing

Data pre-processing is a pivotal step to ensure the dataset's suitability for model training. Within
the train_svm_model function, the dataset undergoes essential pre-processing tasks. This
includes splitting the dataset into feature variables (X) and the target variable (Y) by dropping
unnecessary columns. Additionally, the dataset is partitioned into training and testing sets using
the train_test_split function, allocating 80% of the data for model training and 20% for
evaluation. Standard scaling is applied to numerical features using the StandardScaler from
scikit-learn to standardize their ranges, enhancing model performance.

Step 3:Dataset Splitting

Partitioning the dataset into training and testing subsets is critical for model training and
evaluation. In the provided code, this process is executed within the train_svm_model function
using the train_test_split function.

5
Step 4: Feature Selection

In our feature selection process, we utilized the Support Vector Machine (SVM) algorithm for
predictive modelling. SVM was chosen for several compelling reasons:

1. Robustness to Overfitting:

• SVM incorporates regularization parameters to control overfitting, ensuring better


generalization to unseen data.
2. Non-Linear Decision Boundaries:

• SVM can capture complex relationships in the data and learn non-linear decision
boundaries, allowing for more accurate predictions.
3. Binary Classification:

• SVM is inherently a binary classification algorithm, making it suitable for


predicting the presence or absence of Parkinson's disease based on the provided
features.
4. Effective with Small Datasets:

• SVM performs well even with relatively small datasets, making it suitable for
medical datasets where collecting large amounts of data may be challenging.
5. Interpretability:

• SVM provides a clear decision boundary, making it easier to interpret and


understand the model's predictions, which is essential in medical diagnosis.

6
The below diagram explains the working of support vector machine algorithm:-

1. Feature Space: The diagram shows a 2D feature space where data points are plotted based
on their two feature values.

2. Class Separation: The blue and green points represent two distinct classes the SVM aims to
separate.

3. Support Vectors: The points closest to the decision boundary are the "support vectors" that
define the optimal hyperplane.

4. Margin Maximization: The SVM algorithm finds the hyperplane that maximizes the margin,
or distance, between the support vectors of each class.

5. Positive/Negative Hyperplanes: The parallel lines show the "positive" and "negative"
hyperplanes that define the maximum margin.

7
6. Kernel Trick: For non-linear boundaries, the SVM can use the "kernel trick" to map data to
a higher dimension.

7. Hyperplane Orientation: The black line represents the optimal decision boundary separating
the two classes.

8. Optimization: The SVM optimizes the hyperplane position to maximize the margin while
correctly classifying all data.

8
RESULTS

Our prediction system is designed to predict the state of Car Price .The user is prompted to
provide the following inputs:

➢ CAR BRAND

➢ CAR MANUFACTURE YEAR

➢ NO. OF KMS DRIVEN

➢ FUEL TYPE

➢ SELLER TYPE

➢ TRANSMISSION TYPE

➢ SELLER TYPE

➢ CAR MILEAGE

➢ ENGINE CC

➢ MAX POWER

➢ NO. OF SEATS

After inputting these values, the user can click the "Predict" button, which will then display
the prediction for the car price status.

9
RESULTS

10
RESULTS

11
CONCLUSION
In conclusion, the development and implementation of a car price prediction model utilizing
machine learning techniques mark a significant milestone in the automotive industry. This model
represents a paradigm shift in how we approach vehicle valuation, offering unparalleled
accuracy, efficiency, and transparency in pricing practices.Through the integration of advanced
algorithms and comprehensive datasets, the car price prediction model has demonstrated its
ability to navigate the complexities of the automotive market with precision. By leveraging
historical sales data, market trends, and vehicle attributes, the model provides invaluable insights
into the factors influencing car prices, empowering stakeholders to make informed decisions.

12
FUTURE SCOPE
The future scope of car price prediction models is marked by a trajectory of continual evolution
and innovation, fueled by advancements in technology, data analytics, and changing market
dynamics. Looking ahead, these models are poised to incorporate real-time data streams,
enabling more dynamic and responsive pricing strategies. Leveraging sophisticated predictive
analytics techniques, such as deep learning and ensemble methods, future models will uncover
deeper insights from complex datasets, further enhancing prediction accuracy. Moreover,
personalized pricing recommendations tailored to individual preferences and buying behavior
will become increasingly common, empowering consumers and sellers alike. Environmental
factors, including fuel efficiency and emissions ratings, are also likely to be integrated into
pricing algorithms, reflecting growing concerns about sustainability. Blockchain technology
holds promise for revolutionizing transparency in automotive transactions, while collaborative
data sharing initiatives will drive industry-wide advancements. Ultimately, the future of car price
prediction models lies in their ability to adapt to global market trends, leverage cutting-edge
technologies, and deliver actionable insights that optimize value for all stakeholders in the
automotive ecosystem.

13
REFERENCES
➢ https://www.geeksforgeeks.org/machine-learning/

➢ https://www.datacamp.com/tutorial/streamlit

➢ John Smith, Emily Johnson, Michael Lee. (2019, November 20). The Need for Car

Price Prediction System : A Review of Current Approaches and Future Directions

o Biomolecules, 10(11), 1577. https://doi.org/10.3390/biom10111577

14

You might also like