0% found this document useful (0 votes)
5 views13 pages

Data Analytics

Uploaded by

Rohan s
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views13 pages

Data Analytics

Uploaded by

Rohan s
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 13

Data set on Bike Sales

Prepared by-
ABSTRACT
• An emerging mode of transportation in the world is the bike buying
system.
• Businesses are attempting to design a bike sharing program. They
previously failed to employ data analytics effectively. That predicts the
use of such a technology that will be considerate of the
• clients to organize their travel. When a traveler arrives to the store,
there is a likelihood that it will either be full or empty of bikes. As a
result of the dependent variable's ability to be divided into levels, this
issue is related to classification methods in machine learning. Different
algorithms are used in classification algorithms to predict the output or
dependent variable. They are decision tree, random forest, linear SVM,
gaussian SVM, and KNN (k-Nearest Neighbor). We get the best accuracy
by comparing the outcomes of different algorithms.
Introduction
• To get from one place to another, you need transportation. People most
often choose bicycles. Bikes are used more often as a result of the
requirement for mobility when moving from one location to another.
The majority of manufacturing processes in the automobile sectors still
rely heavily on human judgement. Every manufacturing company has a
challenge with the volume of product production. Before
manufacturing, if they couldn't comprehend the product's
requirements, the corporation might suffer a loss. The project's sole
area of emphasis is bicycles. Based on client behavior during prior
transactions, our initiative aids the industry in predicting the proper
requirement for the product. selecting the best classification algorithm
by utilizing various classification algorithms on the bike buyer dataset.
Methodology
• The various machine learning approaches used in this study estimate whether or not a bike
customer would really purchase a bike. Based on accuracy, this approach will improve the
likelihood that people will purchase bikes. Considering a dataset that has been classified so that
it falls under

• model for classification. Applying all classification techniques to the same dataset will

• Find the most effective algorithm for forecasting bike purchasers by comparing the accuracy
values of all the methods.
• These are a few of the categorization algorithms in use:
• Logistic regression
• Support Vector Machine (SVM)
• KNN(K-Nearest Neighbors)
Logistic regression
• Predictive analytics and categorization frequently make use of this
kind of statistical model, also referred to as a logit model. Based on a
given dataset of independent variables, logistic regression calculates
the likelihood that an event will occur, such as voting or not voting.
Given that the result is a probability, the dependent variable's range is
0 to 1. In logistic regression, the odds—that is, the probability of
success divided by the probability of failure—are transformed using
the logit formula.
Support Vector Machine (SVM)
• Support Vector Machine or SVM is one of the most popular
Supervised Learning algorithms, which is used for Classification
as well as Regression problems. However, primarily, it is used for
Classification problems in Machine Learning. The goal of the SVM
algorithm is to create the best line or decision boundary that can
segregate n-dimensional space into classes so that we can easily
put the new data point in the correct category in the future. This
best decision boundary is called a hyperplane. SVM chooses the
extreme points/vectors that help in creating the hyperplane.
These extreme cases are called as support vectors, and hence
algorithm is termed as Support Vector Machine. Consider the
below diagram in which there are two different categories that
are classified using a decision boundary or hyperplane
SVM applied to Data set
KNN
• The k-nearest neighbor algorithm, also known as KNN or k-NN, is a
non-parametric supervised classification algorithm that uses close
distance to perform classification or prediction on a one-point
cluster. of individual data. Although it can be used for regression or
classification problems, it is often used as a classification algorithm,
assuming that similar points can be found side by side.
KNN Applied To Data set
Results Tabulated .
Confusion
Algorithms Accuracy Presion Recall R2 Score
MatriX
SVM
Logistic
Regression
KNN

16 aug 2021 12
References

16 aug 2021 13

You might also like