Supervised Vs Unsupervised Learning
Supervised Vs Unsupervised Learning
Supervised Vs Unsupervised Learning
A supervised learning algorithm learns from labeled training data, helps you to predict outcomes for unforeseen data.
Successfully building, scaling, and deploying accurate supervised machine learning Data science model takes time and
technical expertise from a team of highly skilled data scientists. Moreover, Data scientist must rebuild models to make
sure the insights given remains true until its data changes.
Unsupervised learning algorithms allow you to perform more complex processing tasks compared to supervised
learning. Although, unsupervised learning can be more unpredictable compared with other natural learning deep
learning and reinforcement learning methods.
Why Supervised Learning?
Supervised learning allows you to collect data or produce a data output from the previous experience.
Helps you to optimize performance criteria using experience
Supervised machine learning helps you to solve various types of real-world computation problems.
Weather conditions
Time of the day
Holidays
All these details are your inputs. The output is the amount of time it took to drive back home on that specific day.
You instinctively know that if it's raining outside, then it will take you longer to drive home. But the machine needs data
and statistics.
Let's see now how you can develop a supervised learning model of this example which help the user to determine the
commute time. The first thing you requires to create is a training data set. This training set will contain the total commute
time and corresponding factors like weather, time, etc. Based on this training set, your machine might see there's a
direct relationship between the amount of rain and time you will take to get home.
So, it ascertains that the more it rains, the longer you will be driving to get back to your home. It might also see the
connection between the time you leave work and the time you'll be on the road.
The closer you're to 6 p.m. the longer time it takes for you to get home. Your machine may find some of the relationships
with your labeled data.
This is the start of your Data Model. It begins to impact how rain impacts the way people drive. It also starts to see that
more people travel during a particular time of day.
She knows and identifies this dog. A few weeks later a family friend brings along a dog and tries to play with the baby.
Baby has not seen this dog earlier. But it recognizes many features (2 ears, eyes, walking on 4 legs) are like her pet
dog. She identifies a new animal like a dog. This is unsupervised learning, where you are not taught but you learn from
the data (in this case data about a dog.) Had this been supervised learning, the family friend would have told the baby
that it's a dog.
Example: You can use regression to predict the house price from training data. The input variables will be locality, size
of a house, etc.
Classification:
Classification means to group the output inside a class. If the algorithm tries to label input into two distinct classes, it is
called binary classification. Selecting between more than two classes is referred to as multiclass classification.
Strengths: Outputs always have a probabilistic interpretation, and the algorithm can be regularized to avoid overfitting.
Weaknesses: Logistic regression may underperform when there are multiple or non-linear decision boundaries. This
method is not flexible, so it does not capture more complex relationships.
Types of Unsupervised Machine Learning Techniques
Unsupervised learning problems further grouped into clustering and association problems.
Clustering
Clustering is an important concept when it comes to unsupervised learning. It mainly deals with finding a structure or
pattern in a collection of uncategorized data. Clustering algorithms will process your data and find natural
clusters(groups) if they exist in the data. You can also modify how many clusters your algorithms should identify. It
allows you to adjust the granularity of these groups.
Association
Association rules allow you to establish associations amongst data objects inside large databases. This unsupervised
technique is about discovering exciting relationships between variables in large databases. For example, people that buy
a new home most likely to buy new furniture.
Other Examples:
Process In a supervised learning model, input and In unsupervised learning model, only input data will
output variables will be given. be given
Input Data Algorithms are trained using labeled data. Algorithms are used against data which is not labeled
Algorithms Used Support vector machine, Neural network, Unsupervised algorithms can be divided into different
Linear and logistics regression, random categories: like Cluster algorithms, K-means,
forest, and Classification trees. Hierarchical clustering, etc.
Use of Data Supervised learning model uses training Unsupervised learning does not use output data.
data to learn a link between the input and
the outputs.
Accuracy of Highly accurate and trustworthy method. Less accurate and trustworthy method.
Results
Real Time Learning method takes place offline. Learning method takes place in real time.
Learning
Main Drawback Classifying big data can be a real challenge You cannot get precise information regarding data
in Supervised Learning. sorting, and the output as data used in unsupervised
learning is labeled and not known.
Summary
In Supervised learning, you train the machine using data which is well "labeled."
Unsupervised learning is a machine learning technique, where you do not need to supervise the model.
Supervised learning allows you to collect data or produce a data output from the previous experience.
Unsupervised machine learning helps you to finds all kind of unknown patterns in data.
For example, you will able to determine the time taken to reach back come base on weather condition, Times of
the day and holiday.
For example, Baby can identify other dogs based on past supervised learning.
Regression and Classification are two types of supervised machine learning techniques.
Clustering and Association are two types of Unsupervised learning.
In a supervised learning model, input and output variables will be given while with unsupervised learning model,
only input data will be given