0% found this document useful (0 votes)
4 views3 pages

2GP ML Unsupervised Learning

Unsupervised learning is a machine learning technique that analyzes unlabelled data to discover patterns without supervision. It is useful for exploratory data analysis, customer segmentation, and anomaly detection, but can be complex and unpredictable. Key applications include clustering, association, and dimensionality reduction, which help in various fields such as recommendation engines and medical imaging.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views3 pages

2GP ML Unsupervised Learning

Unsupervised learning is a machine learning technique that analyzes unlabelled data to discover patterns without supervision. It is useful for exploratory data analysis, customer segmentation, and anomaly detection, but can be complex and unpredictable. Key applications include clustering, association, and dimensionality reduction, which help in various fields such as recommendation engines and medical imaging.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Unsupervised Learning

Definition:
As the name suggests, unsupervised learning is a machine learning technique in
which models are not supervised using a training dataset. Instead, it allows the
model to work on its own to discover patterns and information that was
previously undetected. It mainly deals with the unlabelled data.

Unsupervised Learning Algorithms allow users to perform more complex


processing tasks than supervised learning. Although, unsupervised learning can
be more unpredictable than other natural learning methods.

Its ability to discover similarities and differences in information makes it the


ideal solution for exploratory data analysis, cross-selling strategies, customer
segmentation, and image recognition.

Why Unsupervised Learning?


Reasons for using Unsupervised Learning in Machine Learning:

1
● Unsupervised machine learning finds all kinds of unknown patterns in
data.
● Unsupervised methods help you to find features that can be useful for
categorization.
● It is taken place in real-time, so all the input data is to be analyzed and
labeled in the presence of learners.
● It is easier to get unlabeled data from a computer than labeled data,
which requires manual intervention.

Problems with Unsupervised Learning:


● Unsupervised Learning is more complex as compared to Supervised
Learning tasks.
● It is difficult to guess if the results are correct, since there is no labeled
output for testing.
● External evaluation is required.

Applications:
The following are some of the most popular real-world uses of unsupervised
learning:
● Anomaly detection
Unsupervised learning methods can sift through enormous volumes of
data to find anomalous data points. These abnormalities might raise
awareness of malfunctioning equipment, human mistakes, or security
breaches.
● Recommendation engines
Unsupervised learning can aid in the discovery of data trends that can be
utilized to generate more successful cross-selling tactics by using
historical purchase behavior data.
● Medical Imaging
Unsupervised machine learning gives critical aspects to medical imaging
technologies, such as image identification, classification, and
segmentation, which are utilized in radiology and pathology to swiftly and
effectively diagnose patients.

2
Categories:
Unsupervised learning models are used primarily for three kinds of problems:
clustering, association, and dimensionality reduction.
● Clustering
Clustering can be considered the most important unsupervised learning
problem. Clustering is a technique that organizes unlabeled data into
groups based on similarities and differences. Clustering techniques are
used to arrange raw, unclassified data items into groups characterized by
information structures or patterns.
● Association
This is a rule-based approach for determining associations between
variables in a given dataset. These methodologies are commonly used in
market basket analysis, helping businesses better understand the linkages
between various items. Recommendation engines on multiple sites are
the best-known example of this.
● Dimensionality reduction
While more data typically gives more accurate findings, it can also
influence the performance of machine learning algorithms (overfitting)
and make the visualization of datasets challenging. Dimensionality
reduction is a strategy that is employed when the amount of
characteristics, or dimensions, in a given dataset is excessive. It minimizes
the amount of data inputs to a reasonable quantity while keeping the
dataset's integrity as much as feasible.

You might also like