0% found this document useful (0 votes)

8 views9 pages

Introduction-to-Unsupervised-Machine-Learning

The document provides an overview of unsupervised machine learning, highlighting its ability to identify patterns in unlabeled data and its common techniques such as clustering, association, and dimensionality reduction. It emphasizes the advantages of unsupervised learning, including discovery, efficiency, and flexibility, along with its applications in customer segmentation, fraud detection, and image recognition. Additionally, the document details the K-means clustering algorithm, its methodology, advantages, limitations, and real-world use cases.

Uploaded by

Subhajit Nandi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views9 pages

Introduction-to-Unsupervised-Machine-Learning

Uploaded by

Subhajit Nandi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 9

NETAJI SUBHASH ENGINEERING COLLEGE

NAME: SUBHAJIT NANDI

SEMESTER : 7th
TOPIC :
CLASS ROLL : 83
UNSUPERVISED MACHINE LEARNING
SECTION : B AND ITS APPLICATIONS
AND K-MEANS CLUSTERING
UNIVERSITY ROLL: 10900121089

STREAM: COMPUTER SCIENCE AND ENGINEERING

PAPER NAME : MACHINE LEARNING

PAPER CODE : PEC-CS701E

Introduction to
Unsupervised
Machine Learning
Unsupervised machine learning is a type of machine learning that allows computers
to learn without explicit labels or guidance. Instead of being fed labeled data,
unsupervised learning algorithms discover patterns and structures within the data
itself. This type of learning is advantageous when dealing with data that has not
been categorized or classified, enabling the identification of hidden patterns or
intrinsic structures.

Some common techniques used in unsupervised learning include clustering, association, and dimensionality reduction.
Clustering algorithms, such as K-means and hierarchical clustering, group data points based on similarity, helping to identify
natural groupings within the data. Association rule learning, like the Apriori algorithm, discovers interesting relationships
between variables in large databases. Dimensionality reduction techniques, such as Principal Component Analysis (PCA) and
t-distributed Stochastic Neighbor Embedding (t-SNE), reduce the number of random variables under consideration,
simplifying the dataset while preserving its essential features.
Advantages of Unsupervised Learning
1 Discovery
Unsupervised learning excels at uncovering hidden patterns and insights that might
be overlooked in traditional methods. This ability to identify novel information is
invaluable for understanding complex datasets.

2 Efficiency
Unlike supervised learning, unsupervised learning doesn't
require human-labeled data, making it more efficient in
situations where labeled data is scarce or expensive to
obtain.
3 Flexibility
Unsupervised learning algorithms can adapt to different data
structures and uncover various patterns, making them
adaptable to a wide range of tasks.

4 Applications
Unsupervised learning has a wide range of applications in various fields, including
customer segmentation, fraud detection, and anomaly detection.
Applications of Unsupervised Learning
Unsupervised learning can be used to identify distinct groups of customers based
Customer Segmentationon their purchasing behavior, demographics, or preferences, enabling businesses
to tailor marketing campaigns effectively.

In the realm of fraud detection, unsupervised learning algorithms can analyze transaction data to
spot irregularities that may indicate fraudulent activities. By clustering similar transactions together,
Anomaly Detection these algorithms can highlight outliers that deviate from normal behavior. Techniques such as
Isolation Forests, One-Class SVMs, and autoencoders are often used to detect these anomalies,
enabling financial institutions to proactively identify and prevent fraud.

Unsupervised learning algorithms play a crucial role in the field of image recognition by
clustering images based on their visual features. This capability enables various tasks, such as
Image Recognition image classification and object detection, without the need for extensive labeled datasets. By
automatically grouping similar images together, unsupervised learning can identify patterns
and structures that are not immediately apparent.
Clustering Algorithms: K-Means Clustering
K-Means Clustering

K-Means is a popular and widely used

unsupervised learning algorithm for grouping
data points into clusters based on their
similarity. It aims to minimize the distance
between data points within the same cluster
and maximize the distance between clusters.

Centroid-Based
K-Means works by iteratively assigning data
points to the nearest cluster centroid and
recalculating the centroids based on the
assigned points, aiming for an optimal cluster
arrangement.
K-Means Clustering: Methodology and
Intuition
Initialization
Randomly select k initial centroids, representing the
center of each cluster.

Assignment
Assign each data point to the closest centroid based on a
distance metric, such as Euclidean distance.

Update
Recalculate the position of each centroid based on the
average of the data points assigned to that cluster.

Iteration
Repeat the assignment and update steps until the
centroids no longer change significantly, indicating that
the clusters have converged.
Advantages and Limitations of K-Means
Clustering
Advantages Limitations
If the initial centroids are not
The algorithm is straightforward to Sensitivity to well-chosen, the algorithm may
Simplicity and efficiencyimplement and understand, which converge to a local minimum,
makes it a popular choice for those initial centroid
resulting in suboptimal
new to machine learning. selection clustering.

It has been extensively studied In real-world scenarios, data may

Widely used and and documented, making it Assumption of not conform to this assumption,
well-understood in both spherical leading to poor clustering
well-understood
academic and industry settings. performance.
clusters
The algorithm is computationally efficient,
Suitable for large with a time complexity of O(n), where n is
Inability to handle Outliers can
datasets the number of data points. This efficiency
disproportionately affect the
allows K-Means to quickly process large noisy or outlier
position of centroids, leading
volumes of data, making it ideal for data effectively to skewed clustering results.
applications where real-time or near-real-
time analysis is required.
Real-World Use Cases of K-Means
Clustering
Customer Segmentation
K-Means is widely used for customer segmentation, grouping
customers with similar characteristics to personalize marketing
efforts.

Image Compression
K-Means can be used to compress images by clustering similar colors, reducing the overall data size.

Document Clustering

Clustering similar documents based on their content helps with information retrieval and organization.

Medical Diagnosis
K-Means can be used to identify groups of patients with similar
symptoms and medical histories, aiding in diagnosis and treatment
planning.
Conclusion and Future
Developments
Unsupervised learning has made significant strides in recent years, with
advancements in algorithms and techniques continually pushing the boundaries of
what is possible. The rapid growth of data in today's digital age has underscored the
importance of these methods, as they are uniquely capable of uncovering hidden
patterns and structures within vast, unlabelled datasets.

This ability to analyze and interpret data without the need for manual labeling has
made unsupervised learning methods increasingly valuable. They are now pivotal in
a wide range of applications, from market segmentation and customer behavior
analysis to anomaly detection and bioinformatics. As the volume and complexity of
data continue to expand, the role of unsupervised learning in driving innovation
and extracting meaningful insights becomes ever more critical.

Furthermore, the continuous evolution of unsupervised learning techniques, such

as advanced clustering methods, dimensionality reduction, and association rule
learning, has enhanced their robustness and applicability. These advancements
have enabled more accurate and efficient data analysis, empowering organizations
to make better-informed decisions and uncover opportunities that were previously
hidden

Unit 3 Supervised Learning
No ratings yet
Unit 3 Supervised Learning
89 pages
Machine Learning4 (1)
No ratings yet
Machine Learning4 (1)
39 pages
Assignment 3
No ratings yet
Assignment 3
22 pages
Week 14 and 15 Machine Learning Unsupervised 2
No ratings yet
Week 14 and 15 Machine Learning Unsupervised 2
25 pages
Unit-4
No ratings yet
Unit-4
53 pages
1694601073-Unit 3.1 Unsupervised Learning CU 2.0
No ratings yet
1694601073-Unit 3.1 Unsupervised Learning CU 2.0
35 pages
10900121064_Souvik Pal_60
No ratings yet
10900121064_Souvik Pal_60
9 pages
Machine Learning Unsupervised
No ratings yet
Machine Learning Unsupervised
20 pages
Lecture 3 Types of Machine Learning
No ratings yet
Lecture 3 Types of Machine Learning
40 pages
1
No ratings yet
1
59 pages
04-FSSR_DS610_2024=2025T1_Kmeans
No ratings yet
04-FSSR_DS610_2024=2025T1_Kmeans
57 pages
2nd Unit NN Final Class Notes (1)
No ratings yet
2nd Unit NN Final Class Notes (1)
50 pages
MACHINE LEARNING - IV
No ratings yet
MACHINE LEARNING - IV
13 pages
AI - W8L15
No ratings yet
AI - W8L15
44 pages
ML UNIT 4 Sir
No ratings yet
ML UNIT 4 Sir
42 pages
Module 6 - Un-Supervised Learning Algorithms
No ratings yet
Module 6 - Un-Supervised Learning Algorithms
31 pages
ML Unit 4 V1
No ratings yet
ML Unit 4 V1
30 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
15 pages
Chapter 04_1731894685
No ratings yet
Chapter 04_1731894685
17 pages
som-new
No ratings yet
som-new
21 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
21 pages
Unit 3 Unsupervised Learning & Neural Network
No ratings yet
Unit 3 Unsupervised Learning & Neural Network
21 pages
Lab 10 Unsupervised
No ratings yet
Lab 10 Unsupervised
12 pages
UNIT-4
No ratings yet
UNIT-4
62 pages
UNSUPERVISED ML (2)
No ratings yet
UNSUPERVISED ML (2)
5 pages
U5 unsupervised learning
No ratings yet
U5 unsupervised learning
15 pages
Understanding Unsupervised Learning_ Concepts and Applications
No ratings yet
Understanding Unsupervised Learning_ Concepts and Applications
12 pages
10.Lab Activity
No ratings yet
10.Lab Activity
11 pages
Unsupervised Learning Notes
No ratings yet
Unsupervised Learning Notes
4 pages
New Doc 09-30-2024 20.37
No ratings yet
New Doc 09-30-2024 20.37
6 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
16 pages
WINSEM2023-24 BEEE410L TH VL2023240502246 2024-03-22 Reference-Material-I
No ratings yet
WINSEM2023-24 BEEE410L TH VL2023240502246 2024-03-22 Reference-Material-I
95 pages
2nd Unit NN Final Class Notes
No ratings yet
2nd Unit NN Final Class Notes
51 pages
unit4
No ratings yet
unit4
96 pages
Unsupervised learning
No ratings yet
Unsupervised learning
10 pages
ML Unit 2 Notes
No ratings yet
ML Unit 2 Notes
14 pages
Un-Supervised Machine Learning
No ratings yet
Un-Supervised Machine Learning
9 pages
Clustering in Machine Learning: Prepared by
No ratings yet
Clustering in Machine Learning: Prepared by
10 pages
Group I Discrete Mathematics
No ratings yet
Group I Discrete Mathematics
4 pages
Lecture Unsupervised (17!04!2024).Pptx
No ratings yet
Lecture Unsupervised (17!04!2024).Pptx
61 pages
Hearst E., Knott J. - Blindfold Chess (2009)
100% (12)
Hearst E., Knott J. - Blindfold Chess (2009)
446 pages
R20 machine learning unit 4
No ratings yet
R20 machine learning unit 4
49 pages
Module 6.1
No ratings yet
Module 6.1
42 pages
Week 9
No ratings yet
Week 9
66 pages
Unit 2 Unsupervised Learning
No ratings yet
Unit 2 Unsupervised Learning
86 pages
Ml Unit5 Notes
No ratings yet
Ml Unit5 Notes
18 pages
5 Minute Summary Lecture - 1
No ratings yet
5 Minute Summary Lecture - 1
2 pages
UNIT-5 Material
No ratings yet
UNIT-5 Material
42 pages
Petrel Course
0% (1)
Petrel Course
79 pages
Week 9. Unsupervised Learning
No ratings yet
Week 9. Unsupervised Learning
32 pages
Access Guide Coursera For Employee
No ratings yet
Access Guide Coursera For Employee
29 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
10 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
17 pages
Chapter 8
No ratings yet
Chapter 8
15 pages
Unit- 4(ML)
No ratings yet
Unit- 4(ML)
13 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
14 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
9 pages
ARTIFICIAL INTELLIGENCE LEC 5
No ratings yet
ARTIFICIAL INTELLIGENCE LEC 5
20 pages
DSA Presentation Group 6
No ratings yet
DSA Presentation Group 6
34 pages
K Means
No ratings yet
K Means
9 pages
Unsupervised Lec
No ratings yet
Unsupervised Lec
12 pages
Unsupervised - Learning Final
No ratings yet
Unsupervised - Learning Final
20 pages
System in Package
100% (1)
System in Package
40 pages
Stone - de 123 DOJ Response To MTC Crowdstrike Reports
89% (9)
Stone - de 123 DOJ Response To MTC Crowdstrike Reports
4 pages
6 Use Cisco DNA Center For Controller and AP Auto Install
No ratings yet
6 Use Cisco DNA Center For Controller and AP Auto Install
18 pages
Inductor DataSheet
No ratings yet
Inductor DataSheet
5 pages
Cognitive Ergonomics
100% (1)
Cognitive Ergonomics
27 pages
Zuron E-Catalogue Edition #3
No ratings yet
Zuron E-Catalogue Edition #3
16 pages
RTI Online - Video Call Summons
No ratings yet
RTI Online - Video Call Summons
1 page
Student Email Password Resets
No ratings yet
Student Email Password Resets
30 pages
Servidor Cisco Usc 220 m5
No ratings yet
Servidor Cisco Usc 220 m5
83 pages
22CS2ESPYP
No ratings yet
22CS2ESPYP
3 pages
Keyence 3
No ratings yet
Keyence 3
10 pages
Technical-Specification DPH-500kVA V1
No ratings yet
Technical-Specification DPH-500kVA V1
22 pages
Lab Report Java
No ratings yet
Lab Report Java
23 pages
Tabla MIL - STD - 105E - CLASE
No ratings yet
Tabla MIL - STD - 105E - CLASE
21 pages
Distance Protection Relay Trainer Kit
No ratings yet
Distance Protection Relay Trainer Kit
2 pages
How Superhuman Built An Engine To Find Product Market Fit - First Round Review
No ratings yet
How Superhuman Built An Engine To Find Product Market Fit - First Round Review
17 pages
3dm Classroom Handbook
No ratings yet
3dm Classroom Handbook
27 pages
Super Position and Statically Determinate Beam
No ratings yet
Super Position and Statically Determinate Beam
25 pages
USB Firmware Update Operation-Windows
No ratings yet
USB Firmware Update Operation-Windows
14 pages
Programming For Problem Solving
No ratings yet
Programming For Problem Solving
5 pages
M.tech Cyber Security & Incident Response
No ratings yet
M.tech Cyber Security & Incident Response
11 pages
StatementofAccount 5012200653 3102022151739
No ratings yet
StatementofAccount 5012200653 3102022151739
2 pages
Keyboard Shortcuts
No ratings yet
Keyboard Shortcuts
5 pages
The Generalized Zagreb Index of The Armchair Polyhex Nanotubes
No ratings yet
The Generalized Zagreb Index of The Armchair Polyhex Nanotubes
4 pages
Distributed, Concurrent, and Independent Access To Encrypted Cloud Databases
No ratings yet
Distributed, Concurrent, and Independent Access To Encrypted Cloud Databases
5 pages
FPC Manual
No ratings yet
FPC Manual
8 pages
45 Graphing Extrema
No ratings yet
45 Graphing Extrema
8 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet

Introduction-to-Unsupervised-Machine-Learning

Uploaded by

Introduction-to-Unsupervised-Machine-Learning

Uploaded by

NETAJI SUBHASH ENGINEERING COLLEGE

NAME: SUBHAJIT NANDI

STREAM: COMPUTER SCIENCE AND ENGINEERING

PAPER NAME : MACHINE LEARNING

PAPER CODE : PEC-CS701E

K-Means is a popular and widely used

It has been extensively studied In real-world scenarios, data may

Furthermore, the continuous evolution of unsupervised learning techniques, such

You might also like