0% found this document useful (0 votes)

82 views

Unsupervised Learning

Uploaded by

ayesha bashir

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

82 views

Unsupervised Learning

Uploaded by

ayesha bashir

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 24

Artificial Intelligence

Machine Learning
Unsupervised learning
2
Disclaimer Statement
In preparation of these slides, materials have been taken
from different online sources in the shape of books, websites,
research papers and presentations etc. However, the author
does not have any intention to take any benefit of these in
her/his own name. This lecture (audio, video, slides etc) is
prepared and delivered only for educational purposes and is
not intended to infringe upon the copyrighted material.
Sources have been acknowledged where applicable. The
views expressed are presenter’s alone and do not necessarily
represent actual author(s) or the institution.
Unsupervised Learning
Training data:“examples” x.

x1 , . . . , x n , x i ∈ X ⊂ Rn

• Clustering/segmentation:

f : R d − → {C 1 , . . . C k } (set of clusters).

Example: Find clusters in the population, fruits, species.

Unsupervised learning

Feature 2

Feature 1
Find clusters in the population feature 1 and feature 2.
Unsupervised learning

Feature 2

Feature 1
Methods: K-means, gaussian mixtures, hierarchical agglomerative clustering,
spectral clustering, DBScan, etc.
Clustering examples
• Clustering of the population by their demographics.

• Clustering of geographic objects (mineral deposits,

houses, etc.)
• Clustering of stars

• Audio signal separation. Example?

• Image segmentation. Example?

K-Means: example

Clustering of the population by their demographics.

K-Means: example

Clustering of geographic objects (mineral deposits, houses, etc.)

K-Means: example

Clustering of stars
K-Means: example

Clustering of stars
Clustering: K-Means
• Goal: Assign each example (x 1 , . . . , x n ) to one of the k clusters
{C1, . . . Ck}.
Clustering: K-Means
• Goal: Assign each example (x 1 , . . . , x n ) to one of the k clusters
{C1, . . . Ck}.

• µ j is the mean of all examples in the j t h cluster.

Clustering: K-Means
• Goal: Assign each example (x 1 , . . . , x n ) to one of the k clusters
{C1, . . . Ck}.

• µ j is the mean of all examples in the j t h cluster.

• Minimize: Σk Σ
2
J = ||x i − µ j ||
j=1 x i ∈C j
Clustering: K-Means
Algorithm K-Means:
Initialize randomly µ 1 , · · · µ k .
Clustering: K-Means
Algorithm K-Means: Initialize
randomly µ 1 , · · · µ k . Repeat
Assign each point x i to
the cluster with the closest µ j .
Clustering: K-Means
Algorithm K-Means: Initialize
randomly µ 1 , · · · µ k . Repeat
Assign each point x i to the cluster with the closest µ j .
Calculate the new mean for each cluster as follows:

1 Σ
µj = xi
|Cj |
x i ∈C j

Until convergence∗.
Clustering: K-Means
Algorithm K-Means: Initialize
randomly µ 1 , · · · µ k . Repeat
Assign each point x i to the cluster with the closest µ j .
Calculate the new mean for each cluster as follows:

1 Σ
µj = xi
|Cj |
x i ∈C j

Until convergence∗.

∗Convergence: Means no change in the clusters OR maximum

number of iterations reached.
K-Means: pros and cons
+Easy to implement

BUT...

-Need to know K
-Suffer from the curse of dimensionality
-No theoretical foundation
K-Means: questions

1. How to set k to optimally cluster the data?

2.How to evaluate your model? 3.How to cluster

non circular shapes?

K-Means: question 1
How to set k to optimally cluster the data?
G-means algorithm (Hamerly and Elkan, NIPS 2003) 1.Initialize
k to be a small number
2. Run k-means with those cluster centers, and store the resulting centers as C
3. Assign each point to its nearest cluster
4. Determine if the points in each cluster fit a Gaussian distribu- tion (Anderson-
Darling test).
5. For each cluster, if the points seem to be normally distributed, keep the cluster
center. Otherwise, replace it with two cluster centers.
6. Repeat this algorithm from step 2. until no more cluster centers are created.
K-Means: question 2
How to evaluate your model?
• Not trivial (as compared to counting the number of errors in classification).

• Internal evaluation: using same data. high intra-cluster sim- ilarity

(documents within a cluster are similar) and low inter- cluster similarity. E.g.,
Davies-Bouldin index that takes into account both the distance inside the
clusters and the distance between clusters. The lower the value of the index,
the wider is the separation between different clusters, and the more tightly the
points within each cluster are located together.

• External evaluation: use of ground truth of external data. E.g., mutual

information, entropy, adjusted random index, etc.
K-Means: question 3
How to cluster non circular shapes?

There are other methods: spectral clustering, DBSCAN, BIRCH, etc. that
handle other shapes.

CSI 2110 Summary PDF
No ratings yet
CSI 2110 Summary PDF
17 pages
Neural Networks and Their Application To Finance: Martin P. Wallace (P D)
No ratings yet
Neural Networks and Their Application To Finance: Martin P. Wallace (P D)
10 pages
SVM
No ratings yet
SVM
21 pages
Unsupervised Learning 2024-PPG
No ratings yet
Unsupervised Learning 2024-PPG
85 pages
ET4248E - Chap9 - K-Means and GMM
No ratings yet
ET4248E - Chap9 - K-Means and GMM
27 pages
Support Vector Machines
No ratings yet
Support Vector Machines
14 pages
Expectation Maximization
No ratings yet
Expectation Maximization
23 pages
Support Vector Machines PDF
100% (1)
Support Vector Machines PDF
37 pages
Ain Shams University Faculty of Engineering
No ratings yet
Ain Shams University Faculty of Engineering
2 pages
Intro SVM New Example PDF
100% (1)
Intro SVM New Example PDF
56 pages
03 - K Means Clustering On Iris Datasets
No ratings yet
03 - K Means Clustering On Iris Datasets
4 pages
Introduction To Tree Methods
No ratings yet
Introduction To Tree Methods
15 pages
K Means Clustering Lecture
No ratings yet
K Means Clustering Lecture
32 pages
Association Rules FP Growth
No ratings yet
Association Rules FP Growth
32 pages
ML Practice 1
No ratings yet
ML Practice 1
106 pages
K Means Clustering
100% (1)
K Means Clustering
13 pages
K Means
No ratings yet
K Means
22 pages
Chapter
100% (1)
Chapter
101 pages
Decision Tree
No ratings yet
Decision Tree
13 pages
IS328 Final Exam
No ratings yet
IS328 Final Exam
12 pages
Support Vector Machines: Dominik Wisniewski Wojciech Wawrzyniak
No ratings yet
Support Vector Machines: Dominik Wisniewski Wojciech Wawrzyniak
16 pages
KNN ALGORITHM IN MACHINELEARNING
No ratings yet
KNN ALGORITHM IN MACHINELEARNING
10 pages
Chapter 6 ML Classifications
No ratings yet
Chapter 6 ML Classifications
51 pages
Prims Algorithm
No ratings yet
Prims Algorithm
6 pages
DBSCAN
No ratings yet
DBSCAN
42 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
Data Mining Final Exam
No ratings yet
Data Mining Final Exam
1 page
Literature Review On Feature Selection Methods For HighDimensional Data
No ratings yet
Literature Review On Feature Selection Methods For HighDimensional Data
9 pages
Market Basket Analysis and Advanced Data Mining: Professor Amit Basu
No ratings yet
Market Basket Analysis and Advanced Data Mining: Professor Amit Basu
24 pages
ML Kernel Methods
No ratings yet
ML Kernel Methods
51 pages
CH 6
No ratings yet
CH 6
72 pages
1.linear Regression PSP
No ratings yet
1.linear Regression PSP
92 pages
K-Means and PCA
No ratings yet
K-Means and PCA
69 pages
Matrix Chain Multiplication
No ratings yet
Matrix Chain Multiplication
13 pages
Non Parametric Methods 8
No ratings yet
Non Parametric Methods 8
23 pages
Pycryptodome Master
100% (1)
Pycryptodome Master
82 pages
ML Interview Questions and Answers
100% (1)
ML Interview Questions and Answers
25 pages
K Means Clustering Algorithm
No ratings yet
K Means Clustering Algorithm
12 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
Les 3 DWM
No ratings yet
Les 3 DWM
21 pages
Data Preprocessing
No ratings yet
Data Preprocessing
38 pages
Lecture 3 Data Mining
No ratings yet
Lecture 3 Data Mining
30 pages
Data Preprocesing JavaPoint
No ratings yet
Data Preprocesing JavaPoint
19 pages
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
100% (1)
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
73 pages
Supervised and Deep Learning
No ratings yet
Supervised and Deep Learning
83 pages
An To An A That It It An: I. (L, The
No ratings yet
An To An A That It It An: I. (L, The
10 pages
DBSCAN Clustering Algorithm: Presented by
No ratings yet
DBSCAN Clustering Algorithm: Presented by
22 pages
ML Unit-3.-1
No ratings yet
ML Unit-3.-1
28 pages
Lec20 RidgeRegression
No ratings yet
Lec20 RidgeRegression
21 pages
Square Topology For NoCs
No ratings yet
Square Topology For NoCs
4 pages
K Mean Clustering
No ratings yet
K Mean Clustering
36 pages
Naïve Bayes Classifier (Week 8)
No ratings yet
Naïve Bayes Classifier (Week 8)
18 pages
Distributed Database Derived Horizontal Fragmentation
No ratings yet
Distributed Database Derived Horizontal Fragmentation
26 pages
Ensemble Methods Bagging Boosting and Stacking
100% (1)
Ensemble Methods Bagging Boosting and Stacking
19 pages
Examen Machine Learning A)
No ratings yet
Examen Machine Learning A)
4 pages
MLT Unit 3
100% (1)
MLT Unit 3
38 pages
ML UNIT 4 Sir
No ratings yet
ML UNIT 4 Sir
42 pages
P-3 1 2-Kmeans
No ratings yet
P-3 1 2-Kmeans
43 pages
04-FSSR_DS610_2024=2025T1_Kmeans
No ratings yet
04-FSSR_DS610_2024=2025T1_Kmeans
57 pages
Clustering Techniques - Hierarchical, K-Means Clustering
No ratings yet
Clustering Techniques - Hierarchical, K-Means Clustering
22 pages
Department of Computer Science Assignment#01: Course Title: Compiler Construction Submitted To
No ratings yet
Department of Computer Science Assignment#01: Course Title: Compiler Construction Submitted To
8 pages
Preventive Measures Taken by Government
No ratings yet
Preventive Measures Taken by Government
3 pages
Lecture 2
No ratings yet
Lecture 2
5 pages
History of Telnet
No ratings yet
History of Telnet
5 pages
Quiz DIP
No ratings yet
Quiz DIP
7 pages
Locust Attack
No ratings yet
Locust Attack
31 pages
Machine Learning: III B. Tech I Semester Regular/Supplementary Examinations, December - 2023
No ratings yet
Machine Learning: III B. Tech I Semester Regular/Supplementary Examinations, December - 2023
8 pages
Unit 3
No ratings yet
Unit 3
55 pages
A Comparison of Machine Learning Algorithms for Customer Churn Prediction
No ratings yet
A Comparison of Machine Learning Algorithms for Customer Churn Prediction
6 pages
5 2 Multilayer Perceptron
No ratings yet
5 2 Multilayer Perceptron
17 pages
Weka Sample
No ratings yet
Weka Sample
21 pages
Lec29 ImportanceSampling
No ratings yet
Lec29 ImportanceSampling
84 pages
Random Forest
No ratings yet
Random Forest
18 pages
Simplilearn Deep Learning
No ratings yet
Simplilearn Deep Learning
6 pages
Get Introduction to Data Mining Global Edition Pang Ning Tan Michael Steinbach Anuj Karpatne Vipin Kumar PDF ebook with Full Chapters Now
100% (3)
Get Introduction to Data Mining Global Edition Pang Ning Tan Michael Steinbach Anuj Karpatne Vipin Kumar PDF ebook with Full Chapters Now
65 pages
COMP-377 Lab Assignment 4 - F21
No ratings yet
COMP-377 Lab Assignment 4 - F21
2 pages
NPU MachineLearning
No ratings yet
NPU MachineLearning
28 pages
ML Unit-3 ppt
No ratings yet
ML Unit-3 ppt
92 pages
Introduction: Geometric Models: - Page 1 of 25
No ratings yet
Introduction: Geometric Models: - Page 1 of 25
25 pages
Final Unit 5 Questions
No ratings yet
Final Unit 5 Questions
6 pages
Lecture 1
No ratings yet
Lecture 1
30 pages
Int422 Project
No ratings yet
Int422 Project
8 pages
Experiments With A New Boosting Algorithm: Yoav Freund Robert E. Schapire
No ratings yet
Experiments With A New Boosting Algorithm: Yoav Freund Robert E. Schapire
9 pages
Synopsis PDF
No ratings yet
Synopsis PDF
2 pages
GANs
No ratings yet
GANs
13 pages
Automatic Fruit Classification Using Deep Learning for Industrial Applications
No ratings yet
Automatic Fruit Classification Using Deep Learning for Industrial Applications
8 pages
Comparative Analysis of Classification Algorithms On Diferrent Dataset Using Weka SW PDF
No ratings yet
Comparative Analysis of Classification Algorithms On Diferrent Dataset Using Weka SW PDF
5 pages
Stock Prediction RNN
No ratings yet
Stock Prediction RNN
7 pages
Convolutional Neural Network With An Optimized Backpropagation Technique
No ratings yet
Convolutional Neural Network With An Optimized Backpropagation Technique
5 pages
TSLSTM
No ratings yet
TSLSTM
4 pages
A Driving Decision
No ratings yet
A Driving Decision
39 pages
2085-Article Text-5597-1-10-20220804
No ratings yet
2085-Article Text-5597-1-10-20220804
12 pages
Neural-Network Questions
0% (1)
Neural-Network Questions
3 pages
Deep Learning: Prof:Naveen Ghorpade
No ratings yet
Deep Learning: Prof:Naveen Ghorpade
43 pages
PCA Problem Statement With Answer
No ratings yet
PCA Problem Statement With Answer
22 pages
LAB 6A:K-Means Clustering
No ratings yet
LAB 6A:K-Means Clustering
3 pages

Unsupervised Learning

Uploaded by

Unsupervised Learning

Uploaded by

Artificial Intelligence

Example: Find clusters in the population, fruits, species.

• Clustering of geographic objects (mineral deposits,

• Audio signal separation. Example?

• Image segmentation. Example?

Clustering of the population by their demographics.

Clustering of geographic objects (mineral deposits, houses, etc.)

• µ j is the mean of all examples in the j t h cluster.

• µ j is the mean of all examples in the j t h cluster.

∗Convergence: Means no change in the clusters OR maximum

1. How to set k to optimally cluster the data?

2.How to evaluate your model? 3.How to cluster

non circular shapes?

• Internal evaluation: using same data. high intra-cluster sim- ilarity

• External evaluation: use of ground truth of external data. E.g., mutual

You might also like