0% found this document useful (0 votes)

385 views63 pages

Lesson 6 - Unsupervised Learning

Uploaded by

Omid khosravi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

385 views63 pages

Lesson 6 - Unsupervised Learning

Uploaded by

Omid khosravi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 63

Machine Learning

Lesson 6: Unsupervised Learning

© Simplilearn. All rights reserved.

Concepts Covered

Unsupervised Learning

Hierarchical Clustering

Dendrogram

K means clustering
Learning Objectives

By the end of this lesson, you will be able to:

Explain the mechanism of unsupervised learning

Practice different clustering techniques in Python

Unsupervised Learning
Topic 1: Overview
Unsupervised Learning Process Flow

The data has no labels. The machine just looks for whatever patterns it can find.

Unsupervised Learning Model

Feature
Training
Vectors
Text,
Documents,
Images, etc.

Machine
Learning
Algorithm

Feature
Likelihood
New Text, Vectors or Cluster ID
Document, Predictive or Better
Images, etc. Model Representation
Unsupervised Learning vs. Supervised Learning

The only difference is the labels in the training data

Feature Feature
Vectors Training Text, Vectors
Training Text,
Documents, Documents,
Images, etc. Images, etc.

Machine
Machine
Learning Labels Learning
Algorithm
Algorithm

Feature
Likelihood Feature
New Text, Vectors
Predictive or Cluster ID Vectors
Document, or Better New Text,
Model Predictive Expected
Images, etc. Representation Document,
Model Label
Images, etc.
Unsupervised Learning: Example

Clustering like-looking birds/animals based on their features

Unsupervised
Learning
Application of Unsupervised Learning

Unsupervised learning can be used for anomaly detection as well as

clustering
Anomaly
0.9 detection
7 0.8
6
0.7 xxxx
5 + x xx
+ ++++++++ x xxxx x x
4
0.6 ++++++ x xx x
0.0033
0.5 0.0251
3 0.008
2 0.0119
0.4
1
0.3
0
0.2
-1
-2 0.1
0 2 4 6 8 10 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.1

Identifying
similarities in
groups
(Clustering)
Unsupervised Learning
Topic 2: Clustering
Clustering

Grouping objects based on the information found in data that describes the
objects or their relationship

The goal is to see that

similar objects are
grouped into one
cluster and different
from objects in
another cluster

Cluster 0
Cluster 1
Cluster 2
Cluster 3
Cluster 4
Need of Clustering

To determine the intrinsic grouping in a set of unlabeled data

To organize data into clusters showing internal structure of the data

To partition the data points

To understand and extract value from large sets of structured and unstructured data
Types of Clustering

Clustering

Hierarchical Partitional
clustering clustering

Agglomerative Divisive K-means Fuzzy C-means

Unsupervised Learning
Topic 3: Hierarchical Clustering
Hierarchical Clustering

Outputs a hierarchy, a structure that is more informative than the unstructured set of clusters returned by flat clustering

B C B C B C B C
A A A A

D D D D
F E F E F E F E
Dissimilarity

Dissimilarity

Dissimilarity
A B C D E A B C D E F A B C D E F
A B C D E F

Combine A and B based on similarity Combination of A and B is combined Combination of D and E is combined Final tree contains all clusters
Combine D and E based on similarity with C with F Combined into a single cluster

1 2 3 4
Working: Hierarchical Clustering

Step 1 Step 2 Step 3 Step 4

Assign each item to Find the closest (most Compute distances Repeat steps 2 and 3
its own cluster, such similar) pair of (similarities) between until all items are
that if you have N clusters and merge the new cluster and clustered into a single
number of items, you them into a single every old cluster cluster of size N
now have N number cluster. Now you have
of clusters one less cluster
Distance Measures

Complete - Linkage clustering

• Find the maximum possible distance between points belonging to two different
clusters

Single - Linkage Clustering

• Find the minimum possible distance between points belonging to two different
clusters

Mean - Linkage Clustering

• Find all possible pair-wise distances for points belonging to two different clusters and then
calculate the average

Centroid - Linkage Clustering

• Find the centroids of each cluster and calculate the distance between them
The Dendrogram

Dendrogram ((in Greek, dendro means tree and gramma means drawing) is a tree diagram
frequently used to illustrate the arrangement of the clusters produced by hierarchical
clustering.

Agglomerative

Divisive
Hierarchical Clustering: Example

A hierarchical clustering of distances between cities in kilometers

MI
TO

NA BA
BA NA RM
RM FI TO MI
Hierarchical Clustering: Step 1

Create distance matrix of data

BA FI MI NA RM TO

BA 0 662 877 255 412 996

FI 662 0 295 468 268 400

MI 877 295 0 754 564 138 Distance between TO and MI

NA 255 468 754 0 219 869

RM 412 268 564 219 0 669

TO 996 400 138 869 669 0

Distance Matrix
Hierarchical Clustering: Step 2
From the distance matrix, you can see that MI has least distance with TO and they form a cluster together

BA FI MI NA RM TO
BA FI MI/TO NA RM
BA 0 662 877 255 412 996
BA 0 662 877 255 412
FI 662 0 295 468 268 400
FI 662 0 295 468 268
MI 877 295 0 754 564 138
MI/TO 877 295 0 754 564
NA 255 468 754 0 219 869
NA 255 468 754 0 219
RM 412 268 564 219 0 669
RM 412 268 564 219 0
TO 996 400 138 869 669 0

TO MI

As the MI column has lower values than TO column,

MI/TO consists of MI column values
Hierarchical Clustering: Step 3
Repeat clustering until a single cluster is obtained with all the members in it

BA FI MI/TO NA RM
BA 0 662 877 255 412
FI 662 0 295 468 268
MI/TO 877 295 0 754 564 NA RM TO MI
NA 255 468 754 0 219
RM 412 268 564 219 0
BA FI MI/TO NA/RM
BA 0 662 877 255
FI 662 0 295 268
MI/TO 877 295 0 564
NA/RM 255 468 564 0
Hierarchical Clustering: Step 3 (Contd.)

BA FI MI/TO NA/RM
BA 0 662 877 255
FI 662 0 295 268
NA RM TO MI
MI/TO 877 295 0 564
NA/RM 255 268 564 0

BA/(NA/RM) FI MI/TO
BA/(NA/RM) 0 268 564

FI 268 0 295
MI/TO 564 295 0
Hierarchical Clustering: Step 3 (Contd.)

BA/(NA/RM) FI MI/TO
BA/(NA/RM) 0 268 564

FI 268 0 295 BA
MI/TO 564 295 0 NA RM FI TO MI

BA/(NA/RM)/FI (MI/TO)
BA/(NA/RM)/FI 0 295

(MI/TO) 295 0
Hierarchical Clustering: Step 4

Derive the final dendrogram

BA/(NA/RM)/FI (MI/TO)
BA/(NA/RM)/FI 0 295
BA
(MI/TO) 295 0 NA RM FI TO MI
Assisted Practice
Hierarchical Clustering Duration: 15 mins.

Problem Statement: Consider the dataset “zoo.data” and look at the info in the first five rows. The first
column denotes the animal name and the last one specifies a high-level class for the corresponding animal.
Find a solution to the following questions:
• Unique number of high-level class
• Perform agglomerative clustering using the 16 intermediate features
[ Hint: Refer to the agglomerative clustering (Hierarchical Clustering) module in Scikit learn and
set the number of clusters appropriately ]
Refer the below link for further documentation:
http://scikit-learn.org/stable/modules/generated/sklearn.cluster.AgglomerativeClustering.html
• Compute the mean squared error by comparing the actual class and predicted high-level class.

Objective: Perform agglomerative clustering with appropriate MSE value.

Access: Click on the Labs tab on the left side panel of the LMS. Copy or note the username and password
that are generated. Click on the Launch Lab button. On the page that appears, enter the username and
password in the respective fields, and click Login.
Unassisted Practice
Hierarchical Clustering Duration: 10 mins.

Problem Statement: An ecommerce company has prepared a rough dataset containing shopping details of their
customers, which includes CustomerID, Genre, Age, Annual Income (k$), Spending Score (1-100). The company is unable to
target a specific set of customers with a particular set of SKUs.

Objective: Segment customers into different groups based on their shopping trends.

Note: This practice is not graded. It is only intended for you to apply the knowledge you have gained to solve real-world
problems.

Access: Click on the Labs tab on the left side panel of the LMS. Copy or note the username and password that are
generated. Click on the Launch Lab button. On the page that appears, enter the username and password in the respective
fields, and click Login.
Step1: Data Import

Code

import pandas as pd
import numpy as np
customer_data = pd.read_csv('shopping_data.csv’)
customer_data
Step 2: Filter Columns
Discard all the data, except annual income (in thousands of dollars) and spending score (1-100)

Code

data = customer_data.iloc[:,3:5].values
data
Step 3: Create Dendrograms

Code

import matplotlib.pyplot as plt

%matplotlib inline
import scipy.cluster.hierarchy as shc
plt.figure(figsize=(10,7))
plt.title('Customer Dendrograms')
dend = shc.dendrogram(shc.linkage(data,method='ward'))

5 Clusters
Step 4: Agglomerative Clustering
Since there are five clusters, group the data points into these five clusters

Code

from sklearn.cluster import AgglomerativeClustering

cluster = AgglomerativeClustering(n_clusters=5, affinity='euclidean',
linkage='ward')
cluster.fit_predict(data)
Step 4: Plotting the Clusters

Code

plt.figure(figsize=(10, 7))
plt.scatter(data[:,0], data[:,1], c=cluster.labels_, cmap='rainbow')
Unsupervised Learning
Topic 4: K-means Clustering
K-means Algorithm: Steps

1 Randomly chooses k datapoints as initial centroids

2 Assigns each datapoint closest to the centroid

3 Calculates new cluster centroids

4 Checks if the convergence criterion is met

K-means: Example

Consider the below datapoints

K-means: Example (Contd.)

Initialize centers randomly

K-means: Example (Contd.)

Assign points to the nearest center

K-means: Example (Contd.)

Readjust centers
K-means: Example (Contd.)

Assign points to the nearest center

K-means: Example (Contd.)

Re-adjust centres
K-means: Example (Contd.)

Assign points to the nearest center

K-means: Example (Contd.)

Readjust centers
K-means: Example (Contd.)

Assign points to the nearest center

Optimal Number of Clusters

If you plot k against the SSE, you will see that the error
decreases as k increases

Objective Function Value

This is because their size decreases and hence distortion is

i.e., Distortion
also smaller"

The goal of elbow method is to choose k where SSE

decreases abruptly Elbow Plot
Assisted Practice
K-means Clustering Duration: 15 mins.

Problem Statement: Lithionpower is the largest provider of electric vehicle(e-vehicle) batteries.

It provides battery on a rental model to e-vehicle drivers. Drivers rent battery typically for a day and then
replace it with a charged battery from the company.
Lithionpower has a variable pricing model based on driver's driving history. Battery life depends on factors
such as over speeding, distance driven per day, etc.

Objective:
• Create a cluster model where drivers can be grouped together based on the driving data.
• Group the datapoints so that drivers will be incentivized based on the cluster.

Problem Statement: There is an image in the name of “tiger.png”. Use k-means clustering with k set to 16 and cluster the
image, which means that you want to keep just 16 colors in our compressed image.

Objective: Open and display the image “tiger.png”. Convert the image into numpy array, so that it can be used in further
processing. Find out the dimensions of the image and convert it into a two-dimensional array (Use k-means clustering for
image segmentation, reducing the image into 16 colors).

Note: This practice is not graded. It is only intended for you to apply the knowledge you have gained to solve real-world
problems.

Access: Click on the Labs tab on the left side panel of the LMS. Copy or note the username and password that are
generated. Click on the Launch Lab button. On the page that appears, enter the username and password in the respective
fields, and click Login.
Step 1: Import Libraries

Code

from sklearn.cluster import KMeans

import numpy as np
from PIL import Image
import matplotlib.pyplot as plt
import matplotlib.image as mpimg
import os
%matplotlib inline
Step 2: Get the Image and its Corresponding RGB Values

Code

img = Image.open('tiger.png’)
img_np=np.asarray(img)
img_np[0:2]
Step 3: Get the Image Dimensions

Code

img_np.shape

For feeding this data into the algorithm, you must change the shape of this data into a dataset with 720*1280 =
921600 rows and 3 columns
Step 4: Reshape the Data

Code

pixels=img_np.reshape(img_np.shape[0]*img_np.shape[1],img_np.shape[2])
pixels.shape
Step 5: Define the K-means Model

Code

model=KMeans(n_clusters=16)
model.fit(pixels)

After the model is trained, model.labels_ is used to obtain the number of cluster that is assigned to
each data point or each pixel.
model.cluster_centers_ gives us the coordinates or the RGB values of the 16 cluster centers.
Step 6: Define the Cluster Centres

Code

pixel_centroids = model.labels_
cluster_centers=model.cluster_centers_
pixel_centroids

Code

cluster_centers
Step 7: Cluster Assignment

Code

final=np.zeros((pixel_centroids.shape[0],3))
for cluster_no in range(16):
final[pixel_centroids==cluster_no]=cluster_centers[cluster_no]
final[0:5]
Step 8: Reshape to Original Dimensions

Code

comp_image=final.reshape(img_np.shape[0],img_np.shape[1],3)
comp_image.shape
Step 9: Convert the Pixel Values to Image

Code

comp_image=Image.fromarray(np.uint8(comp_image))
comp_image.save('tiger_compressed.png’)
img_1 = mpimg.imread('tiger.png')
img_2 = mpimg.imread('tiger_compressed.png')
Step 10: Original Plot vs. Compressed Image

Code

fig,(ax1,ax2) = plt.subplots(1,2, figsize=(20,20))

ax1.imshow(img_1)
ax1.set_title('Original Image')
ax2.imshow(img_2)
ax2.set_title('Compressed Image')
plt.show()
Key Takeaways

Now, you are able to:

Explain the mechanism of unsupervised learning

Practice different clustering techniques in Python

Knowledge Check

Knowledge
Check
Can decision trees be used for performing clustering?
1

a. True

b. False

Knowledge
Check
Can decision trees be used for performing clustering?
1

a. True

b. False

The correct answer is a. True

Decision trees can also be used to for clusters in the data, but it often generates natural clusters and is not dependent on any
objective function.
Knowledge Which of the following can act as possible termination condition in K-Means?
Check 1. Fixed number of iterations.
2. Assigning observations to clusters such that they don’t change between iterations, except for cases with a bad local minimum.
2 3. Stationary centroids appear between successive iterations.
4. When RSS falls below a threshold.

a. 1,3, and 4

b. 1, 2, and 3

c. 1, 2, and 4

d. All the above

Knowledge Which of the following can act as possible termination condition in K-Means?
Check 1. Fixed number of iterations.
2. Assigning observations to clusters such that they don’t change between iterations, except for cases with a bad local minimum.
2 3. Stationary centroids appear between successive iterations.
4. When RSS falls below a threshold.

a. 1,3, and 4

b. 1, 2, and 3

c. 1, 2, and 4

d. All the above

The correct answer is d. All the above

All the above options are true.
Lesson-End Project Duration: 20 mins.

Problem Statement: Open and display the image “dog.jpeg”. The image has to be converted in to numpy array, so
that it can be used in further processing. The major challenge is to identify the dominant color in the image
[Hint: Refer the following url for image processing documentation:
http://omz-software.com/pythonista/docs/ios/PIL.html]

Objective: Use K-means clustering for image segmentation, which will include the following steps:
• Find out the dimensions of the image and convert it in to a two-dimensional array.
• Use k-means clustering with k set to 3 and cluster the image.
[Hint: Refer to k-means module of scikit learn]
• Predict the cluster label of every pixel in the image and plot it back as an image.
• Find out the three dominant color in the image.
[Hint: The cluster centers should correspond to three dominant colors]

Access: Click the Labs tab in the left side panel of the LMS. Copy or note the username and password that are
generated. Click the Launch Lab button. On the page that appears, enter the username and password in the
respective fields and click Login.
Thank You

6 - KNN Classifier
No ratings yet
6 - KNN Classifier
10 pages
CS583 Unsupervised Learning
No ratings yet
CS583 Unsupervised Learning
95 pages
Building Recommendation System Using Movielens Data
No ratings yet
Building Recommendation System Using Movielens Data
6 pages
RMM Unit-I Introdution To Data Mining
No ratings yet
RMM Unit-I Introdution To Data Mining
129 pages
K-Nearest Neighbor Learning
No ratings yet
K-Nearest Neighbor Learning
19 pages
Web Analytics, Web Mining, and Social Analytics
No ratings yet
Web Analytics, Web Mining, and Social Analytics
53 pages
Answers To Problems For Data Mining and Predictive Analytics (2nd Edition) by Larose
No ratings yet
Answers To Problems For Data Mining and Predictive Analytics (2nd Edition) by Larose
12 pages
Machine Learning CS-8 Dept. of CS, KFUEIT: Instructor: Muhammad Adeel Abid
No ratings yet
Machine Learning CS-8 Dept. of CS, KFUEIT: Instructor: Muhammad Adeel Abid
19 pages
Machine Learning MCQ
No ratings yet
Machine Learning MCQ
11 pages
U L D R: Nsupervised Earning and Imensionality Eduction
No ratings yet
U L D R: Nsupervised Earning and Imensionality Eduction
58 pages
Concepts and Techniques: Data Mining
100% (1)
Concepts and Techniques: Data Mining
81 pages
Cluster Analysis Chapter 8 Solution
No ratings yet
Cluster Analysis Chapter 8 Solution
8 pages
CH 6
No ratings yet
CH 6
72 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
33 pages
Parallelism of Statistics and Machine Learning & Logistic Regression Versus Random Forest
100% (1)
Parallelism of Statistics and Machine Learning & Logistic Regression Versus Random Forest
72 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Lecture 3 Data Mining
No ratings yet
Lecture 3 Data Mining
30 pages
Professional Certificate Course in Data Analytics E & ICT Academy, IIT Kanpur - Updated 4-05-2023
No ratings yet
Professional Certificate Course in Data Analytics E & ICT Academy, IIT Kanpur - Updated 4-05-2023
28 pages
Optimization With Matlab
100% (1)
Optimization With Matlab
19 pages
Data Science Interview Preparation
100% (1)
Data Science Interview Preparation
113 pages
Classification Algorithms Used in Data Mining. This Is A Lecture Given To MSC Students.
100% (5)
Classification Algorithms Used in Data Mining. This Is A Lecture Given To MSC Students.
63 pages
K-Means and PCA
No ratings yet
K-Means and PCA
69 pages
Introduction To MS Power BI Desktop - Exercise 02 - Deeper Understanding Power BI ETL - V03
No ratings yet
Introduction To MS Power BI Desktop - Exercise 02 - Deeper Understanding Power BI ETL - V03
6 pages
Week 8-Association Rules Part 1
No ratings yet
Week 8-Association Rules Part 1
31 pages
Big Data
No ratings yet
Big Data
31 pages
25 Questions To Test Your Skills On Decision Trees
No ratings yet
25 Questions To Test Your Skills On Decision Trees
10 pages
Ch5 - Support Vector Machine (SVM)
No ratings yet
Ch5 - Support Vector Machine (SVM)
27 pages
Data Mining
No ratings yet
Data Mining
27 pages
Data Mining in Medicine
No ratings yet
Data Mining in Medicine
42 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
Machine Learning Models
0% (1)
Machine Learning Models
16 pages
Power Bi
100% (1)
Power Bi
234 pages
Supervised and Deep Learning
No ratings yet
Supervised and Deep Learning
83 pages
Clickstream Analysis
No ratings yet
Clickstream Analysis
25 pages
Data Mining Overview
No ratings yet
Data Mining Overview
14 pages
Association Rules FP Growth
No ratings yet
Association Rules FP Growth
32 pages
OOSE Lab Report
No ratings yet
OOSE Lab Report
30 pages
7.simple Classification
No ratings yet
7.simple Classification
45 pages
Introduction To Data Mining
100% (1)
Introduction To Data Mining
18 pages
Unit - 4 Machine Learning
100% (1)
Unit - 4 Machine Learning
84 pages
K Means R and Rapid Miner Patient and Mall Case Study
No ratings yet
K Means R and Rapid Miner Patient and Mall Case Study
80 pages
Data Science Statistics Mathematics Cheat Sheet
100% (1)
Data Science Statistics Mathematics Cheat Sheet
13 pages
CEC453 Machine Learning
No ratings yet
CEC453 Machine Learning
168 pages
Rajesh (DL Unit1) 04dec2024
No ratings yet
Rajesh (DL Unit1) 04dec2024
125 pages
About The Classification and Regression Supervised Learning Problems
No ratings yet
About The Classification and Regression Supervised Learning Problems
3 pages
Mini Project On Diabetes Prediction: Information Technology
No ratings yet
Mini Project On Diabetes Prediction: Information Technology
19 pages
Python Machine Learning - Machine Learning and Deep Learning With Python Scikit Learn and Tensorflow 2 Third Edition
No ratings yet
Python Machine Learning - Machine Learning and Deep Learning With Python Scikit Learn and Tensorflow 2 Third Edition
4 pages
Dev Answer Key
100% (1)
Dev Answer Key
17 pages
Kenny-230718-Top 70 Microsoft Data Science Interview Questions
No ratings yet
Kenny-230718-Top 70 Microsoft Data Science Interview Questions
17 pages
Machine Learning Lab Manual 7
100% (1)
Machine Learning Lab Manual 7
8 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
Issues in ML
No ratings yet
Issues in ML
2 pages
Breast Cancer Classification
No ratings yet
Breast Cancer Classification
18 pages
Cluster
100% (1)
Cluster
72 pages
Artificial Intelligence and Deep Learning
0% (1)
Artificial Intelligence and Deep Learning
9 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
10 pages
Artificial Intelligence Report
No ratings yet
Artificial Intelligence Report
23 pages
6 - Machine Learning and Unlabeled Data
No ratings yet
6 - Machine Learning and Unlabeled Data
67 pages
21AI71 Module 5 Textbook
No ratings yet
21AI71 Module 5 Textbook
25 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
10 pages
Lesson 4 - Feature Engineering
No ratings yet
Lesson 4 - Feature Engineering
42 pages
Quick Start: Resolving A Markov Decision Process Problem Using The Mdptoolbox in Matlab
No ratings yet
Quick Start: Resolving A Markov Decision Process Problem Using The Mdptoolbox in Matlab
9 pages
Lesson 3 - Supervised Learning
No ratings yet
Lesson 3 - Supervised Learning
69 pages
Lesson 2 - Data Preprocessing
100% (1)
Lesson 2 - Data Preprocessing
72 pages
Lesson 10 - Text Mining PDF
No ratings yet
Lesson 10 - Text Mining PDF
72 pages
Lesson 5 - Supervised Learning-Classification
100% (1)
Lesson 5 - Supervised Learning-Classification
91 pages
Lesson 8 - Ensemble Learning
No ratings yet
Lesson 8 - Ensemble Learning
61 pages
Lesson 1 - Introduction To AI and Machine Learning
No ratings yet
Lesson 1 - Introduction To AI and Machine Learning
44 pages
Lesson 2 - Data Preprocessing
100% (1)
Lesson 2 - Data Preprocessing
72 pages
Lesson 0 - Course Introduction
No ratings yet
Lesson 0 - Course Introduction
6 pages
Kupdf Min PDF
No ratings yet
Kupdf Min PDF
1 page
Math-Q3-Week 6
No ratings yet
Math-Q3-Week 6
6 pages
RPH Reviewer
No ratings yet
RPH Reviewer
29 pages
l3 Assignment - Interview Questions - Template 2
No ratings yet
l3 Assignment - Interview Questions - Template 2
3 pages
AC, TLS, and Encoders
No ratings yet
AC, TLS, and Encoders
25 pages
Role Play Rubric
100% (2)
Role Play Rubric
2 pages
TMA Quiz Questions
67% (6)
TMA Quiz Questions
12 pages
Pulo, Dalahican, Cavite City
No ratings yet
Pulo, Dalahican, Cavite City
3 pages
Adaptable PID Controller For Industrial Hot and Cold Chamber
No ratings yet
Adaptable PID Controller For Industrial Hot and Cold Chamber
46 pages
Bench Bulletin Issue 30
100% (1)
Bench Bulletin Issue 30
102 pages
Guru Harkrishan Public School, India Gate Holiday Homework (2019 - 20) Class 8 English
No ratings yet
Guru Harkrishan Public School, India Gate Holiday Homework (2019 - 20) Class 8 English
5 pages
Extended Shear Tab Connections Under Combined Axial and Shear Loading
No ratings yet
Extended Shear Tab Connections Under Combined Axial and Shear Loading
10 pages
Thesis Fahrenheit 451 Essay
100% (2)
Thesis Fahrenheit 451 Essay
4 pages
Additional English - 4th Semester Full
No ratings yet
Additional English - 4th Semester Full
48 pages
CV Mr. Danai
No ratings yet
CV Mr. Danai
1 page
Grade 10 Singapore and Asian Schools Math Olympiad: Choose Correct Answer(s) From The Given Choices
No ratings yet
Grade 10 Singapore and Asian Schools Math Olympiad: Choose Correct Answer(s) From The Given Choices
2 pages
15' Stress Test
No ratings yet
15' Stress Test
1 page
Hypothesis
No ratings yet
Hypothesis
3 pages
LoRa Süsteemil Põhinev Põllumajandussüsteem PDF
No ratings yet
LoRa Süsteemil Põhinev Põllumajandussüsteem PDF
101 pages
Term Paper Tungkol Sa k12
100% (1)
Term Paper Tungkol Sa k12
7 pages
STR Profiles: Multiplex PCR, Tri-Alleles, Amelogenin, and Partial Profiles
No ratings yet
STR Profiles: Multiplex PCR, Tri-Alleles, Amelogenin, and Partial Profiles
20 pages
Class 9 PT-2
No ratings yet
Class 9 PT-2
3 pages
Allocate Move Order Script
100% (1)
Allocate Move Order Script
3 pages
p77253 Btec l3 Applied Science 31617h Unit 1b Jan 2024
No ratings yet
p77253 Btec l3 Applied Science 31617h Unit 1b Jan 2024
12 pages
Sabar Rutoto, Henry Suryo Bintoro, Ika Oktavianti, Sumaji
No ratings yet
Sabar Rutoto, Henry Suryo Bintoro, Ika Oktavianti, Sumaji
9 pages
Sriya PPT 2.0
No ratings yet
Sriya PPT 2.0
16 pages
Ohms Law 14to16 Lesson-Plan
No ratings yet
Ohms Law 14to16 Lesson-Plan
3 pages
The Food Security Research Based On Supply Chain Perspective in Northeast Three PROVINCES
No ratings yet
The Food Security Research Based On Supply Chain Perspective in Northeast Three PROVINCES
6 pages
OFP Interview Questions
100% (1)
OFP Interview Questions
2 pages
Ebert - Be13 - TB - 14 - Exam Pool
No ratings yet
Ebert - Be13 - TB - 14 - Exam Pool
3 pages
Grade 2 Poi 2012
No ratings yet
Grade 2 Poi 2012
1 page

Lesson 6 - Unsupervised Learning

Uploaded by

Lesson 6 - Unsupervised Learning

Uploaded by

Machine Learning

Lesson 6: Unsupervised Learning

© Simplilearn. All rights reserved.

By the end of this lesson, you will be able to:

Explain the mechanism of unsupervised learning

Practice different clustering techniques in Python

Unsupervised Learning Model

The only difference is the labels in the training data

Clustering like-looking birds/animals based on their features

Unsupervised learning can be used for anomaly detection as well as

The goal is to see that

To determine the intrinsic grouping in a set of unlabeled data

To organize data into clusters showing internal structure of the data

To partition the data points

Agglomerative Divisive K-means Fuzzy C-means

Step 1 Step 2 Step 3 Step 4

Complete - Linkage clustering

Single - Linkage Clustering

Mean - Linkage Clustering

Centroid - Linkage Clustering

A hierarchical clustering of distances between cities in kilometers

Create distance matrix of data

BA 0 662 877 255 412 996

FI 662 0 295 468 268 400

MI 877 295 0 754 564 138 Distance between TO and MI

RM 412 268 564 219 0 669

TO 996 400 138 869 669 0

As the MI column has lower values than TO column,

Derive the final dendrogram

Objective: Perform agglomerative clustering with appropriate MSE value.

import matplotlib.pyplot as plt

from sklearn.cluster import AgglomerativeClustering

1 Randomly chooses k datapoints as initial centroids

2 Assigns each datapoint closest to the centroid

3 Calculates new cluster centroids

4 Checks if the convergence criterion is met

Consider the below datapoints

Initialize centers randomly

Assign points to the nearest center

Assign points to the nearest center

Assign points to the nearest center

Assign points to the nearest center

Objective Function Value

The goal of elbow method is to choose k where SSE

Problem Statement: Lithionpower is the largest provider of electric vehicle(e-vehicle) batteries.

from sklearn.cluster import KMeans

fig,(ax1,ax2) = plt.subplots(1,2, figsize=(20,20))

Now, you are able to:

Explain the mechanism of unsupervised learning

Practice different clustering techniques in Python

© Simplilearn. All rights reserved.

© Simplilearn. All rights reserved.

The correct answer is a. True

d. All the above

© Simplilearn. All rights reserved.

d. All the above

The correct answer is d. All the above

© Simplilearn. All rights reserved.

You might also like