L05 Unsupervised Learning - Overview

Uploaded by

black hello

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views16 pages

L05 Unsupervised Learning - Overview

Uploaded by

black hello

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Overview

1
Introduction
• machine learning technique in which the users do not
need to supervise the model
• allows the model to work on its own to discover patterns
and information that was previously undetected
• mainly deals with the unlabeled data

2
Unsupervised Learning Overview

unlabeled fit structur

data (no e
answers)

new predict map

unlabeled + model new
data data to
structur
e
Introduction
• Why unsupervised learning?
• finds all kind of unknown patterns in data.
• find features which can be useful for categorization.
• can taken place in real time, so all the input data to
be analyzed and labeled in the presence of learners.
• easier to get unlabeled data from a computer than
labeled data, which needs manual intervention.

4
Types of Unsupervised Learning Algorithms
• Clustering
• finding a structure or pattern in a collection of uncategorized data
• process data and find natural clusters(groups) if they exist in the data
• types: Exclusive (partitioning), Agglomerative, Overlapping, Probabilistic
• Dimension reduction
• dimension reduction technique that finds the variance maximizing
directions onto which to project the data
• use structural characteristics to simplify data
• Association
• rule learning problem to establish associations amongst data objects inside
large databases
• For example, people that buy a new home most likely to buy new furniture

5
Some of the Unsupervised Learning Algorithms
• Clustering
• K-means - data points are assigned into K groups, where K represents the number of
clusters based on the distance from each group’s centroid
• Hierarchical clustering, also known as hierarchical cluster analysis (HCA) - categorized in
two ways; they can be agglomerative or divisive
• probabilistic model - helps us solve density estimation or “soft” clustering problems where
data points are clustered based on the likelihood that they belong to a particular
distribution
• Dimension reduction
• Principal component analysis (PCA) - to reduce redundancies and to compress datasets
through feature extraction
• Singular value decomposition (SVD) - factorizes a matrix, A, into three, low-rank matrices
• Autoencoders leverage neural networks to compress data and then recreate a new
representation of the original data’s input
• Association
• Apriori algorithms – for market basket analyses, leading to different recommendation
engines for music platforms and online retailers
6
Types of Unsupervised Learning

Clustering identify unknown structure in data

Dimensionali
use structural characteristics to simplify
ty Reduction
data
Clustering: Finding Distinct Groups

text articles fit

of unknown + mode model
topics l

text articles predict

mode predict
of unknown + l
similar
topics
articles
Application of Clustering
• Recommendation engines
• Market segmentation
• Social network analysis
• Search result grouping
• Medical imaging
• Image segmentation
• Anomaly detection
9
Dimensionality Reduction: Simplifying
Structure
high fit
resolution + mode model
images l

high predict
mode compressed
resolution + l
images
images
Applications of Unsupervised Machine
Learning
• Clustering automatically split the dataset into groups base on
their similarities
• Anomaly detection can discover unusual data points in your
dataset. It is useful for finding fraudulent transactions
• Association mining identifies sets of items which often occur
together in your dataset
• Latent variable models are widely used for data
preprocessing. Like reducing the number of features in a
dataset or decomposing the dataset into multiple
components

11
Disadvantages of Unsupervised
Learning
• cannot get precise information regarding data sorting, and the output
as data used in unsupervised learning is labeled and not known
• less accuracy of the results is because the input data is not known
and not labeled by people in advance. This means that the machine
requires to do this itself.
• the spectral classes do not always correspond to informational
classes.
• the user needs to spend time interpreting and label the classes which
follow that classification.
• spectral properties of classes can also change over time so you can't
have the same class information while moving from one image to
another.
12
Pre-processing and Scaling
• data preprocessing and normalization become very
important when it comes to the implementation of different
Machine Learning Algorithms
• can affect the outcome of the learning model significantly, it
is very important that all features are on the same scale

13
Types of pre-processing and scaling
• StandardScaler
• ensures that for each feature in the dataset mean is 0 and variance is 1 and brings
all features to the same magnitude
• doesn’t ensure any minimum and maximum values for the features
• RobustScaler
• similar to StandardScaler but uses the median and quartiles instead of mean and
variance
• makes scaler ignore data points that are very different from the rest (measurement
errors)
• Normalizer scales
• each data point such that the feature vector has a Euclidian length of 1
• Every data point is scaled by a different number (by the inverse of its length)
• used when the only direction of the data matters, not the length of the feature
vector
• MinMaxScaler
• transforms all the input variables, so they’re all on the same scale between zero and
one
• computes the minimum and maximum values for each feature on the training data,
and then applies the min - max transformation for each feature.

Source: https://levelup.gitconnected.com/importance-of-data-
preprocessing-and-scaling-in-machine-learning-21db1d4377ec 14
Types of pre-processing and scaling

Source: https://levelup.gitconnected.com/importance-of-data-preprocessing-and-scaling-
15
in-machine-learning-21db1d4377ec
Reference:
• https://www.guru99.com/unsupervised-machine-learning.html
• https://stanford.edu/~shervine/teaching/cs-229/cheatsheet-
unsupervised-learning#dimension-reduction
• https://www.ibm.com/cloud/learn/unsupervised-learning
• https://levelup.gitconnected.com/importance-of-data-
preprocessing-and-scaling-in-machine-learning-21db1d4377ec
• https://www.analyticsvidhya.com/blog/2016/11/an-introduction-
to-clustering-and-different-methods-of-clustering/
• https://www.digitalvidya.com/blog/the-top-5-clustering-
algorithms-data-scientists-should-know/

CBSE Class 7 English - Comprehension Passage
100% (1)
CBSE Class 7 English - Comprehension Passage
7 pages
CEC453 Machine Learning
No ratings yet
CEC453 Machine Learning
168 pages
DSF Unit 3
No ratings yet
DSF Unit 3
29 pages
Unit-5 Machine Learning
No ratings yet
Unit-5 Machine Learning
25 pages
Ultrasound For Primary Care - 1st Edition Readable PDF Download
100% (17)
Ultrasound For Primary Care - 1st Edition Readable PDF Download
14 pages
Python UNIT-5
100% (1)
Python UNIT-5
67 pages
9 Som
No ratings yet
9 Som
32 pages
Optimisation and Dimension Reduction Tech-Unlocked
No ratings yet
Optimisation and Dimension Reduction Tech-Unlocked
29 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
16 pages
Unit 4
No ratings yet
Unit 4
26 pages
Module 6.1
No ratings yet
Module 6.1
42 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
15 pages
Slidesgo Fundamentals of Machine Learning An in Depth Exploration of Supervised and Unsupervised Learning PR 202408181323046euf
No ratings yet
Slidesgo Fundamentals of Machine Learning An in Depth Exploration of Supervised and Unsupervised Learning PR 202408181323046euf
10 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
10 pages
UnSupervised ML
No ratings yet
UnSupervised ML
17 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
8 pages
CP4252 ML Unit-Iii
No ratings yet
CP4252 ML Unit-Iii
18 pages
WheelHorse Raider 10 and Raider 12 Owners Manual For Models 1-6051 1-6251-1-6252-1-6253
100% (3)
WheelHorse Raider 10 and Raider 12 Owners Manual For Models 1-6051 1-6251-1-6252-1-6253
12 pages
Assignment 3
No ratings yet
Assignment 3
22 pages
Unit - 1-1
No ratings yet
Unit - 1-1
35 pages
23ECE205 FoDS 13 Introduction To ML
No ratings yet
23ECE205 FoDS 13 Introduction To ML
41 pages
DTS 101 Lecture 1
No ratings yet
DTS 101 Lecture 1
22 pages
Chapter 04 - 1731894685
No ratings yet
Chapter 04 - 1731894685
17 pages
Machine Learning4
No ratings yet
Machine Learning4
39 pages
Module 3
No ratings yet
Module 3
17 pages
Ai - W8L15
No ratings yet
Ai - W8L15
44 pages
Variance
No ratings yet
Variance
6 pages
Unsupervised Learning - Overview
No ratings yet
Unsupervised Learning - Overview
6 pages
IGCSE Biology - Keywords PDF
No ratings yet
IGCSE Biology - Keywords PDF
13 pages
Unsupervised Learning 1691392220
No ratings yet
Unsupervised Learning 1691392220
15 pages
Learning Types Based On Context
No ratings yet
Learning Types Based On Context
18 pages
Machine Learning - Part - 1
No ratings yet
Machine Learning - Part - 1
17 pages
2nd Unit NN Final Class Notes
No ratings yet
2nd Unit NN Final Class Notes
51 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
17 pages
ML Unit 2 Notes
No ratings yet
ML Unit 2 Notes
14 pages
Unsupervised Machine Learning - Dealing With Unknown Data
No ratings yet
Unsupervised Machine Learning - Dealing With Unknown Data
6 pages
Group I Discrete Mathematics
No ratings yet
Group I Discrete Mathematics
4 pages
Unit 4
No ratings yet
Unit 4
62 pages
Digging Tools PDF
No ratings yet
Digging Tools PDF
6 pages
Industrial Plant Layout
No ratings yet
Industrial Plant Layout
18 pages
U5 Unsupervised Learning
No ratings yet
U5 Unsupervised Learning
15 pages
FAM Unit5
No ratings yet
FAM Unit5
47 pages
Unit 2 Unsupervised Learning
No ratings yet
Unit 2 Unsupervised Learning
86 pages
ML Lecture 2 3 Types
No ratings yet
ML Lecture 2 3 Types
27 pages
Magnetic Particle Inspection
0% (1)
Magnetic Particle Inspection
32 pages
How To Perform Clustering Algorithms in Machine Learning
No ratings yet
How To Perform Clustering Algorithms in Machine Learning
9 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
30 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
4 pages
New Holland E200SR Excavator Workshop Service Repair Manual
No ratings yet
New Holland E200SR Excavator Workshop Service Repair Manual
21 pages
Chapter 4 Data Link Layer (OSI Model) - July 2023
No ratings yet
Chapter 4 Data Link Layer (OSI Model) - July 2023
39 pages
Module IV - Machine Learning
No ratings yet
Module IV - Machine Learning
53 pages
Machine Learning and Web Scraping Lesson02
No ratings yet
Machine Learning and Web Scraping Lesson02
29 pages
Week 9. Unsupervised Learning
No ratings yet
Week 9. Unsupervised Learning
32 pages
2GP ML Unsupervised Learning
No ratings yet
2GP ML Unsupervised Learning
3 pages
Un-Supervised Machine Learning
No ratings yet
Un-Supervised Machine Learning
9 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
13 pages
ML Unit 1
No ratings yet
ML Unit 1
71 pages
Machine Learning Essentials
No ratings yet
Machine Learning Essentials
19 pages
2nd Unit NN Final Class Notes
No ratings yet
2nd Unit NN Final Class Notes
50 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
6 pages
Chap01 - Intro To Programming
No ratings yet
Chap01 - Intro To Programming
37 pages
Chapter 6 Network Layer - July 2023
No ratings yet
Chapter 6 Network Layer - July 2023
58 pages
Chapter 2 Network Protocols - Communication - July 2023
No ratings yet
Chapter 2 Network Protocols - Communication - July 2023
56 pages
Chapter 10 Application Layer - July 2023
No ratings yet
Chapter 10 Application Layer - July 2023
36 pages
Practical 3 - ESP32 WiFi
100% (1)
Practical 3 - ESP32 WiFi
9 pages
2 ML
No ratings yet
2 ML
9 pages
Lecture 03
No ratings yet
Lecture 03
28 pages
NSCP (2010) - Chapter 3
No ratings yet
NSCP (2010) - Chapter 3
24 pages
2 Energy Methods and Basic 1D Finite Element Methods
No ratings yet
2 Energy Methods and Basic 1D Finite Element Methods
53 pages
Guide To Install Visual Studio 2019
No ratings yet
Guide To Install Visual Studio 2019
3 pages
3hac042305 041
No ratings yet
3hac042305 041
1 page
Types of Machine Learning
No ratings yet
Types of Machine Learning
5 pages
Chapter 6 - Multimedia Element Video
No ratings yet
Chapter 6 - Multimedia Element Video
44 pages
Supply Chain Improvement in Construction Industry
No ratings yet
Supply Chain Improvement in Construction Industry
8 pages
Tripping Batteries
No ratings yet
Tripping Batteries
5 pages
Session 3 Types of Machine Learning
No ratings yet
Session 3 Types of Machine Learning
22 pages
Pari Chauhan
No ratings yet
Pari Chauhan
15 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
21 pages
Unit 3 Supervised Learning
No ratings yet
Unit 3 Supervised Learning
89 pages
After You Graduate You Get A Job in A Small
No ratings yet
After You Graduate You Get A Job in A Small
2 pages
Numerical Measures To Describe Data
No ratings yet
Numerical Measures To Describe Data
103 pages
Safety Data Sheet: 1. Identification of The Substance/Mixture and The Supplier
No ratings yet
Safety Data Sheet: 1. Identification of The Substance/Mixture and The Supplier
8 pages
Weather Vocab
No ratings yet
Weather Vocab
2 pages
L10 Neural Network
No ratings yet
L10 Neural Network
52 pages
Physics Vol.2 Figures Class 12
No ratings yet
Physics Vol.2 Figures Class 12
25 pages
Corex Delivery
No ratings yet
Corex Delivery
37 pages
AWS S3 Cheatsheet
No ratings yet
AWS S3 Cheatsheet
10 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Sector Theory
No ratings yet
Sector Theory
12 pages
8.4 - 1 Strong and Weak Acids (H+) and PH Calculations
No ratings yet
8.4 - 1 Strong and Weak Acids (H+) and PH Calculations
4 pages
Medium Term Strategy Rbi
No ratings yet
Medium Term Strategy Rbi
17 pages
L03 Generalization, Train Test Splits and Validation
No ratings yet
L03 Generalization, Train Test Splits and Validation
49 pages
L08 Hierachical Agglomerative Clustering
No ratings yet
L08 Hierachical Agglomerative Clustering
41 pages
Application Guide
No ratings yet
Application Guide
4 pages
Practical 1 Slide
No ratings yet
Practical 1 Slide
20 pages
Lab 3 Unit 2
No ratings yet
Lab 3 Unit 2
7 pages
L01 Introduction To ML
No ratings yet
L01 Introduction To ML
16 pages
Intelligent Motion Control Design For An Omnidirectional Conveyor System
No ratings yet
Intelligent Motion Control Design For An Omnidirectional Conveyor System
11 pages
Cambridge IGCSE: Travel & Tourism 0471/21
No ratings yet
Cambridge IGCSE: Travel & Tourism 0471/21
12 pages
Setup - Firebase
No ratings yet
Setup - Firebase
9 pages
L02 Classification and Regression
No ratings yet
L02 Classification and Regression
26 pages
Data Sheet 80x65 FS2GA 6 15
No ratings yet
Data Sheet 80x65 FS2GA 6 15
5 pages
Practical 2 Hadoop Distributed File System (HDFS)
No ratings yet
Practical 2 Hadoop Distributed File System (HDFS)
4 pages
Myths Derived From Scripture
No ratings yet
Myths Derived From Scripture
2 pages

L05 Unsupervised Learning - Overview

Uploaded by

L05 Unsupervised Learning - Overview

Uploaded by

Overview

unlabeled fit structur

new predict map

Clustering identify unknown structure in data

text articles fit

text articles predict

You might also like