0% found this document useful (0 votes)

21 views81 pages

1 Introduction

Uploaded by

ramjasjdh31

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views81 pages

1 Introduction

Uploaded by

ramjasjdh31

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 81

Pattern Recognition and Machine

Learning
Dr Suresh Sundaram
[email protected]
Lets get started
• Person identification systems -> Biometrics,
Aadhar,
Human Perception
• How did we learn the alphabet of the English
language?

Trained ourselves to recognize alphabets, so

that given a new alphabet, we use our
memory / intelligence in recognizing it.
Machine Perception
• How about providing such capabilities to
machines to recognize alphabets ?

• The field of pattern recognition exactly does

that.
Idea
• Build a machine that can recognize patterns:

– Speech recognition

– Fingerprint identification

– OCR (Optical Character Recognition)

– DNA sequence identification

A basic PR framework
• Training samples
• Testing samples
• An algorithm for recognizing an unknown test
sample

• Samples are labeled (supervised learning)

Typical supervised PR problem
• Alphabets – 26 in number (upper case)

• # of alphabets/ classes to recognize – 26.

• Collect samples of each of the 26 alphabets
and train using an algorithm.
• Once trained, test system using unknown test
sample/ alphabeth.
Basics
So what's a pattern ?
A pattern is an entity, vaguely defined, that
could be given a name, e.g.,
• fingerprint image,
• handwritten word,
• human face,
• speech signal,
• DNA sequence
• alphabeth
Handwriting Recognition

Machine print document Input handwritten document

Handwriting recognition
Face recognition
Fingerprint recognition
Other Applications
• Object classification
• Signature verification ( genuine vs forgery)
• Iris recognition
• Writer adaptation
• Speaker recognition
• Bioinformatics (gene classification)
• Communication System Design
• Medical Image processing
Pattern Recognition Algorithms
• Bag of algorithms that can used to provide
some intelligence to a machine.

• These algorithms have a solid probabilistic

framework.

• Algorithms work on certain characteristics

defining a class -refered as ‘features’.
What is a feature?
• Features across classes need to be
discriminative for better classification
peformance.

Pattern l
Pattern i
• Presence of a dot in ‘i’ can distinguish these
‘i’ from ‘l’ and is a feature.

• Features values can be discrete or continuous

in nature (floating value).

• In practice, a single feature may not suffice for

discrimination.
Pattern Recognition Algorithms
• Bag of algorithms that can used to provide
some intelligence to a machine.

• These algorithms have a solid probabilistic

framework.

• Algorithms work on certain characteristics

defining a class -refered as ‘features’.
What is a feature?
• Features across classes need to be
discriminative for better classification
peformance.

Pattern l
Pattern i
• Presence of a dot in ‘i’ can distinguish these
‘i’ from ‘l’ and is a feature.

• Features values can be discrete or continuous

in nature (floating value).

• In practice, a single feature may not suffice for

discrimination.
Feature selection
In practice, a single feature may not suffice for
discrimination.

• A possible solution is to look out for many features

and select a set ( possibly with feature selection
algorithms). The goal is to improve the recognition
performance of unseen test data.

• The different features selected can be represented

with a vector called as ‘feature vector’.
Dimension of a feature vector
• Suppose we select d features, we can
represent them with a d-dimensional feature
vector.

• Pixels of an image of size M XN can be

represented with a MN*1 dimensional feature
vector.
Feature selection
• Domain Knowledge helps in extracting
features

• Feature discriminability measures are

available like Fisher scores to measure the
effectiveness of features.
List of features used in literature
• Pixels in an image
• Edge based features in an image
• Transformed coefficients

DFT (Shape description)

DCT (Compression)
Wavelets (Palm print recognition)
KLT /PCA (Face recognition)
Gabor (Texture classification, script
identification)
MFCCs (Speech systems)
Features
• Feature to be discriminative
• Specific to applications…… no universal
feature for all pattern recognition problems
…. Ugly Duckling Theorem

• To be robust to translation, rotation,

occlusion, scaling
Features

• Continuous, real valued

• Discrete
• Binary
• Mixed
Features
Curse of dimensionality

• If limited data is available, too many features

may degrade the performance ….. We need as
large number of training samples for better
generalization….to beat the `curse of
dimensionality’!

• Need arises to come up with techniques such

as PCA to pick the `relevant features’.
Basic Pattern Recognition
• “Sorting incoming Fish on a conveyor
according to species using optical sensing”

Sea bass
Species
Salmon
• Problem Analysis

– Set up a camera and take some sample images to extract

features

• Length
• Lightness
• Width
• Number and shape of fins
• Position of the mouth, etc…

• This is the set of all suggested features to explore for use in our
classifier!
• Preprocessing

– Use a segmentation operation to isolate fishes from

one another and from the background

• Information from a single fish is sent to a feature

extractor whose purpose is to reduce the data by
measuring certain features

• The features are passed to a classifier

• Classification

– Select the length of the fish as a possible feature

for discrimination
The length is a poor feature alone!

Select the lightness as a possible feature.

• Adopt the lightness and add the width of the
fish as a new feature

Fish xT = [x1, x2]

Lightness Width
• We might add other features that are not
correlated with the ones we already have. A
precaution should be taken not to reduce the
performance by adding such “noisy features”

• Ideally, the best decision boundary should be

the one which provides an optimal
performance such as in the following figure:
Use simple models to complicated ones : Occams razor
• Sensing

– Use of a transducer (camera or microphone)

• Segmentation and grouping

– Patterns should be well separated and should not

overlap
• Feature extraction
– Discriminative features
– Invariant features with respect to translation, rotation and
scale.

• Classification
– Use a feature vector provided by a feature extractor to
assign the object to a category

• Post Processing
– Exploit context input dependent information other than
from the target pattern itself to improve performance
The Design Cycle

• Data collection
• Feature Choice
• Model Choice
• Training
• Evaluation
• Computational Complexity
• Data Collection

– How do we know when we have collected an

adequately large and representative set of
examples for training and testing the system?
• Feature Choice

– Depends on the characteristics of the problem

domain. Simple to extract, invariant to irrelevant
transformation insensitive to noise.
• Model Choice

– Unsatisfied with the performance of our fish

classifier and want to jump to another class of
model
• Training

– Use data to determine the classifier. Many

different procedures for training classifiers and
choosing models
• Evaluation

– Measure the error rate (or performance and

switch from one set of features to another one
• Computational Complexity

– What is the trade-off between computational

ease and performance?

– (How an algorithm scales as a function of the

number of features, patterns or categories?)
Learning paradigms
• Supervised learning

– A teacher provides a category label or cost for

each pattern in the training set

• Unsupervised learning

– The system forms clusters or “natural groupings”

of the input patterns
Unsupervised Learning
• The system forms clusters or “natural groupings” of
the input patterns….

• Clustering is often called an unsupervised

learning task as no class values denoting an a priori
grouping of the data instances are given
Segmentation of an image into k clusters by a popular iterative algorithm called k
Means Algorithm.

Original image

Segmented image using

k Means Clustering
(k=3)
Reinforcement learning
• Reinforcement learning is an area of machine
learning inspired by behaviorist psychology,
concerned with how software agents ought to
take actions in an environment so as to
maximize some notion of cumulative reward.
Semi-supervised learning
• Semi-supervised learning is a class of supervised
learning tasks and techniques that also make use
of unlabeled data for training - typically a small
amount of labeled data with a large amount of
unlabeled data.

• It falls between unsupervised learning (without

any labeled training data) and supervised
learning (with completely labeled training data).
Learning paradigms
• Supervised learning

– A teacher provides a category label or cost for

each pattern in the training set

• Unsupervised learning

– The system forms clusters or “natural groupings”

of the input patterns
Unsupervised Learning
• The system forms clusters or “natural groupings” of
the input patterns….

• Clustering is often called an unsupervised

learning task as no class values denoting an a priori
grouping of the data instances are given
Segmentation of an image into k clusters by a popular iterative algorithm called k
Means Algorithm.

Original image

Segmented image using

• It falls between unsupervised learning (without

any labeled training data) and supervised
learning (with completely labeled training data).
Regression

Division of feature space to distinct

regions by decision surfaces
Empirical Risk Minimization
• Every classifier / regressor does what is called
as - `empirical risk minimization’

• Learning pertains to coming up with an

architecture that can minimize a risk / loss
function defined on the training /empirical
data.
No- free lunch theorem

• There ain’t such thing as free lunch -- It is impossible to

get nothing for something !

• In view of the no-free-lunch theorem it seems that one cannot

hope for a classifier that would perform best on all possible
problems that one could imagine.
Classifier taxonomy
• Generative classifiers
• Discriminative classifiers

• Types of generative classifier

[a] Parametric
[b] Non-parametric
Generative classifier
• Samples of training data of a class assumed
to come from a probability density function
(class conditional pdf)

• If the form of pdf is assumed , such as

uniform, gaussian, rayleigh, etc …one can
estimate the parameters of the distribution.

• Parametric classifier
Class conditional Density : pdf built using infinite samples of a given pattern / class.

In this figure, we have 2 pdf s corresponding to 2 classes w1 and w2 .

Feature x ‘brightness’ is used to construct the pdfs.

Generative classifier
• One can as well assume to use the training
data to build a pdf - Non parametric
approach

• Discriminative classifier No such

assumption of data being drawn from an
underlying pdf. Models the decision boundary
by adaptive gradient descent techniques.
Discriminative Classifier
• Start with initial weights that define the
decision surface
• Update the weights based on some
optimization criterion….

• No need to model the distribution of samples

of a given class…..class conditional density
concept not required!
• Neural nets (such as MLP, Single layer
perceptron, SVMs) fall in the category of
discriminative classifiers.
Discriminative classifier
Linearly separable data
w1x1+w2x2+b>0

w1x1+w2x2+b<0

Linearly separable data

Separating line : w1x1+w2x2+b=0
Non- linearly separable data
Covers Theorem
• The theorem states that given a set of training
data that is not linearly separable, one can
transform it into a training set that is linearly
separable by mapping it into a possibly
higher-dimensional space via some non-linear
transformation.
Cover’s Theorem

The samples of the original data is in 2D. After a non-linear

transformation , it becomes linearly separable in three
dimensions as shown in (b).
Cover’s Theorem
Evaluation Metric

Consider scenario wherein a patient

is screened for a disease.

Yes : Healthy
No: Diseased

TP : True positive
FN : False negative
TN : True Negative
FN : False Negative

Robust Forex Trading With Deep Q Network (DQN)
No ratings yet
Robust Forex Trading With Deep Q Network (DQN)
19 pages
PATTERN RECOGNITION Final Notes
90% (10)
PATTERN RECOGNITION Final Notes
40 pages
Introduction To Machine Learning
100% (1)
Introduction To Machine Learning
119 pages
Pattern and Classification
No ratings yet
Pattern and Classification
20 pages
Artificial Intelligence and Industrial Applications: Algorithms, Techniques, and Engineering Applications 1st Edition Tawfik Masrour
No ratings yet
Artificial Intelligence and Industrial Applications: Algorithms, Techniques, and Engineering Applications 1st Edition Tawfik Masrour
55 pages
6th - SEM Machine Learning Notes PDF
100% (1)
6th - SEM Machine Learning Notes PDF
36 pages
Pattern Recognition
No ratings yet
Pattern Recognition
45 pages
Pattern Recognition - Organizer - 2023
100% (2)
Pattern Recognition - Organizer - 2023
112 pages
Module 1.docx Aiml
No ratings yet
Module 1.docx Aiml
53 pages
A Robust Cognitive Architecture For Learning From Surprises
No ratings yet
A Robust Cognitive Architecture For Learning From Surprises
12 pages
Swarm RL 34
No ratings yet
Swarm RL 34
15 pages
Pattern Classification
100% (1)
Pattern Classification
42 pages
Environment Interaction of A Bipedal Robot Using Model-Free Control Framework Hybrid Off-Policy and On-Policy Reinforcement Learning Algorithm
No ratings yet
Environment Interaction of A Bipedal Robot Using Model-Free Control Framework Hybrid Off-Policy and On-Policy Reinforcement Learning Algorithm
12 pages
Pattern Recognition 21BR551 MODULE 01 NOTES
No ratings yet
Pattern Recognition 21BR551 MODULE 01 NOTES
20 pages
UNIT-V Notes
No ratings yet
UNIT-V Notes
24 pages
PRA Min
No ratings yet
PRA Min
93 pages
Pattern - Recognigation - Lab 3 Sept 23 - Practical File
No ratings yet
Pattern - Recognigation - Lab 3 Sept 23 - Practical File
19 pages
Cybersecurity of Autonomous Vehicles A Systematic Literature Review of Adversarial Attacks and Defense Models
No ratings yet
Cybersecurity of Autonomous Vehicles A Systematic Literature Review of Adversarial Attacks and Defense Models
21 pages
Lecture 3
No ratings yet
Lecture 3
50 pages
Advancements in Computer Vision For Safe
No ratings yet
Advancements in Computer Vision For Safe
15 pages
BCS602 Model Question Paper Solved (Search Creators) - 2-37
0% (2)
BCS602 Model Question Paper Solved (Search Creators) - 2-37
36 pages
Machine Learning Lectures
No ratings yet
Machine Learning Lectures
126 pages
Pattern Recognition Organizer
No ratings yet
Pattern Recognition Organizer
112 pages
Deep Reinforcement Learning For QoT-Aware Routing Modulation and Spectrum Assignment in Elastic Optical Networks
No ratings yet
Deep Reinforcement Learning For QoT-Aware Routing Modulation and Spectrum Assignment in Elastic Optical Networks
19 pages
Lecture 12 - Training Methods
No ratings yet
Lecture 12 - Training Methods
25 pages
Generative AI For Beginners1
100% (2)
Generative AI For Beginners1
85 pages
Explainable Reinforcement Learning: A Survey and Comparative Review
No ratings yet
Explainable Reinforcement Learning: A Survey and Comparative Review
36 pages
Real Internship Report
No ratings yet
Real Internship Report
49 pages
2 Pattern Recognition Task
No ratings yet
2 Pattern Recognition Task
27 pages
Lecture 1
No ratings yet
Lecture 1
25 pages
Pattern Recognition: Lecturer
No ratings yet
Pattern Recognition: Lecturer
43 pages
PR Unit 1 ....
No ratings yet
PR Unit 1 ....
34 pages
Reinforcement Learning-Based Strategic Bidding For Generation Companies in Electricity Markets
No ratings yet
Reinforcement Learning-Based Strategic Bidding For Generation Companies in Electricity Markets
6 pages
Pattern Recognition
No ratings yet
Pattern Recognition
66 pages
Fundamentals of PR
No ratings yet
Fundamentals of PR
44 pages
ML Mid Syllabus
No ratings yet
ML Mid Syllabus
182 pages
Unit 1 Image Proc
No ratings yet
Unit 1 Image Proc
37 pages
Reflexion: Language Agents With Verbal Reinforcement Learning
No ratings yet
Reflexion: Language Agents With Verbal Reinforcement Learning
18 pages
Unit-1 New
No ratings yet
Unit-1 New
48 pages
Pattern Recognition
No ratings yet
Pattern Recognition
50 pages
PR Some Solutions
No ratings yet
PR Some Solutions
26 pages
Machine Learning
No ratings yet
Machine Learning
28 pages
Chapter 1
No ratings yet
Chapter 1
18 pages
Aiml Mca
100% (1)
Aiml Mca
38 pages
An Introduction To Pattern Recognition - 2
No ratings yet
An Introduction To Pattern Recognition - 2
46 pages
Unit 3
No ratings yet
Unit 3
35 pages
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
100% (1)
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
57 pages
Pattern Recoginition 5
No ratings yet
Pattern Recoginition 5
43 pages
CS6700 Programming Assignment 2
No ratings yet
CS6700 Programming Assignment 2
17 pages
Pattern Recognition 14
No ratings yet
Pattern Recognition 14
46 pages
Reinforcement Learning Control of A SAG Mill Grinding Circuit
No ratings yet
Reinforcement Learning Control of A SAG Mill Grinding Circuit
13 pages
Pattern Recognition: Dr. Farah Qais Al-Khalidi
No ratings yet
Pattern Recognition: Dr. Farah Qais Al-Khalidi
49 pages
Chapter 1. Introduction: (Huan - Nguyen@inha - Ac.kr)
No ratings yet
Chapter 1. Introduction: (Huan - Nguyen@inha - Ac.kr)
24 pages
Introduction To Pattern Recognition
No ratings yet
Introduction To Pattern Recognition
46 pages
To Pattern Recognition: CSE555, Fall 2021 Chapter 1, DHS
100% (1)
To Pattern Recognition: CSE555, Fall 2021 Chapter 1, DHS
39 pages
Pattern Recognition
No ratings yet
Pattern Recognition
33 pages
Using Keras and Deep Q-Network To Play FlappyBird - Ben Lau PDF
No ratings yet
Using Keras and Deep Q-Network To Play FlappyBird - Ben Lau PDF
21 pages
Pattern Recognition
No ratings yet
Pattern Recognition
52 pages
Pattern Recognition
No ratings yet
Pattern Recognition
5 pages
PR01
100% (1)
PR01
41 pages
Pattern Recognition With Semi-Supervised Learning Algorithm
No ratings yet
Pattern Recognition With Semi-Supervised Learning Algorithm
57 pages
Dungeons and DQNs Toward Reinforcement Learning
No ratings yet
Dungeons and DQNs Toward Reinforcement Learning
8 pages
Application of Reinforcement Learning To A Two Dof Robot Arm Control
No ratings yet
Application of Reinforcement Learning To A Two Dof Robot Arm Control
2 pages
Machine Learning For Dynamic Pricing in E-Commerce
No ratings yet
Machine Learning For Dynamic Pricing in E-Commerce
7 pages
Disturbance Observer
No ratings yet
Disturbance Observer
10 pages
Pattern Recognition: Dr. Farah Qais Al-Khalidi
No ratings yet
Pattern Recognition: Dr. Farah Qais Al-Khalidi
43 pages
Pattern Recognition Linear Classifier by Zaheer Ahmad
0% (1)
Pattern Recognition Linear Classifier by Zaheer Ahmad
37 pages
Examples and Videos of Markov Decision Processes (MDPS) and Reinforcement Learning
No ratings yet
Examples and Videos of Markov Decision Processes (MDPS) and Reinforcement Learning
36 pages
New CZ3005 Module 5 - Reinforcement Learning
No ratings yet
New CZ3005 Module 5 - Reinforcement Learning
31 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
56 pages
Interpretable Automated Machine Learning in Maana Knowledge Platform
No ratings yet
Interpretable Automated Machine Learning in Maana Knowledge Platform
12 pages
Machine Learning Techniques For 5G and Beyond
No ratings yet
Machine Learning Techniques For 5G and Beyond
18 pages
Pattern Recognition: Talal A. Alsubaie Sfda
No ratings yet
Pattern Recognition: Talal A. Alsubaie Sfda
40 pages
CSE 473 Pattern Recognition
No ratings yet
CSE 473 Pattern Recognition
45 pages
PR Slide Spring 2017
No ratings yet
PR Slide Spring 2017
5 pages
Spoken Dialog Systems and Voice XML
No ratings yet
Spoken Dialog Systems and Voice XML
94 pages
Artificial Neural Networks-Pattern Recogntion
No ratings yet
Artificial Neural Networks-Pattern Recogntion
21 pages
Pattern Recognition
No ratings yet
Pattern Recognition
12 pages
Introduction of Pattern Recognition PDF
No ratings yet
Introduction of Pattern Recognition PDF
40 pages
Classification Techniques
No ratings yet
Classification Techniques
99 pages
PR Assignment 01 - Seemal Ajaz (206979)
No ratings yet
PR Assignment 01 - Seemal Ajaz (206979)
7 pages
AI Unit-5 Notes
No ratings yet
AI Unit-5 Notes
25 pages
Reinforcement Learning - Ipynb - Colaboratory
No ratings yet
Reinforcement Learning - Ipynb - Colaboratory
7 pages
Ass
No ratings yet
Ass
8 pages
Lecture 01 (Introduction To Pattern Recognition)
No ratings yet
Lecture 01 (Introduction To Pattern Recognition)
26 pages
Pattern Recognition
No ratings yet
Pattern Recognition
3 pages
105 Machine Learning Paper
No ratings yet
105 Machine Learning Paper
6 pages
Python Machine Learning By Example: Unlock machine learning best practices with real-world use cases
From Everand
Python Machine Learning By Example: Unlock machine learning best practices with real-world use cases
Yuxi (Hayden) Liu
No ratings yet
Python Machine Learning: Learn how to build powerful Python machine learning algorithms to generate useful data insights with this data analysis tutorial
From Everand
Python Machine Learning: Learn how to build powerful Python machine learning algorithms to generate useful data insights with this data analysis tutorial
Sebastian Raschka
4/5 (20)
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet

1 Introduction

Uploaded by

1 Introduction

Uploaded by

Pattern Recognition and Machine

Trained ourselves to recognize alphabets, so

• The field of pattern recognition exactly does

– OCR (Optical Character Recognition)

– DNA sequence identification

• Samples are labeled (supervised learning)

• # of alphabets/ classes to recognize – 26.

Machine print document Input handwritten document

• These algorithms have a solid probabilistic

• Algorithms work on certain characteristics

• Features values can be discrete or continuous

• In practice, a single feature may not suffice for

• These algorithms have a solid probabilistic

• Algorithms work on certain characteristics

• Features values can be discrete or continuous

• In practice, a single feature may not suffice for

• A possible solution is to look out for many features

• The different features selected can be represented

• Pixels of an image of size M XN can be

• Feature discriminability measures are

DFT (Shape description)

• To be robust to translation, rotation,

• Continuous, real valued

• If limited data is available, too many features

• Need arises to come up with techniques such

– Set up a camera and take some sample images to extract

– Use a segmentation operation to isolate fishes from

• Information from a single fish is sent to a feature

• The features are passed to a classifier

– Select the length of the fish as a possible feature

Select the lightness as a possible feature.

Fish xT = [x1, x2]

• Ideally, the best decision boundary should be

– Use of a transducer (camera or microphone)

• Segmentation and grouping

– Patterns should be well separated and should not

– How do we know when we have collected an

– Depends on the characteristics of the problem

– Unsatisfied with the performance of our fish

– Use data to determine the classifier. Many

– Measure the error rate (or performance and

– What is the trade-off between computational

– (How an algorithm scales as a function of the

– A teacher provides a category label or cost for

– The system forms clusters or “natural groupings”

• Clustering is often called an unsupervised

Segmented image using

• It falls between unsupervised learning (without

– A teacher provides a category label or cost for

– The system forms clusters or “natural groupings”

• Clustering is often called an unsupervised

Segmented image using

• It falls between unsupervised learning (without

Similar to Curve Fitting Problem to a set of points…..

Division of feature space to distinct

• Learning pertains to coming up with an

• There ain’t such thing as free lunch -- It is impossible to

• In view of the no-free-lunch theorem it seems that one cannot

• Types of generative classifier

• If the form of pdf is assumed , such as

In this figure, we have 2 pdf s corresponding to 2 classes w1 and w2 .

Feature x ‘brightness’ is used to construct the pdfs.

• Discriminative classifier No such

• No need to model the distribution of samples

Linearly separable data

The samples of the original data is in 2D. After a non-linear

Consider scenario wherein a patient

You might also like