100% found this document useful (1 vote)
74 views41 pages

PR01

Pattern recognition involves extracting features from input data to classify it into categories, with applications including image recognition, speech recognition, and bioinformatics. Key aspects of pattern recognition systems include representation of patterns through feature extraction, choosing appropriate models and features for classification, training and evaluating classifiers, and addressing issues of computational complexity and generalization. The design of pattern recognition systems involves collecting data, selecting features, choosing models, training classifiers, evaluating performance, and addressing complexity.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
74 views41 pages

PR01

Pattern recognition involves extracting features from input data to classify it into categories, with applications including image recognition, speech recognition, and bioinformatics. Key aspects of pattern recognition systems include representation of patterns through feature extraction, choosing appropriate models and features for classification, training and evaluating classifiers, and addressing issues of computational complexity and generalization. The design of pattern recognition systems involves collecting data, selecting features, choosing models, training classifiers, evaluating performance, and addressing complexity.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 41

Pattern Recognition

Text Books
• Pattern Recognition
• S. Theodoridis & K. Koutrumbas

• Pattern Classification
• R. Duda et al.

• Pattern Recognition Statistical, Structural and Neural


Approaches

• R. Shalkoff
Pattern Recognition:
What is it?
The act of taking as input sensed
data(measurements)
and taking an action
based on the “category” or “class”
ofthe pattern.
What It Does
• Build a machine that can recognize patterns:

• The task: Assign unknown objects – patterns – into the


correct class. This is known as classification.
What It Does
• Areas:
– Machine vision  Image Data Base retrieval
– Character recognition (OCR)  Data mining
 Biometrics
– Computer aided diagnosis
 Fingerprint identification
– Speech recognition
 Iris Recognition
– Face recognition
 DNA sequence identification
– Bioinformatics
Representation of patterns
• Features:
• measurable quantities from the patterns
• determines the classification task
• Feature vectors: A number of features

x1 ,..., xl ,

constitute the feature vector


x   x1 ,..., xl   R l
T

Feature vectors are treated as random vectors.


Example 1:
Example 1:
Issues in Pattern Recognition
• How are features generated?
• What is the best number of features?
• How are they used to design a classifier?
• How good is the classifier?
Example 2

• “Sorting incoming Fish on a conveyor


according to species using optical sensing”

Sea bass
Species
Salmon
• Problem Analysis

– Set up a camera and take some sample images to extract


features

• Length
• Lightness
• Width
• Number and shape of fins
• Position of the mouth, etc…
• Preprocessing
– isolate fishes from one another and from the
background

• Feature Extraction
– send isolated fish image to feature extractor
– it reduces the data too

• Classification
– pass the features to a classifier
• Classification

– Select the length of the fish as a possible feature


for discrimination
x*
The length is a poor feature alone!

Select the lightness as a possible feature.


• Threshold decision boundary and cost relationship

– Move our decision boundary toward smaller values of


lightness in order to minimize the cost (reduce the
number of sea bass that are classified as salmon!)

Task of decision theory


• Adopt the lightness and add the width of the
fish

Fish xT = [x1, x2]

Lightness Width
• adding correlated feature does not improve
anything and is thus redundant

• too many features may lead to curse of


dimensionality
still there are some misclassifications
perhaps the best one, but too complex
decision boundary
• satisfaction is premature
– cause: aim of a classifier is to correctly classify unknown
input

Issue of generalization!
A compromise between training and testing
Pattern Recognition Systems

• Sensing

– Use of a transducer (camera or microphone)


– PR system depends of the bandwidth, the
resolution sensitivity distortion of the transducer

• Segmentation and grouping

– Patterns should be well separated and should not


overlap
• Feature extraction
– Discriminative features
– Invariant features with respect to translation, rotation and
scale.

• Classification
– Use a feature vector provided by a feature extractor to assign
the object to a category

• Post Processing
– error rate
– risk
– use context
The Design Cycle

• Data collection
• Feature Choice
• Model Choice
• Training
• Evaluation
• Computational Complexity
• Data Collection

– How do we know when we have collected an


adequately large and representative set of
examples for training and testing the system?
• Feature Choice

– Depends on the characteristics of the problem


domain.
– Requirement
• simple to extract
• invariant to irrelevant transformation
• insensitive to noise.
• Model Choice

– too many classification models?


– which one is best?
• Training

– Use data to determine the classifier. Many


different procedures for training classifiers and
choosing models
• Evaluation

– Measure the error rate (or performance and


switch from one set of features to another one
• Computational Complexity

– What is the trade-off between computational


ease and performance?
Learning and Adaptation
• Supervised learning

– A teacher provides a category label or cost for


each pattern in the training set

• Unsupervised learning

– The system forms clusters or “natural groupings”


of the input patterns
Unsupervised Learning
Unsupervised Learning
x1

x2
Unsupervised Learning
x1

x2
Unsupervised Learning
x1

x2

You might also like