100% found this document useful (1 vote)

193 views

Pattern Recognition Lecture Bayes Decision Theory: Prof. Dr. Marcin Grzegorzek

This document provides an overview of a lecture on Bayes decision theory and pattern recognition. The lecture covers: 1) Bayes decision theory and how to compute the a posteriori probability using Bayes' rule. 2) The Bayes classification rule, which assigns patterns to the class with the higher a posteriori probability. 3) How to compute the classification error probability in binary and general classification problems using Bayes decision theory.

Uploaded by

jxy160

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

193 views

Pattern Recognition Lecture Bayes Decision Theory: Prof. Dr. Marcin Grzegorzek

Uploaded by

jxy160

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Pattern Recognition Lecture

Bayes Decision Theory

Prof. Dr. Marcin Grzegorzek

Research Group for Pattern Recognition

Institute for Vision and Graphics
University of Siegen, Germany
Pattern Recognition Chain

Introduction

Bayes Decision
Theory

Discriminant
Functions and
Decision
Surfaces patterns feature feature classifier system
sensor
generation selection design evaluation
Bayesian

↑
Classification
for Normal
Distributions

Estimation of
Unknown
Probability
Density
Functions
Overview

Introduction
1 Introduction
Bayes Decision
Theory

Discriminant 2 Bayes Decision Theory

Functions and
Decision
Surfaces

Bayesian 3 Discriminant Functions and Decision Surfaces

Classification
for Normal
Distributions

Estimation of
4 Bayesian Classification for Normal Distributions
Unknown
Probability
Density
Functions 5 Estimation of Unknown Probability Density Functions
Overview

Introduction
1 Introduction
Bayes Decision
Theory

Discriminant 2 Bayes Decision Theory

Functions and
Decision
Surfaces

Bayesian 3 Discriminant Functions and Decision Surfaces

Classification
for Normal
Distributions

Estimation of
4 Bayesian Classification for Normal Distributions
Unknown
Probability
Density
Functions 5 Estimation of Unknown Probability Density Functions
Statistical Classification - Problem Statement

Introduction Classification of an unknown pattern in the most probable

Bayes Decision of the classes!
Theory
• Set of classes: {ω1 , ω2 , . . . , ωM }
Discriminant
Functions and
Decision • Unknown pattern represented by its feature vector x
Surfaces
• Conditional probabilities: P(ωi |x), i = 1, 2, . . . , M
Bayesian
Classification
for Normal
• Classification result: the class with the maximum
Distributions
conditional probability
Estimation of
Unknown
Probability
Density But how to compute the conditional probability for a
Functions
particular class?
Probability P vs. Density p

Introduction
Probability P
Bayes Decision is a real number describing an event belonging to the
Theory
range < 0, 1 >.
Discriminant
Functions and
Decision
Surfaces
Density p
Bayesian
Classification is a value of a function1 p(x) describing the distribution of
for Normal
Distributions the random variable x.
Estimation of
Unknown
Probability
Density
If the random variable takes only discrete values, the
Functions
densities become probabilities!

1
This function is often referred as pdf - probability density function.
Overview

Introduction
1 Introduction
Bayes Decision
Theory

Discriminant 2 Bayes Decision Theory

Functions and
Decision
Surfaces

Bayesian 3 Discriminant Functions and Decision Surfaces

Classification
for Normal
Distributions

Estimation of
4 Bayesian Classification for Normal Distributions
Unknown
Probability
Density
Functions 5 Estimation of Unknown Probability Density Functions
A Priori Probability vs. A Posteriori Probability

Introduction A priori probability - probability before classification

Bayes Decision
Theory • How probable is a particular class ωi for a pattern x before
Discriminant applying any classification algorithm?
Functions and
Decision • Answer: P(ωi )
Surfaces

Bayesian
Classification
for Normal A posteriori probability - probability after classification
Distributions

Estimation of
• How probable is a particular class ωi for a pattern x after
Unknown
Probability
applying a statistical classification algorithm?
Density
Functions • Answer: P(ωi |x)
Likelihood Density Function

Introduction

Bayes Decision
Theory
Likelihood Density Function
Discriminant
Functions and • How feature vectors x are distributed in a class ωi ?
Decision
Surfaces
• Answer: p(x|ωi )
Bayesian
Classification • p(x|ωi ) is the likelihood function of ωi with respect to x
for Normal
Distributions
• p(x|ωi ) can be trained from examples
Estimation of
Unknown
Probability
Density
Functions
Bayes Decision Theory for a Two-Class Problem

Known
Introduction
Classes: {ω1 , ω2 }
Bayes Decision
Theory A priori probabilities: P(ω1 ) and P(ω2 )
Discriminant Likelihood density functions: p(x|ω1 ) and p(x|ω2 )
Functions and
Decision Pattern to be classified: x = [x1 , x2 , . . . , xl ]T
Surfaces

Bayesian
Classification Assumption
for Normal
Distributions The feature vectors can take any value in the
Estimation of
Unknown
l -dimensional feature space: x = [x1 , x2 , . . . , xl ]T ∈ IRl
Probability
Density
Functions
Unknown
A posteriori probabilities: P(ω1 |x) and P(ω2 |x)
Computation of the A Posteriori Probability

Introduction

Bayes Decision
Theory Using the Bayes Rule
Discriminant
Functions and
Decision p(x|ωi )P(ωi )
Surfaces P(ωi |x) = i = 1, 2 (1)
p(x)
Bayesian
Classification
for Normal
Distributions

Estimation of p(x) – density function for x

Unknown
Probability
Density
Functions
Bayes Classification Rule (1)

Introduction

Bayes Decision
Theory

Discriminant Higher a posteriori probability wins

Functions and
Decision
Surfaces
If P(ω1 |x) > P(ω2 |x), x is classified to ω1
Bayesian
Classification
for Normal
Distributions
If P(ω1 |x) < P(ω2 |x), x is classified to ω2
Estimation of
Unknown
Probability
Density
Functions
Bayes Classification Rule (2)

Introduction

Bayes Decision
Theory

Discriminant
Considering the Bayes Rule (Eq. 1)
Functions and
Decision
p(x|ω1 )P(ω1 ) p(x|ω2 )P(ω2 )
Surfaces If p(x) > p(x) , x is classified to ω1
Bayesian
Classification
for Normal p(x|ω1 )P(ω1 ) p(x|ω2 )P(ω2 )
Distributions If p(x) < p(x) , x is classified to ω2
Estimation of
Unknown
Probability
Density
Functions
Bayes Classification Rule (3)

Introduction

Bayes Decision
Theory
p(x) can be disregarded, because it is the same for all
Discriminant
Functions and classes
Decision
Surfaces

Bayesian If p(x|ω1 )P(ω1 ) > p(x|ω2 )P(ω2 ) , x is classified to ω1

Classification
for Normal
Distributions
If p(x|ω1 )P(ω1 ) < p(x|ω2 )P(ω2 ) , x is classified to ω2
Estimation of
Unknown
Probability
Density
Functions
Bayes Classification Rule (4)

Introduction If the a priori probabilities are equal: P(ω1 ) = P(ω2 )

Bayes Decision
Theory
If p(x|ω1 ) > p(x|ω2 ) , x is classified to ω1
Discriminant
Functions and
Decision
Surfaces If p(x|ω1 ) < p(x|ω2 ) , x is classified to ω2
Bayesian
Classification
for Normal
Distributions

Estimation of
Unknown We are done, since the likelihood density functions
Probability
Density p(x|ω1 ) and p(x|ω2 ) are assumed to have been trained
Functions
from examples!
Classification Error Probability

p(x|v) p(x|v1)

p(x|v2)
Introduction

Bayes Decision
Theory

Discriminant
Functions and
Decision
Surfaces

Bayesian
Classification
for Normal
Distributions

Estimation of
Unknown x0 x
Probability R1 R2
Density
Functions

1
Rx0 1
R∞
Error Probability: Pe = 2 p(x|ω2 )dx + 2 p(x|ω1 )dx
−∞ x0
Classification Error Probability in General

Introduction
• A priori probabilities are not equal: P(ω1 ) 6= P(ω2 )
Bayes Decision
Theory
• Feature vectors have more than one dimension: l > 1
Discriminant
Functions and
Decision
Surfaces
x = [x1 , x2 , . . . , xl ]T
Bayesian
Classification • General form:
for Normal
Distributions Z Z
Estimation of Pe = P(ω1 ) p(x|ω1 )dx + P(ω2 ) p(x|ω2 )dx
Unknown
Probability
Density
R2 R1
Functions
Classification Error Probability

Introduction

Bayes Decision
Theory

Discriminant
Functions and
Decision
Surfaces

Bayesian
Classification
for Normal
Distributions

Estimation of
Unknown
Probability
Density
Functions

Bayesian Classifier is OPTIMAL with respect to

minimising the classification error probability!
Minimising Average Risk for Two Classes

• Classification error probability assigns the same importance

Introduction
to all errors, which is wrong for many applications (e. g.,
Bayes Decision
Theory ω1 → “malignant tumour”, ω2 → “benign tumour” ).
Discriminant
Functions and
• In such cases a penalty term is assigned to weight each
Decision
Surfaces
error.
Bayesian • A modified version of the error probability has to be
Classification
for Normal minimised:
Distributions
Z Z
Estimation of
Unknown r = λ12 P(ω1 ) p(x|ω1 )dx + λ21 P(ω2 ) p(x|ω2 )dx
Probability
Density
Functions
R2 R1

• For the tumour example λ12 is much greater than λ21 .

Modified Bayes Classification Rule

Introduction

Bayes Decision
Theory

Discriminant
If the a priori probabilities are equal: P(ω1 ) = P(ω2 )
Functions and
Decision
Surfaces
If p(x|ω2 ) > p(x|ω1 ) λλ12
21
, x is classified to ω2
Bayesian
Classification
for Normal
Distributions If p(x|ω2 ) < p(x|ω1 ) λλ21
12
, x is classified to ω1
Estimation of
Unknown
Probability
Density
Functions
Overview

Introduction
1 Introduction
Bayes Decision
Theory

Discriminant 2 Bayes Decision Theory

Functions and
Decision
Surfaces

Bayesian 3 Discriminant Functions and Decision Surfaces

Classification
for Normal
Distributions

Estimation of
4 Bayesian Classification for Normal Distributions
Unknown
Probability
Density
Functions 5 Estimation of Unknown Probability Density Functions
Discriminant Functions

• Sometimes it is more convenient to work with functions of

probabilities instead of probabilities
Introduction

Bayes Decision
Theory gi (x) ≡ f (P(ωi |x))
Discriminant
Functions and • f (·) is a monotonically increasing function
Decision
Surfaces
• gi (x) is known as discriminant function
Bayesian
Classification • The decision test is now stated as
for Normal
Distributions

Estimation of classify x into ωi if gi (x) > gj (x) ∀j 6= i

Unknown
Probability
Density • The decision surfaces, separating contiguous regions, are
Functions
described by

gij (x) ≡ gi (x) − gj (x) = 0, i , j = 1, 2, . . . , M i 6= j

Overview

Introduction
1 Introduction
Bayes Decision
Theory

Discriminant 2 Bayes Decision Theory

Functions and
Decision
Surfaces

Bayesian 3 Discriminant Functions and Decision Surfaces

Classification
for Normal
Distributions

Estimation of
4 Bayesian Classification for Normal Distributions
Unknown
Probability
Density
Functions 5 Estimation of Unknown Probability Density Functions
Assumption

Introduction • The likelihood density functions describing the data in

Bayes Decision each of the classes, are multivariate Gaussian (normal)
Theory
distributions
Discriminant
Functions and l 1 1 T Σ −1 (x−µ )
Decision
Surfaces
p(x|ωi ) = (2π)− 2 |Σi |− 2 e − 2 (x−µi ) i i

Bayesian
Classification
for Normal
Distributions

Estimation of
Unknown • This “monster” will be denoted by
Probability
Density
Functions
p(x|ωi ) = N (µi , Σi ) i = 1, 2, . . . , M
Discriminant Function f (·) = ln(·)

Introduction • Due to the exponential form of the involved densities, the

Bayes Decision following discriminant function is applied:
Theory

Discriminant
Functions and gi (x) = ln(p(x|ωi )P(ωi )) = ln p(x|ωi ) + ln P(ωi )
Decision
Surfaces

Bayesian
m considering the “monster”
Classification
for Normal
1
Distributions
gi (x) = − (x − µi )T Σi−1 (x − µi ) + ln P(ωi ) + ci (2)
Estimation of 2
Unknown
Probability
Density
Functions
• Where: ci = − 2l ln 2π − 1
2 ln |Σi |
Quadrics as Decision Curves

Assuming l = 2 and σ1,2 = σ2,1 = 0, the decision curves

gi (x) − gj (x) = 0
Introduction

Bayes Decision
Theory
are quadrics (i. e., ellipsoids, parabolas, hyperbolas, pairs of
Discriminant
lines)
Functions and
Decision
Surfaces
x2 x2
Bayesian
Classification 3 4
for Normal
v2
Distributions

Estimation of v1 1
v1 v1
Unknown 0
Probability
Density 22 v2
Functions
23
25
23 22 21 0 x1 210 25 0 5 x1
(a) (b)
Decision Hyperplanes

• The only quadric contribution in Equation (2) is xT Σi−1 x

Introduction • Assuming that the covariance matrix is the same for all
Bayes Decision classes Σi = Σ the quadric term will be the same for all
Theory

Discriminant
discriminant functions
Functions and
Decision • Thus, the quadric term can be disregarded by decision
Surfaces
surface equations. The same is true for the constant ci
Bayesian
Classification • The simplified version of the discriminant function is just a
for Normal
Distributions linear function
Estimation of
Unknown
gi (x) = wi T x + wi 0
Probability
Density where
Functions

1
wi = Σ −1 µi and wi 0 = ln P(ωi ) − µi T Σ −1 µi
2
Minimum Distance Classifiers

• Assuming equiprobable classes with the same covariance

Introduction matrix and neglecting the constants Eq. 2 is simplified to:
Bayes Decision
Theory 1
gi (x) = − (x − µi )T Σ −1 (x − µi )
Discriminant
Functions and
2
Decision
Surfaces
• If Σ = σ 2 I (diagonal matrix) the maximum gi (x) implies
Bayesian
Classification the minimum Euclidean distance dǫ = ||x − µi ||
for Normal
Distributions

Estimation of
Unknown
• Thus, a feature vector x is assigned to a class bi according
Probability
Density
to its Euclidean distance to the respective mean points µi
Functions
bi = argmax(gi (x)) = argmin(||x − µi ||)
i i
Remarks

Introduction • In practice, it is quite common to assume the Gaussian distribution of the

Bayes Decision data. In this case, the Bayesian classifier is either linear or quadratic in
Theory nature. These approaches are knows as linear discriminant analysis (LDA)
Discriminant
or quadratic discriminant analysis (QDA).
Functions and
Decision
Surfaces • A major problem associated with LDA and QDA is the large number of
Bayesian parameters to be estimated. Thus, l parameters in each mean vector and
2
Classification approximately l2 in each covariance matrix. Moreover, a large number of
for Normal
Distributions training points N is needed.
Estimation of
Unknown
Probability
• LDA and QDA perform very good for many different applications. However,
Density in many cases the assumed normal distribution is not the right method to
Functions statistically model the data.
Overview

Introduction
1 Introduction
Bayes Decision
Theory

Discriminant 2 Bayes Decision Theory

Functions and
Decision
Surfaces

Bayesian 3 Discriminant Functions and Decision Surfaces

Classification
for Normal
Distributions

Estimation of
4 Bayesian Classification for Normal Distributions
Unknown
Probability
Density
Functions 5 Estimation of Unknown Probability Density Functions
Problem Statement

Introduction • So far, we have assumed that the likelihood density

Bayes Decision
Theory
functions p(x|ωi ) for i = 1, 2, . . . , M are known.
Discriminant
Functions and
Decision • This is not the most common case. In many problems, the
Surfaces

Bayesian
likelihood density functions have to be estimated from the
Classification
for Normal
available training data.
Distributions

Estimation of
Unknown • Here, two estimation methods will be considered, namely
Probability
Density • Maximum Likelihood Parameter Estimation
Functions
• Maximum a Posteriori Probability Estimation
Maximum Likelihood Parameter Estimation (1)

• Let us consider an M-class problem with feature vectors

Introduction

Bayes Decision
distributed according to p(x|ωi ), i = 1, 2, . . . , M.
Theory
• The likelihood functions are assumed to be given in a
Discriminant
Functions and parametric form. The statistical parameters for the classes
Decision
Surfaces ωi form vectors θi which are unknown
Bayesian
Classification
for Normal
p(x|ωi ) = p(x|ωi ; θi )
Distributions

Estimation of • Goal: to estimate the unknown parameters using a set of

Unknown
Probability known feature vectors in each class.
Density
Functions • Since the estimation process is the same for all classes, the
index i will be skipped for further investigations.
Maximum Likelihood Parameter Estimation (2)

• Let X = {x1 , x2 , . . . , xN } be a set of feature vectors

Introduction describing training samples of a particular class.
Bayes Decision • Assuming statistical independence between the different
Theory

Discriminant
feature vectors, we can form the joint density function
Functions and
Decision N
Surfaces Y
Bayesian
p(X ; θ) = p(x1 , x2 , . . . , xN ; θ) = p(xk ; θ)
Classification k=1
for Normal
Distributions

Estimation of
• The ML method estimates θ so that the likelihood
Unknown
Probability
function takes its maximum value
Density
Functions N
Y
θbML = argmax p(xk ; θ)
θ k=1
Maximum Likelihood Parameter Estimation (3)

• To find a maximum, the gradient has to be zero

QN
∂ k=1 p(xk ; θ)
Introduction

Bayes Decision =0
Theory ∂θ
Discriminant
Functions and
• Due to the monotonicity of the logarithmic function, we
Decision
Surfaces
can use also the log-likelihood function
Bayesian N
Classification Y
for Normal
Distributions
L(θ) = ln p(xk ; θ)
Estimation of
k=1
Unknown
Probability • Looking for the maximum here, we have
Density
Functions
N N
∂L(θ) X ∂ ln p(xk ; θ) X 1 ∂p(xk ; θ)
= = =0
∂θ ∂θ p(xk ; θ) ∂θ
k=1 k=1
Maximum a Posteriori Probability Estimation

• Set of feature vectors X = {x1 , x2 , . . . , xN }

Introduction

Bayes Decision • θ is an unknown random vector

Theory
• The starting point is the following density function
Discriminant
Functions and
Decision
p(θ)p(X |θ)
Surfaces
p(θ|X ) =
Bayesian p(X )
Classification
for Normal
Distributions • The MAP estimate θbMAP is defined as a point where
Estimation of
Unknown
p(θ|X ) becomes maximum
Probability
Density
∂ ∂
θbMAP :
Functions
p(θ|X ) = 0 or (p(θ)p(X |θ)) = 0
∂θ ∂θ

TCP/IP Protocol Suite (4th Ed.) End of Chapter Exercises (Ch. 1 - 3)
No ratings yet
TCP/IP Protocol Suite (4th Ed.) End of Chapter Exercises (Ch. 1 - 3)
12 pages
Unit 1 Paper 1 Solutions - 2009-2016 - Applied Mathematics
100% (2)
Unit 1 Paper 1 Solutions - 2009-2016 - Applied Mathematics
4 pages
CV English
No ratings yet
CV English
3 pages
Grade 7 Assessment
100% (1)
Grade 7 Assessment
2 pages
IIT Madras Notes Machine Learning
No ratings yet
IIT Madras Notes Machine Learning
13 pages
Sample Questions Pattern Recognition
No ratings yet
Sample Questions Pattern Recognition
8 pages
3c) Question Paper (Mid Exam)
No ratings yet
3c) Question Paper (Mid Exam)
2 pages
Eng4Bf3 Medical Image Processing: Image Enhancement in Frequency Domain
No ratings yet
Eng4Bf3 Medical Image Processing: Image Enhancement in Frequency Domain
59 pages
Lecture 7 - Classification (Rules and Naïve Bayes)
100% (1)
Lecture 7 - Classification (Rules and Naïve Bayes)
19 pages
Rule Based Classifier
No ratings yet
Rule Based Classifier
14 pages
Big Data Unit5
No ratings yet
Big Data Unit5
57 pages
Image Compression (Chapter 8) : CS474/674 - Prof. Bebis
No ratings yet
Image Compression (Chapter 8) : CS474/674 - Prof. Bebis
128 pages
CA2-Question Bank MCQ (PEC-CSBS601D)
No ratings yet
CA2-Question Bank MCQ (PEC-CSBS601D)
9 pages
Iv Semester: Data Mining Question Bank: Unit 2 2 Mark Questions)
No ratings yet
Iv Semester: Data Mining Question Bank: Unit 2 2 Mark Questions)
5 pages
Feature Extraction
No ratings yet
Feature Extraction
14 pages
Final Exam Pattern Recognition
No ratings yet
Final Exam Pattern Recognition
10 pages
Duda Solutions PDF
No ratings yet
Duda Solutions PDF
77 pages
Answer To The Question No: (A) : Pattern Recognition Is The Process of Recognizing Patterns by Using
100% (1)
Answer To The Question No: (A) : Pattern Recognition Is The Process of Recognizing Patterns by Using
4 pages
Unit - 3
No ratings yet
Unit - 3
42 pages
Assignment 11
100% (1)
Assignment 11
4 pages
Data Mining-Outlier Analysis
No ratings yet
Data Mining-Outlier Analysis
6 pages
Knapsack
No ratings yet
Knapsack
7 pages
ML 2 (Mainly KNN)
100% (1)
ML 2 (Mainly KNN)
12 pages
DSF Unit IV MCQ Notes
No ratings yet
DSF Unit IV MCQ Notes
6 pages
MCQ All Unit
No ratings yet
MCQ All Unit
35 pages
Deep Learning Unit 1
No ratings yet
Deep Learning Unit 1
32 pages
Chapter 4 Distributed Database Systems
No ratings yet
Chapter 4 Distributed Database Systems
69 pages
Dr. Kourosh Kiani Genetics-Algorithm Lecture 01 Introduction Genetics Algorithm
No ratings yet
Dr. Kourosh Kiani Genetics-Algorithm Lecture 01 Introduction Genetics Algorithm
86 pages
Module3-Fitting A Model To Data
No ratings yet
Module3-Fitting A Model To Data
57 pages
DIP Questions
100% (1)
DIP Questions
5 pages
DM Unit 3
No ratings yet
DM Unit 3
39 pages
Types of Research
No ratings yet
Types of Research
22 pages
DWDM MCQ Qns 2020
No ratings yet
DWDM MCQ Qns 2020
5 pages
ML Unit 1-Notes
No ratings yet
ML Unit 1-Notes
21 pages
UNIT 1 Practice Quiz - MCQs - ML
100% (1)
UNIT 1 Practice Quiz - MCQs - ML
10 pages
K Means Clustering Algorithm
No ratings yet
K Means Clustering Algorithm
12 pages
Image Processing MCQ Lecture Notes 1
No ratings yet
Image Processing MCQ Lecture Notes 1
55 pages
Must Know Questions Deep Learning
No ratings yet
Must Know Questions Deep Learning
22 pages
One Variable Optimization
No ratings yet
One Variable Optimization
15 pages
CS771 IITK EndSem Solutions
100% (1)
CS771 IITK EndSem Solutions
8 pages
Question Bank - Machine Learning
No ratings yet
Question Bank - Machine Learning
16 pages
747 Must Necessity MCQ Grammar Quiz Test Exercise
No ratings yet
747 Must Necessity MCQ Grammar Quiz Test Exercise
2 pages
Genetic Algorithms - Knapsack Problem - Knapsack Problem
No ratings yet
Genetic Algorithms - Knapsack Problem - Knapsack Problem
28 pages
Assignment 4 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 4 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
4 pages
UNIT IV 4 DempsterShaferTheory
100% (1)
UNIT IV 4 DempsterShaferTheory
19 pages
Image Processing 7-FrequencyFiltering
No ratings yet
Image Processing 7-FrequencyFiltering
66 pages
Machine Learning Unit 2 MCQ
No ratings yet
Machine Learning Unit 2 MCQ
17 pages
Data Literacy Questions All Types
No ratings yet
Data Literacy Questions All Types
2 pages
Advanced Database Management System-Mcq
No ratings yet
Advanced Database Management System-Mcq
8 pages
Support Vector Machine (SVM) : Basic Terminologies
100% (1)
Support Vector Machine (SVM) : Basic Terminologies
2 pages
Query Processing - Database Questions & Answers - Sanfoundry 00
No ratings yet
Query Processing - Database Questions & Answers - Sanfoundry 00
7 pages
Deep Learning For Beginners Mock Exam PDF
No ratings yet
Deep Learning For Beginners Mock Exam PDF
15 pages
CISC 867: Deep Learning Assignment #1: K J Net
No ratings yet
CISC 867: Deep Learning Assignment #1: K J Net
3 pages
ML Question Bank
No ratings yet
ML Question Bank
29 pages
Frame-Based Expert Systems
No ratings yet
Frame-Based Expert Systems
50 pages
Data Mining
No ratings yet
Data Mining
2 pages
Previous Exam Exercises On Classification: Exercise 4 2012: Classification With 2 Features
No ratings yet
Previous Exam Exercises On Classification: Exercise 4 2012: Classification With 2 Features
9 pages
Image Processing 4-ImageEnhancement (PointProcessing)
No ratings yet
Image Processing 4-ImageEnhancement (PointProcessing)
42 pages
CS230 Midterm Solutions Fall 2022
No ratings yet
CS230 Midterm Solutions Fall 2022
20 pages
Unit 1 CV
No ratings yet
Unit 1 CV
78 pages
Bayes Decision Theory
No ratings yet
Bayes Decision Theory
53 pages
Introduction To Pattern Recognition
No ratings yet
Introduction To Pattern Recognition
12 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
63 pages
3gpp 32.435
No ratings yet
3gpp 32.435
18 pages
Get Ready To Train!
No ratings yet
Get Ready To Train!
1 page
Bayesian Probability
No ratings yet
Bayesian Probability
36 pages
Femtocell
0% (1)
Femtocell
83 pages
Two-Tone Intermodulation PDF
No ratings yet
Two-Tone Intermodulation PDF
7 pages
Time Sync Lee 01 0509
No ratings yet
Time Sync Lee 01 0509
14 pages
Numerical Solution For Parabolic Partial Differential Equations
No ratings yet
Numerical Solution For Parabolic Partial Differential Equations
9 pages
Unified Council: Unified International Mathematics Olympiad
No ratings yet
Unified Council: Unified International Mathematics Olympiad
4 pages
Transformation Problems
No ratings yet
Transformation Problems
18 pages
LP Mathematics Q2 Week 3
No ratings yet
LP Mathematics Q2 Week 3
8 pages
Annihilator Method
100% (1)
Annihilator Method
7 pages
Extra Drill 1A: Basic Questions
No ratings yet
Extra Drill 1A: Basic Questions
5 pages
Revision Test Term 1 Grade 9 2025-1
No ratings yet
Revision Test Term 1 Grade 9 2025-1
6 pages
Log BT
No ratings yet
Log BT
9 pages
Bma Handbook of Mathematics
25% (4)
Bma Handbook of Mathematics
2 pages
NEEV 2 0 Quant Foundations Batch For Beginners For All Banking &
No ratings yet
NEEV 2 0 Quant Foundations Batch For Beginners For All Banking &
10 pages
Syllabus Real Analysis
No ratings yet
Syllabus Real Analysis
2 pages
ORPR
No ratings yet
ORPR
10 pages
Math 2112 Solutions Assignment 1
No ratings yet
Math 2112 Solutions Assignment 1
2 pages
Geometric Sequence
No ratings yet
Geometric Sequence
35 pages
A List of Important Inequalities PDF
No ratings yet
A List of Important Inequalities PDF
3 pages
MIT6 041F10 Tut08
No ratings yet
MIT6 041F10 Tut08
2 pages
EXER01
No ratings yet
EXER01
2 pages
5 Analytic Continuation
No ratings yet
5 Analytic Continuation
17 pages
Selina Concise Maths Solutions Class 8 Chapter 4 Cubes and Cube Roots
No ratings yet
Selina Concise Maths Solutions Class 8 Chapter 4 Cubes and Cube Roots
22 pages
Int-It Represents All Number Float - It Represents Numbers With Decimal Values STR - It Represents All Letters
No ratings yet
Int-It Represents All Number Float - It Represents Numbers With Decimal Values STR - It Represents All Letters
20 pages
Py Script PDF
No ratings yet
Py Script PDF
35 pages
Sciography in Architecture
60% (5)
Sciography in Architecture
8 pages
The Transportation Problem
No ratings yet
The Transportation Problem
85 pages
CBSE Class 11 Maths Chapter 2 - Relations and Functions Important Questions 2022-23
No ratings yet
CBSE Class 11 Maths Chapter 2 - Relations and Functions Important Questions 2022-23
57 pages
Zio2020 Question Paper PDF
No ratings yet
Zio2020 Question Paper PDF
4 pages
Official THEA Study Guides
No ratings yet
Official THEA Study Guides
302 pages
simamamo
No ratings yet
simamamo
11 pages

Pattern Recognition Lecture Bayes Decision Theory: Prof. Dr. Marcin Grzegorzek

Uploaded by

Pattern Recognition Lecture Bayes Decision Theory: Prof. Dr. Marcin Grzegorzek

Uploaded by

Pattern Recognition Lecture

Bayes Decision Theory

Prof. Dr. Marcin Grzegorzek

Research Group for Pattern Recognition

Discriminant 2 Bayes Decision Theory

Bayesian 3 Discriminant Functions and Decision Surfaces

Discriminant 2 Bayes Decision Theory

Bayesian 3 Discriminant Functions and Decision Surfaces

Introduction Classification of an unknown pattern in the most probable

Discriminant 2 Bayes Decision Theory

Bayesian 3 Discriminant Functions and Decision Surfaces

Introduction A priori probability - probability before classification

Estimation of p(x) – density function for x

Discriminant Higher a posteriori probability wins

Bayesian If p(x|ω1 )P(ω1 ) > p(x|ω2 )P(ω2 ) , x is classified to ω1

Introduction If the a priori probabilities are equal: P(ω1 ) = P(ω2 )

Bayesian Classifier is OPTIMAL with respect to

• Classification error probability assigns the same importance

• For the tumour example λ12 is much greater than λ21 .

Discriminant 2 Bayes Decision Theory

Bayesian 3 Discriminant Functions and Decision Surfaces

• Sometimes it is more convenient to work with functions of

Estimation of classify x into ωi if gi (x) > gj (x) ∀j 6= i

gij (x) ≡ gi (x) − gj (x) = 0, i , j = 1, 2, . . . , M i 6= j

Discriminant 2 Bayes Decision Theory

Bayesian 3 Discriminant Functions and Decision Surfaces

Introduction • The likelihood density functions describing the data in

Introduction • Due to the exponential form of the involved densities, the

Assuming l = 2 and σ1,2 = σ2,1 = 0, the decision curves

• The only quadric contribution in Equation (2) is xT Σi−1 x

• Assuming equiprobable classes with the same covariance

Introduction • In practice, it is quite common to assume the Gaussian distribution of the

Discriminant 2 Bayes Decision Theory

Bayesian 3 Discriminant Functions and Decision Surfaces

Introduction • So far, we have assumed that the likelihood density

• Let us consider an M-class problem with feature vectors

Estimation of • Goal: to estimate the unknown parameters using a set of

• Let X = {x1 , x2 , . . . , xN } be a set of feature vectors

• To find a maximum, the gradient has to be zero

• Set of feature vectors X = {x1 , x2 , . . . , xN }

Bayes Decision • θ is an unknown random vector

You might also like