0% found this document useful (0 votes)

18 views25 pages

CS3491-AI ML-Chapter 5

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

Uploaded by

Steephen Raj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views25 pages

CS3491-AI ML-Chapter 5

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

Uploaded by

Steephen Raj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 25

INTRODUCTION TO

Machine
Learning
CHAPTER 5:

Multivariate
Methods
Multivariate Data
 Multiple measurements (sensors)
 d inputs/features/attributes: d-variate
 N instances/observations/examples

 X 11 X 1
2  X 1
d
 2 2 2 
 X1 X 2  X d
X
  
 N N N 
 X 1 X 2  X d 

3
Multivariate Parameters
Mean : E x μ 1 ,...,d 
T

Covariance : ij CovX i , X j 

ij
Correlation : Corr X i , X j  ij 
i  j

  12  12   1d 
 
 
2
 21  2   2d 
 CovX   E X  μ X  μ  
T

  
 2 
  d1  d 2   d 

4
Parameter Estimation
N

Samplemean m : mi 
t 1 i
x t

,i 1,...,d
N

Covariancematrix S : sij 
 x
N
t 1
t
i 
 mi xtj  mj 
N
sij
Correlation matrix R : rij 
si s j

5
Estimation of Missing Values
 What to do if certain instances have missing
attributes?
 Ignore those instances: not a good idea if the
sample is small
 Use ‘missing’ as an attribute: may give
information
 Imputation: Fill in the missing value
 Mean imputation: Use the most likely value (e.g., mean)
 Imputation by regression: Predict based on other
attributes

6
Multivariate Normal
Distribution

x ~N d μ, Σ 
1  1 
p x  exp  x  μ Σ x  μ
T 1

2 Σ
d/ 2 1/ 2
 2 
7
Multivariate Normal
Distribution
 Mahalanobis distance: (x – μ)T ∑–1 (x – μ)
measures the distance from x to μ in terms of
∑ (normalizes for difference in variances and
correlations)
 12 12 
 Bivariate: d = 2   
 12 22 

1  1 2 
p x1 ,x2   exp 
 2
2
 
z1  2z1z 2  z 2 

212 1  2
 21   
zi  xi  i / i
8
Bivariate Normal

9
10
Independent Inputs: Naive
Bayes
 If xi are independent, offdiagonals of ∑ are 0,
Mahalanobis distance reduces to weighted (by
1/σi) Euclidean distance:
d
1  1 d x   
2

p x pi xi   d
exp    i
 
i
 
 2 
i 1
2 i
d / 2
 i 1  i 
i 1

 If variances are also equal, reduces to Euclidean

distance

11
Parametric Classification
 If p (x | Ci ) ~ N ( μi , ∑i )
1  1 
p x | C i   exp  x  μi  Σi x  μi 
T 1

2 Σi
d/ 2 1/ 2
 2 
 Discriminant functions are

gi x  log p x | C i   log P C i 

d 1 1
  log2  log Σi  x  μi  Σi x  μi   log P C i 
T 1

2 2 2

12
Estimation of Parameters
P̂ C  
 r
t i
t

i
N

mi 
t i x
r t t

t i
r t

r x  
T

Si 
t i
t t t
 mi x  mi
t i
r t

1 1
gi x   log Si  x  mi  Si x  mi   log P̂ C i 
T 1

2 2

13
Different Si
 Quadratic discriminant

1
2 2

1 T 1 1 T 1

gi x   log Si  x Si x  2xT Si mi  mi Si mi  log P̂ C i 
T
 xT Wi x  wi x  wi 0
where
1 1
Wi   Si
2
1
wi Si mi
1 T 1 1
wi 0   mi Si mi  log Si  log P̂ C i 
2 2

14
likelihoods
discriminant:
P (C1|x ) = 0.5

posterior for C1

15
Common Covariance Matrix S
 Shared common sample covariance S
S   P̂ C i Si
i

 Discriminant reduces to
1
gi x   x  mi  S 1 x  mi   log P̂ C i 
T

2
which is a linear discriminant
gi x wi x  wi 0
T

where
1 T 1
1
wi S mi wi 0   mi S mi  log P̂ C i 
2 16
Common Covariance Matrix S

17
Diagonal S
 When xj j = 1,..d, are independent, ∑ is diagonal
p (x|Ci) = ∏j p (xj |Ci) (Naive Bayes’ assumption)
2
1  x  mij 
d t

gi x    j
  log P̂ C i 
2 j 1  s j 


Classify based on weighted Euclidean distance (in

sj units) to the nearest mean

18
Diagonal S

variances may be
different

19
Diagonal S, equal variances
 Nearest mean classifier: Classify based on
Euclidean distance to the nearest mean
2
x  mi
gi x   2
 log P̂ C i 
2s
2
1 d

  2  xtj  mij
2s j 1
  log P̂ C 
i

 Each mean can be considered a prototype or

template and this is template matching

20
Diagonal S, equal variances

21
Model Selection
Assumption Covariance matrix No of parameters
Shared, Hyperspheric Si=S=s2I 1
Shared, Axis-aligned Si=S, with sij=0 d
Shared, Hyperellipsoidal Si=S d(d+1)/2
Different, Si K d(d+1)/2
Hyperellipsoidal
 As we increase complexity (less restricted S), bias
decreases and variance increases
 Assume simple models (allow some bias) to
control variance (regularization)

22
Discrete Features
 Binary features:pij p x j 1 | C i 
if xj are independent (Naive Bayes’)
d
p x | C i  pij 1  pij 
xj 1 x j 
j 1

the discriminant is linear

gi x  log p x | C i   log P C i 
  x j log pij  1  x j  log 1  pij   log P C i 
j

Estimated parameters p̂ij 

t j ri
x t t

t i
r t

23
Discrete Features
 Multinomial (1-of-nj) features: xj  {v1, v2,..., vnj}

pijk p z jk 1 | C i  p x j vk | C i 

if xj are independent
nd j

p x | C i  pijkjk
z

j 1 k 1

gi x   j k z jk log pijk  log P C i 

p̂ijk 
t jk i
z t
r t

t i
r t

24
Multivariate Regression
r g x | w ,w ,...,w  
t t
0 1 d

 Multivariate linear model

w 0  w1x1t  w 2x2t    w d xdt
1

E w 0 ,w1 ,...,w d | X   t r t  w 0  w1x1t    w d xdt
2

2

 Multivariate polynomial model:

Define new higher-order variables
z1=x1, z2=x2, z3=x12, z4=x22, z5=x1x2
and use the linear model in this new z space
(basis functions, kernel trick, SVM: Chapter 10)

Classification (NaiveBayes KNN SVM DecisionTrees)
No ratings yet
Classification (NaiveBayes KNN SVM DecisionTrees)
105 pages
STAT3006 Lecture Notes 2021 Aug8 2021
No ratings yet
STAT3006 Lecture Notes 2021 Aug8 2021
110 pages
Cognizance On Fish 1
No ratings yet
Cognizance On Fish 1
12 pages
Samsung GT-N7100 - UM - Open - HongKong - Jellybean - Eng - Rev.1.0 - 120924 - Screen
No ratings yet
Samsung GT-N7100 - UM - Open - HongKong - Jellybean - Eng - Rev.1.0 - 120924 - Screen
135 pages
Mml-Book (1) Removed
No ratings yet
Mml-Book (1) Removed
371 pages
Feature Extraction
No ratings yet
Feature Extraction
90 pages
Multivariate
100% (1)
Multivariate
78 pages
Model Report Pmegp Solar Charkha New - 25
No ratings yet
Model Report Pmegp Solar Charkha New - 25
10 pages
Solar Pumping: Electrical Design and Installation of Solar Pumps
100% (1)
Solar Pumping: Electrical Design and Installation of Solar Pumps
40 pages
Bishop Solutions PDF
No ratings yet
Bishop Solutions PDF
87 pages
Sources of Air Pollution PDF
100% (1)
Sources of Air Pollution PDF
30 pages
(Ebook PDF) Fundamentals of Enzymology: The Cell and Molecular Biology of Catalytic Proteins 3rd Edition Instant Download
100% (3)
(Ebook PDF) Fundamentals of Enzymology: The Cell and Molecular Biology of Catalytic Proteins 3rd Edition Instant Download
49 pages
Building Technology 1 - Building Materials: Midterm Project
No ratings yet
Building Technology 1 - Building Materials: Midterm Project
68 pages
ML PPT 2
No ratings yet
ML PPT 2
206 pages
22-Kernel Tricks Shit
No ratings yet
22-Kernel Tricks Shit
43 pages
Health Waves: ※ 본 전국연합학력평가는 17개 시도 교육청 주관으로 시행되며, 해당 자료는 Ebsi에서만 제공됩니다. 무단 전재 및 재배포는 금지됩니다
No ratings yet
Health Waves: ※ 본 전국연합학력평가는 17개 시도 교육청 주관으로 시행되며, 해당 자료는 Ebsi에서만 제공됩니다. 무단 전재 및 재배포는 금지됩니다
6 pages
Poly ML SIR
No ratings yet
Poly ML SIR
378 pages
Duda Solutions PDF
No ratings yet
Duda Solutions PDF
77 pages
Art Appreciation: Activity 7: (Answers Can Be Encoded or Written On A Sheet of Paper)
No ratings yet
Art Appreciation: Activity 7: (Answers Can Be Encoded or Written On A Sheet of Paper)
3 pages
Unit 3
No ratings yet
Unit 3
100 pages
Chapter 4 Classification
No ratings yet
Chapter 4 Classification
78 pages
A PCB Dataset For Defects Detection and Classification: Weibo Huang, Peng Wei
No ratings yet
A PCB Dataset For Defects Detection and Classification: Weibo Huang, Peng Wei
9 pages
شباتر اله مجمعه
No ratings yet
شباتر اله مجمعه
126 pages
DAC ML Tutorial Final Deck
No ratings yet
DAC ML Tutorial Final Deck
150 pages
Lecture 03 Bayes Classifier With Prob Concepts
No ratings yet
Lecture 03 Bayes Classifier With Prob Concepts
70 pages
Schroedter Rosicrucian Notebook
100% (1)
Schroedter Rosicrucian Notebook
17 pages
EDAN96 2024 Last Lecture-1
No ratings yet
EDAN96 2024 Last Lecture-1
78 pages
Introduction To Support Vector Machines: Andrew Moore CMU
No ratings yet
Introduction To Support Vector Machines: Andrew Moore CMU
40 pages
1.12.2024-BSC-301-CSBS-class Note - 2024-25
No ratings yet
1.12.2024-BSC-301-CSBS-class Note - 2024-25
58 pages
Machine Learning Complete-Course-Notes Polimi
No ratings yet
Machine Learning Complete-Course-Notes Polimi
107 pages
Ai - Foundations of Machine Learning II
No ratings yet
Ai - Foundations of Machine Learning II
54 pages
Quran and Physics
50% (2)
Quran and Physics
2 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
57 pages
SVM Class
No ratings yet
SVM Class
33 pages
Applied Statistics - Lecture 1: Mario Beraha
No ratings yet
Applied Statistics - Lecture 1: Mario Beraha
52 pages
Seed 2024
No ratings yet
Seed 2024
49 pages
Statlearn PDF
No ratings yet
Statlearn PDF
123 pages
ASTM D5162-01 - Discontinuity (Holiday) Testing of Nonconductive Protective Coating On Metallic Substrates
No ratings yet
ASTM D5162-01 - Discontinuity (Holiday) Testing of Nonconductive Protective Coating On Metallic Substrates
4 pages
Exercises
No ratings yet
Exercises
69 pages
I2ml3e Chap5
No ratings yet
I2ml3e Chap5
26 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
SPRING Add-On CL V1 1 Protected FR
No ratings yet
SPRING Add-On CL V1 1 Protected FR
42 pages
Lec-01-Introduction To Statistical Learning
No ratings yet
Lec-01-Introduction To Statistical Learning
38 pages
Introduction To: Support Vector Machines
No ratings yet
Introduction To: Support Vector Machines
53 pages
Lect3 2
No ratings yet
Lect3 2
43 pages
A Study On Fracture Toughness of Ultra High Toughness 2020 Construction and
No ratings yet
A Study On Fracture Toughness of Ultra High Toughness 2020 Construction and
22 pages
PR Practical File
No ratings yet
PR Practical File
38 pages
ML Module-02
No ratings yet
ML Module-02
37 pages
ML 41
No ratings yet
ML 41
49 pages
Unit 3 in Machine Intelligence
No ratings yet
Unit 3 in Machine Intelligence
62 pages
Pattern File
No ratings yet
Pattern File
29 pages
Ch5 Multivariate Methods
No ratings yet
Ch5 Multivariate Methods
26 pages
LINFO2275 Questions D Examen-4
No ratings yet
LINFO2275 Questions D Examen-4
34 pages
Pattern L1 L6
No ratings yet
Pattern L1 L6
19 pages
ML Module 02
No ratings yet
ML Module 02
37 pages
Huawei H12-211 PRACTICE EXAM HCNA-HNTD H
No ratings yet
Huawei H12-211 PRACTICE EXAM HCNA-HNTD H
117 pages
Pattern Summary Final
No ratings yet
Pattern Summary Final
28 pages
I2ml3e Chap6
No ratings yet
I2ml3e Chap6
37 pages
Situation Test PDF
No ratings yet
Situation Test PDF
22 pages
Notes MSM
No ratings yet
Notes MSM
66 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
UNIT II Part-2
No ratings yet
UNIT II Part-2
32 pages
Weekly Homework X
No ratings yet
Weekly Homework X
15 pages
Model Selection Techniques - An Overview: Jie Ding, Vahid Tarokh, and Yuhong Yang
No ratings yet
Model Selection Techniques - An Overview: Jie Ding, Vahid Tarokh, and Yuhong Yang
21 pages
Paper - 2009 - Breaching Parameters For Earth and Rockfill Dams
No ratings yet
Paper - 2009 - Breaching Parameters For Earth and Rockfill Dams
14 pages
I2ml2e Chap5 v1 0
No ratings yet
I2ml2e Chap5 v1 0
26 pages
SSC Maths Quiz 2
No ratings yet
SSC Maths Quiz 2
12 pages
Module 2 Rnsit
No ratings yet
Module 2 Rnsit
15 pages
Discriminant Functions
No ratings yet
Discriminant Functions
33 pages
Papilledema: Epidemiology, Etiology, and Clinical Management
No ratings yet
Papilledema: Epidemiology, Etiology, and Clinical Management
11 pages
An Adventure of Epic Porpoises
No ratings yet
An Adventure of Epic Porpoises
174 pages
Mary English Work
No ratings yet
Mary English Work
10 pages
AI Module4
No ratings yet
AI Module4
17 pages
Tutorial: Gaussian Process Models For Machine Learning
No ratings yet
Tutorial: Gaussian Process Models For Machine Learning
35 pages
6.867 Section 3: Classification: 1 Intro 2 2 Representation 2 3 Probabilistic Models 2
No ratings yet
6.867 Section 3: Classification: 1 Intro 2 2 Representation 2 3 Probabilistic Models 2
10 pages
Cheat Sheet
No ratings yet
Cheat Sheet
4 pages
The Area of Quadrilateral ABCD 40: y Students Who Were Not From College C
No ratings yet
The Area of Quadrilateral ABCD 40: y Students Who Were Not From College C
6 pages
Genomic Signal Processing: Classification of Disease Subtype Based On Microarray Data
No ratings yet
Genomic Signal Processing: Classification of Disease Subtype Based On Microarray Data
26 pages
MG DLL WEEK 8 DLL EPP 5 6 DAY 3 2ndQ
No ratings yet
MG DLL WEEK 8 DLL EPP 5 6 DAY 3 2ndQ
9 pages
Decay Modernism
No ratings yet
Decay Modernism
6 pages
Math Behind Machine Learning
No ratings yet
Math Behind Machine Learning
9 pages
MVA Section1 2012
No ratings yet
MVA Section1 2012
14 pages
b2 U8 6min Vocab Easily Confused Words
No ratings yet
b2 U8 6min Vocab Easily Confused Words
5 pages
Đề 3 - L7
No ratings yet
Đề 3 - L7
3 pages
List of Horse Breeds
No ratings yet
List of Horse Breeds
4 pages
John D Musa
No ratings yet
John D Musa
1 page
Generalized Fermat Equation
From Everand
Generalized Fermat Equation
Ran Van Vo
No ratings yet
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
From Everand
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
Andrew Igla
No ratings yet
Herbarium Data
No ratings yet
Herbarium Data
9 pages
Candy Crossword
No ratings yet
Candy Crossword
3 pages

CS3491-AI ML-Chapter 5

Uploaded by

CS3491-AI ML-Chapter 5

Uploaded by

INTRODUCTION TO

Covariance : ij CovX i , X j 

 If variances are also equal, reduces to Euclidean

gi x  log p x | C i   log P C i 

Classify based on weighted Euclidean distance (in

 Each mean can be considered a prototype or

the discriminant is linear

Estimated parameters p̂ij 

gi x   j k z jk log pijk  log P C i 

 Multivariate linear model

 Multivariate polynomial model:

You might also like