0% found this document useful (0 votes)

45 views5 pages

3 Linear

This document discusses linear classifiers for classification problems. It begins by noting that probabilistic information may be unavailable for Bayesian classification, so linear classifiers provide an alternative. Linear classifiers use a decision hyperplane to classify examples based on which side of the hyperplane they fall on. The perceptron algorithm and Winnow algorithm are then introduced as methods for finding the optimal hyperplane by updating the weight vector over multiple iterations. The perceptron algorithm works by updating the weight vector to move it towards the correct prediction for misclassified examples. Winnow uses an exponentiated gradient approach to update weights based on a cost function measuring distance between weights over iterations.

Uploaded by

Rachna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views5 pages

3 Linear

Uploaded by

Rachna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Introduction

• Sometimes probabilistic information unavailable

or mathematically intractable

• Many alternatives to Bayesian classification,

CSCE 970 Lecture 3: but optimality guarantee may be compromised!
Linear Classifiers
• Linear classifiers use a decision hyperplane to
perform classification

Stephen D. Scott • Simple and efficient to train and use

• Optimality requires linear separability of classes

= Class A
January 21, 2003
= Class B

= unclassified
= decision line

1 2

The Perceptron Algorithm

Linear Discriminant Functions
• Assume linear separability, i.e. ∃ w ∗ s.t.

• Let w = [w1, . . . , w`]T be a weight vector and

w∗T · x > 0 ∀ x ∈ ω1
w∗T · x ≤ 0 ∀ x ∈ ω2
w0 (a.k.a. θ) be a threshold
(w0∗ is included in w∗)

• Decision surface is a hyperplane: • So ∃ deterministic function classifying vectors

(contrary to Ch. 2 assumptions)
w T · x + w0 = 0
1
(ω1 )
w0
• E.g. predict ω2 if `i=1 wixi > w0, otherwise
P
x1 w1 y(t)=1 if sum > 0
predict ω1 Σi wi xi
xl wl y(t)=0 otherwise
• Focus of this lecture: How to find wi’s (ω2 )
May also use +1 and -1
– Perceptron algorithm
• Given actual label y(t) for trial t, update weights:
– Winnow w(t + 1) = w(t) + ρ(y(t) − ŷ(t))x(t)

– Least squares methods (if classes not lin- · ρ > 0 is learning rate
early separable)
· (y(t) − ŷ(t)) moves weights toward correct
prediction for x

3 4
The Perceptron Algorithm
Intuition
The Perceptron Algorithm
Example
• Compromise between correctiveness and
x2 y(t) = 0 conservativeness
w0 = 0
y(t) = 1
our new dec. line – Correctiveness: Tendency to improve on x(t)
our dec. line
if prediction error made
x(t)
– Conservativeness: Tendency to keep
w(t + 1) close to w(t)

w* w(t+1)
opt. dec. line • Use cost function that measures both:

x1 conserv. corrective
z }| { z }| {
w(t) U (w) = kw(t + 1) − w(t)k2 2
2 +η (y(t) − w(t + 1) · x(t))
X̀
(ω1 ) = (wi(t + 1) − wi(t))2 +
i=1
(ω2 )
 2
X̀
η y(t) − wi(t + 1) xi(t)
i=1

5 6

The Perceptron Algorithm

Intuition
(cont’d)

• Take gradient w.r.t. w(t + 1) and set to 0: The Perceptron Algorithm

Miscellany
0 =2 (wi(t + 1) − wi(t)) −
 
X̀
2η y(t) − wi(t + 1) xi(t) xi(t)
i=1
• If classes linearly separable, then by cycling
through vectors,
guaranteed to converge in finite number of steps
• Approximate with

0 =2 (wi(t + 1) − wi(t)) −

X̀

• For real-valued output, can replace threshold
2η y(t) − wi(t) xi(t) xi(t), function on sum with
i=1
– Identity function: f (x) = x
which yields
wi(t + 1) = wi(t) + 1
– Sigmoid function: e.g. f (x) = 1+exp(−ax)
 
X̀
η y(t) − wi(t)xi (t) xi(t) – Hyperbolic tangent: e.g. f (x) = c tanh(ax)
i=1

• Applying threshold to summation yields

wi(t + 1) = wi(t) + η (y(t) − ŷ(t)) xi(t)

7 8
Winnow/Exponentiated Gradient
Intuition
Winnow/Exponentiated Gradient
• Measure distance in cost function with
1
(ω1 ) unnormalized relative entropy:
x1 w1 w0
y(t)=1 if sum > 0
conserv.
Σi wi xi z }| !{
X̀ w (t + 1)
xl wl y(t)=0 otherwise U (w) = wi(t) − wi(t + 1) + wi(t + 1) ln i
(ω2 ) i=1 wi(t)
May also use +1 and -1 corrective
z }| {
+ η (y − w(t + 1) · x(t))2
• Same as Perceptron, but update weights:
• Take gradient w.r.t. w(t + 1) and set to 0:
wi(t + 1) = wi(t) exp (−2η(ŷ(t) − y(t)) xi(t))
 
w (t + 1) X̀
0 = ln i − 2η y(t) − wi(t + 1) xi(t) xi(t)
• If y(t), ŷ(t) ∈ {0, 1} ∀t, then set η = (ln α)/2 wi(t) i=1
(α > 1) and get Winnow:
 • Approximate with
xi (t)
wi(t)/α if ŷ(t) = 1, y(t) = 0

  
wi(t + 1) = wi(t)αxi (t) w (t + 1) X̀

if ŷ(t) = 0, y(t) = 1 0 = ln i − 2η y(t) − wi(t)xi (t) xi(t),


wi(t) if ŷ(t) = y(t) wi(t) i=1

which yields
wi(t + 1) = wi(t) exp (−2η (ŷ(t) − y(t)) xi(t))

9 10

Winnow/Exponentiated Gradient Winnow/Exponentiated Gradient

Negative Weights Miscellany

• Winnow and EG update wts by multiplying by • Winnow and EG are muliplicative weight update
a pos const: impossible to change sign schemes versus additive weight update schemes,
e.g. Perceptron
– Weight vectors restricted to one quadrant
• Winnow and EG work well when most attributes
(features) are irrelevant, i.e. optimal weight
• Solution: Maintain wt vectors w +(t) and w−(t) vector w∗ is sparse (many 0 entries)

– Predict ŷ(t) = w+(t) − w−(t) · x(t) • E.g. xi ∈ {0, 1}, x’s are labelled by a monotone
k-disjunction over ` attributes, k `
– Update:
– Remaining ` − k are irrelevant
ri+ (t) = exp (−2η (ŷ(t) − y(t)) xi(t) U )
– E.g. x5 ∨ x9 ∨ x12, ` = 150, k = 3
ri−(t) = 1/ri+(t)
– For disjunctions, number of on-line
wi+ (t) ri+(t) prediction mistakes is O(k log `) for Winnow
wi+ (t + 1) = U · P
+ + − −

` and worst-case Ω(k`) for Perceptron
j=1 wi (t) ri (t) + wi (t) ri (t)

U and denominator normalize wts for proof of error – So in worst case, need exponentially fewer
bound updates for training in Winnow than Per-
ceptron
Kivinen & Warmuth, “Additive Versus Exponen-
tiated Gradient Updates for Linear Prediction.” • Other bounds exist for real-valued inputs and
Information and Computation, 132(1):1–64, Jan.
outputs
1997. [see web page]
11 12
Non-Linearly Separable Classes Non-Linearly Separable Classes
Winnow’s Agnostic Results

• What if no hyperplane completely separates

the classes? • Winnow’s total number of prediction mistakes
• Add extra inputs that are nonlinear combina- loss (in on-line setting) provably not much worse
tions of original inputs (Section 4.14) than best linear classifier

= Class A optimal decision line

– E.g. attribs. x1 and x2, so try
h iT
x = x1, x2, x1x2, x21, x22, x21x2, x1x22, x31, x32 = Class B

– Perhaps classes linearly separable in new fea-

ture space

– Useful, especially with Winnow/EG loga-

rithmic bounds

– Kernel functions/SVMs

• Pocket algorithm (p. 63) guarantees conver- • Loss bound related to performance of best
gence to a best hyperplane classifier and total distance under k · k1 that
feature vectors must be moved to make best
• Winnow’s & EG’s agnostic results classifier perfect [Littlestone, COLT ’91]
• Least squares methods (Sec. 3.4)

• Networks of classifiers (Ch. 4) • Similar bounds for EG [Kivinen & Warmuth]

13 14

Non-Linearly Separable Classes

Multiclass learning
Least Squares Methods
Kessler’s Construction
• Recall from Slide 7:
 
X̀
wi(t + 1) = wi(t) + η y(t) − wi(t)xi (t) xi(t) ω1’s line ω3’s line
i=1
[2,2]
= wi(t) + η y(t) − w(t)T · x(t) xi(t)

• If we don’t threshold dot product during train-

ing and allow η to vary each trial (i.e. substi- ω2’s line
tute ηt), get∗ Eq. 3.38, p. 69:
= Class ω1
w(t + 1) = w(t) + ηt x(t) y(t) − w(t)T · x(t)
= Class ω2
• This is Least Mean Squares (LMS) Algorithm = Class ω3

• If e.g. ηt = 1/t, then

lim P w(t) = w∗ = 1, • For∗ x = [2, 2, 1]T of class ω1, want
t→∞
where
2
w∗ = argmin E y − wT · x `+1 `+1 `+1 `+1
w∈<` X X X X
w1i xi > w2ixi AND w1ixi > w3i xi
is vector minimizing mean square error (MSE) i=1 i=1 i=1 i=1

∗ Note
that here w(t) is weight before trial t. In book it is
∗ The extra 1 is added so threshold can be placed in w.
weight after trial t.
15 16
Multiclass learning Multiclass learning
Kessler’s Construction (cont’d) Error-Correcting Output Codes (ECOC)

• So map x to • Since Win. & Percep. learn binary functions,

orig. neg pad learn individual bits of binary encoding of classes
z }| { z }| { z }| {
x1 = [2, 2, 1, −2, −2, −1, 0, 0, 0]T
x2 = [2, 2, 1, 0, 0, 0, −2, −2, −1]T • E.g. M = 4, so use two linear classifiers:
(all labels = +1) and let
w1 w2 w3 Class Binary Encoding
w = [w11, w12, w10, w21, w22, w20, w31, w32, w30]T
z }| {z }| {z }| { Classifier 1 Classifier 2
ω1 0 0
ω2 0 1
• Now if w∗T · x1 > 0 and w∗T · x2 > 0, then ω3 1 0
ω4 1 1
`+1
X `+1
X `+1
X `+1
X
∗ ∗ ∗ ∗
w1i xi > w2i xi AND w1i xi > w3i xi and train simultaneously
i=1 i=1 i=1 i=1

• Problem: Sensitive to individual classifier er-

• In general, map (` + 1) × 1 feature vector x to rors, so use a set of encodings per class to
x1, . . . xM −1, each of size (` + 1)M × 1 improve robustness

• x ∈ ωi ⇒ x in ith block and −x in jth block,

(rest are 0s). Repeat for all j 6= i • Similar to principle of error-correcting output
codes used in communication networks
• Now train to find weights for new vector space [Dietterich & Bakiri, 1995]
via perceptron, Winnow, etc. • General-purpose, independent of learner

17 18

Topic summary due in 1 week!

Support Vector Machines
No ratings yet
Support Vector Machines
57 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
46 pages
10-701/15-781 Machine Learning Mid-Term Exam Solution: Your Name
No ratings yet
10-701/15-781 Machine Learning Mid-Term Exam Solution: Your Name
12 pages
Artificial Neural Networks (ANN) : Dr.M.Sivagnanasundaram
No ratings yet
Artificial Neural Networks (ANN) : Dr.M.Sivagnanasundaram
18 pages
V3I5201499a84 PDF
No ratings yet
V3I5201499a84 PDF
6 pages
Design of Robot-Centered Cell
No ratings yet
Design of Robot-Centered Cell
7 pages
Deep Vs Shallow Neural Networks
No ratings yet
Deep Vs Shallow Neural Networks
13 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
74 pages
Final Module 2
No ratings yet
Final Module 2
32 pages
DL Question Bank
No ratings yet
DL Question Bank
5 pages
An Adventure of Epic Porpoises
No ratings yet
An Adventure of Epic Porpoises
174 pages
Welding PDF
No ratings yet
Welding PDF
74 pages
Back Propagation Network: Soft Computing
No ratings yet
Back Propagation Network: Soft Computing
33 pages
Lab Internals 2024
No ratings yet
Lab Internals 2024
1 page
NN-Ch2 New V1
No ratings yet
NN-Ch2 New V1
99 pages
Minsky y Papert
No ratings yet
Minsky y Papert
77 pages
Linear Classifier: by Dr. Sanjeev Kumar Associate Professor Department of Mathematics IIT Roorkee, Roorkee-247 667, India
No ratings yet
Linear Classifier: by Dr. Sanjeev Kumar Associate Professor Department of Mathematics IIT Roorkee, Roorkee-247 667, India
86 pages
Chapter 8
No ratings yet
Chapter 8
103 pages
Lecture8,9-Neural Networks
No ratings yet
Lecture8,9-Neural Networks
65 pages
Chapter 9. Various Deep Learning Topics - v.1.1
No ratings yet
Chapter 9. Various Deep Learning Topics - v.1.1
132 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
06 Lectureslides LinearClassification Fixed
No ratings yet
06 Lectureslides LinearClassification Fixed
52 pages
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
No ratings yet
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
50 pages
NN Theory
No ratings yet
NN Theory
138 pages
Lecture 2
No ratings yet
Lecture 2
57 pages
The Adaptive Tree Walk Protocol:: Working Principle
No ratings yet
The Adaptive Tree Walk Protocol:: Working Principle
2 pages
Machine Learning - Classifiers and Boosting: Reading CH 18.6-18.12, 20.1-20.3.2
No ratings yet
Machine Learning - Classifiers and Boosting: Reading CH 18.6-18.12, 20.1-20.3.2
54 pages
Segmentation Algorithms: Václav Krajíček
No ratings yet
Segmentation Algorithms: Václav Krajíček
51 pages
New Schedule: Fall 2004 Pattern Recognition For Vision
No ratings yet
New Schedule: Fall 2004 Pattern Recognition For Vision
48 pages
הרצאה-Classifiers and Decision Trees
No ratings yet
הרצאה-Classifiers and Decision Trees
119 pages
315 F19 14 SVM 1
No ratings yet
315 F19 14 SVM 1
33 pages
Class 4 Part 2 PDF
No ratings yet
Class 4 Part 2 PDF
26 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
Machine Learning: Support Vector Machines Kernel Methods
No ratings yet
Machine Learning: Support Vector Machines Kernel Methods
87 pages
3 Percept Ron
No ratings yet
3 Percept Ron
34 pages
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
No ratings yet
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
43 pages
SVM Notes
No ratings yet
SVM Notes
40 pages
Unit 3
No ratings yet
Unit 3
29 pages
05 Optimization Basics
No ratings yet
05 Optimization Basics
94 pages
Chapter 4. Classification Algorithms-Stud
No ratings yet
Chapter 4. Classification Algorithms-Stud
43 pages
DL - Unit II
No ratings yet
DL - Unit II
78 pages
Ch17 Presn PDF
No ratings yet
Ch17 Presn PDF
29 pages
Linear Discriminant Functions: Minimum Squared Error Procedures: Ho-Kashyap Procedures
No ratings yet
Linear Discriminant Functions: Minimum Squared Error Procedures: Ho-Kashyap Procedures
22 pages
Introduction To Machine Learning Lecture 3: Linear Classification Methods
No ratings yet
Introduction To Machine Learning Lecture 3: Linear Classification Methods
40 pages
CIS 4526: Foundations of Machine Learning Linear Classification: Perceptron
No ratings yet
CIS 4526: Foundations of Machine Learning Linear Classification: Perceptron
33 pages
8 Neural Networks
No ratings yet
8 Neural Networks
55 pages
SML Lecture5
No ratings yet
SML Lecture5
45 pages
Decision Trees Concepts Algorithms
No ratings yet
Decision Trees Concepts Algorithms
15 pages
cs188 sp23 Lec25 - Z
No ratings yet
cs188 sp23 Lec25 - Z
38 pages
Chapter#03 Supervised Learning and Its Algorithms - III
No ratings yet
Chapter#03 Supervised Learning and Its Algorithms - III
29 pages
Pattern Classification 10. Linear Perceptron, Least Squares & Multi-Layer Nns
No ratings yet
Pattern Classification 10. Linear Perceptron, Least Squares & Multi-Layer Nns
38 pages
Lect 1
No ratings yet
Lect 1
24 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
20 pages
Unit Ii: Beyond Binary Classification: Handling More Than Two Classes, Regression, Unsupervised
No ratings yet
Unit Ii: Beyond Binary Classification: Handling More Than Two Classes, Regression, Unsupervised
22 pages
CS229
No ratings yet
CS229
216 pages
Lecture 2 Math
No ratings yet
Lecture 2 Math
34 pages
Linear Classifier: Linear Discriminant Function: Compiled by Lakshmi Manasa, CED16I033
No ratings yet
Linear Classifier: Linear Discriminant Function: Compiled by Lakshmi Manasa, CED16I033
31 pages
Credit Card Fraud Detection Using Machine Learning 3
No ratings yet
Credit Card Fraud Detection Using Machine Learning 3
34 pages
Kakade S. Tewari A. - Topics in Artificial Intelligence (Learning Theory)
No ratings yet
Kakade S. Tewari A. - Topics in Artificial Intelligence (Learning Theory)
68 pages
I2ml3e Chap10
No ratings yet
I2ml3e Chap10
27 pages
Midterm 2008s Solution
No ratings yet
Midterm 2008s Solution
12 pages
ANN Matlab
No ratings yet
ANN Matlab
13 pages
2021 Logistic Regression
No ratings yet
2021 Logistic Regression
33 pages
Linear Classifiers and The Perceptron Algorithm: 36-350, Data Mining, Fall 2009 16 November 2009
No ratings yet
Linear Classifiers and The Perceptron Algorithm: 36-350, Data Mining, Fall 2009 16 November 2009
5 pages
PRu 4
No ratings yet
PRu 4
13 pages
6.867 Section 3: Classification: 1 Intro 2 2 Representation 2 3 Probabilistic Models 2
No ratings yet
6.867 Section 3: Classification: 1 Intro 2 2 Representation 2 3 Probabilistic Models 2
10 pages
01 DS 2019 CODESIGN Correction Ex1
No ratings yet
01 DS 2019 CODESIGN Correction Ex1
17 pages
ML Unit I
No ratings yet
ML Unit I
14 pages
ML-chap10 2024 110300
No ratings yet
ML-chap10 2024 110300
29 pages
RNN LectureNotes
No ratings yet
RNN LectureNotes
36 pages
Lecture Notes 3 Perceptron
No ratings yet
Lecture Notes 3 Perceptron
7 pages
Journal JMS 2018
No ratings yet
Journal JMS 2018
12 pages
lec22-ML III
No ratings yet
lec22-ML III
51 pages
Deep Learning For Credit Card Fraud Detection A Review of Algorithms Challenges and Solutions
No ratings yet
Deep Learning For Credit Card Fraud Detection A Review of Algorithms Challenges and Solutions
18 pages
Machine Learning Techniques For Heart Disease Prediction
No ratings yet
Machine Learning Techniques For Heart Disease Prediction
8 pages
TB and Copd!
No ratings yet
TB and Copd!
9 pages
Lecturenotes Perceptron
No ratings yet
Lecturenotes Perceptron
7 pages
Lecture-20 21 22 (ANN)
No ratings yet
Lecture-20 21 22 (ANN)
30 pages
Week 3
No ratings yet
Week 3
15 pages
Classification of Dry Bean
No ratings yet
Classification of Dry Bean
16 pages
Linear Discriminators: Only Relevant Parts
No ratings yet
Linear Discriminators: Only Relevant Parts
13 pages
Identification of Brain Tumor Using Image Processing Techniques
No ratings yet
Identification of Brain Tumor Using Image Processing Techniques
9 pages
ML Lec 14 LeNeT CNN Architecture
No ratings yet
ML Lec 14 LeNeT CNN Architecture
14 pages
CSCE 970 Lecture 6: System Evaluation and Combining Classifiers
No ratings yet
CSCE 970 Lecture 6: System Evaluation and Combining Classifiers
9 pages
cs188 Fa23 Note21
No ratings yet
cs188 Fa23 Note21
8 pages
Perceptron in Machine Learning
No ratings yet
Perceptron in Machine Learning
11 pages
Syllabus Coursework
No ratings yet
Syllabus Coursework
8 pages
Early Detection of TB and Other Lung Diseases in Chest Radiography Using Image Processing Techniques
No ratings yet
Early Detection of TB and Other Lung Diseases in Chest Radiography Using Image Processing Techniques
7 pages
Lecture 1, Part 2: Linear Classification: Roger Grosse
No ratings yet
Lecture 1, Part 2: Linear Classification: Roger Grosse
10 pages
Perceptron
No ratings yet
Perceptron
23 pages
APELID Augmentd WGAN and Parallel Ensemble Learning
No ratings yet
APELID Augmentd WGAN and Parallel Ensemble Learning
17 pages
6.86x Machine Learning With Python: Linear Classifiers
No ratings yet
6.86x Machine Learning With Python: Linear Classifiers
7 pages
ch6 (Q 2,8,4)
No ratings yet
ch6 (Q 2,8,4)
9 pages
Ai and ML
No ratings yet
Ai and ML
16 pages
Compliance Report Updated
No ratings yet
Compliance Report Updated
4 pages
Activation Functions in Deep Learning: A Comprehensive Survey and Benchmark
No ratings yet
Activation Functions in Deep Learning: A Comprehensive Survey and Benchmark
18 pages
CSCE 970 Lecture 2: Bayesian-Based Classifiers: Most Probable
No ratings yet
CSCE 970 Lecture 2: Bayesian-Based Classifiers: Most Probable
5 pages
Applications of Computer Aided Design (CAD) in Medical Image Technology
No ratings yet
Applications of Computer Aided Design (CAD) in Medical Image Technology
5 pages
Perceptron Learning Algorithm Lecture Supplement
No ratings yet
Perceptron Learning Algorithm Lecture Supplement
6 pages
Empirical Evaluation of Rectified Activations in ConvolutionNetwork
No ratings yet
Empirical Evaluation of Rectified Activations in ConvolutionNetwork
5 pages
An Ensemble Method For Phishing Websites Detection Based On XGBoost
No ratings yet
An Ensemble Method For Phishing Websites Detection Based On XGBoost
6 pages
Remote Controlled Unmanned River Cleaning Bot IJERTV10IS030314
No ratings yet
Remote Controlled Unmanned River Cleaning Bot IJERTV10IS030314
4 pages
Perceptron Notes
No ratings yet
Perceptron Notes
5 pages
Department of Electronics and Instrumentation Engineering: 1 Internal Assessment Test: Even Semester 2020
No ratings yet
Department of Electronics and Instrumentation Engineering: 1 Internal Assessment Test: Even Semester 2020
3 pages
100 AdaBoost and GBM MCQs
No ratings yet
100 AdaBoost and GBM MCQs
28 pages
Linear Separability
No ratings yet
Linear Separability
4 pages
Tugas JST #Individual Task 1. MLP (Multi Layer Perceptron)
No ratings yet
Tugas JST #Individual Task 1. MLP (Multi Layer Perceptron)
3 pages
Main
No ratings yet
Main
5 pages
Department of Electronics and Instrumentation Engineering: 1 Internal Assessment Test: Even Semester 2020
No ratings yet
Department of Electronics and Instrumentation Engineering: 1 Internal Assessment Test: Even Semester 2020
2 pages
Department of Electronics and Instrumentation Engineering: 1 Internal Assessment Test: Even Semester 2020
No ratings yet
Department of Electronics and Instrumentation Engineering: 1 Internal Assessment Test: Even Semester 2020
2 pages
Department of Electronics and Instrumentation Engineering: 1 Internal Assessment Test: Even Semester 2020
No ratings yet
Department of Electronics and Instrumentation Engineering: 1 Internal Assessment Test: Even Semester 2020
2 pages
Artificial Intelligence and Machine Learning
No ratings yet
Artificial Intelligence and Machine Learning
2 pages
War Robot Report Revised
No ratings yet
War Robot Report Revised
1 page

3 Linear

Uploaded by

3 Linear

Uploaded by

Introduction

• Sometimes probabilistic information unavailable

• Many alternatives to Bayesian classification,

Stephen D. Scott • Simple and efficient to train and use

• Optimality requires linear separability of classes

The Perceptron Algorithm

• Let w = [w1, . . . , w`]T be a weight vector and

• Decision surface is a hyperplane: • So ∃ deterministic function classifying vectors

The Perceptron Algorithm

• Take gradient w.r.t. w(t + 1) and set to 0: The Perceptron Algorithm

• Applying threshold to summation yields

Winnow/Exponentiated Gradient Winnow/Exponentiated Gradient

• What if no hyperplane completely separates

= Class A optimal decision line

– Perhaps classes linearly separable in new fea-

– Useful, especially with Winnow/EG loga-

• Networks of classifiers (Ch. 4) • Similar bounds for EG [Kivinen & Warmuth]

Non-Linearly Separable Classes

• If we don’t threshold dot product during train-

• If e.g. ηt = 1/t, then

• So map x to • Since Win. & Percep. learn binary functions,

• Problem: Sensitive to individual classifier er-

• x ∈ ωi ⇒ x in ith block and −x in jth block,

Topic summary due in 1 week!

You might also like