0% found this document useful (0 votes)

15 views56 pages

01 Halfspaces Perceptron

Uploaded by

Cheng Lin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views56 pages

01 Halfspaces Perceptron

Uploaded by

Cheng Lin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

CS 480/680

Introduction to Machine Learning

Lecture 1
Halfspaces and the Perceptron Algorithm
Kathryn Simone
10 September 2024
Classiﬁcation: Fall detection from accelerometer data

Clockwise from top:

Yu, Xiaoqun, Jaehyuk Jang, and Shuping Xiong. Frontiers in Aging
Neuroscience 13 (2021): 692865.
Voelker, Aaron, Ivana Kajić, and Chris Eliasmith. Advances in neural
information processing systems 32 (2019).
PAGE 2
Barkley and Simone 2023, Unpublished
Most of ML makes use of linear methods

PAGE 3
Feature extraction Function approximation
The perceptron algorithm learns a classiﬁer
using linear combinations of features

PAGE 4
Aims
At the end of the lecture, we should be able to:
★ Identify the components of a dataset required for supervised learning.
★ Interpret the separating hyperplane hypothesis class geometrically.
★ Implement the Perceptron algorithm and list its properties.
★ Reproduce Novikoﬀ’s proof of the Perceptron convergence theorem.

PAGE 5
Lecture Outline
I. What is needed in order to learn a classiﬁer?
The structure of observations and hypotheses
II. How can we learn a hypothesis from data?
The Perceptron Algorithm
III. Why does this work?
Convergence analysis and other properties
IV. Summary + Housekeeping

PAGE 6
Lecture Outline
I. What is needed in order to learn a classiﬁer?
The structure of observations and hypotheses
II. How can we learn a hypothesis from data?
The Perceptron Algorithm
III. Why does this work?
Convergence analysis and other properties
IV. Summary + Housekeeping

PAGE 7
A motivating example: predicting whether you’ll pass a class

PAGE 8
A motivating example: predicting whether you’ll pass a class

PAGE 9
Divination eﬀort dataset

PAGE 10
The Binary Classiﬁcation Problem

PAGE 11
Exploring the “divination eﬀort” dataset

PAGE 12
The slope-intercept form for a line is inconvenient

PAGE 13
A line deﬁnes a hyperplane, or aﬃne set, in ℝ2

PAGE 14
The notations and meaning of inner product

x1 x2 x3 x4 ⨯ w1 = -b

PAGE 15
From hyperplanes to halfspaces

PAGE 16
The separating hyperplane hypothesis class

PAGE 17
Biological interpretation

PAGE 18
Lecture Outline
I. What is needed in order to learn a classiﬁer?
The structure of observations and hypotheses
II. How can we learn a hypothesis from data?
The Perceptron Algorithm
III. Why does this work?
Convergence analysis and other properties
IV. Summary + Housekeeping

PAGE 19
Statistical (Batch) Learning

PAGE 20
Online Learning

PAGE 21
The Perceptron Algorithm

PAGE 22
Get ready to watch the Perceptron algorithm in action

parameters

decision boundary
Negative

Positive
x2

New boundary
Old boundary

PAGE 23
features
x1
PAUSE (1 min)
write down your predictions

PAGE 24
The
Perceptron
algorithm
in Action

PAGE 25
Lecture Outline
I. What is needed in order to learn?
The structure of observations and hypotheses
II. How can we learn a hypothesis from data?
The Perceptron Algorithm
III. Why does this work?
Convergence analysis and other properties
IV. Summary + Housekeeping

PAGE 26
The Perceptron convergence theorem (informal)

Linearly separable Perceptron converges

PAGE 27
How can we deﬁne linear separability?

PAGE 28
The padding trick simpliﬁes analysis

x1 x2 x3 x4 ⨯ w1 + b x1 x2 x3 x4 1 ⨯ w1

w2 w2

w3 w3

w4 w4

PAGE 29
Biological interpretation

PAGE 30
Biological interpretation with padding

PAGE 31
Linear separability and the margin, 𝛾, of a dataset, D

PAGE 32
The Oracle Vector

The Matrix (1999)

PAGE 33
Linear separability and the margin, 𝛾, of a dataset, D

PAGE 34
Finite number of errors on linearly separable data

PAGE 35
A proof in two parts (Novikoﬀ, 1962)
I. Do updates necessarily result in progress?
II. Will it ever stop?

PAGE 36
A proof in two parts (Novikoﬀ, 1962)
I. Do updates necessarily result in progress?
II. Will it ever stop?

PAGE 37
Do updates necessarily result in progress?

PAGE 38
Do updates necessarily result in progress?

PAGE 39
Do updates necessarily result in progress?

PAGE 40
Do updates necessarily result in progress?
Yes. The weight vector
increases its alignment with
the oracle at every update.

PAGE 41
A proof in two parts (Novikoﬀ, 1962)
I. Do updates necessarily result in progress?
Yes. The weight vector increases its alignment with the oracle at every update.

II. Will it ever stop?

PAGE 42
A proof in two parts (Novikoﬀ, 1962)
I. Do updates necessarily result in progress?
Yes. The weight vector increases its alignment with the oracle at every update.

II. Will it ever stop?

PAGE 43
Will it ever stop?
Can now be interpreted as: “Is there an upper bound on the norm of the parameter vector?”

PAGE 44
Will it ever stop?
Upper bound:

Lower bound:

PAGE 45
Finite number of errors on linearly separable data

PAGE 46
QUICK BREAK (2 mins)
go over your notes or discuss with your neighbor

PAGE 47
A few more properties
▪ The solution found by the Perceptron algorithm is not unique
▪ There are inﬁnitely many solutions
▪ No guarantee to be optimal, in terms of the maximizing the margin
▪ The Perceptron algorithm will not converge if data are not linearly separable
▪ The algorithm will never halt, it will cycle
▪ The algorithm is inappropriate for such problems
▪ Multiple valid termination conditions
▪ Weights have stopped changing
▪ Exhausted some update budget
▪ Error on training or validation datasets has stopped
▪ There are diﬀerent strategies for controlling the order of arrival of samples

PAGE 48
Multiclass classiﬁcation

Binary: Fall Multiclass:

Not Fall

Yu, Xiaoqun, Jaehyuk Jang, and Shuping Xiong. Frontiers in Aging PAGE 49
Neuroscience 13 (2021): 692865.
Learning a multiclass classifier with Perceptron
One vs all
▪ Train a classifier for each class
▪ Output: arg maxi(w,x)
One vs. one
▪ Train a classifier for each pair of classes
▪ e.g. if 4 classes, 6 possible pairs
▪ Output: Majority vote

PAGE 50
Lecture Outline
I. What is needed in order to learn?
The structure of observations and hypotheses
II. How can we learn a hypothesis from data?
The Perceptron Algorithm
III. Why does this work?
Convergence analysis and other properties
IV. Summary + Housekeeping

PAGE 51
Aims
We should now be able to:
✓ Identify the components of a dataset required for supervised learning.
✓ Interpret the separating hyperplane hypothesis class geometrically.
✓ Implement the Perceptron algorithm and list its properties.
✓ Reproduce Novikoﬀ’s proof of the Perceptron convergence theorem.

PAGE 52
PAGE 53
On the horizon

posting
today →

examples
are up →

PAGE 54
PAGE 55
PAGE 56

Perceptron Algorithm
No ratings yet
Perceptron Algorithm
10 pages
05 Neural Network
No ratings yet
05 Neural Network
38 pages
L03 Slides - Perceptron
No ratings yet
L03 Slides - Perceptron
22 pages
SP14 CS188 Lecture 22 - Perceptron - Print
No ratings yet
SP14 CS188 Lecture 22 - Perceptron - Print
35 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
46 pages
Perceptron
No ratings yet
Perceptron
23 pages
PNAL4 SingleLayerNets
No ratings yet
PNAL4 SingleLayerNets
42 pages
Modern Algebra I
No ratings yet
Modern Algebra I
21 pages
ML - Lec 6 - Linear Classifiers
No ratings yet
ML - Lec 6 - Linear Classifiers
55 pages
Perceptron 2014
No ratings yet
Perceptron 2014
44 pages
Class 10 Mathematics Mind Map
No ratings yet
Class 10 Mathematics Mind Map
14 pages
NN-Ch2 New V1
No ratings yet
NN-Ch2 New V1
99 pages
Lecture 2
No ratings yet
Lecture 2
57 pages
Ajaks Tutorials 2025 Jamb Syllabus Schedule-1
100% (1)
Ajaks Tutorials 2025 Jamb Syllabus Schedule-1
4 pages
ANN 3 - Perceptron
100% (1)
ANN 3 - Perceptron
56 pages
SML Lecture5
No ratings yet
SML Lecture5
45 pages
Electromagnetic Theory Notes
100% (2)
Electromagnetic Theory Notes
178 pages
Lecture 3 - The Perceptron
No ratings yet
Lecture 3 - The Perceptron
4 pages
6.034 Notes: Section 7.1: Slide 7.1.1
No ratings yet
6.034 Notes: Section 7.1: Slide 7.1.1
25 pages
Lecture 4
No ratings yet
Lecture 4
65 pages
Abstract Classes
No ratings yet
Abstract Classes
5 pages
cs188 sp23 Lec25 - Z
No ratings yet
cs188 sp23 Lec25 - Z
38 pages
Perceptron Example (Practice Que)
No ratings yet
Perceptron Example (Practice Que)
26 pages
Linear Regression
No ratings yet
Linear Regression
37 pages
Linear Classifier-Perceptron
No ratings yet
Linear Classifier-Perceptron
42 pages
NN Theory
No ratings yet
NN Theory
138 pages
Lecture 16 - Hyperplane Classifiers - Perceptron - Plain
No ratings yet
Lecture 16 - Hyperplane Classifiers - Perceptron - Plain
9 pages
Linear Separability
No ratings yet
Linear Separability
4 pages
3 Percept Ron
No ratings yet
3 Percept Ron
34 pages
Perceptron: Tirtharaj Dash
No ratings yet
Perceptron: Tirtharaj Dash
22 pages
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
No ratings yet
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
43 pages
Lec1 PerceptronPocket Recap
No ratings yet
Lec1 PerceptronPocket Recap
61 pages
Introduction To Quantitative Methods: Morning 6 December 2007
100% (1)
Introduction To Quantitative Methods: Morning 6 December 2007
20 pages
05 Linear Classifiers
No ratings yet
05 Linear Classifiers
59 pages
Artificial Neural Networks Unit 3: Single-Layer Perceptrons
No ratings yet
Artificial Neural Networks Unit 3: Single-Layer Perceptrons
11 pages
Pattern Recognition 2nd Ed. (2009)
No ratings yet
Pattern Recognition 2nd Ed. (2009)
113 pages
1 Algorithm: For I 1 To N Ify
No ratings yet
1 Algorithm: For I 1 To N Ify
6 pages
Turing Machine
No ratings yet
Turing Machine
99 pages
Week3 Perceptron Mlprwerwerwer
No ratings yet
Week3 Perceptron Mlprwerwerwer
8 pages
Perceptron Bound Proof
No ratings yet
Perceptron Bound Proof
27 pages
CIS 4526: Foundations of Machine Learning Linear Classification: Perceptron
No ratings yet
CIS 4526: Foundations of Machine Learning Linear Classification: Perceptron
33 pages
Perceptron - Algorithm
No ratings yet
Perceptron - Algorithm
9 pages
Linear Classifiers and The Perceptron Algorithm: 36-350, Data Mining, Fall 2009 16 November 2009
No ratings yet
Linear Classifiers and The Perceptron Algorithm: 36-350, Data Mining, Fall 2009 16 November 2009
5 pages
Lecture 2
No ratings yet
Lecture 2
19 pages
Co5124.Sp52.Assignment1 Ngo Chi Nguyen 12528511 in
No ratings yet
Co5124.Sp52.Assignment1 Ngo Chi Nguyen 12528511 in
15 pages
MAT6007 Session5 Perceptron Algorithm
No ratings yet
MAT6007 Session5 Perceptron Algorithm
19 pages
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
No ratings yet
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
50 pages
NN 03
No ratings yet
NN 03
27 pages
Slide 2
No ratings yet
Slide 2
35 pages
Introduction To Machine Learning Lecture 3: Linear Classification Methods
No ratings yet
Introduction To Machine Learning Lecture 3: Linear Classification Methods
40 pages
Fall 2016 Chapter 1 Lecture 3
No ratings yet
Fall 2016 Chapter 1 Lecture 3
18 pages
Python Sach
No ratings yet
Python Sach
67 pages
Perceptron
No ratings yet
Perceptron
26 pages
AP Calc AB 2003 PDF
No ratings yet
AP Calc AB 2003 PDF
34 pages
Lecture 2 Math
No ratings yet
Lecture 2 Math
34 pages
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture2 Compressed
No ratings yet
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture2 Compressed
21 pages
Perceptron PDF
0% (1)
Perceptron PDF
8 pages
The Balanced Scorecard: Superfactory Excellence Program™
No ratings yet
The Balanced Scorecard: Superfactory Excellence Program™
65 pages
Perceptron
No ratings yet
Perceptron
6 pages
10 General Aptitude - GQB (Ddpanda)
No ratings yet
10 General Aptitude - GQB (Ddpanda)
71 pages
Perceptron Lecture 3
No ratings yet
Perceptron Lecture 3
25 pages
Perceptron Linear Classifiers
No ratings yet
Perceptron Linear Classifiers
42 pages
Perceptron Notes
No ratings yet
Perceptron Notes
5 pages
ANN (Perceptron) 02
No ratings yet
ANN (Perceptron) 02
14 pages
Reporting Document-Sap BPC Epm
100% (1)
Reporting Document-Sap BPC Epm
43 pages
NN Unit 2
No ratings yet
NN Unit 2
20 pages
PRu 4
No ratings yet
PRu 4
13 pages
cs188 Fa23 Note21
No ratings yet
cs188 Fa23 Note21
8 pages
Lecturenotes Perceptron
No ratings yet
Lecturenotes Perceptron
7 pages
Lecture Notes 3 Perceptron
No ratings yet
Lecture Notes 3 Perceptron
7 pages
Iv. Single Layer Structures: 4.1. Perceptrons
No ratings yet
Iv. Single Layer Structures: 4.1. Perceptrons
26 pages
Parity Bits Exercises
No ratings yet
Parity Bits Exercises
4 pages
AlexNet PDF
No ratings yet
AlexNet PDF
9 pages
3 Linear
No ratings yet
3 Linear
5 pages
X Viber Balancing Method
No ratings yet
X Viber Balancing Method
8 pages
ANSYS Workbench: Mechanical Examples
No ratings yet
ANSYS Workbench: Mechanical Examples
54 pages
Engg Mathematics - 1 Dec 2012 PDF
No ratings yet
Engg Mathematics - 1 Dec 2012 PDF
4 pages
Osborne (2008) CH 22 Testing The Assumptions of Analysis of Variance
No ratings yet
Osborne (2008) CH 22 Testing The Assumptions of Analysis of Variance
29 pages
Probability Test Grade 4 2018 2019
No ratings yet
Probability Test Grade 4 2018 2019
5 pages
Jack and Jill School Mathematics Mock 2
No ratings yet
Jack and Jill School Mathematics Mock 2
6 pages
Torquimetro
No ratings yet
Torquimetro
12 pages
Math Problems
No ratings yet
Math Problems
8 pages
Mat 1275
No ratings yet
Mat 1275
5 pages
Sprague Matthew Thesis App C PDF
No ratings yet
Sprague Matthew Thesis App C PDF
26 pages
Program To Convert Decimal To Binary Using Stack
No ratings yet
Program To Convert Decimal To Binary Using Stack
27 pages
RD ST ND RD
No ratings yet
RD ST ND RD
2 pages
Grip Worksheets 35 and 39 Grade 7
No ratings yet
Grip Worksheets 35 and 39 Grade 7
2 pages
Tiger Tools
No ratings yet
Tiger Tools
2 pages
Workshop Technology Lesson Plan (Rev.2)
No ratings yet
Workshop Technology Lesson Plan (Rev.2)
3 pages
Math for Deep Learning: What You Need to Know to Understand Neural Networks
From Everand
Math for Deep Learning: What You Need to Know to Understand Neural Networks
Ronald T. Kneusel
No ratings yet

01 Halfspaces Perceptron

Uploaded by

01 Halfspaces Perceptron

Uploaded by

CS 480/680

Introduction to Machine Learning

Clockwise from top:

Linearly separable Perceptron converges

The Matrix (1999)

II. Will it ever stop?

II. Will it ever stop?

Binary: Fall Multiclass:

You might also like