15 SVM
15 SVM
University of Toronto
Margin
Max-margin classification
(
1 if (wT x + b) ≥ 0
y=
−1 if (wT x + b) < 0
Zemel, Urtasun, Fidler (UofT) CSC 411: 15-SVM I 4 / 15
Max Margin Classification
1
if wT x + b ≥ 1
y = −1 if wT x + b ≤ −1
Undefined if − 1 ≤ wT x + b ≤ 1
1
if wT x + b ≥ 1
y = −1 if wT x + b ≤ −1
Undefined if − 1 ≤ wT x + b ≤ 1
wT (u − v) = 0
wT (u − v) = 0
wT (u − v) = 0
x+ = λw + x−
x+ = λw + x−
wT x+ + b = 1
x+ = λw + x−
wT x+ + b = 1
wT (λw + x− ) + b = 1
x+ = λw + x−
wT x+ + b = 1
wT (λw + x− ) + b = 1
wT x− + b + λwT w = 1
x+ = λw + x−
wT x+ + b = 1
wT (λw + x− ) + b = 1
wT x− + b + λwT w = 1
− 1 + λwT w = 1
x+ = λw + x−
wT x+ + b = 1
wT (λw + x− ) + b = 1
wT x− + b + λwT w = 1
− 1 + λwT w = 1
Therefore
2
λ=
wT w
M = ||x+ − x− ||
M = ||x+ − x− ||
= ||λw|| =
M = ||x+ − x− ||
√
= ||λw|| = λ wT w
M = ||x+ − x− ||
√
= ||λw|| = λ wT w
√
wT w
=2 =
wT w
M = ||x+ − x− ||
√
= ||λw|| = λ wT w
√
wT w 2
=2 T
=√ =
w w wT w
M = ||x+ − x− ||
√
= ||λw|| = λ wT w
√
wT w 2 2
=2 T
=√ =
w w T
w w ||w||
1
min ||w||2
w,b 2
1
min ||w||2
w,b 2
1
min ||w||2
w,b 2
1
min ||w||2
w,b 2
Let: N
1 X
J(w, b; α) = ||w||2 + αi [1 − (wT x(i) + b)t (i) ]
2
i=1
Let: N
1 X
J(w, b; α) = ||w||2 + αi [1 − (wT x(i) + b)t (i) ]
2
i=1
Let: N
1 X
J(w, b; α) = ||w||2 + αi [1 − (wT x(i) + b)t (i) ]
2
i=1
We obtain N
X
w= αi t (i) x(i)
i=1
We obtain N
X
w= αi t (i) x(i)
i=1
Then substitute back to get final optimization:
N N
X 1 X (i) (j) T
L = max{ αi − t t αi αj (x(i) · x(j) )}
αi ≥0
i=1
2 i,j=1
Only a small subset of αi ’s will be nonzero, and the corresponding x(i) ’s are
the support vectors S
Only a small subset of αi ’s will be nonzero, and the corresponding x(i) ’s are
the support vectors S
Prediction on a new example:
N
X X
y = sign[b + x · ( αi t (i) x(i) )] = sign[b + x · ( αi t (i) x(i) )]
i=1 i∈S