0% found this document useful (0 votes)

16 views

Kernel Methods

edx machine learning kernel methods

Uploaded by

Jorge

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

Kernel Methods

edx machine learning kernel methods

Uploaded by

Jorge

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

10. Kernel Methods | Project 2: Digit recognition ... https://courses.edx.org/courses/course-v1:MITx+...

Unit 2 Nonlinear Classi�cation,

Linear regression, Collaborative
Course  Filtering (2 weeks)  Project 2: Digit recognition (Part 1)  10. Kernel Methods

Audit Access Expires May 11, 2020

You lose all access to this course, including your progress, on May 11, 2020.
Upgrade by Apr 1, 2020 to get unlimited access to the course as long as it exists on the site. Upgrade now

10. Kernel Methods

As you can see, implementing a direct mapping to the high-dimensional features is a lot of work (imagine using an even
higher dimensional feature mapping.) This is where the kernel trick becomes useful.

Recall the kernel perceptron algorithm we learned in the lecture. The weights θ can be represented by a linear combination of
features:

n
θ = ∑ α(i) y (i) ϕ (x(i) )
i=1

In the softmax regression fomulation, we can also apply this representation of the weights:

n
θj = ∑ αj y (i) ϕ (x(i) ) .
(i)

i=1

⎡e ⎤
[θ1 ⋅ϕ(x)/τ]−c

⎢
⎢
[θ2 ⋅ϕ(x)/τ]−c ⎥
⎥
⎢ ⎥
1 e
⎢ ⎥
h (x) =
∑kj=1 e [ θ ⋅ϕ(x)/τ]−c
⎢ ⋮ ⎥
⎣ e[θk ⋅ϕ(x)/τ]−c ⎦
j

⎡ e[∑i=1 α1 y ϕ(x )⋅ϕ(x)/τ]−c ⎤

n (i) (i) (i)

⎢ e[∑ni=1 α(i) ⎥
⎢ 2 y ϕ(x )⋅ϕ(x)/τ]−c ⎥
⎢ ⎥
(i) (i)
1
⎢
[∑ni=1 αj y (i) ϕ(x(i) )⋅ϕ(x)/τ]−c ⎢
⎥
⎥
h (x) =
⎢ ⎥
(i)
k
∑j=1 e ⋮
⎣ e[∑i=1 αk y ϕ(x )⋅ϕ(x)/τ]−c ⎦
n (i) (i) (i)

We actually do not need the real mapping ϕ (x) , but the inner product between two features after mapping: ϕ (xi ) ⋅ ϕ (x) ,
where xi is a point in the training set and x is the new data point for which we want to compute the probability. If we can
create a kernel function K (x, y) = ϕ (x) ⋅ ϕ (y), for any two points x and y, we can then kernelize our softmax regression
algorithm.

1 of 6 2020-03-25, 7:33 p.m.

10. Kernel Methods | Project 2: Digit recognition ... https://courses.edx.org/courses/course-v1:MITx+...
You will be working in the �les part1/main.py and part1/kernel.py in this problem

Implementing Polynomial Kernel

1.0/1 point (graded)
In the last section, we explicitly created a cubic feature mapping. Now, suppose we want to map the features into d
dimensional polynomial space,

– – – – – −− −−
ϕ (x) = ⟨x2d , … , x21 , √2xd xd−1 , … , √2xd x1 , √2xd−1 xd−2 , … , √2xd−1 x1 , … , √2x2 x1 , √2cxd , … , √2cx1 , c⟩

Write a function polynomial_kernel that takes in two matrix X and Y and computes the polynomial kernel K (x, y) for
every pair of rows x in X and y in Y .

Available Functions: You have access to the NumPy python library as np

1 def polynomial_kernel(X, Y, c, p):

2 """
3 Compute the polynomial kernel between two matrices X and Y::
4 K(x, y) = (<x, y> + c)^p
5 for each pair of rows x in X and y in Y.
6
7 Args:
8 X - (n, d) NumPy array (n datapoints each with d features)
9 Y - (m, d) NumPy array (m datapoints each with d features)
10 c - a coefficient to trade off high-order and low-order terms (scalar)
11 p - the degree of the polynomial kernel
12
13 Returns:
14 kernel_matrix - (n, m) Numpy array containing the kernel matrix
15 """
Press ESC then TAB or click outside of the code editor to exit

Correct

def polynomial_kernel(X, Y, c, p):

"""
Compute the polynomial kernel between two matrices X and Y::
K(x, y) = (<x, y> + c)^p
for each pair of rows x in X and y in Y.

Args:
X - (n, d) NumPy array (n datapoints each with d features)
Y - (m, d) NumPy array (m datapoints each with d features)
c - an coefficient to trade off high-order and low-order terms (scalar)
p - the degree of the polynomial kernel

Returns:
kernel_matrix - (n, m) Numpy array containing the kernel matrix
"""
K = X @ Y.transpose()
K += c
K **= p
return K

2 of 6 2020-03-25, 7:33 p.m.

10. Kernel Methods | Project 2: Digit recognition ... https://courses.edx.org/courses/course-v1:MITx+...

Test results

See full output

CORRECT
See full output

You have used 1 of 25 attempts

 Answers are displayed within the problem

Gaussian RBF Kernel

1.0/1 point (graded)
Another commonly used kernel is the Gaussian RBF kenel. Similarly, write a function rbf_kernel that takes in two matrices
X and Y and computes the RBF kernel K (x, y) for every pair of rows x in X and y in Y .

Available Functions: You have access to the NumPy python library as np

1 def rbf_kernel(X, Y, gamma):

2 """
3 Compute the Gaussian RBF kernel between two matrices X and Y::
4 K(x, y) = exp(-gamma ||x-y||^2)
5 for each pair of rows x in X and y in Y.
6
7 Args:
8 X - (n, d) NumPy array (n datapoints each with d features)
9 Y - (m, d) NumPy array (m datapoints each with d features)
10 gamma - the gamma parameter of gaussian function (scalar)
11
12 Returns:
13 kernel_matrix - (n, m) Numpy array containing the kernel matrix
14 """
15 # YOUR CODE HERE
Press ESC then TAB or click outside of the code editor to exit

Correct

3 of 6 2020-03-25, 7:33 p.m.

10. Kernel Methods | Project 2: Digit recognition ... https://courses.edx.org/courses/course-v1:MITx+...

def rbf_kernel(X, Y, gamma):

"""
Compute the Gaussian RBF kernel between two matrices X and Y::
K(x, y) = exp(-gamma ||x-y||^2)
for each pair of rows x in X and y in Y.

Args:
X - (n, d) NumPy array (n datapoints each with d features)
Y - (m, d) NumPy array (m datapoints each with d features)
gamma - the gamma parameter of gaussian function (scalar)

Returns:
kernel_matrix - (n, m) Numpy array containing the kernel matrix
"""
XTX = np.mat([np.dot(row, row) for row in X]).T
YTY = np.mat([np.dot(row, row) for row in Y]).T
XTX_matrix = np.repeat(XTX, Y.shape[0], axis=1)
YTY_matrix = np.repeat(YTY, X.shape[0], axis=1).T
K = np.asarray((XTX_matrix + YTY_matrix - 2 * (X @ Y.T)), dtype='float64')
K *= - gamma
return np.exp(K, K)

Test results

See full output

CORRECT
See full output

You have used 1 of 25 attempts

 Answers are displayed within the problem

Now, try implementing the softmax regression using kernelized features. You will have to rewrite the softmax_regression
function in softmax.py, as well as the auxiliary functions compute_cost_function, compute_probabilities,
run_gradient_descent_iteration.

How does the test error change?

4 of 6 2020-03-25, 7:33 p.m.

10. Kernel Methods | Project 2: Digit recognition ... https://courses.edx.org/courses/course-v1:MITx+...
In this project, you have been familiarized with the MNIST dataset for digit recognition, a popular task in computer vision.

You have implemented a linear regression which turned out to be inadequate for this task. You have also learned how to use
scikit-learn's SVM for binary classi�cation and multiclass classi�cation.

Then, you have implemented your own softmax regression using gradient descent.

Finally, you experimented with di�erent hyperparameters, di�erent labels and di�erent features, including kernelized
features.

In the next project, you will apply neural networks to this task.

Discussion Hide Discussion

Topic: Unit 2 Nonlinear Classi�cation, Linear regression, Collaborative Filtering (2 weeks):Project
2: Digit recognition (Part 1) / 10. Kernel Methods

Add a Post

Show all posts by recent activity

 [STAFF] get confused 2

Hi, I completed all answers correctly except the one on SVM=>"Implement C-SVM" .... where I get confused. Can you please reset the attempts count …

 [STAFF] RFB kernel answer correct but grader truncates output

6
My answer seems correct (the kernel output is exactly the same as the answer) but for some reason the grader is truncating my output and giving m…

 RBF: Submitted same code twice, got error the �rst time and correct the second time 3
For the RBF kernel question, I submitted the code and got an error. But my output looked identical to the grader's output. So I submitted the same c…

 [STAFF] Problems with grader for "Gaussian RBF Kernel" 2

Hi, Sta� I've sent four times my code solution to the problem "Gaussian RBF Kernel". I think, the grader has a problem because if the solution is trun…

 Computation between matrices of di�erent shapes, how? 4

[relate to RBF Kernel] Sorry for this stupid question, but why it is possible to compute ||x-y||^2 when x and y are matrices of di�erent shapes....? I t…

 Implementing Polynomial Kernel 1

I got this correct

 The ending feels rushed 1

Basically, the �nal part of the project where it's suggested to implement softmax regression using Kernelized features feels a bit rushed. It's not very…

 Correct Answer Marked Wrong?

2
Hi When submitting my polynomial kernel code I get exactly the same output as given by the grader, but it's marked incorrect. Can someone please …

 What is the issue that i am getting. Answers are correct but INCORRECT? 4
What is the issue that i am getting. Answers are correct but INCORRECT?

 [sta�] How to use kernels?

6
I got all functions correct but still struggling in getting the point of why do we need these kernels? First of all i don't understand where arbitrary Y co…

 [STAFF] Gaussian RBF Kernel - please check the lines of code 2

I am going crazy. Everything seems to be ok, the grader gives me half the points, but after several hours I am unable to see where I am missing the o…

 softmax regression using kernelized features.

2
Hi Sta�, Please point me to a paper or lecture notes to describe complete softmax regression with kernel algorithm, thanks.

 review my answer
2
Please review my answer, and tell me what is wrong

Learn About Veri�ed Certi�cates

6 of 6 2020-03-25, 7:33 p.m.

Assignment 2
No ratings yet
Assignment 2
2 pages
A Simple Image Formation Model
No ratings yet
A Simple Image Formation Model
11 pages
4c Kernels
No ratings yet
4c Kernels
31 pages
Kernel Models 1233
No ratings yet
Kernel Models 1233
56 pages
Lecture17 Kernels
No ratings yet
Lecture17 Kernels
23 pages
2021 UNAS REFER Rafi Yon Saputra 173112706420242 Kernel Primer
No ratings yet
2021 UNAS REFER Rafi Yon Saputra 173112706420242 Kernel Primer
65 pages
Lecture 19 - Nonlinear Learning With Kernels (1) - Plain
No ratings yet
Lecture 19 - Nonlinear Learning With Kernels (1) - Plain
15 pages
Exercise 02 RadonovIvan 5967988
No ratings yet
Exercise 02 RadonovIvan 5967988
1 page
03 - Kernelization
No ratings yet
03 - Kernelization
32 pages
Kernel Methods For General Pattern Analysis PDF
No ratings yet
Kernel Methods For General Pattern Analysis PDF
77 pages
cs229 Notes3
No ratings yet
cs229 Notes3
30 pages
Lecture03_kernel
No ratings yet
Lecture03_kernel
28 pages
Kernel_Methods_in_Machine_Learning
No ratings yet
Kernel_Methods_in_Machine_Learning
3 pages
ML Assignment 2 PDF
No ratings yet
ML Assignment 2 PDF
5 pages
07 Kernels
No ratings yet
07 Kernels
6 pages
Kernels and Kernelized Perceptron: Instructor: Alan Ritter
No ratings yet
Kernels and Kernelized Perceptron: Instructor: Alan Ritter
13 pages
Ds 11
No ratings yet
Ds 11
21 pages
Machine Learning and Pattern Recognition Minimal GP Demo
No ratings yet
Machine Learning and Pattern Recognition Minimal GP Demo
3 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
Introduction To Kernels: Max Welling
No ratings yet
Introduction To Kernels: Max Welling
16 pages
Lec5 SVM Kernel SoftMargin
No ratings yet
Lec5 SVM Kernel SoftMargin
44 pages
SVM Kernel Functions
No ratings yet
SVM Kernel Functions
12 pages
05 Kernel
No ratings yet
05 Kernel
24 pages
Kernel Clustering
No ratings yet
Kernel Clustering
57 pages
תרגול - SVM 1
No ratings yet
תרגול - SVM 1
32 pages
data an-6
No ratings yet
data an-6
36 pages
Polynomial Kernel
No ratings yet
Polynomial Kernel
5 pages
High Dimensional Representation
No ratings yet
High Dimensional Representation
33 pages
Slides Chap5 KernelMethods
No ratings yet
Slides Chap5 KernelMethods
24 pages
SVM 4
No ratings yet
SVM 4
8 pages
Speeding Up Kernel Methods, and Intro To Unsupervised Learning
No ratings yet
Speeding Up Kernel Methods, and Intro To Unsupervised Learning
103 pages
05 Lectureslides Kernels
No ratings yet
05 Lectureslides Kernels
47 pages
Mva - Slides Machine Learning With Kernel Methods
No ratings yet
Mva - Slides Machine Learning With Kernel Methods
644 pages
ML Answers Updated
No ratings yet
ML Answers Updated
13 pages
28.9 - Domain Specific Kernels - mp4
No ratings yet
28.9 - Domain Specific Kernels - mp4
2 pages
Lecture 05
No ratings yet
Lecture 05
49 pages
Scikit Learn Org Stable Modules Kernel - Approximation HTML
No ratings yet
Scikit Learn Org Stable Modules Kernel - Approximation HTML
3 pages
Kernel Methods: Feature Mapping at No Cost
No ratings yet
Kernel Methods: Feature Mapping at No Cost
25 pages
Foundations of Data Science: Exercise 1
No ratings yet
Foundations of Data Science: Exercise 1
5 pages
ml mod 4
No ratings yet
ml mod 4
26 pages
ASSi2 DSBDA
No ratings yet
ASSi2 DSBDA
4 pages
Kernel Ridge Regression
No ratings yet
Kernel Ridge Regression
8 pages
HW 3
No ratings yet
HW 3
5 pages
Kernel-Functions-in-Support-Vector-Machines
No ratings yet
Kernel-Functions-in-Support-Vector-Machines
10 pages
Vahid
No ratings yet
Vahid
18 pages
Lecture 14: Kernels — Applied ML
No ratings yet
Lecture 14: Kernels — Applied ML
14 pages
INF264 - Exercise 2: 1 Instructions
No ratings yet
INF264 - Exercise 2: 1 Instructions
4 pages
Liu et al. - 2021 - Random Features for Kernel Approximation A Survey on Algorithms, Theory, and Beyond
No ratings yet
Liu et al. - 2021 - Random Features for Kernel Approximation A Survey on Algorithms, Theory, and Beyond
35 pages
SVM and Kernels
No ratings yet
SVM and Kernels
13 pages
DSA5102X_lecture2
No ratings yet
DSA5102X_lecture2
43 pages
SVM Class 2
No ratings yet
SVM Class 2
87 pages
Lecture 04
No ratings yet
Lecture 04
19 pages
771 A18 Lec12
No ratings yet
771 A18 Lec12
131 pages
Lecture 8_Kernels
No ratings yet
Lecture 8_Kernels
32 pages
2014 02 26 Kernels
No ratings yet
2014 02 26 Kernels
140 pages
DA_Programs
No ratings yet
DA_Programs
44 pages
Numpy Cheatsheet
No ratings yet
Numpy Cheatsheet
11 pages
Kernel Functions: Tejumade Afonja Jan 2, 2017 6 Min Read
No ratings yet
Kernel Functions: Tejumade Afonja Jan 2, 2017 6 Min Read
6 pages
Assignment 11-17-15: Michael Petzold November 19, 2015
No ratings yet
Assignment 11-17-15: Michael Petzold November 19, 2015
4 pages
kernel_perceptron
No ratings yet
kernel_perceptron
28 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
De ZG535
No ratings yet
De ZG535
8 pages
Matrices and Determinants Theory+Exercises+HLP
No ratings yet
Matrices and Determinants Theory+Exercises+HLP
13 pages
BSCE-EMNS0323-LM2.1 - Solutions of Linear Equations
No ratings yet
BSCE-EMNS0323-LM2.1 - Solutions of Linear Equations
24 pages
Quantum Computation and Quantum Information Chapter 2 Exercises, by Foobanana
0% (1)
Quantum Computation and Quantum Information Chapter 2 Exercises, by Foobanana
30 pages
Matrices and Matroids For Systems Analysis PDF - Google Search
No ratings yet
Matrices and Matroids For Systems Analysis PDF - Google Search
1 page
General Math Summary Notes
No ratings yet
General Math Summary Notes
4 pages
Introduction To FEM
No ratings yet
Introduction To FEM
27 pages
SSMDA Notes (1)
No ratings yet
SSMDA Notes (1)
16 pages
CampusX DSMP Syllabus
No ratings yet
CampusX DSMP Syllabus
48 pages
Multhopp Method NASA
No ratings yet
Multhopp Method NASA
81 pages
Sorting Searching
No ratings yet
Sorting Searching
351 pages
Mind Map - Matrices - Class 12
100% (3)
Mind Map - Matrices - Class 12
5 pages
B.tech Exam
No ratings yet
B.tech Exam
70 pages
Exact Solutions of The Sextic Oscillator From The Bi-Confluent Heun Equation
No ratings yet
Exact Solutions of The Sextic Oscillator From The Bi-Confluent Heun Equation
17 pages
Relationship: of Influence Coefficients Between Static-Couple and Multiplane Methods On Two-Plane Balancing
No ratings yet
Relationship: of Influence Coefficients Between Static-Couple and Multiplane Methods On Two-Plane Balancing
14 pages
LAC Final Importent Questions
No ratings yet
LAC Final Importent Questions
5 pages
Analysis of Piles Subject To Lateral Soil Movements
100% (1)
Analysis of Piles Subject To Lateral Soil Movements
7 pages
MIMO Model Creation - MATLAB & Simulink - MathWorks India
No ratings yet
MIMO Model Creation - MATLAB & Simulink - MathWorks India
4 pages
CPE-214 Computer-Aided Engineering Design - Lab - Manual - OBE - 2 PDF
No ratings yet
CPE-214 Computer-Aided Engineering Design - Lab - Manual - OBE - 2 PDF
64 pages
Finding Minimal Cut Sets in A Fault Tree
No ratings yet
Finding Minimal Cut Sets in A Fault Tree
4 pages
Calculus III - Notes of B. Tsirelson
No ratings yet
Calculus III - Notes of B. Tsirelson
129 pages
SPM 2016 Exam Study Tips
No ratings yet
SPM 2016 Exam Study Tips
81 pages
Java Question Bank Practical File
No ratings yet
Java Question Bank Practical File
7 pages
C Prog Lab
No ratings yet
C Prog Lab
33 pages
Accommodation of External Disturbances in Linear Regulator and Servomechanism Problems
No ratings yet
Accommodation of External Disturbances in Linear Regulator and Servomechanism Problems
10 pages
Guidance is All You Need: Advancing Large Language Models with Temperature-Guided Reasoning
No ratings yet
Guidance is All You Need: Advancing Large Language Models with Temperature-Guided Reasoning
23 pages
Material Final Unit-5
No ratings yet
Material Final Unit-5
22 pages

Kernel Methods

Uploaded by

Kernel Methods

Uploaded by

10. Kernel Methods | Project 2: Digit recognition ... https://courses.edx.org/courses/course-v1:MITx+...

Unit 2 Nonlinear Classi�cation,

Audit Access Expires May 11, 2020

10. Kernel Methods

⎡ e[∑i=1 α1 y ϕ(x )⋅ϕ(x)/τ]−c ⎤

1 of 6 2020-03-25, 7:33 p.m.

Implementing Polynomial Kernel

Available Functions: You have access to the NumPy python library as np

1 def polynomial_kernel(X, Y, c, p):

def polynomial_kernel(X, Y, c, p):

2 of 6 2020-03-25, 7:33 p.m.

See full output

You have used 1 of 25 attempts

 Answers are displayed within the problem

Gaussian RBF Kernel

Available Functions: You have access to the NumPy python library as np

1 def rbf_kernel(X, Y, gamma):

3 of 6 2020-03-25, 7:33 p.m.

def rbf_kernel(X, Y, gamma):

See full output

You have used 1 of 25 attempts

 Answers are displayed within the problem

How does the test error change?

4 of 6 2020-03-25, 7:33 p.m.

Discussion Hide Discussion

Show all posts by recent activity

 [STAFF] get confused 2

 [STAFF] RFB kernel answer correct but grader truncates output

 [STAFF] Problems with grader for "Gaussian RBF Kernel" 2

 Computation between matrices of di�erent shapes, how? 4

 Implementing Polynomial Kernel 1

 The ending feels rushed 1

 Correct Answer Marked Wrong?

 [sta�] How to use kernels?

 [STAFF] Gaussian RBF Kernel - please check the lines of code 2

 softmax regression using kernelized features.

Learn About Veri�ed Certi�cates

6 of 6 2020-03-25, 7:33 p.m.

You might also like