0% found this document useful (0 votes)

136 views6 pages

Stats 231 / CS229T Homework 3 Solutions

This document contains the solutions to 4 questions from a homework assignment on kernels and reproducing kernel Hilbert spaces (RKHS). The questions cover topics such as showing that a normalized kernel is still a valid kernel, finding the reproducing kernel for a given Hilbert space of functions, and properties of Hilbert spaces including their completeness.

Uploaded by

gabriele

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

136 views6 pages

Stats 231 / CS229T Homework 3 Solutions

Uploaded by

gabriele

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Stats 231 / CS229T Homework 3 Solutions

Question 1: Let k : X × X → R be a valid kernel function. Define

k(x, z)
knorm (x, z) := p p .
k(x, x) k(z, z)

Is knorm a valid kernel? Justify your answer.

Answer: Yes, it is. Let k(x, z) = hφ(x), φ(z)i for some mapping φ : X → H, where H is a Hilbert
space. Then
knorm (x, z) = hφ(x)/ kφ(x)k2 , φ(z)/ kφ(z)k2 i
so that it is still a valid inner product, where the feature mapping is now x 7→ φ(x)/ kφ(x)k2 for
kφ(x)k22 = hφ(x), φ(x)i.

Question 2: Consider the class of functions

H := f : f (0) = 0, f 0 ∈ L2 ([0, 1]) ,

Rthat is, functions f : [0, 1] → R with f (0) = 0 that are almost everywhere differentiable, where
1 0
0 (f (t))2 dt < ∞. On this space of functions, we define the inner product by
Z 1
hf, gi = f 0 (t)g 0 (t)dt.
0

Show that k(x, z) = min{x, z} is the reproducing kernel for H, so that it is (i) positive semidefinite
and (ii) a valid kernel.
Answer: If we show that k(x, z) = min{x, z} is indeed the reproducing kernel for H, then that
suffices to demonstrate that it is a positive definite function. We have for g(z) = k(x, z) that
(almost everywhere) g 0 (z) = 1 {x ≤ z}, so that
Z 1 Z z
hf, k(z, ·)i = f 0 (t)1 {t ≤ z} dt = f 0 (t)dt = f (z) − f (0) = f (z).
0 0

Thus k is evidently a reproducing kernel, so it must be a positiveRdefinite function.

1
(Another way to see that, we have min{x, z} = k(x, z) = 0 1 {t ≤ x} 1 {t ≤ z} dt, so that
min{x, z} is evidently an inner product.)

Question 3: Consider the Sobolev space Fk , which is defined as the set of functions that are
(k − 1)-times differentiable and have kth derivative almost everywhere on [0, 1], where the kth
derivative is square-integrable. That is, we define
n o
Fk := f : [0, 1] | f (k) ∈ L2 ([0, 1]) ,

where f (k) denotes the kth derivative of f . We define the inner product on Fk by
k−1
X Z 1
hf, gi = f (i) (0)g (i) (0) + f (k) (t)g (k) (t)dt.
i=0 0

1
(a) Find the representer of evaluation for this Hilbert space, that is, find a function rx : [0, 1] → R
(defined for each x ∈ [0, 1]) such that rx ∈ Fk and

hrx , f i = f (x)

for all x ∈ [0, 1].

(b) What is the reproducing kernel k(x, z) associated with this space? (Recall that k(x, z) = hrx , rz i
for an RKHS.)

Answer:

(a) By Taylor’s theorem, we have

k−1 x
xi
Z
X
(i) 1
f (x) = f (0) + f (0) + f (k) (t)(x − t)k−1 dt.
i! (k − 1)! 0
i=1

Define the function

k−1 i i k−1
X x t (−1)k X x2k−1−i ti
rx (t) = + max{x − t, 0}2k−1 + (−1)k+i+1 .
i! i! (2k − 1)! (2k − 1 − i)! i!
i=0 i=0

Then
1 i (−1)k+i (−1)k+i+1 2k−1−i
rx(i) (0) = x + max{x, 0}2k−1−i + x = xi
i! (2k − i − 1)! (2k − 1 − i)!
for i < k and
1
rx(k) (t) = max{x − t, 0}k−1 .
(k − 1)!
Thus we have
Z 1
1 1 1
hf, rx i = f (0) + f (0)x + f 00 (0)x2 + · · · +
0
f (k−1) (0)xk−1 + f (k) (t) [x − t]k−1
+ dt
2 (k − 1)! (k − 1)! 0
k−1 (i) Z x
X f (0) i 1
= x + f (k) (t)(x − t)k−1 dt
i! (k − 1)! 0
i=0
= f (x)

where the last equality is Taylor’s theorem.

(b) For the reproducing kernel, note that

k(x, z) = hrx , rz i
k−1 i i Z 1
X x z 1 k−1
= + [x − t]+ [z − t]k−1
+ dt
i! i! (k − 1)!(k − 1)! 0
i=0
k−1 min{x,z}
xi z i
Z
X 1
= + (x − t)k−1 (z − t)k−1 dt.
i! i! (k − 1)!(k − 1)! 0
i=0

2
(c) To see that Fk is a Hilbert space, we must show that kf k2H = hf, f i is a norm and that Fk is
complete for k·kH . Non-negativity of k·kH and the triangle inequality are trivial, as it is clear
that h·, ·i is an inner product. Now suppose that kf kH = 0. Then f (l) (0) = 0 for all l < k, and
R 1 (k) 2
0 f (t) dt = 0, so that f (k) = 0 almost everywhere. Of course, this shows that f (k−1) ≡ 0
by integration, and so on, so that f ≡ 0. To show completeness, let fn be a Cauchy sequence
in Fk . Then since
k−1
X Z 1
2 (l) (l) 2
kfn − fm kH = (fn (0) − fm (0)) + (fn(k) (t) − fm
(k)
(t))2 dt,
l=0 0

(l) (k)
it is clear that fn (0) is a Cauchy sequence in R and fn is a Cauchy sequence in L2 ([0, 1]).
(l)
Completeness of R and completeness of L2 then imply the existence of limn fn (0) for l < k
(k)
and a g ∈ L2 ([0, 1]) such that fn → g in L2 . Now define the functions f (l) by
Z x Z x
(k) (k−1) (k−1)
f (x) = g(x), f (x) = lim fn (0) + g(t)dt, . . . , f (x) = lim fn (0) + f (1) (t)dt.
n 0 n 0

Since f (k) ∈ L2 ([0, 1]), it is clear that each of the f (l) are absolutely continuous, and the
derivative of f (l) is f (l+1) . So fn indeed has a limit f .

Question 4: The variation distance between probability distributions P and Q on a space X is

defined by kP − QkTV = supA⊂X |P (A) − Q(A)|.
(a) Show that
2 kP − QkTV = sup {EP [f (X)] − EQ [f (X)]}
f :kf k∞ ≤1

where the supremum is taken over all functions with f (x) ∈ [−1, 1], and the first expectation
is taken with respect to P and the second with respect to Q. You may assume that P and Q
have densities.

Answer: Using the assumption that we have a density and that P (A) − Q(A) = 1 − P (Ac ) −
(1 − Q(Ac )) = Q(Ac ) − P (Ac ), we have
Z
kP − QkTV = sup {P (A) − Q(A)} = sup 1 {x ∈ A} (p(x) − q(x))dx
A⊂X A
Z
= 1 {p(x) ≥ q(x)} (p(x) − q(x))dx.

Similarly, we have kP − QkTV = supA {Q(A) − P (A)}, and combining these yields
Z
2 kP − QkTV = (1 {p(x) ≥ q(x)} − 1 {p(x) ≤ q(x)}) (p(x) − q(x))dx.

But of course, supa∈[−1,1] a(p − q) = (p − q)(1 {p ≥ q} − 1 {p ≤ q}), which proves the result.

Question 5: In a number of experimental situations, it is valuable to determine if two distributions

P and Q are the same or different. For example, P may be the distribution of widgets produced
by one machine, Q the distributions of widgets by a second machine, and we wish to test if the two
distributions are the same (to within allowable tolerances). Let H be an RKHS of functions with
domain X and reproducing kernel k, and let P and Q be distributions on X .

3
(a) Let k·kH denote the norm on the Hilbert space H. Show that
n o
Dk (P, Q)2 := sup |EP [f (X)] − EQ [f (Z)]|2 = E[k(X, X 0 )] + E[k(Z, Z 0 )] − 2E[k(X, Z)]
f :kf kH ≤1

iid iid
where X, X 0 ∼ P and Z, Z 0 ∼ Q.

(b) A kernel k : X × X → R is called universal if the induced RKHS H of functions f : X → R

can arbitrarily approximate continuous functions. That is, for any φ : X → R continuous and
> 0, there is some f ∈ H such that

sup |f (x) − φ(x)| ≤ .

x∈X

Show that if k is universal, then

Dk (P, Q) = 0 if and only if P = Q.

You may assume X is a metric space and that P = Q iff P (A) = Q(A) for all compact A ⊂ X .

(c) You wish to estimate Dk (P, Q) given samples from each of the distributions. Assume that
iid iid
k(x, z) ∈ [−B, B] for all x, z ∈ X . Let Xi ∼ P , i = 1, . . . , n1 and Zi ∼ Q, i = 1, . . . , n2 . Define
−1 X −1 X
b 1:n ) := n1
K(X k(Xi , Xj ), K(Z
b 1:n2 ) :=
n2
k(Zi , Zj ),
1
2 2
1≤i<j≤n1 1≤i<j≤n2

and
n1 X
n2
1 X
K(X
b 1:n , Z1:n ) :=
1 2 k(Xi , Zj ).
n1 n2
i=1 j=1

b 1:n )] = E[k(X, X 0 )] and E[K(X

b 1:n , Z1:n )] = E[k(X, Z)] for X, X 0 ∼ P and iid
Show that E[K(X 1 2
iid
Z, Z 0 ∼ Q. Show for some numerical constant c > 0 that for all t ≥ 0,

nt2

0
P K(X ) − X )] ≥ t ≤ 2 exp −c
b
1:n E[k(X,
B2

and
n1 t 2 n2 t2

P K(X1:n1 , Z1:n2 ) − E[k(X, Z)] ≥ t ≤ 2 exp −c 2 + 2 exp −c 2 .
b
B B

(d) Define the empirical Hilbert distances

−1 −1 n1 X
n2
b 2 (P, Q) := n1 X n2 X 2 X
D k k(Xi , Xj ) + k(Zi , Zj ) − k(Xi , Zj ).
2 2 n1 n2
1≤i<j≤n1 1≤i<j≤n2 i=1 j=1

Show that for all t ≥ 0,

min{n1 , n2 }t2

b2 2
P Dk (P, Q) − Dk (P, Q) ≥ t ≤ C exp −c

B2
where 0 < c, C < ∞ are numerical constants.

4
Answer:
(a) As k : X × X → R is the reproducing kernel for H, we have for any f ∈ H such that kf kH ≤ 1

E[f (X)] − E[f (Z)] = E[hf, k(X, ·)i] − E[hf, k(Z, ·)i]
(i)
= hf, E[k(X, ·) − k(Z, ·)]i
(ii)
≤ kf kH kE[k(X, ·) − k(Z, ·)]kH ≤ kE[k(X, ·) − k(Z, ·)]kH ,

where we have used linearity in (i) and Cauchy-Schwarz in (ii), and that kf kH ≤ 1 in the final
line. Equality holds in step (ii) if
E[k(X, ·) − k(Z, ·)]
f (·) = ,
kE[k(X, ·) − k(Z, ·)]kH
and we have

kE[k(X, ·) − k(Z, ·)]k2H = E[k(X, ·) − k(Z, ·)], E[k(X 0 , ·) − k(Z 0 , ·)]

= E[k(X, ·)], E[k(X 0 , ·)] + E[k(Z, ·)], E[k(Z 0 , ·)] − 2 hE[k(X, ·)], E[k(Z, ·)]i

= E[k(X, X 0 )] + E[k(Z, Z 0 )] − 2E[k(X, Z)],

where the final equality uses the linearity of the inner product and independence of X, X 0 , Z, Z 0 .
(b) Suppose that P = Q. Then certainly EP [f (X)] − EQ [f (Z)] = EP [f (X)] − EP [f (X)] = 0 for all
f ∈ H. Now suppose P 6= Q. Then there exists a compact set A such that P (A) 6= Q(A). For
n ∈ N, define the function

φn (x) = max{1 − n · dist(x, A), 0} = [1 − n dist(x, A)]+ ,

which satisfies φn (x) = 1 for x ∈ A, φn (x) = 0 for x such that dist(x, A) ≥ 1/n, and is Lipschitz
continuous. Moreover, we have φn (x) ↓ 1 {x ∈ A} for all x ∈ A as n → ∞. Thus the monotone
convergence theorem gives that

lim EP [φn (X)] = P (A) and lim EQ [φn (Z)] = Q(A).

n n

|EP [f (X)] − EQ [f (Z)]| ≥ |EP [φn (X)] − EQ [φn (Z)]| − 2 > |P (A) − Q(A)| − 4 ≥ 4 − 4 = 0.

Dividing by kf kH we have
|EP [f (X)] − EQ [f (Z)]|
Dk (P, Q) = sup |EP [g] − EQ [g]| ≥ > 0.
g:kgkH ≤1 kf kH

(c) The expectation equalities are immediate.

We apply bounded differences for the first statement. We first look at f (x1:n ) = K(x
b 1:n ). As
0
the function is symmetric, we fix index i = 1. Then for x, x ∈ X , we have
n
−1 X
0 n
f (x, x2:n ) − f (x , x2:n ) = (k(x, Xj ) − k(x0 , Xj ))
2
j=2

5
and using that k(x, x0 ) ∈ [−B, B], the summands are each bounded by 2B in magnitude. Thus
2 4B
|f (x, x2:n ) − f (x0 , x2:n )| ≤ · 2B(n − 1) = .
n(n − 1) n
Bounded differences (McDiarmid’s inequality) implies

nt2

P K(X1:n ) − E[K(X1:n )] ≥ t ≤ 2 exp − 2 .
b b
8B

b 1:n , Z1:n ) is a bit more complex. Define

The argument about K(X 1 2

n1
1 X
K(X1:n1 , Q) =
b EQ [k(Xi , Z) | Xi ].
n1
i=1

Then we have
b 1:n , Z1:n ) | X1:n ] = K(X
E[K(X b 1:n , Q)
1 2 1 1

by the independence of Zi , Xj . Fixing X1:n1 , define the function g(z1:n2 | X1:n1 ) by

g(z1:n2 | X1:n1 ) = K(X

b 1:n , z1:n ).
1 2

Then g satisfies bounded differences with parameter 4B/n2 , as above, and so conditional on
X1:n1 , we have
2

n 2 t
P g(Z1:n2 | X1:n1 ) − K(X
b 1:n , Q) ≥ t | X1:n ≤ 2 exp − . (1)

1 1
8B 2
Now we argue that
x1:n1 7→ K(x
b 1:n , Q)
1

satisfies bounded differences as well. Note that E[K(X b 1:n , Q)] = E[k(X, Z)] by construction.
1
Without loss of generality let us fix x2:n1 and modify x1 ∈ {x, x0 }. Then

0 1 0 2B 2B
K(x, x2:n1 , Q) − K(x , x2:n1 , Q) =
b b EQ [k(x, Z) − k(x , Z)] ∈ − , ,
n1 n1 n1
satisfying bounded differences with parameter 2B/n1 . Thus we have

n1 t2

P K(X1:n1 , Q) − E[k(X, Z)] ≥ t ≤ 2 exp − 2 . (2)
b
2B
Combining the bounds (1) and (2) and applying the tower property of expectation and the
triangle inequality, we have

P K(X1:n1 , Z1:n2 ) − E[k(X, Z)] ≥ t
b
h i
≤ E P g(Z1:n2 | X1:n1 ) − K(X
b 1:n , Q) ≥ t/2 | X1:n + K(X , Q) − Z)] ≥ t/2
b
1 1 P 1:n1 E[k(X,
n2 t2 n1 t 2

≤ 2 exp − 2
+ 2 exp − 2 .
32B 8B

Cap.6 Bartle - Resolvido
No ratings yet
Cap.6 Bartle - Resolvido
7 pages
2013 Junior Solution
No ratings yet
2013 Junior Solution
2 pages
נוסחאות ואי שיוויונים
No ratings yet
נוסחאות ואי שיוויונים
12 pages
FA 15 16 Ex4 solHW
No ratings yet
FA 15 16 Ex4 solHW
11 pages
Fa mth405 1
0% (1)
Fa mth405 1
31 pages
Sol 2
No ratings yet
Sol 2
7 pages
Solucionario Richard L., J. Douglas Faires-Numerical Analysis, 9th Edition (2010)
No ratings yet
Solucionario Richard L., J. Douglas Faires-Numerical Analysis, 9th Edition (2010)
89 pages
1 Inequalities: 1.1 Markov
No ratings yet
1 Inequalities: 1.1 Markov
15 pages
Math 2925 Problem Solution Presentation
No ratings yet
Math 2925 Problem Solution Presentation
474 pages
Solution To Midterm Exam in Functional Analysis
No ratings yet
Solution To Midterm Exam in Functional Analysis
2 pages
Functional Analysis I Assignment Solutions: James C. Robinson
No ratings yet
Functional Analysis I Assignment Solutions: James C. Robinson
26 pages
Equi-Statistical - Convergence of Positive Linear Operators: Sevda Karaku S and Kamil Demirci
No ratings yet
Equi-Statistical - Convergence of Positive Linear Operators: Sevda Karaku S and Kamil Demirci
12 pages
Martingale Limit Theory and Stochastic Regression Theory: Ching-Zong Wei
No ratings yet
Martingale Limit Theory and Stochastic Regression Theory: Ching-Zong Wei
155 pages
Qual Sols
No ratings yet
Qual Sols
150 pages
IMC 2017 - Day 2 (Problems and Solutions)
No ratings yet
IMC 2017 - Day 2 (Problems and Solutions)
5 pages
Ss12 Fa Extra Exercises Solutions
No ratings yet
Ss12 Fa Extra Exercises Solutions
5 pages
281A Final Sol
No ratings yet
281A Final Sol
9 pages
Math400 Exercises Chapt2 Co 1
No ratings yet
Math400 Exercises Chapt2 Co 1
8 pages
Solutions To Fall 2006 Math 301 Midterm Problems: N M 1 N N
No ratings yet
Solutions To Fall 2006 Math 301 Midterm Problems: N M 1 N N
4 pages
Imc 1994 2017 Omegaleph
No ratings yet
Imc 1994 2017 Omegaleph
214 pages
Exponential Approximation And Meromorphic Interpolation: e L π, π f L π, π f n
No ratings yet
Exponential Approximation And Meromorphic Interpolation: e L π, π f L π, π f n
15 pages
Exercices Kernel Trick
No ratings yet
Exercices Kernel Trick
24 pages
ES Key
No ratings yet
ES Key
6 pages
Fourier Transform: 3.1 Definition and Basic Properties
No ratings yet
Fourier Transform: 3.1 Definition and Basic Properties
24 pages
Recurso AR 2324
No ratings yet
Recurso AR 2324
2 pages
HW3 Solution-1
No ratings yet
HW3 Solution-1
8 pages
Homw 5 Sol
No ratings yet
Homw 5 Sol
8 pages
Optimization Lectures Formal Note
No ratings yet
Optimization Lectures Formal Note
9 pages
Answers To Problems in Numerical Analysis, 9th Edition by Richard Burden
No ratings yet
Answers To Problems in Numerical Analysis, 9th Edition by Richard Burden
16 pages
Additional Exercise 5 Solution
No ratings yet
Additional Exercise 5 Solution
8 pages
hw5 Kernel Trick 2021
No ratings yet
hw5 Kernel Trick 2021
4 pages
Final 2
No ratings yet
Final 2
7 pages
M597K: Solution To Homework Assignment 7: 1. Show That The Sequence (X
No ratings yet
M597K: Solution To Homework Assignment 7: 1. Show That The Sequence (X
6 pages
Lect 7
No ratings yet
Lect 7
35 pages
ORFE 526 - Probability: 1 Definitions
No ratings yet
ORFE 526 - Probability: 1 Definitions
10 pages
12 International Mathematics Competition For University Students
No ratings yet
12 International Mathematics Competition For University Students
4 pages
Hw2sol PDF
No ratings yet
Hw2sol PDF
3 pages
Solutions To Exercises:, Y) ) Is Dense in X y /2 /2
No ratings yet
Solutions To Exercises:, Y) ) Is Dense in X y /2 /2
72 pages
Imc2021 Day1 Solutions
No ratings yet
Imc2021 Day1 Solutions
5 pages
A Bounded Derivative That Is Not Riemann Integrable
No ratings yet
A Bounded Derivative That Is Not Riemann Integrable
59 pages
FA Homework
100% (1)
FA Homework
11 pages
Homework 5 Spring 2014 Solutions
No ratings yet
Homework 5 Spring 2014 Solutions
7 pages
Navid Akbaripour PS1
No ratings yet
Navid Akbaripour PS1
8 pages
Riemann Integral Answers
No ratings yet
Riemann Integral Answers
17 pages
Homework+2
No ratings yet
Homework+2
10 pages
Midterm
No ratings yet
Midterm
6 pages
final2019_solutionsA
No ratings yet
final2019_solutionsA
5 pages
Homework 4 Solutions
No ratings yet
Homework 4 Solutions
6 pages
IIT Kanpur PHD May 2017
No ratings yet
IIT Kanpur PHD May 2017
5 pages
Nbhmra 14
No ratings yet
Nbhmra 14
12 pages
Assignment RyanShaikh 2022MT11923
No ratings yet
Assignment RyanShaikh 2022MT11923
8 pages
Solutions9-24
No ratings yet
Solutions9-24
8 pages
2013 Ui Mock Putnam Exam September 25, 2013, 5 PM - 7 PM Solutions
No ratings yet
2013 Ui Mock Putnam Exam September 25, 2013, 5 PM - 7 PM Solutions
4 pages
International Competition in Mathematics For Universtiy Students in Plovdiv, Bulgaria 1996
No ratings yet
International Competition in Mathematics For Universtiy Students in Plovdiv, Bulgaria 1996
15 pages
International Competition in Mathematics For Universtiy Students in Plovdiv, Bulgaria 1996
No ratings yet
International Competition in Mathematics For Universtiy Students in Plovdiv, Bulgaria 1996
15 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Transformation of Axes (Geometry) Mathematics Question Bank
From Everand
Transformation of Axes (Geometry) Mathematics Question Bank
Mohmmad Khaja Shareef
3/5 (1)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
No ratings yet
Lectures on Integral Equations
From Everand
Lectures on Integral Equations
Harold Widom
4.5/5 (2)
Humboldt Wi Github Io Blog Research Information - Systems - 1920 Group2 - Survivalanalysis
No ratings yet
Humboldt Wi Github Io Blog Research Information - Systems - 1920 Group2 - Survivalanalysis
20 pages
Golden Rules To Answer in A System Design Interview
100% (2)
Golden Rules To Answer in A System Design Interview
33 pages
Functionspaces PDF
No ratings yet
Functionspaces PDF
15 pages
WEIGHT (KG) Body Fat (%) Total Body Water TBW (%) Muscle Mass (%) Bone Mass (KG)
No ratings yet
WEIGHT (KG) Body Fat (%) Total Body Water TBW (%) Muscle Mass (%) Bone Mass (KG)
2 pages
Calendar AY 1617 Letter
No ratings yet
Calendar AY 1617 Letter
1 page
Reproducing Kernel Hilbert - Module and Kernel Mean Embeddings
No ratings yet
Reproducing Kernel Hilbert - Module and Kernel Mean Embeddings
56 pages
The Christoffel Darboux Kernel For Data Analysis Cambridge Monographs On Applied and Computational Mathematics Jean Bernard Lasserre
No ratings yet
The Christoffel Darboux Kernel For Data Analysis Cambridge Monographs On Applied and Computational Mathematics Jean Bernard Lasserre
74 pages
Reproducing Kernel Hilbert Spaces
No ratings yet
Reproducing Kernel Hilbert Spaces
5 pages
Ibook - Pub Seasonal Adjustment Methods and Real Time Trend Cycle Estimation
No ratings yet
Ibook - Pub Seasonal Adjustment Methods and Real Time Trend Cycle Estimation
293 pages
Takhanov, R. (2023c) - On The Speed of Uniform Convergence in Mercer's Theorem
No ratings yet
Takhanov, R. (2023c) - On The Speed of Uniform Convergence in Mercer's Theorem
11 pages
Jin-Han2010 ReferenceWorkEntry K-MeansClustering
No ratings yet
Jin-Han2010 ReferenceWorkEntry K-MeansClustering
10 pages
SVM PDF
No ratings yet
SVM PDF
52 pages
Banach Spaces and Hilbert Spaces in Machine Learning Theory
No ratings yet
Banach Spaces and Hilbert Spaces in Machine Learning Theory
33 pages
Kernel Adaptive Filtering PDF
No ratings yet
Kernel Adaptive Filtering PDF
124 pages
What Is An RKHS?: 1 Outline
No ratings yet
What Is An RKHS?: 1 Outline
24 pages
Mva - Slides Machine Learning With Kernel Methods
No ratings yet
Mva - Slides Machine Learning With Kernel Methods
644 pages
Statistical Learning and Inver
No ratings yet
Statistical Learning and Inver
18 pages
23 Domain Adaptation Challenges Methods Datasets and Applications
No ratings yet
23 Domain Adaptation Challenges Methods Datasets and Applications
48 pages
2018 Blind Radio Tomography
No ratings yet
2018 Blind Radio Tomography
15 pages
AWA: Adversarial Website Adaptation: IEEE Transactions On Information Forensics and Security April 2021
No ratings yet
AWA: Adversarial Website Adaptation: IEEE Transactions On Information Forensics and Security April 2021
16 pages
(2007) - Cucker-Learning Theory - An Approximation Theory Viewpoint
No ratings yet
(2007) - Cucker-Learning Theory - An Approximation Theory Viewpoint
237 pages
Unit IV
No ratings yet
Unit IV
144 pages
Machine Learning: Intechopen Series Artificial Intelligence, Volume 7
No ratings yet
Machine Learning: Intechopen Series Artificial Intelligence, Volume 7
154 pages
Pade Approximants
No ratings yet
Pade Approximants
430 pages
Breaking The Curse of Dimensionality With Convex Neural Networks
No ratings yet
Breaking The Curse of Dimensionality With Convex Neural Networks
53 pages
Machine Learning With Kernel Methods
No ratings yet
Machine Learning With Kernel Methods
760 pages
0975 Data Science and Machine Learning
No ratings yet
0975 Data Science and Machine Learning
6 pages
Support Vector Machines For Classification and Regression: Steve R. Gunn
No ratings yet
Support Vector Machines For Classification and Regression: Steve R. Gunn
66 pages
A Novel Augmented Complex Valued Kernel LMS
No ratings yet
A Novel Augmented Complex Valued Kernel LMS
4 pages
Introduction To Kernels: Max Welling
No ratings yet
Introduction To Kernels: Max Welling
16 pages
07 Kernels
No ratings yet
07 Kernels
6 pages
Invariant Subspaces of The Shift Operator - Javad Mashreghi, Emmanuel Fricain, William Ross
No ratings yet
Invariant Subspaces of The Shift Operator - Javad Mashreghi, Emmanuel Fricain, William Ross
330 pages
Kernel Methods For Machine Learning With Math and R 100 Exercises For Building Logic Joe Suzuki PDF Download
No ratings yet
Kernel Methods For Machine Learning With Math and R 100 Exercises For Building Logic Joe Suzuki PDF Download
84 pages
Surv138 Endmatter PDF
100% (1)
Surv138 Endmatter PDF
41 pages

Stats 231 / CS229T Homework 3 Solutions

Uploaded by

Stats 231 / CS229T Homework 3 Solutions

Uploaded by

Stats 231 / CS229T Homework 3 Solutions

Question 1: Let k : X × X → R be a valid kernel function. Define

Is knorm a valid kernel? Justify your answer.

Question 2: Consider the class of functions

H := f : f (0) = 0, f 0 ∈ L2 ([0, 1]) ,

Thus k is evidently a reproducing kernel, so it must be a positiveRdefinite function.

for all x ∈ [0, 1].

(a) By Taylor’s theorem, we have

Define the function

where the last equality is Taylor’s theorem.

(b) For the reproducing kernel, note that

Question 4: The variation distance between probability distributions P and Q on a space X is

Question 5: In a number of experimental situations, it is valuable to determine if two distributions

(b) A kernel k : X × X → R is called universal if the induced RKHS H of functions f : X → R

sup |f (x) − φ(x)| ≤ .

Show that if k is universal, then

Dk (P, Q) = 0 if and only if P = Q.

b 1:n )] = E[k(X, X 0 )] and E[K(X

(d) Define the empirical Hilbert distances

Show that for all t ≥ 0,

kE[k(X, ·) − k(Z, ·)]k2H = E[k(X, ·) − k(Z, ·)], E[k(X 0 , ·) − k(Z 0 , ·)]

= E[k(X, X 0 )] + E[k(Z, Z 0 )] − 2E[k(X, Z)],

φn (x) = max{1 − n · dist(x, A), 0} = [1 − n dist(x, A)]+ ,

lim EP [φn (X)] = P (A) and lim EQ [φn (Z)] = Q(A).

(c) The expectation equalities are immediate.

b 1:n , Z1:n ) is a bit more complex. Define

by the independence of Zi , Xj . Fixing X1:n1 , define the function g(z1:n2 | X1:n1 ) by

g(z1:n2 | X1:n1 ) = K(X

You might also like

sup |f (x) − φ(x)| ≤ .