0% found this document useful (0 votes)

13 views7 pages

Solutions 2

The document contains a final exam for a mathematics course, featuring problems on continuous random variables, Markov chains, hypothesis classes, and clustering algorithms. Each problem includes a detailed explanation of concepts, methods, and solutions, demonstrating the application of mathematical theories. The exam assesses students' understanding of statistical methods, probability, and data analysis techniques.

Uploaded by

Riley Collins

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views7 pages

Solutions 2

Uploaded by

Riley Collins

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

MATH 368

Dr. A. Alpers

Assessment Specimen II (120-min final exam)

Problem 1. Consider a continuous random variable X of which we draw samples x according to
the following method:

Step 1: Draw a sample u of U ∼ U (0, 1).

Step 2: Return x with
( √
1 − −2u + 1 : u ≤ 1/2,
x= √
1 + 2u − 1 : u > 1/2.

(a) The method described above is either cdf inversion or rejection sampling. Exclude one of the
two alternatives and explain your reasoning.

(b) Determine both the pdf fX and the cdf FX of the random variable X.

Solution.

(a) It is definitely not rejection sampling since in rejection sampling one draws a pair (!) of
random numbers (and accepts or rejects one of them based on a criterion that involves both
samples). The above method draws only a sample of a single random number U.

(b) The above method is therefore cdf inversion, hence in Step 2 we have x = FX−1 (u). Since
u ∈ (0, 1) we see from Step 2 that x ∈ (0, 2). Furthermore, by solving the two equations in
Step 2 for u we obtain:
√
For u < 1/2 we need to solve x = 1 − −2u + 1 for u, and therefore u = − 12 x2 + x.
√
For u ≥ 1/2 we need to solve x = 1 + 2u − 1 for u, and therefore u = 12 x2 − x + 1.

Hence, 

 0 : x < 0,
 − 1 x2 + x

: 0 ≤ x < 1,
2
FX (x) = 1 2

 2
x −x+1 : 1 ≤ x ≤ 2,

 1 : x > 2.
By computing the derivative of FX (x) with respect to x we obtain the pdf

d  1−x
 : 0 ≤ x < 1,
fX (x) = FX (x) = x−1 : 1 ≤ x ≤ 2,
dx 
 0 : x < 0 or x > 2.

Problem 2. Consider the following collection of seven points {v1 , . . . , v7 } and sets L1 , . . . , L7 :

Page 1 of 7
v7
L1 = {v1 , v2 , v3 },
L2 = {v3 , v6 , v7 },
L3 = {v1 , v5 , v7 },
v5 v6 L4 = {v1 , v4 , v6 },
v4
L5 = {v2 , v4 , v7 },
L6 = {v3 , v4 , v5 },
v1 v2 v3 L7 = {v2 , v5 , v6 }.

We call a subset B ⊆ {v1 , . . . , v7 } a basis if it consists of three elements and B ̸= Li , i = 1, . . . , 7.

E.g., {v1 , v2 , v4 } is a basis while {v1 , v2 , v3 } is not a basis. The following ”basis exchange” property
holds for all bases (you can assume this without proof):
If A ̸= B are bases and a ∈ A \ B, then there exists an element
b ∈ B \ A such that (A \ {a}) ∪ {b} is a basis.
(a) Describe a Markov chain that samples uniformly at random on the state space Ω = {B :
B is a basis}. (Hint: It suffices to state how the initial state is chosen, a move to the next
state is proposed, and a move is accepted/rejected.)
(b) Show that the Markov chain from (a)
(i) is aperiodic,
(ii) is irreducible, and
(iii) its stationary distribution is uniform.
(c) Suppose nTrials= 35, 000 random samples are generated via the Markov chain from (a).
How many of them would be expected to be equal to {v1 , v2 , v4 }? Explain your answer.

Solution.
(a)

(i) Initialize B = {v1 , v2 , v4 } (or use another basis).

(iia) Select an element a ∈ B uniformly at random.

(iib) Select b uniformly at random from the set

{b ∈ {v1 , . . . , v7 } : (B \ {a}) ∪ {b} is a basis}.

(iii) Accept the move, i.e., set B = (B \ {a}) ∪ {b}. Go to (iia).

(b) (i) Aperiodicity: Each state of the Markov chain has self-loops as with non-zero proba-
bility b = a is selected in Step (iib).
(ii) Irreducibility: Let B1 ̸= B2 be bases. Moving from B1 and selecting a ∈ A \ B
and a suitable b ∈ B \ A we obtain by the basis exchange property a basis B ′ which
has at least one more element in common with B2 than the previous B1 . Iterating this
argument at most two more times (as all bases have three elements) we finally need to
arrive, with such moves, at B2 .

Page 2 of 7
(iii) Uniform distribution: The proposal move is symmetric, because if we are in the
current state B1 and B2 = (B1 \ {a}) ∪ {b} with a, b ∈ {v1 , . . . , v7 } is proposed, then
this happens with probability P (B1 , B2 ) = 31 · k1 , while proposing B1 = (B2 \ {b}) ∪
{a} happens also with probability P (B2 , B1 ) = 31 · k1 (k is the number of ways in
how B1 \ {a} = B2 \ {b} can be extended into a basis). From this follows directly
that the stationary distribution π is uniform by showing detailed balance (or quoting
Exercise 20):
1 symm. 1
π(B1 )P (B1 , B2 ) = P (B1 , B2 ) = P (B2 , B1 ) = π(B2 )P (B2 , B1 ).
M M

(c) By definition, the bases are all three-element subsets S ⊆ {v1 , . . . , v7 }, except for the sets
L1 , . . . , L7 . This gives a total of

7
− 7 = 35 − 7 = 28
3

bases. Since they are uniformly sampled in (a), each should be sampled approximately
nTrials
= 1, 250
28
times.

Problem 3. Consider learning specific sets on the feature space R = {a, b, c, d}. The hypothesis
class H is the following class of sets over the domain R :

H = {{a, b, c}, {a, b, d}, {a, c}, {a, d}, {b, d}, {c, d}, {b}, {c}}.

A set (hypothesis) h ∈ H is interpreted as a classifier that identifies a point x ∈ R as being in

class Ch if x ∈ h and identifies x as not being in class Ch if x ̸∈ h.

(a) Show that all one-element subsets of R are shattered by H.

(b) Show that the Vapnik-Chervonenkis dimension VCD(H) is at least 3.

(c) Give a brief argument showing VCD(H) < 4.

Solution.

(a) Every l ∈ {{a}, {b}, {d}} is shattered by H, because

l = l ∩ {a, b, d}, ∅ = l ∩ {c}.

For l = {c} we have {c} = {c} ∩ {c} and ∅ = {c} ∩ {a, b, d} hence {c} is also shattered by H.

Page 3 of 7
(b) VCD(H) ≥ 3 since S = {a, b, d} is shattered by H, i.e., for each subset s of {a, b, d}, there is
a set h in H such S intersected with h results exactly in s :

∅ = {a, b, d} ∩ {c},
{a} = {a, b, d} ∩ {a, c},
{b} = {a, b, d} ∩ {b},
{d} = {a, b, d} ∩ {c, d},
{a, b} = {a, b, d} ∩ {a, b, c},
{a, d} = {a, b, d} ∩ {a, d},
{b, d} = {a, b, d} ∩ {b, d},
{a, b, d} = {a, b, d} ∩ {a, b, d}.

(Remark: The other candidate for a three-element set, {a, b, c}, is not shattered; for instance,
one cannot obtain the empty set as an intersection with an element of H.)

(c) There is no larger set shattered by H, since the only candidate is C = R = {a, b, c, d} and
this set cannot be obtained by intersecting R with an element of H (all elements in H contain
fewer than four elements).

Problem 4. A homogeneous Markov chain {Xt }t≥0 with state space Ω = {0, 1, 2} has the following
transition matrix
0 1 2
1 0 0 !0
P = 1/4 1/2 1/4 1
0 0 1 2.

(a) Draw its transition graph.

(b) Give the value of Pr(X1 = 0 | X0 = 1).

(c) Give the value of Pr(X2 = 0 | X0 = 1).

(d) State the global balance equation and the normalization condition.

(e) There is more than one stationary distribution. Determine all of them.

Solution.
1 1/2 1

(a) 0 1 2
1/4 1/4

(b) We read this off from the matrix: Pr(X1 = 0 | X0 = 1) = p2,1 = 1/4.

Page 4 of 7
(c) We compute  
1 0 0
P 2 =  3/8 1/4 3/8  .
0 0 1
Hence Pr(X2 = 0 | X0 = 1) = 3/8 = 0.375.

(d) π T P = π T is the global balance equation. π(0)+π(1)+π(2) = 1 is the normalizing condition.

(e) The global balance equation yields

 
1−1 0 0
(π(0), π(1), π(2))  1/4 1/2 − 1 1/4  = (0, 0, 0),
0 0 1−1

which is equivalent to π(1) = 0 (The other equations are redundant). This together with

π(0) + π(1) + π(2) = 1

shows that we can set π(0) to any value p ∈ [0, 1], then π(2) = 1 − p, and π(1) = 0. For each
value of p ∈ [0, 1], any π of the form

π T = (π(0), π(1), π(2)) = (p, 0, 1 − p)

is thus a stationary distribution.

Problem 5. Given the set Xα = {−6α, 0, 2α} ⊆ R of data points depending on a parameter
α ∈ R with α > 0. Consider the task of clustering Xα into two clusters C1 and C2 using the
k-means algorithm with k = 2.

(a) Start initially with the seeds s1 = −3α, and s2 = 6α, and show the first two iterations of
k-means indicating at each iteration which points belong to each cluster and the coordinates
of the two new cluster centers. In other words, fill in the tables below.
Iteration 1:

Data Point Cluster (C1 or C2 )

−6α

2α

New cluster centers:

s1 = s2 =

Iteration 2:

Page 5 of 7
Data Point Cluster (C1 or C2 )

−6α

2α

New cluster centers:

s1 = s2 =

(b) True or False: There is an α > 0 and initial seeds such that the k-means algorithm (for
k = 2) clusters Xα into C1 = {−6α}, C2 = {0, 2α} after iteration 1 while after iteration 2
the clustering C1 = {−6α, 0}, C2 = {2α} is produced. (Briefly explain your answer.)

Solution.

(a) Iteration 1:

Data Point Cluster (C1 or C2 )

−6α C1

0 C1

2α C2

New cluster centers:

s1 = −3α s2 = 2α

Iteration 2:

Data Point Cluster (C1 or C2 )

−6α C1

0 C2

2α C2

New cluster centers:

s1 = −6α s2 = α

Page 6 of 7
(b) False. This cannot happen. For the first clustering we would have the cluster centers
s1 = −6α, s2 = α, while for the second we would have s1 = −3α, s2 = α. Computing
the sum of squared error (SSE) we see that for the first clustering we obtain an SSE of
0 + α2 + α2 = 2α2 , while for the second we obtain 9α2 + 9α2 + 0 = 18α2 . This second value
is strictly larger than the first. As k-means never increases the SSE, this cannot happen.

Page 7 of 7

DSA in JAVA Syllabus
No ratings yet
DSA in JAVA Syllabus
15 pages
Cheatsheet PDF
100% (1)
Cheatsheet PDF
4 pages
Beej's Guide To C Programming: Brian "Beej Jorgensen" Hall
No ratings yet
Beej's Guide To C Programming: Brian "Beej Jorgensen" Hall
679 pages
胡希恕经方理论与实践
No ratings yet
胡希恕经方理论与实践
324 pages
Game Engine Gems 2 1st Edition Eric Lengyel Instant Download
100% (3)
Game Engine Gems 2 1st Edition Eric Lengyel Instant Download
81 pages
Gate Statistics Practice Question
No ratings yet
Gate Statistics Practice Question
30 pages
+ Chelton: V/Uhf Receiver TYPE 707-I SERIAL Nos. 1 - 100
No ratings yet
+ Chelton: V/Uhf Receiver TYPE 707-I SERIAL Nos. 1 - 100
51 pages
LectureNotes Complete
No ratings yet
LectureNotes Complete
90 pages
Platform Technologies
100% (1)
Platform Technologies
2 pages
ON1 Photo Keyword AI User Guide PDF
No ratings yet
ON1 Photo Keyword AI User Guide PDF
91 pages
Studi Kasus Airway
No ratings yet
Studi Kasus Airway
58 pages
0063 Course PHP Advanced Tutorial
No ratings yet
0063 Course PHP Advanced Tutorial
80 pages
The Girl With The Broken Heart Lurlene Mcdaniel Download
No ratings yet
The Girl With The Broken Heart Lurlene Mcdaniel Download
27 pages
2DD40 20231102 Answers
No ratings yet
2DD40 20231102 Answers
5 pages
MA4151 Applied Probability)
No ratings yet
MA4151 Applied Probability)
28 pages
Assignment 0 (Sol.) : Reinforcement Learning
No ratings yet
Assignment 0 (Sol.) : Reinforcement Learning
50 pages
Chapter1 Notes
No ratings yet
Chapter1 Notes
23 pages
PEB Questions by Topic
No ratings yet
PEB Questions by Topic
15 pages
PFS and PD Compiled Document Exec Edited
No ratings yet
PFS and PD Compiled Document Exec Edited
59 pages
2020 Extra Units 3 4 Exam 2 Solutions
No ratings yet
2020 Extra Units 3 4 Exam 2 Solutions
38 pages
HCIA-Intelligent Computing V1.0 Lab Guide
No ratings yet
HCIA-Intelligent Computing V1.0 Lab Guide
213 pages
Solution Mid Sem
100% (1)
Solution Mid Sem
4 pages
Chapter2 Notes
No ratings yet
Chapter2 Notes
16 pages
15B11CI212 - Theoretical Foundations of Computer Science Tutorial 11 Solutions Automata Theory
No ratings yet
15B11CI212 - Theoretical Foundations of Computer Science Tutorial 11 Solutions Automata Theory
7 pages
A2 (Partial Solution)
No ratings yet
A2 (Partial Solution)
7 pages
Machine 2020 Jul-Dec Practice 7,8
No ratings yet
Machine 2020 Jul-Dec Practice 7,8
37 pages
Group 12 Zerodha SectionB
No ratings yet
Group 12 Zerodha SectionB
12 pages
MTech QROR PQB 2020
No ratings yet
MTech QROR PQB 2020
13 pages
HW 2 Sol
No ratings yet
HW 2 Sol
14 pages
ENEE627 Problem Set1
0% (1)
ENEE627 Problem Set1
8 pages
Power Suply ICE2AS NCP1200 BIT3105 - FLF1521 LCD Power Supply
No ratings yet
Power Suply ICE2AS NCP1200 BIT3105 - FLF1521 LCD Power Supply
3 pages
EEC 126 Discussion 4 Solutions
100% (1)
EEC 126 Discussion 4 Solutions
4 pages
hw01 Sol
No ratings yet
hw01 Sol
6 pages
Probability and Statistics 22s Soln
No ratings yet
Probability and Statistics 22s Soln
4 pages
SC222: Tutorial Sheet 2: N I N I
0% (1)
SC222: Tutorial Sheet 2: N I N I
3 pages
GCN-based Soft Sensor Utilizing Process Flow
No ratings yet
GCN-based Soft Sensor Utilizing Process Flow
6 pages
Python
No ratings yet
Python
8 pages
Datasheet Modem 6 Transceiver (Surface)
No ratings yet
Datasheet Modem 6 Transceiver (Surface)
2 pages
Sheet2 Sol
No ratings yet
Sheet2 Sol
5 pages
Set11 Soln
No ratings yet
Set11 Soln
7 pages
Probability CW
No ratings yet
Probability CW
14 pages
Review 2024 04
No ratings yet
Review 2024 04
5 pages
SOP IPPB - LPT001 Downloading&Configuring Java
No ratings yet
SOP IPPB - LPT001 Downloading&Configuring Java
9 pages
Manual - Cable Reels 1400 Series
No ratings yet
Manual - Cable Reels 1400 Series
26 pages
Other Examples
No ratings yet
Other Examples
15 pages
hw3 Soln
No ratings yet
hw3 Soln
7 pages
Mscds2023 Solutions
No ratings yet
Mscds2023 Solutions
17 pages
Wbsn-2400 User Guide: September 2011
No ratings yet
Wbsn-2400 User Guide: September 2011
42 pages
MBL Balkans 2024 Qualifying Quiz
No ratings yet
MBL Balkans 2024 Qualifying Quiz
3 pages
Mscds2022 Solutions
No ratings yet
Mscds2022 Solutions
23 pages
2025 Ie622
No ratings yet
2025 Ie622
3 pages
Max17201gevkit Max17211xevkit
No ratings yet
Max17201gevkit Max17211xevkit
24 pages
Lec. 1 Computer Organization and Architecture (CPE343)
No ratings yet
Lec. 1 Computer Organization and Architecture (CPE343)
18 pages
hwk4 Soln
No ratings yet
hwk4 Soln
6 pages
Assign3sol 240910 160118
No ratings yet
Assign3sol 240910 160118
5 pages
MIT18 05S14 Class27-Sol
No ratings yet
MIT18 05S14 Class27-Sol
11 pages
Homework 6: Stats 217: Xy Xy
No ratings yet
Homework 6: Stats 217: Xy Xy
5 pages
IT1101
No ratings yet
IT1101
9 pages
STAT 709 Midterm Fall 2023
No ratings yet
STAT 709 Midterm Fall 2023
2 pages
XXXXX
No ratings yet
XXXXX
5 pages
MSO 201a: Probability and Statistics 2019-20-II Semester Assignment-V Instructor: Neeraj Misra
No ratings yet
MSO 201a: Probability and Statistics 2019-20-II Semester Assignment-V Instructor: Neeraj Misra
27 pages
Mobile Monitoring Tool User Manual iOS
No ratings yet
Mobile Monitoring Tool User Manual iOS
19 pages
Random Variable and Its Distribution Problems: NPTEL-Probability and Distributions
No ratings yet
Random Variable and Its Distribution Problems: NPTEL-Probability and Distributions
7 pages
January 2020 Exam Solutions
No ratings yet
January 2020 Exam Solutions
7 pages
Solution 5 Problem 1: Let a > 0 be a known constant, and let θ > 0 be a parameter
No ratings yet
Solution 5 Problem 1: Let a > 0 be a known constant, and let θ > 0 be a parameter
8 pages
PCI Express Gen 4 and Gen 5 Card Edge Connectors
No ratings yet
PCI Express Gen 4 and Gen 5 Card Edge Connectors
4 pages
ML A0
No ratings yet
ML A0
7 pages
C3 Review Test MS
No ratings yet
C3 Review Test MS
4 pages
22kV SWITCH ROOM OPTION
No ratings yet
22kV SWITCH ROOM OPTION
1 page
HW 2 Sol
No ratings yet
HW 2 Sol
10 pages
Tutorial Sheet5
No ratings yet
Tutorial Sheet5
1 page
Imc2021 Day1 Solutions
No ratings yet
Imc2021 Day1 Solutions
5 pages
HW 1
No ratings yet
HW 1
16 pages
Case Study-Question-3
No ratings yet
Case Study-Question-3
2 pages
Hw2sol PDF
No ratings yet
Hw2sol PDF
15 pages
Spring 2005 Solutions
No ratings yet
Spring 2005 Solutions
11 pages
ESM3a: Advanced Linear Algebra and Stochastic Processes
No ratings yet
ESM3a: Advanced Linear Algebra and Stochastic Processes
12 pages
CS459 - Introduction To Services Computing: Course Information
No ratings yet
CS459 - Introduction To Services Computing: Course Information
2 pages
Unregistered Copy of Bakoma Tex: Midterm Exam Report
No ratings yet
Unregistered Copy of Bakoma Tex: Midterm Exam Report
2 pages
Problem Sheet 3: Answers
No ratings yet
Problem Sheet 3: Answers
3 pages
Reading: Form B - Extra Reading
No ratings yet
Reading: Form B - Extra Reading
3 pages
Problem Set 1
No ratings yet
Problem Set 1
5 pages
Isi JRF Stat 07
No ratings yet
Isi JRF Stat 07
10 pages
Department of Mathematics Indian Institute of Technology Guwahati
No ratings yet
Department of Mathematics Indian Institute of Technology Guwahati
3 pages
Problems by Jim Pitman. Solutions by George Chen
No ratings yet
Problems by Jim Pitman. Solutions by George Chen
8 pages
ML cd −y+ (1−θ) z
No ratings yet
ML cd −y+ (1−θ) z
2 pages
Panasonic Phone System KXT308
No ratings yet
Panasonic Phone System KXT308
6 pages
1 Exy 4
No ratings yet
1 Exy 4
6 pages
EE 5375/7375 Random Processes Homework #4 Solutions
No ratings yet
EE 5375/7375 Random Processes Homework #4 Solutions
8 pages

Solutions 2

Uploaded by

Solutions 2

Uploaded by

MATH 368

Assessment Specimen II (120-min final exam)

Step 1: Draw a sample u of U ∼ U (0, 1).

We call a subset B ⊆ {v1 , . . . , v7 } a basis if it consists of three elements and B ̸= Li , i = 1, . . . , 7.

(i) Initialize B = {v1 , v2 , v4 } (or use another basis).

(iia) Select an element a ∈ B uniformly at random.

(iib) Select b uniformly at random from the set

{b ∈ {v1 , . . . , v7 } : (B \ {a}) ∪ {b} is a basis}.

(iii) Accept the move, i.e., set B = (B \ {a}) ∪ {b}. Go to (iia).

A set (hypothesis) h ∈ H is interpreted as a classifier that identifies a point x ∈ R as being in

(a) Show that all one-element subsets of R are shattered by H.

(b) Show that the Vapnik-Chervonenkis dimension VCD(H) is at least 3.

(c) Give a brief argument showing VCD(H) < 4.

(a) Every l ∈ {{a}, {b}, {d}} is shattered by H, because

l = l ∩ {a, b, d}, ∅ = l ∩ {c}.

(a) Draw its transition graph.

(b) Give the value of Pr(X1 = 0 | X0 = 1).

(c) Give the value of Pr(X2 = 0 | X0 = 1).

(d) π T P = π T is the global balance equation. π(0)+π(1)+π(2) = 1 is the normalizing condition.

(e) The global balance equation yields

π(0) + π(1) + π(2) = 1

π T = (π(0), π(1), π(2)) = (p, 0, 1 − p)

is thus a stationary distribution.

Data Point Cluster (C1 or C2 )

New cluster centers:

New cluster centers:

Data Point Cluster (C1 or C2 )

New cluster centers:

Data Point Cluster (C1 or C2 )

New cluster centers:

You might also like