0% found this document useful (0 votes)

36 views18 pages

Final 2003

This document contains instructions for a final exam with 10 questions. It states that the exam is 3 hours long, contains 10 questions, and the maximum possible score is 100 points. Unless otherwise stated, showing work is not required. Students are advised that if they get stuck on a question, they should move on to other questions and come back later. Good luck is wished for the exam.

Uploaded by

Muhammad Murtaza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views18 pages

Final 2003

Uploaded by

Muhammad Murtaza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

ANDREW ID (CAPITALS):

NAME (CAPITALS):

10-701/15-781 Final, Fall 2003

You have 3 hours.

There are 10 questions. If you get stu k on one question, move on to others and ome
ba k to the diÆ ult question later.
The maximum possible total s ore is 100.
Unless otherwise stated there is no need to show your working.
Good lu k!

1
1 Short Questions (16 points)
(a) Traditionally, when we have a real-valued input attribute during de ision-tree learning
we onsider a binary split a ording to whether the attribute is above or below some
threshold. Pat suggests that instead we should just have a multiway split with one
bran h for ea h of the distin t values of the attribute. From the list below hoose the
single biggest problem with Pat's suggestion:
(i) It is too omputationally expensive.
(ii) It would probably result in a de ision tree that s ores badly on the training set
and a testset.
(iii) It would probably result in a de ision tree that s ores well on the training set but
badly on a testset.
(iv) It would probably result in a de ision tree that s ores well on a testset but badly
on a training set.
(b) You have a dataset with three ategori al input attributes A, B and C. There is one
ategori al output attribute Y. You are trying to learn a Naive Bayes Classier for
predi ting Y. Whi h of these Bayes Net diagrams represents the naive bayes lassier
assumption?

(i) (ii)
A B C A B C

Y Y

(iii) Y (iv)

A B C Y
A B C

( ) For a neural network, whi h one of these stru tural assumptions is the one that most
ae ts the trade-o between undertting (i.e. a high bias model) and overtting (i.e.
a high varian e model):
(i) The number of hidden nodes
(ii) The learning rate
(iii) The initial hoi e of weights
(iv) The use of a onstant-term unit input

2
(d) For polynomial regression, whi h one of these stru tural assumptions is the one that
most ae ts the trade-o between undertting and overtting:
(i) The polynomial degree
(ii) Whether we learn the weights by matrix inversion or gradient des ent
(iii) The assumed varian e of the Gaussian noise
(iv) The use of a onstant-term unit input
(e) For a Gaussian Bayes lassier, whi h one of these stru tural assumptions is the one
that most ae ts the trade-o between undertting and overtting:
(i) Whether we learn the lass enters by Maximum Likelihood or Gradient Des ent
(ii) Whether we assume full lass ovarian e matri es or diagonal lass ovarian e
matri es
(iii) Whether we have equal lass priors or priors estimated from the data.
(iv) Whether we allow lasses to have dierent mean ve tors or we for e them to share
the same mean ve tor
(f) For Kernel Regression, whi h one of these stru tural assumptions is the one that most
ae ts the trade-o between undertting and overtting:
(i) Whether kernel fun tion is Gaussian versus triangular versus box-shaped
(ii) Whether we use Eu lidian versus L1 versus L1 metri s
(iii) The kernel width
(iv) The maximum height of the kernel fun tion
(g) (True or False) Given two lassiers A and B, if A has a lower VC-dimension than
B then A almost ertainly will perform better on a testset.
(h) P (Good Movie j In ludes Tom Cruise) = 0:01
P (Good Movie j Tom Cruise absent) = 0:1
P (Tom Cruise in a randomly hosen movie) = 0:01
What is P (Tom Cruise is in the movie j Not a Good Movie)?

3
2 Markov De ision Pro esses (13 points)
For this question it might be helpful to re all the following geometri identities, whi h assume
0 < 1.

Xk i = 1 k+1 X i =
1
1
i=0
1 i=0
1
The following gure shows an MDP with N states. All states have two a tions (North
and Right) ex ept Sn , whi h an only self-loop. Unlike most MDPs, all state transitions are
deterministi . Assume dis ount fa tor .

p=1 p=1 p=1 p=1 p=1

s1 s2 s3 sn-1 sn
p=1 p=1 p=1
... p=1 r = 10
r=1 r=1 r=1 r=1

For questions (a){(e), express your answer as a nite expression (no summation
signs or : : : 's) in terms of n and/or .
(a) What is J (Sn )?

(b) There is a unique optimal poli y. What is it?

( ) What is J (S1 )?

(d) Suppose you try to solve this MDP using value iteration. What is J 1 (S1 )?

4
(e) Suppose you try to solve this MDP using value iteration. What is J 2 (S1 )?

(f) Suppose your omputer has exa t arithmeti (no rounding errors). How many itera-
tions of value iteration will be needed before all states re ord their exa t ( orre t to
innite de imal pla es) J value? Pi k one:
(i) Less than 2n
(ii) Between 2n and n2
(iii) Between n2 + 1 and 2n
(iv) It will never happen
(g) Suppose you run poli y iteration. During one step of poli y iteration you ompute the
value of the urrent poli y by omputing the exa t solution to the appropriate system
of n equations in n unknowns. Suppose too that when hoosing the a tion during the
poli y improvement step, ties are broken by hoosing North.
Suppose poli y iteration begins with all states hoosing North.
How many steps of poli y iteration will be needed before all states re ord their exa t
( orre t to innite de imal pla es) J value? Pi k one:
(i) Less than 2n
(ii) Between 2n and n2
(iii) Between n2 + 1 and 2n
(iv) It will never happen

5
3 Reinfor ement Learning (10 points)
This question uses the same MDP as the previous question, repeated here for your onve-
nien e. Again, assume = 21 .

p=1 p=1 p=1 p=1 p=1

s1 s2 s3 sn-1 sn
p=1 p=1 p=1
... p=1 r = 10
r=1 r=1 r=1 r=1

Suppose we are dis overing the optimal poli y via Q-learning. We begin with a Q-table
initialized with 0's everywhere:
Q(Si ; North) = 0 for all i
Q(Si ; Right) = 0 for all i
Be ause the MDP is determisti , we run Q-learning with a learning rate = 1. Assume we
start Q-learning at state S1 .

(a) Suppose our exploration poli y is to always hoose a random a tion. How many steps
do we expe t to take before we rst enter state Sn ?
(i) O(n) steps
(ii) O(n2 ) steps
(iii) O(n3 ) steps
(iv) O(2n) steps
(v) It will ertainly never happen
(b) Suppose our exploration is greedy and we break ties by going North:
Choose North if Q(Si ; North) Q(Si ; Right)
Choose Right if Q(Si ; North) < Q(Si ; Right)
How many steps do we expe t to take before we rst enter state Sn ?
(i) O(n) steps
(ii) O(n2 ) steps
(iii) O(n3 ) steps
(iv) O(2n) steps
(v) It will ertainly never happen

6
( ) Suppose our exploration is greedy and we break ties by going Right:
Choose North if Q(Si ; North) > Q(Si ; Right)
Choose Right if Q(Si ; North) Q(Si ; Right)
How many steps do we expe t to take before we rst enter state Sn ?
(i) O(n) steps
(ii) O(n2 ) steps
(iii) O(n3 ) steps
(iv) O(2n) steps
(v) It will ertainly never happen
WARNING: Question (d) is only worth 1 point so you should probably just
guess the answer unless you have plenty of time.
(d) In this question we work with a similar MDP ex ept that ea h state other than Sn has
a punishment (-1) instead of a reward (+1). Sn remains the same large reward (10).
The new MDP is shown below:

p=1 p=1 p=1 p=1 p=1

s1 s2 s3 sn-1 sn
p=1 p=1 p=1
... p=1 r = 10
r = -1 r = -1 r = -1 r = -1

Suppose our exploration is greedy and we break ties by going North:

Choose North if Q(Si ; North) Q(Si ; Right)
Choose Right if Q(Si ; North) < Q(Si ; Right)
How many steps do we expe t to take before we rst enter state Sn ?
(i) O(n) steps
(ii) O(n2 ) steps
(iii) O(n3 ) steps
(iv) O(2n) steps
(v) It will ertainly never happen

7
4 Bayesian Networks (11 points)
Constru tion. Two astronomers in two dierent parts of the world, make measurements
M1 and M2 of the number of stars N in some small regions of the sky, using their teles opes.
Normally, there is a small possibility of error by up to one star in ea h dire tion. Ea h
teles ope an be, with a mu h smaller probability, badly out of fo us (events F1 and F2 ). In
su h a ase the s ientist will under ount by three or more stars or, if N is less than three,
fail to dete t any stars at all.
For questions (a) and (b), onsider the four networks shown below.

(i) M1 M2 (ii) F1 N F2

F1 N F2 M1 M2

(iii) M1 M2 (iv) F1 F2

N M1 M2

F1 F2 N

(a) Whi h of them orre tly, but not ne essarily eÆ iently, represents the above informa-
tion? Note that there may be multiple answers.

(b) Whi h is the best network?

8
Inferen e. A student of the Ma hine Learning lass noti es that people driving SUVs
(S ) onsume large amounts of gas (G) and are involved in more a idents than the national
average (A). He also noti ed that there are two types of people that drive SUVs: people
from Pennsylvania (L) and people with large families (F ). After olle ting some statisti s,
he arrives at the following Bayesian network.

( ) What is P (S )?

(d) What is P (S jA)?

Consider the following Bayesian network. State whether the given onditional independen es
are implied by the net stru ture.

A B

C D

F E

(f) (True or False) I<A,fg,B>

(g) (True or False) I<A,fEg,D>
(h) (True or False) I<A,fFg,D>

9
5 Instan e Based Learning (8 points)
Consider the following dataset with one real-valued input x and one X Y
binary output y . We are going to use k-NN with unweighted Eu- -0.1 -
lidean distan e to predi t y for x. 0.7 +
1.0 +
1.6 -
2.0 +
– + + – + + – – + + 2.5 +
3.2 -
-0.1 0.7 1.0 1.6 2.0 2.5 3.2 3.5 4.1 4.9 3.5 -
4.1 +
4.9 +

(a) What is the leave-one-out ross-validation error of 1-NN on this dataset? Give your
answer as the number of mis lassi ations.

(b) What is the leave-one-out ross-validation error of 3-NN on this dataset? Give your
answer as the number of mis lassi ations.

Consider a dataset with N examples: f(xi ; yi)j1 i N g, where both xi and yi are real
valued for all i. Examples are generated by yi = w0 + w1 xi + ei where ei is a Gaussian
random variable with mean 0 and standard deviation 1.

N P
( ) We use least square linear regression to solve w0 and w1 , that is
fw0; w1g = arg fwmin 2
i=1 (yi w0 w1 xi ) :
0 ;w1 g

We assume the solution is unique. Whi h one of the following statements is true?
PNi=1(yi w0 w1 xi )yi = 0
(i)
PNi=1(yi w0 w1 xi )x2i = 0
(ii)
PNi=1(yi w0 w1 xi )xi = 0
(iii)
(iv)
PNi=1(yi w0 w1 xi )2 = 0

P
(d) We hange the optimization riterion to in lude lo al weights, that is
fw0; w1g = arg min Ni=1 i2(yi w0 w1 xi)2
fw0 ;w1 g
where i is a lo al weight. Whi h one of the following statements is true?
P
N 2
i=1 i (yi w0 w1 xi )(xi + i ) = 0
(i)
P

N
i=1 i (yi w0 w1 xi )xi = 0
(ii)
P

N 2
i=1 i (yi w0 w1 xi )(xi yi + w1 ) = 0
(iii)
P

N 2
(iv) i=1 i (yi w0 w1 xi )xi = 0

10
6 VC-dimension (9 points)
Let H denote a hypothesis lass, and V C (H ) denote its VC dimension.
(a) (True or False) If there exists a set of k instan es that annot be shattered by H ,
then V C (H ) < k.
(b) (True or False) If two hypothesis lasses H1 and H2 satisfy H1 H2 , then
V C (H1 ) V C (H2 ).
( ) (True or False) If three hypothesis lasses H1 ; H2 and H3 satisfy H1 = H2 [ H3 ,
then V C (H1 ) V C (H2 ) + V C (H3 ) .

For questions (d){(f), give V C (H ). No explanation is required.

(d) H = fh j0 1; h (x) = 1 i x otherwise h (x) = 0g.

(e) H is the set of all per eptrons in 2D plane, i.e.

H = fhw jhw = (w0 + w1 x1 + w2 x2 ) where (z ) = 1 i z 0 otherwise z = 0g.

(f) H is the set of all ir les in 2D plane. Points inside the ir les are lassied as 1
otherwise 0.

11
7 SVM and Kernel Methods (8 points)
(a) Kernel fun tions impli itly dene some mapping fun tion () that transforms an input
instan e x 2 R d to a high dimensional feature spa e Q by giving the form of dot produ t
in Q: K (xi ; xj ) = (xi ) (xj ).
Assume we use radial basis kernel fun tion K (xi ; xj ) = exp( 12 kxi xj k2 ). Thus we
assume that there's some impli it unknown fun tion (x) su h that

1
(xi ) (xj ) = K (xi ; xj ) = exp( kx xj k2 )
2 i
Prove that for any two input instan es xi and xj , the squared Eu lidean distan e
of their orresponding points in the feature spa e Q is less than 2, i.e. prove that
k(xi) (xj )k2 < 2.

(b) With the help of a kernel fun tion, SVM attempts to onstru t a hyper-plane in the
feature spa e Q that maximizes the margin between two lasses. The lassi ation
de ision of any x is made on the basis of the sign of
^ T (x) + w^0 =
w
X y K (x ; x) + w^
i i i 0 = f (x; ; w^0);
i2SV
where w ^ and w^0 are parameters for the lassi ation hyper-plane in the feature spa e
Q, SV is the set of support ve tors, and i is the oeÆ ient for the support ve tor.
Again we use the radial basis kernel fun tion. Assume that the training instan es are
linearly separable in the feature spa e Q, and assume that the SVM nds a margin
that perfe tly separates the points.
(True or False) If we hoose a test point xfar whi h is far away from any training
instan e xi (distan e here is measured in the original spa e R d ), we will observe that
f (xfar ; ; w^0 ) w^0 .
( ) (True or False) The SVM learning algorithm is guaranteed to nd the globally
optimal hypothesis with respe t to its obje t fun tion.
(d) (True or False) The VC dimension of a Per eptron is smaller than the VC dimension
of a simple linear SVM.

12
(e) (True or False) After being mapped into feature spa e Q through a radial basis
kernel fun tion, a Per eptron may be able to a hieve better lassi ation performan e
than in its original spa e (though we an't guarantee this).
(f) (True or False) After mapped into feature spa e Q through a radial basis kernel
fun tion, 1-NN using unweighted Eu lidean distan e may be able to a hieve better
lassi ation performan e than in original spa e (though we an't guarantee this).

13
8 GMM (8 points)
Consider the lassi ation problem illustrated in the following gure. The data points in the
gure are labeled, where \o" orresponds to lass 0 and \+" orresponds to lass 1. We now
estimate a GMM onsisting of 2 Gaussians, one Gaussian per lass, with the onstraint that
the ovarian e matri es are identity matri es. The mixing proportions ( lass frequen ies)
and the means of the two Gaussians are free parameters.

1.5
2

1
x

0.5

0
0 0.5 1 1.5 2
x
1

(a) Plot the maximum likelihood estimates of the means of the two Gaussians in the gure.
Mark the means as points \x" and label them \0" and \1" a ording to the lass.

(b) Based on the learned GMM, what is the probability of generating a new data point
that belongs to lass 0?

( ) How many data points are lassied in orre tly?

(d) Draw the de ision boundary in the same gure.

14
9 K-means Clustering (9 points)
There is a set S onsisting of 6 points in the plane shown as below, a = (0; 0), b = (8; 0),
= (16; 0), d = (0; 6), e = (8; 6), f = (16; 6). Now we run the k-means algorithm on those
points with k = 3. The algorithm uses the Eu lidean distan e metri (i.e. the straight line
distan e between two points) to assign ea h point to its nearest entroid. Ties are broken in
favor of the entroid to the left/down. Two denitions:
A k-starting onguration is a subset of k starting points from S that form the
initial entroids, e.g. fa; b; g.
A k-partition is a partition of S into k non-empty subsets, e.g. fa; b; eg; f ; dg; ff g is
a 3-partition.
Clearly any k-partition indu es a set of k entroids in the natural manner. A k-partition
is alled stable if a repetition of the k-means iteration with the indu ed entroids leaves it
un hanged.
8

6 d e f
4
y

2
a b c
0
0 4 8 12 16 20
x

(a) How many 3-starting ongurations are there? (Remember, a 3-starting onguration
is just a subset, of size 3, of the six datapoints).
(b) Fill in the following table:
3-partition Is it sta- An example 3-starting ongura- The number of
ble? tion that an arrive at the 3- unique starting
partition after 0 or more itera- ongurations that
tions of k -means (or write \none" an arrive at the
if no su h 3-starting ongura- 3-partition
tion)
fa; b; eg; f ; dg; ff g
fa; bg; fd; eg; f ; f g
fa; dg; fb; eg; f ; f g
fag; fdg; fb; ; e; f g
fa; bg; fdg; f ; e; f g
fa; b; dg; f g; fe; f g

15
10 Hidden Markov Models (8 points)
Consider a hidden Markov model illustrated as the gure shown below, whi h shows the
hidden state transitions and the asso iated probabilities along with the initial state distribu-
tion. We assume that the state dependent outputs ( oin ips) are governed by the following
distributions
P (x = headsjs = 1) = 0:51
P (x = headsjs = 2) = 0:49
P (x = tailsjs = 1) = 0:49
P (x = tailsjs = 2) = 0:51
In other words, our oin is slightly biased towards heads in state 1 whereas in state 2 tails
is a somewhat more probable out ome.
0.9 0.9
1 1 1 ...
0.01 0.1 0.1

0.99 0.1 0.1

2 0.9 2 0.9 2 ...
t=0 t=1 t=2
(a) Now, suppose we observe three oin ips all resulting in heads. The sequen e of
observations is therefore heads; heads; heads. What is the most likely state sequen e
given these three observations? (It is not ne essary to use the Viterbi algorithm to
dedu e this, nor any subsequent questions).

(b) What happens to the most likely state sequen e if we observe a long sequen e of all
heads (e.g., 106 heads in a row)?

16
( ) Consider the following 3-state HMM, 1 , 2 and 3 are the probabilities of starting from
ea h state S 1, S 2 and S 3. Give a set of values so that the resulting HMM maximizes
the likelihood of the output sequen e ABA.

3 = _ A
S3
_
B
_ _
_ _
_
2 =
1 =
S1 S2
_ _

_
_ _ _ _
PSfrag repla ements

A B A B

17
(d) We're going to use EM to learn the parameters for the following HMM. Before the rst
iteration of EM we have initialized the parameters as shown in the following gure.
(True or False) For these initial values, EM will su essfully onverge to the model
that maximizes the likelihood of the training sequen e ABA.

PSfrag repla ements

1 = 1/3
2 =
3 = 3 = 1/3 1/3 A
S3
2/3
B
1/3 1/3
1/3 1/3
1/3
2 = 1/3
1 = 1/3
S1 S2
1/3 1/3
1/3
1/3 2/3 1/3 2/3

A B A B

(e) (True or False) In general when are trying to learn an HMM with a small number of
states from a large number of observations, we an almost always in rease the training
data likelihood by permitting more hidden states.

Answer 2023-24
No ratings yet
Answer 2023-24
19 pages
MLfinal 1
No ratings yet
MLfinal 1
7 pages
Assignment 1 Solution
No ratings yet
Assignment 1 Solution
6 pages
(Electrical Power Systems) (By: C.L. Wadhwa) (Published: July, 2009)
No ratings yet
(Electrical Power Systems) (By: C.L. Wadhwa) (Published: July, 2009)
5 pages
Tdp-704 Variable Volume and Temperature Systems
No ratings yet
Tdp-704 Variable Volume and Temperature Systems
64 pages
ML Bits & Answers
100% (1)
ML Bits & Answers
4 pages
ML Midsem 2022
No ratings yet
ML Midsem 2022
8 pages
Teknaevo TPG: Installation Manual
No ratings yet
Teknaevo TPG: Installation Manual
95 pages
Fluke 718 300g Process Calibrator Manual
No ratings yet
Fluke 718 300g Process Calibrator Manual
36 pages
MachineLearning MidTerm UMT Spring 2021
100% (1)
MachineLearning MidTerm UMT Spring 2021
12 pages
Tech. Manual SG
No ratings yet
Tech. Manual SG
29 pages
Untitled
No ratings yet
Untitled
3 pages
AI Mock 2
No ratings yet
AI Mock 2
17 pages
2022 ML Assignments
No ratings yet
2022 ML Assignments
45 pages
Lumbini Bikas Bank
No ratings yet
Lumbini Bikas Bank
14 pages
SMAI Question Papers
No ratings yet
SMAI Question Papers
13 pages
Final2019 Solutions
No ratings yet
Final2019 Solutions
23 pages
MLvsMAP Merged
No ratings yet
MLvsMAP Merged
208 pages
Machine Learning Full Question Bank
No ratings yet
Machine Learning Full Question Bank
14 pages
10-701/15-781 Machine Learning Mid-Term Exam Solution: Your Name
No ratings yet
10-701/15-781 Machine Learning Mid-Term Exam Solution: Your Name
12 pages
Wattless Current
No ratings yet
Wattless Current
2 pages
Top 40 Machine Learning Questions & Answers: Which of The Following Statement Is True in The Following Case?
No ratings yet
Top 40 Machine Learning Questions & Answers: Which of The Following Statement Is True in The Following Case?
34 pages
ZX2-GC×GC Im 01
No ratings yet
ZX2-GC×GC Im 01
32 pages
DW 144
No ratings yet
DW 144
98 pages
VDL Sample
No ratings yet
VDL Sample
2 pages
Iso 14001 Static 16x9
100% (1)
Iso 14001 Static 16x9
13 pages
Deep Learning MCQA
No ratings yet
Deep Learning MCQA
20 pages
03 MLE MAP NBayes-1-21-2015
No ratings yet
03 MLE MAP NBayes-1-21-2015
40 pages
Tps 562209
No ratings yet
Tps 562209
30 pages
R 2031053
No ratings yet
R 2031053
12 pages
Final Exam (NN&DL 5720)
No ratings yet
Final Exam (NN&DL 5720)
33 pages
John Mccarthy: A. B. Nicklaus Wirth C. Seymour Papert D. Alain Colmerauer
No ratings yet
John Mccarthy: A. B. Nicklaus Wirth C. Seymour Papert D. Alain Colmerauer
19 pages
ELCB F
No ratings yet
ELCB F
5 pages
Finals 19
No ratings yet
Finals 19
16 pages
Midterm 2010 F
No ratings yet
Midterm 2010 F
15 pages
ControlCase Compliance Manager Start-Up Manual v1.1
No ratings yet
ControlCase Compliance Manager Start-Up Manual v1.1
19 pages
Midterm2008f Sol
No ratings yet
Midterm2008f Sol
12 pages
Midterm 2006
No ratings yet
Midterm 2006
11 pages
Midterm Sol
No ratings yet
Midterm Sol
16 pages
MANG2011完成稿
No ratings yet
MANG2011完成稿
13 pages
Midterm 2002
No ratings yet
Midterm 2002
10 pages
10-701 Midterm Exam, Fall 2007
No ratings yet
10-701 Midterm Exam, Fall 2007
25 pages
cs675 SS2022 Midterm Solution PDF
No ratings yet
cs675 SS2022 Midterm Solution PDF
10 pages
Issues in Decision Trees
No ratings yet
Issues in Decision Trees
22 pages
Final 2006
No ratings yet
Final 2006
15 pages
Midterm Sol
No ratings yet
Midterm Sol
23 pages
Quiz and Mid Paper Data
No ratings yet
Quiz and Mid Paper Data
31 pages
MCQ Unit Wise ML (ROE083) Que Bank With Ans.
100% (4)
MCQ Unit Wise ML (ROE083) Que Bank With Ans.
22 pages
R 2032422
No ratings yet
R 2032422
11 pages
MCQs Dumps 2
No ratings yet
MCQs Dumps 2
15 pages
Imp Qs
No ratings yet
Imp Qs
10 pages
07au Midterm
No ratings yet
07au Midterm
17 pages
Mid Term Test
No ratings yet
Mid Term Test
6 pages
601 sp09 Midterm Solutions
No ratings yet
601 sp09 Midterm Solutions
14 pages
AI42001 Machine Learing Foundations ES 2024
No ratings yet
AI42001 Machine Learing Foundations ES 2024
18 pages
MCQ
No ratings yet
MCQ
8 pages
Aml Mid-2 Objective
No ratings yet
Aml Mid-2 Objective
17 pages
Ai ML Unit 3
No ratings yet
Ai ML Unit 3
15 pages
113 Trellix NX 4600 Ds Trellix Network Security Tech Specifications Datasheet
No ratings yet
113 Trellix NX 4600 Ds Trellix Network Security Tech Specifications Datasheet
9 pages
10-701 Midterm Exam Solutions, Spring 2007
No ratings yet
10-701 Midterm Exam Solutions, Spring 2007
20 pages
Final 2018
No ratings yet
Final 2018
15 pages
d3 PDF
No ratings yet
d3 PDF
7 pages
Midterm Solutions
No ratings yet
Midterm Solutions
8 pages
Mohamed Sathik: Assessment Report
No ratings yet
Mohamed Sathik: Assessment Report
9 pages
Machine Learning - AKTU PAPER (Session 2019 - 2020)
No ratings yet
Machine Learning - AKTU PAPER (Session 2019 - 2020)
10 pages
Land Earth Station (LES) Configuration of Sat-C Terminals
No ratings yet
Land Earth Station (LES) Configuration of Sat-C Terminals
9 pages
Santu CV Job Final (07!01!25)
No ratings yet
Santu CV Job Final (07!01!25)
10 pages
Quiz2 B
No ratings yet
Quiz2 B
6 pages
t4 Sol
No ratings yet
t4 Sol
8 pages
Homework 2
No ratings yet
Homework 2
4 pages
Midterm 2008s Solution
No ratings yet
Midterm 2008s Solution
12 pages
GA05 Guide To LEED Certification Commercial
No ratings yet
GA05 Guide To LEED Certification Commercial
10 pages
Doosan Schematic All Models
100% (69)
Doosan Schematic All Models
20 pages
CSC 327 (DBMS Ii)
No ratings yet
CSC 327 (DBMS Ii)
8 pages
ML Finals16 PDF
No ratings yet
ML Finals16 PDF
12 pages
15-381 Spring 2007 Assignment 6: Learning
No ratings yet
15-381 Spring 2007 Assignment 6: Learning
14 pages
SMAI End 2015 S
No ratings yet
SMAI End 2015 S
4 pages
Class Test 1 Answer Key
No ratings yet
Class Test 1 Answer Key
3 pages
Btech Cse 7 Sem Machine Learning Pec Cs701e 2024
No ratings yet
Btech Cse 7 Sem Machine Learning Pec Cs701e 2024
2 pages
Homework 3
No ratings yet
Homework 3
4 pages
Final: CS 189 Spring 2016 Introduction To Machine Learning
No ratings yet
Final: CS 189 Spring 2016 Introduction To Machine Learning
12 pages
Car Basic Mechanics
No ratings yet
Car Basic Mechanics
3 pages
Sheet 4 - Decision Tree
No ratings yet
Sheet 4 - Decision Tree
4 pages
Quiz2 A
No ratings yet
Quiz2 A
5 pages
Equipment Ready
No ratings yet
Equipment Ready
2 pages
Extreme Privacy - Mobile Devices
100% (6)
Extreme Privacy - Mobile Devices
135 pages
Homework 4
No ratings yet
Homework 4
3 pages
Recollected - Questions para Repasar
No ratings yet
Recollected - Questions para Repasar
8 pages
Final 2019
No ratings yet
Final 2019
15 pages
ML End Sem Nov2024 Paper
No ratings yet
ML End Sem Nov2024 Paper
4 pages
Whitepaper Top Benefits of Video Conferencing Polycom
No ratings yet
Whitepaper Top Benefits of Video Conferencing Polycom
2 pages
Oscorp Style Guide: Logos
No ratings yet
Oscorp Style Guide: Logos
2 pages
Email Invoicing (E-Invoicing) : A Tool For Customer Satisfaction and Logistics Optimization
No ratings yet
Email Invoicing (E-Invoicing) : A Tool For Customer Satisfaction and Logistics Optimization
3 pages
Powin - SAMPLE Commissioning Schedule 22NOV2021
No ratings yet
Powin - SAMPLE Commissioning Schedule 22NOV2021
1 page
Assigning Items To Catalogs - TEST
No ratings yet
Assigning Items To Catalogs - TEST
10 pages

Final 2003

Uploaded by

Final 2003

Uploaded by

ANDREW ID (CAPITALS):

10-701/15-781 Final, Fall 2003

 You have 3 hours.

p=1 p=1 p=1 p=1 p=1

(b) There is a unique optimal poli y. What is it?

p=1 p=1 p=1 p=1 p=1

p=1 p=1 p=1 p=1 p=1

Suppose our exploration is greedy and we break ties by going North:

(b) Whi h is the best network?

(d) What is P (S jA)?

(f) (True or False) I<A,fg,B>

For questions (d){(f), give V C (H ). No explanation is required.

(e) H is the set of all per eptrons in 2D plane, i.e.

( ) How many data points are lassi ed in orre tly?

(d) Draw the de ision boundary in the same gure.

0.99 0.1 0.1

PSfrag repla ements

You might also like

You have 3 hours.

( ) How many data points are lassied in orre tly?