cs229 MT Review
cs229 MT Review
11/2/2018
Outline
Today:
SVMs
Kernels
Tree Ensembles
Key assumption:
➔ “Optimal margin”
Optimal margin classifier
Which linear decision boundary?
Support vectors
Observations on margin or
violating margin.
Quizz
Non-linear decision boundary
Disadvantage
Decision boundary will be non-linear in original feature space (ellipse), but linear
in the expanded feature space.
Non-linear decision boundary
Non-linear decision boundary
Non-linear decision boundary
Large number of features becomes computationally challenging.
Left:
Right:
Bagging: resample of the same size as the original dataset, with replacement.
EM Algorithm / Mixtures
Mixture Models
Gaussian Mixture Model
Here,
We hypothesize
Mixture Models
GMMs can be extremely effective at modeling distributions!
It turns out that we’re maximizing a lower bound on the true log-likelihood.
Why?
Why?
In general, need to hallucinate one new data point per possible value of .