Lecture 5
Lecture 5
• All the legal possible ways in which the coordinate plane can be divided to
predict the outcome of the test data composes of the Hypothesis Space
(H). Prof. Subir Kumar Das, Dept. of CSE
• Each individual possible way is known as the hypothesis (h).
VC Dimension
• A dichotomy is a split of a set into two mutually exclusive subsets whose
union is the original set.
• A dichotomous variable is a type of variable that only takes on two
possible values.
• Gender: Male or Female
• Coin Flip: Heads or Tails
• Property Type: Residential or Commercial
• If for any set of labels {y(1), … , y(d)}, there exists some hЄH so that h(x(i)) =
y(i) for all i = 1, …, m
• There are 2m different ways to separate the sample into two sub-samples
(a dichotomy)
• While the number of hypotheses H can be ∞, the number of dichotomies
H{X1, X2, ..., XN} is at most 2N
• VC dimension, short for Vapnik-Chervonenkis dimension, is a measure of
the complexity of a machine learning model.
• It is named after the mathematicians Vladimir Vapnik and Alexey
Chervonenkis, who developed the concept in the 1970s as part of their
work on statistical learningProf. Subir Kumar Das, Dept. of CSE
theory.
VC Dimension
• VC dimension is defined as the largest number of points that can be
shattered by a binary classifier without misclassification.
• In other words, it is a measure of the model’s capacity to fit arbitrary
labeled datasets.
• A subset S of instances of a set X is shattered by a collection of function F
if ∀ S'⊆ S there is a function f ∈ F such data:
• f (x) = 1 x∈S′
• 0 x∈S−S′
• It is termed informally as a measure of a model’s capacity.
• It is used frequently to guide the model selection process while
developing machine learning applications.
• If there exists a set of n points that can be shattered by the classifier
and there is no set of n+1 points that can be shattered by the classifier,
then the VC dimension of the classifier is n.
• VC Dim helps us compare the models using this bias variance tradeoff.
• Bias represents the inherent error due to the model's assumptions, while
variance measures the model'sProf. Subirsensitivity toCSEtraining data.
Kumar Das, Dept. of
Example
• VC dimension for a linear classifier is at least 3, since it can shatter this
configuration of 3 points.
• In each of the 2³ = 8 possible assignment of positive and negative, the
classifier is able to perfectly separate the two classes.