Lecture 5

Uploaded by

sayanpal854

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Lecture 5

Uploaded by

sayanpal854

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

VC Dimension and PAC

Prof. Subir Kumar Das, Dept. of CSE

Hypothesis Space
• Hypothesis space is the space of all legal hypothesis, is a set of all legal
hypothesis that you can describe using the features that you have chosen,
and the language that you have chosen.
• It is the set from which the learning algorithm will pick a hypothesis.
• A hypothesis space is represented by H and the learning algorithm outputs
a hypothesis h belonging to H, this is the output of a learning algorithm.
• The inductive bias (also known as learning bias) of a learning algorithm is
the set of assumptions that the learner uses to predict outputs.
• In machine learning, one aim to construct algorithms that are able to learn
to predict a certain target output.
• The hypothesis that an algorithm would come up depends upon the data
and depends upon the restrictions and bias that is imposed on the data.

• All the legal possible ways in which the coordinate plane can be divided to
predict the outcome of the test data composes of the Hypothesis Space
(H). Prof. Subir Kumar Das, Dept. of CSE
• Each individual possible way is known as the hypothesis (h).
VC Dimension
• A dichotomy is a split of a set into two mutually exclusive subsets whose
union is the original set.
• A dichotomous variable is a type of variable that only takes on two
possible values.
• Gender: Male or Female
• Coin Flip: Heads or Tails
• Property Type: Residential or Commercial
• If for any set of labels {y(1), … , y(d)}, there exists some hЄH so that h(x(i)) =
y(i) for all i = 1, …, m
• There are 2m different ways to separate the sample into two sub-samples
(a dichotomy)
• While the number of hypotheses H can be ∞, the number of dichotomies
H{X1, X2, ..., XN} is at most 2N
• VC dimension, short for Vapnik-Chervonenkis dimension, is a measure of
the complexity of a machine learning model.
• It is named after the mathematicians Vladimir Vapnik and Alexey
Chervonenkis, who developed the concept in the 1970s as part of their
work on statistical learningProf. Subir Kumar Das, Dept. of CSE
theory.
VC Dimension
• VC dimension is defined as the largest number of points that can be
shattered by a binary classifier without misclassification.
• In other words, it is a measure of the model’s capacity to fit arbitrary
labeled datasets.
• A subset S of instances of a set X is shattered by a collection of function F
if ∀ S'⊆ S there is a function f ∈ F such data:
• f (x) = 1 x∈S′
• 0 x∈S−S′
• It is termed informally as a measure of a model’s capacity.
• It is used frequently to guide the model selection process while
developing machine learning applications.
• If there exists a set of n points that can be shattered by the classifier
and there is no set of n+1 points that can be shattered by the classifier,
then the VC dimension of the classifier is n.
• VC Dim helps us compare the models using this bias variance tradeoff.
• Bias represents the inherent error due to the model's assumptions, while
variance measures the model'sProf. Subirsensitivity toCSEtraining data.
Kumar Das, Dept. of
Example
• VC dimension for a linear classifier is at least 3, since it can shatter this
configuration of 3 points.
• In each of the 2³ = 8 possible assignment of positive and negative, the
classifier is able to perfectly separate the two classes.

• Here a linear classifier is lower than 4.

• In this configuration of 4 points, the classifier is unable to segment the
positive and negative classes in at least one assignment.
• Two lines would be necessary to separate the two classes in this
situation. Prof. Subir Kumar Das, Dept. of CSE 5
VC Dimension Application
• In most cases, the exact VC
dimension of a classifier is not so
important.
• Rather, it is used more so to
classify different types of
algorithms by their complexities;
• for example, the class of simple
classifiers could include basic
shapes like lines, circles, or
rectangles,
• whereas a class of complex
classifiers could include classifiers
such as multilayer perceptrons,
boosted trees, or other nonlinear
classifiers.
• The complexity of a classification algorithm, which is directly related to
its VC dimension, is related to the trade-off between bias and variance.
• a model with a higher VC dimension will require more training data to
properly train, but will be Prof.
able to identify more complex relationships 6in
Subir Kumar Das, Dept. of CSE
the data.
Computational Learning Theory

• Computational learning theory (CoLT) is a branch of AI concerned with

using mathematical methods or the design applied to computer learning
programs.
• It involves using mathematical frameworks for the purpose of quantifying
learning tasks and algorithms.
• It can be considered to be an extension of statistical learning theory (SLT),
that makes use of formal methods for the purpose of quantifying learning
algorithms.
• Computational Learning Theory (CoLT): Formal study of learning tasks.
• Statistical Learning Theory (SLT): Formal study of learning algorithms.
• This division of learning tasks vs. learning algorithms is arbitrary, and in
practice, there is quite a large degree of overlap between these two fields.
• It is essentially a sub-fieldProf.of
Subirartificial intelligence
Kumar Das, Dept. of CSE (AI) that focuses on
studying the design and analysis of machine learning algorithms.
Probably Approximately Correct Learning
• PAC learning, also known as Probably Approximately Correct learning is a
theoretical machine learning framework created by Leslie Valiant.
• PAC learning aims to quantify the difficulty involved in a learning task and
it might be considered to be the main sub-field of computational learning
theory.
• PAC learning bothers about the amount of computational effort that is
needed in order to identify a hypothesis (fit model) that is a close match
for the unknown target function.
• PAC learning offers guarantees to the absolute error, how different your
hypothesis (the learned function) is from the concept (the target function,
your task), given that you can only measure your empirical error, the one
that you get from your training sample.
• PAC learning aims to determine whether a learning algorithm can, with
high probability, produce a hypothesis that is approximately correct.
• This means the algorithm should, with a probability of at least 1−𝛿, find a
hypothesis ℎ whose error rate is within 𝜖 of the best possible hypothesis.
• Here, 𝛿 represents the confidence parameter, and 𝜖 represents the
accuracy parameter.
• In the context of Machine Learning, a problem is PAC-learnable if there is
an algorithm A when given some independently drawn samples, will
produce a hypothesis withProf.aSubir
small error
Kumar Das, Dept. offor
CSE any distribution D and any
concept c, and that too with a high probability.
Formal Definition
• f is the function that we want to learn, the target function.
• F is the class of functions from which f can be selected. f is an element of
F.
• X is the set of possible individuals. It is the domain of f.
• N is the cardinality of X.
• D is a probability distribution on X; this distribution is used both when the
training set is created and when the test set is created.
• ORACLE(f,D), a function that in a unit of time returns a pair of the form
(x,f(x)), where x is selected from X according to D.
• H is the set of possible hypotheses.
• h is the specific hypothesis that has been learned. H is an element of H.
• m is the cardinality of the training set.
• error(h) = Probability[f(x) != h(x), x chosen from X according to D]
• Definition: A class of functions F is Probably Approximately (PAC)
Learnable if:
• there is a learning algorithm L that for all f in F, all distributions D on X,
• all epsilon (0 < ε< 1) and delta (0 < 𝛿 < 1), will produce an hypothesis h,
such that
• the probability is at most 𝛿Prof.
that error(h) > ε.
Subir Kumar Das, Dept. of CSE
• L has access to the values of ε and 𝛿, and to ORACLE(f,D)
Probably Approximately Correct Learning
• F is Efficiently PAC Learnable if L is polynomial in ε, 𝛿 , and ln(N).
• It is Polynomial PAC Learnable if m is polynomial in ε, 𝛿 , and the size of
(minimal) descriptions of individuals and of the concept.
• PAC learning, the learning process involves several key components:
• Concept Class (C): The set of all possible target functions.
• Hypothesis Class (H): The set of all possible hypotheses that the learning
algorithm can produce.
• Distribution (D): The probability distribution over the input space.
• Sample Size (m): The number of training examples drawn independently
from D.
• The goal is to find a hypothesis ℎ∈𝐻 that approximates the target concept
𝑐∈𝐶 well enough.

Prof. Subir Kumar Das, Dept. of CSE

Applications and Limitations
• PAC learning provides a theoretical basis for designing and evaluating
machine learning algorithms.
• It helps in understanding the trade-offs between sample size, accuracy,
and confidence, guiding the development of efficient and reliable
algorithms.
• PAC learning aids in model selection by providing criteria for choosing
models that balance complexity and performance.
• It helps in determining the appropriate hypothesis class and regularization
techniques to avoid overfitting.
• PAC learning principles can be applied to determine the optimal querying
strategy to achieve the desired accuracy and confidence with fewer
labeled or unlabeled examples.
• PAC learning assumes that the data distribution is fixed and known, which
may not hold in real-world scenarios where distributions can change over
time.
• Calculating the exact VC dimension for complex hypothesis classes can be
computationally infeasible, limiting practical applications.
• The hypothesis class needs to be expressive enough to contain a good
approximation of the targetProf. Subirfunction, which
Kumar Das, Dept. of CSE can be challenging for
complex problems.
Thank You

Prof. Subir Kumar Das, Dept. of CSE

[Ebooks PDF] download Motion Preservation Surgery of the Spine Advanced Techniques and Controversies 1 Har/DVD/ Edition James J. Yue Md full chapters
100% (1)
[Ebooks PDF] download Motion Preservation Surgery of the Spine Advanced Techniques and Controversies 1 Har/DVD/ Edition James J. Yue Md full chapters
61 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
TG SHS Earth Science
100% (2)
TG SHS Earth Science
107 pages
Youth in Nation Building
100% (1)
Youth in Nation Building
22 pages
SVM Using Python
No ratings yet
SVM Using Python
24 pages
Company Law Course Outline 2023
100% (2)
Company Law Course Outline 2023
8 pages
ASL 1 Determining Progress Towards The Attainment of The Learning
67% (3)
ASL 1 Determining Progress Towards The Attainment of The Learning
18 pages
ML Unit-3
No ratings yet
ML Unit-3
24 pages
WINSEM2021-22 CSE4020 ETH VL2021220501968 Reference Material I 22-01-2022 PAC Learning
No ratings yet
WINSEM2021-22 CSE4020 ETH VL2021220501968 Reference Material I 22-01-2022 PAC Learning
34 pages
ML Unit-3.-1
No ratings yet
ML Unit-3.-1
28 pages
Notes
No ratings yet
Notes
125 pages
All Merged Chap 4
No ratings yet
All Merged Chap 4
37 pages
Machine Learning
No ratings yet
Machine Learning
21 pages
Week_7_Notes[1]
No ratings yet
Week_7_Notes[1]
11 pages
DL_Unit1 (1)
No ratings yet
DL_Unit1 (1)
79 pages
ML Unit-4
No ratings yet
ML Unit-4
9 pages
Algorithm of Neural Network M4
No ratings yet
Algorithm of Neural Network M4
25 pages
4.0 ALGO211 Week10 Computational Learning Theory
No ratings yet
4.0 ALGO211 Week10 Computational Learning Theory
16 pages
MachineLearning_UNIT III
No ratings yet
MachineLearning_UNIT III
30 pages
Artificial Intelligence Chapter 18 (Updated)
No ratings yet
Artificial Intelligence Chapter 18 (Updated)
19 pages
3.pattern Recognition (Pattern Classification) - AdaBoost
No ratings yet
3.pattern Recognition (Pattern Classification) - AdaBoost
80 pages
Machine Learning
No ratings yet
Machine Learning
87 pages
Unit 4 Part1
No ratings yet
Unit 4 Part1
33 pages
DI-ML-concept learning-CEA
No ratings yet
DI-ML-concept learning-CEA
33 pages
Sec 1630
No ratings yet
Sec 1630
145 pages
Module3 PPT
No ratings yet
Module3 PPT
132 pages
21CS54 Aiml Module3 PPT
No ratings yet
21CS54 Aiml Module3 PPT
102 pages
ML Lecture 4
No ratings yet
ML Lecture 4
15 pages
Classification
No ratings yet
Classification
53 pages
computational learning theorem
No ratings yet
computational learning theorem
91 pages
ML Sit1305
No ratings yet
ML Sit1305
127 pages
Machine - Learning - Unit - 1
No ratings yet
Machine - Learning - Unit - 1
70 pages
Lecture Slides-Week15,16
No ratings yet
Lecture Slides-Week15,16
50 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
24 pages
Unit 1 1
No ratings yet
Unit 1 1
64 pages
Hypothesis Space and Inductive Bias - Inductive Bias - Inductive Learning - Underfitting and Overfitting
No ratings yet
Hypothesis Space and Inductive Bias - Inductive Bias - Inductive Learning - Underfitting and Overfitting
4 pages
ML Lecture 5
No ratings yet
ML Lecture 5
14 pages
ML UNIT 1-2-57
No ratings yet
ML UNIT 1-2-57
56 pages
ML - Questions & Answer
No ratings yet
ML - Questions & Answer
45 pages
EE5434 Regression
No ratings yet
EE5434 Regression
96 pages
Symbolic Regression
No ratings yet
Symbolic Regression
11 pages
Hypothesis in ML
No ratings yet
Hypothesis in ML
16 pages
INT354 Unit 1 Part1
No ratings yet
INT354 Unit 1 Part1
16 pages
ML_UNIT 4
No ratings yet
ML_UNIT 4
15 pages
Lecture13 - ML Linear & Log-Linear Models
No ratings yet
Lecture13 - ML Linear & Log-Linear Models
34 pages
Mod09-ppt2-ML_in_Image_Classification
No ratings yet
Mod09-ppt2-ML_in_Image_Classification
30 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
AIML - Unit 4 Notes
No ratings yet
AIML - Unit 4 Notes
23 pages
Unit Iii
No ratings yet
Unit Iii
6 pages
AI Capstone Project - Notes-Part2
No ratings yet
AI Capstone Project - Notes-Part2
8 pages
Machine Learning Models
0% (1)
Machine Learning Models
16 pages
Module 1
No ratings yet
Module 1
27 pages
UNIT IV
No ratings yet
UNIT IV
28 pages
ML_UNIT-1
No ratings yet
ML_UNIT-1
64 pages
Unit 1 M.sabareesan Ii Cse (108 P X 62 C) Toc
No ratings yet
Unit 1 M.sabareesan Ii Cse (108 P X 62 C) Toc
107 pages
Presentation on ML - Copy
No ratings yet
Presentation on ML - Copy
469 pages
Notes on Machine_learning
No ratings yet
Notes on Machine_learning
88 pages
Introduction To ML - MCA - 2023
No ratings yet
Introduction To ML - MCA - 2023
30 pages
CS607 - FinalTerm Subjectives Solved With References by Moaaz
No ratings yet
CS607 - FinalTerm Subjectives Solved With References by Moaaz
16 pages
ML_Unit-2
No ratings yet
ML_Unit-2
23 pages
Chap2 SupervisedLearning
No ratings yet
Chap2 SupervisedLearning
24 pages
5 - AIML - Module3 - PPT
No ratings yet
5 - AIML - Module3 - PPT
37 pages
ML_Unit_1 (1)
No ratings yet
ML_Unit_1 (1)
124 pages
Deep Learning Answers
No ratings yet
Deep Learning Answers
36 pages
Module 3 - AIML
No ratings yet
Module 3 - AIML
134 pages
Project Report Guidelines for BBA FINAL YEAR 2024-2025
No ratings yet
Project Report Guidelines for BBA FINAL YEAR 2024-2025
12 pages
DLP TRENDS Week 8 - Global Networks Labor and Migration
No ratings yet
DLP TRENDS Week 8 - Global Networks Labor and Migration
5 pages
Karabuk University Recognition Sheet 20 20... Academic Year Semester Faculty of Letters Western Languages and Literature Department
No ratings yet
Karabuk University Recognition Sheet 20 20... Academic Year Semester Faculty of Letters Western Languages and Literature Department
2 pages
Departmental Prospectus
No ratings yet
Departmental Prospectus
66 pages
20231012T031705929 Att 706618414818741
No ratings yet
20231012T031705929 Att 706618414818741
4 pages
Curriculum Vitae of Mahabuba Siddika: Career Objectives
No ratings yet
Curriculum Vitae of Mahabuba Siddika: Career Objectives
2 pages
M1 - M2 - Isp-Lesson Guide - T.byron
No ratings yet
M1 - M2 - Isp-Lesson Guide - T.byron
9 pages
Unit 4 Written Report TTL
No ratings yet
Unit 4 Written Report TTL
27 pages
3 Marketing Executive Resume Samples, Examples - Download Now!
No ratings yet
3 Marketing Executive Resume Samples, Examples - Download Now!
9 pages
A Way To Analyse The Tenets of English Language
No ratings yet
A Way To Analyse The Tenets of English Language
16 pages
International-Cuisine Topics
No ratings yet
International-Cuisine Topics
3 pages
Voice Output Communication Aids Students With Autism PP
No ratings yet
Voice Output Communication Aids Students With Autism PP
8 pages
S 09 Mathematics
No ratings yet
S 09 Mathematics
12 pages
Succeed in English 1-7-8
No ratings yet
Succeed in English 1-7-8
2 pages
2555 Cbc Kpsea Answer Sheet Word Format
No ratings yet
2555 Cbc Kpsea Answer Sheet Word Format
10 pages
Defining Inclusive Cultural Empathy
No ratings yet
Defining Inclusive Cultural Empathy
19 pages
Career Guidance Implementation Report S
No ratings yet
Career Guidance Implementation Report S
5 pages
IEP Case Study Small 1
No ratings yet
IEP Case Study Small 1
14 pages
1st CO - Gen. Math
No ratings yet
1st CO - Gen. Math
3 pages
Statistics Jee-Main (Flash Back)
No ratings yet
Statistics Jee-Main (Flash Back)
3 pages
STEP Standard 6 - Analysis of Student Learning: Post-Test Data: Whole Class
No ratings yet
STEP Standard 6 - Analysis of Student Learning: Post-Test Data: Whole Class
4 pages
S.M.A.R.T. Goals Worksheet: Initial Goal
100% (1)
S.M.A.R.T. Goals Worksheet: Initial Goal
2 pages
Orthodontic Space Analysis
100% (2)
Orthodontic Space Analysis
60 pages
NHS IAPT Children and Young People's Project Newsletter June 2012
No ratings yet
NHS IAPT Children and Young People's Project Newsletter June 2012
11 pages
Jolly Phonics Template
No ratings yet
Jolly Phonics Template
1 page