0% found this document useful (0 votes)

16 views10 pages

Machine Leaning 1 Unit

Machine learning (ML) is a subset of artificial intelligence that develops algorithms capable of learning from data to make predictions or decisions. It includes various types such as supervised, unsupervised, semi-supervised, and reinforcement learning, each with distinct characteristics and applications. Key challenges in ML include data quality, bias, interpretability, regulatory compliance, trust, ethical considerations, and data privacy.

Uploaded by

Priyanshu Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views10 pages

Machine Leaning 1 Unit

Uploaded by

Priyanshu Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Machine Leaning

Machine learning is a branch of Arti cial Intelligence that focuses on

developing models.
Machine learning (ML) is a dynamic eld within arti cial intelligence
(AI) that focuses on developing algorithms capable of learning from
data and making predictions or decisions without being explicitly
programmed to do so. ML algorithms are designed to improve their
performance as they are exposed to more data, enabling them to solve
a wide range of problems, from image and speech recognition to fraud
detection and recommendation systems.

Types of Machine Learning

Machine learning is categorized into several types, each with distinct

characteristics and applications. The primary types include supervised
learning, unsupervised learning, semi-supervised learning, and
reinforcement learning.

• Supervised Learning: This type of learning involves training a model

on a dataset that includes both input features and corresponding
labels. The goal is for the model to learn the relationship between
inputs and outputs so it can predict the correct output for new, unseen
data. Supervised learning is further divided into classi cation and
regression tasks. Classi cation involves predicting discrete labels,
such as whether an email is spam or not, while regression involves
predicting continuous values, like predicting a property's price based
on its features.

• Unsupervised Learning: Unlike supervised learning, unsupervised

learning deals with unlabeled data. The goal is to nd hidden patterns
or intrinsic structures in the data. Common unsupervised learning
techniques include clustering, where data points are grouped into
clusters based on similarities, and dimensionality reduction, which
simpli es high-dimensional data into a lower-dimensional space.

• Semi-Supervised Learning: This approach combines both labeled

and unlabeled data to improve learning accuracy. It is particularly
useful when labeled data is scarce or expensive to obtain. Semi-
supervised learning can enhance the performance of models by
fi
fi
fi
fi
fi
fi
fi
leveraging the structure of the unlabeled data to guide the learning
process.

• Reinforcement Learning: In reinforcement learning, an agent learns

to make decisions by interacting with an environment. The agent
receives feedback in the form of rewards or penalties based on its
actions. The goal is to maximize the cumulative reward over time. This
type of learning is particularly useful for decision-making tasks where
the agent must learn optimal actions through trial and error.

Machine Learning Perspectives

Perspectives in Machine Learning (ML) include several key areas of
focus:
• Algorithms for Learning General Target Functions: These
algorithms aim to learn a general function from speci c training
examples. Different algorithms converge to the desired
function under various conditions, and speci c algorithms
perform best for certain types of problems and
representations. 2

• Training Data Suf ciency: Determining how much training

data is necessary for effective learning is crucial. General
bounds can be established to relate the con dence in learned
hypotheses to the amount of training data and the character of
the learner’s hypothesis space. 2

• Prior Knowledge Utilization: Understanding how prior

knowledge can guide the generalization process from
examples is important. Prior knowledge can be bene cial even
if it is only approximately correct. 2

• Strategies for Choosing Training Experiences: Deciding on

the best strategy for selecting the next training experience is a
critical aspect. The choice of this strategy can signi cantly alter
the complexity of the learning problem. 2
fi
fi
fi
fi
fi
fi
Issues in Machine Learning
Machine Learning (ML) faces several signi cant issues that can
impact its effectiveness and adoption. Here are the key challenges
in a point-wise format:

• Data Quality and Availability: ML models heavily rely on

high-quality data. Issues such as the absence of good quality
data, unclean data, and noisy data can hinder the accuracy of
predictions and the overall performance of the models. 15

• Bias and Fairness: ML models can inherit biases present in

the training data, leading to unfair outcomes. For instance,
biased algorithms can result in discriminatory decisions in
areas like hiring, lending, and criminal justice.
25

• Interpretability and Explainability: Many ML models,

particularly deep learning models, are often described as
"black boxes" due to the dif culty in understanding how they
make decisions. This lack of explainability is a signi cant issue
in industries where transparency is crucial, such as healthcare
and nance. 25

• Regulatory Compliance: In regulated industries, there is a

need for models to provide clear explanations for their
decisions. The lack of explainability can hinder the adoption of
ML in these elds. 5

• Trust: Stakeholders may be reluctant to trust ML predictions if

they do not understand how the model arrives at its decisions. 5

• Ethical Considerations: ML raises ethical questions about

privacy, consent, and the impact on jobs. For example, the use
of automated content generation by large language models
(LLMs) can affect jobs in content-related elds, and ensuring
responsible use of these models is crucial. 4

• Data Privacy: There are growing concerns about how ML

models handle and protect sensitive data. Ensuring data
privacy is essential to maintain user trust and comply with
regulations.
fi
fi
fi
fi
fi
fi
🔍 What is a Hypothesis Space?
In machine learning, a hypothesis is a function or model that maps
input data to output labels.
The hypothesis space is the set of all hypotheses that a learning
algorithm can choose from when tting data.
• Denoted as H
• A learning algorithm searches through H to nd the best hypothesis
h∈H that ts the training data.

Finite Hypothesis Space

A nite hypothesis space means the number of possible

hypotheses in H is countable and limited.

Examples:

1. Binary decision rules over 3 boolean features:

◦ Each feature can be true/false.

◦ The total number of possible combinations is limited.

1. Decision Trees with depth ≤ 2:

◦ The number of distinct trees is nite due to depth constraint.

Characteristics:
• Easier to search through.
• Generalization is easier to analyze.
• Theoretical guarantees (e.g., PAC learning) are easier to
establish.
• Suitable for simpler problems with well-bounded structure.

Advantages:
• E cient computation.
• Lower risk of over tting.
• Easier to interpret.
ffi
fi
fi
fi
fi
fi
fi
Disadvantages:
• Limited expressive power: Can only capture simple
patterns.
• May under t if the true function is more complex.

♾ In nite Hypothesis Space

An in nite hypothesis space contains an uncountably large or

countably in nite number of possible hypotheses.

Examples:
1. Linear functions over real numbers:
◦ Any function of the form

y=w1 x1 +w2 x2 +⋯+wn xn +b, with weights and bias from ℝ.

◦ There are in nitely many combinations of weights.

2. Neural Networks:
◦ With real-valued weights and possibly in nite architectures, they

represent a massive (even uncountable) space of functions.

Characteristics:
• Much more expressive and exible.
• Can model highly complex relationships.
• Typically needed for real-world, high-dimensional data.

Advantages:
• Can approximate complex, real-world functions.
• Higher accuracy potential if trained correctly.

Disadvantages:
• Risk of over tting without proper regularization.
• Harder to search or optimize.
• Theoretical analysis (e.g., generalization guarantees) is more
complex.

Finite Hypothesis Space:

+-------------------+
fi
fi
fi
fi
fi
fi
fl
fi
| Hypothesis 1 |
+-------------------+
| Hypothesis 2 |
+-------------------+
| Hypothesis 3 |
+-------------------+
| ... |
+-------------------+
| Hypothesis N |
+-------------------+

In nite Hypothesis Space:

+-------------------+
| Hypothesis 1 |
+-------------------+
| Hypothesis 2 |
+-------------------+
| Hypothesis 3 |
+-------------------+
| ... |
+-------------------+
| Hypothesis ∞ |
+-------------------+

📊 Comparison Table
Feature Finite Hypothesis Space In nite Hypothesis Space
Size Countable and limited Countably or uncountably
in nite
Examples Boolean logic rules, simple Linear models, neural
decision trees networks
Expressiveness Limited High
Risk of Over tting Low High (needs regularization)
Search Complexity Simple (can enumerate all) Complex (needs optimization
techniques)
Generalization Easier to guarantee Harder to ensure without
constraints

What is PAC Learning?

fi
fi
fi
fi
Probably Approximately Correct (PAC) learning is a theoretical
framework introduced by Leslie Valiant in 1984. It addresses the
problem of learning a function from a set of samples in a way that
is both probably correct and approximately correct. In simpler
terms, PAC learning formalizes the conditions under which a
learning algorithm can be expected to perform well on new, unseen
data after being trained on a nite set of examples.

Importance of PAC Learning

PAC learning is important because it provides a rigorous

foundation for understanding the behavior and performance of
learning algorithms. It helps determine the conditions under which
a learning algorithm can generalize well from a limited number of
samples, o ering insights into the trade-o s between accuracy,
con dence, and sample size.

The PAC framework is widely applicable and serves as a basis for

analyzing and designing many machine learning algorithms. It
o ers theoretical guarantees that are crucial for assessing the
reliability and robustness of these algorithms. By understanding
PAC learning, researchers and practitioners can develop more
e cient and e ective models that are capable of making accurate
predictions on new data.

PAC Learning Theorem

The PAC learning theorem provides formal guarantees about the
performance of learning algorithms. It states that for a given
accuracy (ε) and con dence (δ), there exists a sample size (m) such
that any learning algorithm that returns a hypothesis consistent
with the training samples will, with probability at least 1-δ, have an
error rate less than ε on unseen data.
Mathematically, the PAC learning theorem can be expressed
as:
ffi
ff
fi
ff
ff
fi
fi
ff
The VC dimension is a measure of the capacity or complexity of
the hypothesis space. It quanti es the maximum number of points
that can be shattered (i.e., correctly classi ed in all possible ways)
by the hypotheses in the space. A higher VC dimension indicates a
more complex hypothesis space, which may require more samples
to ensure good generalization.
The PAC learning theorem provides a powerful tool for analyzing
and designing learning algorithms. It helps determine the sample
size needed to achieve a desired level of accuracy and con dence,
guiding the development of e cient and e ective models.

Challenges of PAC Learning

Real-world Applicability

While PAC learning provides a solid theoretical foundation,

applying it to real-world problems can be challenging. The
assumptions made in PAC learning, such as the availability of a
nite hypothesis space and the existence of a true underlying
function, may not always hold in practice.

In real-world scenarios, data distributions can be complex and

unknown, and the hypothesis space may be in nite or unbounded.
These factors can complicate the application of PAC learning,
requiring additional techniques and considerations to achieve
practical results.

Computational Complexity

Finding the optimal hypothesis within the PAC framework can be

computationally expensive, especially for large and complex
fi
ffi
fi
ff
fi
fi
fi
hypothesis spaces. This can limit the practical use of PAC learning
for certain applications, particularly those involving high-
dimensional data or complex models.

E cient algorithms and optimization techniques are needed to

make PAC learning feasible for practical use. Researchers are
continually developing new methods to address the computational
challenges of PAC learning and improve its applicability to real-
world problems.

Vapnik-Chervonenkis Dimension

The Vapnik-Chervonenkis (VC) dimension is a measure of the

capacity of a hypothesis set to t di erent data sets. It was
introduced by Vladimir Vapnik and Alexey Chervonenkis in the
1970s and has become a fundamental concept in statistical
learning theory. The VC dimension is a measure of the complexity
of a model, which can help us understand how well it can t
di erent data sets.

The VC dimension of a hypothesis set H is the largest number of

points that can be shattered by H. A hypothesis set H shatters a
set of points S if, for every possible labeling of the points in S,
there exists a hypothesis in H that correctly classi es the points. In
other words, a hypothesis set shatters a set of points if it can t
any possible labeling of those points.

Bounds of VC – Dimension

The VC dimension provides both upper and lower bounds on the

number of training examples required to achieve a given level of
accuracy. The upper bound on the number of training examples
is logarithmic in the VC dimension, while the lower bound is linear.

Applications of VC – Dimension
The VC dimension has a wide range of applications in machine
learning and statistics. For example, it is used to analyze the
complexity of neural networks, support vector machines, and
decision trees. The VC dimension can also be used to design new
ffi
ff
fi
ff
fi
fi
fi
learning algorithms that are robust to noise and can generalize well
to unseen data.

The VC dimension can be extended to more complex learning

scenarios, such as multiclass classi cation and regression. The
concept of the VC dimension can also be applied to other areas of
computer science, such as computational geometry and graph
theory.

Introduction To Machine Learning PPT Main
100% (1)
Introduction To Machine Learning PPT Main
15 pages
Machine Learning?
100% (2)
Machine Learning?
114 pages
Machine Learning Practical File
No ratings yet
Machine Learning Practical File
41 pages
Unit 1 ML
No ratings yet
Unit 1 ML
41 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Machine Learning: Understanding The Basics of Machine Learning and Its Applications
No ratings yet
Machine Learning: Understanding The Basics of Machine Learning and Its Applications
24 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
Machine Learning in Unit-1
No ratings yet
Machine Learning in Unit-1
10 pages
Machine Learning Unil-1
No ratings yet
Machine Learning Unil-1
20 pages
01 - Introduction
No ratings yet
01 - Introduction
35 pages
Machine Learning
No ratings yet
Machine Learning
38 pages
Presentation On ML
No ratings yet
Presentation On ML
469 pages
chapter5-AI Approaches
No ratings yet
chapter5-AI Approaches
52 pages
Module - 1
No ratings yet
Module - 1
9 pages
Fundamentals of Machine Learning II
No ratings yet
Fundamentals of Machine Learning II
13 pages
Basic Concepts of Machine Learning For Beginners 1732109263
No ratings yet
Basic Concepts of Machine Learning For Beginners 1732109263
102 pages
Unit 1
No ratings yet
Unit 1
62 pages
Machine Learning A Comprehensive Overview
No ratings yet
Machine Learning A Comprehensive Overview
10 pages
AI Module 1 Simple Notes
No ratings yet
AI Module 1 Simple Notes
14 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Introduction
No ratings yet
Introduction
73 pages
ML Cheet
No ratings yet
ML Cheet
14 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
5 pages
M2 AI Chap1 Neural-Network
No ratings yet
M2 AI Chap1 Neural-Network
60 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
Deep Learnng IA
No ratings yet
Deep Learnng IA
69 pages
Unit 1
No ratings yet
Unit 1
51 pages
AI Unit 1
No ratings yet
AI Unit 1
36 pages
Machine Learning
No ratings yet
Machine Learning
2 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
Machine Learning Unit - 1
No ratings yet
Machine Learning Unit - 1
154 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
10 pages
ML 7th Sem AIML ITE Notes Complete LONG (1) - 10-33
No ratings yet
ML 7th Sem AIML ITE Notes Complete LONG (1) - 10-33
24 pages
Module 1
No ratings yet
Module 1
22 pages
14.2 Machine Learning and Deep Learning
No ratings yet
14.2 Machine Learning and Deep Learning
7 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
19 pages
Machine Learning - 1 (UNIT - 1)
No ratings yet
Machine Learning - 1 (UNIT - 1)
6 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
27 pages
Unit 1 Notes - FML
No ratings yet
Unit 1 Notes - FML
95 pages
Machine Learning: From: Atul Ranjan Jha
No ratings yet
Machine Learning: From: Atul Ranjan Jha
11 pages
2 ML
No ratings yet
2 ML
2 pages
Presentation1.Pptx Tanushka
No ratings yet
Presentation1.Pptx Tanushka
13 pages
Machine Learning Moudle - 1: There Are Three Main Types of Machine Learning
No ratings yet
Machine Learning Moudle - 1: There Are Three Main Types of Machine Learning
86 pages
Department of Electronics and Communication: Industrial Training Presentation
No ratings yet
Department of Electronics and Communication: Industrial Training Presentation
22 pages
ML Hand Written Notes
No ratings yet
ML Hand Written Notes
19 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
56 pages
1 Introduction
No ratings yet
1 Introduction
24 pages
Department of Emerging Technology (SB) III B.Tech - I Semester
No ratings yet
Department of Emerging Technology (SB) III B.Tech - I Semester
12 pages
AIDS Module 1 Notes Draft
No ratings yet
AIDS Module 1 Notes Draft
30 pages
Machine Learning Notes
100% (1)
Machine Learning Notes
8 pages
5th Sem Report
No ratings yet
5th Sem Report
29 pages
Ch7 Introduction To Machine Learning
No ratings yet
Ch7 Introduction To Machine Learning
29 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
5 pages
Unit 3 - ML
No ratings yet
Unit 3 - ML
15 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
48 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
Mlfa Autumn 22 Lec 01
No ratings yet
Mlfa Autumn 22 Lec 01
43 pages
Machine Learning: Fundamentals and Applications
From Everand
Machine Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Grains Weight (G) : Ugyen Academy Assignments For Class VIII Students - 2022
No ratings yet
Grains Weight (G) : Ugyen Academy Assignments For Class VIII Students - 2022
6 pages
An Erlang Is A Unit of Telecommunications Traffic Measurement
No ratings yet
An Erlang Is A Unit of Telecommunications Traffic Measurement
4 pages
Traupal Notes
No ratings yet
Traupal Notes
41 pages
Level - I (C.W) (Areas)
No ratings yet
Level - I (C.W) (Areas)
3 pages
Model Question Paper - IA, IB & IA, IIB EM&TM
No ratings yet
Model Question Paper - IA, IB & IA, IIB EM&TM
25 pages
Maths MCQ & FB From Revised Edition of Text Book-SSB
No ratings yet
Maths MCQ & FB From Revised Edition of Text Book-SSB
50 pages
Division Properties of Exponents Homework Help
100% (1)
Division Properties of Exponents Homework Help
5 pages
A1emac6814 2025
No ratings yet
A1emac6814 2025
6 pages
Inferential Statistics Review For Compre S
No ratings yet
Inferential Statistics Review For Compre S
122 pages
Formation Characterization Well Logs
No ratings yet
Formation Characterization Well Logs
26 pages
Variation
No ratings yet
Variation
20 pages
Linear Algebra Slide Beammer 2022 Oct 16
No ratings yet
Linear Algebra Slide Beammer 2022 Oct 16
178 pages
Midterm - Revision (TA Aladin)
No ratings yet
Midterm - Revision (TA Aladin)
40 pages
Tutorial - 4
No ratings yet
Tutorial - 4
2 pages
A Review of Condition Monitoring and Fault Diagnosis For Diesel Engines
No ratings yet
A Review of Condition Monitoring and Fault Diagnosis For Diesel Engines
25 pages
2018 05 HP
No ratings yet
2018 05 HP
105 pages
Distillation Using Excel
No ratings yet
Distillation Using Excel
4 pages
Decisions Under Uncertainty-L3
No ratings yet
Decisions Under Uncertainty-L3
16 pages
A Philosopher's Understanding of Quantum Mechanics (Vermaas)
100% (1)
A Philosopher's Understanding of Quantum Mechanics (Vermaas)
308 pages
Mathematical Techniques in Crystallography and Materials Science - Edward Prince
0% (1)
Mathematical Techniques in Crystallography and Materials Science - Edward Prince
199 pages
Symplectic Geometry
No ratings yet
Symplectic Geometry
21 pages
Fluid Dynamics and Transport of PDF
No ratings yet
Fluid Dynamics and Transport of PDF
3 pages
Math Quiz 6 Ellimination
No ratings yet
Math Quiz 6 Ellimination
1 page
Physics: Mechanical Waves (3 Hours)
No ratings yet
Physics: Mechanical Waves (3 Hours)
11 pages
II PU PASSING Questions & Answers 2024 Annual Exam PART D & E
No ratings yet
II PU PASSING Questions & Answers 2024 Annual Exam PART D & E
27 pages
A Three-Dimensional Finite-Strain Rod Model. Part Ii: Computational Aspects
No ratings yet
A Three-Dimensional Finite-Strain Rod Model. Part Ii: Computational Aspects
37 pages
IGCSE Maths Paper 21 - Final Paper
No ratings yet
IGCSE Maths Paper 21 - Final Paper
17 pages
GAN-based Synthetic Medical Image Augmentation
No ratings yet
GAN-based Synthetic Medical Image Augmentation
10 pages
DBM 20023 Engineering Mathematics 2: Application of Differentiation
No ratings yet
DBM 20023 Engineering Mathematics 2: Application of Differentiation
7 pages
Pisa 5 Final
No ratings yet
Pisa 5 Final
15 pages