DSOST3

Machine learning (ML) is a branch of artificial intelligence that enables computers to learn from data and make predictions. It includes three main types: supervised learning, unsupervised learning, and reinforcement learning, each with distinct methodologies and applications. The document also discusses the importance of training, validation, and test datasets in model evaluation and tuning, along with concepts like learning curves, overfitting, and regularization strategies.

Uploaded by

229x1a3250

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views31 pages

DSOST3

Uploaded by

229x1a3250

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Unit-3

Machine Learning
Introduction
Machine learning (ML) is a subset of artificial
intelligence (AI) that focuses on developing algorithms
that allow computers to learn from data and make
decisions or predictions without being explicitly
programmed.

It involves training models using large amounts of data

so they can detect patterns, recognize trends, and
improve over time based on new inputs.
There are three main types of machine learning:

Supervised Learning
Unsupervised Learning
Reinforcement Learning
Supervised Learning:
The model is trained on labeled data, where the correct
output is already known.

The goal is to map input data to the correct output (e.g.,

predicting house prices based on features like square
footage).
Unsupervised Learning: The model is given data
without labels and must find hidden patterns or
structures on its own (e.g., clustering customers based
on purchasing behavior).
Reinforcement Learning: The model learns by
interacting with an environment and receiving feedback
in the form of rewards or penalties, which helps it
optimize its actions over time (e.g., training a robot to
navigate a maze).
Supervised Learning
Introduction
Machine learning involves coding programs that
automatically adjust their performance in accordance
with their exposure to information in data
Supervised learning: Algorithms which learn from a
training set of labeled examples (exemplars) to
generalize to the set of all possible inputs.
Examples of techniques in supervised learning: logistic
regression, support vector machines, decision trees, random
forest, etc.

Unsupervised learning: Algorithms that learn from a

training set of unlabeled examples. Used to explore data
according to some statistical, geometric or similarity
criterion.

Examples of unsupervised learning include k-means

clustering and kernel density estimation.
Reinforcement learning:

Algorithms that learn via reinforcement from criticism

that provides information on the quality of a solution,
but not on how to improve it.

Improved solutions are achieved by iteratively

exploring the solution space.
Learning Curves
A learning curve is a graphical representation of how a
model's performance improves as it is exposed to more
training data or as it goes through more iterations of
training.

Learning curves are helpful for understanding how

well a model is learning and diagnosing issues like
overfitting or underfitting.
There are typically two types of learning curves in
supervised learning:
Training Error Curve: Shows how the error (or loss)
of the model changes as more training data or iterations
are used.

Ideally, this curve should decrease as the model learns

more about the data.
Validation Error Curve: Shows how the model’s
error behaves on a separate validation set that it hasn’t
seen during training.

This curve is crucial for detecting overfitting. If the

training error decreases while the validation error starts
increasing, it indicates overfitting.
Underfitting (High Training and Validation Error):
This happens when the model is too simple (e.g., using
too few features or too simple of a model like a linear
regression for a non-linear problem).
Overfitting (Low Training Error, High Validation
Error):
The model becomes too complex and starts to
memorize the training data, capturing noise instead of
general patterns.
The value both errors converge towards is also called
the bias; and the difference between this value and the
test error is called the variance.
The bias/variance decomposition of the learning curve
is an alternative approach to the training and
generalization view.
A good heuristic for selecting the model is to choose
the value of the hyperparameters that yields the
smallest estimated test error.

We may also change the formulation of the objective

function to penalize complex models. This is called
regularization. Regularization accounts for estimating
the value of Ω in our out-of-sample error inequality.
This usually becomes implicit in the algorithm but has
huge consequences in real applications.
The most common regularization strategies are as
follows:
L2 weight regularization: Adding an L2 penalization
term to the weights of a weight-controlled model
implies looking for solutions with small weight values.
L1 weight regularization: Adding an L1 regularization
term forces sparsity in the weights of the model.
These terms are added to the objective function.
Thus, we still have to select this parameter by means of
model selection.
Training,Validation and Test
The concepts of training, validation, and test datasets
are essential to properly evaluate and tune models.
To select a model and control its complexity according
to the number of training data.
Selecting the best hyperparameters as choosing the
classifier with parameters that performs the best.
Select a set of hyperparameter values and use
cross-validation to select the best configuration. The
process of selecting the best hyperparameters is called
validation.

This introduces a new set into our simulation scheme;

we now need to divide the data we have into three sets:
training, validation, and test sets.
The process of assessing the performance of the
classifier by estimating the generalization error is called
testing.

And the process of selecting a model using the

estimation of the generalization error is called
validation.
Test data is used exclusively for assessing performance
at the end of the process and will never be used in the
learning process.

Validation data is used explicitly to select the

parameters/models with the best performance according
to an estimation of the generalization error. This is a
form of learning.
Training data are used to learn the instance of the model
from a model class.

Training ensures that the model learns from data.

Validation helps in model selection and hyperparameter

tuning, ensuring that the model generalizes well and avoids
overfitting.

Testing provides a final, unbiased performance assessment,

allowing you to estimate how the model will perform on
real-world data.
Given training data, and in the most general case we
explicitly have to tune some hyperparameter. Select the
different splits

To provide the best model, then we may use

cross-validation on our training dataset and select the
model with the best performance.
A practical issue: once we have selected the model, we
use the complete training set to train the final model.
Thus, we may proceed in the following way:
1.Split the original dataset into training and test data.
For example, use 30% of the original dataset for testing
purposes. This data is held back and will only be used
to assess the performance of the method.
2.Use the remaining training data to select the
hyperparameters by means of crossvalidation.
3. Train the model with the selected parameter and
assess the performance using the test dataset.
Learning Models

Unit I 2
No ratings yet
Unit I 2
78 pages
Wa0001.
No ratings yet
Wa0001.
173 pages
ML Unit IV
No ratings yet
ML Unit IV
70 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
116 pages
Unit - 2 Deep Learning
No ratings yet
Unit - 2 Deep Learning
26 pages
All DL
No ratings yet
All DL
72 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
Data Science-Unit-4 - 05.10.23
No ratings yet
Data Science-Unit-4 - 05.10.23
59 pages
Machine Learning
No ratings yet
Machine Learning
63 pages
UNIT03
No ratings yet
UNIT03
52 pages
ML MU Unit 2
100% (3)
ML MU Unit 2
84 pages
06 Regularizations
No ratings yet
06 Regularizations
42 pages
5 - Model For Predictions - ML
No ratings yet
5 - Model For Predictions - ML
52 pages
SML Updated UNIT 4
No ratings yet
SML Updated UNIT 4
44 pages
ML MU Unit 2
100% (2)
ML MU Unit 2
42 pages
BAI602 Module 2 Textbook
No ratings yet
BAI602 Module 2 Textbook
31 pages
CSL0777 L08
No ratings yet
CSL0777 L08
29 pages
Lec-1 Bias-variance-Tradeoff
No ratings yet
Lec-1 Bias-variance-Tradeoff
24 pages
Unit 4
No ratings yet
Unit 4
34 pages
Unit IV
No ratings yet
Unit IV
51 pages
Machine Learning-2
No ratings yet
Machine Learning-2
87 pages
19 ML Intro
No ratings yet
19 ML Intro
31 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
43 pages
Module3 DS PPT
No ratings yet
Module3 DS PPT
68 pages
Ch6-Models Selection Evaluating Classifiers
No ratings yet
Ch6-Models Selection Evaluating Classifiers
28 pages
Unit 2
No ratings yet
Unit 2
63 pages
ML Unit 2 Part 1
No ratings yet
ML Unit 2 Part 1
47 pages
ML 5
No ratings yet
ML 5
26 pages
Training Evaluation
No ratings yet
Training Evaluation
42 pages
MLquestions
No ratings yet
MLquestions
26 pages
Machine Learning Models
No ratings yet
Machine Learning Models
54 pages
Unit Ii ML
No ratings yet
Unit Ii ML
57 pages
ML Models Concepts
No ratings yet
ML Models Concepts
32 pages
Complete ML Concepts
No ratings yet
Complete ML Concepts
30 pages
Complete ML Notes
No ratings yet
Complete ML Notes
62 pages
Receiver Operator Characteristic
No ratings yet
Receiver Operator Characteristic
25 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
Beyond The Algorithm AI, Security, Privacy, and Ethics (Omar Santos)
No ratings yet
Beyond The Algorithm AI, Security, Privacy, and Ethics (Omar Santos)
437 pages
Machine Learning Notes Anna University
No ratings yet
Machine Learning Notes Anna University
9 pages
15-The Bias - Variance - Trade-Off-08-04-2024
No ratings yet
15-The Bias - Variance - Trade-Off-08-04-2024
23 pages
5.3 Model
No ratings yet
5.3 Model
26 pages
Lec2 Intro To ML
No ratings yet
Lec2 Intro To ML
35 pages
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
FinQuiz - Curriculum Note, @InsightSquad Study Session 3, Reading 7
No ratings yet
FinQuiz - Curriculum Note, @InsightSquad Study Session 3, Reading 7
11 pages
Model Validation & Data Partition
No ratings yet
Model Validation & Data Partition
14 pages
ML-4 Cross Validation in Machine Learning
No ratings yet
ML-4 Cross Validation in Machine Learning
13 pages
Jkkklphftbbhuii
No ratings yet
Jkkklphftbbhuii
17 pages
Csa202 Unit 2
No ratings yet
Csa202 Unit 2
36 pages
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
No ratings yet
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
28 pages
Model Evaluation
No ratings yet
Model Evaluation
29 pages
Untitled
No ratings yet
Untitled
11 pages
Machine Leafning
No ratings yet
Machine Leafning
5 pages
Choosing Model and Tuning
No ratings yet
Choosing Model and Tuning
20 pages
ML & DL
No ratings yet
ML & DL
19 pages
ML 5
No ratings yet
ML 5
14 pages
1.4 Intro To Need of Estimation and Validation PDF
No ratings yet
1.4 Intro To Need of Estimation and Validation PDF
18 pages
Tycs Ai Unit 2
No ratings yet
Tycs Ai Unit 2
84 pages
A "Short" Introduction To Model Selection
No ratings yet
A "Short" Introduction To Model Selection
25 pages
Transformer Network For Video To Text Translation
No ratings yet
Transformer Network For Video To Text Translation
6 pages
ML Assignment 1 PDF
No ratings yet
ML Assignment 1 PDF
6 pages
00 Pytorch and Deep Learning Fundamentals PDF
No ratings yet
00 Pytorch and Deep Learning Fundamentals PDF
44 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
AI1001 Assignment 9
No ratings yet
AI1001 Assignment 9
2 pages
Machine Learning: Bilal Khan
No ratings yet
Machine Learning: Bilal Khan
26 pages
ML Ass 2
No ratings yet
ML Ass 2
6 pages
Evaluation of Text Transformers For Classifying Sentiment of Revi
No ratings yet
Evaluation of Text Transformers For Classifying Sentiment of Revi
104 pages
Prompt Engineering Seminar Report-1
No ratings yet
Prompt Engineering Seminar Report-1
2 pages
LAB MANUAL CST (Soft Computing) 12-02-2019
No ratings yet
LAB MANUAL CST (Soft Computing) 12-02-2019
68 pages
Artificial Neural Networks Ece
No ratings yet
Artificial Neural Networks Ece
1 page
ML Lecture#02
No ratings yet
ML Lecture#02
20 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
58 pages
Image Generator
No ratings yet
Image Generator
11 pages
Secrets of Statistical Data Analysis and Management Science!
From Everand
Secrets of Statistical Data Analysis and Management Science!
Andrei Besedin
No ratings yet
CNN Notes Unit 3 Notes
No ratings yet
CNN Notes Unit 3 Notes
17 pages
2023 IEEE TNNLS A Survey On Evolutionary Neural Architecture Search
No ratings yet
2023 IEEE TNNLS A Survey On Evolutionary Neural Architecture Search
21 pages
Cnns Layers: Convolution Neural Network Convolutional Neural Network
No ratings yet
Cnns Layers: Convolution Neural Network Convolutional Neural Network
10 pages
Week9 (Learning Perceptron and Delta)
No ratings yet
Week9 (Learning Perceptron and Delta)
57 pages
Active Learning For Data Streams A Survey
No ratings yet
Active Learning For Data Streams A Survey
48 pages
Numericals
No ratings yet
Numericals
4 pages
Unit 4a
No ratings yet
Unit 4a
83 pages
Guo Generating Diverse and Natural 3D Human Motions From Text CVPR 2022 Paper
No ratings yet
Guo Generating Diverse and Natural 3D Human Motions From Text CVPR 2022 Paper
10 pages
1 - Constructing Invariant Signatures For AEC Objects To Support BIM-Based Analysis Automation Through Object Classification
No ratings yet
1 - Constructing Invariant Signatures For AEC Objects To Support BIM-Based Analysis Automation Through Object Classification
14 pages
Agronomy CCMT - 14-00500
No ratings yet
Agronomy CCMT - 14-00500
13 pages
The Three R's of Computer Vision:: Jitendra Malik UC Berkeley
No ratings yet
The Three R's of Computer Vision:: Jitendra Malik UC Berkeley
54 pages
Age Estimation Using Ensemble of Deep Learning Models: Author Name
No ratings yet
Age Estimation Using Ensemble of Deep Learning Models: Author Name
46 pages
Pre-Trained Models For Natural Language Processing: A Survey
No ratings yet
Pre-Trained Models For Natural Language Processing: A Survey
31 pages
DL Practical QP
No ratings yet
DL Practical QP
10 pages
Disaster Assessment From Satellite Imagery Using Deep Learning
No ratings yet
Disaster Assessment From Satellite Imagery Using Deep Learning
8 pages
Science BSC Computer Science Semester 5 2023 November Artificial Intelligence R 2023
No ratings yet
Science BSC Computer Science Semester 5 2023 November Artificial Intelligence R 2023
2 pages
Addernet: Do We Really Need Multiplications in Deep Learning?
No ratings yet
Addernet: Do We Really Need Multiplications in Deep Learning?
8 pages