Deep Learning Curve 1693642530

Learning curves plot a model's performance on training and validation datasets over time. They can diagnose underfitting, overfitting, or good fitting of a model. Underfitting shows high training loss that doesn't improve. Overfitting shows training loss decreasing but validation loss increasing after a point. Good fitting shows both decreasing to a stable, small gap. Learning curves also diagnose unrepresentative training data if validation performance doesn't improve despite training.

Uploaded by

omar4programming

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

116 views10 pages

Deep Learning Curve 1693642530

Uploaded by

omar4programming

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

DEEP LEARNING : LEARNING

DIAGNOSE PERFORMANCE CURVES TO

HEMANT THAPA
Learning curves are a widely used diagnostic tool in machine learning for algorithms that
learn from a training dataset incrementally. The model can be evaluated on the training
dataset and on a holdout validation dataset after each update during training, and plots
of the measured performance can be created to show learning curves.
Reviewing learning curves of models during training can be used to diagnose problems
with learning, such as an underfit or overfit model, as well as whether the training and
validation datasets are suitably representative.
In this document, you will discover learning curves and how they can be used to
diagnose the learning and generalization behavior of machine learning models, with
example plots showing common learning problems.
After reading this post, you will know:
Learning curves are plots that show changes in learning performance over time in terms
of experience. Learning curves of model performance on the training and validation
datasets can be used to diagnose an underfit, overfit, or well-fit model. Learning curves
of model performance can be used to diagnose whether the training or validation
datasets are not relatively representative of the problem domain.
1. Understanding Learning Curves in Machine Learning
A learning curve is like a friendly graph that captures how you're getting better at
something over time. Imagine the x-axis as time or experience, and the y-axis as your
progress or improvement.
Learning curves (LCs) are deemed effective tools for monitoring the performance of
workers exposed to a new task. LCs provide a mathematical representation of the
learning process that takes place as task repetition occurs.
— Learning curve models and applications: Literature review and research directions,
2011.
Here's a real-world
instrument, someoneexample: if youyouwere
could give learning
a score everytoweek
play for
a musical
a year.
Plotting those scores over the 52 weeks would create a learning curve.
This curve would help you see how you're doing on the instrument as time
goes by.
So, what exactly is a learning curve? It's just a line graph that tells you how much you're
learning (that's on the up-and-down side) as you gather more experience (that's on the
left-to-right side).
In the world of machine learning, we use learning curves to keep an eye on algorithms
that learn and improve gradually, like those fancy neural networks in deep learning.
The way we measure learning might be like trying to get the highest score possible,
where bigger numbers mean more learning. Think of it as a game of maximizing.
But sometimes, we use a score where smaller numbers are better, like errors or losses.
Here, lower numbers mean you're getting better. If the score hits 0.0, you've aced the
training and made zero mistakes.
When we're training a machine learning model, we can check how well it's doing at each
step. We test it on the training dataset to see how much it's "learning." Then we test it
on a separate validation dataset that wasn't part of the training. This tells us how well
the model is "generalizing," or applying its learning to new stuff.
There are two types of learning curves we often make:
Train Learning Curve: This one uses the training data to show how well the model is
learning. Validation Learning Curve: Here, we use the validation data to see how well the
model is doing on new things. We usually make both curves while the model is learning,
using both the training and validation datasets.
Sometimes, we even make curves for more than one thing, like in problems where we
predict categories. Imagine tuning a model based on both how wrong it is (loss) and how
many things it gets right (accuracy). You'd end up with two plots, each showing two
learning curves – one for training and one for validation.
So, we've got:
Optimization Learning Curves: These show how the model's key parameters are getting
better over time (using a measure like loss). Performance Learning Curves: These tell us
how the model is doing, based on the evaluation criteria we care about (like accuracy).
Learning curves are like our learning buddies, helping us watch how well our models are
catching on and how good they are becoming.
2. Understanding Model Behavior Through Learning Curves
The structure and patterns within a learning curve provide valuable insights for
diagnosing the behavior of a machine learning model. This, in turn, can guide
recommendations for potential configuration adjustments to enhance both learning and
performance.
Three prevalent dynamics often manifest in learning curves. They are as follows:
1. Underfitting
2. Overfitting
3. Optimal Fit

3. Identifying Underfitting Through Learning Curves

Underfitting refers to a model's inability to grasp the training dataset.
This issue arises when the model struggles to achieve a sufficiently low error value on
the training set.
— Cited from "Deep Learning," 2016, Page 111.
Recognition of an underfit model typically revolves around the learning curve related to
training loss.
This curve might display a flat line or exhibit fluctuating values that represent relatively
high loss. Such behavior indicates the model's failure to comprehend the intricacies of
the training dataset.
Here's an example that illustrates this phenomenon. It's a common occurrence when the
model lacks the requisite complexity to handle the intricacies inherent in the dataset.m
An underfit model can also reveal itself through a specific pattern in the training loss
curve. If the training loss keeps decreasing and continues to do so at the plot's end, it's
a sign.
This pattern suggests that the model has more room to learn and improve, and that the
training process might have been stopped prematurely.
A plot displaying learning curves indicates underfitting when:
1. The training loss remains stagnant despite ongoing training.
2. The training loss keeps decreasing steadily until the conclusion of the training.

4. Recognizing Overfitting Using Learning Curves

Overfitting occurs when a model becomes too attuned to the training dataset, even
capturing its statistical noise and random fluctuations.
fitting a more flexible model requires estimating a greater number of
parameters. These more complex models can lead to a phenomenon
known as overfitting the data, which essentially means they follow the
errors, or noise, too closely.
— Extract from "An Introduction to Statistical Learning: with Applications in R," 2013,
Page 22.
The challenge with overfitting is that as the model becomes increasingly specialized to
the training data, its capacity to generalize to new data diminishes. Consequently,
generalization error rises. The magnitude of this increase in generalization error can be
assessed through the model's performance on the validation dataset.
This is an example of overfitting the data,. It is an undesirable situation
because the fit obtained will not yield accurate estimates of the response
on new observations that were not part of the original training data set.
— Extract from "An Introduction to Statistical Learning: with Applications in R," 2013,
Page 24.
Overfitting often arises when the model possesses more capacity than required for the
task, resulting in excessive flexibility. It can also manifest due to excessive training.
Learning curve plots indicate overfitting when:
The training loss curve continues to decline alongside accumulated experience. The
validation loss curve initially drops but subsequently starts ascending. The inflection
point in the validation loss marks the juncture where training should potentially cease, as
subsequent experience showcases overfitting tendencies. The following example plot
aptly illustrates a scenario of overfitting.

5. Identifying Good Fit Through Learning Curves

Achieving a good fit represents the objective of the learning algorithm, positioned
between the extremes of overfitting and underfitting.
A good fit is recognized by both training and validation losses descending to a state of
stability, accompanied by a minimal disparity between their respective concluding loss
values.
Typically, a model's loss on the training dataset will be lower than that on the validation
dataset. Consequently, a discernible gap tends to exist between the learning curves of
training and validation losses—a gap termed the "generalization gap."
Learning curve plots illustrate a good fit when:
The training loss curve descends to a state of stability. The validation loss curve similarly
attains stability and maintains a narrow gap with the training loss. Further training of a
model displaying a good fit is liable to lead to overfitting.

6. Diagnosing Unrepresentative Datasets through Learning

Curves
Learning curves extend their utility to diagnosing dataset attributes and assessing their
relative representativeness.
An unrepresentative dataset signifies a dataset that might fail to encompass the
statistical traits in comparison to another dataset sourced from the same domain. This
discrepancy often arises between a training and a validation dataset and could stem
from an insufficient number of samples in one dataset when contrasted with the other.
Two prevailing scenarios warrant consideration:
1. Relatively Unrepresentative Training Dataset
2. Relatively Unrepresentative Validation Dataset

7. Detecting Unrepresentative Training Datasets

An unrepresentative training dataset signifies that the provided training data lacks
adequate information to effectively grasp the problem, especially when juxtaposed with
the validation dataset utilized for assessment.
This circumstance may arise due to a scarcity of training examples when compared to
the validation dataset.
Such a scenario becomes evident through learning curves: the training loss curve
demonstrates enhancement, while the validation loss curve also displays improvement.
However, a substantial gap persists between the two curves.

8. Identifying Unrepresentative Validation Datasets

An unrepresentative validation dataset denotes a situation where the validation data
lacks the necessary information to assess the model's capacity to generalize.
Such instances can arise if the validation dataset contains an insufficient number of
examples compared to the training dataset.
This scenario can be recognized through learning curves: the training loss curve adopts
a pattern similar to a good fit (or other fits), while the validation loss curve exhibits
erratic fluctuations around the training loss curve.
It can also be discerned by a validation loss that registers lower than the training loss. In
such instances, this points towards the possibility that the model finds the validation
dataset comparatively easier to predict than the training dataset.

Summary
1. Learning curves manifest changes in learning performance over time, corresponding
to experience.
2. Learning curves of model performance across both training and validation datasets
serve as diagnostic tools for identifying underfitting, overfitting, or optimal fitting
models.
3. Learning curves of model performance also help in ascertaining the potential
mismatch between the train or validation datasets and the problem domain's
representation.

References:
Machine Learning Mastery. (n.d.). Learning Curves for Diagnosing Machine Learning
Model Performance. Retrieved from https://machinelearningmastery.com/learning-
curves-for-diagnosing-machine-learning-model-performance/
Dimleve. (n.d.). Back Propagation Explained. Retrieved from
https://dimleve.medium.com/back-propagation-explained-9720c2d4a566
Stanford University. (n.d.). MultiLayer Neural Networks. Deep Learning Tutorial.
Retrieved from
http://deeplearning.stanford.edu/tutorial/supervised/MultiLayerNeuralNetworks/

Airbus ESLD ECAM System Logic Data
100% (3)
Airbus ESLD ECAM System Logic Data
3,736 pages
Surpac DTM - Surfaces Tutorial
100% (1)
Surpac DTM - Surfaces Tutorial
54 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Steam Silencer
100% (2)
Steam Silencer
3 pages
How To Use Learning Curves To Diagnose Machine Learning Model Performance
No ratings yet
How To Use Learning Curves To Diagnose Machine Learning Model Performance
63 pages
Learnin
No ratings yet
Learnin
9 pages
1.4 Intro To Need of Estimation and Validation PDF
No ratings yet
1.4 Intro To Need of Estimation and Validation PDF
18 pages
The Shape of Learning Curves: A Review: Tom Viering Marco Loog
No ratings yet
The Shape of Learning Curves: A Review: Tom Viering Marco Loog
46 pages
Unit 2
No ratings yet
Unit 2
76 pages
Learning Curves in Machine Learning
No ratings yet
Learning Curves in Machine Learning
7 pages
Underfitting
No ratings yet
Underfitting
13 pages
U&O Fitting
No ratings yet
U&O Fitting
6 pages
ML 5
No ratings yet
ML 5
14 pages
FinQuiz - Curriculum Note, @InsightSquad Study Session 3, Reading 7
No ratings yet
FinQuiz - Curriculum Note, @InsightSquad Study Session 3, Reading 7
11 pages
Underfitting and Overfitting Slides and Transcript
No ratings yet
Underfitting and Overfitting Slides and Transcript
13 pages
The Shape of Learning Curves A Review
No ratings yet
The Shape of Learning Curves A Review
21 pages
Csa202 Unit 2
No ratings yet
Csa202 Unit 2
36 pages
Unit 1.2 Perceptron 2024
No ratings yet
Unit 1.2 Perceptron 2024
107 pages
Underfitting and Overfitting
No ratings yet
Underfitting and Overfitting
4 pages
Bias and Variance in Machine Learning
No ratings yet
Bias and Variance in Machine Learning
3 pages
Bias and Variance
No ratings yet
Bias and Variance
4 pages
A "Short" Introduction To Model Selection
No ratings yet
A "Short" Introduction To Model Selection
25 pages
Receiver Operator Characteristic
No ratings yet
Receiver Operator Characteristic
25 pages
Unit 1-1
No ratings yet
Unit 1-1
75 pages
DSOST3
No ratings yet
DSOST3
31 pages
Machine Learning Notes Anna University
No ratings yet
Machine Learning Notes Anna University
9 pages
Overfitting and Underfitting
No ratings yet
Overfitting and Underfitting
25 pages
Chapter 1-ML
No ratings yet
Chapter 1-ML
27 pages
DL Unit1
100% (2)
DL Unit1
79 pages
Unit 2
No ratings yet
Unit 2
97 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
The Shape of Learning Curve
No ratings yet
The Shape of Learning Curve
20 pages
Data Analyst Interview Questionaries
No ratings yet
Data Analyst Interview Questionaries
16 pages
Lecture 17
No ratings yet
Lecture 17
33 pages
ML - Underfitting and Overfitting - GeeksforGeeks
No ratings yet
ML - Underfitting and Overfitting - GeeksforGeeks
8 pages
Matthias Schonlau, Ph.D. Statistical Learning - Classification Stat441
No ratings yet
Matthias Schonlau, Ph.D. Statistical Learning - Classification Stat441
30 pages
Overfitting Regression
No ratings yet
Overfitting Regression
14 pages
ML Question Bank Solution
No ratings yet
ML Question Bank Solution
95 pages
Model Evaluation
No ratings yet
Model Evaluation
29 pages
Linear Regression, Polynomical, Gradiant Descent
No ratings yet
Linear Regression, Polynomical, Gradiant Descent
42 pages
Unit3 - Debugging Algorithms
100% (1)
Unit3 - Debugging Algorithms
11 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
116 pages
01 Intro
No ratings yet
01 Intro
22 pages
ML Unit 2 Part 1
No ratings yet
ML Unit 2 Part 1
47 pages
Gansp Awareness Quiz PDF
No ratings yet
Gansp Awareness Quiz PDF
13 pages
Int3209 - Data Mining: Week 5: Classification Model Improvements
No ratings yet
Int3209 - Data Mining: Week 5: Classification Model Improvements
56 pages
Module3 DS PPT
No ratings yet
Module3 DS PPT
68 pages
Machine Learning Basics Understanding Overfitting and Underfitting
No ratings yet
Machine Learning Basics Understanding Overfitting and Underfitting
11 pages
Classification
No ratings yet
Classification
53 pages
Unit 2
No ratings yet
Unit 2
15 pages
Mod 1
No ratings yet
Mod 1
15 pages
Overfitting vs. Underfitting, Bias vs. Variance
No ratings yet
Overfitting vs. Underfitting, Bias vs. Variance
7 pages
Overfitting and Underfitting in Machine Learning
No ratings yet
Overfitting and Underfitting in Machine Learning
3 pages
DL Unit2
No ratings yet
DL Unit2
22 pages
Unit 5
No ratings yet
Unit 5
21 pages
Machine Leafning
No ratings yet
Machine Leafning
5 pages
Complete ML Concepts
No ratings yet
Complete ML Concepts
30 pages
Diagnosing Bias Vs Variance
No ratings yet
Diagnosing Bias Vs Variance
11 pages
Overfitting Vs Underfitting
No ratings yet
Overfitting Vs Underfitting
3 pages
Machine Learning: Interview Questions
No ratings yet
Machine Learning: Interview Questions
21 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
Lec3 Linear Regression With Multiple Vars
No ratings yet
Lec3 Linear Regression With Multiple Vars
30 pages
Secrets of Statistical Data Analysis and Management Science!
From Everand
Secrets of Statistical Data Analysis and Management Science!
Andrei Besedin
No ratings yet
Machine Learning in Finance
100% (4)
Machine Learning in Finance
300 pages
Study of Spatial Attention Mechanisms
No ratings yet
Study of Spatial Attention Mechanisms
10 pages
قصة the Woman Who Disappeared
No ratings yet
قصة the Woman Who Disappeared
14 pages
Probability in High Dimensions 1693642387
No ratings yet
Probability in High Dimensions 1693642387
184 pages
Chapter 1 - Perspective Drawing
No ratings yet
Chapter 1 - Perspective Drawing
23 pages
Asymptotic Notations
No ratings yet
Asymptotic Notations
101 pages
기본어휘및4천단어 PDF
No ratings yet
기본어휘및4천단어 PDF
215 pages
Mathematics Kognity Test Paper
No ratings yet
Mathematics Kognity Test Paper
13 pages
PVD Hy PP X Maths
No ratings yet
PVD Hy PP X Maths
4 pages
Control Systems CH1
No ratings yet
Control Systems CH1
12 pages
Restaurant Case Study PDF
No ratings yet
Restaurant Case Study PDF
6 pages
0606 s14 Ms 23
No ratings yet
0606 s14 Ms 23
6 pages
NPV Model
No ratings yet
NPV Model
6 pages
Selina Solutions For Class 9 Physics Chapter 7 Reflection of Light
No ratings yet
Selina Solutions For Class 9 Physics Chapter 7 Reflection of Light
38 pages
Modeling Radioactive Decay With A Dice
No ratings yet
Modeling Radioactive Decay With A Dice
4 pages
CHECKLIST106
No ratings yet
CHECKLIST106
78 pages
Integrated Project
No ratings yet
Integrated Project
7 pages
Correlatio and Convolution Using Matlab
No ratings yet
Correlatio and Convolution Using Matlab
11 pages
Module 1
No ratings yet
Module 1
140 pages
Solutions 5
No ratings yet
Solutions 5
6 pages
Calculus An Applied Approach 10th Edition Larson Solutions Manual
100% (42)
Calculus An Applied Approach 10th Edition Larson Solutions Manual
35 pages
9.-Time-Series Prediction of Wind Speed Using Machine Learning Algorithms 2018
No ratings yet
9.-Time-Series Prediction of Wind Speed Using Machine Learning Algorithms 2018
17 pages
Formulation and Implementation of Stress-Driven And/or Strain-Driven Computational Homogenization For Finite Strain
No ratings yet
Formulation and Implementation of Stress-Driven And/or Strain-Driven Computational Homogenization For Finite Strain
20 pages
Year 9 Annual Examinations - Revision Booklet FINAL
No ratings yet
Year 9 Annual Examinations - Revision Booklet FINAL
13 pages
Graphing Vectors
No ratings yet
Graphing Vectors
1 page
BMATS201 Question Bank On Module 1
100% (1)
BMATS201 Question Bank On Module 1
3 pages
08 - Squares and Square Roots-22-52
No ratings yet
08 - Squares and Square Roots-22-52
23 pages
Lectura de Esfuerzos en Muros de Etabs
No ratings yet
Lectura de Esfuerzos en Muros de Etabs
3 pages
ITEM ANALYSIS Grade 6 DIAGNOSTIC
100% (1)
ITEM ANALYSIS Grade 6 DIAGNOSTIC
23 pages
Dahlquist - Bjoerck - Numerical Methods in Scientific Computing. Volume 2
No ratings yet
Dahlquist - Bjoerck - Numerical Methods in Scientific Computing. Volume 2
667 pages
5DATAA1
No ratings yet
5DATAA1
68 pages

Deep Learning Curve 1693642530

Uploaded by

Deep Learning Curve 1693642530

Uploaded by

DEEP LEARNING : LEARNING

DIAGNOSE PERFORMANCE CURVES TO

3. Identifying Underfitting Through Learning Curves

4. Recognizing Overfitting Using Learning Curves

5. Identifying Good Fit Through Learning Curves

6. Diagnosing Unrepresentative Datasets through Learning

7. Detecting Unrepresentative Training Datasets

8. Identifying Unrepresentative Validation Datasets

You might also like