0% found this document useful (0 votes)

8 views5 pages

NB4-09 PT IV Data Augmentation and Early Stopping

The document outlines a project focused on using data augmentation and early stopping techniques in a deep learning model for skin lesion classification across seven classes. It details the importance of these techniques, the setup of the workspace, and the process of loading and transforming the dataset for training. Additionally, it describes the architecture of a convolutional neural network (CNN) and the steps for training, validating, and testing the model, including performance evaluation through confusion matrices and classification reports.

Uploaded by

Patricia Garcia Berlanga

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views5 pages

NB4-09 PT IV Data Augmentation and Early Stopping

Uploaded by

Patricia Garcia Berlanga

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

NB4-09: Biomedical Imaging and PyTorch IV

Goal: Let's try to create a NB where we use data augmentation and early stopping
when i.e., we reach a model that does not improve for a certain number of epochs. We
will use a skin lesion dataset similar to the previous one but with 7 classes.

1. Introduction

Data Augmentation and Early Stopping are two crucial techniques in machine
learning and deep learning that help improve the performance and generalization of
models. Here’s an explanation of their importance:

A. Data Augmentation

Definition: Data augmentation involves creating new training examples from the
existing data by applying random transformations such as rotation, translation,
flipping, scaling, noise addition, and more.

Importance:

 Enhances Model Generalization: By introducing variability in the training

data, data augmentation helps the model generalize better to unseen data. This
reduces overfitting, where the model performs well on the training data but
poorly on new data.

 Improves Performance on Small Datasets: When there is limited data, data

augmentation effectively increases the size of the training dataset, giving the
model more diverse examples to learn from.

 Increases Model Robustness: Augmented data often simulates real-world

variations and distortions, making the model more robust to such scenarios
during inference.

 Supports Balanced Learning: In cases of imbalanced datasets (where some

classes have significantly fewer examples), data augmentation can help balance
the dataset by generating more examples of the minority classes.

Example: In image classification, rotating an image slightly or flipping it horizontally

creates a new training example. The model learns to recognize the object regardless
of its orientation, making it more versatile.

B. Early Stopping

Definition: Early stopping is a regularization technique where the training of a model

is halted when the performance on a validation dataset stops improving or begins to
degrade.

Importance:

 Prevents Overfitting: As training progresses, a model might start to

memorize the training data, leading to overfitting. Early stopping detects when
this occurs by monitoring the validation loss or accuracy and stops training
before the model overfits.
 Saves Computational Resources: Training deep models can be
computationally expensive. Early stopping can prevent unnecessary training
epochs once optimal performance is reached, saving time and resources.

 Optimizes Model Performance: By stopping at the point where the model

performs best on the validation data, early stopping ensures that the model is
as generalized as possible without compromising on performance.

 Reduces the Need for Manual Tuning: Finding the right number of training
epochs manually can be challenging. Early stopping automates this process by
dynamically determining the best point to stop training.

Example: Suppose you're training a neural network and notice that the validation
loss starts increasing after 50 epochs, while the training loss continues to decrease.
Early stopping would halt the training process at this point, preventing the model from
overfitting to the training data.

2. Setting up Our Workspace

First, we check if GPU is connected. The nvidia-smi command (NVIDIA System

Management Interface) is used to monitor and manage NVIDIA GPUs (Graphics
Processing Units) in a system. It provides detailed information about the status and
performance of the GPUs, including GPU utilization, temperature, memory usage,
processes utilizing the GPU, and more.

nvidia-smi is a command-line utility provided by NVIDIA that helps you manage and
monitor NVIDIA GPU devices. It stands for NVIDIA System Management Interface.

Setting our workspace: /content and /content/datasets

Setting our Home

We save the root directory of the project '/content' as 'HOME' since we will be
navigating through the directory to have multiple projects under the same HOME.
Additionally, we will have the datasets in the 'datasets' directory, so all datasets are
easily accessible for any project.

Mount Google Drive

Next, it imports the drive module from the google.colab library, which provides
functionalities for mounting Google Drive in Google Colab.

Additionally, Google Drive is mounted in Google Colab and made available at the
path /content/drive. The user will be prompted to authorize access to Google Drive.
Once authorized, the content of Google Drive will be accessible from that point
onwards in the Colab notebook.

3. Load a Dataset (Dataloader)

Create a directory where we can save our dataset

Create the dataset directory (if it doesn't exist), where we are going to save the
dataset with which we are going to train our CNN.

Inspect the Dataset: Skin Lesion Detection in 7 Classes

The dataset contains several thousand photos of cell images in seven subdirectories
(classes) with one cell image per class. The directory structure is as follows thanks to
this snipet:

 0: 'akiec' - actinic keratosis

 1: 'bcc' - basal cell carcinoma

 2: 'bkl' - benign keratosis

 3: 'df' - dermatofibroma

 4: 'mel' - melanoma

 5: 'nv' - melanocytic nevus

 6: 'vasc' - vascular lesion

Setting a Dataloader

The purpose of a DataLoader is fundamental in the context of machine learning and

deep learning, especially when working with large or complex datasets. Its main
purpose is to facilitate the efficient loading and manipulation of data during model
training.

Transform the dataloaders for data augmentation

 Initial Transformations: Various augmentation techniques like flipping,

cropping, rotating, color jittering, affine transformations, blurring, and
perspective changes are defined.

 Enhancing Transformations: Each augmentation technique is enhanced by

converting images to tensors and normalizing them.

 Final Output: The list of transformations is updated and printed.

These transformations are typically applied during the training process to increase the
diversity of the training data, helping to improve the generalization of the deep
learning model.

Normalize the dataloaders using Statistics

 Normalization: Normalization is crucial for ensuring that pixel values across

images are on a similar scale [0 1], which helps in stabilizing and speeding up
the training process of deep neural networks.

 Dataset Preparation: Each dataset (train_data, val_set, test_set) is prepared

with consistent transformations and normalization, facilitating uniformity in data
processing across training, validation, and testing phases.

This setup ensures that the datasets are properly preprocessed and ready to be used
in training and evaluating machine learning models, particularly deep neural
networks, using PyTorch.

The train set is unmodified in size because ``transform()`` transform the data but it
don't augment the dataset.
Data Augmentation

The train set is modified in size because ConcatDataset() augment the dataset.

Displaying all classes

Let us show one example for each class, for fun. As we've transformed the image by
normalizing it, we should undo the transformation before visualizing the image.

Settings Hyperparameters

Define batch_size, epochs and obtain the number of classes

We are going to define some training parameters for the network, such as the number
of batches, epochs, and classes in the dataset because they are needed for
dataloaders in order to set up our training loop. We will run only 10 epochs to check
functionality. Later, we will load a model that has already been trained for 30 epochs.

Display all images and its ground truth from a random batch

To see how the DataLoader works and how it handles the loaded data, we will select a
random batch and display it, indicating its class label as well. It is said, we can display
all images and its ground truth from a random batch in a easy way with dataloaders.

Alongside 'normal' images, we should observe transformed images.

4. Define a Convolutional Neural Network

Define the Model

To enhance previous CNN model, we can make several adjustments:

1. Dropout Layers: Add dropout layers to reduce overfitting.

2. Residual Connections: Adding residual connections can improve the learning

capability of deep networks.

3. Global Average Pooling: Replace the Flatten layer with a global average
pooling layer to reduce the number of parameters and prevent overfitting.

4. Learning Rate Scheduler: Implement a scheduler to dynamically adjust the

learning rate during training.

5. Weight Initialization: Properly initialize the weights to improve convergence.

6. Data Augmentation: While not part of the model architecture, performing data
augmentation during training can improve performance.

Explanation of Changes:

1. Dropout: Added nn.Dropout after the linear layers to reduce overfitting.

2. Global Average Pooling: Replaced nn.Flatten with nn.AdaptiveAvgPool2d to

reduce dimensionality without explicitly flattening.

3. Weight Initialization: Added a weight initialization function that uses Kaiming

initialization to improve convergence.
4. Residual Connections: Not included in this version but can be considered for a
more advanced version.

These changes can improve the model's generalization capability and efficiency.

5. Train the network

Create Train directories

To create directories named train1, train2, etc., each time you execute a training loop,
you can modify the code to check the number of existing training directories and then
create the next directory in sequence. Here's an example of how you could do this:

6. Validating our model

Todo código.

7. Predictions (Inference)

Testing

For this multiclass classification test , we need change some things:

1. Class Number Check:

o The code now checks if the confusion matrix is 2x2 to determine if it is a

binary classification problem. If so, it extracts the values of TN, FP, FN,
and TP. If not, it simply prints the entire confusion matrix.

2. Confusion Matrix Visualization:

o It uses seaborn.heatmap to visualize the confusion matrix, which helps to

better understand the model's performance.

3. Classification Report:

o Prints a detailed classification report

using classification_report from sklearn, providing additional metrics like
precision, recall, and F1-score for each class.

These changes should help you better understand your model's results and ensure
that the code handles different numbers of classes correctly.

Deep Learning Lab Manual
100% (10)
Deep Learning Lab Manual
30 pages
Lecture 02
No ratings yet
Lecture 02
147 pages
Deep Learning TensorFlow and Keras
No ratings yet
Deep Learning TensorFlow and Keras
454 pages
Fake Image Detection Report
No ratings yet
Fake Image Detection Report
21 pages
MA Quandel
No ratings yet
MA Quandel
75 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
30 pages
Design of Marine Propulsion Shafting System For 53000 DWT Bulk Carrier
67% (3)
Design of Marine Propulsion Shafting System For 53000 DWT Bulk Carrier
10 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
Image Classification Using MNIST Dataset
No ratings yet
Image Classification Using MNIST Dataset
28 pages
Deep Neural Network
No ratings yet
Deep Neural Network
60 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
Deep Learning Lab With Tensorflow
No ratings yet
Deep Learning Lab With Tensorflow
84 pages
Internship Report: Supply Chain Management
100% (1)
Internship Report: Supply Chain Management
32 pages
CSY3025 Artificial Intelligence Techniques: Deep Learning
No ratings yet
CSY3025 Artificial Intelligence Techniques: Deep Learning
42 pages
Chapter 2 - Non Audio
100% (1)
Chapter 2 - Non Audio
23 pages
Dataset Augmentation
No ratings yet
Dataset Augmentation
30 pages
Transfer Learning CNN
No ratings yet
Transfer Learning CNN
21 pages
Day 8
No ratings yet
Day 8
20 pages
UNIT-IV Improving Deep Neural Networks
No ratings yet
UNIT-IV Improving Deep Neural Networks
17 pages
2 Deep Neural Network - 241120 - 095158
No ratings yet
2 Deep Neural Network - 241120 - 095158
47 pages
03 Pytorch Computer Vision
No ratings yet
03 Pytorch Computer Vision
29 pages
Chapter 3 - Training Deep Neural Networks
No ratings yet
Chapter 3 - Training Deep Neural Networks
25 pages
Lect11 Neural Nets2
No ratings yet
Lect11 Neural Nets2
48 pages
09 Tensorflow101 Slide
No ratings yet
09 Tensorflow101 Slide
78 pages
DL7 1
No ratings yet
DL7 1
19 pages
TLM For CNN
No ratings yet
TLM For CNN
32 pages
Plant Disease Identification
No ratings yet
Plant Disease Identification
17 pages
Module-4 4
No ratings yet
Module-4 4
19 pages
Hiperparametre
No ratings yet
Hiperparametre
10 pages
Deep Learning With Pytorch: Ai Courses by Opencv
No ratings yet
Deep Learning With Pytorch: Ai Courses by Opencv
9 pages
SublimationPrinting101 2011edition PDF
No ratings yet
SublimationPrinting101 2011edition PDF
62 pages
AIML Lab 3
No ratings yet
AIML Lab 3
17 pages
Experiment No 13 Final
No ratings yet
Experiment No 13 Final
9 pages
Microproject Report Group 2
No ratings yet
Microproject Report Group 2
15 pages
Unit - I CHP - 5
No ratings yet
Unit - I CHP - 5
26 pages
Week 02 Ch2.1 Introduction To Neural Networks
No ratings yet
Week 02 Ch2.1 Introduction To Neural Networks
44 pages
Pytorch Tutorial: Narges Honarvar Nazari January 30
No ratings yet
Pytorch Tutorial: Narges Honarvar Nazari January 30
29 pages
DL 8
No ratings yet
DL 8
4 pages
DEEPLEARNINGTUTORIAL Ipynb-Colaboratory
No ratings yet
DEEPLEARNINGTUTORIAL Ipynb-Colaboratory
8 pages
Dla
No ratings yet
Dla
23 pages
DL Lab-Final
No ratings yet
DL Lab-Final
22 pages
01 - Mnist - Ipynb (4) - JupyterLab
No ratings yet
01 - Mnist - Ipynb (4) - JupyterLab
23 pages
NB4-07 PT II Data Augmentation
No ratings yet
NB4-07 PT II Data Augmentation
6 pages
Ep 400
No ratings yet
Ep 400
15 pages
Payroll System Thesis Documentation
100% (1)
Payroll System Thesis Documentation
8 pages
Assignment3 AL
No ratings yet
Assignment3 AL
23 pages
Proposed TCDA Supplement - Draft: GEH-6195C Application Manual
100% (1)
Proposed TCDA Supplement - Draft: GEH-6195C Application Manual
1 page
NB4-08 PT III Early Stopping
No ratings yet
NB4-08 PT III Early Stopping
6 pages
یادگیری پایتورچ
No ratings yet
یادگیری پایتورچ
30 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
15 pages
Pytorch Neural Networks Guide 1717173717
No ratings yet
Pytorch Neural Networks Guide 1717173717
17 pages
Deep Learning Tensorflow
No ratings yet
Deep Learning Tensorflow
35 pages
Cat Dog Classification CNN Model
No ratings yet
Cat Dog Classification CNN Model
13 pages
College Information System
68% (28)
College Information System
97 pages
21BCP167 Ai 9
No ratings yet
21BCP167 Ai 9
10 pages
Harvard CS197 Lecture 5 Notes
No ratings yet
Harvard CS197 Lecture 5 Notes
14 pages
Final Code
No ratings yet
Final Code
16 pages
UNIT-III Convolution Neural Networks
No ratings yet
UNIT-III Convolution Neural Networks
9 pages
Unit-2 Improving-Deep-Neural-Networks
No ratings yet
Unit-2 Improving-Deep-Neural-Networks
18 pages
AZ-900T01 Microsoft Azure Fundamentals-04
No ratings yet
AZ-900T01 Microsoft Azure Fundamentals-04
28 pages
Complete Worksheets 7T
No ratings yet
Complete Worksheets 7T
17 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
Week 6
No ratings yet
Week 6
8 pages
Jackup Rig 2
No ratings yet
Jackup Rig 2
1 page
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Keras
No ratings yet
Keras
4 pages
Tutorial 4
No ratings yet
Tutorial 4
6 pages
GIT Lecture 9 Cybercrime Laws in The Philippines
No ratings yet
GIT Lecture 9 Cybercrime Laws in The Philippines
137 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
NUS Engineering Brochure 2020 CVE PDF
No ratings yet
NUS Engineering Brochure 2020 CVE PDF
4 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Processes and Threads
No ratings yet
Processes and Threads
14 pages
Machine Learning (ML) :: Aim: Analysis and Implementation of Deep Neural Network. Definitions
No ratings yet
Machine Learning (ML) :: Aim: Analysis and Implementation of Deep Neural Network. Definitions
6 pages
Howards2016 17 Mid PDF
No ratings yet
Howards2016 17 Mid PDF
360 pages
TNCT Q4 Module3a
No ratings yet
TNCT Q4 Module3a
4 pages
Microsoft Azure Database Administrator DP 300
From Everand
Microsoft Azure Database Administrator DP 300
Manish Soni
No ratings yet
Extract Greek Text PDF
No ratings yet
Extract Greek Text PDF
2 pages
Application of Metering Process in Oil and Gas Production in Niger Delta Fields
No ratings yet
Application of Metering Process in Oil and Gas Production in Niger Delta Fields
7 pages
Mountz Com III Software Manual
No ratings yet
Mountz Com III Software Manual
53 pages
SHR 1040
No ratings yet
SHR 1040
23 pages
Moss White Paper English Final Reduced
No ratings yet
Moss White Paper English Final Reduced
91 pages
Week 8 IT Era LITE WITH HIGHLIGHT
No ratings yet
Week 8 IT Era LITE WITH HIGHLIGHT
36 pages
For Cinema, Television and Photography: Light & Shadow
No ratings yet
For Cinema, Television and Photography: Light & Shadow
7 pages
Link de Mis Clases - 6B A Partir de Julio
No ratings yet
Link de Mis Clases - 6B A Partir de Julio
3 pages
Crop Improvement IA 3 Poster
No ratings yet
Crop Improvement IA 3 Poster
1 page
Sdst1303 Statistics 1statistik 1
No ratings yet
Sdst1303 Statistics 1statistik 1
11 pages
Fascia Ventilated Eaves 25mm - Warm Roof
No ratings yet
Fascia Ventilated Eaves 25mm - Warm Roof
1 page
Rohan Kumar Resume
No ratings yet
Rohan Kumar Resume
1 page
Latex Foam Manufacturing Process
No ratings yet
Latex Foam Manufacturing Process
2 pages
Customer Scenario Battlecard - Why Professional Service
No ratings yet
Customer Scenario Battlecard - Why Professional Service
1 page

NB4-09 PT IV Data Augmentation and Early Stopping

Uploaded by

NB4-09 PT IV Data Augmentation and Early Stopping

Uploaded by

NB4-09: Biomedical Imaging and PyTorch IV

 Enhances Model Generalization: By introducing variability in the training

 Improves Performance on Small Datasets: When there is limited data, data

 Increases Model Robustness: Augmented data often simulates real-world

 Supports Balanced Learning: In cases of imbalanced datasets (where some

Example: In image classification, rotating an image slightly or flipping it horizontally

Definition: Early stopping is a regularization technique where the training of a model

 Prevents Overfitting: As training progresses, a model might start to

 Optimizes Model Performance: By stopping at the point where the model

2. Setting up Our Workspace

First, we check if GPU is connected. The nvidia-smi command (NVIDIA System

Setting our workspace: /content and /content/datasets

Setting our Home

Mount Google Drive

3. Load a Dataset (Dataloader)

Create a directory where we can save our dataset

Inspect the Dataset: Skin Lesion Detection in 7 Classes

 0: 'akiec' - actinic keratosis

 1: 'bcc' - basal cell carcinoma

 2: 'bkl' - benign keratosis

 5: 'nv' - melanocytic nevus

 6: 'vasc' - vascular lesion

The purpose of a DataLoader is fundamental in the context of machine learning and

Transform the dataloaders for data augmentation

 Initial Transformations: Various augmentation techniques like flipping,

 Enhancing Transformations: Each augmentation technique is enhanced by

 Final Output: The list of transformations is updated and printed.

Normalize the dataloaders using Statistics

 Normalization: Normalization is crucial for ensuring that pixel values across

 Dataset Preparation: Each dataset (train_data, val_set, test_set) is prepared

Displaying all classes

Define batch_size, epochs and obtain the number of classes

Alongside 'normal' images, we should observe transformed images.

4. Define a Convolutional Neural Network

Define the Model

To enhance previous CNN model, we can make several adjustments:

1. Dropout Layers: Add dropout layers to reduce overfitting.

2. Residual Connections: Adding residual connections can improve the learning

4. Learning Rate Scheduler: Implement a scheduler to dynamically adjust the

5. Weight Initialization: Properly initialize the weights to improve convergence.

1. Dropout: Added nn.Dropout after the linear layers to reduce overfitting.

2. Global Average Pooling: Replaced nn.Flatten with nn.AdaptiveAvgPool2d to

3. Weight Initialization: Added a weight initialization function that uses Kaiming

5. Train the network

Create Train directories

6. Validating our model

For this multiclass classification test , we need change some things:

1. Class Number Check:

o The code now checks if the confusion matrix is 2x2 to determine if it is a

2. Confusion Matrix Visualization:

o It uses seaborn.heatmap to visualize the confusion matrix, which helps to

o Prints a detailed classification report

You might also like