0% found this document useful (0 votes)

10 views9 pages

Program 5n6 DL

The document provides an overview of transfer learning and fine-tuning in deep CNNs, highlighting its effectiveness in leveraging pre-trained models for new tasks. It discusses various pre-trained networks like VGG, ResNet, Inception, and MobileNet, detailing their advantages and drawbacks. Additionally, it outlines the implementation of transfer learning using MobileNetV2 for image classification, emphasizing its efficiency and suitability for resource-constrained environments.

Uploaded by

aakshargarg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views9 pages

Program 5n6 DL

Uploaded by

aakshargarg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

PROGRAM 5

AIM-Introduction to Transfer Learning and Fine Tuning in Deep CNN. Also, discuss
various Pre-trained networks with their advantage and drawbacks.

THEORY-
1. Introduction to Transfer Learning: Transfer learning is a machine learning technique
where a model developed for one task is reused as the starting point for a model on a second
task. In deep learning, particularly with Convolutional Neural Networks (CNNs), transfer
learning has proven highly effective because CNNs trained on large datasets like ImageNet
have already learned useful features such as edge detection, shape recognition, and texture
identification. These features can be generalized across many different types of image
recognition tasks.
Instead of training a deep learning model from scratch (which can be computationally
expensive and time-consuming, requiring large datasets), transfer learning allows us to
leverage the knowledge learned by a pre-trained model.
2. Concept of Transfer Learning in Deep CNN: In deep CNNs, lower layers learn general
features like edges and textures, while higher layers learn more task-specific features. In
transfer learning, we use the lower layers of a pre-trained network as a feature extractor and
modify the higher layers to suit the new task.
Transfer learning in CNNs typically follows these steps:
• Using a Pre-trained Model: Load a model that has already been trained on a large
dataset, like ImageNet.
• Modifying the Architecture: Replace the output layer (classifier) of the model to suit
the new dataset's classes.
• Freezing Some Layers: Freeze the initial layers of the model to retain the learned
features.
• Training on New Data: Train the remaining layers and the new classifier on the new
dataset.
This significantly reduces training time and provides better performance, especially when the
new dataset is smaller.
3. Fine-Tuning: Fine-tuning is a specific type of transfer learning where you not only replace
the output layer but also allow some (or all) of the pre-trained model's layers to be updated
during training.
In typical transfer learning:
• The pre-trained model's layers are frozen (i.e., their weights are not updated).
• Only the newly added classifier (top layer) is trained on the new dataset.
In fine-tuning:

Arnav Bhatia 02114811622

• Some of the pre-trained layers (usually higher layers closer to the output) are
unfrozen, meaning they can learn and adjust weights based on the new dataset.
• The learning rate during fine-tuning is usually lower to ensure the changes in the
weights are small and don’t drastically affect the model’s pre-learned features.
4. Types of Transfer Learning: There are two primary types of transfer learning in deep
learning:
a. Feature Extraction (Frozen Layers):
• Here, the pre-trained model is used as a feature extractor. The layers are frozen, and
the output layer is modified to suit the new dataset. Only the new classifier is trained
on the new data.
• This method works well when the dataset is small and the new task is similar to the
one the model was pre-trained on.
b. Fine-Tuning (Unfreezing Layers):
• Fine-tuning involves unfreezing some of the layers of the pre-trained model, allowing
them to learn from the new dataset.
• Fine-tuning is typically used when the dataset is larger, or the new task is somewhat
different from the original task.
• It allows the model to adapt the learned representations to the specifics of the new
task.
5. Common Pre-trained Networks:
Several CNN architectures are widely used in transfer learning and fine-tuning. These
networks are typically trained on large datasets like ImageNet, which contains over 1 million
images and 1000 classes.
a. VGG (Visual Geometry Group) Networks:
• VGG16 and VGG19 are some of the most popular CNN architectures. They have 16
and 19 layers, respectively.
• They use very simple and uniform architecture, consisting of 3×3 convolutional layers
followed by max-pooling.
• Advantages: VGG networks are straightforward, with consistent layer structures.
• Disadvantages: They are computationally expensive, requiring a lot of memory and
processing power.
b. ResNet (Residual Networks):
• ResNet solves the vanishing gradient problem by introducing residual connections,
which allow gradients to flow more easily through the network.
• ResNet models come in various sizes, such as ResNet-50, ResNet-101, etc., indicating
the depth of the model.

Arnav Bhatia 02114811622

• Advantages: Allows for much deeper networks while maintaining good performance.
It generalizes well and achieves high accuracy.
• Disadvantages: ResNet models are larger and require more computational resources
than simpler architectures.
c. Inception (GoogLeNet):
• InceptionNet uses different-sized convolutional filters in the same layer, allowing the
network to capture multi-scale features in the input.
• Inception models (e.g., Inception-V3) are computationally efficient, using fewer
parameters.
• Advantages: High accuracy with reduced computational requirements.
• Disadvantages: The architecture is more complex to implement and fine-tune.
d. MobileNet:
• MobileNet is designed for mobile and embedded applications where computational
resources are limited.
• It uses depthwise separable convolutions, significantly reducing the number of
parameters and computations required.
• Advantages: Low computational cost, small size, and optimized for real-time
applications.
• Disadvantages: Slightly lower accuracy compared to larger models like ResNet or
Inception.
6. Advantages of Transfer Learning:
• Reduced Training Time: Since the base layers are already pre-trained, training time
is significantly reduced.
• Better Performance with Less Data: Transfer learning allows better generalization,
even with smaller datasets.
• Overcoming the Need for Large Datasets: Deep learning models require vast
amounts of labeled data for training, which may not always be available. Transfer
learning allows you to apply models that were trained on large datasets to your own
smaller datasets.
7. Drawbacks of Transfer Learning:
• Domain Specificity: If the task or domain of the new dataset is very different from
the original dataset (e.g., medical images vs. natural images), transfer learning may
not work well.
• Large Models: Many pre-trained networks are quite large, requiring more
computational resources, even if you're only using them as a feature extractor.

Arnav Bhatia 02114811622

• Overfitting: In some cases, transferring too much knowledge from the pre-trained
model can lead to overfitting, especially when the dataset is small.
8. Applications of Transfer Learning:
a. Image Classification: Transfer learning is frequently used in image classification tasks
where models like VGG, ResNet, and Inception have shown excellent performance.
b. Object Detection and Segmentation: Pre-trained models can be fine-tuned for tasks like
detecting objects in an image or segmenting images into different classes.
c. Natural Language Processing (NLP): Transfer learning is also widely used in NLP tasks
where models like BERT and GPT-3 are pre-trained on vast corpora of text and fine-tuned for
specific language-related tasks.
d. Medical Imaging: Transfer learning has shown promise in medical imaging tasks like
disease diagnosis, where data is scarce but critical.

VIVA QUESTIONS
1. What is transfer learning?
o Transfer learning refers to using a model trained on one task as the starting
point for a model on another related task.
2. Why do we use transfer learning in CNNs?
o It is used to leverage the learned features from large datasets to reduce training
time and improve performance on tasks with smaller datasets.
3. What is fine-tuning in CNNs?
o Fine-tuning is the process of retraining some or all layers of a pre-trained
network on a new dataset to adapt it to the new task.
4. What is the difference between transfer learning and fine-tuning?
o In transfer learning, the pre-trained model is used without changing the
weights. In fine-tuning, we adjust some or all of the weights based on the new
dataset.
5. Give an example of a pre-trained CNN model.
o Examples include VGG16, ResNet, InceptionNet.

Arnav Bhatia 02114811622

PROGRAM-6
AIM - Implementation of Transfer Learning using Pre-trained Model (MobileNetV2) for
Image Classification in Python

THEORY-
Introduction to Transfer Learning:
Transfer learning is a machine learning technique where a model trained on one task is
repurposed to solve another related task. In deep learning, especially with Convolutional
Neural Networks (CNNs), transfer learning is commonly used to leverage models pre-trained
on large datasets such as ImageNet for smaller and more specific tasks. This saves time and
computational resources and can often result in higher accuracy than training a model from
scratch, particularly when the new task has limited data.
MobileNetV2:
MobileNetV2 is a lightweight deep neural network architecture designed by Google for
mobile and embedded vision applications. It is optimized to perform well even on devices
with limited computational power, such as smartphones. The architecture is based on
depthwise separable convolutions, which significantly reduce the number of parameters and
computational cost while retaining model accuracy.
MobileNetV2 is particularly suitable for transfer learning and fine-tuning because it balances
performance and efficiency, making it an excellent choice for real-time applications or
resource-constrained environments.

MobileNetV2 Architecture:
MobileNetV2 builds on its predecessor, MobileNetV1, and introduces several key
improvements:
• Depthwise Separable Convolutions: This technique separates convolution into two
operations:
o Depthwise convolution: Applies a single filter to each input channel.
o Pointwise convolution: Applies a 1x1 convolution to combine the outputs of
the depthwise convolution. This reduces the number of computations and
parameters, making the model lighter.

Arnav Bhatia 02114811622

• Inverted Residuals and Linear Bottlenecks: In traditional residual networks (like
ResNet), the residual blocks allow the input to bypass layers and directly connect to
the output. In MobileNetV2, these residual connections are "inverted" and involve a
linear bottleneck layer. This helps in preserving the representational power while
reducing complexity.
• Efficient Layer Design: The overall design focuses on making the network smaller
and faster, allowing it to be deployed on edge devices while maintaining reasonable
accuracy for a wide range of tasks.
Transfer Learning Using MobileNetV2:
When implementing transfer learning with MobileNetV2 for image classification, the steps
typically involve:
1. Loading the Pre-trained Model: MobileNetV2, pre-trained on ImageNet (a dataset
with over 1 million images across 1,000 classes), is loaded without the final
classification layer.
2. Freezing Base Layers: The base layers of the model (those trained on ImageNet) are
frozen to preserve the learned features (such as edges, textures, shapes).
3. Adding a Custom Classifier: A new classification head is added, typically consisting
of a fully connected layer (Dense layer) followed by a softmax activation function.
This classifier corresponds to the number of categories in the new task.
4. Training the New Classifier: Only the added classifier layers are trained on the new
dataset, while the pre-trained layers remain frozen.
5. Fine-Tuning (Optional): In some cases, after training the classifier, a few of the
earlier layers can be unfrozen, and the entire model is trained with a lower learning
rate. This fine-tuning helps adapt the model further to the new task, improving
performance.

CODE AND OUTPUT-

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import tensorflow as tf
# from keras.preprocessing.image import ImageDataGenerator
from tensorflow.keras.preprocessing.image import ImageDataGenerator
train_data_generator=ImageDataGenerator(rescale=1./255,shear_range=0.2,zoom_range=
0.2,horizontal_flip=True)
training_set=train_data_generator.flow_from_directory(r"output\train",target_size=(224,22
4),batch_size=32,class_mode='binary')
test_data_generator=ImageDataGenerator(rescale=1./255)

Arnav Bhatia 02114811622

testing_set=test_data_generator.flow_from_directory(r"output\val",target_size=(224,224),ba
tch_size=32,class_mode='binary')

from tensorflow.keras.callbacks import EarlyStopping

base_model = tf.keras.applications.MobileNetV2(input_shape=(224,224, 3),
include_top=False,
weights='imagenet')

base_model.trainable = False

model = tf.keras.Sequential([
base_model,
tf.keras.layers.GlobalAveragePooling2D(),
tf.keras.layers.Dense(10, activation='softmax') # Adjust the output units to match the
number of classes
])

model.compile(optimizer=tf.keras.optimizers.Adam(),
loss='sparse_categorical_crossentropy',
metrics=['accuracy'])
# early_stopping=EarlyStopping(patience=3)

model.fit(training_set, epochs=100, validation_data=testing_set)

# ,callbacks=[early_stopping])

Arnav Bhatia 02114811622

VIVA QUESTIONS-
1. What is MobileNetV2?
• MobileNetV2 is a lightweight convolutional neural network designed for mobile
and edge devices, utilizing depthwise separable convolutions to reduce complexity
and size.
2. Why do we use MobileNetV2 for transfer learning?
• Due to its efficiency and low computational requirements, it is suitable for
resource-constrained environments like mobile and embedded devices.
3. What are depthwise separable convolutions?
• Depthwise separable convolutions divide a standard convolution into two simpler
operations: depthwise convolution and pointwise convolution, drastically reducing
the number of parameters.
4. What is the importance of freezing layers in transfer learning?
• Freezing layers ensures that the pre-trained weights are not modified during initial
training, preserving the learned features.
5. At which point should you unfreeze layers during transfer learning?
• After training the new classifier on top of the frozen base model, you may
unfreeze some layers and fine-tune the model to improve performance on the new
task.

Arnav Bhatia 02114811622

Course Presentation AI 900 AzureAIFundamentals
No ratings yet
Course Presentation AI 900 AzureAIFundamentals
78 pages
Artificial Intelligence AI Literacy Education in Secondary Schools A Review
No ratings yet
Artificial Intelligence AI Literacy Education in Secondary Schools A Review
22 pages
Faculty List
No ratings yet
Faculty List
4 pages
Computer Vision-Unit 4 Notes
100% (1)
Computer Vision-Unit 4 Notes
13 pages
Mercity - Ai-Guide To Fine-Tuning LLMs Using PEFT and LoRa Techniques
No ratings yet
Mercity - Ai-Guide To Fine-Tuning LLMs Using PEFT and LoRa Techniques
25 pages
Artificial Intelligence: Business Strategies and Applications
100% (1)
Artificial Intelligence: Business Strategies and Applications
16 pages
Lecture 17 Transfer Learning
No ratings yet
Lecture 17 Transfer Learning
12 pages
DLunit 4
No ratings yet
DLunit 4
16 pages
Chapter 6 - Notes PDF
No ratings yet
Chapter 6 - Notes PDF
22 pages
Slides CNN
No ratings yet
Slides CNN
17 pages
Iva Lab Manual
No ratings yet
Iva Lab Manual
34 pages
Object Detection Using Convolutional Neural Network Transfer Learning
No ratings yet
Object Detection Using Convolutional Neural Network Transfer Learning
11 pages
Transfer Learning Seminar
No ratings yet
Transfer Learning Seminar
12 pages
Image Processing Chapter 2
No ratings yet
Image Processing Chapter 2
92 pages
0 AI M Learning Deep Learning 2022
No ratings yet
0 AI M Learning Deep Learning 2022
81 pages
Artificial Intelligence Recent Advances Challenges and Future Directions
No ratings yet
Artificial Intelligence Recent Advances Challenges and Future Directions
8 pages
Practical File of Computer Graphics (BTCS-504)
No ratings yet
Practical File of Computer Graphics (BTCS-504)
11 pages
Lect11 Neural Nets2
No ratings yet
Lect11 Neural Nets2
48 pages
Oct2022 CSC649 SupervisedDL - CNN
No ratings yet
Oct2022 CSC649 SupervisedDL - CNN
79 pages
Lecture 6 Deep Learning Training and Testing 2025
No ratings yet
Lecture 6 Deep Learning Training and Testing 2025
36 pages
Unit 4
No ratings yet
Unit 4
50 pages
Classic CNN
No ratings yet
Classic CNN
39 pages
DL 1 2 3
No ratings yet
DL 1 2 3
24 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
05 Transfer Learning With Tensorflow Part 2 Fine Tuning
No ratings yet
05 Transfer Learning With Tensorflow Part 2 Fine Tuning
24 pages
Aai TT1
No ratings yet
Aai TT1
50 pages
Make 04 00002 v2
No ratings yet
Make 04 00002 v2
20 pages
ch4 CNN
No ratings yet
ch4 CNN
35 pages
06 Transfer Learning With Tensorflow Part 3 Scaling Up
No ratings yet
06 Transfer Learning With Tensorflow Part 3 Scaling Up
29 pages
Week7 - ConvNets and Transfer Learning
No ratings yet
Week7 - ConvNets and Transfer Learning
39 pages
v12 Red Buffer Computer Vision
No ratings yet
v12 Red Buffer Computer Vision
26 pages
Opencv Divya Sanvi
No ratings yet
Opencv Divya Sanvi
24 pages
Pre MD Log Book - 2nd Year
No ratings yet
Pre MD Log Book - 2nd Year
24 pages
Purdue PGP AI and ML
No ratings yet
Purdue PGP AI and ML
35 pages
06 Pytorch Transfer Learning
No ratings yet
06 Pytorch Transfer Learning
18 pages
Report
No ratings yet
Report
40 pages
Transfer Learning
No ratings yet
Transfer Learning
24 pages
English Courses MTS
No ratings yet
English Courses MTS
26 pages
Zhuang 2017
No ratings yet
Zhuang 2017
12 pages
Unit Iii
No ratings yet
Unit Iii
26 pages
CH 5
No ratings yet
CH 5
16 pages
Advancing The Future of Logistics: Freighttech White Paper 2020
No ratings yet
Advancing The Future of Logistics: Freighttech White Paper 2020
34 pages
Cognitive API Using Neural Network
No ratings yet
Cognitive API Using Neural Network
30 pages
Transfer Learning: Objectives
No ratings yet
Transfer Learning: Objectives
16 pages
Operations Slides
No ratings yet
Operations Slides
11 pages
NB4-10 PT V Transfer Learning
No ratings yet
NB4-10 PT V Transfer Learning
16 pages
OS File (1-4) - Final
No ratings yet
OS File (1-4) - Final
26 pages
Session15 TransferLearning
No ratings yet
Session15 TransferLearning
13 pages
Chapter 9
No ratings yet
Chapter 9
15 pages
Q - A Unit 5
No ratings yet
Q - A Unit 5
8 pages
Unit Iv - NNDL
No ratings yet
Unit Iv - NNDL
32 pages
Getting Started With Photo Scene Editor
No ratings yet
Getting Started With Photo Scene Editor
57 pages
NNDL PPT Subashini
No ratings yet
NNDL PPT Subashini
16 pages
CNN Pretrained Models Presentation
No ratings yet
CNN Pretrained Models Presentation
11 pages
AAI Module 4
No ratings yet
AAI Module 4
13 pages
UNIT 2 Self Notes
No ratings yet
UNIT 2 Self Notes
10 pages
Lecture 5,6 - Transfer Learning
No ratings yet
Lecture 5,6 - Transfer Learning
24 pages
Day 8
No ratings yet
Day 8
20 pages
14174-English
No ratings yet
14174-English
7 pages
IOT 7 8 Final
No ratings yet
IOT 7 8 Final
10 pages
TransferLearningwithAdaptiveFine Tuning
No ratings yet
TransferLearningwithAdaptiveFine Tuning
16 pages
2 Marks Gen AI
No ratings yet
2 Marks Gen AI
14 pages
A Detailed Survey On Enhancing Low Light Images Using Retinex Theory and Deep Learning
No ratings yet
A Detailed Survey On Enhancing Low Light Images Using Retinex Theory and Deep Learning
11 pages
Pattern So LN
No ratings yet
Pattern So LN
15 pages
PROGRAM 5n6 DL - Final
No ratings yet
PROGRAM 5n6 DL - Final
9 pages
DL Exp-6 16010422230
No ratings yet
DL Exp-6 16010422230
8 pages
AGD-Week 3-Summary Notes
No ratings yet
AGD-Week 3-Summary Notes
13 pages
Sauravtermpaper PPT
No ratings yet
Sauravtermpaper PPT
11 pages
Cat and Dog 1
No ratings yet
Cat and Dog 1
9 pages
X - CH-2 Working With Images
No ratings yet
X - CH-2 Working With Images
6 pages
"Transfer Learning" For Bridging The Gap Between Data Sciences and The Deep Learning
No ratings yet
"Transfer Learning" For Bridging The Gap Between Data Sciences and The Deep Learning
9 pages
Using Pre-Trained Models
No ratings yet
Using Pre-Trained Models
16 pages
Transfer Learning Using VGG-16 With Deep Convoluti
No ratings yet
Transfer Learning Using VGG-16 With Deep Convoluti
9 pages
AI in Logistics - A Proposal
No ratings yet
AI in Logistics - A Proposal
6 pages
Transfer Learnring
No ratings yet
Transfer Learnring
5 pages
IoT 1 2 3 Final
No ratings yet
IoT 1 2 3 Final
10 pages
Data Aug Trans
No ratings yet
Data Aug Trans
4 pages
Application of Transfer Learning For Image Classification On Dataset With Not Mutually Exclusive Classes
No ratings yet
Application of Transfer Learning For Image Classification On Dataset With Not Mutually Exclusive Classes
4 pages
UNIT-III Convolution Neural Networks
No ratings yet
UNIT-III Convolution Neural Networks
9 pages
Wall e Robot
No ratings yet
Wall e Robot
17 pages
CNN - Case Study
No ratings yet
CNN - Case Study
4 pages
AMLlab 06
No ratings yet
AMLlab 06
3 pages
OS-7 Final
No ratings yet
OS-7 Final
5 pages
Iot 5 6
No ratings yet
Iot 5 6
8 pages
ReviewPaper TransferLearning
No ratings yet
ReviewPaper TransferLearning
6 pages
Ml-Ii 5
No ratings yet
Ml-Ii 5
5 pages
On Experimenting With Single Subject Multi-Session fMRI Analysis of Human Brain
No ratings yet
On Experimenting With Single Subject Multi-Session fMRI Analysis of Human Brain
12 pages
Wpdiscuz Options 2024 11 16
No ratings yet
Wpdiscuz Options 2024 11 16
3 pages
FDP Ai, ML, DL Q5
No ratings yet
FDP Ai, ML, DL Q5
2 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Industrial Applications of Image Processing
No ratings yet
Industrial Applications of Image Processing
5 pages
Unit 5 1
No ratings yet
Unit 5 1
1 page
A Missing and Found Recognition System For Hajj and Umrah
No ratings yet
A Missing and Found Recognition System For Hajj and Umrah
3 pages
CNN For Transfer Learning
No ratings yet
CNN For Transfer Learning
1 page
IGNOU MCS 231 Mobile Computing Previous Year Solved Papers
From Everand
IGNOU MCS 231 Mobile Computing Previous Year Solved Papers
Manish Soni
No ratings yet
Lecture 5 Mask/Filter Transformation
No ratings yet
Lecture 5 Mask/Filter Transformation
24 pages

Program 5n6 DL

Uploaded by

Program 5n6 DL

Uploaded by

PROGRAM 5

Arnav Bhatia 02114811622

Arnav Bhatia 02114811622

Arnav Bhatia 02114811622

Arnav Bhatia 02114811622

Arnav Bhatia 02114811622

CODE AND OUTPUT-

Arnav Bhatia 02114811622

from tensorflow.keras.callbacks import EarlyStopping

model.fit(training_set, epochs=100, validation_data=testing_set)

Arnav Bhatia 02114811622

Arnav Bhatia 02114811622

You might also like