0% found this document useful (0 votes)

98 views18 pages

04 Transfer Learning With Tensorflow Part 1 Feature Extraction

This document discusses transfer learning with TensorFlow. It begins with an overview of transfer learning and some common use cases in computer vision and natural language processing. It then explains that transfer learning allows leveraging existing neural network architectures and learned patterns from similar data to build models that perform better than training from scratch with less data. The document outlines how transfer learning works by training earlier layers on a large dataset and then tuning the later layers on a new smaller dataset. It also introduces TensorFlow Hub as a place to find pre-trained models for transfer learning.

Uploaded by

Akbar Shakoor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views18 pages

04 Transfer Learning With Tensorflow Part 1 Feature Extraction

Uploaded by

Akbar Shakoor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Transfer Learning with

Part 1: Feature extraction

Where can you get help?
“If in doubt, run the code”

• Follow along with the code

• Try it for yourself
• Press SHIFT + CMD + SPACE to read the docstring
• Search for it
• Try again
• Ask (don’t forget the Discord chat!)
(yes, including the “dumb”
questions)
“What is transfer learning?”
Surely someone has spent the time crafting the right model for the job…
Example transfer learning use cases
Computer vision

Natural language processing

To: [email protected] To: [email protected]

Hey Daniel, Hay daniel…

This deep learning course is incredible! C0ongratu1ations! U win $1139239230

I can’t wait to use what I’ve learned!

Not spam Spam

Model learns patterns/weights from similar problem space Patterns get used/tuned to speci c problem
fi
“Why use transfer learning?”
Why use transfer learning?
• Can leverage an existing neural network architecture proven to work on problems similar to our
own

• Can leverage a working network architecture which has already learned patterns on similar
data to our own (often results in great results with less data)

Learn patterns in a E cientNet archiecture Tune patterns/weights to

Model performs better
wide variety of images (already works really well our own problem
than from scratch
(using ImageNet) on computer vision tasks) (Food Vision)
ffi
What we’re going to cover
(broadly)
• Introduce transfer learning with TensorFlow

• Using a small dataset to experiment faster (10% of training

samples)

• Building a transfer learning feature extraction model with

TensorFlow Hub

• Use TensorBoard to track modelling experiments and results

👩🍳 👩🔬
(we’ll be cooking up lots of code!)

How:
Let’s code!
What are callbacks?
• Callbacks are a tool which can add helpful functionality to your models during training,
evaluation or inference

• Some popular callbacks include:

Callback name Use case Code

Log the performance of multiple models and then view and compare
these models in a visual way on TensorBoard (a dashboard for
TensorBoard tf.keras.callbacks.TensorBoard()
inspecting neural network parameters). Helpful to compare the results
of di erent models on your data.
Save your model as it trains so you can stop training if needed and
Model checkpointing come back to continue o where you left. Helpful if training takes a tf.keras.callbacks.ModelCheckpoint()
long time and can't be done in one sitting.

Leave your model training for an arbitrary amount of time and have it
Early stopping stop training automatically when it ceases to improve. Helpful when tf.keras.callbacks.EarlyStopping()
you've got a large dataset and don't know how long training will take.
ff
ff
What is TensorFlow Hub?
• A place to nd a plethora of pre-trained machine learning models (ready to be applied and
ne-tuned for your own problems)
🤔 “Does m y p r o b l e m e x i s t
on Ten s o r F lo w H u b ? ”

https://tfhub.dev/tensorflow/efficientnet/b0/feature-vector/1
TensorFlow Hub makes using a pre-trained model as
simple as calling a URL
fi
fi
ResNet50* feature extractor
Input data
(10 classes of Food101)

Changes
Stays same
(same shape as number
(frozen, pre-trained on ImageNet)
of classes)

10
*Note: In the code, we’re actually using ResNet50, a slightly larger architecture than ResNet34.
Image source: https://arxiv.org/abs/1512.03385
Original Model vs. Feature Extraction
Changes Output layer(s) gets trained
Output Layer (shape = 1000) 10 on new data

…
Layer 235 Layer 235

Layer 234 Layer 234

Working Stays same (frozen)
architecture …

…
(original model layers
(e.g. E cientNet) don’t update during training)
Layer 2 Layer 2

Input Layer Input Layer

…

…
Changes

Large dataset (e.g. ImageNet) Di erent dataset (e.g. 10 classes of food)

Original Model Feature Extraction Transfer Learning Model

ff
ffi
Kinds of Transfer Learning Top layers get trained
on new data

Output Layer (shape = 1000) Changes 10 Stays same 10

…
Layer 235 Layer 235 Layer 235
Changes
(unfrozen)
Layer 234 Stays same Layer 234 Layer 234
(frozen)
…

…
Layer 2 Layer 2 Layer 2
Stays same
(frozen)
Input Layer Input Layer Input Layer
…

…
Fine-t
Changes Might change usual uning
ly req
more d uires
ata th
featur an
Large dataset (e.g. ImageNet) Di erent dataset (e.g. 10 classes of food) extrac e
tion

Original Model Feature Extraction Fine-tuning

ff
Kinds of Transfer Learning
Type Description What happens When to use

Take a pretrained model as it is and

The original model remains Helpful if you have the exact same kind of data
Original model (“As is”) apply it to your task without any
unchanged. the original model was trained on.
changes.

Take the underlying patterns (also Helpful if you have a small amount of custom
Most of the layers in the original
called weights) a pretrained model data (similar to what the original model was
Feature extraction has learned and adjust its outputs
model remain frozen during training
trained on) and want to utilise a pretrained model
(only the top 1-3 layers get updated).
to be more suited to your problem. to get better results on your speci c problem.

Helpful if you have a large amount of custom data

Take the weights of a pretrained Some, many or all of the layers in
and want to utilise a pretrained model and
Fine-tuning model and adjust ( ne-tune) them the pretrained model are updated
improve its underlying patterns to your speci c
to your own problem. during training.
problem.
fi
fi
fi
What is TensorBoard?
• A way to visually explore your machine learning models performance and internals

• Host, track and share your machine learning experiments on TensorBoard.dev

n t e gr a t e s
o a r d a ls oi
(TensorB t s & B i a s e s )
i k e W ei gh
s i t e s l
With web

Comparing the results of two di erent model

architectures (ResNet50V2 & E cientNetB0)
on the same dataset.

Source: https://tensorboard.dev/experiment/73taSKxXQeGPQsNBcVvY3g/#scalars
ffi
ff
🍔👁 Food Vision: Dataset(s) we’re using
Note: For randomly selected data, the Food101 dataset was downloaded and modi ed using the Image Data Modi cation Notebook

Dataset Name Source Classes Training data Testing data

750 images of pizza and steak 250 images of pizza and steak
pizza_steak Food101 Pizza, steak (2) (same as original Food101 (same as original Food101
dataset) dataset)

Chicken curry, chicken wings,

7 randomly selected images 250 images of each class
fried rice, grilled salmon,
10_food_classes_1_percent Same as above of each class (1% of original (same as original Food101
hamburger, ice cream, pizza,
training data) dataset)
ramen, steak, sushi (10)

75 randomly selected images

10_food_classes_10_percent Same as above Same as above of each class (10% of original Same as above
training data)

750 images of each class

10_food_classes_100_percent Same as above Same as above (100% of original training Same as above
data)

75 images of each class (10% 250 images of each class

101_food_classes_10_percent Same as above All classes from Food101 (101) of original Food101 training (same as original Food101
dataset) dataset)
fi
fi
Useful computer vision architectures
• tf.keras.applications and keras.applications have many of the most popular and
best performing computer vision architectures built-in & pre-trained, ready to use for your
own problems

Source: https://keras.io/api/applications/ Source: https://www.tensorflow.org/api_docs/python/tf/keras/applications

Improving a model (from a model’s perspective)

Smaller model

Common ways to improve a deep model:

• Adding layers Larger model
• Increase the number of hidden units
• Change the activation functions
• Change the optimization function
• Change the learning rate (because you can alter each of
• Fitting on more data these, they’re hyperparameters)

• Fitting for longer

Secret Life of Real Estate
25% (8)
Secret Life of Real Estate
8 pages
Spring Microservices in Action PDF
No ratings yet
Spring Microservices in Action PDF
161 pages
Dub Sample
100% (4)
Dub Sample
5 pages
OSMR Exam Report
No ratings yet
OSMR Exam Report
5 pages
Aldebaran
100% (2)
Aldebaran
5 pages
Deep Learning TensorFlow and Keras
No ratings yet
Deep Learning TensorFlow and Keras
454 pages
05 Transfer Learning With Tensorflow Part 2 Fine Tuning
No ratings yet
05 Transfer Learning With Tensorflow Part 2 Fine Tuning
24 pages
06 Transfer Learning With Tensorflow Part 3 Scaling Up
No ratings yet
06 Transfer Learning With Tensorflow Part 3 Scaling Up
29 pages
07 Milestone Project 1 Food Vision
No ratings yet
07 Milestone Project 1 Food Vision
20 pages
Bonus 1 - TF2.0 Practical Advanced Cheat Sheet PDF
No ratings yet
Bonus 1 - TF2.0 Practical Advanced Cheat Sheet PDF
17 pages
06 Pytorch Transfer Learning
No ratings yet
06 Pytorch Transfer Learning
18 pages
Code
No ratings yet
Code
10 pages
Deep Learning Project For Computer Vision With Python 2022
No ratings yet
Deep Learning Project For Computer Vision With Python 2022
297 pages
07 - Assessmen (10) - JupyterLab
No ratings yet
07 - Assessmen (10) - JupyterLab
13 pages
Plant Disease Identification
No ratings yet
Plant Disease Identification
17 pages
TLM For CNN
No ratings yet
TLM For CNN
32 pages
DL7 2
No ratings yet
DL7 2
11 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
73 pages
Aai TT1
No ratings yet
Aai TT1
50 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Hiperparametre
No ratings yet
Hiperparametre
10 pages
Python Code
No ratings yet
Python Code
52 pages
Transfer Learning and Fine-Tuning
No ratings yet
Transfer Learning and Fine-Tuning
32 pages
Transfer Learning: Objectives
No ratings yet
Transfer Learning: Objectives
16 pages
03 Convolution Neural Networks and Computer Vision With Tensorflow
No ratings yet
03 Convolution Neural Networks and Computer Vision With Tensorflow
21 pages
DL Pipeline and Tutorial
No ratings yet
DL Pipeline and Tutorial
36 pages
CNN With TensorFlow and Keras
No ratings yet
CNN With TensorFlow and Keras
11 pages
DLV Lab Manual Print
No ratings yet
DLV Lab Manual Print
29 pages
Microproject Report Group 2
No ratings yet
Microproject Report Group 2
15 pages
NB4-10 PT V Transfer Learning
No ratings yet
NB4-10 PT V Transfer Learning
16 pages
DL Exp-6 16010422230
No ratings yet
DL Exp-6 16010422230
8 pages
Deep Learning For Vision Lab Manual 2024
100% (1)
Deep Learning For Vision Lab Manual 2024
25 pages
Brain Tumour Classification
No ratings yet
Brain Tumour Classification
10 pages
1729492946538
No ratings yet
1729492946538
10 pages
Deep Learning Lab With Output
No ratings yet
Deep Learning Lab With Output
12 pages
Anurag - 2025
No ratings yet
Anurag - 2025
18 pages
CV - T3 - Unit-7
No ratings yet
CV - T3 - Unit-7
36 pages
Explore The Implementation of CNNs in Python
No ratings yet
Explore The Implementation of CNNs in Python
10 pages
8 Deep Learning CNN
No ratings yet
8 Deep Learning CNN
63 pages
INT422
No ratings yet
INT422
5 pages
Tensor Flow 2
No ratings yet
Tensor Flow 2
3 pages
Deep Learning Viva Que&ans
No ratings yet
Deep Learning Viva Que&ans
13 pages
CNN Implementation in Python
No ratings yet
CNN Implementation in Python
7 pages
NNDL Lab Record
No ratings yet
NNDL Lab Record
26 pages
BreastCancer EXP
No ratings yet
BreastCancer EXP
8 pages
Big Data Machine Learning Lab 4
No ratings yet
Big Data Machine Learning Lab 4
7 pages
Inception New
No ratings yet
Inception New
11 pages
Chapter04 - Getting Started With Neural Networks
No ratings yet
Chapter04 - Getting Started With Neural Networks
9 pages
Transfer Learning CNN
No ratings yet
Transfer Learning CNN
21 pages
Explanation of CNN
No ratings yet
Explanation of CNN
8 pages
Program 5n6 DL
No ratings yet
Program 5n6 DL
9 pages
Dlweek 7
No ratings yet
Dlweek 7
9 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
DL 3
No ratings yet
DL 3
10 pages
14 DL Frameworks
No ratings yet
14 DL Frameworks
30 pages
Experiment 10 1
No ratings yet
Experiment 10 1
3 pages
Session15 TransferLearning
No ratings yet
Session15 TransferLearning
13 pages
Building Deep Learning Models Using The PyTorch Library
No ratings yet
Building Deep Learning Models Using The PyTorch Library
4 pages
Computer Vision Activity
No ratings yet
Computer Vision Activity
6 pages
Pre-Trained Models: Objectives
No ratings yet
Pre-Trained Models: Objectives
12 pages
Deep Learning Experiments
No ratings yet
Deep Learning Experiments
42 pages
Chapter 6 - Notes PDF
No ratings yet
Chapter 6 - Notes PDF
22 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
Building A Convolutional Neural Network Using Tensorflow Keras
No ratings yet
Building A Convolutional Neural Network Using Tensorflow Keras
10 pages
Bul
No ratings yet
Bul
1 page
Example
No ratings yet
Example
3 pages
Mastering Prompt
No ratings yet
Mastering Prompt
33 pages
B 09767 Eaf 3
No ratings yet
B 09767 Eaf 3
13 pages
Introduction T
No ratings yet
Introduction T
3 pages
Sce
No ratings yet
Sce
2 pages
Riverspring: at Home
No ratings yet
Riverspring: at Home
33 pages
Fearful
No ratings yet
Fearful
2 pages
Onlin
No ratings yet
Onlin
1 page
Ultimate Prompt
0% (2)
Ultimate Prompt
3 pages
600+ AI Tools List by Kavish Lodha
No ratings yet
600+ AI Tools List by Kavish Lodha
7 pages
What Is A Smart Contract
No ratings yet
What Is A Smart Contract
5 pages
Review of Secret Life of Real Estate
No ratings yet
Review of Secret Life of Real Estate
7 pages
Hyades
100% (1)
Hyades
4 pages
Machine Learning Introduction by Google
No ratings yet
Machine Learning Introduction by Google
3 pages
Star+Nations+Astro+Degrees+new+version+23 09 2021
No ratings yet
Star+Nations+Astro+Degrees+new+version+23 09 2021
7 pages
20240112104558120
No ratings yet
20240112104558120
1 page
Sample of Unconventional Hypertrophy
No ratings yet
Sample of Unconventional Hypertrophy
6 pages
Star Nations Astro Degrees (Old Version)
No ratings yet
Star Nations Astro Degrees (Old Version)
5 pages
BTC Patent
No ratings yet
BTC Patent
8 pages
Orion
No ratings yet
Orion
5 pages
XRP Passive Income, Gold Backed XRP, and Stuff About How CBDCs Are Evil
No ratings yet
XRP Passive Income, Gold Backed XRP, and Stuff About How CBDCs Are Evil
7 pages
Mask
No ratings yet
Mask
18 pages
Old Man Grease Joint Armor Program
No ratings yet
Old Man Grease Joint Armor Program
8 pages
How To Find Your Mission in Life
No ratings yet
How To Find Your Mission in Life
15 pages
7 Types of Adhd
No ratings yet
7 Types of Adhd
3 pages
Command Reference, Cisco IOS XE 17.13.x (Catalyst 9200 Switches)
No ratings yet
Command Reference, Cisco IOS XE 17.13.x (Catalyst 9200 Switches)
2,072 pages
Jevin Sweval Apple Presentation
No ratings yet
Jevin Sweval Apple Presentation
24 pages
Assembly Language Fundamentals: CMPS293&290 Class Notes (Chap 03) Kuo-Pao Yang Page 1 / 22
No ratings yet
Assembly Language Fundamentals: CMPS293&290 Class Notes (Chap 03) Kuo-Pao Yang Page 1 / 22
22 pages
Nvidia
No ratings yet
Nvidia
36 pages
Chapter 4, E Commerce Security & Payment Systems
No ratings yet
Chapter 4, E Commerce Security & Payment Systems
21 pages
Section 5: Cluster Maintenance
No ratings yet
Section 5: Cluster Maintenance
7 pages
Itanium Processor: Presented by Name-Mohammad Faizan Akhter Branch-ETC (Section) Semester-6 Regd No-1801289179
No ratings yet
Itanium Processor: Presented by Name-Mohammad Faizan Akhter Branch-ETC (Section) Semester-6 Regd No-1801289179
18 pages
Mobile Computers Brochure Portfolio en Us
No ratings yet
Mobile Computers Brochure Portfolio en Us
16 pages
ds1401 System Overview 2017-A
No ratings yet
ds1401 System Overview 2017-A
5 pages
6 Essential Steps To Learn Python Effectively in 2025
No ratings yet
6 Essential Steps To Learn Python Effectively in 2025
7 pages
Week 1 - Introduction To Discrete Structures
No ratings yet
Week 1 - Introduction To Discrete Structures
3 pages
Modbus Mach3 To Arduinono Additional Hardware Brai
No ratings yet
Modbus Mach3 To Arduinono Additional Hardware Brai
11 pages
Thesis Cisco DMZ
100% (1)
Thesis Cisco DMZ
44 pages
@ Assigning CMLs For High Corrosion Rate Circuits Process
100% (1)
@ Assigning CMLs For High Corrosion Rate Circuits Process
1 page
DSA All Labs
No ratings yet
DSA All Labs
117 pages
W8 Protocols Standard and Internet Architecture - Module
No ratings yet
W8 Protocols Standard and Internet Architecture - Module
8 pages
VTU Provisional Results Sheet
No ratings yet
VTU Provisional Results Sheet
1 page
Unit 2 Chapter 1 Functions
No ratings yet
Unit 2 Chapter 1 Functions
46 pages
Javelin DNA Series
No ratings yet
Javelin DNA Series
3 pages
PST Notes
No ratings yet
PST Notes
55 pages
Student Registration Form Using Table in HTML and CSS
No ratings yet
Student Registration Form Using Table in HTML and CSS
7 pages
DATA Provisioning & Replication in SAP HANA
No ratings yet
DATA Provisioning & Replication in SAP HANA
5 pages
NetBackup1011 InstallGuide
No ratings yet
NetBackup1011 InstallGuide
215 pages
Sucgang - Week5 - Programming Session
No ratings yet
Sucgang - Week5 - Programming Session
4 pages
COMPUTER STUDIES Form 3 Term 2 Joint Exam 2022 Questions
No ratings yet
COMPUTER STUDIES Form 3 Term 2 Joint Exam 2022 Questions
11 pages
Folleto GA1-240202501-AA2-EV01
No ratings yet
Folleto GA1-240202501-AA2-EV01
2 pages
ACN MCQs Unit 1 & 2
No ratings yet
ACN MCQs Unit 1 & 2
24 pages
Mikrotik: Product Version
No ratings yet
Mikrotik: Product Version
6 pages

04 Transfer Learning With Tensorflow Part 1 Feature Extraction

Uploaded by

04 Transfer Learning With Tensorflow Part 1 Feature Extraction

Uploaded by

Transfer Learning with

Part 1: Feature extraction

• Follow along with the code

Natural language processing

To: [email protected] To: [email protected]

This deep learning course is incredible! C0ongratu1ations! U win $1139239230

Not spam Spam

Learn patterns in a E cientNet archiecture Tune patterns/weights to

• Using a small dataset to experiment faster (10% of training

• Building a transfer learning feature extraction model with

• Use TensorBoard to track modelling experiments and results

• Some popular callbacks include:

Callback name Use case Code

Layer 234 Layer 234

Input Layer Input Layer

Large dataset (e.g. ImageNet) Di erent dataset (e.g. 10 classes of food)

Original Model Feature Extraction Transfer Learning Model

Output Layer (shape = 1000) Changes 10 Stays same 10

Original Model Feature Extraction Fine-tuning

Take a pretrained model as it is and

Helpful if you have a large amount of custom data

• Host, track and share your machine learning experiments on TensorBoard.dev

Comparing the results of two di erent model

Dataset Name Source Classes Training data Testing data

Chicken curry, chicken wings,

75 randomly selected images

750 images of each class

75 images of each class (10% 250 images of each class

Source: https://keras.io/api/applications/ Source: https://www.tensorflow.org/api_docs/python/tf/keras/applications

Common ways to improve a deep model:

• Fitting for longer

You might also like