Unit - Ichp 4

The document provides an overview of Transfer Learning (TL), highlighting its definition, advantages, and strategies for application in machine learning. It discusses various categories of TL methods, including inductive, unsupervised, and transductive transfer, as well as approaches for knowledge transfer. Additionally, it covers methodologies such as feature extraction, fine-tuning, and the use of pretrained models, alongside applications in text, computer vision, and speech recognition.

Uploaded by

mailtoyashi04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views19 pages

Unit - Ichp 4

Uploaded by

mailtoyashi04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

UNIT-I

Transfer Learning Fundamentals

1. Introduction to Transfer Learning(TL)
• Traditional ML trains every model in isolation based on the specific domain, data and task

• TL is a method of reusing the model or knowledge for another related task.

• Definition: Situation where what has been learned in one setting is exploited to improve
generalization in another setting
• Ex: Task T1: identifying objects in images within
a restaurant domain
• Task T2: identifying objects in images from a
Park or Café
• TL enables us to utilize knowledge from
previously learned tasks and apply them to
newer, related ones.
• If we have more data for task T1, we may utilize
the learnings and generalize them for task T2.
• In case of image classification, certain low level
features, such as edges, shapes, and lighting can
be shared across tasks
Advantages of TL
• Improved baseline performance: When we augment the
knowledge of an isolated learner with the knowledge from a
source model, the baseline performance might improve due
to this knowledge transfer
• Model development time: will be less when the source
model helps in learning the target task, as compared to a
target model that learns from a scratch.
• Improved final performance: will be attained by leveraging
TL.

Note that one of the gains are possible. The

above diagrams shows better baseline
performance (higher start), efficiency gains
(higher slope) and better final performance
(higher asymptote)
2. Transfer Learning Strategies
• A domain, D, is defined as two-element tuple consisting of feature space, X, and marginal probability, P(X),
where X is a sample data point.
• X={x1,x2,….xn} Thus D={x,P(X)}
• A task, T can be defined as two-element tuple consisting of label space, Y, and objective function, f. The
objective function can be denoted as P(Y|X) from a probabilistic point of view. Thus T={Y,P(Y|X)
• Using this frameworks TL can be defined as a process aimed at improving the target objective function, fr ( or
target task, Tt), in the target domain, Dt, using knowledge from the Ts source task in the Ds domain.
This leads to the following scenarios:
• Feature Space: The feature spaces of source and target domains are different from each other , such as Xs !
=Xt. For instance , if our tasks are related to document classification, this scenario refers to source and target
tasks in different languages.
• Marginal Probability: The marginal probabilities of source and target domains are different from each other ,
such as P(Xs) != P(Xt). Also known as domain adaptation.
• Label Space: The label space of source and target domains are different from each other , such as Ys != Yt.
• Conditional Probabilities: The conditional probabilities of source and target domains are different from each
other , such as P(Ys |Xs) != P(Yt|Xt).
2. Transfer Learning Strategies
• During transfer learning, the following three important questions must be answered:
• What to transfer:
• First important step.
• Try to seek which part of the knowledge can be transferred from the source to the target in order to
improve the performance of the target task.
• Try to identify which part of the knowledge is source specific and what is common between the
source and target
• When to transfer:
• Transfer the knowledge for the sake of it may make matters worst than improving (negative
transfer)
• Aim at utilizing transfer learning to improve target task performance and not degrade them.
• Need to be careful about when to transfer and when not to.
• How to transfer
• Identify the ways of actually transferring the knowledge across domains/tasks.
• This involves changes to existing algorithms and different techniques.
2. Transfer Learning Strategies – Transfer Categories
TL methods can be categorized based on the type of traditional ML algorithms involved such
as:
Inductive transfer:
▪ Here the source and target domains are the same, yet the source and target tasks are different from
each other.
▪ The algorithms try to utilize the inductive biases of the source domain to help improve the target task.
▪ Depending upon whether the source domain contains labeled data or not, this can further be divided
into two subcategories namely: Multitask learning and self-taught learning respectively
Unsupervised transfer:
▪ Similar to inductive transfer, with a focus on the unsupervised tasks in the target domain.
▪ The source and target domains are similar, but the tasks are different. In this scenario, labelled data is
unavailable in either of the domains.
Transductive transfer:
▪ In this scenario, there are similarities between the source and target tasks but the corresponding
domains are different.
▪ The source domain has a lot of labeled data while the target domain has none.
▪ Further classified into subcategories, where either the features spaces are different or the marginal
probabilities
2. Transfer Learning Strategies – Transfer Categories –
What to transfer - Approaches
• Instance transfer:
• Certain instances from the source domain can be reused along with the target data to improve results
• Feature-representation transfer
• Aims to minimize domain divergence and reduce error rates by identifying good feature representations that
can be utilized from the source to target domains.
• Either Supervised or unsupervised methods may be applied for feature-representation based transfers
• Parameter transfer
• This approach works on the assumption that the models for related tasks share some parameters or prior
distribution of hyperparameters.
• We may apply additional weightage to the loss of target domain to improve overall performance.
• Relational-knowledge transfer
• Unlike other three approaches, this method attempts to handle non-IID data, such as data that is not
independent and identically distributed.
• In this data, each data point has a relationship with other data points.
• Social network data utilizes the relational knowledge transfer techniques.
Transfer Learning and Deep Learning
• Inductive Learning:
• The objective of inductive learning algorithms is to infer a
mapping from a set of training examples.
• In case of classification, the model learns mapping between
input features and class labels
• To generalize well on unseen data, its algorithm works with a set
of assumptions related to the distribution of the training data.
These set of assumptions are known as inductive bias.
• The inductive bias can be characterized by multiple factors, such
as the hypothesis space it restricts to and the search process
through the hypothesis space.
• Thus these biases impact how and what is learned by the model
on the given task and domain
• Inductive Transfer:
• Utilizes the inductive biases of the source task to assist the
target task. This can be done in different ways, such as by
adjusting the inductive bias of the target task by limiting the
model space, limiting the model space, narrowing down the
hypothesis space or making adjustments to the search process
itself with the help of knowledge from the source task
3. Transfer Learning Methodologies
• Training time and the amount of data required for deep learning
systems is orders of magnitudes than traditional ML systems
• Domains
• Computer Vision
• Natural Language Processing
Transfer Learning Methodologies- Feature Extraction

DL architectures are layered

architectures that learn different
features at different layers.
These layers are finally connected to
last layer to get the final output.
This layered architecture allows us to
utilize a pretrained network (such as
Inception V3 or VGG) without its final
layer as a fixed feature extractor for
other tasks.
If we utilize AlexNet without its final
classification layer, it helps us to
transform images from a new domain
task into a 4096 dimensional vector,
thus enables us to extract features from
a new domain task, utilizing the
knowledge from a source domain task.
Transfer Learning Methodologies- Fine
Tuning
• Here we do not just replace the final layer, but we also selectively
retrain some of the previous layers.
• Deep neural networks are highly configurable architectures with
various hyper parameters.
• Usually, the initial layers capture generic features while the later ones
focus on the specific task at hand.
• Using this insight, we may freeze certain layers while training(fix the
weights) or fine tune rest of them to suit our needs.
• This will help us achieve better performance with less training time.
Transfer Learning Methodologies- Pretrained
Models
• There are various deep learning networks with state-of-art performance that have
been developed and tested across domains such as computer vision and NLP.
• One of the fundamental requirement for TL is the presence of models that
perform well on source tasks.
• In most cases, people share the details of these networks for others to use,
• These pre-trained networks/models form the basis of transfer learning
• Pretrained models are usually shared in the form of millions of
parameters/weights the model achieved while being trained to a stable state.
• Python package used to download pretrained models: keras, Tensorflow,
Berkley’s Model Zoo
• Pretrained networks available: Xception, VGG16, InceptionV3
Applications
• Transfer Learning with Text data
• Texts are transformed or vectorized using different techniques
• Embedding such as word2vec have been prepared using different training datasets. These are used
in different tasks such as sentiment analysis and document classification by transferring the
knowledge from source tasks
• Transfer Learning with Computer Vision
• Used in various computer vision tasks such as object identification using different CNN
architectures.
• Lower layers acts as conventional computer vision feature extractors such as edge detectors while
the final layers work toward task specific features
• These helped in utilizing the state of the art models such as VGG, AlexNet and Inceptions for target
tasks such as style transfer and face detection that were different from what these models were
trained for
• Transfer Learning with Speech/Audio
• Automatic Speech recognition(ASR) models developed for English have been successfully used to
improve speech recognition performance of other language such as German.
• Automated speaker identification is another example
4. Types of deep transfer Learning
• Domain Adaptation
• It is usually referred to in scenarios where the marginal probabilities between the source and target domains
are different such as P(Xs) != P(Xt)
• There is an inherent shift or drift in the data distribution of the source and target domains that requires tweaks
to transfer the learning
• For ex, a corpus of moview reviews labeled as +ve or –ve woud be different from a corpus of product-review
sentiments
• A classifier trained on movie-review sentiment would see a different distribution if utilized to classify product
reviews
• Thus domain adaptation techniques are used in these scenarios
• Domain Confusion
• Different layers in a deep learning network capture different set of features.
• We can utilize this fact to learn domain-invariant features and improve their transferability across domains
• In stead of allowing the model to learn any representation, we nudge the representations of both domains to
be as similar as possible.
• This can be achieved by applying certain preprocessing steps directly to the representations themselves.
• The basic idea behind this technique is to add another objective to the source model to encourage similarity by
confusing the domain itself.
Types of deep transfer Learning
• Multitask Learning
• It is slightly different from TL. In this
several tasks are learned simultaneously
without distinction between source and
targets. In this case, the learner receives
information about multiple tasks at once.
In TL, the learner has no idea about the
target task initially.
Types of deep transfer Learning
• One-Shot Learning
• DL systems are data hungry, such that they need many training examples to
learn the weights
• It infer the required output based on just one or a few training examples
• Helpful for real-world scenarios where it is not possible to have labelled data
for every possible class and in scenarios where new classes can be added
often
Types of deep transfer Learning
• Zero-shot Learning
• Relies on no labelled examples to learn a task
• These methods make clever adjustments during the training stage itself to
exploit additional information to understand unseen data
• Used in machine translation
5. Challenges of transfer learning
• Negative Transfer
• Drop in performance
• No improvement
• Reason
• Source task is not sufficiently related to target task
• Solution
• Bayesian approaches
• Clustering based solutions
• Transfer bounds
• Quantifying the transfer about the quality and its viability
• Solution
• Kolmogorov complexity
• Graph based approach

Artificial Intelligence in Higher Education
100% (3)
Artificial Intelligence in Higher Education
267 pages
Lecture 17 Transfer Learning
No ratings yet
Lecture 17 Transfer Learning
12 pages
Transfer Learning - Qiang Yang
No ratings yet
Transfer Learning - Qiang Yang
393 pages
Training The Application of LLM
No ratings yet
Training The Application of LLM
68 pages
Transferability in Deep Learning: A Survey: Junguang Jiang
No ratings yet
Transferability in Deep Learning: A Survey: Junguang Jiang
64 pages
Individual Dual Sports
100% (1)
Individual Dual Sports
60 pages
A Systematic Review of Transfer Learning in Software Engineering
No ratings yet
A Systematic Review of Transfer Learning in Software Engineering
62 pages
Data - and AI-driven Methods in Engineering
No ratings yet
Data - and AI-driven Methods in Engineering
40 pages
A Comprehensive Survey On Transfer Learning
No ratings yet
A Comprehensive Survey On Transfer Learning
31 pages
Lec11 Transfer Learning
No ratings yet
Lec11 Transfer Learning
45 pages
Transfer Learning
No ratings yet
Transfer Learning
22 pages
Applied Sciences: Transfer Learning From Deep Neural Networks For Predicting Student Performance
No ratings yet
Applied Sciences: Transfer Learning From Deep Neural Networks For Predicting Student Performance
12 pages
Essay 6
No ratings yet
Essay 6
15 pages
Lecture 11 Transfer and Few-Shot Learning
No ratings yet
Lecture 11 Transfer and Few-Shot Learning
47 pages
Unit-V Tranfer Learning Notes
No ratings yet
Unit-V Tranfer Learning Notes
27 pages
Session 5
No ratings yet
Session 5
33 pages
Unit 4
No ratings yet
Unit 4
50 pages
What Is Being Transferred in Transfer Learning?
No ratings yet
What Is Being Transferred in Transfer Learning?
28 pages
Tkde Transfer Learning
No ratings yet
Tkde Transfer Learning
15 pages
T L P - C G M: Ransfer Earning With RE Trained Onditional Enerative Odels
No ratings yet
T L P - C G M: Ransfer Earning With RE Trained Onditional Enerative Odels
24 pages
Transfert Learning Techniques
No ratings yet
Transfert Learning Techniques
6 pages
AdvAI Unit4
No ratings yet
AdvAI Unit4
79 pages
Chapter 9
No ratings yet
Chapter 9
15 pages
Lecture3 Transfer Learning
No ratings yet
Lecture3 Transfer Learning
28 pages
Detailed Lesson Plan in Cookery
88% (80)
Detailed Lesson Plan in Cookery
3 pages
CH 5
No ratings yet
CH 5
16 pages
Unit - V
No ratings yet
Unit - V
44 pages
Transfer Learning Using VGG-16 With Deep Convoluti
No ratings yet
Transfer Learning Using VGG-16 With Deep Convoluti
9 pages
Lecture Notes
No ratings yet
Lecture Notes
86 pages
ReviewPaper TransferLearning
No ratings yet
ReviewPaper TransferLearning
6 pages
A Survey On Deep Transfer Learning
No ratings yet
A Survey On Deep Transfer Learning
10 pages
Transfer Learning
No ratings yet
Transfer Learning
14 pages
Transfer Learning
No ratings yet
Transfer Learning
24 pages
A Survey On Transfer Learning: Sinno Jialin Pan and Qiang Yang, Fellow, IEEE
No ratings yet
A Survey On Transfer Learning: Sinno Jialin Pan and Qiang Yang, Fellow, IEEE
15 pages
Abstract:: Keywords: Transfer Learning, Convolutional Neural Networks (Convnets), Imagenet, Vgg16
No ratings yet
Abstract:: Keywords: Transfer Learning, Convolutional Neural Networks (Convnets), Imagenet, Vgg16
11 pages
Transfer Learning
No ratings yet
Transfer Learning
18 pages
Zhu 2018
No ratings yet
Zhu 2018
8 pages
04.1 PP 3 22 Introduction
No ratings yet
04.1 PP 3 22 Introduction
20 pages
Transfer Learnring
No ratings yet
Transfer Learnring
5 pages
Transfer
No ratings yet
Transfer
14 pages
Transfer Learning in Building Neural Network Model Case Study
No ratings yet
Transfer Learning in Building Neural Network Model Case Study
6 pages
Unit Iii
No ratings yet
Unit Iii
26 pages
Make 04 00002 v2
No ratings yet
Make 04 00002 v2
20 pages
NB4-10 PT V Transfer Learning
No ratings yet
NB4-10 PT V Transfer Learning
16 pages
A Survey On Transfer Learning
No ratings yet
A Survey On Transfer Learning
42 pages
Suppose You Have Good Knowledge in A Certain Topic Learning Allied Topics Becomes Easier As You Can Always Build On The Fundamentals
No ratings yet
Suppose You Have Good Knowledge in A Certain Topic Learning Allied Topics Becomes Easier As You Can Always Build On The Fundamentals
7 pages
ML Module 5
No ratings yet
ML Module 5
76 pages
Transfer Learning Approach in Deep Learning: A Technical Seminar On
No ratings yet
Transfer Learning Approach in Deep Learning: A Technical Seminar On
2 pages
11 Deep Transfer Learning and Multi Task Learning
No ratings yet
11 Deep Transfer Learning and Multi Task Learning
24 pages
KBS Surney Paper References Revised
No ratings yet
KBS Surney Paper References Revised
19 pages
PROGRAM 5n6 DL - Final
No ratings yet
PROGRAM 5n6 DL - Final
9 pages
How To Transfer Algorithmic Reasoning Knowledge To Learn New Algorithms?
No ratings yet
How To Transfer Algorithmic Reasoning Knowledge To Learn New Algorithms?
21 pages
Ml-Ii 5
No ratings yet
Ml-Ii 5
5 pages
Transfer Learning: Meskatul Islam ID: 1703210201349 6 Semester, Dept. of CSE Premier University, Chittagong
No ratings yet
Transfer Learning: Meskatul Islam ID: 1703210201349 6 Semester, Dept. of CSE Premier University, Chittagong
4 pages
Transfer Learning
No ratings yet
Transfer Learning
13 pages
FDP Ai, ML, DL Q5
No ratings yet
FDP Ai, ML, DL Q5
2 pages
Transfer Learning Using VGG-16 With Deep Convoluti
No ratings yet
Transfer Learning Using VGG-16 With Deep Convoluti
9 pages
Transfer Learning Seminar
No ratings yet
Transfer Learning Seminar
12 pages
AAM Ans
No ratings yet
AAM Ans
3 pages
MLC 09 A
No ratings yet
MLC 09 A
8 pages
MarianneArias CALP Module
No ratings yet
MarianneArias CALP Module
97 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Factors Affecting The Academic Performance of ABM Students of Santa Isabel College of Manila AY 2017-2018
No ratings yet
Factors Affecting The Academic Performance of ABM Students of Santa Isabel College of Manila AY 2017-2018
14 pages
PROF ED CYCLE 1 (Teaching Profession)
No ratings yet
PROF ED CYCLE 1 (Teaching Profession)
9 pages
Unit 2 Applications of Multimedia: Structure
No ratings yet
Unit 2 Applications of Multimedia: Structure
10 pages
ED 105 Facilitating Learning Module 5 6
No ratings yet
ED 105 Facilitating Learning Module 5 6
19 pages
Lesson Plan Life of Pi
No ratings yet
Lesson Plan Life of Pi
3 pages
The 2010 Secondary Education Curriculum
No ratings yet
The 2010 Secondary Education Curriculum
35 pages
Narrative Report
No ratings yet
Narrative Report
66 pages
Assignment 2 MGT 403
No ratings yet
Assignment 2 MGT 403
3 pages
Grade 2 - Unit 1 - Worksheet
No ratings yet
Grade 2 - Unit 1 - Worksheet
8 pages
Cognitive Learning Theory: History of and Assumptions of Cognitivism
No ratings yet
Cognitive Learning Theory: History of and Assumptions of Cognitivism
38 pages
My Studybook Module 1-5 (PORTFOLIO) - EBORDE, GWENDOLYN Q PDF
100% (1)
My Studybook Module 1-5 (PORTFOLIO) - EBORDE, GWENDOLYN Q PDF
65 pages
Course Syllabus (SEEL 106)
No ratings yet
Course Syllabus (SEEL 106)
5 pages
Topic 7 - Session Plan Development
100% (1)
Topic 7 - Session Plan Development
15 pages
Team Building Presentation Paul Kastigu
No ratings yet
Team Building Presentation Paul Kastigu
15 pages
Participant's Handbook TT10
No ratings yet
Participant's Handbook TT10
115 pages
Sow F5 2019
No ratings yet
Sow F5 2019
14 pages
AIML ISE mpq2
No ratings yet
AIML ISE mpq2
4 pages
DLL 6. Eapp PASSIVIZATION AND NOMINALIZTN
No ratings yet
DLL 6. Eapp PASSIVIZATION AND NOMINALIZTN
1 page
English Action Plan 2022-2023
No ratings yet
English Action Plan 2022-2023
8 pages
Research in Developmental Disabilities: Sciencedirect
No ratings yet
Research in Developmental Disabilities: Sciencedirect
8 pages
Rubix Cube
No ratings yet
Rubix Cube
1 page
Student Teaching Experience at Governor Ferrer Memorial Integrated National High School, Pinagtipunan, General Trias City, Cavite
No ratings yet
Student Teaching Experience at Governor Ferrer Memorial Integrated National High School, Pinagtipunan, General Trias City, Cavite
24 pages
DLP 3 Verbs Revised PDF
No ratings yet
DLP 3 Verbs Revised PDF
3 pages
Jennifer Mohr
No ratings yet
Jennifer Mohr
1 page
Name: Erika Mae Aliviano Teacher: Ms. Ailun Jaugan Gr. & Sec.: Stem 11-Innovativeness III - Self - Learning Activities
No ratings yet
Name: Erika Mae Aliviano Teacher: Ms. Ailun Jaugan Gr. & Sec.: Stem 11-Innovativeness III - Self - Learning Activities
2 pages
Progresive DPK
No ratings yet
Progresive DPK
7 pages
Workshop Master Revealed
From Everand
Workshop Master Revealed
Anil Soni
No ratings yet
Mastering Algorithms and Data Structures
From Everand
Mastering Algorithms and Data Structures
Manish Soni
No ratings yet

Unit - Ichp 4

Uploaded by

Unit - Ichp 4

Uploaded by

UNIT-I

Transfer Learning Fundamentals

• TL is a method of reusing the model or knowledge for another related task.

Note that one of the gains are possible. The

DL architectures are layered

You might also like