0% found this document useful (0 votes)

11 views15 pages

Multi Task Learning (MTL)

Uploaded by

Lalith Kishore V P

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views15 pages

Multi Task Learning (MTL)

Uploaded by

Lalith Kishore V P

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Multi-Task Learning(MTL)

for
Deep Learning
Dr G Manikandan
• Multi-Task Learning (MTL) is a type of machine learning technique where a
model is trained to perform multiple tasks simultaneously.

• In deep learning, MTL refers to training a neural network to perform multiple

tasks by sharing some of the network’s layers and parameters across tasks.
• In MTL, the goal is to improve the generalization performance of the model by
leveraging the information shared across tasks.

• By sharing some of the network’s parameters, the model can learn a more

efficient and compact representation of the data, which can be beneficial when

the tasks are related or have some commonalities.

• There are different ways to implement MTL in deep learning, but the most

common approach is to use a shared feature extractor and multiple task-specific

heads.

• The shared feature extractor is a part of the network that is shared across tasks
and is used to extract features from the input data.

• The task-specific heads are used to make predictions for each task and are

typically connected to the shared feature extractor.

• Another approach is to use a shared decision-making layer, where the decision-
making layer is shared across tasks, and the task-specific layers are connected to

the shared decision-making layer.

• MTL can be useful in many applications such as natural language processing,

computer vision, and healthcare, where multiple tasks are related or have some

commonalities.

• It is also useful when the data is limited, MTL can help to improve the

generalization performance of the model by leveraging the information shared

• However, MTL also has its own limitations, such as when the tasks are very
different

• Multi-Task Learning is a sub-field of Deep Learning. It is recommended that

you familiarize yourself with the concepts of neural networks to understand

what multi-task learning means.

• What is Multi-Task Learning? Multi-Task learning is a sub-field of Machine

Learning that aims to solve multiple different tasks at the same time, by taking

advantage of the similarities between different tasks.

• This can improve the learning efficiency and also act as a regularizer which we
will discuss in a while. Formally, if there are n tasks (conventional deep learning
approaches aim to solve just 1 task using 1 particular model), where
these n tasks or a subset of them are related to each other but not exactly
identical, Multi-Task Learning (MTL) will help in improving the learning of a
particular model by using the knowledge contained in all the n tasks.

• Intuition behind Multi-Task Learning (MTL): By using Deep learning

models, we usually aim to learn a good representation of the features or
attributes of the input data to predict a specific value.
• Formally, we aim to optimize for a particular function by training a model and
fine-tuning the hyperparameters till the performance can’t be increased further.
By using MTL, it might be possible to increase performance even further by
forcing the model to learn a more generalized representation as it learns (updates
its weights) not just for one specific task but a bunch of tasks.

• Biologically, humans learn in the same way. We learn better if we learn multiple
related tasks instead of focusing on one specific task for a long time.

• MTL as a regularizer: In the lingo of Machine Learning, MTL can also be

looked at as a way of inducing bias. It is a form of inductive transfer, using
multiple tasks induces a bias that prefers hypotheses that can explain all
the n tasks.
• MTL acts as a regularizer by introducing inductive bias as stated above. It
significantly reduces the risk of overfitting and also reduces the model’s ability
to accommodate random noise during training.

• Now, let’s discuss the major and prevalent techniques to use MTL. Hard
Parameter Sharing – A common hidden layer is used for all tasks but several
task specific layers are kept intact towards the end of the model.

• This technique is very useful as by learning a representation for various tasks by

a common hidden layer, we reduce the risk of overfitting.
• Soft Parameter Sharing – Each model has their own sets of weights and biases
and the distance between these parameters in different models is regularized so

that the parameters become similar and can represent all the tasks.
• Assumptions and Considerations – Using MTL to share knowledge among
tasks are very useful only when the tasks are very similar, but when this
assumption is violated, the performance will significantly
decline. Applications: MTL techniques have found various uses, some of the
major applications are-

• Object detection and Facial recognition

• Self Driving Cars: Pedestrians, stop signs and other obstacles can be detected
together

• Multi-domain collaborative filtering for web applications

• Stock Prediction
• Here are some important points to consider when implementing Multi-Task Learning
(MTL) for deep learning:

1.Task relatedness: MTL is most effective when the tasks are related or have some
commonalities, such as natural language processing, computer vision, and healthcare.

2.Data limitation: MTL can be useful when the data is limited, as it allows the model to
leverage the information shared across tasks to improve the generalization
performance.

3.Shared feature extractor: A common approach in MTL is to use a shared feature

extractor, which is a part of the network that is shared across tasks and is used to
extract features from the input data.

4.Task-specific heads: Task-specific heads are used to make predictions for each task
1.Shared decision-making layer: another approach is to use a shared decision-making layer,
where the decision-making layer is shared across tasks, and the task-specific layers are
connected to the shared decision-making layer.

2.Careful architecture design: The architecture of MTL should be carefully designed to

accommodate the different tasks and to make sure that the shared features are useful for
all tasks.

3.Overfitting: MTL models can be prone to overfitting if the model is not regularized
properly.

4.Avoiding negative transfer: when the tasks are very different or independent, MTL can
lead to suboptimal performance compared to training a single-task model. Therefore, it is
important to make sure that the shared features are useful for all tasks to avoid negative

Generative AI Design For Building Structures
100% (1)
Generative AI Design For Building Structures
19 pages
01T-Gartner Hype Cycle For Generative AI
100% (2)
01T-Gartner Hype Cycle For Generative AI
23 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Introduction To Multitasking Notes Unit-5
No ratings yet
Introduction To Multitasking Notes Unit-5
23 pages
A Survey On Multi-Task Learning: Yu Zhang and Qiang Yang
No ratings yet
A Survey On Multi-Task Learning: Yu Zhang and Qiang Yang
20 pages
2018 - A Brief Review On Multi-Task Learning - Thung - Wee - Multimedia Tools and Applications
No ratings yet
2018 - A Brief Review On Multi-Task Learning - Thung - Wee - Multimedia Tools and Applications
21 pages
2022 - Multi-Task Learning For Dense Prediction Tasks - A Survey - Vandenhende Et Al - IEEE Transactions On Pattern Analysis and Machine Intelligence
No ratings yet
2022 - Multi-Task Learning For Dense Prediction Tasks - A Survey - Vandenhende Et Al - IEEE Transactions On Pattern Analysis and Machine Intelligence
20 pages
Survey of Multitask Learning
No ratings yet
Survey of Multitask Learning
20 pages
Multitask Learning
No ratings yet
Multitask Learning
35 pages
A Survey On Multi-Task Learning
No ratings yet
A Survey On Multi-Task Learning
24 pages
Thijs Van Der Laan s3986721 Bachelors Thesis
No ratings yet
Thijs Van Der Laan s3986721 Bachelors Thesis
42 pages
2024 - A Survey On Kernel-Based Multi-Task Learning - Neurocomputing
No ratings yet
2024 - A Survey On Kernel-Based Multi-Task Learning - Neurocomputing
12 pages
Multitask Learning: School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213
No ratings yet
Multitask Learning: School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213
35 pages
DL Unit-5
No ratings yet
DL Unit-5
7 pages
Libmtl: A Python Library For Deep Multi-Task Learning: Baijiong Lin
No ratings yet
Libmtl: A Python Library For Deep Multi-Task Learning: Baijiong Lin
7 pages
2024 - Multi-Task Learning in Natural Language Processing - An Overview - Chen Et Al - ACM Computing Surveys
No ratings yet
2024 - Multi-Task Learning in Natural Language Processing - An Overview - Chen Et Al - ACM Computing Surveys
31 pages
2018 - Multi-Task Learning As Multi-Objective Optimization - Sener - Koltun - Advances in Neural Information Processing Systems
No ratings yet
2018 - Multi-Task Learning As Multi-Objective Optimization - Sener - Koltun - Advances in Neural Information Processing Systems
12 pages
2022 - MTFormer - Multi-Task Learning Via Transformer and Cross-Task Reasoning - Xu Et Al - Springer Nature Switzerland
No ratings yet
2022 - MTFormer - Multi-Task Learning Via Transformer and Cross-Task Reasoning - Xu Et Al - Springer Nature Switzerland
18 pages
Neural Network Seminar Anirban
No ratings yet
Neural Network Seminar Anirban
13 pages
LibMTL - Pytorch Library For MTL - March 2022
No ratings yet
LibMTL - Pytorch Library For MTL - March 2022
6 pages
Multitask Transfer
No ratings yet
Multitask Transfer
36 pages
11 Deep Transfer Learning and Multi Task Learning
No ratings yet
11 Deep Transfer Learning and Multi Task Learning
24 pages
2021 - Task Switching Network For Multi-Task Learning - Sun Et Al
No ratings yet
2021 - Task Switching Network For Multi-Task Learning - Sun Et Al
10 pages
(Slide) Multi Task Learning
No ratings yet
(Slide) Multi Task Learning
40 pages
2019 - Pareto Multi-Task Learning - Lin Et Al - Curran Associates, Inc.
No ratings yet
2019 - Pareto Multi-Task Learning - Lin Et Al - Curran Associates, Inc.
11 pages
Advances in AI: Module-1
No ratings yet
Advances in AI: Module-1
23 pages
Daily Dose of Data Science Full Archive
No ratings yet
Daily Dose of Data Science Full Archive
53 pages
6 - Multi - Task - Learning
No ratings yet
6 - Multi - Task - Learning
1 page
Adaptive Weight Assignment Scheme For Multi-Task Learning
No ratings yet
Adaptive Weight Assignment Scheme For Multi-Task Learning
6 pages
Data Science Guide
100% (1)
Data Science Guide
275 pages
Yolor Based Multi Task Learning
No ratings yet
Yolor Based Multi Task Learning
17 pages
2020 - Which Tasks Should Be Learned Together in Multi-Task Learning - Standley Et Al - PMLR
No ratings yet
2020 - Which Tasks Should Be Learned Together in Multi-Task Learning - Standley Et Al - PMLR
13 pages
Sensors 23 00583 v2
No ratings yet
Sensors 23 00583 v2
17 pages
2019 - End-To-End Multi-Task Learning With Attention - Liu Et Al
No ratings yet
2019 - End-To-End Multi-Task Learning With Attention - Liu Et Al
10 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Multi-Task Learning On Mnist Image Datasets
No ratings yet
Multi-Task Learning On Mnist Image Datasets
4 pages
Gradnorm: Gradient Normalization For Adaptive Loss Balancing in Deep Multitask Networks
No ratings yet
Gradnorm: Gradient Normalization For Adaptive Loss Balancing in Deep Multitask Networks
12 pages
MmAP Multi-Modal Alignment Prompt For Cross-Domain Multi-Task Learning
No ratings yet
MmAP Multi-Modal Alignment Prompt For Cross-Domain Multi-Task Learning
9 pages
UCS - 401 - Unit-LV - Trends in Machine Learning - Model and Symbols - Bagging and Boosting, Multitask
No ratings yet
UCS - 401 - Unit-LV - Trends in Machine Learning - Model and Symbols - Bagging and Boosting, Multitask
44 pages
2018 - Learning To Multitask - Zhang Et Al - Curran Associates, Inc.
No ratings yet
2018 - Learning To Multitask - Zhang Et Al - Curran Associates, Inc.
12 pages
Constrained Conditional Model: Fundamentals and Applications
From Everand
Constrained Conditional Model: Fundamentals and Applications
Fouad Sabry
No ratings yet
Asymmetric Multi Task Learning With Loca
No ratings yet
Asymmetric Multi Task Learning With Loca
30 pages
Learning Representation For Multitask Learning Through Self-Supervised Auxiliary Learning
No ratings yet
Learning Representation For Multitask Learning Through Self-Supervised Auxiliary Learning
18 pages
Machine Learning: Fundamentals and Applications
From Everand
Machine Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
2021 - Efficiently Identifying Task Groupings For Multi-Task Learning - Fifty Et Al - Curran Associates, Inc.
No ratings yet
2021 - Efficiently Identifying Task Groupings For Multi-Task Learning - Fifty Et Al - Curran Associates, Inc.
14 pages
NIPS 1996 Multi Task Learning For Stock Selection Paper
No ratings yet
NIPS 1996 Multi Task Learning For Stock Selection Paper
7 pages
Multi-Task Deep Learning Games - Investigating Nash Equilibria and Convergence Properties
No ratings yet
Multi-Task Deep Learning Games - Investigating Nash Equilibria and Convergence Properties
22 pages
Online Meta-Learning: y 0. An Algorithm That Understands The Underlying Struc
No ratings yet
Online Meta-Learning: y 0. An Algorithm That Understands The Underlying Struc
19 pages
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet
Transfer
No ratings yet
Transfer
14 pages
Meta-Learning & Transfer Learning
No ratings yet
Meta-Learning & Transfer Learning
56 pages
Mastering Machine Learning: A Comprehensive Guide to Success
From Everand
Mastering Machine Learning: A Comprehensive Guide to Success
Rick Spair
No ratings yet
Unit 1
No ratings yet
Unit 1
38 pages
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
Unit - V
No ratings yet
Unit - V
44 pages
Machine Learning with Python: Foundations and Applications: ML, #1
From Everand
Machine Learning with Python: Foundations and Applications: ML, #1
Mohammed Nurudeen
No ratings yet
Misra Et Al. - 2016 - Cross-Stitch Networks For Multi-Task Learning
No ratings yet
Misra Et Al. - 2016 - Cross-Stitch Networks For Multi-Task Learning
10 pages
Deep Learning U1
No ratings yet
Deep Learning U1
5 pages
DL UNIT-4 Part-1
No ratings yet
DL UNIT-4 Part-1
10 pages
Transfer Learning Seminar
No ratings yet
Transfer Learning Seminar
12 pages
Meta-Learning With Versatile Loss Geometries - For Fast Adaptation Using Mirror Descent
No ratings yet
Meta-Learning With Versatile Loss Geometries - For Fast Adaptation Using Mirror Descent
7 pages
Paper 7
No ratings yet
Paper 7
16 pages
ELE492 - Midterm Q1-2-3 - 18-04-2023
No ratings yet
ELE492 - Midterm Q1-2-3 - 18-04-2023
3 pages
Skin Cancer Detection
No ratings yet
Skin Cancer Detection
7 pages
Delineating The Effective Use of Self-Supervised Learning in Single Cell Genomics
No ratings yet
Delineating The Effective Use of Self-Supervised Learning in Single Cell Genomics
14 pages
UNDERSTANG PERCEPTRON and Perceptron LEARNING
No ratings yet
UNDERSTANG PERCEPTRON and Perceptron LEARNING
26 pages
Analysis On Prediction of Plant Leaf Diseases Using Deep Learning
No ratings yet
Analysis On Prediction of Plant Leaf Diseases Using Deep Learning
5 pages
RWKV-TS: Beyond Traditional Recurrent Neural Network For Time Series Tasks
No ratings yet
RWKV-TS: Beyond Traditional Recurrent Neural Network For Time Series Tasks
13 pages
Machine Learning
100% (1)
Machine Learning
46 pages
LDA Final
No ratings yet
LDA Final
25 pages
Forest
No ratings yet
Forest
2 pages
Large Language Models For Data Annotation - A Survey
No ratings yet
Large Language Models For Data Annotation - A Survey
22 pages
Hand Gesture Recognition Based On Convolution Neural Network CNN and Support Vector Machine SVM
No ratings yet
Hand Gesture Recognition Based On Convolution Neural Network CNN and Support Vector Machine SVM
4 pages
FL ML 101 WelcomeToTheCourse en
No ratings yet
FL ML 101 WelcomeToTheCourse en
1 page
Adaptive and Neuro Fuzzy Spectrum
No ratings yet
Adaptive and Neuro Fuzzy Spectrum
11 pages
Advanced AI
No ratings yet
Advanced AI
184 pages
A Survey On Generative AI and LLM For Video
No ratings yet
A Survey On Generative AI and LLM For Video
16 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
9 pages
Hybrid Clustering Strategies For Effective Oversampling and Undersampling in Multiclass Classification
No ratings yet
Hybrid Clustering Strategies For Effective Oversampling and Undersampling in Multiclass Classification
20 pages
Machine Learning Based Intrusion Detection System
No ratings yet
Machine Learning Based Intrusion Detection System
5 pages
Cogvideo: Large-Scale Pretraining For Text-To-Video Generation Via Transformers
No ratings yet
Cogvideo: Large-Scale Pretraining For Text-To-Video Generation Via Transformers
15 pages
MLOPs
No ratings yet
MLOPs
20 pages
840-Article Text-3097-1-10-20231107
No ratings yet
840-Article Text-3097-1-10-20231107
16 pages
Language Models:: Towards
No ratings yet
Language Models:: Towards
38 pages
nlfynx7RfS0IZ9YGOtls - Some Core Concepts
No ratings yet
nlfynx7RfS0IZ9YGOtls - Some Core Concepts
6 pages
1 s2.0 S0016003220302544 Main
No ratings yet
1 s2.0 S0016003220302544 Main
22 pages
CCS355 Set 2
No ratings yet
CCS355 Set 2
2 pages
Grade 9
No ratings yet
Grade 9
6 pages
Neural Network Regression
No ratings yet
Neural Network Regression
7 pages

Multi Task Learning (MTL)

Uploaded by

Multi Task Learning (MTL)

Uploaded by

Multi-Task Learning(MTL)

• In deep learning, MTL refers to training a neural network to perform multiple

the tasks are related or have some commonalities.

common approach is to use a shared feature extractor and multiple task-specific

typically connected to the shared feature extractor.

the shared decision-making layer.

• MTL can be useful in many applications such as natural language processing,

generalization performance of the model by leveraging the information shared

• Multi-Task Learning is a sub-field of Deep Learning. It is recommended that

what multi-task learning means.

• What is Multi-Task Learning? Multi-Task learning is a sub-field of Machine

advantage of the similarities between different tasks.

• Intuition behind Multi-Task Learning (MTL): By using Deep learning

• MTL as a regularizer: In the lingo of Machine Learning, MTL can also be

• This technique is very useful as by learning a representation for various tasks by

• Object detection and Facial recognition

• Multi-domain collaborative filtering for web applications

3.Shared feature extractor: A common approach in MTL is to use a shared feature

2.Careful architecture design: The architecture of MTL should be carefully designed to

You might also like