Cs771 Mini Project-2

Uploaded by

11a11varshapillai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views25 pages

Cs771 Mini Project-2

Uploaded by

11a11varshapillai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

CS771 MINI PROJECT-2

Group 95 : Cerebro

Lifelong Domain Adaptation via Consolidated

Internal Distribution

BY: VARSHA PILLAI, KISHORE S, ANIRVAN TYAGI

IIT KANPUR
Introduction
We develop an algorithm to address unsupervised
domain adaptation (UDA) in continual learning (CL)
settings. The goal is to update a model continually to
learn distributional shifts across sequentially arriving
tasks with unlabeled data while retaining the knowledge
about the past learned tasks. Our solution is based on
consolidating the learned internal distribution for
improved model generalization on new domains and
benefiting from experience replay to overcome
catastrophic forgetting.
Challenges in Robust Generalization in Deep Learning
Deep Neural Networks and Domain-Specific Learning:

Deep neural networks (DNNs) are highly effective at identifying intricate patterns in
large datasets, allowing them to automate feature extraction and classification.
However, DNNs often overfit to the source domain, meaning they become too
specialized in the data they were trained on and struggle to perform well on new,
unseen data.
Domain Shift and the Generalization Challenge:

Domain shift occurs when the distribution of data changes between the training
phase and real-world testing, as seen when models are deployed in new
environments or with different data sources.
Under domain shift, DNNs tend to fail in producing accurate predictions, making
robust generalization a key challenge in fields like autonomous driving, healthcare
diagnostics, and finance.
Challenges in Continual Learning
What is Continual Learning (CL)?
Continual Learning refers to the ability to learn from non-stationary information streams
incrementally. “Non-stationary” represents continuously changing data distributions.
“Incremental” learning refers to preserving previous knowledge while continuously learning
new information.
In CL settings, models must adapt to a sequence of tasks or domains over time, each with
unique characteristics.
Traditional supervised learning methods are inefficient for CL as they require fully labeled data
and complete retraining, which is often impractical for evolving data streams.

Challenge in CL
Most current CL algorithms focus on tasks, or domains, that have fully labeled datasets. As a
result, they rely on extensive data labeling for each new domain encountered. However, manual
data annotation is often impractical, as it is both time-consuming and costly.
Challenges in Unsupervised Domain
Adaptation (UDA)
What is UDA?
Shared Latent Space Alignment: Many UDA methods align the data distributions of both the
source and target domains in a shared embedding space, allowing a classifier trained on the
source domain to generalize to the target domain.
Domain Alignment Techniques: Domain alignment can be achieved through generative
adversarial learning or by directly minimizing the distance between the two distributions.

Challenge of Catastropic forgetting

Traditional UDA methods are not suitable for continual learning because they typically require
access to both source and target datasets simultaneously and often only handle a single target
and source domain.Simply updating the model for each new domain can lead to "catastrophic
forgetting," where the network loses information from previously learned domains due to
retroactive interference.
New Proposed Model
The algorithm aims to enable lifelong, unsupervised adaptation of a model to new
domains with only unannotated data. This means the model can continuously learn
from changing environments without labeled data.
Core Idea - Internal Distribution Consolidation: The approach consolidates the internal
representation or distribution learned by the model from the initial source domain (where
labeled data is available). This internal distribution acts as a memory of learned knowledge
and helps the model adapt to new, unlabeled domains.
Multimodal Distribution for Coupled Learning: By treating this internal representation as a
multimodal distribution, the algorithm updates the model to ensure that the knowledge from
the original domain is "coupled" with the new domains it encounters. This coupling allows the
model to generalize effectively across multiple unseen domains.
Addressing Catastrophic Forgetting with Experience Replay: To prevent the model from
forgetting past knowledge (catastrophic forgetting), it saves key representative samples from
previous tasks. When learning new tasks, it replays these samples, reinforcing past
knowledge as it adapts to new data.
Problem Statement
Source Domain Setup:
We start with a source domain S, where we have a labeled training
dataset . This dataset is drawn from an unknown
distribution , and we can train a deep neural network
on this data using empirical risk minimization (ERM), which
minimizes the classification error between predicted and actual
labels on the source domain. If the dataset is large and the
network is complex enough, the model will generalize well on new
samples from the same distribution

Optimum θ using ERM :

Problem Statement
Challenge of Continual Learning: In continual learning (CL), we face
the additional challenge that the input distribution is not stationary.
Over time, the data distribution may shift, leading to a distributional
gap (or "domain shift") between the training data distribution and
the new, incoming data. If the model is not adapted to these
changes, it may perform poorly on the new, shifted data.
Sequential Target Domains: To simulate real-world scenarios, we
consider a series of target domains each with an unlabeled
dataset .The samples in each target domain are drawn
from a different distribution , meaning that the data
distribution changes over time. Since these datasets are
unlabeled, we can’t use standard ERM methods, which rely on
labeled data for training.
Problem Statement
To address domain shift and catastrophic forgetting, we
decompose our network into a deep encoder (which maps
data to an embedding space Z) and a classifier . After
training on the source domain, the classes are well-separated in
Z. Our goal is to maintain this separation as new, unlabeled
target domains are introduced, allowing the model to generalize
to new domains.
We achieve this by keeping the distribution in Z stable across
domains, minimizing the distance between the source and target
distributions in Z,ie

However, unlike standard UDA methods, we can’t directly access the source
data during continual learning, which makes it challenging to align
distributions without forgetting past knowledge.
Proposed Solution
The solution described involves learning a discriminative embedding space for a model that
consolidates intermediate distributions to improve generalizability. The encoder maps the
input source distribution into a multi-modal distribution pJ(z) in the embedding space, with
each mode representing a class. Data points from a specific class are mapped to the
corresponding cluster in the embedding space.
To model the learned distribution, a Gaussian Mixture Model (GMM) is used, which consists
of kkk components. The model is defined by:
where αjare the mixture weights, and μjand Σj are
the mean and covariance for the components,
respectively.
Since the class labels are available, the
parameters for each mode can be computed
independently using the Maximum A Posteriori
(MAP) estimates. For each mode j, the support set
Sjconsists of data points belonging to that class.
The MAP estimates for the GMM parameters are:
Theoretical Analysis
In the PAC-learning framework, this theorem provides
a bound on the expected error a classifier will
experience on new, unseen tasks based on previous
learning experiences and data distribution changes
across tasks. Specifically, it considers a set of
possible classifiers (or hypothesis class) and
describes the errors: e0for the initial (source) domain,
etfor target domains (new tasks), and etJfor a
pseudo-dataset that approximates target performance.
The theorem then relates the error on target domains
to several factors, including the pseudo-dataset error,
the difference in data distributions between
consecutive tasks (measured as shifts in feature
distributions), an ideal error bound achievable by an
optimal classifier, and an additional residual error
term. By accounting for these elements, the theorem
provides insight into how well the classifier can adapt
to new tasks while retaining past knowledge, thus
helping guide continual learning by limiting error as
tasks evolve.
Theoretical Analysis
The model trained as

denotes the WD distance

Theorem 1 explains why LDAuCID algorithm is effective. Major terms in the right-hand side of Eq. 5, as an
upperbound for the expected error for each task (domain), are continually minimized by LDAuCID. The first
term is minimized because the internal distribution random samples are used to minimize the empirical error
term as the first term of Eq. (4). The second term is minimized as the third terms of Eq. (4) when the task
distribution is aligned with the empirical internal distribution in the embedding space at time t. The third term
which is a summation which models the effect
Empirical Validation
We address lifelong unsupervised domain adaptation (UDA) using four classic
UDA benchmark datasets, adapted for sequential tasks. We adhere to the one-
source, one-target domain setup and standard evaluation protocols.

Digit Recognition: ImageCLEF-DA: Office-Home: Office-Caltech:

Using MNIST (M), With 12 shared Consisting of Using 10

USPS (U), and classes from 15,500 images shared classes
SVHN (S), we test Caltech-256 (C), across Artistic (A),
from Office-31
on M → U, U → M, ILSVRC 2012 (I), Clip Art (C),
Product (P), and and Caltech-
and S → M, plus and Pascal VOC 256, we test A
Real-World (R), we
two sequential 2012 (P), we test A → C → P → →C→D→W
tasks: S → M → U evaluate on C → R and R → P → C and W → D → C
and S → U → M. I → P. → A. → A.
Network Structure and Evaluation Protocol:

We use VGG16 as the base model for digit recognition tasks and
Decaf6 features for the Office-Caltech tasks. For ImageCLEF-DA and
Office-Home, we use ResNet-50 pre-trained on ImageNet as the
backbone. To analyze model learning dynamics over time, we generate
learning curves showing test performance across training epochs,
simulating continual training. After each target domain task, we report
the average classification accuracy and standard deviation on the
target domain over five runs. Initial performance is measured with only
source data to show domain shift impact; then, we adapt the model
using the LDAuCID algorithm on target data
Results
Learning Curves: Figure 2 shows the learning curves for eight sequential
UDA tasks, with the model trained for 100 epochs on each task.
Experience replay uses 10 samples per class per domain.
Domain Shift: Initially, domain shift causes a performance drop, but
subsequent tasks show improved performance due to knowledge
transfer.
LDAuCID Effectiveness: LDAuCID boosts performance on all target
tasks, with reduced catastrophic forgetting. However, the Office-Home
dataset, with larger domain gaps, shows less improvement.
Catastrophic Forgetting: The model retains performance on previously
learned tasks, with some forgetting observed in the SVHN dataset.
Comparison with Classic UDA: LDAuCID is compared to classic UDA
methods (Table 1) and often outperforms or is competitive with
methods like ETD and CDAN.
Balanced Datasets: LDAuCID performs particularly well on
balanced datasets like ImageCLEF-DA.
Lifelong UDA: LDAuCID effectively addresses lifelong UDA tasks,
outperforming many classic UDA methods, despite the limitation
of not having access to all source domain data.
Analytic and Ablative Studies
Analytic and Ablative Studies
1. Data Representation and Learning
Progress:
Each data point in the embedding
space is represented by a point in a 2D
plot, with colors denoting the ten digit
classes.
The rows in Figure correspond to the
data geometry at different time-steps,
with the second row showing the state
after learning the SVHN and MNIST
datasets.
By inspecting the columns vertically,
the impact of learning multiple tasks
over time can be observed. In
particular, when a new task is added,
the model retains the knowledge
learned in previous tasks, indicated by
the separability of classes across
rows.
Analytic and Ablative Studies
1.Catastrophic Forgetting Mitigation:
The model shows stability in retaining knowledge, suggesting that catastrophic
forgetting is being effectively mitigated. Even as new tasks are added (moving to
the next rows in the figure), the learned knowledge does not fade, and the model
adapts to new tasks while preserving previous information.

2.Alignment of Data Distributions:

By the final row of Figure 3, the distributions of all domains align closely,
resembling the internally learned Gaussian Mixture Model (GMM) distribution.
This alignment suggests that the model successfully adapts the target domains
to share the same distribution.
For example, comparing the distribution of the MNIST dataset in the first row
(before adaptation) and the second row (after adaptation) shows that the MNIST
distribution increasingly resembles that of the SVHN dataset, the source domain.
This supports the hypothesis that the model’s domain adaptation mechanism
works as expected.
3.Hyperparameter Sensitivity:
The effect of two hyperparameters, λ and τ, on model performance is also studied,
particularly for the binary UDA task (S → M) and illustrated in Figures 3e and 3f.
λ (Trade-off Parameter): The results show that λ has a minimal effect on performance.
This is because the ERM (Empirical Risk Minimization) loss term is relatively small at the
start of the alignment process due to pre-training on the source domain, making λ less
critical for fine-tuning. The dominant optimization term is the domain-alignment term,
meaning that λ does not need careful adjustment.
τ (Confidence Parameter): The parameter τ, which controls the confidence in the
alignment process, shows that when τ is set to approximately 1, the model performs
better on the target domain. This is due to the reduction of label pollution caused by
outlier samples in the GMM distribution, which can negatively affect domain alignment.
When τ ≈ 1, the model becomes more robust to such outliers.
4.Empirical Support for Theoretical Analysis:
The observed results align with the theoretical analysis presented earlier (Theorem 1),
confirming the effectiveness of the domain adaptation method.
The overall conclusion from these empirical evaluations is that the LDAuCID method
works as expected, demonstrating effective domain adaptation and mitigated
catastrophic forgetting, as well as improved performance with the proper choice of
hyperparameters.
Conclusion
We propose a domain adaptation algorithm for continual learning,
where input distributions are mapped to an internal distribution in
an embedding space via a neural network. Our method aligns
these distributions across tasks, ensuring that new tasks do not
hinder generalization. Catastrophic forgetting is mitigated through
experience replay, which stores and replays informative input
samples for updating the internal distribution. While our approach
uses a simple distribution estimation, we anticipate that better
methods could further enhance performance. Future work will
explore the impact of task order and extend the approach to
incremental learning, allowing for new class discovery after initial
training.

NeurIPS 2021 Lifelong Domain Adaptation Via Consolidated Internal Distribution Paper
No ratings yet
NeurIPS 2021 Lifelong Domain Adaptation Via Consolidated Internal Distribution Paper
12 pages
A Primer On Domain Adaptation PDF
No ratings yet
A Primer On Domain Adaptation PDF
31 pages
Deja Vu Continual Model Generalization For Unseen Data
No ratings yet
Deja Vu Continual Model Generalization For Unseen Data
19 pages
C L C S N N: Ontinual Earning With Olumnar Piking Eural Etworks
No ratings yet
C L C S N N: Ontinual Earning With Olumnar Piking Eural Etworks
12 pages
Feature Transformers A Unified Representation Learning Framework For Lifelong Learning
No ratings yet
Feature Transformers A Unified Representation Learning Framework For Lifelong Learning
11 pages
Continual Learning and Catastrophic Forgetting
No ratings yet
Continual Learning and Catastrophic Forgetting
21 pages
Continual Learning Proposal
No ratings yet
Continual Learning Proposal
11 pages
Paper 8
No ratings yet
Paper 8
10 pages
Domain Adaptation Via Prompt Learning
No ratings yet
Domain Adaptation Via Prompt Learning
10 pages
Domain Adaptation in GAN
No ratings yet
Domain Adaptation in GAN
41 pages
Data - and AI-driven Methods in Engineering
No ratings yet
Data - and AI-driven Methods in Engineering
40 pages
Mini Project 1
No ratings yet
Mini Project 1
4 pages
A Continual Learning Survey Defying Forgetting in Classification Tasks
No ratings yet
A Continual Learning Survey Defying Forgetting in Classification Tasks
20 pages
Missing-Class-Robust Domain Adaptation by Unilateral Alignment
No ratings yet
Missing-Class-Robust Domain Adaptation by Unilateral Alignment
9 pages
6598 Extrapolative Continuous Time
No ratings yet
6598 Extrapolative Continuous Time
22 pages
PHD Defense
100% (1)
PHD Defense
89 pages
Stojanov 19 B
No ratings yet
Stojanov 19 B
10 pages
Continual Learning of Large Language Models: A Comprehensive Survey
No ratings yet
Continual Learning of Large Language Models: A Comprehensive Survey
57 pages
Transferability in Deep Learning: A Survey: Junguang Jiang
No ratings yet
Transferability in Deep Learning: A Survey: Junguang Jiang
64 pages
Transfer Learning Through Embedding Spaces (Z-Lib - Io)
No ratings yet
Transfer Learning Through Embedding Spaces (Z-Lib - Io)
223 pages
Thesis To Read 4
No ratings yet
Thesis To Read 4
156 pages
Continual Object Detection: A Review of Definitions, Strategies, and Challenges
No ratings yet
Continual Object Detection: A Review of Definitions, Strategies, and Challenges
17 pages
Chandra Continual Learning With Dependency Preserving Hypernetworks WACV 2023 Paper
No ratings yet
Chandra Continual Learning With Dependency Preserving Hypernetworks WACV 2023 Paper
10 pages
MRCL
No ratings yet
MRCL
15 pages
Unsupervised Domain Adaptation by Backpropagation
No ratings yet
Unsupervised Domain Adaptation by Backpropagation
11 pages
Cont Learning
No ratings yet
Cont Learning
33 pages
Continual Learning With Pre-Trained Models: A Survey: Da-Wei Zhou Hai-Long Sun Jingyi Ning Han-Jia Ye De-Chuan Zhan
No ratings yet
Continual Learning With Pre-Trained Models: A Survey: Da-Wei Zhou Hai-Long Sun Jingyi Ning Han-Jia Ye De-Chuan Zhan
9 pages
Few-Shot Adversarial Domain Adaptation
No ratings yet
Few-Shot Adversarial Domain Adaptation
11 pages
Building Domain Enriched Deep Learning Algorithms
No ratings yet
Building Domain Enriched Deep Learning Algorithms
35 pages
Lecture 11 Transfer and Few-Shot Learning
No ratings yet
Lecture 11 Transfer and Few-Shot Learning
47 pages
Domain-Adversarial Training of Neural Networks
No ratings yet
Domain-Adversarial Training of Neural Networks
35 pages
Learning To Combine: Knowledge Aggregation For Multi-Source Domain Adaptation
No ratings yet
Learning To Combine: Knowledge Aggregation For Multi-Source Domain Adaptation
19 pages
Deep Unsupervised Domain Adaptation: A Review of Recent Advances and Perspectives
No ratings yet
Deep Unsupervised Domain Adaptation: A Review of Recent Advances and Perspectives
51 pages
Continual Learning For Smart City: A Survey: Li Yang, Zhipeng Luo, Shiming Zhang, Fei Teng, and Tianrui Li
No ratings yet
Continual Learning For Smart City: A Survey: Li Yang, Zhipeng Luo, Shiming Zhang, Fei Teng, and Tianrui Li
24 pages
Villa 2023 Plasticity-Optimized Complementary Networks For Unsupervised Continual Learning
No ratings yet
Villa 2023 Plasticity-Optimized Complementary Networks For Unsupervised Continual Learning
12 pages
Toward Understanding Catastrophic Forgetting in Continual Learning
No ratings yet
Toward Understanding Catastrophic Forgetting in Continual Learning
12 pages
A Survey On Contrastive Self-Supervised Learning
No ratings yet
A Survey On Contrastive Self-Supervised Learning
21 pages
Mancini Boosting Domain Adaptation CVPR 2018 Paper
No ratings yet
Mancini Boosting Domain Adaptation CVPR 2018 Paper
10 pages
Deep Learning L3
No ratings yet
Deep Learning L3
37 pages
Ran PAC
No ratings yet
Ran PAC
32 pages
Unsupervised Domain Adaptation For ECG Arrhythmia Classification
No ratings yet
Unsupervised Domain Adaptation For ECG Arrhythmia Classification
4 pages
34166-Article Text-38234-1-2-20250410
No ratings yet
34166-Article Text-38234-1-2-20250410
9 pages
DACS: Domain Adaptation Via Cross-Domain Mixed Sampling
No ratings yet
DACS: Domain Adaptation Via Cross-Domain Mixed Sampling
11 pages
Curriculum Learning For Domain Adaptation in Neural Machine Translation
No ratings yet
Curriculum Learning For Domain Adaptation in Neural Machine Translation
13 pages
AAM Ans
No ratings yet
AAM Ans
3 pages
Prompt-Based Distribution Alignment For Unsupervised Domain Adaptation
No ratings yet
Prompt-Based Distribution Alignment For Unsupervised Domain Adaptation
13 pages
Online Continual Learning With Natural Distribution Shifts - An Empirical Study With Visual Data
No ratings yet
Online Continual Learning With Natural Distribution Shifts - An Empirical Study With Visual Data
10 pages
Ovanet: One-Vs-All Network For Universal Domain Adaptation
No ratings yet
Ovanet: One-Vs-All Network For Universal Domain Adaptation
12 pages
29299-Article Text-33353-1-2-20240324
No ratings yet
29299-Article Text-33353-1-2-20240324
9 pages
Li Et Al. - 2023 - Building Manufacturing Deep Learning Models With M
No ratings yet
Li Et Al. - 2023 - Building Manufacturing Deep Learning Models With M
8 pages
Ar Tta A Simple Method For Real World Continual Test Time Adaptation ICCV WS 2023
No ratings yet
Ar Tta A Simple Method For Real World Continual Test Time Adaptation ICCV WS 2023
5 pages
Introduction To DL With TensorFlow
No ratings yet
Introduction To DL With TensorFlow
55 pages
Single Multi-Source Black-Box Domain Adaption For Sensor Time Series Data
No ratings yet
Single Multi-Source Black-Box Domain Adaption For Sensor Time Series Data
12 pages
Lecture3 Transfer Learning
No ratings yet
Lecture3 Transfer Learning
28 pages
Haeusser Iccv 17
No ratings yet
Haeusser Iccv 17
11 pages
Unified Deep Supervised Domain Adaptation and Generalization
No ratings yet
Unified Deep Supervised Domain Adaptation and Generalization
11 pages
Unit - V
No ratings yet
Unit - V
44 pages
Unsupervised Multi-Class Domain Adaptation: Theory, Algorithms, and Practice
No ratings yet
Unsupervised Multi-Class Domain Adaptation: Theory, Algorithms, and Practice
27 pages
Wang Continual Learning With Lifelong Vision Transformer CVPR 2022 Paper
No ratings yet
Wang Continual Learning With Lifelong Vision Transformer CVPR 2022 Paper
11 pages
Report
No ratings yet
Report
1 page
Task 2
No ratings yet
Task 2
1 page
Task 4
No ratings yet
Task 4
2 pages
Question 8
No ratings yet
Question 8
1 page
10 Best Firms in India For Data Scientists To Work For
No ratings yet
10 Best Firms in India For Data Scientists To Work For
15 pages
DEC GEC Approved MOOC Courses MED
No ratings yet
DEC GEC Approved MOOC Courses MED
3 pages
GAN Augmentation Augmenting Training Data Using Generative Adversarial Networks
No ratings yet
GAN Augmentation Augmenting Training Data Using Generative Adversarial Networks
12 pages
Feb - 2023
No ratings yet
Feb - 2023
1 page
1 s2.0 S2949719123000031 Main
No ratings yet
1 s2.0 S2949719123000031 Main
17 pages
Pedestrian Detection System Based On Deep Learning
No ratings yet
Pedestrian Detection System Based On Deep Learning
5 pages
A Comprehensive Review of Arti
No ratings yet
A Comprehensive Review of Arti
36 pages
Artificial - Intelligence - and - Education - Pedagogical Challenges
No ratings yet
Artificial - Intelligence - and - Education - Pedagogical Challenges
6 pages
Mini Project
No ratings yet
Mini Project
49 pages
A Survey of Machine Learning For Computer Architecture and Systems
No ratings yet
A Survey of Machine Learning For Computer Architecture and Systems
37 pages
Jms LSTM Rul Prediction
No ratings yet
Jms LSTM Rul Prediction
10 pages
Wang 2021
No ratings yet
Wang 2021
11 pages
Convolutional Neural Networks For Malware Classification
100% (1)
Convolutional Neural Networks For Malware Classification
100 pages
ICCC2024 Preliminary Program
No ratings yet
ICCC2024 Preliminary Program
7 pages
Practicals 1 To
No ratings yet
Practicals 1 To
5 pages
Unit - 1 Deep Learning Techniques
No ratings yet
Unit - 1 Deep Learning Techniques
18 pages
AI From Basics To Advanced Levels
No ratings yet
AI From Basics To Advanced Levels
3 pages
Business Data Analytics
No ratings yet
Business Data Analytics
19 pages
Review 3 PPT Final1)
No ratings yet
Review 3 PPT Final1)
51 pages
IP Project Report
No ratings yet
IP Project Report
15 pages
AI-Big Data Analytics For Building Automation and Management Systems A Survey, Actual Challenges and Future Perspectives
No ratings yet
AI-Big Data Analytics For Building Automation and Management Systems A Survey, Actual Challenges and Future Perspectives
93 pages
Texture Generic Deep Shape From Template
No ratings yet
Texture Generic Deep Shape From Template
22 pages
Glossary of AI Terms
No ratings yet
Glossary of AI Terms
4 pages
UNIT 5 - Information Extraction
No ratings yet
UNIT 5 - Information Extraction
14 pages
Artificial Intelligence Reading Comprehension
0% (1)
Artificial Intelligence Reading Comprehension
4 pages
Water Fraud REPORT
0% (2)
Water Fraud REPORT
63 pages
Ai in Rheumatology : Current Trends and Future Horizons
No ratings yet
Ai in Rheumatology : Current Trends and Future Horizons
1 page
Heart Sound
No ratings yet
Heart Sound
8 pages
Ocean Engineering
No ratings yet
Ocean Engineering
21 pages
Rajkumar R: Objective
No ratings yet
Rajkumar R: Objective
2 pages

Cs771 Mini Project-2

Uploaded by

Cs771 Mini Project-2

Uploaded by

CS771 MINI PROJECT-2

Lifelong Domain Adaptation via Consolidated

BY: VARSHA PILLAI, KISHORE S, ANIRVAN TYAGI

Challenge of Catastropic forgetting

Optimum θ using ERM :

denotes the WD distance

Digit Recognition: ImageCLEF-DA: Office-Home: Office-Caltech:

Using MNIST (M), With 12 shared Consisting of Using 10

2.Alignment of Data Distributions:

You might also like