Transfer Learning On Quora Dataset
Transfer Learning On Quora Dataset
in Quora
Abstract
Transfer learning is a research problem in ML where the knowledge gained by solving a problem related
to a task or domain is applied to solve different but related problem.Transfer Learning techniques have been
effectively used in fields like Image Processing and were able to achieve good results. But in NLP Transfer
Learning has been loosely applied and conclusions are not consistent.Neural networks are shown to obtain
state of art performance on text pair classification tasks like SNLI task. In this project we explored different
neural network based Transfer Learning schemes on a variety of datasets.
4 Datasets:
IMBD: A large dataset for binary sentiment classifi-
cation (positive vs. negative) - 25k sentences. Figure 1: (Ref [3]) Architecture used by Lili Mou et
MR: A small dataset for binary sentiment classifica- al., ’a’ for Experiment 1 and ’b’ for Experiment 2
tion - 10662 sentences.
QC A (small) ) dataset for 6-way question classifica-
tion (e.g., location, time, and number) - 5000 ques-
tions.
SNLI: A large dataset for sentence entailment recog-
nition. The classification objectives are entailment,
contradiction, and neutral - 500k pairs. 6 Experiments
SICK: A small dataset with exactly the same classifi-
cation objective as SNLI - 10k pairs.
MSRP: A (small) dataset for paraphrase detec-
tion.The objective is binary classification: judging 6.1 Experiment 1:
whether two sentences have the same meaning - 5000
pairs.
Quora dataset: It contains duplicate questions pairs For this experiment we have used LSTM archi-
with labels indicating whether the pair of questions tecture.We have trained the model on IMDB and
request the same information - 400k question pairs. then transferred the three layers embeddings, hidden
(LSTM) and output layer to MR dataset and em-
beddings and hidden (LSTM) layers to QC and the
5 Our Contributions results are shown in table 1.When we have trans-
ferred the parameters from IMDB to MR, the ac-
We have tried to replicate the results from [3] using curacy has improved by 1.57% and from IMDB to
INIT method. For experiment 1, we have trained a QC, there isn’t much change in accuracy. The rea-
LSTM model on IMDB and transferred the parame- son for this is that IMDB and MR are semantically
ters to MR and QC datasets and the results are shown similar datasets whereas IMDB and QC are semanti-
in table 1 and in figure 1. For experiment 2, we have cally different.Figure 2 shows accuracy for MR dataset
trained CNN model on SNLI dataset and transferred without transfer and with transfer of parameters from
the parameters to SICK and MRSP datasets. IMDB dataset.Figure 3 shows accuracy for QC dataset
We have then experimented with both INIT and without transfer and with transfer of parameters from
MULT on SNLI(source) and Quora(target) datasets. IMDB dataset.
Paper
Paper
(with Wiithout With
Dataset (without
Trans- Transfer Transfer
Transfer)
fer)
IMDB 87.0 - 84.50 -
MR 75.1 81.4 76.89 78.46
QC 90.8 93.2 95.3 94.73
Figure 2: IMDB to MR
Figure 5: Quora to SICK
Accuracy
Transfer Learning Scheme
(in %)
INIT 82.41
MULT 82.12
NMT context vector 77.11
No Transfer Learning 82.01