14 SentimentClassification
14 SentimentClassification
Sentiment Classification
Phrase-level subtask A
Dataset Score Rank
Twitter 15 84.79 1
Message-level subtask B
Dataset Score Rank
Twitter 15 64.59 2
Sentiment Specific Word Embeddings
Sentiment Specific Word Embeddings
LM likelihood + Polarity
Three-phase procedure:
1. creation of word embeddings for initialization of the first layer.
Word2vec on an unlabeled corpus of 200M tweets.
2. distant supervised phase, where the network weights and word
embeddings are trained to capture aspects related to sentiment.
Emoticons used to infer the polarity of a balanced set of 90M tweets.
3. supervised phase, where the network is trained on the provided
supervised training data.
Ensemble of Classifiers
Ensemble of classifiers
combining the outputs of two 2-layer CNNs having similar architectures but
differing in the choice of certain parameters (such as the number of
convolutional filters).
networks were also initialized using different word embeddings and used
slightly different training data for the distant supervised phase.
A total of 7 outputs were combined
Results
2013 2014 2015 2016 Tweet
Live-
Sarcas Avg
Tweet SMS Tweet Journa Tweet Acc
m F1
l
SwissCheese 70.0 63.72 71.62 56.61 69.57 67.11 63.31 64.61
Combination 5
SwissCheese
67.00 69.12 62.00 71.32 61.01 57.19
single
UniPI SWE 64.2 60.6 68.4 48.1 66.8 63.5 59.2 65.2
Breakdown over all test sets
SwissCheese Prec. Rec. F1 UniPI 3 Prec. Rec. F1
positive 67.48 74.14 70.66 positive 70.88 65.35 68.00
negative 53.26 67.86 59.68 negative 50.29 58.93 54.27
neutral 71.47 59.51 64.94 neutral 68.02 68.12 68.07
Avg F1 65.17 Avg F1 61.14
Accuracy 64.62 Accuracy 65.64
Sentiment Classification from a single neuron
A char-level LSTM with 4096 units has been
trained on 82 millions reviews from Amazon.
The model is trained only to predict the next
character in the text
After training one of the units had a very
high correlation with sentiment, resulting in
state-of-the-art accuracy when used as a
classifier.
The model can be used to generate text.
By setting the value of the sentiment unit,
one can control the sentiment of the Blog post - Radford et al. Learning to
Generate Reviews and Discovering
resulting text. Sentiment. Arxiv 1704.01444