0% found this document useful (0 votes)

11 views13 pages

Ambigu

Uploaded by

Al Karim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views13 pages

Ambigu

Uploaded by

Al Karim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

4

Resolving Ambiguity in Sentiment Classification: The Role

of Dependency Features
SHUYUAN DENG, ATISH P. SINHA, and HUIMIN ZHAO, University of Wisconsin-Milwaukee

Sentiment analysis has become popular in business intelligence and analytics applications due to the great
need for learning insights from the vast amounts of user generated content on the Internet. One major
challenge of sentiment analysis, like most text classification tasks, is finding structures from unstructured
texts. Existing sentiment analysis techniques employ the supervised learning approach and the lexicon
scoring approach, both of which largely rely on the representation of a document as a collection of words
and phrases. The semantic ambiguity (i.e., polysemy) of single words and the sparsity of phrases negatively
affect the robustness of sentiment analysis, especially in the context of short social media texts. In this study,
we propose to represent texts using dependency features. We test the effectiveness of dependency features
in supervised sentiment classification. We compare our method with the current standard practice using
a labeled data set containing 170,874 microblogging messages. The combination of unigram features and
dependency features significantly outperformed other popular types of features.
r
CCS Concepts: Computing methodologies → Machine learning → Learning paradigms → Super-
vised learning → Supervised learning by classification
Additional Key Words and Phrases: Sentiment analysis, text mining, dependency, feature extraction, super-
vised learning
ACM Reference Format:
Shuyuan Deng, Atish P. Sinha, and Huimin Zhao. 2017. Resolving ambiguity in sentiment classification:
The role of dependency features. ACM Trans. Manage. Inf. Syst. 8, 2–3, Article 4 (June 2017), 13 pages.
DOI: http://dx.doi.org/10.1145/3046684

1. INTRODUCTION
Posting on social media platforms has become one of the most popular activities on the
Internet. Social media messages contain rich user opinions and are being generated
in high volume and velocity, providing businesses with a great opportunity to monitor
their environments in real time [Bifet and Frank 2010; Yu et al. 2013]. Sentiment
analysis, also known as opinion mining, has emerged as a useful tool for extracting
subjective information from different types of texts [Liu 2012; Pang and Lee 2008],
such as blogs, reviews, comments, and tweets.
Sentiment analysis typically classifies the directional emotions in texts into different
categories, such as positive, negative, and neutral [Abbasi et al. 2011; Chen et al.
2012]. It relies on natural language processing (NLP) and text-mining techniques.
Since text data are usually unstructured, the biggest challenge of sentiment analysis is
finding meaningful structures from texts. A standard practice is to represent texts as a
collection of words and/or phrases. However, single words can have multiple meanings,

Authors’ addresses: S. Deng, A. P. Sinha, and H. Zhao, Sheldon B. Lubar School of Business, University of
Wisconsin-Milwaukee, P.O. Box 742, 3202 N. Maryland Ave., Milwaukee, WI 53201-0742; emails: dengs@
uwm.edu, [email protected], [email protected].
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted
without fee provided that copies are not made or distributed for profit or commercial advantage and that
copies show this notice on the first page or initial screen of a display along with the full citation. Copyrights for
components of this work owned by others than ACM must be honored. Abstracting with credit is permitted.
To copy otherwise, to republish, to post on servers, to redistribute to lists, or to use any component of this
work in other works requires prior specific permission and/or a fee. Permissions may be requested from
Publications Dept., ACM, Inc., 2 Penn Plaza, Suite 701, New York, NY 10121-0701 USA, fax +1 (212)
869-0481, or [email protected].
c 2017 ACM 2158-656X/2017/06-ART4 $15.00
DOI: http://dx.doi.org/10.1145/3046684

ACM Transactions on Management Information System, Vol. 8, No. 2–3, Article 4, Publication date: June 2017.
4:2 S. Deng et al.

known as polysemy. The meanings can vary greatly by context. Social media texts are
typically short, making the contextual information even less available. Phrases are
much less ambiguous compared to single words. However, they lack flexibility since
they only account for fixed word sequences.
This study addresses the polysemy issue in sentiment analysis by introducing depen-
dency features as sentiment indicators. Dependencies are pairwise word relations [De
Marneffe and Manning 2008]. We argue that simple dependency representation of texts
is more effective than phrases in sentiment analysis, especially for short documents, if
the analysis is conducted at the document level. Compared to using single words and
phrases, using dependencies has at least two advantages. First, dependencies incorpo-
rate contextual information by using word relations. Second, a word relation can still be
established even if two words are not adjacent. In this study, we introduce dependency
features into supervised sentiment classification. We compare the classification effec-
tiveness of dependencies with that of the current standard practice of using n-grams
and part-of-speech tagged words on a large test set.
The remainder of this article is organized as follows. In the second section, we review
the standard practices in sentiment analysis and the types of representation of sentence
structures. In the third section, we describe the advantage of dependency features
and propose the dependency-based text representation in sentiment classification. In
the fourth section, we evaluate the effectiveness of dependency features with that of
different baseline approaches. Then, we review related studies using dependencies and
discuss the contribution of this article. The last section identifies the limitations of our
study and future research directions.

2. BACKGROUND
There are two major approaches to sentiment analysis: supervised learning and lexicon
scoring [Liu 2012; Pang and Lee 2008]. The supervised learning approach represents a
document as a set of linguistic features and trains a machine-learning classifier using
a large annotated corpus in which the sentiment category of each document is known.
The trained model is subsequently used to classify the sentiment of other documents
[Hu and Liu 2004; Pang et al. 2002]. The most popular method used to represent texts
is the word n-gram model [Abbasi et al. 2011; Chou et al. 2010]. Unigram models
present a document as a vector of word frequencies (i.e., vector space model). This
is also known as the bag-of-words (BOW) model. Term frequency-inverse document
frequency (TF-IDF), which assigns more weights to words that only occur in a few
documents, has been used as an improvement to plain frequency [Chou et al. 2010;
Ngo-Ye and Sinha 2012]. For short documents, such as social media text, the binary
value of word presence has also been used (i.e., whether it exists or does not exist). As
an effort to resolve word semantics, part-of-speech (POS) tagged words have also been
experimented with [Blitzer et al. 2006; Tsai et al. 2016; Zimbra et al. 2015]. A major
drawback of the BOW model is the strong assumption that the order of words (i.e.,
syntax) does not matter. To incorporate syntactical information, existing studies have
also attempted to use phrase patterns, bi-grams, and tri-grams. However, they have
not found consistent performance improvement by using these features.
One type of important information that previous research in sentiment analysis has
not effectively captured is syntax, the principles of constructing sentences [Chomsky
1965]. In natural language processing, there are two types of representations of sen-
tence structures, the constituency grammar and the dependency grammar [Covington
2001]. Constituency grammar, also known as phrase structure grammar, describes a
sentence as a set of constituency relations [Chomsky 2002]. Single words (i.e., leaves)
are the constituents of phrases, which, in turn, are constituents of more complicated
phrases, the eventual constituents of the sentence (i.e., root). The phrase features used
in sentiment classification are a simplified case of constituency features.
ACM Transactions on Management Information System, Vol. 8, No. 2–3, Article 4, Publication date: June 2017.
Resolving Ambiguity in Sentiment Classification 4:3

Table I. Representation of Sentence 1

Sentence 1: AAPL stealing the thunder
Representation Elements
Unigram AAPL, stealing, the, thunder
POS Tagged AAPL/NNP, stealing/VBG, the/DT, thunder/NN
Bigram AAPL stealing, stealing the, the thunder
Trigram AAPL stealing the, stealing the thunder
(ROOT (S (NP (NNP AAPL)) (VP (VBG stealing) (NP (DT the) (NN
thunder)))))
Constituency (S (NP (NNP AAPL)) (VP (VBG stealing) (NP (DT the) (NN thunder))))
(excluding leaves) (NP (NNP AAPL))
(VP (VBG stealing) (NP (DT the) (NN thunder)))
(NP (DT the) (NN thunder))
root(ROOT, stealing)
Dependency nsubj(stealing, AAPL)
dobj(stealing, thunder)
det(the, thunder)

However, it is doubtful that constituents are suitable to be used as features in sen-

timent classification. First, higher level and lower level constituents are highly corre-
lated. Using constituency representation would generate a large number of redundant
features, which may lead to overfitting. Second, a constituency is also fixed, providing
no flexibility for accommodating similar structures with different leaves.
Dependency grammar provides the flexibility to capture the relationship between
non-adjacent words. It represents a sentence as a collection of pairwise word relations,
i.e., dependencies. A dependency is a triplet, rel(w1 , w2 ), where w1 and w2 are two words
in a sentence and rel describes their relationship [Joshi and Penstein-Ros 2009]. The
dependency structure of a sentence can be flat (i.e., not hierarchical) compared to the
constituency structure. In a dependency relation, one word is called the head and the
other the dependent [De Marneffe and Manning 2008]. The head usually plays a more
important role in determining the behavior of the relation. The dependent typically is
the subject, object, or modifier of the head.

3. RESEARCH METHOD
In this study, we propose the use of dependency features to improve supervised senti-
ment classification. We compare proposed dependency features with features used in
previous studies in terms of classification correctness.

3.1. Advantage of Dependency Features

We show the advantage of dependency grammar through a simple but informative ex-
ample. Sentence 1, AAPL stealing the thunder, is excerpted from a popular microblog-
ging website. The author of this sentence reported positive sentiment, indicating she is
happy about the increase of the stock price of Apple (Symbol: AAPL). The sentiment in-
dicator in this text is stealing the thunder. Several representations of this sentence (un-
igram, POS-tagged words, bigram, trigram, constituency, and dependency) are shown
in Table I.
Sentence 1 contains four unigrams, AAPL, stealing, the, and thunder. They are proper
noun (NNP), verb-gerund (VBG), determiner (DT), and noun (NN), respectively. The
constituency representation shows combinations of the tagged words at difference lev-
els, i.e., constituency sub-trees. For the dependency representation, the first element
is root(ROOT, stealing), meaning that stealing is the root of the dependency tree and
carries most of the sentence’s meaning. The second dependency, nsubj(stealing, AAPL),
means that AAPL is the nominal subject (nsubj) that carries out stealing. The third

ACM Transactions on Management Information System, Vol. 8, No. 2–3, Article 4, Publication date: June 2017.
4:4 S. Deng et al.

dependency, dobj(stealing, thunder), means that thunder is the direct object of stealing.
Although the two words are not adjacent, their close relationship can still be cap-
tured by this dependency. The last dependency, det(the, thunder), means that the is
the determiner of thunder, which does not carry more meaning than merely the word,
thunder.
Without loss of generality, we suppose a classifier is trained using Sentence 1 (among
others) and is used to classify two other sentences, both of which are real-world exam-
ples:
Sentence 2: Apple keeps stealing Samsung’s thunder.
Sentence 3: Apple stealing user information via Face Time.
We show the different representations of both sentences in Table II. In these rep-
resentations, if an element has also appeared in Sentence 1, then we display it in
bold.
It is clear that Sentences 1 and 2 both express positive sentiment toward Apple, while
Sentence 3 expresses negative sentiment. Among the different types of feature repre-
sentation, only unigram, POS-tagged words, and dependency relations can reveal the
similarity between Sentences 1 and 2. However, unigram and POS-tagged words also
capture some similarity between Sentences 1 and 3, which have completely opposite
sentiment polarity. Among all types of representation, dependency is the only one that
can accurately detect the major similarities and differences between Sentence 1 and
the other sentences. We summarize the similar elements of different representations
for the three sentences in Table III.
3.2. Research Design
To use dependency relations as features, a document is parsed into dependencies. The
occurrence of each dependency in the corpus is measured for each document to generate
a vector space model. The dependency features may be quantified using frequency
(continuous) or presence (binary) numbers. In addition, inverse document frequency
may be employed. Then, a training data set containing documents represented by
dependency vectors and their sentiment categories is used to build a classifier.
4. EVALUATION
To evaluate the effectiveness of dependency features for sentiment classification, we
conducted experiments on a large social media data set. The data set contains all user
messages from Stocktwits between July 2009 and April 2014. Stocktwits is a leading
social media platform for investors to share opinions about the financial market. Sim-
ilar to tweets, Stocktwits messages are limited to 140 characters. Instead of officially
supporting Hashtags, Stocktwits uses Cashtags to track stocks and other financial as-
sets mentioned in a message. On Stocktwits, users can mark the sentiment of their
postings as bullish or bearish. In our data set, there are 87,776 bearish messages and
265,452 bullish messages. We did not use any unmarked messages since the sentiment
in these messages are uncertain. The experimental task is to classify each message
as bullish or bearish. For benchmarking purposes, we sampled an equal number of
bullish and bearish messages for cross validation. Our final data set contains 170,874
messages, half of which are bullish, and the other half are bearish.
We used Stanford typed dependencies in this study [De Marneffe and Manning 2008].
This representation defines approximately 50 grammatical relations. To generate de-
pendency features, we first used the CMU Ark Tweet POS Tagger [Owoputi et al.
2013] to tag the Stocktwits messages. Next, we used the Stanford Parser [De Marneffe
et al. 2006], a Java library published by the Stanford NLP group, to parse the tagged
messages into dependencies. The dependency parser was trained using the Wall Street

ACM Transactions on Management Information System, Vol. 8, No. 2–3, Article 4, Publication date: June 2017.
Resolving Ambiguity in Sentiment Classification 4:5

Table II. Representation of Sentences 2 and 3

Representation Elements
Sentence 2: Apple keeps stealing Samsung’s thunder
Unigram Apple, keeps, stealing, Samsung, ’s, thunder
POS Tagged Apple/NNP, keeps/VBZ, stealing/VBG, Samsung/NNP, ’s/POS, thunder/NN
Apple keeps
keeps stealing
Bigram stealing Samsung
Samsung ’s
‘s thunder
Apple keeps stealing
Trigram keeps stealing Samsung
stealing Samsung ’s
Samsung ‘s thunder
(ROOT (S (NP (NNP Apple)) (VP (VBZ keeps) (VP (VBG stealing) (NP (NP
(NNP Samsung) (POS ’s)) (NN thunder))))))
(S (NP (NNP Apple)) (VP (VBZ keeps) (VP (VBG stealing) (NP (NP (NNP
Samsung) (POS ‘s)) (NN thunder)))))
Constituency (NP (NNP Apple))
(excluding leaves) (VP (VBZ keeps) (VP (VBG stealing) (NP (NP (NNP Samsung) (POS ’s)) (NN
thunder))))
(VP (VBG stealing) (NP (NP (NNP Samsung) (POS ’s)) (NN thunder)))
(NP (NP (NNP Samsung) (POS ’s)) (NN thunder))
(NP (NNP Samsung) (POS ’s))
nsubj(keeps, Apple)
Dependency dep(keeps, stealing)
dobj(stealing, thunder)
poss(thunder, Samsung)
Sentence 3: Apple stealing user information via Face Time
Unigram Apple, stealing, user, information, via, Face, Time
POS Tagged Apple/NNP, stealing/VBG, user/NN, information/NN, via/PREP, Face/NN,
Time/NN
Bigram Apple stealing, stealing user, user information, information via, via Face, Face
Time
Trigram Apple stealing user, stealing user information, user information via,
information via Face, via Face Time
(ROOT (S (NP (NNP Apple)) (VP (VBG stealing) (NP (NN user) (NN
information)) (PP (IN via) (NP (NNP Face) (NNP Time))))))
(S (NP (NNP Apple)) (VP (VBG stealing) (NP (NN user) (NN information))
(PP (IN via) (NP (NNP Face) (NNP Time)))))
Constituency (NP (NNP Apple))
(excluding leaves) (VP (VBG stealing) (NP (NN user) (NN information)) (PP (IN via) (NP (NNP
Face) (NNP Time))))
(NP (NN user) (NN information))
(PP (IN via) (NP (NNP Face) (NNP Time)))
(NP (NNP Face) (NNP Time))
nsub(stealing, Apple)
dobj(stealing, information)
Dependency nn(information, user)
nn(Time, Face)
prep_via(stealing, Time)

Journal (WSJ) section of the Penn treebank, which consisted of one million words of
manually annotated sentences [Marcus et al. 1993]. Each sentence in the treebank
was represented as a constituency tree. The parser first decomposed the constituency
trees into rules to represent a context-free grammar [Charniak 1996]. For example, a
sentence (S) consisting of a noun phrase (NP) and a verb phrase (VP) was represented
as rule, S → NP VP; a noun phrase consisting of a determiner (DT) and a noun (NN)

ACM Transactions on Management Information System, Vol. 8, No. 2–3, Article 4, Publication date: June 2017.
4:6 S. Deng et al.

Table III. Similarity in Representing Sentences 1, 2, and 3

Representation Sentence 1 Sentence 2 Sentence 3
Unigram stealing, thunder stealing, thunder stealing
POS Tagged stealing/VBG, stealing/VBG, stealing/VBG
thunder/NN thunder/NN
Bi-gram
Tri-gram
Constituency
(Excluding leaves)
Dependency dobj(stealing, dobj(stealing,
thunder) thunder)

was represented as rule, NP → DT NN. Each rule was assigned a probability based on
how often it occurred in the training corpus. Given a POS tagged sentence, its parse
tree was constructed by maximizing the joint likelihood of the rules [Johnson 1998].
The search for the maximum likelihood was accomplished using the CKY algorithm
[Martin and Jurafsky 2000]. Next, the dependency relations were extracted using pre-
defined patterns [De Marneffe et al. 2006]. A previous study has shown that the parser
can achieve about 90% accuracy in parsing English texts [Chen and Manning 2014].
The time complexity of the parser is O(n3 ) [Klein and Manning 2003]. Parsing all of
the messages took approximately 6h using a single core of an Intel i7-6700HQ processor.
We then constructed the features and performed the sentiment classification in Python.
Each run (training and testing) took less than 1min.
For each model, we performed 10-fold cross validation 10 times. We used accuracy to
evaluate the performance of the different models. We conducted five groups of sentiment
classification with different settings. Group 1 (1G) uses only word unigrams; Group 2
(1G+2G) uses word unigrams and bigrams; Group 3 (1G+2G+3G) uses word unigrams,
bigrams, and trigrams; Group 4 (POS) uses POS-tagged words; Group 5 (1G+DEP) uses
word unigrams and dependencies. Each dependency is treated as a single term.
Prior research proposed to use back-off dependency features in sentence-level sub-
jectivity detection, i.e., classifying opinion sentences and non-opinion sentences [Joshi
and Penstein-Ros 2009]. A back-off dependency is a dependency triplet with the head
or the modifier replaced with its POS tag. They found that the combination of unigrams
and back-off dependency features significantly outperformed the aforementioned base-
lines and even the combination of unigrams and lexicalized dependencies (Group 5).
It would be interesting to examine the usefulness of back-off dependencies. Thus, we
created two additional baseline groups. Group 6 (1G+M-BO) combines unigrams and
back-off dependencies with modifier words replaced by their POS tags. Group 7 (1G+H-
BO) consists of unigrams and back-off dependencies with head words replaced by their
POS tags.
To ensure the robustness of the results, we used two popular text classification meth-
ods, linear Support Vector Machine (SVM) and Naı̈ve Bayes (NB). We also quantified
the features using both frequency (continuous) and presence (binary) numbers, with
and without inverse document frequency. Because text classification often needs to
deal with high dimensional data, feature selection may help improving classification
accuracy. Combining dependencies and unigrams would significantly increase the fea-
ture space. Thus, we ran each experiment again with an additional feature selection
procedure to examine the sensitivity of the results to feature selection. In this study,
we chose Chi-squared-based feature selection and information gain-based feature se-
lection. Both have been shown to be effective in text classification [Chou et al. 2010]. In
Chi-squared-based feature selection, the Chi-squared value between each feature and
the sentiment class was calculated and ranked. In information gain-based selection, the

ACM Transactions on Management Information System, Vol. 8, No. 2–3, Article 4, Publication date: June 2017.
Resolving Ambiguity in Sentiment Classification 4:7

Table IV. Classification Accuracy (%) of Groups 1–7

Classifier Measure IDF 1G 1G+2G 1G+2G+3G POS 1G+H-BO 1G+M-BO 1G+DEP
No 76.27 78.58 78.63 77.65 77.69 76.53 79.19
Continuous
Yes 75.75 78.47 78.46 77.20 77.71 76.23 78.94
NB
No 76.34 78.39 78.53 77.66 77.75 76.73 79.19
Binary
Yes 75.77 78.36 78.33 77.16 77.70 76.40 78.94
No 76.39 77.48 77.89 77.39 76.21 76.44 78.96
Continuous
Yes 76.95 79.05 79.29 78.70 78.58 78.86 80.67
SVM
No 76.43 77.45 77.86 77.42 76.22 76.57 78.93
Binary
Yes 76.99 79.06 79.27 78.67 78.61 78.84 80.65

Table V. Classification Accuracy (%) of Groups 1–7

(Using top 10% features based on Chi-squared values)
Classifier Measure IDF 1G 1G+2G 1G+2G+3G POS 1G+H-BO 1G+M-BO 1G+DEP
No 75.73 77.74 77.95 77.14 77.47 76.44 78.74
Continuous
Yes 75.26 77.75 77.77 76.77 77.43 76.08 78.56
NB
No 75.78 77.85 78.02 77.18 77.50 76.70 78.85
Binary
Yes 75.24 77.80 77.84 76.72 77.42 76.27 78.60
No 76.61 76.80 76.23 78.04 76.80 76.79 77.73
Continuous
Yes 76.74 78.68 79.04 78.53 78.89 79.08 80.50
SVM
No 76.65 76.84 76.25 78.13 76.86 76.96 77.80
Binary
Yes 76.73 78.70 79.04 78.52 78.92 79.12 80.54

information gain (or reduction in uncertainty) between each feature and the sentiment
class was calculated and ranked. For each run, the top 10% features were retained to
train and test the model. Although the choice of the threshold, 10%, is arbitrary, it was
used for all groups and settings. Thus, this choice will not cause consistent bias for the
purpose of benchmarking.
Table IV shows the classification accuracy without feature selection. The highest
accuracy in each setting is displayed in bold. SVM results were generally better than
NB results, primarily due to the fact that the regularization used in SVM allows it to
work well with high-dimensional data. Using continuous features did not differ signif-
icantly from using binary features. This can be attributed to the fact that many terms
occur only once in a short microblog message. Using only unigram features gave the
lowest accuracy across all settings. Adding bigrams significantly improved the accu-
racy by about 2% across different settings. Further adding trigrams only led to slight
improvement (<0.5%). This supports our conjecture that capturing syntactical infor-
mation helps classifying sentiment in short texts. The performance of POS features was
better than that of unigrams but worse than that of the combination of unigrams and
bigrams. The combination of unigrams and back-off dependencies did not consistently
outperform the combination of unigrams, bigrams, and trigrams.
The combination of unigrams and dependencies gave the best performance. All SVM
results using such a feature set achieved over 1% improvement, compared to the sec-
ond best (1G+2G+3G). The difference between the accuracy of all runs in the two
groups is statistically significant (p < 0.001). This difference translates to more than
1,709 correctly predicted messages in our data set. Given the large message volume on
microblog websites, this improvement also represents a practically significant number.
Table V shows the results of Groups 1–7 when only using the Top 10% features
based on Chi-squared values. The highest accuracy in each setting is displayed in bold.
The simple Chi-squared feature selection procedure did not improve the classifica-
tion accuracy. Nonetheless, our purpose is to show that dependency feature improves
classification accuracy over other baselines even after feature selection. Consistent

ACM Transactions on Management Information System, Vol. 8, No. 2–3, Article 4, Publication date: June 2017.
4:8 S. Deng et al.

Table VI. Classification Accuracy (%) of Groups 1–7

(Using top 10% features based on information gain)
Classifier Measure IDF 1G 1G+2G 1G+2G+3G POS 1G+H-BO 1G+M-BO 1G+DEP
No 75.10 76.86 76.71 76.41 76.77 75.73 77.76
Continuous
Yes 74.59 76.84 76.60 75.94 76.69 75.23 77.54
NB
No 75.09 77.01 76.82 76.42 76.82 75.95 77.83
Binary
Yes 74.59 76.91 76.73 75.86 76.65 75.42 77.54
No 75.91 75.47 75.68 77.30 75.51 75.34 76.32
Continuous
Yes 75.99 77.96 78.32 77.66 77.94 78.14 79.67
SVM
No 76.00 75.53 75.69 77.30 75.57 75.40 76.34
Binary
Yes 76.10 77.95 78.29 77.61 77.94 78.12 79.69

Table VII. Classification Accuracy (%) of 2G, H-BO, M-BO, and DEP
Classifier Measure IDF 2G H-BO M-BO DEP
No 75.50 74.25 71.18 76.06
Continuous
Yes 75.43 74.40 71.45 75.97
NB
No 75.56 74.24 71.26 76.10
Binary
Yes 75.47 74.39 71.50 76.00
No 73.55 73.50 68.40 74.50
Continuous
Yes 74.92 75.00 70.51 76.08
SVM
No 73.58 73.48 68.56 74.52
Binary
Yes 74.92 74.99 70.53 76.13

with the results in Table IV, using only unigram features yielded the lowest accuracy.
Adding bigrams led to significant improvement and further adding trigrams only led
to slight improvement. POS performed between 1G and 1G+2G, except in two settings,
where it outperformed all other groups. The combination of unigrams and back-off
dependencies did not consistently outperform the combination of unigrams, bigrams,
and trigrams. Similar patterns have been observed in the results using information
gain-based feature selection (Table VI).
After feature selection, using either Chi-squared values or information gain, the
combination of unigrams and dependencies outperformed all other groups in six of
eight settings. The improvement in the best-performing group over the second best is
over 2%; the improvement is statistically significant (p < 0.001). This shows that the
usefulness of dependency features is robust to feature selection.
Dependency is most similar to bigram since both of them can capture the syntactical
relation between two words. As we mentioned earlier, dependency can further identify
remote word relations. To compare the usefulness of dependency directly against that of
bigram in sentiment classification, we conducted additional two groups of experiments,
using bigram only (2G) and using dependency only (DEP), respectively. We also included
the back-off dependency features (H-BO and M-BO) as baselines. Table VII shows the
classification accuracy without feature selection. The highest accuracy in each setting
is displayed in bold. DEP improved accuracy by about 1% in most settings, compared
to 2G, H-BO, and M-BO (p < 0.001). Table VIII shows the classification accuracy using
the top 10% features based on Chi-squared values. DEP outperformed the baselines
in all settings (p < 0.001). Table IX shows the results using information gain-based
feature selection. DEP outperformed all baselines except in two settings.
5. RELATED STUDIES
There have been a number of studies that explored the usefulness of dependency struc-
ture in text classification (summarized in Table X). Wilson et al. [2004] proposed syntac-
tical features to classify the strength of opinion into neutral, low, medium, and high. The

ACM Transactions on Management Information System, Vol. 8, No. 2–3, Article 4, Publication date: June 2017.
Resolving Ambiguity in Sentiment Classification 4:9

Table VIII. Classification Accuracy (%) of 2G, H-BO, M-BO, and DEP
(Using top 10% features based on Chi-squared values)
Classifier Measure IDF 2G H-BO M-BO DEP
No 73.47 73.60 70.57 75.10
Continuous
Yes 73.49 73.75 70.73 75.07
NB
No 73.52 73.57 70.61 75.11
Binary
Yes 73.52 73.72 70.76 75.11
No 73.08 73.86 69.52 73.96
Continuous
Yes 74.26 74.95 70.95 75.93
SVM
No 73.16 73.88 69.50 74.09
Binary
Yes 74.30 74.94 70.95 76.00

Table IX. Classification Accuracy (%) of 2G, H-BO, M-BO, and DEP
(Using top 10% features based on information gain)
Classifier Measure IDF 2G H-BO M-BO DEP
No 71.14 72.40 59.51 73.06
Continuous
Yes 71.16 72.34 69.60 73.11
NB
No 71.22 72.37 69.54 73.08
Binary
Yes 71.22 72.31 69.59 73.13
No 69.54 72.08 67.61 70.68
Continuous
Yes 70.91 73.21 69.12 73.24
SVM
No 69.60 72.13 67.73 70.72
Binary
Yes 70.97 73.22 69.17 73.29

feature set they proposed includes POS tagged words, dependency, and word location
in a dependency tree. The experiments on a manually annotated news corpus showed
about 5% improvement over the baselines. However, it is not clear if the improvement
can be attributed to the dependency features. Ng et al. [2006] proposed to use three
types of dependency relations as features in classifying customer reviews as positive
or negative. The three types of dependency are adjective-noun, subject-verb, and verb-
object. However, these features did not help with classification accuracy. In Wilson et al.
[2009], dependency information related to a word was used to classify the sentiment of
the word. However, the features cannot be applied to document-level classification.
Joshi and Penstein-Ros [2009] used dependency features to detect if a sentence
contains opinion. They proposed back-off dependency features, which replace a word
or both words in a dependency with its/their POS tags. One example is amod(NN,
great), which indicates a dependency containing the word “great” modifying a noun.
This example feature works well for identifying the similar sentiment in the following
two sentences:
The camera is great.
The MP3 player is great.
However, such back-off dependency does not distinguish the difference for the fol-
lowing two phrases:
Cure cancer.
Have cancer.
Both of the two phrases can be represented as dobj(VB, cancer), i.e., a dependency
containing the word “cancer,” which is the direct object to a verb, but they have very
different sentiment polarity. Joshi and Penstein-Ros [2009] also proposed full back-off
features (e.g., amod(NN, ADJ)) and n-gram back-off features. They found that the back-
off features outperformed the unigram baseline. However, they did not find additional

ACM Transactions on Management Information System, Vol. 8, No. 2–3, Article 4, Publication date: June 2017.
4:10 S. Deng et al.

Table X. Summary of Related Studies

Level of
classifica-
Study Objective Feature tion Data Findings
Theresa Classify the Terminal Phrase A question The usefulness of
et al. 2004 sentiment dependency, answering dependency-
strength of word location in corpus related features is
phrases. a dependency (MPQA) unclear.
tree consisting of
9313
sentences
Ng et al. Classify Three types of Document Movie The proposed
2006 reviews as dependency: reviews feature did not
positive or adjective-noun, help improve
negative. subject-verb, classification
and verb-object performance.
Wilson et al. Classify the (Is the word) in Word MPQA The usefulness of
2009 contextual a subj dependency-
polarity of dependency; related features is
words. in a copular unclear.
dependency;
in a passive
dependency
Joshi and Classify if a Back-off Sentence Movie Back-off features
Penstein- sentence dependency reviews significantly
Ros 2009 contains outperformed the
opinion. baselines.
Pak and Classify Back-off Document Movie The proposed
Paroubek reviews as dependency reviews features did not
2010 positive or without the outperform the
negative. POS tag baselines on the
same dataset in
another study.
Nkagawa Classify the The sentiment Document MPQA and The proposed
et al. 2010 polarity of a polarity of reviews method
variety of dependency as outperformed all
documents. hidden variable baselines.
Vilares et al. Classify the Dependency Document Tweets Dependency type
2015 polarity of type alone is not useful.
tweets.
This study Classify the Lexicalized Document Stocktwits The proposed
polarity of dependency messages feature set
Stocktwits. outperformed all
baselines,
including back-off
dependency
features.

usefulness of the back-off features beyond the simple dependency features (with words).
No effort was made to classify positive sentiment against negative in their study, either.
Pak and Paroubek [2010] were among the first to explore the usefulness of depen-
dency in sentiment classification. The features they used include two-node and three-
node dependency subgraphs. These subgraphs were selected using manually created
rules. Moreover, they replaced words other than adjectives and verbs with a wildcard.
The idea is similar to the back-off dependencies in Joshi and Penstein-Ros [2009].
They tested the features on a data set of movie reviews. The experiment results did not
show improvement over the bag-of-words baseline reported in Matsumoto et al. [2005].
Nakagawa et al. [2010] modeled each dependency in a sentence with a hidden

ACM Transactions on Management Information System, Vol. 8, No. 2–3, Article 4, Publication date: June 2017.
Resolving Ambiguity in Sentiment Classification 4:11

sentiment. The overall sentiment is jointly determined by the sentiment of the

dependencies. However, their proposed algorithm for estimating the model does not
guarantee to find the optimal solution due to the high complexity imposed by the
hidden variables. The proposed method did not consistently outperform the POS-tag
and bigram baselines. Vilares et al. [2015] also used dependency types to classify
the sentiment of Spanish tweets but did not find performance improvement over the
unigram and POS tag baselines.
Our study makes several contributions to the literature in the area. First,
dependency-based features were usually used for sentence-level classification. Al-
though a microblog message (e.g., tweet) is not very long, it can easily contain multiple
sentences. Our results show that sentiment classification using flat dependency fea-
tures in microblog messages, without considering the structure between sentences, can
still gain improvement over the commonly used features. Second, most prior studies
used derived dependency features, such as back-off dependency. However, most of them
did not find improvement in classification performance using such features. Our study
is among the first to use lexicalized dependency in the sentiment classification of short
texts. Our experiments have shown that dependency representation with terminal
words can be quite useful in classifying the sentiment in microblog messages given a
large training data set. This finding is new and has important implications for text
analytics involving deep linguistic information.
6. CONCLUSION
In this study, we proposed the use of dependency features in supervised sentiment clas-
sification. Although dependency structure has been used in various text classification
and information retrieval tasks, our study is among the first to explore its usefulness
in sentiment classification. Our results indicate that dependency features are able to
provide context for words; their flexibility to allow distant relationships enables them
to achieve better performance over multi-gram features and POS features. The results
are robust for both large and small feature sets. The amount of improvement across
different settings accounts for meaningful practical significance, given the volume of
messages on a social media website.
This study is not without limitations. For instance, the usefulness of dependencies
largely relies on the parsing accuracy. The parsing procedure can also take signifi-
cantly more time than extracting other features discussed in this article. Moreover,
although we followed most related studies and conducted binary sentiment classifi-
cation (positive versus negative), ternary sentiment classification better reflects the
requirement for real-world sentiment analysis since most microblog messages do not
bear sentiment.
Based on our findings in this study, we have identified some interesting future re-
search directions. This study tested the use of dependency features using short social
media texts. It would be interesting to see how dependency features influence the
classification of longer texts. Moreover, this study achieved improved sentiment classi-
fication performance by combining dependency and unigram features. Such a feature
set has certain redundancies, since many dependencies can be reduced to single words.
Developing an effective feature selection method for dependencies is a potential avenue
for future research. While this study focused on supervised sentiment classification,
the lexicon-based sentiment classification approach could also benefit from the use
of dependency information. Future efforts can be directed at developing a sentiment
lexicon of dependencies.

REFERENCES
A. Abbasi, S. France, Z. Zhang, and H. Chen. 2011. Selecting attributes for sentiment classification using
feature relation networks. IEEE Trans. Knowl. Data Eng. 23, 3. 447–462.

ACM Transactions on Management Information System, Vol. 8, No. 2–3, Article 4, Publication date: June 2017.
4:12 S. Deng et al.

A. Bifet and E. Frank. 2010. Sentiment knowledge discovery in twitter streaming data. In Proceedings of
13th International Conference on Discovery Science. Springer. 1–15.
J. Blitzer, R. McDonald, and F. Pereira. 2006. Domain adaptation with structural correspondence learning. In
Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing. Association
for Computational Linguistics. 120–128.
E. Charniak. 1996. Tree-bank grammars. In Proceedings of the National Conference on Artificial Intelligence.
1031–1036.
D. Chen and C. D. Manning. 2014. A fast and accurate dependency parser using neural networks. In Pro-
ceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP’14).
740–750.
H. Chen, R. H. Chiang, and V. C. Storey. 2012. Business intelligence and analytics: From big data to big
impact, MIS Quart. 36, 4. 1165–1188.
N. Chomsky. 1965. Aspects of the Theory of Syntax. MIT press.
N. Chomsky. 2002. Syntactic Structures. Walter de Gruyter.
C.-H. Chou, A. P. Sinha, and H. Zhao. 2010. A hybrid attribute selection approach for text classification. J.
Assoc. Informat. Syst. 11, 9. 491–518.
M. A. Covington. 2001. A fundamental algorithm for dependency parsing. In Proceedings of the 39th Annual
ACM Southeast Conference. Citeseer. 95–102.
M.-C. De Marneffe, B. MacCartney, and C. D. Manning. 2006. Generating typed dependency parses
from phrase structure parses. In Proceedings of the Language Resources and Evaluation Conference
(LREC’06). 449–454.
M.-C. De Marneffe and C. D. Manning. 2008. The stanford typed dependencies representation. In Coling 2008:
Proceedings of the Workshop on Cross-Framework and Cross-Domain Parser Evaluation. Association for
Computational Linguistics. 1–8.
M. Hu and B. Liu. 2004. Mining and summarizing customer reviews. In Proceedings of the Tenth ACM
SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM. 168–177.
M. Johnson. 1998. PCFG models of linguistic tree representations. Computat. Linguist. 24, 4. 613–632.
M. Joshi and C. Penstein-Ros. 2009. Generalizing dependency features for opinion mining. In Proceedings of
the ACL-IJCNLP 2009 Conference Short Papers. Association for Computational Linguistics. 313–316.
D. Klein and C. D. Manning. 2003. Accurate unlexicalized parsing. In Proceedings of the 41st Annual Meeting
on Association for Computational Linguistics, Vol. 1. Association for Computational Linguistics. 423–
430.
B. Liu. 2012. Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol. 5, 1. 1–167.
M. P. Marcus, M. A. Marcinkiewicz, and B. Santorini. 1993. Building a large annotated corpus of english:
The penn treebank. Computat. Linguist. 19, 2. 313–330.
J. H. Martin and D. Jurafsky. 2000. Speech and Language Processing. International Edition.
S. Matsumoto, H. Takamura, and M. Okumura. 2005. Sentiment classification using word sub-sequences
and dependency sub-trees. In Advances in Knowledge Discovery and Data Mining. Springer. 301–311.
T. Nakagawa, K. Inui, and S. Kurohashi. 2010. Dependency tree-based sentiment classification using CRFs
with hidden variables. In Human Language Technologies: The 2010 Annual Conference of the North
American Chapter of the Association for Computational Linguistics. Association for Computational Lin-
guistics. 786–794.
V. Ng, S. Dasgupta, and S. Arifin. 2006. Examining the role of linguistic knowledge sources in the automatic
identification and classification of reviews. In Proceedings of the COLING/ACL on Main Conference
Poster Sessions. Association for Computational Linguistics. 611–618.
T. L. Ngo-Ye and A. P. Sinha. 2012. Analyzing online review helpfulness using a regressional relieff-enhanced
text mining method. ACM Trans. Manag. Inform. Syst. 3, 2. 1–20.
O. Owoputi, B. O’Connor, C. Dyer, K. Gimpel, N. Schneider, and N. A. Smith. 2013. Improved part-of-
speech tagging for online conversational text with word clusters. In Proceedings of the 2013 Conference
of the North American Chapter of the Association for Computational Linguistics: Human Language
Technologies (NAACL-HLT’13). 380–390.
A. Pak and P. Paroubek. 2010. Twitter as a corpus for sentiment analysis and opinion mining. In Proceedings
of the Language Resources and Evaluation Conference (LREC’10).
B. Pang and L. Lee. 2008. Opinion Mining and Sentiment Analysis. Now Pub.
B. Pang, L. Lee, and S. Vaithyanathan. 2002. Thumbs up?: sentiment classification using machine learn-
ing techniques. In Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language
Processing, Vol. 10. Association for Computational Linguistics. 79–86.

ACM Transactions on Management Information System, Vol. 8, No. 2–3, Article 4, Publication date: June 2017.
Resolving Ambiguity in Sentiment Classification 4:13

M.-F. Tsai, C.-J. Wang, and P.-C. Chien. 2016. Discovering finance keywords via continuous-space language
models. ACM Trans. Manag. Inform. Syst. 7, 3. 1–17.
D. Vilares, M. A. Alonso, and C. Gómez Rodrı́guez. 2015. On the usefulness of lexical and syntactic processing
in polarity classification of Twitter messages. J. Assoc. Inform. Sci. Technol. 66, 9. 1799–1816.
T. Wilson, J. Wiebe, and P. Hoffmann. 2009. Recognizing contextual polarity: An exploration of features for
phrase-level sentiment analysis. Comput. Linguist. 35, 3. 399–433.
T. Wilson, J. Wiebe, and R. Hwa. 2004. Just how mad are you? finding strong and weak opinion clauses. In
Proceedings of the 19th National Conference on Artifical Intelligence. 761–767.
Y. Yu, W. Duan, and Q. Cao. 2013. The impact of social and conventional media on firm equity value: A
sentiment analysis approach. Decis. Supp. Syst. 55, 4. 919–926.
D. Zimbra, H. Chen, and R. F. Lusch. 2015. Stakeholder analyses of firm-related web forums: Applications
in stock return prediction. ACM Trans. Manag. Inform. Syst. 6, 1.

Received May 2016; revised November 2016; accepted January 2017

ACM Transactions on Management Information System, Vol. 8, No. 2–3, Article 4, Publication date: June 2017.

A Natural Language Processing For Sentiment Analysis From Text Using Deep Learning Algorithm
No ratings yet
A Natural Language Processing For Sentiment Analysis From Text Using Deep Learning Algorithm
7 pages
CompilerDesign Lab Manual
No ratings yet
CompilerDesign Lab Manual
66 pages
Natural Language Processing For Sentiment Analysis - Ankur Shukla
No ratings yet
Natural Language Processing For Sentiment Analysis - Ankur Shukla
27 pages
04 BE (CSE-AI&ML) 3rd YR Syllabus 15june 2022
No ratings yet
04 BE (CSE-AI&ML) 3rd YR Syllabus 15june 2022
50 pages
Lexicon-Based Methods For SA
No ratings yet
Lexicon-Based Methods For SA
42 pages
Lexicon-Based Methods For Sentiment Analysis
No ratings yet
Lexicon-Based Methods For Sentiment Analysis
42 pages
Rap Lyric Generator: 1 Research Question
100% (1)
Rap Lyric Generator: 1 Research Question
9 pages
A Method of Fine-Grained Short Text Sentiment Analysis Based On Machine Learning
No ratings yet
A Method of Fine-Grained Short Text Sentiment Analysis Based On Machine Learning
20 pages
Minor Fnal
No ratings yet
Minor Fnal
22 pages
Machine Learning With Advance Model
No ratings yet
Machine Learning With Advance Model
19 pages
Literature Review
No ratings yet
Literature Review
5 pages
Journal Pone 0313092
No ratings yet
Journal Pone 0313092
19 pages
Lexicon-Based Methods For Sentiment Analysis: Maite Taboada
No ratings yet
Lexicon-Based Methods For Sentiment Analysis: Maite Taboada
42 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
17 pages
Deep Diving Into Extended Reality
No ratings yet
Deep Diving Into Extended Reality
16 pages
Sentiment Analysis: Srishti Chaubey
No ratings yet
Sentiment Analysis: Srishti Chaubey
40 pages
### Seminar Report
No ratings yet
### Seminar Report
12 pages
Lexi Can
No ratings yet
Lexi Can
6 pages
Sentiment Analysis of User Comment Text Based On L
No ratings yet
Sentiment Analysis of User Comment Text Based On L
13 pages
XLNet Transfer Learning Model For Sentimental Analysis
No ratings yet
XLNet Transfer Learning Model For Sentimental Analysis
9 pages
NLP Unit 6
No ratings yet
NLP Unit 6
16 pages
Report
No ratings yet
Report
30 pages
Sentiment Analysis With Contextual Embeddings and Self-Attention
No ratings yet
Sentiment Analysis With Contextual Embeddings and Self-Attention
10 pages
A Comprehensive Analysis of Sentiment Analysis Approaches Applications and Classifier Comparisons
No ratings yet
A Comprehensive Analysis of Sentiment Analysis Approaches Applications and Classifier Comparisons
8 pages
Picet Presentation
No ratings yet
Picet Presentation
12 pages
Aspect-Based Sentiment Analysis Using A Hybridized Approach Based On CNN and GA
No ratings yet
Aspect-Based Sentiment Analysis Using A Hybridized Approach Based On CNN and GA
14 pages
Sentimental Analysis On Twitter Data Using Naive Bayes: Ijarcce
No ratings yet
Sentimental Analysis On Twitter Data Using Naive Bayes: Ijarcce
4 pages
Information Sciences: Li Kong, Chuanyi Li, Jidong Ge, Feifei Zhang, Yi Feng, Zhongjin Li, Bin Luo
No ratings yet
Information Sciences: Li Kong, Chuanyi Li, Jidong Ge, Feifei Zhang, Yi Feng, Zhongjin Li, Bin Luo
17 pages
Pre Processing
No ratings yet
Pre Processing
9 pages
AAIML
No ratings yet
AAIML
10 pages
Document Analysis
No ratings yet
Document Analysis
6 pages
Verma 2018 Springerpaper
No ratings yet
Verma 2018 Springerpaper
8 pages
Research Ashish
No ratings yet
Research Ashish
7 pages
Exploiting Emojis in Sentiment Analysis A Survey
No ratings yet
Exploiting Emojis in Sentiment Analysis A Survey
14 pages
Sentiment Analysis: Literature Survey
No ratings yet
Sentiment Analysis: Literature Survey
3 pages
Formation of Smart Sentiment Analysis Technique For Big Data
No ratings yet
Formation of Smart Sentiment Analysis Technique For Big Data
8 pages
An Overview of Lexicon-Based Approach For Sentiment Analysis
No ratings yet
An Overview of Lexicon-Based Approach For Sentiment Analysis
6 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
4 pages
Cin2015 715730
No ratings yet
Cin2015 715730
9 pages
Chen (2017) - CNN+sentiment Analysis+sentence Type
No ratings yet
Chen (2017) - CNN+sentiment Analysis+sentence Type
10 pages
1 s2.0 S187705091630463X Main
No ratings yet
1 s2.0 S187705091630463X Main
6 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
6 pages
(IJCST-V9I4P3) : Shivaji Chabukswar, Renuka Chopade, Mona Saoji, Manjiri Kadu, Dr. Premchand Ambhore
No ratings yet
(IJCST-V9I4P3) : Shivaji Chabukswar, Renuka Chopade, Mona Saoji, Manjiri Kadu, Dr. Premchand Ambhore
3 pages
NLPPR7
No ratings yet
NLPPR7
6 pages
Knowledge-Based Systems: Oscar Araque, Ganggao Zhu, Carlos A. Iglesias
No ratings yet
Knowledge-Based Systems: Oscar Araque, Ganggao Zhu, Carlos A. Iglesias
14 pages
Sentiments of Public Opinion
No ratings yet
Sentiments of Public Opinion
3 pages
CC File
No ratings yet
CC File
47 pages
Sentiment Analysis in Twitter: Rohit Kumar Jha (11615) Sakaar Khurana (10627)
No ratings yet
Sentiment Analysis in Twitter: Rohit Kumar Jha (11615) Sakaar Khurana (10627)
9 pages
Machine Learning With Sentiment Approach
No ratings yet
Machine Learning With Sentiment Approach
5 pages
Sentence Level Sentiment Analysis
No ratings yet
Sentence Level Sentiment Analysis
8 pages
Sentiment Analysis Over Social Networks: An
No ratings yet
Sentiment Analysis Over Social Networks: An
6 pages
A Survey of Sentiment Analysis Techniques: Harpreet Kaur Veenu Mangat Nidhi
No ratings yet
A Survey of Sentiment Analysis Techniques: Harpreet Kaur Veenu Mangat Nidhi
5 pages
Improved Feature Extraction and Classification - Sentiment Analysis - Trupthi2016
No ratings yet
Improved Feature Extraction and Classification - Sentiment Analysis - Trupthi2016
6 pages
Sentiment Analysis On Data of Social Media: Aditya Zaware
No ratings yet
Sentiment Analysis On Data of Social Media: Aditya Zaware
5 pages
A Comprehensive Study On Lexicon Based Approaches For Sentiment Analysis
No ratings yet
A Comprehensive Study On Lexicon Based Approaches For Sentiment Analysis
7 pages
Source Code in Database (SCID)
100% (8)
Source Code in Database (SCID)
99 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
4 pages
Sentiment Analysis Using Support Vector Machine Based On Feature Selection and Semantic Analysis
No ratings yet
Sentiment Analysis Using Support Vector Machine Based On Feature Selection and Semantic Analysis
5 pages
Sentiment Analysis Poster
No ratings yet
Sentiment Analysis Poster
1 page
Sentiment Analysis
No ratings yet
Sentiment Analysis
5 pages
W04 3253 PDF
No ratings yet
W04 3253 PDF
7 pages
Godbole2007a PDF
No ratings yet
Godbole2007a PDF
4 pages
Sentiment Analysis For Vietnamese: Binh Thanh Kieu Son Bao Pham
No ratings yet
Sentiment Analysis For Vietnamese: Binh Thanh Kieu Son Bao Pham
6 pages
NLP Unit II Notes
No ratings yet
NLP Unit II Notes
17 pages
Syntax Analysis: CD: Compiler Design
No ratings yet
Syntax Analysis: CD: Compiler Design
36 pages
NUST MCS Computer Software Engineering Curriculum
No ratings yet
NUST MCS Computer Software Engineering Curriculum
125 pages
Grammar Tules
No ratings yet
Grammar Tules
5 pages
Unit 6: Query Processing and Optimization
No ratings yet
Unit 6: Query Processing and Optimization
21 pages
Python Myths About Indention
No ratings yet
Python Myths About Indention
4 pages
Computer Science Engineering
No ratings yet
Computer Science Engineering
33 pages
CS 6002 Compiler Design
No ratings yet
CS 6002 Compiler Design
2 pages
LEX and YACC
No ratings yet
LEX and YACC
3 pages
LL (K) and LR (K)
No ratings yet
LL (K) and LR (K)
21 pages
CD - Unit - 1 IPU
No ratings yet
CD - Unit - 1 IPU
121 pages
Beautiful Soup Documentation
No ratings yet
Beautiful Soup Documentation
53 pages
Unit 3 (Part I)
No ratings yet
Unit 3 (Part I)
45 pages
Unit-2 Query Processing and Optimization, Query Equivalence, Join Strategies
No ratings yet
Unit-2 Query Processing and Optimization, Query Equivalence, Join Strategies
38 pages
NMCC
No ratings yet
NMCC
14 pages
Chapter 1 - Overview of Compilation
No ratings yet
Chapter 1 - Overview of Compilation
32 pages
Compiler Construction Unit 3 Part-6 CLR (1) and LANR (1) Parser CSE
No ratings yet
Compiler Construction Unit 3 Part-6 CLR (1) and LANR (1) Parser CSE
5 pages
CD DSTC Notes
No ratings yet
CD DSTC Notes
35 pages
Cit 316
No ratings yet
Cit 316
18 pages
English Vocabulary in Use Advanced With Answers
No ratings yet
English Vocabulary in Use Advanced With Answers
9 pages
2.1 Review of Literature: "SQL Generation and PL/SQL Execution From Natural Language Processing"
No ratings yet
2.1 Review of Literature: "SQL Generation and PL/SQL Execution From Natural Language Processing"
11 pages
Important Points
No ratings yet
Important Points
8 pages
Compiler Design (13Cs401) List of Programs
No ratings yet
Compiler Design (13Cs401) List of Programs
12 pages
Canonical LR (1) Parsers: Def: An LR (1) Item Is A Two-Component Element of The Form (A ,)
No ratings yet
Canonical LR (1) Parsers: Def: An LR (1) Item Is A Two-Component Element of The Form (A ,)
10 pages
Homework and Exams
No ratings yet
Homework and Exams
8 pages

Ambigu

Uploaded by

Ambigu

Uploaded by

4

Resolving Ambiguity in Sentiment Classification: The Role

Table I. Representation of Sentence 1

However, it is doubtful that constituents are suitable to be used as features in sen-

3.1. Advantage of Dependency Features

Table II. Representation of Sentences 2 and 3

Table III. Similarity in Representing Sentences 1, 2, and 3

Table IV. Classification Accuracy (%) of Groups 1–7

Table V. Classification Accuracy (%) of Groups 1–7

Table VI. Classification Accuracy (%) of Groups 1–7

Table X. Summary of Related Studies

sentiment. The overall sentiment is jointly determined by the sentiment of the

Received May 2016; revised November 2016; accepted January 2017

You might also like