Sarcasm Analysis Using Social Media: A Literature Review: ISSN (ONLINE) : 2250-0758, ISSN (PRINT) : 2394-6962
Sarcasm Analysis Using Social Media: A Literature Review: ISSN (ONLINE) : 2250-0758, ISSN (PRINT) : 2394-6962
researchers are based on distinct features and structure of the taking place from the last tweet is included in the current
text. tweet (for eg, positive ! Negative, negative! Positive) as a
Reganti et al [10] have discussed four types of feature (1 feature).Sarcasm as a complex form of expression
satire that are present in the English language that are: Readability
Exaggeration: It is to enlarge, accentuate or portray As sarcasm is widely standardized to be hard to read
something beyond mundane limits so as to highlight faults, and understand, the author adapts identical readability tests
e.g.: “I’m super exhilarated today!! So much that I’d kill to measure the degree of complexity and understandability of
myself” the tweet.
Incongruity: Sarcasm as a means of assigning emotion
It signifies to present things that are malapropos or 1. Mood
preposterous in cognation to its circumventions, e.g.: “The Mood represents the utilizer's state of emotion.
back camera of the phone is so good that I can capture every Naturally, the mood of the individual may be indicative of
atom of scenery”. his propensity to utilize mordancy; if the utilizer is in a
Reversal: deplorable (negative) mood, he could opt to express it in the
Which is presenting diametrical to what is to form of a sarcastic tweet. Consequently, his mood is judged
authentically convey by the user, e.g.: “I’m profoundly utilizing sentiment expressed in his past tweets... They
disappointed. Not as expected! It’s just astounding how the captured the mood utilizing last tweets.
flash works!” 2. Affect and sentiment
Parody: Sarcasm is a coalescence of affect and sentiment
Which is to imitate the demeanor/slang and/or style expression, and consequently, the effect and the sentiment
of some person, place or thing. expressed in mordant tweets are examined.
Jena et al [2] defines Six types of sarcasm which are Sarcasm as the possible function of ease
discussed that occur in the text- 1. Familiarity with the language
T1- Contrasting comments between positive view and Naturally, one would expect a utilizer who utilizes a
negative view. form of language as intricate as sarcasm to have good
T2- Contrasting comments between negative view and command over the language. Consequently, the author
positive view. measures the utilizer’s language skills with features that are
T3- Fact Negation – i.e. text contradicting a fact. inspired by standardized language proficiency Cloze tests. In
T4- Likes and Dislikes Prediction – i.e. behavior based. cloze tests, proficiency is evaluated predicated on lexicon,
T5- Lexical Analysis – i.e. sarcasm hashtag based. grammar, dictation, and reading levels.
T6- Temporal Knowledge i.e. Extracting tweets contradicting 2. Familiarity with the environment
facts about the event. The users can express sarcasm better when they are
Liu et al [4] introduced the concept of Sarcasm which is well acquainted with their environment. Just like people are
discussed below- less liable to utilize sarcasm in an incipient, unfamiliar
Contrast of various sentiments setting, users take time to get habituated with Twitter afore
A complex form of expression posting mordant tweets. Author measure a utilizer's
Means of assigning emotion familiarity with Twitter in terms of his utilization familiarity,
Possible function of ease parlance familiarity, and convivial activity.
A form of written manifestation Sarcasm as a form of written manifestation
Sarcasm as a contrast of various sentiments 1. Prosodic variations
1. Contrasting connotations The users often reiterate letters in words to stress
A mundane denotes of expressing mordancy is to and over accentuate certain components of the tweet (for
utilize words with distinct construals within the same tweet. example, sooooo, awesomeeee) to denote that they mean
In the example, I dote getting spam emails!, spam diametrical to what is indited.
conspicuously has a negative connotation while love is 2. Structural variations
Overwhelmingly positive and to model such occurrences, It is observed that mordant tweets sometimes have a
they construct features predicated on affect and sentiment certain construction wherein the commentor's views are
scores. verbalized in the first few words of the tweet, while in the
2. Contrasting present with the past later components, an explication of a particular scenario is
Sometimes, the utilizer may set up a contrasting put forth, e.g., I dote it when my friends ignore me.
context in his antecedent tweet and then, opt to utilize a
mordacious verbalization in his current tweet. To model such
demeanor, they obtain the sentiment expressed by the utilizer
(i.e., positive, negative, neutral) in the antecedent tweet and
the current tweet. Then, the type of sentiment transition
III. FEATURE EXTRACTION and tweet based feature. However exclaimed on some other
PROCESSES particular type of social graph-based features such as local
clustering coefficient, between centrality, and distance,
To analyze the sarcasm in texts posted on social media, according to them it is not possible for now. Tayal et al [7]
researchers suggested different features which can be used to worked on features such as used punctuation marks! ,?
distinguish the sarcastic comments from normal text. Characters in the sentence, #sarcasm tag and #irony,
Lunando et al[5] have given several features which emoticons and adjective and verb in conjunction with ! on
are taken from the pre-processed text which are Twitter. Along with these, they also considered emoticons
Unigram(According to the author unigram is more suitable with verbs and adjectives since the mere use of emoticons
for Indonesian social media text since the grammars used in like “:P” and “;)”. According to the author the emoticons can
Indonesian social media texts are various and informal.), also be used for humor sentiment analysis.
Negativity (This feature represents the percentage of the Liu et al[4] obtained a set of sarcastic tweets using
negative sentiment in the topic of the text message.) ,Number keywords #sarcasm and #not, altering out non-English tweets
of interjection words (This feature shows the number of and retweets they limited their analysis to tweets which
interjection words from the text such as “aha”, “bah”, “nah”, contain more than three words as they found that tweets with
“wew”, “wow”, “yay”, “uh”, etc.). fewer words were very noisy or clichéd (e.g., yeah, right!
Bouazizi et al [3] extract 2 sets of features: one #sarcasm). Along with these various kinds of features, Rosso
qualified as “non-textual”( Non-textual features: From the et al[6] focused on various tweets which have #irony , not,
“raw” tweets we first extract 6 features by counting the and #sarcasm words in their context. The main concern of
number of positive and negative Hashtags, that of positive the author is to differentiate the ironic tweets from sarcastic
and negative Emoticons, and that of positive and negative ones.
slang words.) and one qualified as “textual”( the tag “NOT ”
(e.g., “not”, “never”, etc.) , positive words such as love, IV. VARIOUS APPROACHES FOR
happy, and negative emotional content such as hate, sad etc.), SARCASM ANALYSIS
Jena et al [2] used these features, lexical, pragmatics and
hyperbole features to recognize sarcasm in text and also A. Time-Based Approach-
discussed the role of different lexical factors, such as lexical This is an approach which chao et al [1] used in
feature includes text properties such as uni-gram, bi-gram, order to predict the behavior of the sarcastic tweets done
tri-gram, and n-gram. before and after a certain fixed time and studied the behavior.
Chao et al[1] worked on various types of features B. Classification Based Approach-
such as age, no. of followers, following/friends, favorites, This approach is commonly used to identify the
lists, tweets sent, retweets done by n twitter users, hashtags target word or sentence as sarcastic or literal. Different
included, user mentions included, URLs included, no. of classifiers mainly machine learning classifiers such as SVM,
characters and no. of digits in a particular tweet also they Naïve Bayes, Maximum entropy, etc. are used, Authors [3],
broadly classified into two categories i.e. user based feature [5], [6] who are using this classification approach.
Non textual(positive and negative (Hashtags, emoticons, slang words)), textual(the tag
2. bouazizi et al [3] “NOT ” (e.g., “not”,
“never”, etc.) , positive words such as love, happy, and negative emotional content such
as hate, sad etc. )
age, no. of followers, following/friends, favorites, lists, tweets sent, retweets done by n
4. Chao et al [1] twitter users,
hashtags included, user mentions included, URLs included, no. of characters and no. of
digits in a particular
Tweet
Used punctuation marks ! , ? , Characters in the sentence, #sarcasm tag and #irony,
5. Tayal et al[7] emoticons and adjective
and verb in conjunction with ! in Twitter
6. Liu et al[4] a set of sarcastic tweets using keywords #sarcasm and #not
Lunando et al[5] Naïve Bayes, Maximum Entropy, and Support Vector Machine.
Bouazizi et al[3] Naive Bayes, Support Vector Machine (SVM), and Maximum Entropy classifiers.
Chao et
al[1] Trend Micro’s WRS, manual inspection.
SCUBA (n-gram,
Liu et al Streaming API, Behavioral Sarcasm Detection behavioral Using SCUBA, social media
29 Copyright © 2018. IJEMR. All Rights Reserved.
www.ijemr.net ISSN (ONLINE): 2250-0758, ISSN (PRINT): 2394-6962
V. CONCLUSION AND FUTURE Micros WRS, manual inspection etc. It has been found that
WORK various techniques applied for sarcasm analysis are domain and
language specific.
In this paper we did a detailed study of sarcasm Taking a finite data set and perform sarcasm detection
detection from various Research papers, we found that the analysis is one of the limitation found, and work on dynamic
data set selected by them varies in each research. Giving data should be focused in future also we found that the sarcasm
the possibility to focus more on a different data set and detection does not limit till text mining but other areas can also
gives a great opportunity in data mining field. Secondly, be focused on such as Audio mining, video mining and
we have discussed various techniques to carry out sarcasm recognition etc. and work in this field can be focused further.
analysis on Twitter data including Naïve Bayes, Maximum
Entropy, and Support Vector Machine, PBLGA (Parsing
Based Lexicon Generation Algorithm), SCUBA, Trend