• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

ICWSM – a great catchy name: Semi-supervised recognition of sarcastic sentences in online product reviews,” (2010)

by Oren Tsur, Dmitry Davidov, Ari Rappoport
Venue:in Proceedings of AAAI,
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 37
Next 10 →

Sentiment Analysis and Opinion Mining

by Bing Liu , 2012
"... ..."
Abstract - Cited by 170 (11 self) - Add to MetaCart
Abstract not found

Enhanced Sentiment Learning Using Twitter Hashtags and Smileys

by Dmitry Davidov, Oren Tsur
"... Automated identification of diverse sentiment types can be beneficial for many NLP systems such as review summarization and public media analysis. In some of these systems there is an option of assigning a sentiment value to a single sentence or a very short text. In this paper we propose a supervis ..."
Abstract - Cited by 120 (3 self) - Add to MetaCart
Automated identification of diverse sentiment types can be beneficial for many NLP systems such as review summarization and public media analysis. In some of these systems there is an option of assigning a sentiment value to a single sentence or a very short text. In this paper we propose a supervised sentiment classification framework which is based on data from Twitter, a popular microblogging service. By utilizing 50 Twitter tags and 15 smileys as sentiment labels, this framework avoids the need for labor intensive manual annotation, allowing identification and classification of diverse sentiment types of short texts. We evaluate the contribution of different feature types for sentiment classification and show that our framework successfully identifies sentiment types of untagged sentences. The quality of the sentiment identification was also confirmed by human judges. We also explore dependencies and overlap between different sentiment types represented by smileys and Twitter hashtags. 1
(Show Context)

Citation Context

... analysis in the phrasal and sentence level, see (Yu and Hatzivassiloglou, 2003; Wilson et al., 2005; McDonald et al., 2007; Titov and McDonald, 2008a; Titov and McDonald, 2008b; Wilson et al., 2009; =-=Tsur et al., 2010-=-) among others. Only a few studies analyze the sentiment and polarity of tweets targeted at major brands. Jansen et al. (2009) used a commercial sentiment analyzer as well as a manually labeled corpus...

Semi-Supervised Recognition of Sarcastic Sentences in Twitter and Amazon

by Dmitry Davidov, Oren Tsur
"... Sarcasm is a form of speech act in which the speakers convey their message in an implicit way. The inherently ambiguous nature of sarcasm sometimes makes it hard even for humans to decide whether an utterance is sarcastic or not. Recognition of sarcasm can benefit many sentiment analysis NLP applica ..."
Abstract - Cited by 51 (3 self) - Add to MetaCart
Sarcasm is a form of speech act in which the speakers convey their message in an implicit way. The inherently ambiguous nature of sarcasm sometimes makes it hard even for humans to decide whether an utterance is sarcastic or not. Recognition of sarcasm can benefit many sentiment analysis NLP applications, such as review summarization, dialogue systems and review ranking systems. In this paper we experiment with semisupervised sarcasm identification on two very different data sets: a collection of 5.9 million tweets collected from Twitter, and a collection of 66000 product reviews from Amazon. Using the Mechanical Turk we created a gold standard sample in which each sentence was tagged by 3 annotators, obtaining F-scores of 0.78 on the product reviews dataset and 0.83 on the Twitter dataset. We discuss the differences between the datasets and how the algorithm uses them (e.g., for the Amazon dataset the algorithm makes use of structured information). We also discuss the utility of Twitter #sarcasm hashtags for the task. 1
(Show Context)

Citation Context

...r and Rappoport, 2009). All systems currently fail to correctly classify the sentiment of sarcastic sentences. In this paper we utilize the semi-supervised sarcasm identification algorithm (SASI) of (=-=Tsur et al., 2010-=-). The algorithm employs two modules: semi supervised pattern acquisition for identifying sarcastic patterns that serve as features for a classifier, and a classification stage that classifies each se...

A multidimensional approach for detecting irony

by Antonio Reyes, Paolo Rosso, Tony Veale, Ó Springer, Science+business Media B. V, A. Reyes, P. Rosso - in Twitter. Language Resources and Evaluation , 2013
"... Abstract Irony is a pervasive aspect of many online texts, one made all the more difficult by the absence of face-to-face contact and vocal intonation. As our media increasingly become more social, the problem of irony detection will become even more pressing. We describe here a set of textual featu ..."
Abstract - Cited by 34 (4 self) - Add to MetaCart
Abstract Irony is a pervasive aspect of many online texts, one made all the more difficult by the absence of face-to-face contact and vocal intonation. As our media increasingly become more social, the problem of irony detection will become even more pressing. We describe here a set of textual features for recognizing irony at a linguistic level, especially in short texts created via social media such as Twitter postings or ‘‘tweets’’. Our experiments concern four freely available data sets that were retrieved from Twitter using content words (e.g. ‘‘Toyota’’) and user-generated tags (e.g. ‘‘#irony’’). We construct a new model of irony detection that is assessed along two dimensions: representativeness and relevance. Initial results are largely positive, and provide valuable insights into the figurative issues facing tasks such as sentiment analysis, assessment of online reputations, or decision making.

Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis

by Diana Maynard , Mark A Greenwood - in Proceedings of LREC , 2014
"... Abstract Sarcasm is a common phenomenon in social media, and is inherently difficult to analyse, not just automatically but often for humans too. It has an important effect on sentiment, but is usually ignored in social media analysis, because it is considered too tricky to handle. While there exis ..."
Abstract - Cited by 10 (2 self) - Add to MetaCart
Abstract Sarcasm is a common phenomenon in social media, and is inherently difficult to analyse, not just automatically but often for humans too. It has an important effect on sentiment, but is usually ignored in social media analysis, because it is considered too tricky to handle. While there exist a few systems which can detect sarcasm, almost no work has been carried out on studying the effect that sarcasm has on sentiment in tweets, and on incorporating this into automatic tools for sentiment analysis. We perform an analysis of the effect of sarcasm scope on the polarity of tweets, and have compiled a number of rules which enable us to improve the accuracy of sentiment analysis when sarcasm is known to be present. We consider in particular the effect of sentiment and sarcasm contained in hashtags, and have developed a hashtag tokeniser for GATE, so that sentiment and sarcasm found within hashtags can be detected more easily. According to our experiments, the hashtag tokenisation achieves 98% Precision, while the sarcasm detection achieved 91% Precision and polarity detection 80%.
(Show Context)

Citation Context

... Table 2. The sarcasm detection performed very well at 91% Precision and Recall. Out 6http://www.trend-miner.eu 4241 of 400 tweets, there were 91 sarcastic sentences, which is quite a high proportion, and many of these were not indicated by any kind of sarcasm marker. Twitter corpus P R F1 Opinionated 65.69 77.31 71.02 Opinion+polarity 52.61 61.92 56.89 Polarity only 80.08 80.03 80.05 Sarcasm detection 91.03 91.04 91.03 Table 2: Experiments on General Tweets 6. Related Work There have been a few recent works attempting to detect sarcasm in tweets and other user-generated content. Tsur et al. (Tsur et al., 2010) use a semi-supervised approach to classify sentences in online product reviews into different sarcastic classes, and report an F-measure of 82.7% on the binary sarcasm detection task (although Precision is much higher than Recall). Liebrecht et al. (Liebrecht et al., 2013) use the Balanced Winnow Algorithm to classify Dutch tweets as sarcastic or not, with 75% accuracy, training over a set of tweets with the #sarcasm hashtag. Reyes et al. (Reyes et al., 2013) use a similar technique on English tweets to detect ironic tweets, using the #irony hashtag, with 70% accuracy. Davidov et al. (Davidov...

Heart and Soul: Sentiment Strength Detection in the Social Web with SentiStrength 1

by Mike Thelwall
"... Emotions are important in communication to effectively convey messages and to understand reactions to messages. Large scale studies of communication need methods to detect sentiment in order to investigate or model the processes involved. This chapter describes the sentiment strength detection progr ..."
Abstract - Cited by 7 (0 self) - Add to MetaCart
Emotions are important in communication to effectively convey messages and to understand reactions to messages. Large scale studies of communication need methods to detect sentiment in order to investigate or model the processes involved. This chapter describes the sentiment strength detection program SentiStrength that was developed during the CyberEmotions project to detect the strength of sentiments expressed in social web texts. SentiStrength uses a lexical approach that exploits a list of sentiment-related terms and has rules to deal with standard linguistic and social web methods to express sentiment, such as emoticons, exaggerated punctuation and deliberate misspellings. This chapter also describes how SentiStrength can be refined for particular topics and contexts and how variants are created for different languages. The chapter also briefly describes some studies that have applied SentiStrength to analyse trends in Twitter and You Tube comments.
(Show Context)

Citation Context

... sarcasm detection, however. Book reviews are one example due to the repeated use of stock sarcastic types of phrase, such as “this book has a great cover” that can be learned from a training corpus (=-=Tsur, Davidov, & Rappoport, 2010-=-). Sarcasm in Portuguese political discussions can also be identified through a combination of features including the use of a politician’s name in diminutive form (Carvalho, Sarmento, Silva, & de Oli...

Automated Content Analysis of Online Discussion Transcripts

by Vitomir Kovanovic, Srecko Joksimovic, Dragan Gasevic, Marek Hatala
"... In this paper we present the results of an exploratory study that ex-amined the use of text mining and text classification for the automa-tion of the content analysis of discussion transcripts in the context of distance education. We used Community of Inquiry (CoI) frame-work and focused on the cont ..."
Abstract - Cited by 3 (2 self) - Add to MetaCart
In this paper we present the results of an exploratory study that ex-amined the use of text mining and text classification for the automa-tion of the content analysis of discussion transcripts in the context of distance education. We used Community of Inquiry (CoI) frame-work and focused on the content analysis of the cognitive pres-ence construct given its central position within the CoI model. Our results demonstrate the potentials of proposed approach; The de-veloped classifier achieved 58.4 % accuracy and Cohen’s Kappa of 0.41 for the 5-category classification task. In this paper we analyze different classification features and describe the main problems and lessons learned from the development of such a system. Further-more, we analyzed the use of several novel classification features that are based on the specifics of cognitive presence construct and our results indicate that some of them significantly improve classi-fication accuracy. 1.
(Show Context)

Citation Context

...sence is a latent construct and not clearly observable, we based our work on the previous work that focused also on mining latent constructs. The work done on opinion mining of online product reviews =-=[3, 15, 23]-=-, gender style differences [13] and sentiment analysis [4] are some of the main areas of research that informed our classification approach. The text classification tasks have been studied in the cont...

Authorship Attribution of Micro-Messages

by Roy Schwartz, Oren Tsur
"... Work on authorship attribution has tradition-ally focused on long texts. In this work, we tackle the question of whether the author of a very short text can be successfully iden-tified. We use Twitter as an experimental testbed. We introduce the concept of an au-thor’s unique “signature”, and show t ..."
Abstract - Cited by 3 (2 self) - Add to MetaCart
Work on authorship attribution has tradition-ally focused on long texts. In this work, we tackle the question of whether the author of a very short text can be successfully iden-tified. We use Twitter as an experimental testbed. We introduce the concept of an au-thor’s unique “signature”, and show that such signatures are typical of many authors when writing very short texts. We also present a new authorship attribution feature (“flexible pat-terns”) and demonstrate a significant improve-ment over our baselines. Our results show that the author of a single tweet can be identified with good accuracy in an array of flavors of the authorship attribution task. 1
(Show Context)

Citation Context

...et al., 2009), detection of synonyms (Turney, 2008a), disambiguation of nominal compound relations (Davidov and Rappoport, 2008a), sentiment analysis (Davidov et al., 2010b) and detection of sarcasm (=-=Tsur et al., 2010-=-). 9 Conclusion The main goal of this paper is to measure to what extent authors of micro-messages can be identified. We have shown that authors of very short texts can be successfully identified in a...

Mining Subjective Knowledge from Customer Reviews: A Specific Case of Irony Detection

by Antonio Reyes, Paolo Rosso
"... The research described in this work focuses on identifying key components for the task of irony detection. By means of analyzing a set of customer reviews, which are considered as ironic both in social and mass media, we try to find hints about how to deal with this task from a computational point o ..."
Abstract - Cited by 2 (1 self) - Add to MetaCart
The research described in this work focuses on identifying key components for the task of irony detection. By means of analyzing a set of customer reviews, which are considered as ironic both in social and mass media, we try to find hints about how to deal with this task from a computational point of view. Our objective is to gather a set of discriminating elements to represent irony. In particular, the kind of irony expressed in such reviews. To this end, we built a freely available data set with ironic reviews collected from Amazon. Such reviews were posted on the basis of an online viral effect; i.e. contents whose effect triggers a chain reaction on people. The findings were assessed employing three classifiers. The results show interesting hints regarding the patterns and, especially, regarding the implications for sentiment analysis. 1

Learning Phrase Patterns for Text Classification Using a Knowledge Graph and Unlabeled Data

by Alex Marin , Roman Holenstein , Ruhi Sarikaya , Mari Ostendorf
"... Abstract This paper explores a novel method for learning phrase pattern features for text classification, employing a mapping of selected words into a knowledge graph and self-training over unlabeled data. Using Support Vector Machine classification, we obtain improvements over lexical and fully-su ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
Abstract This paper explores a novel method for learning phrase pattern features for text classification, employing a mapping of selected words into a knowledge graph and self-training over unlabeled data. Using Support Vector Machine classification, we obtain improvements over lexical and fully-supervised phrase pattern features in domain and intent detection for language understanding, particularly in conjunction with the use of unlabeled data. Our best results are obtained using unlabeled data filtered for both model training and feature learning based on the confidence of the baseline classifiers.
(Show Context)

Citation Context

...s dependency parse tuples (e.g. [4]). However, such features require running expensive parsing models during the evaluation phase. Another approach is to hand-craft regular expressions, which require human effort but are cheaper to apply; they can also be used to implement business decisions or system constraints, which could be difficult to model in a statistical system. A method which merges the benefits of these two approaches involves learning phrase patterns, extensions of n-grams allowing gaps between words. Such features have been used successfully in a variety of text processing tasks [6, 7, 8, 9, 10]. In this paper, we propose a novel method of learning phrase patterns. Section 2 describes the core phrase pattern learning algorithm and contrasts it with previous work. The experimental setup and results are described in section 3. We discuss additional experiments aimed at system analysis in section 4 and conclude with suggestions for future work in section 5. 2. Phrase Pattern Learning Given a word sequence [wn1 ], we define a phrase pattern as a subsequence [wi1 , . . . , wik ] with 1 ≤ i1 < . . . ik ≤ n. The words wik for each pattern are referred to here as the “pivots” of the pattern....

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University