Results 1 - 10
of
46
You Are What You Tweet: Analyzing Twitter for Public Health
"... Analyzing user messages in social media can measure different population characteristics, including public health measures. For example, recent work has correlated Twitter messages with influenza rates in the United States; but this has largely been the extent of mining Twitter for public health. In ..."
Abstract
-
Cited by 10 (0 self)
- Add to MetaCart
Analyzing user messages in social media can measure different population characteristics, including public health measures. For example, recent work has correlated Twitter messages with influenza rates in the United States; but this has largely been the extent of mining Twitter for public health. In this work, we consider a broader range of public health applications for Twitter. We apply the recently introduced Ailment Topic Aspect Model to over one and a half million health related tweets and discover mentions of over a dozen ailments, including allergies, obesity and insomnia. We introduce extensions to incorporate prior knowledge into this model and apply it to several tasks: tracking illnesses over times (syndromic surveillance), measuring behavioral risk factors, localizing illnesses by geographic region, and analyzing symptoms and medication usage. We show quantitative correlations with public health data and qualitative evaluations of model output. Our results suggest that Twitter has broad applicability for public health research.
Information Credibility on Twitter
"... We analyze the information credibility of news propagated through Twitter, a popular microblogging service. Previous research has shown that most of the messages posted on Twitter are truthful, but the service is also used to spread misinformation and false rumors, often unintentionally. On this pap ..."
Abstract
-
Cited by 6 (1 self)
- Add to MetaCart
We analyze the information credibility of news propagated through Twitter, a popular microblogging service. Previous research has shown that most of the messages posted on Twitter are truthful, but the service is also used to spread misinformation and false rumors, often unintentionally. On this paper we focus on automatic methods for assessing the credibility of a given set of tweets. Specifically, we analyze microblog postings related to “trending ” topics, and classify them as credible or not credible, based on features extracted from them. We use features from users ’ posting and re-posting (“re-tweeting”) behavior, from the text of the posts, and from citations to external sources. We evaluate our methods using a significant number of human assessments about the credibility of items on a recent sample of Twitter postings. Our results shows that there are measurable differences in the way messages propagate, that can be used to classify them automatically as credible or not credible, with precision and recall in the range of 70% to 80%.
Beyond Trending Topics: Real-World Event Identification on Twitter
"... User-contributed messages on social media sites such as Twitter have emerged as powerful, real-time means of information sharing on the Web. These short messages tend to reflect a variety of events in real time, making Twitter particularly well suited as a source of real-time event content. In this ..."
Abstract
-
Cited by 6 (1 self)
- Add to MetaCart
User-contributed messages on social media sites such as Twitter have emerged as powerful, real-time means of information sharing on the Web. These short messages tend to reflect a variety of events in real time, making Twitter particularly well suited as a source of real-time event content. In this paper, we explore approaches for analyzing the stream of Twitter messages to distinguish between messages about real-world events and non-event messages. Our approach relies on a rich family of aggregate statistics of topically similar message clusters. Large-scale experiments over millions of Twitter messages show the effectiveness of our approach for surfacing real-world event content on Twitter. 1
Measuring Online Service Availability Using Twitter
"... Real-time micro-blogging services such as Twitter are widely recognized for their social dynamics — how they both encapsulate a social graph and propagate information across it. However, the content of this information is equally interesting since it frequently reflects individual experiences with a ..."
Abstract
-
Cited by 5 (0 self)
- Add to MetaCart
Real-time micro-blogging services such as Twitter are widely recognized for their social dynamics — how they both encapsulate a social graph and propagate information across it. However, the content of this information is equally interesting since it frequently reflects individual experiences with a broad variety of real-time events. Indeed, events of broad interest are commonly revealed in correlated spikes of semantically-related posting activity. In this paper, we explore one such application this of phenomenon: using Twitter data to infer on-line Internet service availability. We show that simple techniques are sufficient to extract key semantic content from “tweets” (i.e., service X is down) and also filter out extraneous noise. We demonstrate the efficacy of this approach at identifying a range of large-scale service outages in 2009 for popular services such as Gmail, Bing and PayPal. 1
Event Summarization using Tweets
"... Twitter has become exceedingly popular, with hundreds of millions of tweets being posted every day on a wide variety of topics. This has helped make real-time search applications possible with leading search engines routinely displaying relevant tweets in response to user queries. Recent research ha ..."
Abstract
-
Cited by 4 (0 self)
- Add to MetaCart
Twitter has become exceedingly popular, with hundreds of millions of tweets being posted every day on a wide variety of topics. This has helped make real-time search applications possible with leading search engines routinely displaying relevant tweets in response to user queries. Recent research has shown that a considerable fraction of these tweets are about “events”, and the detection of novel events in the tweet-stream has attracted a lot of research interest. However, very little research has focused on properly displaying this real-time information about events. For instance, the leading search engines simply display all tweets matching the queries in reverse chronological order. In this paper we argue that for some highly structured and recurring events, such as sports, it is better to use more sophisticated techniques to summarize the relevant tweets. We formalize the problem of summarizing event-tweets and give a solution based on learning the underlying hidden state representation of the event via Hidden Markov Models. In addition, through extensive experiments on real-world data we show that our model significantly outperforms some intuitive and competitive baselines.
Hip and Trendy: Characterizing Emerging Trends on Twitter
"... Twitter, Facebook, and other related systems that we call social awareness streams are rapidly changing the information and communication dynamics of our society. These systems, where hundreds of millions of users share short messages in real time, expose the aggregate interests and attention of glo ..."
Abstract
-
Cited by 4 (2 self)
- Add to MetaCart
Twitter, Facebook, and other related systems that we call social awareness streams are rapidly changing the information and communication dynamics of our society. These systems, where hundreds of millions of users share short messages in real time, expose the aggregate interests and attention of global and local communities. In particular, emerging temporal trends in these systems, especially those related to a single geographic area, are a significant and revealing source of information for, and about, a local community. This study makes two essential contributions for interpreting emerging temporal trends in these information systems. First, based on a large dataset of Twitter messages from one geographic area, we develop a taxonomy of the trends present in the data. Second, we identify important dimensions according to which trends can be categorized, as well as the key distinguishing features of trends that can be derived from their associated messages. We quantitatively examine the computed features for different categories of trends, and establish that significant differences can be detected across categories. Our study advances the understanding of trends on Twitter and other social awareness streams, which will enable powerful applications and activities, including user-driven real-time information services for local communities.
Social Sensors and Pervasive Services: Approaches and Perspectives
"... Social networks are perhaps the purest example of “Web 2.0 ” services, and offer a sophisticated tool for accessing the preferences and properties of individuals and groups. Thus, they potentially allow up-to-date, richly annotated contextual data to be acquired as a side effect of users ’ everyday ..."
Abstract
-
Cited by 3 (2 self)
- Add to MetaCart
Social networks are perhaps the purest example of “Web 2.0 ” services, and offer a sophisticated tool for accessing the preferences and properties of individuals and groups. Thus, they potentially allow up-to-date, richly annotated contextual data to be acquired as a side effect of users ’ everyday use of the services. In this paper, we explore how such “social sensing ” could be integrated into pervasive systems. We frame and survey the possible approaches to such an integration, and eventually discuss the open issues and challenges facing researchers. 1
Analyzing the potential of microblogs for spatio-temporal popularity estimation of music artists
- In Proceedings of the IJCAI 2011: International workshop on social web mining
, 2011
"... This paper looks into the suitability of microblogs for an important task in music information research, namely popularity estimation of music artists. The research questions addressed are the following: To which extent are microblogs used to communicate music listening behavior? Are there differenc ..."
Abstract
-
Cited by 3 (3 self)
- Add to MetaCart
This paper looks into the suitability of microblogs for an important task in music information research, namely popularity estimation of music artists. The research questions addressed are the following: To which extent are microblogs used to communicate music listening behavior? Are there differences between different countries of the world? Is it possible to derive a popularity measure from user’s microblogging activities? We found that microblogging does indeed represent an important communication channel for revealing music listening activities, although the intensity of its use vary considerably from country to country. Motivated by this finding, we took first steps towards a geo-aware, social popularity measure for music artists. To this end, we analyzed user posts mined from the microblogging service Twitter over a period of five months. Addressing the problem of determining the popularity of music artists, we employed a gazetteer on extracted posts relevant for particular music artists. The presented approach aims at extracting time- and location-specific artist popularity information. We evaluated the performance of the approach by comparing the popularity rankings derived from Twitter posts against the popularity rankings provided by last.fm, a popular music information system and recommender engine. 1
Why do people retweet? antihomophily wins the day
- In International Conference on Weblogs and Social Media (ICWSM
"... Twitter and other microblogs have rapidly become a significant means by which people communicate with the world and each other in near realtime. There has been a large number of studies surrounding these social media, focusing on areas such as information spread, various centrality measures, topic d ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
Twitter and other microblogs have rapidly become a significant means by which people communicate with the world and each other in near realtime. There has been a large number of studies surrounding these social media, focusing on areas such as information spread, various centrality measures, topic detection and more. However, one area which has not received much attention is trying to better understand what information is being spread and why it is being spread. This work looks to get a better understanding of what makes people spread information in tweets or microblogs through the use of retweeting. Several retweet behavior models are presented and evaluated on a Twitter data set consisting of over 768,000 tweets gathered from monitoring over 30,000 users for a period of one month. We evaluate the proposed models against each user and show how people use different retweet behavior models. For example, we find that although users in the majority of cases do not retweet information on topics that they themselves Tweet about as or from people who are “like them ” (hence anti-homophily), we do find that models which do take homophily, or similarity, into account fits the observed retweet behaviors much better than other more general models which do not take this into account. We further find that, not surprisingly, people’s retweeting behavior is better explained through multiple different models rather than one model. 1
Identifying Content for Planned Events Across Social Media Sites
"... User-contributed Web data contains rich and diverse information about a variety of events in the physical world, such as shows, festivals, conferences and more. This information ranges from known event features (e.g., title, time, location) posted on event aggregation platforms (e.g., Last.fm events ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
User-contributed Web data contains rich and diverse information about a variety of events in the physical world, such as shows, festivals, conferences and more. This information ranges from known event features (e.g., title, time, location) posted on event aggregation platforms (e.g., Last.fm events, EventBrite, Facebook events) to discussions and reactions related to events shared on different social media sites (e.g., Twitter, YouTube, Flickr). In this paper, we focus on the challenge of automatically identifying user-contributed content for events that are planned and, therefore, known in advance, across different social media sites. We mine event aggregation platforms to extract event features, which are often noisy or missing. We use these features to develop query formulation strategies for retrieving content associated with an event on different social media sites. Further, we explore ways in which event content identified on one social media site can be used to retrieve additional relevant event content on other social media sites. We apply our strategies to a large set of user-contributed events, and analyze their effectiveness in retrieving relevant event content from Twitter, YouTube, and Flickr.

