• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

27). Evolution of Online User Behavior During a Social Upheaval. Retrieved April 6, 2015, from http://arxiv.org/abs/1406.7197 (2014)

by O Varol, E Ferrara, C Ogan, F Menczer, A Flammini
Add To MetaCart

Tools

Sorted by:
Results 1 - 2 of 2

Clustering memes in social media streams

by Mohsen Jafariasbagh, Emilio Ferrara, Onur Varol, Alessandro Flammini
"... The problem of clustering content in social media has pervasive appli-cations, including the identification of discussion topics, event detection, and content recommendation. Here we describe a streaming framework for online detection and clustering of memes in social media, specifically Twitter. A ..."
Abstract - Add to MetaCart
The problem of clustering content in social media has pervasive appli-cations, including the identification of discussion topics, event detection, and content recommendation. Here we describe a streaming framework for online detection and clustering of memes in social media, specifically Twitter. A pre-clustering procedure, namely protomeme detection, first isolates atomic tokens of information carried by the tweets. Protomemes are thereafter aggregated, based on multiple similarity measures, to obtain memes as cohesive groups of tweets reflecting actual concepts or topics of discussion. The clustering algorithm takes into account various dimen-sions of the data and metadata, including natural language, the social network, and the patterns of information diffusion. As a result, our sys-tem can build clusters of semantically, structurally, and topically related tweets. The clustering process is based on a variant of Online K-means that incorporates a memory mechanism, used to “forget ” old memes and replace them over time with the new ones. The evaluation of our frame-work is carried out by using a dataset of Twitter trending topics. Over a one-week period, we systematically determined whether our algorithm was able to recover the trending hashtags. We show that the proposed method outperforms baseline algorithms that only use content features, as well as a state-of-the-art event detection method that assumes full knowledge of the underlying follower network. We finally show that our online learning framework is flexible, due to its independence of the adopted clustering algorithm, and best suited to work in a streaming scenario. 1

ADAPTING THE STANDARD SIR DISEASE MODEL IN ORDER TO TRACK AND PREDICT THE SPREADING OF THE EBOLA VIRUS USING TWITTER DATA

by Armin Smailhodzic, Armin Smailhodzic , 2015
"... iii ..."
Abstract - Add to MetaCart
Abstract not found
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University