Results 1 - 10
of
14
Exploring Trajectory-Driven Local Geographic Topics in Foursquare
"... The location based social networking services (LBSNSs) are becoming very popular today. In LBSNSs, such as Foursquare, users can explore their places of interests around their current locations, check in at these places to share their locations with their friends, etc. These check-ins contain rich i ..."
Abstract
-
Cited by 6 (0 self)
- Add to MetaCart
(Show Context)
The location based social networking services (LBSNSs) are becoming very popular today. In LBSNSs, such as Foursquare, users can explore their places of interests around their current locations, check in at these places to share their locations with their friends, etc. These check-ins contain rich information and imply human mobility patterns; thus, they can greatly facilitate mining and analysis of local geographic topics driven by users ’ trajectories. The local geographic topics indicate the potential and intrinsic relations among the locations in accordance with users ’ trajectories. These relations are useful for users in both location and friend recommendations. In this paper, we focus on exploring the local geographic topics through check-ins in Pittsburgh area in Foursquare. We use the Latent Dirichlet Allocation (LDA) model to discover the local geographic topics from the checkins. We also compare the local geographic topics on weekdays with those at weekends. Our results show that LDA works well in finding the related places of interests.
You are where you e-mail: using e-mail data to estimate international migration rates
- In Proc. of the ACM Conference on Web Science (WebSci
, 2012
"... ABSTRACT International migration is one of the major determinants of demographic change. Although efforts to produce comparable statistics are underway, estimates of demographic flows are inexistent, outdated, or largely inconsistent, for most countries. We estimate age and gender-specific migratio ..."
Abstract
-
Cited by 4 (3 self)
- Add to MetaCart
(Show Context)
ABSTRACT International migration is one of the major determinants of demographic change. Although efforts to produce comparable statistics are underway, estimates of demographic flows are inexistent, outdated, or largely inconsistent, for most countries. We estimate age and gender-specific migration rates using data extracted from a large sample of Yahoo! e-mail messages. Self-reported age and gender of anonymized e-mail users were linked to the geographic locations (mapped from IP addresses) from where users sent e-mail messages over time (2009)(2010)(2011). The users' country of residence over time was inferred as the one from where most e-mail messages were sent. Our estimates of age profiles of migration are qualitatively consistent with existing administrative data sources. Selection bias generates uncertainty for estimates at one point in time, especially for developing countries. However, our approach allows us to compare in a reliable way migration trends of females and males. We document the recent increase in human mobility and we observe that female mobility has been increasing at a faster pace. Our findings suggest that e-mail data may complement existing migration data, resolve inconsistencies arising from different definitions of migration, and provide new and rich information on mobility patterns and social networks of migrants. The use of digital records for demographic research has the potential to become particularly important for developing countries, where the diffusion of Internet will be faster than the development of mature demographic registration systems. * We thank Josh Goldstein for his helpful suggestions throughout the development of this study. The manuscript also benefited from the comments of James Raymer, our colleagues at MPIDR and Yahoo! Research, the participants to the Alp-Pop meeting (2012) and to the Demography seminar of CEDEPLAR (2011). We thank four anonymous referees for their constructive comments.
Discovering and Predicting User Routines by Differential Analysis of Social Network Traces
"... Abstract—The study of human activity patterns traditionally relies on the continuous tracking of user location. We approach the problem of activity pattern discovery from a new perspective which is rapidly gaining attention. Instead of actively sampling increasing volumes of sensor data, we explore ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
(Show Context)
Abstract—The study of human activity patterns traditionally relies on the continuous tracking of user location. We approach the problem of activity pattern discovery from a new perspective which is rapidly gaining attention. Instead of actively sampling increasing volumes of sensor data, we explore the participatory sensing potential of multiple mobile social networks, on which users often disclose information about their location and the venues they visit. In this paper, we present automated techniques for filtering, aggregating, and processing combined social networking traces with the goal of extracting descriptions of regularly-occurring user activities, which we refer to as “user routines”. We report our findings based on two localized data sets about a single pool of users: the former contains public geotagged Twitter messages, the latter Foursquare check-ins that provide us with meaningful venue information about the locations we observe. We analyze and combine the two datasets to highlight their properties and show how the emergent features can enhance our understanding of users ’ daily schedule. Finally, we evaluate and discuss the potential of routine descriptions for predicting future user activity and location. I.
1 Talking Places: Modelling and Analysing Linguistic Content in Foursquare
"... The advent of online social media and the growing popularity of sensor-equipped mobile devices have created a vast landscape of location-aware applications and services. This goldmine of data, including temporal and spatial information of unprecedented granularity, can help researchers gain insights ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
(Show Context)
The advent of online social media and the growing popularity of sensor-equipped mobile devices have created a vast landscape of location-aware applications and services. This goldmine of data, including temporal and spatial information of unprecedented granularity, can help researchers gain insights into the behavioural patterns of people at a global scale. Here we analyse the textual content of millions of comments published alongside Foursquare user check-ins. For this, we extend a standard topic modelling approach so that it explicitly takes into account geographic and temporal side information. The framework is applied to Foursquare data and used to detect the dominant topics in the neighbourhoods of a city. In particular, we present the most prominent topics discussed by Foursquare users in New York, London, Chicago and San Francisco. We characterize the topics ’ spatial coverage and temporal evolution, and we also highlight some cultural idiosyncrasies. Finally, we evaluate the novel spatio-temporal topic model quantitatively. We believe that our model may be a useful tool for social scientists and application developers. I.
Demographic Research with Non-Representative Internet Data’, in this issue.
, 2015
"... We would like to thank Klaus Zimmermann, Nikos Askitas and two anonymous reviewers, for their very helpful comments and suggestions. 2 Abstract Purpose Internet data hold many promises for demographic research, but come with severe drawbacks due to several types of bias. This paper reviews the lite ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
(Show Context)
We would like to thank Klaus Zimmermann, Nikos Askitas and two anonymous reviewers, for their very helpful comments and suggestions. 2 Abstract Purpose Internet data hold many promises for demographic research, but come with severe drawbacks due to several types of bias. This paper reviews the literature that uses Internet data for demographic studies and presents a general framework for addressing the problem of selection bias in nonrepresentative samples. Design/methodology/approach We propose two main approaches to reduce bias. When ground truth data are available, we suggest a method that relies on calibration of the online data against reliable official statistics. When no ground truth data are available, we propose a difference in differences approach to evaluate relative trends. Findings We offer a generalization of existing techniques. Although there is not a definite answer to the question of whether statistical inference can be made from non-representative samples, we show that, when certain assumptions are met, we can extract signal from noisy and biased data. Research limitations/implications Our methods are sensitive to a number of assumptions. These include some regularities in the way the bias changes across different locations, different demographic groups, and between time steps. The assumptions that we discuss might not always hold. In particular, the scenario where bias varies in an unpredictable manner and, at the same time, there is no "ground truth" available to continuously calibrate the model, remains challenging and beyond the scope of this article. Originality/value The paper combines a critical review of existing substantive and methodological literature with a generalization of prior techniques. It intends to provide a fresh perspective on the issue and to stimulate the methodological discussion among social scientists.
Geographic Summaries from Crowdsourced Data
- In 11th Extended Semantic Web Conference (ESWC’14
, 2014
"... Abstract. In this paper, we present a research prototype for creating geographic summaries using the whereabouts of Foursquare users. Exploiting the density of the venue types in a particular region, the system adds a layer over any typical cartography geographic maps service, creating a first glan ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
(Show Context)
Abstract. In this paper, we present a research prototype for creating geographic summaries using the whereabouts of Foursquare users. Exploiting the density of the venue types in a particular region, the system adds a layer over any typical cartography geographic maps service, creating a first glance summary over the venues sampled from the Foursquare knowledge base. Each summary is represented by a convex hull. The shape is automatically computed according to the venue densities enclosed in the area. The summary is then labeled with the most prominent category or categories. The prominence is given by the observed venue category density. The prototype provides two outputs: a light-weight representation structured in GeoJSON, and a semantic description using the Open Annotation Ontology. We evaluate the quality of the summaries using the Sum of Squared Errors (SSE) and the Jaccard distance. The system is available at http://geosummly.eurecom.fr.
An Approach to Situation Recognition Based on Learned Semantic Models
, 2014
"... Full metadata for this item is available in ..."
WWW '14 Companion
"... ABSTRACT Data about migration flows are largely inconsistent across countries, typically outdated, and often inexistent. Despite the importance of migration as a driver of demographic change, there is limited availability of migration statistics. Generally, researchers rely on census data to indire ..."
Abstract
- Add to MetaCart
(Show Context)
ABSTRACT Data about migration flows are largely inconsistent across countries, typically outdated, and often inexistent. Despite the importance of migration as a driver of demographic change, there is limited availability of migration statistics. Generally, researchers rely on census data to indirectly estimate flows. However, little can be inferred for specific years between censuses and for recent trends. The increasing availability of geolocated data from online sources has opened up new opportunities to track recent trends in migration patterns and to improve our understanding of the relationships between internal and international migration. In this paper, we use geolocated data for about 500,000 users of the social network website "Twitter". The data are for users in OECD countries during the period May 2011-April 2013. We evaluated, for the subsample of users who have posted geolocated tweets regularly, the geographic movements within and between countries for independent periods of four months, respectively. Since Twitter users are not representative of the OECD population, we cannot infer migration rates at a single point in time. However, we proposed a difference-indifferences approach to reduce selection bias when we infer trends in out-migration rates for single countries. Our results indicate that our approach is relevant to address two longstanding questions in the migration literature. First, our methods can be used to predict turning points in migration trends, which are particularly relevant for migration forecasting. Second, geolocated Twitter data can substantially improve our understanding of the relationships between internal and international migration. Our analysis relies uniquely on publicly available data that could be potentially available in real time and that could be used to monitor migration trends. The Web Science community is well-positioned to address, in future work, a number of methodological and substantive questions that we discuss in this article.
Towards Situated Awareness in Urban Networks: A Bio-inspired Approach
"... Abstract—The possibility to have millions of computational devices interconnected across urban environments opens up novel application areas. In such highly distributed scenarios, applications must gain awareness as a result of opportunistic encounters with co-located devices, a departure from tradi ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract—The possibility to have millions of computational devices interconnected across urban environments opens up novel application areas. In such highly distributed scenarios, applications must gain awareness as a result of opportunistic encounters with co-located devices, a departure from traditional reasoning approaches. We envision situated awareness as an emergent property of such networks, where bio-inspired algorithms are employed to coordinate interactions between devices through managing the lifecycle, distribution, and content of data. A congestion-aware, crowd-steering example illustrates this vision. I.
Towards (Re)Constructing Narratives from Georeferenced Photographs through Visual Analytics
"... We present a study that explores methodological steps towards (re)constructing collective narratives from the photo-taking behaviour of two groups (foreign tourists and inhabitants of Switzerland) by analysing spatial and temporal patterns in user-contributed, georeferenced photographs of Zurich, Sw ..."
Abstract
- Add to MetaCart
(Show Context)
We present a study that explores methodological steps towards (re)constructing collective narratives from the photo-taking behaviour of two groups (foreign tourists and inhabitants of Switzerland) by analysing spatial and temporal patterns in user-contributed, georeferenced photographs of Zurich, Switzerland. We reason that the photographers typically capture a scene or a moment because they want to remember or share it, thus these scenes or moments are meaningful to them. Various scholars suggest that the human experience (i.e. this meaningfulness) is what separates a place from the mathematical descriptions of space. While this notion is well known in larger geographic literature, it is under-explored in cartographic research. We respond to this research gap and reconstruct static and dynamic patterns of photo-taking and-sharing behaviour to assist in capturing the implicit meaning in the studied locations. These locations may be meaningful to only a certain group of people in certain moments; therefore, studying group differences in spatial and temporal photo-taking patterns will help building a collective and comparative story about the studied place. In our study, we focus on experiences of foreign versus domestic visitors, and in the process, we examine the potential (and feasibility) of georeferenced photographs for extracting such collective narratives using qualitative and quantitative visual analytical methods.