1,180 research outputs found
On the Accuracy of Hyper-local Geotagging of Social Media Content
Social media users share billions of items per year, only a small fraction of
which is geotagged. We present a data- driven approach for identifying
non-geotagged content items that can be associated with a hyper-local
geographic area by modeling the location distributions of hyper-local n-grams
that appear in the text. We explore the trade-off between accuracy, precision
and coverage of this method. Further, we explore differences across content
received from multiple platforms and devices, and show, for example, that
content shared via different sources and applications produces significantly
different geographic distributions, and that it is best to model and predict
location for items according to their source. Our findings show the potential
and the bounds of a data-driven approach to geotag short social media texts,
and offer implications for all applications that use data-driven approaches to
locate content.Comment: 10 page
A Survey of Location Prediction on Twitter
Locations, e.g., countries, states, cities, and point-of-interests, are
central to news, emergency events, and people's daily lives. Automatic
identification of locations associated with or mentioned in documents has been
explored for decades. As one of the most popular online social network
platforms, Twitter has attracted a large number of users who send millions of
tweets on daily basis. Due to the world-wide coverage of its users and
real-time freshness of tweets, location prediction on Twitter has gained
significant attention in recent years. Research efforts are spent on dealing
with new challenges and opportunities brought by the noisy, short, and
context-rich nature of tweets. In this survey, we aim at offering an overall
picture of location prediction on Twitter. Specifically, we concentrate on the
prediction of user home locations, tweet locations, and mentioned locations. We
first define the three tasks and review the evaluation metrics. By summarizing
Twitter network, tweet content, and tweet context as potential inputs, we then
structurally highlight how the problems depend on these inputs. Each dependency
is illustrated by a comprehensive review of the corresponding strategies
adopted in state-of-the-art approaches. In addition, we also briefly review two
related problems, i.e., semantic location prediction and point-of-interest
recommendation. Finally, we list future research directions.Comment: Accepted to TKDE. 30 pages, 1 figur
Discovery of Points of Interest with Different Granularities for Tour Recommendation Using a City Adaptive Clustering Framework
Increasing demand for personalized tours for tourists travel in an urban area motivates more attention to points of interest (POI) and tour recommendation services. Recently, the granularity of POI has been discussed to provide more detailed information for tour planning, which supports both inside and outside routes that would improve tourists' travel experience. Such tour recommendation systems require a predefined POI database with different granularities, but existing POI discovery methods do not consider the granularity of POI well and treat all POIs as the same scale. On the other hand, the parameters also need to be tuned for different cities, which is not a trivial process. To this end, we propose a city adaptive clustering framework for discovering POIs with different granularities in this article. Our proposed method takes advantage of two clustering algorithms and is adaptive to different cities due to automatic identification of suitable parameters for different datasets. Experiments on two real-world social image datasets reveal the effectiveness of our proposed framework. Finally, the discovered POIs with two levels of granularity are successfully applied on inner and outside tour planning
A Location-Sentiment-Aware Recommender System for Both Home-Town and Out-of-Town Users
Spatial item recommendation has become an important means to help people
discover interesting locations, especially when people pay a visit to
unfamiliar regions. Some current researches are focusing on modelling
individual and collective geographical preferences for spatial item
recommendation based on users' check-in records, but they fail to explore the
phenomenon of user interest drift across geographical regions, i.e., users
would show different interests when they travel to different regions. Besides,
they ignore the influence of public comments for subsequent users' check-in
behaviors. Specifically, it is intuitive that users would refuse to check in to
a spatial item whose historical reviews seem negative overall, even though it
might fit their interests. Therefore, it is necessary to recommend the right
item to the right user at the right location. In this paper, we propose a
latent probabilistic generative model called LSARS to mimic the decision-making
process of users' check-in activities both in home-town and out-of-town
scenarios by adapting to user interest drift and crowd sentiments, which can
learn location-aware and sentiment-aware individual interests from the contents
of spatial items and user reviews. Due to the sparsity of user activities in
out-of-town regions, LSARS is further designed to incorporate the public
preferences learned from local users' check-in behaviors. Finally, we deploy
LSARS into two practical application scenes: spatial item recommendation and
target user discovery. Extensive experiments on two large-scale location-based
social networks (LBSNs) datasets show that LSARS achieves better performance
than existing state-of-the-art methods.Comment: Accepted by KDD 201
Recommended from our members
Crowdsourced Data Mining for Urban Activity: A Review of Data Sources, Applications and Methods
The penetration of devices integrated with location-based services and internet services has generated massive data about the everyday life of citizens and tracked their activities happening in cities. Crowdsourced data, such as social media data, POIs data and collaborative websites, generated by the crowd, has become fine-grained proxy data of urban activity and widely used in research in urban studies. However, due to the heterogeneity of data types of crowdsourced data and the limitation of previous studies mainly focusing on a specific application, a systematic review of crowdsourced data mining for urban activity is still lacking. In order to fill the gap, this paper conducts a literature search in the Web of Science database, selecting 226 highly related papers published between 2013 and 2019. Based on those papers, the review firstly conducts a bibliometric analysis identifying underpinning domains, pivot scholars and papers around this topic. The review also synthesises previous research into three parts: main applications of different data sources and data fusion; application of spatial analysis in mobility patterns, functional areas and event detection; application of socio-demographic and perception analysis in city attractiveness, demographic characteristics and sentiment analysis. The challenges of this type of data are also discussed in the end. This study provides a systematic and current review for both researchers and practitioners interested in the applications of crowdsourced data mining for urban activity.This research is funded by a scholarship from the China Scholarship Counci
Discovering visiting behaviors and city perceptions by mining semantic trajectory
Tourism is a crucial industry for many cities, necessitating the development of unique attractions to draw in more visitors. Understanding the visiting behaviors and perceptions of visitors helps to uncover the city’s distinctive characteristics, thereby aiding in the further growth of its tourism industry. It’s important to note that different population groups may exhibit varying visiting behaviors depending on the time of their visit, which in turn can shape their impressions of the city. This study explores the dynamic visiting behaviors and city perceptions of locals and tourists throughout different times of the day and week. The study area is London, one of the world’s most famous tourist cities. To conduct this study, User-Generated Content (UGC) is utilized, specifically data from Foursquare check-ins and Flickr tags from April 3, 2012, to September 16, 2013. The study first identifies the spatiotemporal distribution of hotspots for each population group based on their Foursquare check-ins. The relative concentration of locals and tourists is then examined through the difference ratio to understand their unique visiting preferences. Next, the spatiotemporal movements of locals and tourists and their city descriptions during their trips are analyzed by constructing semantic trajectories. The place is the fundamental element of a semantic trajectory, and places are constructed by clustering Foursquare check-ins. The property of the place is defined by three dimensions: location (represented by borough names), locale (represented by place categories), and sense of place (represented by topics generated in topic modeling based on Flickr tags). Semantic trajectories are then clustered based on their semantic dimensions, and typical trajectories are mined for each cluster. The distribution of trajectories and their semantic dimensions are compared between locals and tourists at different time spans to explore how London’s impressions evolve over time. The results suggest distinct visiting behaviors and city perceptions over time for locals and tourists. Both groups primarily concentrate in the city center, with small hotspots around the airport. However, locals tend to visit more suburban areas than tourists. Locals show higher preferences for business districts during the daytime and on weekdays, while tourists consistently show interest in shopping in the city center. In terms of city perceptions, the city center is associated with descriptions of cityscapes and transport during the daytime. At night, people tend to associate the same area with nightlife activities. Furthermore, locals are interested in leisure activities and fitness, while tourists tend to focus on tourist attractions and the Olympics
- …