32 research outputs found

    Geotagging One Hundred Million Twitter Accounts with Total Variation Minimization

    Full text link
    Geographically annotated social media is extremely valuable for modern information retrieval. However, when researchers can only access publicly-visible data, one quickly finds that social media users rarely publish location information. In this work, we provide a method which can geolocate the overwhelming majority of active Twitter users, independent of their location sharing preferences, using only publicly-visible Twitter data. Our method infers an unknown user's location by examining their friend's locations. We frame the geotagging problem as an optimization over a social network with a total variation-based objective and provide a scalable and distributed algorithm for its solution. Furthermore, we show how a robust estimate of the geographic dispersion of each user's ego network can be used as a per-user accuracy measure which is effective at removing outlying errors. Leave-many-out evaluation shows that our method is able to infer location for 101,846,236 Twitter users at a median error of 6.38 km, allowing us to geotag over 80\% of public tweets.Comment: 9 pages, 8 figures, accepted to IEEE BigData 2014, Compton, Ryan, David Jurgens, and David Allen. "Geotagging one hundred million twitter accounts with total variation minimization." Big Data (Big Data), 2014 IEEE International Conference on. IEEE, 201

    Implementation of Classification of Geolocation of Country from Worldwide Tweets

    Get PDF
    Social media are progressively being employed within the scientific community as key supply of knowledge to assist perceive various natural and social phenomena, and this has prompted the event of a good vary of process data processing tools that may extract data from social media for each post-hoc and real time analysis. The rise of interest in mistreatment social media as a supply for analysis has actuated braving the challenge of mechanically geo-locating tweets, given the dearth of specific location data within the majority of tweets. In distinction to abundant previous work that has targeted on location classification of tweets restricted to a selected country, here we tend to undertake the task during a broader context by classifying international tweets at the country level that is up to now undiscovered during a time period situation. We tend to analyze the extent to that a tweet’s country of origin maybe determined by creating use of eight tweet-inherent options for classification

    Social Sensing of Floods in the UK

    Get PDF
    "Social sensing" is a form of crowd-sourcing that involves systematic analysis of digital communications to detect real-world events. Here we consider the use of social sensing for observing natural hazards. In particular, we present a case study that uses data from a popular social media platform (Twitter) to detect and locate flood events in the UK. In order to improve data quality we apply a number of filters (timezone, simple text filters and a naive Bayes `relevance' filter) to the data. We then use place names in the user profile and message text to infer the location of the tweets. These two steps remove most of the irrelevant tweets and yield orders of magnitude more located tweets than we have by relying on geo-tagged data. We demonstrate that high resolution social sensing of floods is feasible and we can produce high-quality historical and real-time maps of floods using Twitter.Comment: 24 pages, 6 figure
    corecore