Article thumbnail

An Author-Topic based Approach to Cluster Tweets and Mine their Location

By Mohamed Morchid, Yonathan Portilla, Didier Josselin, Richard Dufour, Eitan Altman, Marc El Bèze, Jean-Valère Cossu, Georges Linarès and Alexandre Reiffers-Masson

Abstract

Presented as poster at Spatial Statistics Conference 2015, Avignon, France, June 2015International audienceSocial Networks became a major actor in information propagation. Using the Twitter popular platform, mobile users post or relaymessages from different locations. The tweet content, meaning and location show how an event-such as the bursty one“JeSuisCharlie'” happened in France in January 2015 is comprehended in different countries. This research aims at clustering thetweets according to the co-occurrence of their terms, including the country, and forecasting the probable country of a non locatedtweet, knowing its content. First, we present the process of collecting a large quantity of data from the Twitter website. Wefinally have a set of 2.189 located tweets about “Charlie'', from the 7th to the 14th of January. We describe an original methodadapted from the Author-Topic (AT) model based on the Latent Dirichlet Allocation method (LDA). We define a homogeneousspace containing both lexical content (words) and spatial information (country). During a training process on a part of the sample,we provide a set of clusters (topics) based on statistical relations between lexical and spatial terms. During a clustering task, weevaluate the method effectiveness on the rest of the sample that reaches up to 95% of good assignment

Topics: Keywords: Author-Topic model, Tweet location, [SHS.GEO] Humanities and Social Sciences/Geography, [INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL]
Publisher: Elsevier
Year: 2015
DOI identifier: 10.1016/j.proenv.2015.07.109
OAI identifier: oai:HAL:hal-01251313v1
Provided by: HAL-UNICE
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://www.sciencedirect.com/s... (external link)
  • http://www.sciencedirect.com/s... (external link)
  • https://hal.archives-ouvertes.... (external link)
  • https://hal.archives-ouvertes.... (external link)
  • https://hal.archives-ouvertes.... (external link)
  • http://www.sciencedirect.com/s... (external link)
  • http://www.sciencedirect.com/s... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.