Search CORE

20,248 research outputs found

Characterizing Geo-located Tweets in Brazilian Megacities

Author: Christina Gagnon (4247860)
Elizabeth Ottoni (4247866)
Luc DesGroseillers (59022)
Rémy Beaujois (314942)
Sami HSine (4247869)
Stéphanie Mollet (4247875)
Wildriss Viranaicken (347964)
Xin Zhang (35492)
Publication venue
Publication date: 01/01/2017
Field of study

This work presents a framework for collecting, processing and mining geo-located tweets in order to extract meaningful and actionable knowledge in the context of smart cities. We collected and characterized more than 9M tweets from the two biggest cities in Brazil, Rio de Janeiro and S\~ao Paulo. We performed topic modeling using the Latent Dirichlet Allocation model to produce an unsupervised distribution of semantic topics over the stream of geo-located tweets as well as a distribution of words over those topics. We manually labeled and aggregated similar topics obtaining a total of 29 different topics across both cities. Results showed similarities in the majority of topics for both cities, reflecting similar interests and concerns among the population of Rio de Janeiro and S\~ao Paulo. Nevertheless, some specific topics are more predominant in one of the cities

arXiv.org e-Print Archive

Crossref

FigShare

Characterizing Geo-located Tweets in Brazilian Megacities

Author: Cacho Nélio
Pasquali Arian
Pereira João
Rossetti Rosaldo
Saleiro Pedro
Publication venue
Publication date: 06/09/2017
Field of study

arXiv.org e-Print Archive

Crossref

The Development and the Evaluation of a System for Extracting Events from Web Pages

Author: Constantin AVORNICULUI
Mihai-Constantin AVORNICULUI
Silviu Claudiu POPA
Publication venue
Publication date
Field of study

The centralization of a particular event is primarily useful for running news services. These services should provide updated information, if possible even in real time, on a specific type of event. These events and their extraction involved the automatic analysis of linguistic structure documents to determine the possible sequences in which these events occur in documents. This analysis will provide structured and semi-structured documents in which the unit events can be extracted automatically. In order to measure the quality of a system, a methodology will be introduced, which describes the stages and how the decomposition of a system for extracting events in components, quality attributes and properties will be defined for these components, and finally will be introduced metrics for evaluation.Event, Performance Metric, Event Extraction System

Research Papers in Economics

Exploiting Social Annotation for Automatic Resource Discovery

Author: Lerman Kristina
Plangprasopchok Anon
Publication venue
Publication date: 01/01/2007
Field of study

Information integration applications, such as mediators or mashups, that require access to information resources currently rely on users manually discovering and integrating them in the application. Manual resource discovery is a slow process, requiring the user to sift through results obtained via keyword-based search. Although search methods have advanced to include evidence from document contents, its metadata and the contents and link structure of the referring pages, they still do not adequately cover information sources -- often called ``the hidden Web''-- that dynamically generate documents in response to a query. The recently popular social bookmarking sites, which allow users to annotate and share metadata about various information sources, provide rich evidence for resource discovery. In this paper, we describe a probabilistic model of the user annotation process in a social bookmarking system del.icio.us. We then use the model to automatically find resources relevant to a particular information domain. Our experimental results on data obtained from \emph{del.icio.us} show this approach as a promising method for helping automate the resource discovery task.Comment: 6 pages, submitted to AAAI07 workshop on Information Integration on the We

arXiv.org e-Print Archive

CiteSeerX

An information retrieval approach to ontology mapping

Author: Gulla J.A.
Su X.
Publication venue: Elsevier
Publication date: 01/01/2006
Field of study

In this paper, we present a heuristic mapping method and a prototype mapping system that support the process of semi-automatic ontology mapping for the purpose of improving semantic interoperability in heterogeneous systems. The approach is based on the idea of semantic enrichment, i.e., using instance information of the ontology to enrich the original ontology and calculate similarities between concepts in two ontologies. The functional settings for the mapping system are discussed and the evaluation of the prototype implementation of the approach is reported. \ud \u

CiteSeerX

University of Twente Research Information