Search CORE

618 research outputs found

Event detection, tracking, and visualization in Twitter: a mention-anomaly-based approach

Author: Favre Cecile
Guille Adrien
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 13/05/2015
Field of study

The ever-growing number of people using Twitter makes it a valuable source of timely information. However, detecting events in Twitter is a difficult task, because tweets that report interesting events are overwhelmed by a large volume of tweets on unrelated topics. Existing methods focus on the textual content of tweets and ignore the social aspect of Twitter. In this paper we propose MABED (i.e. mention-anomaly-based event detection), a novel statistical method that relies solely on tweets and leverages the creation frequency of dynamic links (i.e. mentions) that users insert in tweets to detect significant events and estimate the magnitude of their impact over the crowd. MABED also differs from the literature in that it dynamically estimates the period of time during which each event is discussed, rather than assuming a predefined fixed duration for all events. The experiments we conducted on both English and French Twitter data show that the mention-anomaly-based approach leads to more accurate event detection and improved robustness in presence of noisy Twitter content. Qualitatively speaking, we find that MABED helps with the interpretation of detected events by providing clear textual descriptions and precise temporal descriptions. We also show how MABED can help understanding users' interest. Furthermore, we describe three visualizations designed to favor an efficient exploration of the detected events.Comment: 17 page

arXiv.org e-Print Archive

HAL

Hal-Diderot

SURGE: Continuous Detection of Bursty Regions Over a Stream of Spatial Objects

Author: Bhowmicks Sourav S.
Cong Gao
Feng Kaiyu
Guo Tao
Ma Shuai
Publication venue
Publication date: 28/09/2017
Field of study

With the proliferation of mobile devices and location-based services, continuous generation of massive volume of streaming spatial objects (i.e., geo-tagged data) opens up new opportunities to address real-world problems by analyzing them. In this paper, we present a novel continuous bursty region detection problem that aims to continuously detect a bursty region of a given size in a specified geographical area from a stream of spatial objects. Specifically, a bursty region shows maximum spike in the number of spatial objects in a given time window. The problem is useful in addressing several real-world challenges such as surge pricing problem in online transportation and disease outbreak detection. To solve the problem, we propose an exact solution and two approximate solutions, and the approximation ratio is

\frac{1-\alpha}{4}

in terms of the burst score, where

\alpha

is a parameter to control the burst score. We further extend these solutions to support detection of top-

k

bursty regions. Extensive experiments with real-world data are conducted to demonstrate the efficiency and effectiveness of our solutions

arXiv.org e-Print Archive

Crossref

DR-NTU (Digital Repository of NTU)

Finding Bursty Topics From Microblogs

Author: DIAO Qiming
JIANG Jing
LIM Ee Peng
ZHU Feida
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2012
Field of study

Microblogs such as Twitter reflect the general public’s reactions to major events. Bursty topics from microblogs reveal what events have attracted the most online attention. Although bursty event detection from text streams has been studied before, previous work may not be suitable for microblogs because compared with other text streams such as news articles and scientific publications, microblog posts are particularly diverse and noisy. To find topics that have bursty patterns on microblogs, we propose a topic model that simultaneously captures two observations: (1) posts published around the same time are more likely to have the same topic, and (2) posts published by the same user are more likely to have the same topic. The former helps find eventdriven posts while the latter helps identify and filter out “personal ” posts. Our experiments on a large Twitter dataset show that there are more meaningful and unique bursty topics in the top-ranked results returned by our model than an LDA baseline and two degenerate variations of our model. We also show some case studies that demonstrate the importance of considering both the temporal information and users ’ personal interests for bursty topic detection from microblogs.

CiteSeerX

Institutional Knowledge at Singapore Management University

Context Modeling for Ranking and Tagging Bursty Features in Text Streams

Author: HE Jing
JIANG Jing
LI Xiaoming
Shan Dongdong
YAN Hongfei
ZHAO Xin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2010
Field of study

Bursty features in text streams are very useful in many text mining applications. Most existing studies detect bursty features based purely on term frequency changes without taking into account the semantic contexts of terms, and as a result the detected bursty features may not always be interesting or easy to interpret. In this paper we propose to model the contexts of bursty features using a language modeling approach. We then propose a novel topic diversity-based metric using the context models to find newsworthy bursty features. We also propose to use the context models to automatically assign meaningful tags to bursty features. Using a large corpus of a stream of news articles, we quantitatively show that the proposed context language models for bursty features can effectively help rank bursty features based on their newsworthiness and to assign meaningful tags to annotate bursty features. ? 2010 ACM.EI

Institutional Knowledge at Singapore Management University

Detecting newsworthy topics in Twitter

Author: Demeester Thomas
Develder Chris
Dhoedt Bart
Feys Matthias
Schockaert Steven
Van Canneyt Steven
Publication venue
Publication date: 01/01/2014
Field of study

Ghent University Academic Bibliography