Search CORE

8,449 research outputs found

Verifying baselines for crisis event information classification on Twitter

Author: Crow Justin Michael
Publication venue: 'Virginia Tech Libraries'
Publication date: 01/01/2020
Field of study

Social media are rich information sources during and in the aftermath of crisis events such as earthquakes and terrorist attacks. Despite myriad challenges, with the right tools, significant insight can be gained which can assist emergency responders and related applications. However, most extant approaches are incomparable, using bespoke definitions, models, datasets and even evaluation metrics. Furthermore, it is rare that code, trained models, or exhaustive parametrisation details are made openly available. Thus, even confirmation of self-reported performance is problematic; authoritatively determining the state of the art (SOTA) is essentially impossible. Consequently, to begin addressing such endemic ambiguity, this paper seeks to make 3 contributions: 1) the replication and results confirmation of a leading (and generalisable) technique; 2) testing straightforward modifications of the technique likely to improve performance; and 3) the extension of the technique to a novel and complimentary type of crisis-relevant information to demonstrate it’s generalisability

Statistical Semantic Classification of Crisis Information

Author: Alani Harith
Fernandez Miriam
Khare Prashant
Publication venue
Publication date: 01/10/2017
Field of study

The rise of social media as an information channel during crisis has become key to community response. However, existing crisis awareness applications, often struggle to identify relevant information among the high volume of data that is generated over social platforms. A wide range of statistical features and machine learning methods have been researched in recent years to automatically classify this information. In this paper we aim to complement previous studies by exploring the use of semantics as additional features to identify relevant crisis in- formation. Our assumption is that entities and concepts tend to have a more consistent correlation with relevant and irrelevant information, and therefore can enhance the discrimination power of classifiers. Our results, so far, show that some classification improvements can be obtained when using semantic features, reaching +2.51% when the classifier is applied to a new crisis event (i.e., not in training set)

Community Structure Characterization

Author: A Clauset
A Lancichinetti
A Lancichinetti
C Bothorel
F Radicchi
G Palla
GK Orman
Hongyun Cai
J Creusefond
J Shi
J Yang
L da Fontoura Costa
M Girvan
M Rosvall
M Rosvall
M Tumminello
MEJ Newman
MEJ Newman
MEJ Newman
MEJ Newman
MEJ Newman
N Dugué
N Kashtan
NR Mabroukeh
P Bródka
R Guimera
S Asur
S Fortunato
S Fortunato
T Aynaud
T-C Fu
V Labatut
Vincent Labatut
X Han
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

This entry discusses the problem of describing some communities identified in a complex network of interest, in a way allowing to interpret them. We suppose the community structure has already been detected through one of the many methods proposed in the literature. The question is then to know how to extract valuable information from this first result, in order to allow human interpretation. This requires subsequent processing, which we describe in the rest of this entry

arXiv.org e-Print Archive

Classifying Crises-Information Relevancy with Semantics

Author: A Tonon
F Abel
G Burel
H Gao
J Rogstadius
J Yin
N Cristianini
R Navigli
R Power
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Social media platforms have become key portals for sharing and consuming information during crisis situations. However, humanitarian organisations and affected communities often struggle to sieve through the large volumes of data that are typically shared on such platforms during crises to determine which posts are truly relevant to the crisis, and which are not. Previous work on automatically classifying crisis information was mostly focused on using statistical features. However, such approaches tend to be inappropriate when processing data on a type of crisis that the model was not trained on, such as processing information about a train crash, whereas the classifier was trained on floods, earthquakes, and typhoons. In such cases, the model will need to be retrained, which is costly and time-consuming. In this paper, we explore the impact of semantics in classifying Twitter posts across same, and different, types of crises. We experiment with 26 crisis events, using a hybrid system that combines statistical features with various semantic features extracted from external knowledge bases. We show that adding semantic features has no noticeable benefit over statistical features when classifying same-type crises, whereas it enhances the classifier performance by up to 7.2% when classifying information about a new type of crisis

Leveraging Social Media and Web of Data for Crisis Response Coordination

Author: Castillo Carlos
Diaz Fernando
Purohit Hemant
Publication venue: CORE Scholar
Publication date: 01/04/2014
Field of study

There is an ever increasing number of users in social media (1B+ Facebook users, 500M+ Twitter users) and ubiquitous mobile access (6B+ mobile phone subscribers) who share their observations and opinions. In addition, the Web of Data and existing knowledge bases keep on growing at a rapid pace. In this scenario, we have unprecedented opportunities to improve crisis response by extracting social signals, creating spatio-temporal mappings, performing analytics on social and Web of Data, and supporting a variety of applications. Such applications can help provide situational awareness during an emergency, improve preparedness, and assist during the rebuilding/recovery phase of a disaster. Data mining can provide valuable insights to support emergency responders and other stakeholders during crisis. However, there are a number of challenges and existing computing technology may not work in all cases. Therefore, our objective here is to present the characterization of such data mining tasks, and challenges that need further research attention