39 research outputs found
Weakly-supervised Learning Approaches for Event Knowledge Acquisition and Event Detection
Capabilities of detecting events and recognizing temporal, subevent, or causality relations among events can facilitate many applications in natural language understanding. However, supervised learning approaches that previous research mainly uses have two problems. First, due to the limited size of annotated data, supervised systems cannot sufficiently capture diverse contexts to distill universal event knowledge. Second, under certain application circumstances such as event recognition during emergent natural disasters, it is infeasible to spend days or weeks to annotate enough data to train a system. My research aims to use weakly-supervised learning to address these problems and to achieve automatic event knowledge acquisition and event recognition.
In this dissertation, I first introduce three weakly-supervised learning approaches that have been shown effective in acquiring event relational knowledge. Firstly, I explore the observation that regular event pairs show a consistent temporal relation despite of their various contexts, and these rich contexts can be used to train a contextual temporal relation classifier to further recognize new temporal relation knowledge. Secondly, inspired by the double temporality characteristic of narrative texts, I propose a weakly supervised approach that identifies 287k narrative paragraphs using narratology principles and then extract rich temporal event knowledge from identified narratives. Lastly, I develop a subevent knowledge acquisition approach by exploiting two observations that 1) subevents are temporally contained by the parent event and 2) the definitions of the parent event can be used to guide the identification of subevents. I collect rich weak supervision to train a contextual BERT classifier and apply the classifier to identify new subevent knowledge.
Recognizing texts that describe specific categories of events is also challenging due to language ambiguity and diverse descriptions of events. So I also propose a novel method to rapidly build a fine-grained event recognition system on social media texts for disaster management. My method creates high-quality weak supervision based on clustering-assisted word sense disambiguation and enriches tweet message representations using preceding context tweets and reply tweets in building event recognition classifiers
Unsupervised Detection of Sub-events in Large Scale Disasters
Social media plays a major role during and after major natural disasters
(e.g., hurricanes, large-scale fires, etc.), as people ``on the ground'' post
useful information on what is actually happening. Given the large amounts of
posts, a major challenge is identifying the information that is useful and
actionable. Emergency responders are largely interested in finding out what
events are taking place so they can properly plan and deploy resources. In this
paper we address the problem of automatically identifying important sub-events
(within a large-scale emergency ``event'', such as a hurricane). In particular,
we present a novel, unsupervised learning framework to detect sub-events in
Tweets for retrospective crisis analysis. We first extract noun-verb pairs and
phrases from raw tweets as sub-event candidates. Then, we learn a semantic
embedding of extracted noun-verb pairs and phrases, and rank them against a
crisis-specific ontology. We filter out noisy and irrelevant information then
cluster the noun-verb pairs and phrases so that the top-ranked ones describe
the most important sub-events. Through quantitative experiments on two large
crisis data sets (Hurricane Harvey and the 2015 Nepal Earthquake), we
demonstrate the effectiveness of our approach over the state-of-the-art. Our
qualitative evaluation shows better performance compared to our baseline.Comment: AAAI-20 Social Impact Trac
Using social media for sub-event detection during disasters
AbstractSocial media platforms have become fundamental tools for sharing information during natural disasters or catastrophic events. This paper presents SEDOM-DD (Sub-Events Detection on sOcial Media During Disasters), a new method that analyzes user posts to discover sub-events that occurred after a disaster (e.g., collapsed buildings, broken gas pipes, floods). SEDOM-DD has been evaluated with datasets of different sizes that contain real posts from social media related to different natural disasters (e.g., earthquakes, floods and hurricanes). Starting from such data, we generated synthetic datasets with different features, such as different percentages of relevant posts and/or geotagged posts. Experiments performed on both real and synthetic datasets showed that SEDOM-DD is able to identify sub-events with high accuracy. For example, with a percentage of relevant posts of 80% and geotagged posts of 15%, our method detects the sub-events and their areas with an accuracy of 85%, revealing the high accuracy and effectiveness of the proposed approach
A Survey on Visual Analytics of Social Media Data
The unprecedented availability of social media data offers substantial opportunities for data owners, system operators, solution providers, and end users to explore and understand social dynamics. However, the exponential growth in the volume, velocity, and variability of social media data prevents people from fully utilizing such data. Visual analytics, which is an emerging research direction, ha..
When Infodemic Meets Epidemic: a Systematic Literature Review
Epidemics and outbreaks present arduous challenges requiring both individual
and communal efforts. Social media offer significant amounts of data that can
be leveraged for bio-surveillance. They also provide a platform to quickly and
efficiently reach a sizeable percentage of the population, hence their
potential impact on various aspects of epidemic mitigation. The general
objective of this systematic literature review is to provide a methodical
overview of the integration of social media in different epidemic-related
contexts. Three research questions were conceptualized for this review,
resulting in over 10000 publications collected in the first PRISMA stage, 129
of which were selected for inclusion. A thematic method-oriented synthesis was
undertaken and identified 5 main themes related to social media enabled
epidemic surveillance, misinformation management, and mental health. Findings
uncover a need for more robust applications of the lessons learned from
epidemic post-mortem documentation. A vast gap exists between retrospective
analysis of epidemic management and result integration in prospective studies.
Harnessing the full potential of social media in epidemic related tasks
requires streamlining the results of epidemic forecasting, public opinion
understanding and misinformation propagation, all while keeping abreast of
potential mental health implications. Pro-active prevention has thus become
vital for epidemic curtailment and containment