4,839 research outputs found
On Identifying Disaster-Related Tweets: Matching-based or Learning-based?
Social media such as tweets are emerging as platforms contributing to
situational awareness during disasters. Information shared on Twitter by both
affected population (e.g., requesting assistance, warning) and those outside
the impact zone (e.g., providing assistance) would help first responders,
decision makers, and the public to understand the situation first-hand.
Effective use of such information requires timely selection and analysis of
tweets that are relevant to a particular disaster. Even though abundant tweets
are promising as a data source, it is challenging to automatically identify
relevant messages since tweet are short and unstructured, resulting to
unsatisfactory classification performance of conventional learning-based
approaches. Thus, we propose a simple yet effective algorithm to identify
relevant messages based on matching keywords and hashtags, and provide a
comparison between matching-based and learning-based approaches. To evaluate
the two approaches, we put them into a framework specifically proposed for
analyzing disaster-related tweets. Analysis results on eleven datasets with
various disaster types show that our technique provides relevant tweets of
higher quality and more interpretable results of sentiment analysis tasks when
compared to learning approach
Public awareness and engagement in relation to the coastal oil spill in northeast Brazil
Social media data is a rich source of information to assess human activities in catastrophic events. Here, we use social media data to understand how the 2019 Brazilian oil spill influenced social attitudes. Data were collected from the globally popular Instagram platform between August 1, 2019 and March 1, 2020. First, we manually identified the 5 most popular (portuguese language) hashtags related to the oil spill #oleonononordeste;#desastreambiental;#ma rsemoleo;#sosnordeste;#ma rsempetroleo. In the sequence, we collected information on captions, post metadata and users associated with posts retrieved using the selected hashtags. We identified a total of 7,413 posts. These posts were grouped in topics: government (47.76%), protest (24.37%), volunteers (24.45%), biodiversity (0.003%), origin (0.006%), tourism (0.008%) and others (0.016%). All topics had the peak of posts in October and November 2019. Nevertheless, interest in the oil spill was temporary, with most posts appearing in the 2-4 months after the beginning of the disaster. Our findings illustrate the enormous potential of using social media data for understanding and monitoring human engagement with environmental disasters, but also suggest that conservationists and environmental groups may only have a limited 'window of opportunity' to engage and mobilize public support for remediation and restoration efforts.Peer reviewe
Recommended from our members
Statistical Semantic Classification of Crisis Information
The rise of social media as an information channel during crisis has become key to community response. However, existing crisis awareness applications, often struggle to identify relevant information among the high volume of data that is generated over social platforms. A wide range of statistical features and machine learning methods have been researched in recent years to automatically classify this information. In this paper we aim to complement previous studies by exploring the use of semantics as additional features to identify relevant crisis in- formation. Our assumption is that entities and concepts tend to have a more consistent correlation with relevant and irrelevant information, and therefore can enhance the discrimination power of classifiers. Our results, so far, show that some classification improvements can be obtained when using semantic features, reaching +2.51% when the classifier is applied to a new crisis event (i.e., not in training set)
Using Twitter to Understand Public Interest in Climate Change: The case of Qatar
Climate change has received an extensive attention from public opinion in the
last couple of years, after being considered for decades as an exclusive
scientific debate. Governments and world-wide organizations such as the United
Nations are working more than ever on raising and maintaining public awareness
toward this global issue. In the present study, we examine and analyze Climate
Change conversations in Qatar's Twittersphere, and sense public awareness
towards this global and shared problem in general, and its various related
topics in particular. Such topics include but are not limited to politics,
economy, disasters, energy and sandstorms. To address this concern, we collect
and analyze a large dataset of 109 million tweets posted by 98K distinct users
living in Qatar -- one of the largest emitters of CO2 worldwide. We use a
taxonomy of climate change topics created as part of the United Nations Pulse
project to capture the climate change discourse in more than 36K tweets. We
also examine which topics people refer to when they discuss climate change, and
perform different analysis to understand the temporal dynamics of public
interest toward these topics.Comment: Will appear in the proceedings of the International Workshop on
Social Media for Environment and Ecological Monitoring (SWEEM'16
Classifying Crises-Information Relevancy with Semantics
Social media platforms have become key portals for sharing and consuming information during crisis situations. However, humanitarian organisations and affected communities often struggle to sieve through the large volumes of data that are typically shared on such platforms during crises to determine which posts are truly relevant to the crisis, and which are not. Previous work on automatically classifying crisis information was mostly focused on using statistical features. However,
such approaches tend to be inappropriate when processing data on a type of crisis that the model was not trained on, such as processing information about a train crash, whereas the classifier was trained on floods, earthquakes, and typhoons. In such cases, the model will need to be retrained, which is costly and time-consuming. In this paper, we explore the impact of semantics in classifying Twitter posts across same, and different, types of crises. We experiment with 26 crisis events, using a hybrid system that combines statistical features with various semantic features extracted from external knowledge bases. We show that adding semantic features has no noticeable benefit over statistical features when classifying same-type crises, whereas it enhances the classifier performance by up to 7.2% when classifying information about a new type of crisis
- …