48 research outputs found

    The Web of False Information: Rumors, Fake News, Hoaxes, Clickbait, and Various Other Shenanigans

    Full text link
    A new era of Information Warfare has arrived. Various actors, including state-sponsored ones, are weaponizing information on Online Social Networks to run false information campaigns with targeted manipulation of public opinion on specific topics. These false information campaigns can have dire consequences to the public: mutating their opinions and actions, especially with respect to critical world events like major elections. Evidently, the problem of false information on the Web is a crucial one, and needs increased public awareness, as well as immediate attention from law enforcement agencies, public institutions, and in particular, the research community. In this paper, we make a step in this direction by providing a typology of the Web's false information ecosystem, comprising various types of false information, actors, and their motives. We report a comprehensive overview of existing research on the false information ecosystem by identifying several lines of work: 1) how the public perceives false information; 2) understanding the propagation of false information; 3) detecting and containing false information on the Web; and 4) false information on the political stage. In this work, we pay particular attention to political false information as: 1) it can have dire consequences to the community (e.g., when election results are mutated) and 2) previous work show that this type of false information propagates faster and further when compared to other types of false information. Finally, for each of these lines of work, we report several future research directions that can help us better understand and mitigate the emerging problem of false information dissemination on the Web

    Can we predict a riot? Disruptive event detection using Twitter

    Get PDF
    In recent years, there has been increased interest in real-world event detection using publicly accessible data made available through Internet technology such as Twitter, Facebook, and YouTube. In these highly interactive systems, the general public are able to post real-time reactions to “real world” events, thereby acting as social sensors of terrestrial activity. Automatically detecting and categorizing events, particularly small-scale incidents, using streamed data is a non-trivial task but would be of high value to public safety organisations such as local police, who need to respond accordingly. To address this challenge, we present an end-to-end integrated event detection framework that comprises five main components: data collection, pre-processing, classification, online clustering, and summarization. The integration between classification and clustering enables events to be detected, as well as related smaller-scale “disruptive events,” smaller incidents that threaten social safety and security or could disrupt social order. We present an evaluation of the effectiveness of detecting events using a variety of features derived from Twitter posts, namely temporal, spatial, and textual content. We evaluate our framework on a large-scale, real-world dataset from Twitter. Furthermore, we apply our event detection system to a large corpus of tweets posted during the August 2011 riots in England. We use ground-truth data based on intelligence gathered by the London Metropolitan Police Service, which provides a record of actual terrestrial events and incidents during the riots, and show that our system can perform as well as terrestrial sources, and even better in some cases

    Diffusion of Falsehoods on Social Media

    Get PDF
    Misinformation has captured the interest of academia in recent years with several studies looking at the topic broadly. However, these studies mostly focused on rumors which are social in nature and can be either classified as false or real. In this research, we attempt to bridge the gap in the literature by examining the impacts of user characteristics and feature contents on the diffusion of (mis)information using verified true and false information. We apply a topic allocation model augmented by both supervised and unsupervised machine learning algorithms to identify tweets on novel topics. We find that retweet count is higher for fake news, novel tweets, and tweets with negative sentiment and lower lexical structure. In addition, our results show that the impacts of sentiment are opposite for fake news versus real news. We also find that tweets on the environment have a lower retweet count than the baseline religious news and real social news tweets are shared more often than fake social news. Furthermore, our studies show the counter intuitive nature of current correction endeavors by FEMA and other fact checking organizations in combating falsehoods. Specifically, we show that even though fake news causes an increase in correction messages, they influenced the propagation of falsehoods. Finally our empirical results reveal that correction messages, positive tweets and emotionally charged tweets morph faster. Furthermore, we show that tweets with positive sentiment or are emotionally charged morph faster over time. Word count and past morphing history also positively affect morphing behavior

    Press media impact of the Cumbre Vieja volcano activity in the island of La Palma (Canary Islands): A machine learning and sentiment analysis of the news published during the volcanic eruption of 2021

    Get PDF
    In this work we have used as a source of information a large sample of the press articles published during 2021 about the eruption of the Cumbre Vieja volcano in the island of La Palma (Canary Islands). In contraposition, the scientific papers evaluating different facets of natural disasters have preferentially used social networks as a source of information. Herein we have shown how the emotions and sentiments expressed in press media can be efficiently analyzed via AI techniques to better assess the social impact of a disaster at the time it takes place. We have also gauged the usefulness of different classifiers combining sentiment analysis with multivariate statistical analysis and machine learning techniques. By applying this methodology, we were able to classify a newspaper article within a certain time frame of the eruption, and we observed significant differences between local news published in Spanish and those of foreign newspapers written in English. We also found different emotional trajectories of articles by applying the Fourier transform onto the inner “valence” progress along each article narrative time. In addition, there appeared a significant relationship between the surface area occupied by lava and the emotions and sentiments expressed in the articles—many other correlations and causalities could be explored too. The main findings of this research may constitute a helpful resource for a better understanding of the way press media react to volcanic activity, and may guide in public decisionmaking under different temporal horizons, including the design of improved strategies in the risk reduction domain

    Event identification in social media using classification-clustering framework

    Get PDF
    In recent years, there has been increased interest in real-world event detection using publicly accessible data made available through Internet technology such as Twitter, Facebook and YouTube. In these highly interactive systems the general public are able to post real-time reactions to “real world" events - thereby acting as social sensors of terrestrial activity. Automatically detecting and categorizing events, particularly smallscale incidents, using streamed data is a non-trivial task, due to the heterogeneity, the scalability and the varied quality of the data as well as the presence of noise and irrelevant information. However, it would be of high value to public safety organisations such as local police, who need to respond accordingly. To address these challenges we present an end-to-end integrated event detection framework which comprises five main components: data collection, pre-processing, classification, online clustering and summarization. The integration between classification and clustering enables events to be detected, especially “disruptive events" - incidents that threaten social safety and security, or that could disrupt social order. We present an evaluation of the effectiveness of detecting events using a variety of features derived from Twitter posts, namely: temporal, spatial and textual content. We evaluate our framework on large-scale, realworld datasets from Twitter and Flickr. Furthermore, we apply our event detection system to a large corpus of tweets posted during the August 2011 riots in England. We show that our system can perform as well as terrestrial sources, such as police reports, traditional surveillance, and emergency calls, even better than local police intelligence in most cases. The framework developed in this thesis provides a scalable, online solution, to handle the high volume of social media documents in different languages including English, Arabic, Eastern languages such as Chinese, and many Latin languages. Moreover, event detection is a concept that is crucial to the assurance of public safety surrounding real-world events. Decision makers use information from a range of terrestrial and online sources to help inform decisions that enable them to develop policies and react appropriately to events as they unfold. Due to the heterogeneity and scale of the data and the fact that some messages are more salient than others for the purposes of understanding any risk to human safety and managing any disruption caused by events, automatic summarization of event-related microblogs is a non-trivial and important problem. In this thesis we tackle the task of automatic summarization of Twitter posts, and present three methods that produce summaries by selecting the most representative posts from real-world tweet-event clusters. To evaluate our approaches, we compare them to the state-of-the-art summarization systems and human generated summaries. Our results show that our proposed methods outperform all the other summarization systems for English and non-English corpora

    Social informatics

    Get PDF
    5th International Conference, SocInfo 2013, Kyoto, Japan, November 25-27, 2013, Proceedings</p

    Big Data Computing for Geospatial Applications

    Get PDF
    The convergence of big data and geospatial computing has brought forth challenges and opportunities to Geographic Information Science with regard to geospatial data management, processing, analysis, modeling, and visualization. This book highlights recent advancements in integrating new computing approaches, spatial methods, and data management strategies to tackle geospatial big data challenges and meanwhile demonstrates opportunities for using big data for geospatial applications. Crucial to the advancements highlighted in this book is the integration of computational thinking and spatial thinking and the transformation of abstract ideas and models to concrete data structures and algorithms

    Courting Disaster: An Analysis of Federal Government Twitter Usage during Hurricane Sandy Resulting in a Suggested Model for Future Disaster Response

    Get PDF
    abstract: ABSTRACT This dissertation examined how seven federal agencies utilized Twitter during a major natural disaster, Hurricane Sandy. Data collected included tweets between October 26-31, 2012 via TweetTracker, as well as federal social media policy doctrines and elite interviews, to discern patterns in the guidance provided to federal public information officers (PIOs). While scholarly research cites successful local and state government efforts utilizing social media to improve response efforts in a two-way communications interaction, no substantive research addresses social media’s role in crisis response capabilities at the federal level. This study contributes to the literature in three ways: it focuses solely on the use of social media by federal agencies in a crisis setting; it illuminates policy directives that often hamper federal crisis communication response efforts; and it suggests a proposed model that channels the flow of social media content for PIOs. This is especially important to the safety of the nation moving forward, since crises have increased. Additionally, Twitter was adopted only recently as an official communications tool in 2013. Prior to 2013, social media was applied informally and inconsistently. The findings of this study reveal a reliance upon a one-way, passive communication approach in social media federal policy directives, as well as vague guidelines in existing crisis communications models. Both dimensions are counter to risk management and crisis communication research, which embrace two-way interactivity with audiences and specific messaging that bolsters community engagement, which are vital to the role of the PIO. The resulting model enables the PIO to provide relevant information to key internal agencies and external audiences in response to a future crisis.Dissertation/ThesisCrisis Tweet Text and DataDoctoral Dissertation Mass Communication 201
    corecore