47,285 research outputs found
Learning Collective Behavior in Multi-relational Networks
With the rapid expansion of the Internet and WWW, the problem of analyzing social media data has received an increasing amount of attention in the past decade. The boom in social media platforms offers many possibilities to study human collective behavior and interactions on an unprecedented scale. In the past, much work has been done on the problem of learning from networked data with homogeneous topologies, where instances are explicitly or implicitly inter-connected by a single type of relationship. In contrast to traditional content-only classification methods, relational learning succeeds in improving classification performance by leveraging the correlation of the labels between linked instances. However, networked data extracted from social media, web pages, and bibliographic databases can contain entities of multiple classes and linked by various causal reasons, hence treating all links in a homogeneous way can limit the performance of relational classifiers. Learning the collective behavior and interactions in heterogeneous networks becomes much more complex. The contribution of this dissertation include 1) two classification frameworks for identifying human collective behavior in multi-relational social networks; 2) unsupervised and supervised learning models for relationship prediction in multi-relational collaborative networks. Our methods improve the performance of homogeneous predictive models by differentiating heterogeneous relations and capturing the prominent interaction patterns underlying the network structure. The work has been evaluated in various real-world social networks. We believe that this study will be useful for analyzing human collective behavior and interactions specifically in the scenario when the heterogeneous relationships in the network arise from various causal reasons
Analyzing digital societal interactions and sentiment classification in Twitter (X) during critical events in Chile
This study explores the influence of social media content on societal attitudes and actions during critical events, with a special focus on occurrences in Chile, such as the COVID-19 pandemic, the 2019 protests, and the wildfires in 2017 and 2023. By leveraging a novel tweet dataset, this study introduces new metrics for assessing sentiment, inclusivity, engagement, and impact, thereby providing a comprehensive framework for analyzing social media dynamics. The methodology employed enhances sentiment classification through the use of a Deep Random Vector Functional Link (D-RVFL) neural network, which demonstrates superior performance over traditional models such as Support Vector Machines (SVM), naive Bayes, and back propagation (BP) neural networks, achieving an overall average accuracy of 78.30% (0.17). This advancement is attributed to deep learning techniques with direct input–output connections that facilitate faster and more precise sentiment classification. This analysis differentiates the roles of influencers, press radio, and television handlers during crises, revealing how various social media actors affect information dissemination and audience engagement. By dissecting online behaviors and classifying sentiments using the RVFL network, this study sheds light on the effects of the digital landscape on societal attitudes and actions during emergencies. These findings underscore the importance of understanding the nuances of social media engagement to develop more effective crisis communication strategies
False News On Social Media: A Data-Driven Survey
In the past few years, the research community has dedicated growing interest
to the issue of false news circulating on social networks. The widespread
attention on detecting and characterizing false news has been motivated by
considerable backlashes of this threat against the real world. As a matter of
fact, social media platforms exhibit peculiar characteristics, with respect to
traditional news outlets, which have been particularly favorable to the
proliferation of deceptive information. They also present unique challenges for
all kind of potential interventions on the subject. As this issue becomes of
global concern, it is also gaining more attention in academia. The aim of this
survey is to offer a comprehensive study on the recent advances in terms of
detection, characterization and mitigation of false news that propagate on
social media, as well as the challenges and the open questions that await
future research on the field. We use a data-driven approach, focusing on a
classification of the features that are used in each study to characterize
false information and on the datasets used for instructing classification
methods. At the end of the survey, we highlight emerging approaches that look
most promising for addressing false news
Topology comparison of Twitter diffusion networks effectively reveals misleading information
In recent years, malicious information had an explosive growth in social
media, with serious social and political backlashes. Recent important studies,
featuring large-scale analyses, have produced deeper knowledge about this
phenomenon, showing that misleading information spreads faster, deeper and more
broadly than factual information on social media, where echo chambers,
algorithmic and human biases play an important role in diffusion networks.
Following these directions, we explore the possibility of classifying news
articles circulating on social media based exclusively on a topological
analysis of their diffusion networks. To this aim we collected a large dataset
of diffusion networks on Twitter pertaining to news articles published on two
distinct classes of sources, namely outlets that convey mainstream, reliable
and objective information and those that fabricate and disseminate various
kinds of misleading articles, including false news intended to harm, satire
intended to make people laugh, click-bait news that may be entirely factual or
rumors that are unproven. We carried out an extensive comparison of these
networks using several alignment-free approaches including basic network
properties, centrality measures distributions, and network distances. We
accordingly evaluated to what extent these techniques allow to discriminate
between the networks associated to the aforementioned news domains. Our results
highlight that the communities of users spreading mainstream news, compared to
those sharing misleading news, tend to shape diffusion networks with subtle yet
systematic differences which might be effectively employed to identify
misleading and harmful information.Comment: A revised new version is available on Scientific Report
TwitterMancer: Predicting Interactions on Twitter Accurately
This paper investigates the interplay between different types of user
interactions on Twitter, with respect to predicting missing or unseen
interactions. For example, given a set of retweet interactions between Twitter
users, how accurately can we predict reply interactions? Is it more difficult
to predict retweet or quote interactions between a pair of accounts? Also, how
important is time locality, and which features of interaction patterns are most
important to enable accurate prediction of specific Twitter interactions? Our
empirical study of Twitter interactions contributes initial answers to these
questions.
We have crawled an extensive dataset of Greek-speaking Twitter accounts and
their follow, quote, retweet, reply interactions over a period of a month.
We find we can accurately predict many interactions of Twitter users.
Interestingly, the most predictive features vary with the user profiles, and
are not the same across all users.
For example, for a pair of users that interact with a large number of other
Twitter users, we find that certain "higher-dimensional" triads, i.e., triads
that involve multiple types of interactions, are very informative, whereas for
less active Twitter users, certain in-degrees and out-degrees play a major
role. Finally, we provide various other insights on Twitter user behavior.
Our code and data are available at https://github.com/twittermancer/.
Keywords: Graph mining, machine learning, social media, social network
Exploratory Analysis of Pairwise Interactions in Online Social Networks
In the last few decades sociologists were trying to explain human behaviour
by analysing social networks, which requires access to data about interpersonal
relationships. This represented a big obstacle in this research field until the
emergence of online social networks (OSNs), which vastly facilitated the
process of collecting such data. Nowadays, by crawling public profiles on OSNs,
it is possible to build a social graph where "friends" on OSN become
represented as connected nodes. OSN connection does not necessarily indicate a
close real-life relationship, but using OSN interaction records may reveal
real-life relationship intensities, a topic which inspired a number of recent
researches. Still, published research currently lacks an extensive exploratory
analysis of OSN interaction records, i.e. a comprehensive overview of users'
interaction via different ways of OSN interaction. In this paper we provide
such an overview by leveraging results of conducted extensive social experiment
which managed to collect records for over 3,200 Facebook users interacting with
over 1,400,000 of their friends. Our exploratory analysis focuses on extracting
population distributions and correlation parameters for 13 interaction
parameters, providing valuable insight in online social network interaction for
future researches aimed at this field of study.Comment: Journal Article published 2 Oct 2017 in Automatika volume 58 issue 4
on pages 422 to 42
Social Fingerprinting: detection of spambot groups through DNA-inspired behavioral modeling
Spambot detection in online social networks is a long-lasting challenge
involving the study and design of detection techniques capable of efficiently
identifying ever-evolving spammers. Recently, a new wave of social spambots has
emerged, with advanced human-like characteristics that allow them to go
undetected even by current state-of-the-art algorithms. In this paper, we show
that efficient spambots detection can be achieved via an in-depth analysis of
their collective behaviors exploiting the digital DNA technique for modeling
the behaviors of social network users. Inspired by its biological counterpart,
in the digital DNA representation the behavioral lifetime of a digital account
is encoded in a sequence of characters. Then, we define a similarity measure
for such digital DNA sequences. We build upon digital DNA and the similarity
between groups of users to characterize both genuine accounts and spambots.
Leveraging such characterization, we design the Social Fingerprinting
technique, which is able to discriminate among spambots and genuine accounts in
both a supervised and an unsupervised fashion. We finally evaluate the
effectiveness of Social Fingerprinting and we compare it with three
state-of-the-art detection algorithms. Among the peculiarities of our approach
is the possibility to apply off-the-shelf DNA analysis techniques to study
online users behaviors and to efficiently rely on a limited number of
lightweight account characteristics
Listening between the Lines: Learning Personal Attributes from Conversations
Open-domain dialogue agents must be able to converse about many topics while
incorporating knowledge about the user into the conversation. In this work we
address the acquisition of such knowledge, for personalization in downstream
Web applications, by extracting personal attributes from conversations. This
problem is more challenging than the established task of information extraction
from scientific publications or Wikipedia articles, because dialogues often
give merely implicit cues about the speaker. We propose methods for inferring
personal attributes, such as profession, age or family status, from
conversations using deep learning. Specifically, we propose several Hidden
Attribute Models, which are neural networks leveraging attention mechanisms and
embeddings. Our methods are trained on a per-predicate basis to output rankings
of object values for a given subject-predicate combination (e.g., ranking the
doctor and nurse professions high when speakers talk about patients, emergency
rooms, etc). Experiments with various conversational texts including Reddit
discussions, movie scripts and a collection of crowdsourced personal dialogues
demonstrate the viability of our methods and their superior performance
compared to state-of-the-art baselines.Comment: published in WWW'1
- …