118 research outputs found
Hot Streaks on Social Media
Measuring the impact and success of human performance is common in various
disciplines, including art, science, and sports. Quantifying impact also plays
a key role on social media, where impact is usually defined as the reach of a
user's content as captured by metrics such as the number of views, likes,
retweets, or shares. In this paper, we study entire careers of Twitter users to
understand properties of impact. We show that user impact tends to have certain
characteristics: First, impact is clustered in time, such that the most
impactful tweets of a user appear close to each other. Second, users commonly
have 'hot streaks' of impact, i.e., extended periods of high-impact tweets.
Third, impact tends to gradually build up before, and fall off after, a user's
most impactful tweet. We attempt to explain these characteristics using various
properties measured on social media, including the user's network, content,
activity, and experience, and find that changes in impact are associated with
significant changes in these properties. Our findings open interesting avenues
for future research on virality and influence on social media.Comment: Accepted as a full paper at ICWSM 2019. Please cite the ICWSM versio
Co-Following on Twitter
We present an in-depth study of co-following on Twitter based on the
observation that two Twitter users whose followers have similar friends are
also similar, even though they might not share any direct links or a single
mutual follower. We show how this observation contributes to (i) a better
understanding of language-agnostic user classification on Twitter, (ii)
eliciting opportunities for Computational Social Science, and (iii) improving
online marketing by identifying cross-selling opportunities.
We start with a machine learning problem of predicting a user's preference
among two alternative choices of Twitter friends. We show that co-following
information provides strong signals for diverse classification tasks and that
these signals persist even when (i) the most discriminative features are
removed and (ii) only relatively "sparse" users with fewer than 152 but more
than 43 Twitter friends are considered.
Going beyond mere classification performance optimization, we present
applications of our methodology to Computational Social Science. Here we
confirm stereotypes such as that the country singer Kenny Chesney
(@kennychesney) is more popular among @GOP followers, whereas Lady Gaga
(@ladygaga) enjoys more support from @TheDemocrats followers.
In the domain of marketing we give evidence that celebrity endorsement is
reflected in co-following and we demonstrate how our methodology can be used to
reveal the audience similarities between Apple and Puma and, less obviously,
between Nike and Coca-Cola. Concerning a user's popularity we find a
statistically significant connection between having a more "average"
followership and having more followers than direct rivals. Interestingly, a
\emph{larger} audience also seems to be linked to a \emph{less diverse}
audience in terms of their co-following.Comment: full version of a short paper at Hypertext 201
A Motif-based Approach for Identifying Controversy
Among the topics discussed in Social Media, some lead to controversy. A
number of recent studies have focused on the problem of identifying controversy
in social media mostly based on the analysis of textual content or rely on
global network structure. Such approaches have strong limitations due to the
difficulty of understanding natural language, and of investigating the global
network structure. In this work we show that it is possible to detect
controversy in social media by exploiting network motifs, i.e., local patterns
of user interaction. The proposed approach allows for a language-independent
and fine- grained and efficient-to-compute analysis of user discussions and
their evolution over time. The supervised model exploiting motif patterns can
achieve 85% accuracy, with an improvement of 7% compared to baseline
structural, propagation-based and temporal network features
WhatsApp Explorer: A Data Donation Tool To Facilitate Research on WhatsApp
In recent years, reports and anecdotal evidence pointing at the role of
WhatsApp in a variety of events, ranging from elections to collective violence,
have emerged. While academic research should examine the validity of these
claims, obtaining WhatsApp data for research is notably challenging,
contrasting with the relative abundance of data from platforms like Facebook
and Twitter, where user "information diets" have been extensively studied. This
lack of data is particularly problematic since misinformation and hate speech
are major concerns in the set of Global South countries in which WhatsApp
dominates the market for messaging. To help make research on these questions,
and more generally research on WhatsApp, possible, this paper introduces
WhatsApp Explorer, a tool designed to enable WhatsApp data collection on a
large scale. We discuss protocols for data collection, including potential
sampling approaches, and explain why our tool (and adjoining protocol) arguably
allow researchers to collect WhatsApp data in an ethical and legal manner, at
scale
Unraveling the Dynamics of Television Debates and Social Media Engagement: Insights from an Indian News Show
The relationship between television shows and social media has become
increasingly intertwined in recent years. Social media platforms, particularly
Twitter, have emerged as significant sources of public opinion and discourse on
topics discussed in television shows. In India, news debates leverage the
popularity of social media to promote hashtags and engage users in discussions
and debates on a daily basis.
This paper focuses on the analysis of one of India's most prominent and
widely-watched TV news debate shows: "Arnab Goswami-The Debate". The study
examines the content of the show by analyzing the hashtags used to promote it
and the social media data corresponding to these hashtags. The findings reveal
that the show exhibits a strong bias towards the ruling Bharatiya Janata Party
(BJP), with over 60% of the debates featuring either pro-BJP or anti-opposition
content. Social media support for the show primarily comes from BJP supporters.
Notably, BJP leaders and influencers play a significant role in promoting the
show on social media, leveraging their existing networks and resources to
artificially trend specific hashtags. Furthermore, the study uncovers a
reciprocal flow of information between the TV show and social media. We find
evidence that the show's choice of topics is linked to social media posts made
by party workers, suggesting a dynamic interplay between traditional media and
online platforms.
By exploring the complex interaction between television debates and social
media support, this study contributes to a deeper understanding of the evolving
relationship between these two domains in the digital age. The findings hold
implications for media researchers and practitioners, offering insights into
the ways in which social media can influence traditional media and vice versa.Comment: Accepted at ICWSM 2024. Please cite the ICWSM versio
Professional Gender Gaps Across US Cities
Gender imbalances in work environments have been a long-standing concern.
Identifying the existence of such imbalances is key to designing policies to
help overcome them. In this work, we study gender trends in employment across
various dimensions in the United States. This is done by analyzing anonymous,
aggregate statistics that were extracted from LinkedIn's advertising platform.
The data contain the number of male and female LinkedIn users with respect to
(i) location, (ii) age, (iii) industry and (iv) certain skills. We studied
which of these categories correlate the most with high relative male or female
presence on LinkedIn. In addition to examining the summary statistics of the
LinkedIn data, we model the gender balance as a function of the different
employee features using linear regression. Our results suggest that the gender
gap varies across all feature types, but the differences are most profound
among industries and skills. A high correlation between gender ratios of people
in our LinkedIn data set and data provided by the US Bureau of Labor Statistics
serves as external validation for our results.Comment: Accepted at a poster at ICWSM 2018. Please cite the ICWSM versio
- …