239 research outputs found
Revisit Behavior in Social Media: The Phoenix-R Model and Discoveries
How many listens will an artist receive on a online radio? How about plays on
a YouTube video? How many of these visits are new or returning users? Modeling
and mining popularity dynamics of social activity has important implications
for researchers, content creators and providers. We here investigate the effect
of revisits (successive visits from a single user) on content popularity. Using
four datasets of social activity, with up to tens of millions media objects
(e.g., YouTube videos, Twitter hashtags or LastFM artists), we show the effect
of revisits in the popularity evolution of such objects. Secondly, we propose
the Phoenix-R model which captures the popularity dynamics of individual
objects. Phoenix-R has the desired properties of being: (1) parsimonious, being
based on the minimum description length principle, and achieving lower root
mean squared error than state-of-the-art baselines; (2) applicable, the model
is effective for predicting future popularity values of objects.Comment: To appear on European Conference on Machine Learning and Principles
and Practice of Knowledge Discovery in Databases 201
Gender Matters! Analyzing Global Cultural Gender Preferences for Venues Using Social Sensing
Gender differences is a phenomenon around the world actively researched by
social scientists. Traditionally, the data used to support such studies is
manually obtained, often through surveys with volunteers. However, due to their
inherent high costs because of manual steps, such traditional methods do not
quickly scale to large-size studies. We here investigate a particular aspect of
gender differences: preferences for venues. To that end we explore the use of
check-in data collected from Foursquare to estimate cultural gender preferences
for venues in the physical world. For that, we first demonstrate that by
analyzing the check-in data in various regions of the world we can find
significant differences in preferences for specific venues between gender
groups. Some of these significant differences reflect well-known cultural
patterns. Moreover, we also gathered evidence that our methodology offers
useful information about gender preference for venues in a given region in the
real world. This suggests that gender and venue preferences observed may not be
independent. Our results suggests that our proposed methodology could be a
promising tool to support studies on gender preferences for venues at different
spatial granularities around the world, being faster and cheaper than
traditional methods, besides quickly capturing changes in the real world
Tagging and Tag Recommendation
Tagging has emerged as one of the best ways of associating metadata with objects (e.g., videos, texts) in Web 2.0 applications. Consisting of freely chosen keywords assigned to objects by users, tags represent a simpler, cheaper, and a more natural way of organizing content than a fixed taxonomy with a controlled vocabulary. Moreover, recent studies have demonstrated that among other textual features such as title, description, and user comments, tags are the most effective to support information retrieval (IR) services such as search, automatic classification, and content recommendation. In this context, tag recommendation services aim at assisting users in the tagging process, allowing users to select some of the recommended tags or to come up with new ones. Besides improving user experience, tag recommendation services potentially improve the quality of the generated tags, benefiting IR services that rely on tags as data sources. Besides the obvious benefit of improving the description of the objects, tag recommendation can be directly applied in IR services such as search and query expansion. In this chapter, we will provide the main concepts related to tagging systems, as well as an overview of tag recommendation techniques, dividing them into two stages of the tag recommendation process: (1) the candidate tag extraction and (2) the candidate tag ranking
A Dataset of Fact-Checked Images Shared on WhatsApp During the Brazilian and Indian Elections
Recently, messaging applications, such as WhatsApp, have been reportedly
abused by misinformation campaigns, especially in Brazil and India. A notable
form of abuse in WhatsApp relies on several manipulated images and memes
containing all kinds of fake stories. In this work, we performed an extensive
data collection from a large set of WhatsApp publicly accessible groups and
fact-checking agency websites. This paper opens a novel dataset to the research
community containing fact-checked fake images shared through WhatsApp for two
distinct scenarios known for the spread of fake news on the platform: the 2018
Brazilian elections and the 2019 Indian elections.Comment: 7 pages. This is a preprint version of an accepted paper on ICWSM'20.
Please, consider to cite the conference version instead of this on
- …