263 research outputs found
Insights from Machine-Learned Diet Success Prediction
To support people trying to lose weight and stay healthy, more and more
fitness apps have sprung up including the ability to track both calories intake
and expenditure. Users of such apps are part of a wider ``quantified self''
movement and many opt-in to publicly share their logged data. In this paper, we
use public food diaries of more than 4,000 long-term active MyFitnessPal users
to study the characteristics of a (un-)successful diet. Concretely, we train a
machine learning model to predict repeatedly being over or under self-set daily
calories goals and then look at which features contribute to the model's
prediction. Our findings include both expected results, such as the token
``mcdonalds'' or the category ``dessert'' being indicative for being over the
calories goal, but also less obvious ones such as the difference between pork
and poultry concerning dieting success, or the use of the ``quick added
calories'' functionality being indicative of over-shooting calorie-wise. This
study also hints at the feasibility of using such data for more in-depth data
mining, e.g., looking at the interaction between consumed foods such as mixing
protein- and carbohydrate-rich foods. To the best of our knowledge, this is the
first systematic study of public food diaries.Comment: Preprint of an article appearing at the Pacific Symposium on
Biocomputing (PSB) 2016 in the Social Media Mining for Public Health
Monitoring and Surveillance trac
#greysanatomy vs. #yankees: Demographics and Hashtag Use on Twitter
Demographics, in particular, gender, age, and race, are a key predictor of
human behavior. Despite the significant effect that demographics plays, most
scientific studies using online social media do not consider this factor,
mainly due to the lack of such information. In this work, we use
state-of-the-art face analysis software to infer gender, age, and race from
profile images of 350K Twitter users from New York. For the period from
November 1, 2014 to October 31, 2015, we study which hashtags are used by
different demographic groups. Though we find considerable overlap for the most
popular hashtags, there are also many group-specific hashtags.Comment: This is a preprint of an article appearing at ICWSM 201
Co-Following on Twitter
We present an in-depth study of co-following on Twitter based on the
observation that two Twitter users whose followers have similar friends are
also similar, even though they might not share any direct links or a single
mutual follower. We show how this observation contributes to (i) a better
understanding of language-agnostic user classification on Twitter, (ii)
eliciting opportunities for Computational Social Science, and (iii) improving
online marketing by identifying cross-selling opportunities.
We start with a machine learning problem of predicting a user's preference
among two alternative choices of Twitter friends. We show that co-following
information provides strong signals for diverse classification tasks and that
these signals persist even when (i) the most discriminative features are
removed and (ii) only relatively "sparse" users with fewer than 152 but more
than 43 Twitter friends are considered.
Going beyond mere classification performance optimization, we present
applications of our methodology to Computational Social Science. Here we
confirm stereotypes such as that the country singer Kenny Chesney
(@kennychesney) is more popular among @GOP followers, whereas Lady Gaga
(@ladygaga) enjoys more support from @TheDemocrats followers.
In the domain of marketing we give evidence that celebrity endorsement is
reflected in co-following and we demonstrate how our methodology can be used to
reveal the audience similarities between Apple and Puma and, less obviously,
between Nike and Coca-Cola. Concerning a user's popularity we find a
statistically significant connection between having a more "average"
followership and having more followers than direct rivals. Interestingly, a
\emph{larger} audience also seems to be linked to a \emph{less diverse}
audience in terms of their co-following.Comment: full version of a short paper at Hypertext 201
Sponsored Search, Market Equilibria, and the Hungarian Method
Matching markets play a prominent role in economic theory. A prime example of
such a market is the sponsored search market. Here, as in other markets of that
kind, market equilibria correspond to feasible, envy free, and bidder optimal
outcomes. For settings without budgets such an outcome always exists and can be
computed in polynomial-time by the so-called Hungarian Method. Moreover, every
mechanism that computes such an outcome is incentive compatible. We show that
the Hungarian Method can be modified so that it finds a feasible, envy free,
and bidder optimal outcome for settings with budgets. We also show that in
settings with budgets no mechanism that computes such an outcome can be
incentive compatible for all inputs. For inputs in general position, however,
the presented mechanism---as any other mechanism that computes such an outcome
for settings with budgets---is incentive compatible
- …