494 research outputs found
Supervised Random Walks: Predicting and Recommending Links in Social Networks
Predicting the occurrence of links is a fundamental problem in networks. In
the link prediction problem we are given a snapshot of a network and would like
to infer which interactions among existing members are likely to occur in the
near future or which existing interactions are we missing. Although this
problem has been extensively studied, the challenge of how to effectively
combine the information from the network structure with rich node and edge
attribute data remains largely open.
We develop an algorithm based on Supervised Random Walks that naturally
combines the information from the network structure with node and edge level
attributes. We achieve this by using these attributes to guide a random walk on
the graph. We formulate a supervised learning task where the goal is to learn a
function that assigns strengths to edges in the network such that a random
walker is more likely to visit the nodes to which new links will be created in
the future. We develop an efficient training algorithm to directly learn the
edge strength estimation function.
Our experiments on the Facebook social graph and large collaboration networks
show that our approach outperforms state-of-the-art unsupervised approaches as
well as approaches that are based on feature extraction
Goal-setting And Achievement In Activity Tracking Apps: A Case Study Of MyFitnessPal
Activity tracking apps often make use of goals as one of their core
motivational tools. There are two critical components to this tool: setting a
goal, and subsequently achieving that goal. Despite its crucial role in how a
number of prominent self-tracking apps function, there has been relatively
little investigation of the goal-setting and achievement aspects of
self-tracking apps.
Here we explore this issue, investigating a particular goal setting and
achievement process that is extensive, recorded, and crucial for both the app
and its users' success: weight loss goals in MyFitnessPal. We present a
large-scale study of 1.4 million users and weight loss goals, allowing for an
unprecedented detailed view of how people set and achieve their goals. We find
that, even for difficult long-term goals, behavior within the first 7 days
predicts those who ultimately achieve their goals, that is, those who lose at
least as much weight as they set out to, and those who do not. For instance,
high amounts of early weight loss, which some researchers have classified as
unsustainable, leads to higher goal achievement rates. We also show that early
food intake, self-monitoring motivation, and attitude towards the goal are
important factors. We then show that we can use our findings to predict goal
achievement with an accuracy of 79% ROC AUC just 7 days after a goal is set.
Finally, we discuss how our findings could inform steps to improve goal
achievement in self-tracking apps
Multiple-channel generalization of Lellouch-Luscher formula
We generalize the Lellouch-Luscher formula, relating weak matrix elements in
finite and infinite volumes, to the case of multiple strongly-coupled decay
channels into two scalar particles. This is a necessary first step on the way
to a lattice QCD calculation of weak decay rates for processes such as D -> pi
pi and D -> KK. We also present a field theoretic derivation of the
generalization of Luscher's finite volume quantization condition to multiple
two-particle channels. We give fully explicit results for the case of two
channels, including a form of the generalized Lellouch-Luscher formula
expressed in terms of derivatives of the energies of finite volume states with
respect to the box size. Our results hold for arbitrary total momentum and for
degenerate or non-degenerate particles.Comment: 16 pages, 2 figures. v3: Added references, clarified relation to and
corrected comments about previous work, and minor stylistic improvements. v4:
Minor clarifications added, typos fixed, references updated---matches
published versio
Loyalty in Online Communities
Loyalty is an essential component of multi-community engagement. When users
have the choice to engage with a variety of different communities, they often
become loyal to just one, focusing on that community at the expense of others.
However, it is unclear how loyalty is manifested in user behavior, or whether
loyalty is encouraged by certain community characteristics.
In this paper we operationalize loyalty as a user-community relation: users
loyal to a community consistently prefer it over all others; loyal communities
retain their loyal users over time. By exploring this relation using a large
dataset of discussion communities from Reddit, we reveal that loyalty is
manifested in remarkably consistent behaviors across a wide spectrum of
communities. Loyal users employ language that signals collective identity and
engage with more esoteric, less popular content, indicating they may play a
curational role in surfacing new material. Loyal communities have denser
user-user interaction networks and lower rates of triadic closure, suggesting
that community-level loyalty is associated with more cohesive interactions and
less fragmentation into subgroups. We exploit these general patterns to predict
future rates of loyalty. Our results show that a user's propensity to become
loyal is apparent from their first interactions with a community, suggesting
that some users are intrinsically loyal from the very beginning.Comment: Extended version of a paper appearing in the Proceedings of ICWSM
2017 (with the same title); please cite the official ICWSM versio
Graph Convolutional Neural Networks for Web-Scale Recommender Systems
Recent advancements in deep neural networks for graph-structured data have
led to state-of-the-art performance on recommender system benchmarks. However,
making these methods practical and scalable to web-scale recommendation tasks
with billions of items and hundreds of millions of users remains a challenge.
Here we describe a large-scale deep recommendation engine that we developed and
deployed at Pinterest. We develop a data-efficient Graph Convolutional Network
(GCN) algorithm PinSage, which combines efficient random walks and graph
convolutions to generate embeddings of nodes (i.e., items) that incorporate
both graph structure as well as node feature information. Compared to prior GCN
approaches, we develop a novel method based on highly efficient random walks to
structure the convolutions and design a novel training strategy that relies on
harder-and-harder training examples to improve robustness and convergence of
the model. We also develop an efficient MapReduce model inference algorithm to
generate embeddings using a trained model. We deploy PinSage at Pinterest and
train it on 7.5 billion examples on a graph with 3 billion nodes representing
pins and boards, and 18 billion edges. According to offline metrics, user
studies and A/B tests, PinSage generates higher-quality recommendations than
comparable deep learning and graph-based alternatives. To our knowledge, this
is the largest application of deep graph embeddings to date and paves the way
for a new generation of web-scale recommender systems based on graph
convolutional architectures.Comment: KDD 201
- …