399 research outputs found
ChoiceRank: Identifying Preferences from Node Traffic in Networks
Understanding how users navigate in a network is of high interest in many
applications. We consider a setting where only aggregate node-level traffic is
observed and tackle the task of learning edge transition probabilities. We cast
it as a preference learning problem, and we study a model where choices follow
Luce's axiom. In this case, the marginal counts of node visits are a
sufficient statistic for the transition probabilities. We show how to
make the inference problem well-posed regardless of the network's structure,
and we present ChoiceRank, an iterative algorithm that scales to networks that
contains billions of nodes and edges. We apply the model to two clickstream
datasets and show that it successfully recovers the transition probabilities
using only the network structure and marginal (node-level) traffic data.
Finally, we also consider an application to mobility networks and apply the
model to one year of rides on New York City's bicycle-sharing system.Comment: Accepted at ICML 201
Just Sort It! A Simple and Effective Approach to Active Preference Learning
We address the problem of learning a ranking by using adaptively chosen
pairwise comparisons. Our goal is to recover the ranking accurately but to
sample the comparisons sparingly. If all comparison outcomes are consistent
with the ranking, the optimal solution is to use an efficient sorting
algorithm, such as Quicksort. But how do sorting algorithms behave if some
comparison outcomes are inconsistent with the ranking? We give favorable
guarantees for Quicksort for the popular Bradley-Terry model, under natural
assumptions on the parameters. Furthermore, we empirically demonstrate that
sorting algorithms lead to a very simple and effective active learning
strategy: repeatedly sort the items. This strategy performs as well as
state-of-the-art methods (and much better than random sampling) at a minuscule
fraction of the computational cost.Comment: Accepted at ICML 201
The Entropy of Conditional Markov Trajectories
To quantify the randomness of Markov trajectories with fixed initial and
final states, Ekroot and Cover proposed a closed-form expression for the
entropy of trajectories of an irreducible finite state Markov chain. Numerous
applications, including the study of random walks on graphs, require the
computation of the entropy of Markov trajectories conditioned on a set of
intermediate states. However, the expression of Ekroot and Cover does not allow
for computing this quantity. In this paper, we propose a method to compute the
entropy of conditional Markov trajectories through a transformation of the
original Markov chain into a Markov chain that exhibits the desired conditional
distribution of trajectories. Moreover, we express the entropy of Markov
trajectories - a global quantity - as a linear combination of local entropies
associated with the Markov chain states.Comment: Accepted for publication in IEEE Transactions on Information Theor
Mitigating Epidemics through Mobile Micro-measures
Epidemics of infectious diseases are among the largest threats to the quality
of life and the economic and social well-being of developing countries. The
arsenal of measures against such epidemics is well-established, but costly and
insufficient to mitigate their impact. In this paper, we argue that mobile
technology adds a powerful weapon to this arsenal, because (a) mobile devices
endow us with the unprecedented ability to measure and model the detailed
behavioral patterns of the affected population, and (b) they enable the
delivery of personalized behavioral recommendations to individuals in real
time. We combine these two ideas and propose several strategies to generate
such recommendations from mobility patterns. The goal of each strategy is a
large reduction in infections, with a small impact on the normal course of
daily life. We evaluate these strategies over the Orange D4D dataset and show
the benefit of mobile micro-measures, even if only a fraction of the population
participates. These preliminary results demonstrate the potential of mobile
technology to complement other measures like vaccination and quarantines
against disease epidemics.Comment: Presented at NetMob 2013, Bosto
Uncovering latent behaviors in ant colonies
Many biological systems exhibit collective behaviors that strengthen their adaptability to their environment, compared to more solitary species. Describing these behaviors is challenging yet necessary in order to understand these biological systems. We propose a probabilistic model that enables us to uncover the collective behaviors observed in a colony of ants. This model is based on the assumption that the behavior of an individual ant is a time-dependent mixture of latent behaviors that are specific to the whole colony. We apply this model to a large-scale dataset obtained by observing the mobility of nearly 1000 Camponotus fellah ants from six different colonies. Our results indicate that a colony typically exhibits three classes of behaviors, each characterized by a specific spatial distribution and a level of activity. Moreover, these spatial distributions, which are uncovered automatically by our model, match well with the ground truth as manually annotated by domain experts. We further explore the evolution of the behavior of individual ants and show that it is well captured by a second order Markov chain that encodes the fact that the future behavior of an ant depends not only on its current behavior but also on its preceding one
Regression Networks for Meta-Learning Few-Shot Classification
We propose regression networks for the problem of few-shot classification,
where a classifier must generalize to new classes not seen in the training set,
given only a small number of examples of each class. In high dimensional
embedding spaces the direction of data generally contains richer information
than magnitude. Next to this, state-of-the-art few-shot metric methods that
compare distances with aggregated class representations, have shown superior
performance. Combining these two insights, we propose to meta-learn
classification of embedded points by regressing the closest approximation in
every class subspace while using the regression error as a distance metric.
Similarly to recent approaches for few-shot learning, regression networks
reflect a simple inductive bias that is beneficial in this limited-data regime
and they achieve excellent results, especially when more aggregate class
representations can be formed with multiple shots.Comment: 7th ICML Workshop on Automated Machine Learning (2020
Self-Supervised Prototypical Transfer Learning for Few-Shot Classification
Most approaches in few-shot learning rely on costly annotated data related to
the goal task domain during (pre-)training. Recently, unsupervised
meta-learning methods have exchanged the annotation requirement for a reduction
in few-shot classification performance. Simultaneously, in settings with
realistic domain shift, common transfer learning has been shown to outperform
supervised meta-learning. Building on these insights and on advances in
self-supervised learning, we propose a transfer learning approach which
constructs a metric embedding that clusters unlabeled prototypical samples and
their augmentations closely together. This pre-trained embedding is a starting
point for few-shot classification by summarizing class clusters and
fine-tuning. We demonstrate that our self-supervised prototypical transfer
learning approach ProtoTransfer outperforms state-of-the-art unsupervised
meta-learning methods on few-shot tasks from the mini-ImageNet dataset. In
few-shot experiments with domain shift, our approach even has comparable
performance to supervised methods, but requires orders of magnitude fewer
labels.Comment: Extended version of work presented at the 7th ICML Workshop on
Automated Machine Learning (2020). Code available at
https://github.com/indy-lab/ProtoTransfer ; 17 pages, 3 figures, 12 table
- …
