Search CORE

578 research outputs found

Thompson Sampling: An Asymptotically Optimal Finite Time Analysis

Author: A. Salomon
B.C. May
J.-Y. Audibert
J.-Y. Audibert
O.C. Granmo
P. Auer
T.L. Lai
W.R. Thompson
Publication venue
Publication date: 01/01/2012
Field of study

The question of the optimality of Thompson Sampling for solving the stochastic multi-armed bandit problem had been open since 1933. In this paper we answer it positively for the case of Bernoulli rewards by providing the first finite-time analysis that matches the asymptotic rate given in the Lai and Robbins lower bound for the cumulative regret. The proof is accompanied by a numerical comparison with other optimal policies, experiments that have been lacking in the literature until now for the Bernoulli case.Comment: 15 pages, 2 figures, submitted to ALT (Algorithmic Learning Theory

arXiv.org e-Print Archive

HAL - Lille 3

Crossref

INRIA a CCSD electronic archive server

Age-acquired resistance and predisposition to reinfection with Schistosoma haematobium after treatment with Praziquantel in Mali

Author: Audibert M.
Dabo A.
Etard Jean-François
Publication venue
Publication date: 01/01/1995
Field of study

Horizon / Pleins textes

Spectral Sparsification and Regret Minimization Beyond Matrix Multiplicative Updates

Author: Audibert J.-Y.
Ben-Tal A.
Hazan E.
Hazan E.
Naor A.
Orecchia L.
Rakhlin A.
Shalev-Shwartz S.
Zinkevich M.
Publication venue
Publication date: 16/06/2015
Field of study

In this paper, we provide a novel construction of the linear-sized spectral sparsifiers of Batson, Spielman and Srivastava [BSS14]. While previous constructions required

\Omega(n^4)

running time [BSS14, Zou12], our sparsification routine can be implemented in almost-quadratic running time

O(n^{2+\varepsilon})

. The fundamental conceptual novelty of our work is the leveraging of a strong connection between sparsification and a regret minimization problem over density matrices. This connection was known to provide an interpretation of the randomized sparsifiers of Spielman and Srivastava [SS11] via the application of matrix multiplicative weight updates (MWU) [CHS11, Vis14]. In this paper, we explain how matrix MWU naturally arises as an instance of the Follow-the-Regularized-Leader framework and generalize this approach to yield a larger class of updates. This new class allows us to accelerate the construction of linear-sized spectral sparsifiers, and give novel insights on the motivation behind Batson, Spielman and Srivastava [BSS14]

arXiv.org e-Print Archive

Crossref

Functional Sequential Treatment Allocation

Author: Audibert J.
Cassel A.
Degenne R.
Garivier A.
Lambert P. J.
Rosenbluth G.
Sani A.
Schutz R. R.
Thurow L. C.
Publication venue
Publication date: 29/01/2020
Field of study

Consider a setting in which a policy maker assigns subjects to treatments, observing each outcome before the next subject arrives. Initially, it is unknown which treatment is best, but the sequential nature of the problem permits learning about the effectiveness of the treatments. While the multi-armed-bandit literature has shed much light on the situation when the policy maker compares the effectiveness of the treatments through their mean, much less is known about other targets. This is restrictive, because a cautious decision maker may prefer to target a robust location measure such as a quantile or a trimmed mean. Furthermore, socio-economic decision making often requires targeting purpose specific characteristics of the outcome distribution, such as its inherent degree of inequality, welfare or poverty. In the present paper we introduce and study sequential learning algorithms when the distributional characteristic of interest is a general functional of the outcome distribution. Minimax expected regret optimality results are obtained within the subclass of explore-then-commit policies, and for the unrestricted class of all policies

arXiv.org e-Print Archive

Crossref

DI-fusion

An efficient algorithm for learning with semi-bandit feedback

Author: A. György
A. Kalai
C. Allenberg
D. Suehiro
E. Takimoto
H.B. McMahan
J. Hannan
J. Poland
J.-Y. Audibert
N. Cesa-Bianchi
N. Cesa-Bianchi
P. Auer
Publication venue
Publication date: 01/01/2013
Field of study

We consider the problem of online combinatorial optimization under semi-bandit feedback. The goal of the learner is to sequentially select its actions from a combinatorial decision set so as to minimize its cumulative loss. We propose a learning algorithm for this problem based on combining the Follow-the-Perturbed-Leader (FPL) prediction method with a novel loss estimation procedure called Geometric Resampling (GR). Contrary to previous solutions, the resulting algorithm can be efficiently implemented for any decision set where efficient offline combinatorial optimization is possible at all. Assuming that the elements of the decision set can be described with d-dimensional binary vectors with at most m non-zero entries, we show that the expected regret of our algorithm after T rounds is O(m sqrt(dT log d)). As a side result, we also improve the best known regret bounds for FPL in the full information setting to O(m^(3/2) sqrt(T log d)), gaining a factor of sqrt(d/m) over previous bounds for this algorithm.Comment: submitted to ALT 201

arXiv.org e-Print Archive

Crossref

Do Deep Neural Networks Contribute to Multivariate Time Series Anomaly Detection?

Author: Audibert Julien
Guyard Frédéric
Marti Sébastien
Michiardi Pietro
Zuluaga Maria A.
Publication venue
Publication date: 04/04/2022
Field of study

Anomaly detection in time series is a complex task that has been widely studied. In recent years, the ability of unsupervised anomaly detection algorithms has received much attention. This trend has led researchers to compare only learning-based methods in their articles, abandoning some more conventional approaches. As a result, the community in this field has been encouraged to propose increasingly complex learning-based models mainly based on deep neural networks. To our knowledge, there are no comparative studies between conventional, machine learning-based and, deep neural network methods for the detection of anomalies in multivariate time series. In this work, we study the anomaly detection performance of sixteen conventional, machine learning-based and, deep neural network approaches on five real-world open datasets. By analyzing and comparing the performance of each of the sixteen methods, we show that no family of methods outperforms the others. Therefore, we encourage the community to reincorporate the three categories of methods in the anomaly detection in multivariate time series benchmarks

arXiv.org e-Print Archive

Déterminants de la demande de soins en milieu péri-urbain dans un contexte de subvention à Pikine, Sénégal

Author: Audibert M.
Dieng M.
Le Hesran Jean-Yves
Ta Dial A.
Publication venue: CERDI
Publication date: 01/01/2014
Field of study

Depuis les années 2000, le Sénégal a adopté des politiques nationales visant la suppression progressive du paiement direct au point de services pour rendre les soins de santé plus accessibles. La mise en place de ces politiques de subvention et de gratuité dans un espace dense hétérogène voire hétéroclite, présente une situation particulière. Pour comprendre ces interactions et étudier le comportement des ménages en matière de demande de soins, 5520 individus ont étés enquêtés à quatre reprises sur la période 2010-2011 dans la banlieue de Dakar (Pikine), un probit multinomial est estimé pour étudier la demande de soins de la population face à un épisode de maladie. Les résultats montrent que l'effet négatif du prix est en moyenne assez faible, mais qu'il varie en fonction du niveau de revenu et de la sévérité de la maladie. La qualité perçue des soins a un effet positif sur le recours aux services de santé privés pour lesquels on observe une compensation de l'effet négatif du prix par la qualité. L'effet de l'âge n'est pas linéaire et les enfants, plus touchés par la maladie, bénéficient de peu d'exemption ou du moins d'exemption partielle contrairement aux personnes âgées qui bénéficient d'exemption totale (plan SESAME)

HAL Clermont Université

HAL Descartes

HAL-IRD

Horizon / Pleins textes

Syndrome de détresse respiratoire aiguë secondaire à une infection à Toxocara cati

Author: A. Chaudeurge
A. Novara
E. Guérot
J. Audibert
J.M. Tadié
J.Y. Fagon
N. Lerolle
Publication venue: 'Elsevier BV'
Publication date: 01/01/2010
Field of study

Human toxocarosis is a helminthozoonosis due to the migration of toxocara species larvae throughout the human body. Lung manifestations vary and range from asymptomatic infection to severe disease. Dry cough and chest discomfort are the most common respiratory symptoms. Clinical manifestations include a transient form of Loeffler\u27s syndrome or an eosinophilic pneumonia. We report a case of bilateral pneumonia in an 80 year old caucasian man who developed very rapidly an acute respiratory distress syndrome, with a PaO2/FiO2 ratio of 55, requiring mechanical ventilation and adrenergic support. There was an increased eosinophilia in both blood and bronchoalveolar lavage fluid. Positive toxocara serology and the clinical picture confirmed the diagnosis of the "visceral larva migrans" syndrome. Intravenous corticosteroid therapy produced a rapid rise in PaO2/FiO2 before the administration of specific treatment. A few cases of acute pneumonia requiring mechanical ventilation due to toxocara have been published but this is, to our knowledge, is the first reported case of ARDS with multi-organ failure

HAL Descartes

Okina

Hal-Diderot

PAC-Bayesian Bounds for Randomized Empirical Risk Minimizers

Author: A. Tsybakov
C. Cortes
D. A. McAllester
D. A. McAllester
E. Mammen
J. H. Friedman
J. Rissanen
J.-Y. Audibert
L. Devroye
P. Alquier
R. Schapire
S. Boucheron
T. Zhang
W. Hoeffding
Publication venue: 'Allerton Press'
Publication date: 01/01/2008
Field of study

The aim of this paper is to generalize the PAC-Bayesian theorems proved by Catoni in the classification setting to more general problems of statistical inference. We show how to control the deviations of the risk of randomized estimators. A particular attention is paid to randomized estimators drawn in a small neighborhood of classical estimators, whose study leads to control the risk of the latter. These results allow to bound the risk of very general estimation procedures, as well as to perform model selection

arXiv.org e-Print Archive

Crossref

Hal-Diderot

HAL-Polytechnique

Gain properties of dye-doped polymer thin films

Author: Audibert J.-F.
Boudreau M.
Brosseau A.
Chénais S.
Djellali N.
Forget S.
Gauvin S.
Gozhyk I.
Haghighi H. Rabbani
Lebental M.
Pansu R.
Ulysse C.
Zyss J.
Publication venue: 'American Physical Society (APS)'
Publication date: 15/07/2015
Field of study

Hybrid pumping appears as a promising compromise in order to reach the much coveted goal of an electrically pumped organic laser. In such configuration the organic material is optically pumped by an electrically pumped inorganic device on chip. This engineering solution requires therefore an optimization of the organic gain medium under optical pumping. Here, we report a detailed study of the gain features of dye-doped polymer thin films. In particular we introduce the gain efficiency

K

, in order to facilitate comparison between different materials and experimental conditions. The gain efficiency was measured with various setups (pump-probe amplification, variable stripe length method, laser thresholds) in order to study several factors which modify the actual gain of a layer, namely the confinement factor, the pump polarization, the molecular anisotropy, and the re-absorption. For instance, for a 600 nm thick 5 wt\% DCM doped PMMA layer, the different experimental approaches give a consistent value

K\simeq

80 cm.MW

^{-1}

. On the contrary, the usual model predicting the gain from the characteristics of the material leads to an overestimation by two orders of magnitude, which raises a serious problem in the design of actual devices. In this context, we demonstrate the feasibility to infer the gain efficiency from the laser threshold of well-calibrated devices. Besides, temporal measurements at the picosecond scale were carried out to support the analysis.Comment: 15 pages, 17 figure

arXiv.org e-Print Archive