Search CORE

11 research outputs found

Multi-Armed Bandits for Correlated Markovian Environments with Smoothed Reward Feedback

Author: Claudet I.
Honorat R.
KARSENTY C.
Lelong-Tissier C.
Marcoux M.-O.
Rittié J.-L.
Publication venue
Publication date: 01/03/2012
Field of study

We study a multi-armed bandit problem in a dynamic environment where arm rewards evolve in a correlated fashion according to a Markov chain. Different than much of the work on related problems, in our formulation a learning algorithm does not have access to either a priori information or observations of the state of the Markov chain and only observes smoothed reward feedback following time intervals we refer to as epochs. We demonstrate that existing methods such as UCB and

\varepsilon

-greedy can suffer linear regret in such an environment. Employing mixing-time bounds on Markov chains, we develop algorithms called EpochUCB and EpochGreedy that draw inspiration from the aforementioned methods, yet which admit sublinear regret guarantees for the problem formulation. Our proposed algorithms proceed in epochs in which an arm is played repeatedly for a number of iterations that grows linearly as a function of the number of times an arm has been played in the past. We analyze these algorithms under two types of smoothed reward feedback at the end of each epoch: a reward that is the discount-average of the discounted rewards within an epoch, and a reward that is the time-average of the rewards within an epoch.Comment: Significant revision of prior version including deeper discussion of related work, gap-independent regret bounds, and regret bounds for discounted reward

arXiv.org e-Print Archive

Nasal high frequency percussive ventilation versus nasal continuous positive airway pressure in transient tachypnea of the newborn: A pilot randomized controlled trial (NCT00556738)

Author: Baines
Birnkrant
Carman
Deakins
Freitag
Freitag
Freitag
Freitag
Gilbert
Guglani
Hermansen
Homnick
Hummler
Hurst
Jain
Jain
Jain
Kumar
Lelong-Tissier
Lewis
Lucangelo
Natale
Nino
Pfenninger
Pfenninger
Ribera Cano
Salim
Silverman
Stucki
Tchepichev
Toussaint
Tsuruta
Varekojis
Vargas
Vargas
Vargas
Zhu
Publication venue: 'Wiley'
Publication date
Field of study

Décompensation au cours de l'insuffisance respiratoire chronique de l'enfant

Author: Claudet I.
Lelong-Tissier M.C.
Publication venue: 'Elsevier BV'
Publication date: 01/03/2000
Field of study

International audienc

Optimisation de la prise en charge pré-hospitalière des plaies hémorragiques de membre de l'enfant

Author: LELONG TISSIER Marie-Cécile
TOURNERET Marie-Laure
Publication venue
Publication date: 01/01/2009
Field of study

TOULOUSE3-BU Santé-Centrale (315552105) / SudocSudocFranceF

OpenGrey Repository

Glycog\ue9nose type IV variante cong\ue9nitale: \ue0 propos d'une observation anatomo-clinique

Author: Ceuterick-de Groote Chantal
Chausseray F.
Delisle M.-B.
Lelong-Tissier M.-C.
Maire I.
Uro-Coste E.
Publication venue
Publication date: 01/01/1996
Field of study

Institutional Repository Universiteit Antwerpen

Indications de la ventilation à percussions intrapulmonaires (VPI) : revue de la littérature

Author: Allan
Antonaglia
Bartsch
Berlinski
Bertin
Birnkrant
Bougatef
Bradley
Branson
Chatburn
Chelha
Clini
Dalne
Deakins
Degreef
Dellamonica
Dellamonica
Dimassi
Dmello
Fauroux
Flurin
Fujita
G. Riffard
Gatani
Haas
Homnick
Hristara-Papadopoulou
Langenderfer
Lee
Lelong Tissier
Lelong-Tissier
Lucangelo
Lucangelo
M. Toussaint
Natale
Nava
Newhouse
Nguyen
Nino
Piaggi
Pryor
Reardon
Reper
Reychler
Reychler
Ribera Cano
Salim
Stucki
Toussaint
Tsuruta
Van Ginderdeuren
Varekojis
Vargas
Vargas
Velmahos
Vienne
Yen Ha
Publication venue: 'Elsevier BV'
Publication date
Field of study

Historique des SMUR pédiatriques en France

Author: Agostino
Barbier
Beltramini
Blond
Boët
Cara
Carbajal
Chabernaud
Chabernaud
Chabernaud
Chabernaud
Chabernaud
Cneude
Committee on Perinatal Health
Dehan
Gonzalez
Gregory
Haut comité de la santé publique
Heluwaert
Heluwaert
Huault
Hurtaud
Jones
Kollée
Lavaud
Lelong-Tissier
Lodé
Mortamet
Naud
Rambaud
Sellam
Serre
Smythe
Truffert
Walti
Publication venue: 'Elsevier BV'
Publication date
Field of study

Serious childhood angiomas: unsuccessful alpha-2b interferon treatment. A report of four cases

Author: Aylett SE
Bartoshesky LE
Bell AJ
Bowers RE
Brouty-Boye D
Cooper AG
Crum R
Drouet L
Eldor A
Enjolras O
Esterly N.
Ezekowitz RAB
Folkman J.
Folkman J.
Kasabach HH
Larcher VF
Larsen EC
Lelong-Tissier MC
Li FP
Merland JJ
Mulliken JB
Mulliken JB.
Prost Y
Shulkin BL
Warrell RP
White CW
Publication venue: 'Wiley'
Publication date
Field of study

Nutrition in Pregnancy: Some Current Concepts and Questions

core

core