Search CORE

435 research outputs found

Iterated least squares in multiperiod control

Author: Lai T.L.
Robbins Herbert
Publication venue: Published by Elsevier Inc.
Publication date: 31/03/1982
Field of study

Thompson Sampling: An Asymptotically Optimal Finite Time Analysis

Author: A. Salomon
B.C. May
J.-Y. Audibert
J.-Y. Audibert
O.C. Granmo
P. Auer
T.L. Lai
W.R. Thompson
Publication venue
Publication date: 01/01/2012
Field of study

The question of the optimality of Thompson Sampling for solving the stochastic multi-armed bandit problem had been open since 1933. In this paper we answer it positively for the case of Bernoulli rewards by providing the first finite-time analysis that matches the asymptotic rate given in the Lai and Robbins lower bound for the cumulative regret. The proof is accompanied by a numerical comparison with other optimal policies, experiments that have been lacking in the literature until now for the Bernoulli case.Comment: 15 pages, 2 figures, submitted to ALT (Algorithmic Learning Theory

arXiv.org e-Print Archive

HAL - Lille 3

Crossref

INRIA a CCSD electronic archive server

Study of Interaction Modes in Pyrene-Based Fluorescent Organogels

Author: D. Canevet
F. Pop
M. Sallé
N. Avarvari
T.L. Lai
Publication venue
Publication date: 01/01/2014
Field of study

International audienc

HAL Descartes

Okina

Precautionary Measures for Credit Risk Management in Jump Models

Author: Bertoin J.
Dao B.
Kazutoshi Yamazaki
Kyprianou A.E.
Lai T.L.
Lipton A.
Masahiko Egami
Peskir G.
Schmidli H.
Shiryaev A.N.
Øksendal B.
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2010
Field of study

Sustaining efficiency and stability by properly controlling the equity to asset ratio is one of the most important and difficult challenges in bank management. Due to unexpected and abrupt decline of asset values, a bank must closely monitor its net worth as well as market conditions, and one of its important concerns is when to raise more capital so as not to violate capital adequacy requirements. In this paper, we model the tradeoff between avoiding costs of delay and premature capital raising, and solve the corresponding optimal stopping problem. In order to model defaults in a bank's loan/credit business portfolios, we represent its net worth by Levy processes, and solve explicitly for the double exponential jump diffusion process and for a general spectrally negative Levy process.Comment: 31 pages, 4 figure

arXiv.org e-Print Archive

CiteSeerX

Crossref

Asymptotic Normality of a Class of Adaptive Statistics with Applications to Synthetic Data Methods for Censored Regression

Author: Lai T.L.
Ying Z.L.
Zheng Z.K.
Publication venue: Academic Press.
Publication date: 28/02/1995
Field of study

AbstractMotivated by regression analysis of censored survival data, we develop herein a general asymptotic distribution theory for estimators defined by estimating equations of the form ∑ni=1ξ (wi, θ, Ĝn) = 0, in which wi represents observed data, θ is an unknown parameter to be estimated, and Ĝn represents an estimate of some unknown underlying distribution. This general theory is used to establish asymptotic normality of synthetic least squares estimates in censored regression models and to evaluate the covariance matrices of the limiting normal distributions

Elsevier - Publisher Connector

A Neural Networks Committee for the Contextual Bandit Problem

Author: D.E. Rumelhart
E. Kaufmann
G. Tesauro
K. Hornik
L. Bottou
L. Kocsis
P. Auer
P. Auer
P. Auer
R. Feraud
S.M. Kakade
T.L. Lai
W. Thompson
Publication venue
Publication date: 01/01/2014
Field of study

This paper presents a new contextual bandit algorithm, NeuralBandit, which does not need hypothesis on stationarity of contexts and rewards. Several neural networks are trained to modelize the value of rewards knowing the context. Two variants, based on multi-experts approach, are proposed to choose online the parameters of multi-layer perceptrons. The proposed algorithms are successfully tested on a large dataset with and without stationarity of rewards.Comment: 21st International Conference on Neural Information Processin

arXiv.org e-Print Archive

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

Revisiting urea-based gelators: strong solvent- and casting-microstructure dependencies and organogel processing using an alumina template

Author: D. Canevet
J.Y. Mevellec
M. Sallé
N. Avarvari
R. Barille
T.L. Lai
Y. Almohamed
Publication venue: 'Royal Society of Chemistry (RSC)'
Publication date: 01/01/2014
Field of study

Urea-based gelators have been thoroughly characterized through various techniques and exhibit a strong solvent-structuration dependency in both the gel and the xerogel states. In a ground-breaking manner, gels were introduced in alumina membranes, which act as templates, in order to shape these materials and force the alignment of the corresponding self-assembled nanofibers by confinement

Okina

Internal Probing of the Supramolecular Organization of Pyrene-Based Organogelators

Author: D. Canevet
M. Sallé
N. Avarvari
T.L. Lai
Publication venue: 'Wiley'
Publication date: 01/01/2016
Field of study

A thorough study of the unexpected spectroscopic behavior of two new luminescent pyrene-urea-based organogelators is rationalized as a function of their aggregation state and provides a key method to probe the supramolecular organization of the material

Okina