Search CORE

34 research outputs found

Penalized maximum likelihood and semiparametric second-order efficiency

Author: Dalalyan A. S.
Golubev G. K.
Tsybakov A. B.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2006
Field of study

We consider the problem of estimation of a shift parameter of an unknown symmetric function in Gaussian white noise. We introduce a notion of semiparametric second-order efficiency and propose estimators that are semiparametrically efficient and second-order efficient in our model. These estimators are of a penalized maximum likelihood type with an appropriately chosen penalty. We argue that second-order efficiency is crucial in semiparametric problems since only the second-order terms in asymptotic expansion for the risk account for the behavior of the ``nonparametric component'' of a semiparametric procedure, and they are not dramatically smaller than the first-order terms.Comment: Published at http://dx.doi.org/10.1214/009053605000000895 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Crossref

Pivotal estimation in high-dimensional regression via linear programming

Author: A. Antoniadis
A. Belloni
A. Belloni
A. Dalalyan
A. Dalalyan
B. Efron
B.Y. Jing
E. Candès
F. Ye
N. Städler
P. Bertail
P. Bickel
P. Bühlmann
P. Rigollet
P. Rigollet
Publication venue
Publication date: 01/01/2013
Field of study

We propose a new method of estimation in high-dimensional linear regression model. It allows for very weak distributional assumptions including heteroscedasticity, and does not require the knowledge of the variance of random errors. The method is based on linear programming only, so that its numerical implementation is faster than for previously known techniques using conic programs, and it allows one to deal with higher dimensional models. We provide upper bounds for estimation and prediction errors of the proposed estimator showing that it achieves the same rate as in the more restrictive situation of fixed design and i.i.d. Gaussian errors with known variance. Following Gautier and Tsybakov (2011), we obtain the results under weaker sensitivity assumptions than the restricted eigenvalue or assimilated conditions

arXiv.org e-Print Archive

Crossref

Toulouse Capitole Publications

Toulouse 1 Capitole Publications

HAL-Polytechnique

Time series prediction via aggregation : an oracle bound including numerical cost

Author: A. S. Dalalyan
C. Andrieu
C. Coulon-Prieur
E. Moulines
E. R. Beadle
E. Rio
G. Leung
G. O. Roberts
J. Dedecker
K. L. Mengersen
K. Łatuszyński
K. Łatuszyński
N. Cesa-Bianchi
P. Alquier
P. Alquier
Y. F. Atchadé
Publication venue
Publication date: 26/05/2014
Field of study

We address the problem of forecasting a time series meeting the Causal Bernoulli Shift model, using a parametric set of predictors. The aggregation technique provides a predictor with well established and quite satisfying theoretical properties expressed by an oracle inequality for the prediction risk. The numerical computation of the aggregated predictor usually relies on a Markov chain Monte Carlo method whose convergence should be evaluated. In particular, it is crucial to bound the number of simulations needed to achieve a numerical precision of the same order as the prediction risk. In this direction we present a fairly general result which can be seen as an oracle inequality including the numerical cost of the predictor computation. The numerical cost appears by letting the oracle inequality depend on the number of simulations required in the Monte Carlo approximation. Some numerical experiments are then carried out to support our findings

arXiv.org e-Print Archive

Crossref

Systems of Hess-Appel'rot Type and Zhukovskii Property

Author: Appel'rot G. G.
Beljaev A. V.
Benenti S.
Bolsinov A. V.
BORISLAV GAJIĆ
BOŽIDAR JOVANOVIĆ
Dalalyan S. G.
Dragović V.
Dragović V.
Dragović V.
Dubrovin B. A.
Dubrovin B. A.
Dubrovin B. A.
Gavrilov L.
Hess W.
Levi-Civita T.
Lichnerowicz A.
Liouville J.
Manakov S. V.
Mescherkov M. V.
Mykytyuk I. V.
Nekrasov P. A.
Panasyuk A.
Routh E. J.
Trofimov V. V.
VLADIMIR DRAGOVIĆ
Zhukovskii N. E.
Zung N. T.
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 09/12/2009
Field of study

We start with a review of a class of systems with invariant relations, so called {\it systems of Hess--Appel'rot type} that generalizes the classical Hess--Appel'rot rigid body case. The systems of Hess-Appel'rot type carry an interesting combination of both integrable and non-integrable properties. Further, following integrable line, we study partial reductions and systems having what we call the {\it Zhukovskii property}: these are Hamiltonian systems with invariant relations, such that partially reduced systems are completely integrable. We prove that the Zhukovskii property is a quite general characteristic of systems of Hess-Appel'rote type. The partial reduction neglects the most interesting and challenging part of the dynamics of the systems of Hess-Appel'rot type - the non-integrable part, some analysis of which may be seen as a reconstruction problem. We show that an integrable system, the magnetic pendulum on the oriented Grassmannian

Gr^+(4,2)

has natural interpretation within Zhukovskii property and it is equivalent to a partial reduction of certain system of Hess-Appel'rot type. We perform a classical and an algebro-geometric integration of the system, as an example of an isoholomorphic system. The paper presents a lot of examples of systems of Hess-Appel'rot type, giving an additional argument in favor of further study of this class of systems.Comment: 42 page

arXiv.org e-Print Archive

Crossref

Revisiting clustering as matrix factorisation on the Stiefel manifold

Author: A Durmus
A Edelman
AS Bandeira
AS Dalalyan
AS Dalalyan
E Arias-Castro
G McLachlan
GO Roberts
MX Goemans
N Verzelen
O Guédon
P Alquier
S Burer
S Burer
Publication venue: HAL CCSD
Publication date: 11/03/2019
Field of study

International audienceThis paper studies clustering for possibly high dimensional data (e.g. images, time series, gene expression data, and many other settings), and rephrase it as low rank matrix estimation in the PAC-Bayesian framework. Our approach leverages the well known Burer-Monteiro factorisation strategy from large scale optimisation, in the context of low rank estimation. Moreover, our Burer-Monteiro factors are shown to lie on a Stiefel manifold. We propose a new generalized Bayesian estimator for this problem and prove novel prediction bounds for clustering. We also devise a componentwise Langevin sampler on the Stiefel manifold to compute this estimator

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

UCL Discovery

HAL Descartes

Noisy Monte Carlo: Convergence of Markov chains with approximate transition kernels

Author: A Caimo
A Dalalyan
A. Boland
AY Mitrophanov
C Andrieu
G Golub
G Robins
GO Roberts
GO Roberts
GO Roberts
H Robbins
J Møller
J Propp
J-M Marin
JE Besag
L Bottou
L Valiant
M Girolami
MA Beaumont
N Friel
N Friel
N Friel
N. Friel
NV Kartashov
P Bühlmann
P. Alquier
R Reeves
R Tibshirani
R. Everitt
S Geman
S Meyn
W Gilks
Publication venue
Publication date: 15/04/2014
Field of study

Monte Carlo algorithms often aim to draw from a distribution

\pi

by simulating a Markov chain with transition kernel

P

such that

\pi

is invariant under

P

. However, there are many situations for which it is impractical or impossible to draw from the transition kernel

P

. For instance, this is the case with massive datasets, where is it prohibitively expensive to calculate the likelihood and is also the case for intractable likelihood models arising from, for example, Gibbs random fields, such as those found in spatial statistics and network analysis. A natural approach in these cases is to replace

P

by an approximation

\hat{P}

. Using theory from the stability of Markov chains we explore a variety of situations where it is possible to quantify how 'close' the chain given by the transition kernel

\hat{P}

is to the chain given by

P

. We apply these results to several examples from spatial statistics and network analysis.Comment: This version: results extended to non-uniformly ergodic Markov chain

arXiv.org e-Print Archive

Central Archive at the University of Reading

Crossref

Research Repository UCD

Irish Universities

Warwick Research Archives Portal Repository

Aggregation by exponential weighting, sharp PAC-Bayesian bounds and sparsity

Author: A. B. Juditsky
A. B. Tsybakov
A. B. Tsybakov
A. B. Tsybakov
A. B. Tsybakov
A. Dalalyan
A. Dalalyan
A. Dembo
A. Nemirovski
B. Efron
D. L. Donoho
D. Revuz
E. Candes
E. Greenshtein
E. L. Lehmann
F. Bunea
F. Bunea
F. Bunea
G. Leung
I. E. Frank
J. Kivinen
J. Obloj
J.-Y. Audibert
N. Cesa-Bianchi
N. Cesa-Bianchi
N. Cesa-Bianchi
N. Littlestone
O. Catoni
T. Zhang
T. Zhang
V. V. Petrov
V. Vovk
V. Vovk
Y. Yang
Y. Yang
Y. Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 12/03/2008
Field of study

We study the problem of aggregation under the squared loss in the model of regression with deterministic design. We obtain sharp PAC-Bayesian risk bounds for aggregates defined via exponential weights, under general assumptions on the distribution of errors and on the functions to aggregate. We then apply these results to derive sparsity oracle inequalities

arXiv.org e-Print Archive

CiteSeerX

Crossref

Hal-Diderot

Systems of Hess-Appel'rot type

Author: A. Beauville
A. Goriely
A.G. Reyman
A.I. Bobenko
A.V. Borisov
B.A. Dubrovin
B.A. Dubrovin
B.A. Dubrovin
Borislav Gajić
D. Mumford
D. Mumford
E. Arbarello
E. Leimanis
E.D. Belokolos
E.T. Whittaker
H. Yoshida
H. Yoshida
H. Yoshida
I.M. Gel’fand
I.M. Krichever
J.D. Fay
L. Gavrilov
M. Adler
M. Adler
M. Adler
M. Audin
N.E. Zhukovski
O.I. Bogoyavlensky
P. Moerbeke van
P.A. Griffiths
P.A. Nekrasov
S. Kowalevski
S.G. Dalalyan
S.L. Ziglin
S.L. Ziglin
S.V. Manakov
T. Ratiu
T. Ratiu
V. Dragović
V. Dragović
V.I. Arnol’d
V.I. Arnol’d
V.V. Kozlov
V.V. Shokurov
V.V. Trofimov
Vladimir Dragović
W. Hess
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/02/2006
Field of study

We construct higher-dimensional generalizations of the classical Hess-Appel'rot rigid body system. We give a Lax pair with a spectral parameter leading to an algebro-geometric integration of this new class of systems, which is closely related to the integration of the Lagrange bitop performed by us recently and uses Mumford relation for theta divisors of double unramified coverings. Based on the basic properties satisfied by such a class of systems related to bi-Poisson structure, quasi-homogeneity, and conditions on the Kowalevski exponents, we suggest an axiomatic approach leading to what we call the "class of systems of Hess-Appel'rot type".Comment: 40 pages. Comm. Math. Phys. (to appear

arXiv.org e-Print Archive

Crossref

Asymptotic equivalence of discretely observed diffusion processes and their Euler scheme: small variance case

Author: A Dalalyan
A Dalalyan
A Meister
A Meister
A Rohde
AV Carter
AV Carter
AV Carter
B Buhmann
B Øksendal
C Laredo
Ester Mariucci
F Comte
G Milstein
GK Golubev
I Grama
I Grama
I Grama
I Karatzas
J Jacod
J Picard
J Picard
L Le Cam
L Le Cam
LD Brown
LD Brown
LD Brown
LD Brown
M Hoffmann
M Jähnisch
M Nussbaum
M Reiß
M Reiß
M Uchida
S Delattre
S Efromovich
V Genon-Catalot
V Genon-Catalot
V Genon-Catalot
V Volkonskii
Y Wang
YA Kutoyants
YA Kutoyants
Publication venue
Publication date: 13/02/2015
Field of study

This paper establishes the global asymptotic equivalence, in the sense of the Le Cam

\Delta

-distance, between scalar diffusion models with unknown drift function and small variance on the one side, and nonparametric autoregressive models on the other side. The time horizon

T

is kept fixed and both the cases of discrete and continuous observation of the path are treated. We allow non constant diffusion coefficient, bounded but possibly tending to zero. The asymptotic equivalences are established by constructing explicit equivalence mappings.Comment: 21 page

arXiv.org e-Print Archive

Crossref

Hal - Université Grenoble Alpes

Non-parametric Bayesian drift estimation for stochastic differential equations

Author: A Dalalyan
E Gobet
E Gobet
E Schmisser
F Comte
F Meulen van der
F Meulen van der
FH Meulen van der
H Zanten van
J Jacod
L Panzar
O Papaspiliopoulos
P Diaconis
Peter Spreij
S Ghosal
S Ghosal
S Ghosal
S Walker
S Walker
Shota Gugushvili
Y Pokern
Y Tang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/02/2013
Field of study

We consider non-parametric Bayesian estimation of the drift coefficient of a one-dimensional stochastic differential equation from discrete-time observations on the solution of this equation. Under suitable regularity conditions that are weaker than those previosly suggested in the literature, we establish posterior consistency in this context. Furthermore, we show that posterior consistency extends to the multidimensional setting as well, which, to the best of our knowledge, is a new result in this setting.Comment: 27 page

arXiv.org e-Print Archive

Crossref

International Migration, Integration and Social Cohesion online publications