Search CORE

22 research outputs found

Detection of a sparse submatrix of a high-dimensional noisy matrix

Author: Butucea Cristina
Ingster Yuri I.
Publication venue: 'Bernoulli Society for Mathematical Statistics and Probability'
Publication date: 01/01/2011
Field of study

We observe a

N\times M

matrix

Y_{ij}=s_{ij}+\xi_{ij}

with

\xi_{ij}\sim {\mathcal {N}}(0,1)

i.i.d. in

i,j

, and

s_{ij}\in \mathbb {R}

. We test the null hypothesis

s_{ij}=0

for all

i,j

against the alternative that there exists some submatrix of size

n\times m

with significant elements in the sense that

s_{ij}\ge a>0

. We propose a test procedure and compute the asymptotical detection boundary

a

so that the maximal testing risk tends to 0 as

M\to\infty

N\to\infty

p=n/N\to0

q=m/M\to0

. We prove that this boundary is asymptotically sharp minimax under some additional constraints. Relations with other testing problems are discussed. We propose a testing procedure which adapts to unknown

(n,m)

within some given set and compute the adaptive sharp rates. The implementation of our test procedure on synthetic data shows excellent behavior for sparse, not necessarily squared matrices. We extend our sharp minimax results in different directions: first, to Gaussian matrices with unknown variance, next, to matrices of random variables having a distribution from an exponential family (non-Gaussian) and, finally, to a two-sided alternative for matrices with Gaussian elements.Comment: Published in at http://dx.doi.org/10.3150/12-BEJ470 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm

arXiv.org e-Print Archive

CiteSeerX

Crossref

HAL - UPEC / UPEM

Statistical inference in compound functional models

Author: Dalalyan Arnak
Ingster Yuri
Tsybakov Alexandre
Publication venue
Publication date: 27/08/2012
Field of study

We consider a general nonparametric regression model called the compound model. It includes, as special cases, sparse additive regression and nonparametric (or linear) regression with many covariates but possibly a small number of relevant covariates. The compound model is characterized by three main parameters: the structure parameter describing the "macroscopic" form of the compound function, the "microscopic" sparsity parameter indicating the maximal number of relevant covariates in each component and the usual smoothness parameter corresponding to the complexity of the members of the compound. We find non-asymptotic minimax rate of convergence of estimators in such a model as a function of these three parameters. We also show that this rate can be attained in an adaptive way

arXiv.org e-Print Archive

Hal-Diderot

HAL-Ecole des Ponts ParisTech

HAL-Polytechnique

Adaptation in minimax nonparametric hypothesis testing for ellipsoids and Besov bodies

Author: Ingster Yuri I.
Publication venue
Publication date: 01/01/1998
Field of study

We observe an infinitely dimensional Gaussian random vector

x=\xi+v

where

\xi

is a sequence of standard Gaussian variables and

v\in l_2

is an unknown mean. Let

V_{\varepsilon}(\tau,\rho_{\varepsilon})\subset l_2

be sets which correspond to

l_q

-ellipsoids %of the radiuses

R/{\varepsilon}

of power semi-axes

a_i=i^{-s}R/{\varepsilon}

with

l_p

-ellipsoid %of the radiuses

\rho_{\varepsilon}/\varepsilon

and of semi-axes

b_i=i^{-r}\rho_{\varepsilon}/\varepsilon

removed or to similar Besov bodies

B_{q,t,s}(R/{\varepsilon})

with Besov bodies

B_{p,h,r}(\rho_{\varepsilon}/{\varepsilon})

removed. Here

\tau =(\kappa,R)

\tau =(\kappa,h,t,R),\ \ \kappa=(p,q,r,s)

are the parameters which define the sets

V_{\varepsilon}

for given radiuses

\rho_{\varepsilon}\to 0

00,\ \varepsilon\to 0

is asymptotical parameter. For the case

\tau

is known hypothesis testing problem

H_0: v=0

versus alternatives

H_{\varepsilon,\tau}:v\in V_{\varepsilon}(\tau, \rho_{\varepsilon})

have been considered by Ingster and Suslina [11] in minimax setting. It was shown that there is a partition of the set of

\kappa

on to regions with different types of asymptotics: classical, trivial, degenerate and Gaussian (of two main and some "boundary" types). Also there is essential dependence of the structure of asymptotically minimax tests on the parameter

\kappa

for the case of Gaussian asymptotics . In this paper we consider alternative

H_{\varepsilon,\Gamma}:v\in V_\varepsilon(\Gamma)

for sets

{V_{\varepsilon}(\Gamma)= \bigcup_{\tau\in \Gamma}V_{\varepsilon}(\tau,\rho_{\varepsilon}(\tau))}

. This corresponds to adaptive setting:

\tau

is unknown,

\tau\in \Gamma

for a compact

\Gamma=K\times \Delta,\ \Delta=[c,\ C]\subset R_+^1,\ K\subset \Xi_{G_1}\cup \Xi_{G_2}

where

\Xi_{G_2}

and

\Xi_{G_2}

are regions of main tapes of Gaussian asymptotics . First the problems of such types were studied by Spokoiny [16, 17]. For ellipsoidal case we study sharp asymptotics of minimax second kind errors

\beta_{\varepsilon}(\alpha, \Gamma)=\beta(\alpha, V_{\varepsilon}(\Gamma))

and construct asymptotically minimax tests. %

\psi_{\alpha,\varepsilon,\Gamma}

. These asymptotics are analogous to degenerate type. For Besov bodies case we obtain exact rates and construct minimax consistent tests. Analogous exact rates are obtained in a signal detection problem for continuous variant of white Gaussian noise model: alternatives correspond to Besov or Sobolev balls with Sobolev or Besov balls removed. The study is based on results [11] and on an extension of methods of this paper for degenerate case

Publications Server of the Weierstrass Institute for Applied Analysis and Stochastics

Adaptive detection of high-dimensional signal

Author: Ingster Yuri I.
Publication venue
Publication date: 01/01/1999
Field of study

Let n-dimensional Gaussian random vector x = ξ + v be observed where ξ is a standard n-dimensional Gaussian vector and v ∈ Rn is the unknown mean. In the papers [3,5] there were studied minimax hypothesis testing problems: to test null - hypothesis H0 : v = 0 against two types of alternatives H1 = H1(θn): v ∈ Vn(θn). The first one corresponds to multi-channels signal detection problem for given value b of a signal and number k of channels containing a signal, θn = (b,k). The second one corresponds to lnq-ball of radius R1,n with the lnp-ball of radius R2,n removed, θn = (R1,n, R2,n,p,q) ∈ R4+. It was shown in [3,5] that often there are essential dependences of the structure of asymptotically minimax tests and of the asymptotics of the minimax second kind errors on parameters θn. These imply the problem: to construct adaptive tests having good minimax property for large enough regions Θn of parameters θn. This problem is studied here. We describe the sets Θn such that adaptation is possible without loss of efficiency. For other sets we present wide enough class of asymptotically exact bounds of adaptive efficiency and construct asymptotically minimax test procedures

Publications Server of the Weierstrass Institute for Applied Analysis and Stochastics

Sparse classification boundaries

Author: Ingster Yuri I.
Pouet Christophe
Tsybakov Alexandre B.
Publication venue
Publication date: 27/03/2009
Field of study

Given a training sample of size

m

from a

d

-dimensional population, we wish to allocate a new observation

Z\in \R^d

to this population or to the noise. We suppose that the difference between the distribution of the population and that of the noise is only in a shift, which is a sparse vector. For the Gaussian noise, fixed sample size

m

, and the dimension

d

that tends to infinity, we obtain the sharp classification boundary and we propose classifiers attaining this boundary. We also give extensions of this result to the case where the sample size

m

depends on

d

and satisfies the condition

(\log m)/\log d \to \gamma

0\le \gamma<1

, and to the case of non-Gaussian noise satisfying the Cram\'er condition

arXiv.org e-Print Archive

HAL AMU

Hal-Diderot

HAL-Polytechnique

Minimax signal detection in ill-posed inverse problems

Author: Ingster Yuri I.
Sapatinas Theofanis
Suslina Irina A.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 25/09/2012
Field of study

Ill-posed inverse problems arise in various scientific fields. We consider the signal detection problem for mildly, severely and extremely ill-posed inverse problems with

l^q

-ellipsoids (bodies),

q\in(0,2]

, for Sobolev, analytic and generalized analytic classes of functions under the Gaussian white noise model. We study both rate and sharp asymptotics for the error probabilities in the minimax setup. By construction, the derived tests are, often, nonadaptive. Minimax rate-optimal adaptive tests of rather simple structure are also constructed.Comment: Published in at http://dx.doi.org/10.1214/12-AOS1011 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

Crossref

Detection boundary in sparse regression

Author: Ingster Yuri I.
Tsybakov Alexandre B.
Verzelen Nicolas
Publication venue
Publication date: 01/01/2010
Field of study

We study the problem of detection of a p-dimensional sparse vector of parameters in the linear regression model with Gaussian noise. We establish the detection boundary, i.e., the necessary and sufficient conditions for the possibility of successful detection as both the sample size n and the dimension p tend to the infinity. Testing procedures that achieve this boundary are also exhibited. Our results encompass the high-dimensional setting (p>> n). The main message is that, under some conditions, the detection boundary phenomenon that has been proved for the Gaussian sequence model, extends to high-dimensional linear regression. Finally, we establish the detection boundaries when the variance of the noise is unknown. Interestingly, the detection boundaries sometimes depend on the knowledge of the variance in a high-dimensional setting

arXiv.org e-Print Archive