Search CORE

21,317 research outputs found

Large-Scale Kernel Methods for Independence Testing

Author: Filippi Sarah
Gretton Arthur
Sejdinovic Dino
Zhang Qinyi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/06/2016
Field of study

Representations of probability measures in reproducing kernel Hilbert spaces provide a flexible framework for fully nonparametric hypothesis tests of independence, which can capture any type of departure from independence, including nonlinear associations and multivariate interactions. However, these approaches come with an at least quadratic computational cost in the number of observations, which can be prohibitive in many applications. Arguably, it is exactly in such large-scale datasets that capturing any type of dependence is of interest, so striking a favourable tradeoff between computational efficiency and test performance for kernel independence tests would have a direct impact on their applicability in practice. In this contribution, we provide an extensive study of the use of large-scale kernel approximations in the context of independence testing, contrasting block-based, Nystrom and random Fourier feature approaches. Through a variety of synthetic data experiments, it is demonstrated that our novel large scale methods give comparable performance with existing methods whilst using significantly less computation time and memory.Comment: 29 pages, 6 figure

arXiv.org e-Print Archive

Springer - Publisher Connector

Oxford University Research Archive

Spiral - Imperial College Digital Repository

Discussion of: Brownian distance covariance

Author: Fukumizu Kenji
Gretton Arthur
Sriperumbudur Bharath K.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 05/10/2010
Field of study

Discussion on "Brownian distance covariance" by G\'{a}bor J. Sz\'{e}kely and Maria L. Rizzo [arXiv:1010.0297]Comment: Published in at http://dx.doi.org/10.1214/09-AOAS312E the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

On a Nonparametric Notion of Residual and its Applications

Author: Patra Rohit Kumar
Sen Bodhisattva
Szekely Gabor
Publication venue
Publication date: 30/09/2015
Field of study

Let

(X, \mathbf{Z})

be a continuous random vector in

\mathbb{R} \times \mathbb{R}^d

d \ge 1

. In this paper, we define the notion of a nonparametric residual of

X

\mathbf{Z}

that is always independent of the predictor

\mathbf{Z}

. We study its properties and show that the proposed notion of residual matches with the usual residual (error) in a multivariate normal regression model. Given a random vector

(X, Y, \mathbf{Z})

\mathbb{R} \times \mathbb{R} \times \mathbb{R}^d

, we use this notion of residual to show that the conditional independence between

X

and

Y

, given

\mathbf{Z}

, is equivalent to the mutual independence of the residuals (of

X

\mathbf{Z}

and

Y

\mathbf{Z}

) and

\mathbf{Z}

. This result is used to develop a test for conditional independence. We propose a bootstrap scheme to approximate the critical value of this test. We compare the proposed test, which is easily implementable, with some of the existing procedures through a simulation study.Comment: 19 pages, 2 figure

arXiv.org e-Print Archive