28,022 research outputs found
A One-Sample Test for Normality with Kernel Methods
We propose a new one-sample test for normality in a Reproducing Kernel
Hilbert Space (RKHS). Namely, we test the null-hypothesis of belonging to a
given family of Gaussian distributions. Hence our procedure may be applied
either to test data for normality or to test parameters (mean and covariance)
if data are assumed Gaussian. Our test is based on the same principle as the
MMD (Maximum Mean Discrepancy) which is usually used for two-sample tests such
as homogeneity or independence testing. Our method makes use of a special kind
of parametric bootstrap (typical of goodness-of-fit tests) which is
computationally more efficient than standard parametric bootstrap. Moreover, an
upper bound for the Type-II error highlights the dependence on influential
quantities. Experiments illustrate the practical improvement allowed by our
test in high-dimensional settings where common normality tests are known to
fail. We also consider an application to covariance rank selection through a
sequential procedure
Estimation of Gini Index within Pre-Specied Error Bound
Gini index is a widely used measure of economic inequality. This article
develops a general theory for constructing a confidence interval for Gini index
with a specified confidence coefficient and a specified width. Fixed sample
size methods cannot simultaneously achieve both the specified confidence
coefficient and specified width.
We develop a purely sequential procedure for interval estimation of Gini
index with a specified confidence coefficient and a fixed margin of error.
Optimality properties of the proposed method, namely first order asymptotic
efficiency and asymptotic consistency are proved. All theoretical results are
derived without assuming any specific distribution of the data
Sequential Data-Adaptive Bandwidth Selection by Cross-Validation for Nonparametric Prediction
We consider the problem of bandwidth selection by cross-validation from a
sequential point of view in a nonparametric regression model. Having in mind
that in applications one often aims at estimation, prediction and change
detection simultaneously, we investigate that approach for sequential kernel
smoothers in order to base these tasks on a single statistic. We provide
uniform weak laws of large numbers and weak consistency results for the
cross-validated bandwidth. Extensions to weakly dependent error terms are
discussed as well. The errors may be {\alpha}-mixing or L2-near epoch
dependent, which guarantees that the uniform convergence of the cross
validation sum and the consistency of the cross-validated bandwidth hold true
for a large class of time series. The method is illustrated by analyzing
photovoltaic data.Comment: 26 page
Testing for Changes in Kendall's Tau
For a bivariate time series we want to detect
whether the correlation between and stays constant for all . We propose a nonparametric change-point test statistic based on
Kendall's tau and derive its asymptotic distribution under the null hypothesis
of no change by means a new U-statistic invariance principle for dependent
processes. The asymptotic distribution depends on the long run variance of
Kendall's tau, for which we propose an estimator and show its consistency.
Furthermore, assuming a single change-point, we show that the location of the
change-point is consistently estimated. Kendall's tau possesses a high
efficiency at the normal distribution, as compared to the normal maximum
likelihood estimator, Pearson's moment correlation coefficient. Contrary to
Pearson's correlation coefficient, it has excellent robustness properties and
shows no loss in efficiency at heavy-tailed distributions. We assume the data
to be stationary and P-near epoch dependent on an
absolutely regular process. The P-near epoch dependence condition constitutes a
generalization of the usually considered -near epoch dependence, , that does not require the existence of any moments. It is therefore very
well suited for our objective to efficiently detect changes in correlation for
arbitrarily heavy-tailed data
- …