28,022 research outputs found

    A One-Sample Test for Normality with Kernel Methods

    Get PDF
    We propose a new one-sample test for normality in a Reproducing Kernel Hilbert Space (RKHS). Namely, we test the null-hypothesis of belonging to a given family of Gaussian distributions. Hence our procedure may be applied either to test data for normality or to test parameters (mean and covariance) if data are assumed Gaussian. Our test is based on the same principle as the MMD (Maximum Mean Discrepancy) which is usually used for two-sample tests such as homogeneity or independence testing. Our method makes use of a special kind of parametric bootstrap (typical of goodness-of-fit tests) which is computationally more efficient than standard parametric bootstrap. Moreover, an upper bound for the Type-II error highlights the dependence on influential quantities. Experiments illustrate the practical improvement allowed by our test in high-dimensional settings where common normality tests are known to fail. We also consider an application to covariance rank selection through a sequential procedure

    Estimation of Gini Index within Pre-Specied Error Bound

    Full text link
    Gini index is a widely used measure of economic inequality. This article develops a general theory for constructing a confidence interval for Gini index with a specified confidence coefficient and a specified width. Fixed sample size methods cannot simultaneously achieve both the specified confidence coefficient and specified width. We develop a purely sequential procedure for interval estimation of Gini index with a specified confidence coefficient and a fixed margin of error. Optimality properties of the proposed method, namely first order asymptotic efficiency and asymptotic consistency are proved. All theoretical results are derived without assuming any specific distribution of the data

    Sequential Data-Adaptive Bandwidth Selection by Cross-Validation for Nonparametric Prediction

    Full text link
    We consider the problem of bandwidth selection by cross-validation from a sequential point of view in a nonparametric regression model. Having in mind that in applications one often aims at estimation, prediction and change detection simultaneously, we investigate that approach for sequential kernel smoothers in order to base these tasks on a single statistic. We provide uniform weak laws of large numbers and weak consistency results for the cross-validated bandwidth. Extensions to weakly dependent error terms are discussed as well. The errors may be {\alpha}-mixing or L2-near epoch dependent, which guarantees that the uniform convergence of the cross validation sum and the consistency of the cross-validated bandwidth hold true for a large class of time series. The method is illustrated by analyzing photovoltaic data.Comment: 26 page

    Testing for Changes in Kendall's Tau

    Get PDF
    For a bivariate time series ((Xi,Yi))i=1,...,n((X_i,Y_i))_{i=1,...,n} we want to detect whether the correlation between XiX_i and YiY_i stays constant for all i=1,...,ni = 1,...,n. We propose a nonparametric change-point test statistic based on Kendall's tau and derive its asymptotic distribution under the null hypothesis of no change by means a new U-statistic invariance principle for dependent processes. The asymptotic distribution depends on the long run variance of Kendall's tau, for which we propose an estimator and show its consistency. Furthermore, assuming a single change-point, we show that the location of the change-point is consistently estimated. Kendall's tau possesses a high efficiency at the normal distribution, as compared to the normal maximum likelihood estimator, Pearson's moment correlation coefficient. Contrary to Pearson's correlation coefficient, it has excellent robustness properties and shows no loss in efficiency at heavy-tailed distributions. We assume the data ((Xi,Yi))i=1,...,n((X_i,Y_i))_{i=1,...,n} to be stationary and P-near epoch dependent on an absolutely regular process. The P-near epoch dependence condition constitutes a generalization of the usually considered LpL_p-near epoch dependence, p≥1p \ge 1, that does not require the existence of any moments. It is therefore very well suited for our objective to efficiently detect changes in correlation for arbitrarily heavy-tailed data
    • …
    corecore