57 research outputs found
Robust nonparametric inference for the median
We consider the problem of constructing robust nonparametric confidence
intervals and tests of hypothesis for the median when the data distribution is
unknown and the data may contain a small fraction of contamination. We propose
a modification of the sign test (and its associated confidence interval) which
attains the nominal significance level (probability coverage) for any
distribution in the contamination neighborhood of a continuous distribution. We
also define some measures of robustness and efficiency under contamination for
confidence intervals and tests. These measures are computed for the proposed
procedures.Comment: Published at http://dx.doi.org/10.1214/009053604000000634 in the
Annals of Statistics (http://www.imstat.org/aos/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Uniform asymptotics for robust location estimates when the scale is unknown
Most asymptotic results for robust estimates rely on regularity conditions
that are difficult to verify in practice. Moreover, these results apply to
fixed distribution functions. In the robustness context the distribution of the
data remains largely unspecified and hence results that hold uniformly over a
set of possible distribution functions are of theoretical and practical
interest. Also, it is desirable to be able to determine the size of the set of
distribution functions where the uniform properties hold. In this paper we
study the problem of obtaining verifiable regularity conditions that suffice to
yield uniform consistency and uniform asymptotic normality for location robust
estimates when the scale of the errors is unknown.
We study M-location estimates calculated with an S-scale and we obtain
uniform asymptotic results over contamination neighborhoods. Moreover, we show
how to calculate the maximum size of the contamination neighborhoods where
these uniform results hold. There is a trade-off between the size of these
neighborhoods and the breakdown point of the scale estimate.Comment: Published by the Institute of Mathematical Statistics
(http://www.imstat.org) in the Annals of Statistics
(http://www.imstat.org/aos/) at http://dx.doi.org/10.1214/00905360400000054
Uniform asymptotics for robust location estimates when the scale is unknown
Most asymptotic results for robust estimates rely on regularity conditions that are difficult to verify and that real data sets rarely satisfy. Moreover, these results apply to fixed distribution functions. In the robustness context the distribution of the data remains largely unspecified and hence results that hold uniformly over a set of possible distribution functions are of theoretical and practical interest. In this paper we study the problem of obtaining verifiable and realistic conditions that suffice to obtain uniform consistency and uniform asymptotic normality for location robust estimates when the scale of the errors is unknown. We study M-location estimates calculated withan S-scale and we obtain uniform asymptotic results over contamination neighbourhoods. There is a trade-off between the size of these neighbourhoods and the breakdown point of the scale estimate. We also show how to calculate the maximum size of the contamination neighbourhoods where these uniform results hold.
Propagation of outliers in multivariate data
We investigate the performance of robust estimates of multivariate location
under nonstandard data contamination models such as componentwise outliers
(i.e., contamination in each variable is independent from the other variables).
This model brings up a possible new source of statistical error that we call
"propagation of outliers." This source of error is unusual in the sense that it
is generated by the data processing itself and takes place after the data has
been collected. We define and derive the influence function of robust
multivariate location estimates under flexible contamination models and use it
to investigate the effect of propagation of outliers. Furthermore, we show that
standard high-breakdown affine equivariant estimators propagate outliers and
therefore show poor breakdown behavior under componentwise contamination when
the dimension is high.Comment: Published in at http://dx.doi.org/10.1214/07-AOS588 the Annals of
Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical
Statistics (http://www.imstat.org
- …