1,070 research outputs found
Student's -test for scale mixture errors
Generalized t-tests are constructed under weaker than normal conditions. In
the first part of this paper we assume only the symmetry (around zero) of the
error distribution (i). In the second part we assume that the error
distribution is a Gaussian scale mixture (ii). The optimal (smallest) critical
values can be computed from generalizations of Student's cumulative
distribution function (cdf), . The cdf's of the generalized -test
statistics are denoted by (i) and (ii) , resp. As the
sample size we get the counterparts of the standard normal cdf
: (i) , and (ii)
. Explicit formulae are
given for the underlying new cdf's. For example iff . Thus the classical 95% confidence interval for the unknown expected
value of Gaussian distributions covers the center of symmetry with at least 95%
probability for Gaussian scale mixture distributions. On the other hand, the
90% quantile of is .Comment: Published at http://dx.doi.org/10.1214/074921706000000365 in the IMS
Lecture Notes--Monograph Series
(http://www.imstat.org/publications/lecnotes.htm) by the Institute of
Mathematical Statistics (http://www.imstat.org
Brownian distance covariance
Distance correlation is a new class of multivariate dependence coefficients
applicable to random vectors of arbitrary and not necessarily equal dimension.
Distance covariance and distance correlation are analogous to product-moment
covariance and correlation, but generalize and extend these classical bivariate
measures of dependence. Distance correlation characterizes independence: it is
zero if and only if the random vectors are independent. The notion of
covariance with respect to a stochastic process is introduced, and it is shown
that population distance covariance coincides with the covariance with respect
to Brownian motion; thus, both can be called Brownian distance covariance. In
the bivariate case, Brownian covariance is the natural extension of
product-moment covariance, as we obtain Pearson product-moment covariance by
replacing the Brownian motion in the definition with identity. The
corresponding statistic has an elegantly simple computing formula. Advantages
of applying Brownian covariance and correlation vs the classical Pearson
covariance and correlation are discussed and illustrated.Comment: This paper discussed in: [arXiv:0912.3295], [arXiv:1010.0822],
[arXiv:1010.0825], [arXiv:1010.0828], [arXiv:1010.0836], [arXiv:1010.0838],
[arXiv:1010.0839]. Rejoinder at [arXiv:1010.0844]. Published in at
http://dx.doi.org/10.1214/09-AOAS312 the Annals of Applied Statistics
(http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics
(http://www.imstat.org
DISCO analysis: A nonparametric extension of analysis of variance
In classical analysis of variance, dispersion is measured by considering
squared distances of sample elements from the sample mean. We consider a
measure of dispersion for univariate or multivariate response based on all
pairwise distances between-sample elements, and derive an analogous distance
components (DISCO) decomposition for powers of distance in . The ANOVA F
statistic is obtained when the index (exponent) is 2. For each index in
, this decomposition determines a nonparametric test for the
multi-sample hypothesis of equal distributions that is statistically consistent
against general alternatives.Comment: Published in at http://dx.doi.org/10.1214/09-AOAS245 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Assessing stochastic algorithms for large scale nonlinear least squares problems using extremal probabilities of linear combinations of gamma random variables
This article considers stochastic algorithms for efficiently solving a class
of large scale non-linear least squares (NLS) problems which frequently arise
in applications. We propose eight variants of a practical randomized algorithm
where the uncertainties in the major stochastic steps are quantified. Such
stochastic steps involve approximating the NLS objective function using
Monte-Carlo methods, and this is equivalent to the estimation of the trace of
corresponding symmetric positive semi-definite (SPSD) matrices. For the latter,
we prove tight necessary and sufficient conditions on the sample size (which
translates to cost) to satisfy the prescribed probabilistic accuracy. We show
that these conditions are practically computable and yield small sample sizes.
They are then incorporated in our stochastic algorithm to quantify the
uncertainty in each randomized step. The bounds we use are applications of more
general results regarding extremal tail probabilities of linear combinations of
gamma distributed random variables. We derive and prove new results concerning
the maximal and minimal tail probabilities of such linear combinations, which
can be considered independently of the rest of this paper
Measuring and testing dependence by correlation of distances
Distance correlation is a new measure of dependence between random vectors.
Distance covariance and distance correlation are analogous to product-moment
covariance and correlation, but unlike the classical definition of correlation,
distance correlation is zero only if the random vectors are independent. The
empirical distance dependence measures are based on certain Euclidean distances
between sample elements rather than sample moments, yet have a compact
representation analogous to the classical covariance and correlation.
Asymptotic properties and applications in testing independence are discussed.
Implementation of the test and Monte Carlo results are also presented.Comment: Published in at http://dx.doi.org/10.1214/009053607000000505 the
Annals of Statistics (http://www.imstat.org/aos/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Stochastic integration based on simple, symmetric random walks
A new approach to stochastic integration is described, which is based on an
a.s. pathwise approximation of the integrator by simple, symmetric random
walks. Hopefully, this method is didactically more advantageous, more
transparent, and technically less demanding than other existing ones. In a
large part of the theory one has a.s. uniform convergence on compacts. In
particular, it gives a.s. convergence for the stochastic integral of a finite
variation function of the integrator, which is not c\`adl\`ag in general.Comment: 16 pages, some typos correcte
Epidemic spread, parameter sensitivity and vaccination strategies on a random graph with overlapping communities
Our main goal is to examine the role of communities in epidemic spread in a
random graph model. More precisely, we consider a random graph model which
consists of overlapping complete graphs, representing households, workplaces,
school classes, and which also has a simple geometric structure. We study the
model's sensitivity to infection parameters and other tunable parameters of the
model, which might be helpful in finding efficient social distancing
strategies. We also quantitatively compare different vaccination strategies to
see which order is the best to defend the most vulnerable groups or the
population in general, and how important it is to gather and use information on
the position of infected individuals in the network.Comment: 16 pages, 7 figure
- âŠ