1,070 research outputs found

    Student's tt-test for scale mixture errors

    Full text link
    Generalized t-tests are constructed under weaker than normal conditions. In the first part of this paper we assume only the symmetry (around zero) of the error distribution (i). In the second part we assume that the error distribution is a Gaussian scale mixture (ii). The optimal (smallest) critical values can be computed from generalizations of Student's cumulative distribution function (cdf), tn(x)t_n(x). The cdf's of the generalized tt-test statistics are denoted by (i) tnS(x)t_n^S(x) and (ii) tnG(x)t_n^G(x), resp. As the sample size n→∞n\to \infty we get the counterparts of the standard normal cdf Ί(x)\Phi(x): (i) ΊS(x):=lim⁥n→∞tnS(x)\Phi^S(x):=\operatorname {lim}_{n\to \infty}t_n^S(x), and (ii) ΊG(x):=lim⁥n→∞tnG(x)\Phi^G(x):=\operatorname {lim}_{n\to \infty}t_n^G(x). Explicit formulae are given for the underlying new cdf's. For example ΊG(x)=Ί(x)\Phi^G(x)=\Phi(x) iff ∣xâˆŁâ‰„3|x|\ge \sqrt{3}. Thus the classical 95% confidence interval for the unknown expected value of Gaussian distributions covers the center of symmetry with at least 95% probability for Gaussian scale mixture distributions. On the other hand, the 90% quantile of ΊG\Phi^G is 43/5=1.385...>Ω−1(0.9)=1.282...4\sqrt{3}/5=1.385... >\Phi^{-1}(0.9)=1.282....Comment: Published at http://dx.doi.org/10.1214/074921706000000365 in the IMS Lecture Notes--Monograph Series (http://www.imstat.org/publications/lecnotes.htm) by the Institute of Mathematical Statistics (http://www.imstat.org

    Brownian distance covariance

    Full text link
    Distance correlation is a new class of multivariate dependence coefficients applicable to random vectors of arbitrary and not necessarily equal dimension. Distance covariance and distance correlation are analogous to product-moment covariance and correlation, but generalize and extend these classical bivariate measures of dependence. Distance correlation characterizes independence: it is zero if and only if the random vectors are independent. The notion of covariance with respect to a stochastic process is introduced, and it is shown that population distance covariance coincides with the covariance with respect to Brownian motion; thus, both can be called Brownian distance covariance. In the bivariate case, Brownian covariance is the natural extension of product-moment covariance, as we obtain Pearson product-moment covariance by replacing the Brownian motion in the definition with identity. The corresponding statistic has an elegantly simple computing formula. Advantages of applying Brownian covariance and correlation vs the classical Pearson covariance and correlation are discussed and illustrated.Comment: This paper discussed in: [arXiv:0912.3295], [arXiv:1010.0822], [arXiv:1010.0825], [arXiv:1010.0828], [arXiv:1010.0836], [arXiv:1010.0838], [arXiv:1010.0839]. Rejoinder at [arXiv:1010.0844]. Published in at http://dx.doi.org/10.1214/09-AOAS312 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

    DISCO analysis: A nonparametric extension of analysis of variance

    Full text link
    In classical analysis of variance, dispersion is measured by considering squared distances of sample elements from the sample mean. We consider a measure of dispersion for univariate or multivariate response based on all pairwise distances between-sample elements, and derive an analogous distance components (DISCO) decomposition for powers of distance in (0,2](0,2]. The ANOVA F statistic is obtained when the index (exponent) is 2. For each index in (0,2)(0,2), this decomposition determines a nonparametric test for the multi-sample hypothesis of equal distributions that is statistically consistent against general alternatives.Comment: Published in at http://dx.doi.org/10.1214/09-AOAS245 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Assessing stochastic algorithms for large scale nonlinear least squares problems using extremal probabilities of linear combinations of gamma random variables

    Get PDF
    This article considers stochastic algorithms for efficiently solving a class of large scale non-linear least squares (NLS) problems which frequently arise in applications. We propose eight variants of a practical randomized algorithm where the uncertainties in the major stochastic steps are quantified. Such stochastic steps involve approximating the NLS objective function using Monte-Carlo methods, and this is equivalent to the estimation of the trace of corresponding symmetric positive semi-definite (SPSD) matrices. For the latter, we prove tight necessary and sufficient conditions on the sample size (which translates to cost) to satisfy the prescribed probabilistic accuracy. We show that these conditions are practically computable and yield small sample sizes. They are then incorporated in our stochastic algorithm to quantify the uncertainty in each randomized step. The bounds we use are applications of more general results regarding extremal tail probabilities of linear combinations of gamma distributed random variables. We derive and prove new results concerning the maximal and minimal tail probabilities of such linear combinations, which can be considered independently of the rest of this paper

    Measuring and testing dependence by correlation of distances

    Full text link
    Distance correlation is a new measure of dependence between random vectors. Distance covariance and distance correlation are analogous to product-moment covariance and correlation, but unlike the classical definition of correlation, distance correlation is zero only if the random vectors are independent. The empirical distance dependence measures are based on certain Euclidean distances between sample elements rather than sample moments, yet have a compact representation analogous to the classical covariance and correlation. Asymptotic properties and applications in testing independence are discussed. Implementation of the test and Monte Carlo results are also presented.Comment: Published in at http://dx.doi.org/10.1214/009053607000000505 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Stochastic integration based on simple, symmetric random walks

    Full text link
    A new approach to stochastic integration is described, which is based on an a.s. pathwise approximation of the integrator by simple, symmetric random walks. Hopefully, this method is didactically more advantageous, more transparent, and technically less demanding than other existing ones. In a large part of the theory one has a.s. uniform convergence on compacts. In particular, it gives a.s. convergence for the stochastic integral of a finite variation function of the integrator, which is not c\`adl\`ag in general.Comment: 16 pages, some typos correcte

    Epidemic spread, parameter sensitivity and vaccination strategies on a random graph with overlapping communities

    Full text link
    Our main goal is to examine the role of communities in epidemic spread in a random graph model. More precisely, we consider a random graph model which consists of overlapping complete graphs, representing households, workplaces, school classes, and which also has a simple geometric structure. We study the model's sensitivity to infection parameters and other tunable parameters of the model, which might be helpful in finding efficient social distancing strategies. We also quantitatively compare different vaccination strategies to see which order is the best to defend the most vulnerable groups or the population in general, and how important it is to gather and use information on the position of infected individuals in the network.Comment: 16 pages, 7 figure
    • 

    corecore