108,873 research outputs found

    Maximum Likelihood for Dual Varieties

    Full text link
    Maximum likelihood estimation (MLE) is a fundamental computational problem in statistics. In this paper, MLE for statistical models with discrete data is studied from an algebraic statistics viewpoint. A reformulation of the MLE problem in terms of dual varieties and conormal varieties will be given. With this description, the dual likelihood equations and the dual MLE problem are defined. We show that solving the dual MLE problem yields solutions to the MLE problem, so we can solve the MLE problem without ever determining the defining equations of the model

    Inconsistency of the MLE for the joint distribution of interval censored survival times and continuous marks

    Full text link
    This paper considers the nonparametric maximum likelihood estimator (MLE) for the joint distribution function of an interval censored survival time and a continuous mark variable. We provide a new explicit formula for the MLE in this problem. We use this formula and the mark specific cumulative hazard function of Huang and Louis (1998) to obtain the almost sure limit of the MLE. This result leads to necessary and sufficient conditions for consistency of the MLE which imply that the MLE is inconsistent in general. We show that the inconsistency can be repaired by discretizing the marks. Our theoretical results are supported by simulations.Comment: 27 pages, 4 figure

    The bivariate current status model

    Full text link
    For the univariate current status and, more generally, the interval censoring model, distribution theory has been developed for the maximum likelihood estimator (MLE) and smoothed maximum likelihood estimator (SMLE) of the unknown distribution function, see, e.g., [12], [7], [4], [5], [6], [10], [11] and [8]. For the bivariate current status and interval censoring models distribution theory of this type is still absent and even the rate at which we can expect reasonable estimators to converge is unknown. We define a purely discrete plug-in estimator of the distribution function which locally converges at rate n^{1/3} and derive its (normal) limit distribution. Unlike the MLE or SMLE, this estimator is not a proper distribution function. Since the estimator is purely discrete, it demonstrates that the n^{1/3} convergence rate is in principle possible for the MLE, but whether this actually holds for the MLE is still an open problem. If the cube root n rate holds for the MLE, this would mean that the local 1-dimensional rate of the MLE continues to hold in dimension 2, a (perhaps) somewhat surprising result. The simulation results do not seem to be in contradiction with this assumption, however. We compare the behavior of the plug-in estimator with the behavior of the MLE on a sieve and the SMLE in a simulation study. This indicates that the plug-in estimator and the SMLE have a smaller variance but a larger bias than the sieved MLE. The SMLE is conjectured to have a n^{1/3}-rate of convergence if we use bandwidths of order n^{-1/6}. We derive its (normal) limit distribution, using this assumption. Finally, we demonstrate the behavior of the MLE and SMLE for the bivariate interval censored data of [1], which have been discussed by many authors, see e.g., [18], [3], [2] and [15].Comment: 18 pages, 7 figures, 4 table

    Fixed-domain asymptotic properties of tapered maximum likelihood estimators

    Full text link
    When the spatial sample size is extremely large, which occurs in many environmental and ecological studies, operations on the large covariance matrix are a numerical challenge. Covariance tapering is a technique to alleviate the numerical challenges. Under the assumption that data are collected along a line in a bounded region, we investigate how the tapering affects the asymptotic efficiency of the maximum likelihood estimator (MLE) for the microergodic parameter in the Mat\'ern covariance function by establishing the fixed-domain asymptotic distribution of the exact MLE and that of the tapered MLE. Our results imply that, under some conditions on the taper, the tapered MLE is asymptotically as efficient as the true MLE for the microergodic parameter in the Mat\'ern model.Comment: Published in at http://dx.doi.org/10.1214/08-AOS676 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Tightness of the maximum likelihood semidefinite relaxation for angular synchronization

    Full text link
    Maximum likelihood estimation problems are, in general, intractable optimization problems. As a result, it is common to approximate the maximum likelihood estimator (MLE) using convex relaxations. In some cases, the relaxation is tight: it recovers the true MLE. Most tightness proofs only apply to situations where the MLE exactly recovers a planted solution (known to the analyst). It is then sufficient to establish that the optimality conditions hold at the planted signal. In this paper, we study an estimation problem (angular synchronization) for which the MLE is not a simple function of the planted solution, yet for which the convex relaxation is tight. To establish tightness in this context, the proof is less direct because the point at which to verify optimality conditions is not known explicitly. Angular synchronization consists in estimating a collection of nn phases, given noisy measurements of the pairwise relative phases. The MLE for angular synchronization is the solution of a (hard) non-bipartite Grothendieck problem over the complex numbers. We consider a stochastic model for the data: a planted signal (that is, a ground truth set of phases) is corrupted with non-adversarial random noise. Even though the MLE does not coincide with the planted signal, we show that the classical semidefinite relaxation for it is tight, with high probability. This holds even for high levels of noise.Comment: 2 figure
    corecore