15 research outputs found

    Modified confidence intervals for the Mahalanobis distance

    Get PDF
    Reiser (2001) proposes a method of forming confidence interval for a Mahalanobis distance that yields intervals which have exactly the nominal coverage, but sometimes the interval is View the MathML source (0,0). We consider the case where Mahalanobis distance quantifies the difference between an individual and a population mean, and suggest a modification that avoids implausible intervals

    On point estimation of the abnormality of a Mahalanobis index

    Get PDF
    Mahalanobis distance may be used as a measure of the disparity between an individual’s profile of scores and the average profile of a population of controls. The degree to which the individual’s profile is unusual can then be equated to the proportion of the population who would have a larger Mahalanobis distance than the individual. Several estimators of this proportion are examined. These include plug-in maximum likelihood estimators, medians, the posterior mean from a Bayesian probability matching prior, an estimator derived from a Taylor expansion, and two forms of polynomial approximation, one based on Bernstein polynomial and one on a quadrature method. Simulations show that some estimators, including the commonly-used plug-in maximum likelihood estimators, can have substantial bias for small or moderate sample sizes. The polynomial approximations yield estimators that have low bias, with the quadrature method marginally to be preferred over Bernstein polynomials. However, the polynomial estimators sometimes yield infeasible estimates that are outside the 0–1 range. While none of the estimators are perfectly unbiased, the median estimators match their definition; in simulations their estimates of the proportion have a median error close to zero. The standard median estimator can give unrealistically small estimates (including 0) and an adjustment is proposed that ensures estimates are always credible. This latter estimator has much to recommend it when unbiasedness is not of paramount importance, while the quadrature method is recommended when bias is the dominant issue

    Eliciting Dirichlet and Gaussian copula prior distributions for multinomial models

    Get PDF
    In this paper, we propose novel methods of quantifying expert opinion about prior distributions for multinomial models. Two different multivariate priors are elicited using median and quartile assessments of the multinomial probabilities. First, we start by eliciting a univariate beta distribution for the probability of each category. Then we elicit the hyperparameters of the Dirichlet distribution, as a tractable conjugate prior, from those of the univariate betas through various forms of reconciliation using least-squares techniques. However, a multivariate copula function will give a more flexible correlation structure between multinomial parameters if it is used as their multivariate prior distribution. So, second, we use beta marginal distributions to construct a Gaussian copula as a multivariate normal distribution function that binds these marginals and expresses the dependence structure between them. The proposed method elicits a positive-definite correlation matrix of this Gaussian copula. The two proposed methods are designed to be used through interactive graphical software written in Java

    Prevalence and Population Attributable Risk for Chronic Airflow Obstruction in a Large Multinational Study

    Get PDF
    Rationale: The Global Burden of Disease programme identified smoking, and ambient and household air pollution as the main drivers of death and disability from Chronic Obstructive Pulmonary Disease (COPD).Objective: To estimate the attributable risk of chronic airflow obstruction (CAO), a quantifiable characteristic of COPD, due to several risk factors.Methods: The Burden of Obstructive Lung Disease study is a cross-sectional study of adults, aged≄40, in a globally distributed sample of 41 urban and rural sites. Based on data from 28,459 participants, we estimated the prevalence of CAO, defined as a post-bronchodilator one-second forced expiratory volume to forced vital capacity ratio Measurements and Main Results: Mean prevalence of CAO was 11.2% in men and 8.6% in women. Mean PAR for smoking was 5.1% in men and 2.2% in women. The next most influential risk factors were poor education levels, working in a dusty job for ≄10 years, low body mass index (BMI), and a history of tuberculosis. The risk of CAO attributable to the different risk factors varied across sites.Conclusions: While smoking remains the most important risk factor for CAO, in some areas poor education, low BMI and passive smoking are of greater importance. Dusty occupations and tuberculosis are important risk factors at some sites

    Eliciting prior distributions for extra parameters in some generalized linear models

    No full text
    To elicit an informative prior distribution for a normal linear model or a gamma generalized linear model (GLM), expert opinion must be quantified about both the regression coefficients and the extra parameters of these models. The latter task has attracted comparatively little attention. In this article, we introduce two elicitation methods that aim to complete the prior structure of the normal and gamma GLMs. First, we develop a method of assessing a conjugate prior distribution for the error variance in normal linear models. The method quantifies an expert's opinions through assessments of a median and conditional medians. Second, we propose a novel method for eliciting a lognormal prior distribution for the scale parameter of gamma GLMs. Given the mean value of a gamma distributed response variable, the method is based on conditional quartile assessments. It can also be used to quantify an expert's opinion about the prior distribution for the shape parameter of any gamma random variable, if the mean of the distribution has been elicited or is assumed to be known. In the context of GLMs, the mean value is determined by the regression coefficients. Interactive graphics is the medium through which assessments for the two proposed methods are elicited. Examples illustrating use of the methods are given. Computer programs that implement both methods are available

    Eliciting Dirichlet and Connor–Mosimann prior distributions for multinomial models

    No full text
    This paper addresses the task of eliciting an informative prior distribution for multinomial models. We first introduce a method of eliciting univariate beta distributions for the probability of each category, conditional on the probabilities of other categories. Two different forms of multivariate prior are derived from the elicited beta distributions. First, we determine the hyperparameters of a Dirichlet distribution by reconciling the assessed parameters of the univariate beta conditional distributions. Although the Dirichlet distribution is the standard conjugate prior distribution for multinomial models, it is not flexible enough to represent a broad range of prior information. Second, we use the beta distributions to determine the parameters of a Connor–Mosimann distribution, which is a generalization of a Dirichlet distribution and is also a conjugate prior for multinomial models. It has a larger number of parameters than the standard Dirichlet distribution and hence a more flexible structure. The elicitation methods are designed to be used with the aid of interactive graphical user-friendly software

    Two-Term Edgeworth Expansions for the Classes of U- and V-statistics

    No full text
    Much effort has been devoted to deriving Edgeworth expansions for various classes of statistics that are asymptotically normally distributed, with derivations tailored to the individual structure of each class. Expansions with smaller error rates are needed for more accurate statistical inference. Two such Edgeworth expansions are derived analytically in this paper. One is a two-term expansion for the standardized U-statistic of order m, m â©Ÿ 3, with an error rate o(n− 1). The other is an expansion with the same error rate for the distribution of the standardized V-statistic of the same order. In deriving the Edgeworth expansion, we made use of the close connection between the V- and U-statistics, which permits to first derive the needed expansion for the related U-statistic, then extend it to the V-statistic, taking into consideration the estimation of all difference terms between the two statistics
    corecore