2,108 research outputs found

    Estimating the Distribution of Dietary Consumption Patterns

    Get PDF
    In the United States the preferred method of obtaining dietary intake data is the 24-hour dietary recall, yet the measure of most interest is usual or long-term average daily intake, which is impossible to measure. Thus, usual dietary intake is assessed with considerable measurement error. We were interested in estimating the population distribution of the Healthy Eating Index-2005 (HEI-2005), a multi-component dietary quality index involving ratios of interrelated dietary components to energy, among children aged 2-8 in the United States, using a national survey and incorporating survey weights. We developed a highly nonlinear, multivariate zero-inflated data model with measurement error to address this question. Standard nonlinear mixed model software such as SAS NLMIXED cannot handle this problem. We found that taking a Bayesian approach, and using MCMC, resolved the computational issues and doing so enabled us to provide a realistic distribution estimate for the HEI-2005 total score. While our computation and thinking in solving this problem was Bayesian, we relied on the well-known close relationship between Bayesian posterior means and maximum likelihood, the latter not computationally feasible, and thus were able to develop standard errors using balanced repeated replication, a survey-sampling approach.Comment: Published in at http://dx.doi.org/10.1214/12-STS413 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org). arXiv admin note: substantial text overlap with arXiv:1107.486

    Unexpected properties of bandwidth choice when smoothing discrete data for constructing a functional data classifier

    Get PDF
    The data functions that are studied in the course of functional data analysis are assembled from discrete data, and the level of smoothing that is used is generally that which is appropriate for accurate approximation of the conceptually smooth functions that were not actually observed. Existing literature shows that this approach is effective, and even optimal, when using functional data methods for prediction or hypothesis testing. However, in the present paper we show that this approach is not effective in classification problems. There a useful rule of thumb is that undersmoothing is often desirable, but there are several surprising qualifications to that approach. First, the effect of smoothing the training data can be more significant than that of smoothing the new data set to be classified; second, undersmoothing is not always the right approach, and in fact in some cases using a relatively large bandwidth can be more effective; and third, these perverse results are the consequence of very unusual properties of error rates, expressed as functions of smoothing parameters. For example, the orders of magnitude of optimal smoothing parameter choices depend on the signs and sizes of terms in an expansion of error rate, and those signs and sizes can vary dramatically from one setting to another, even for the same classifier.Comment: Published in at http://dx.doi.org/10.1214/13-AOS1158 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Variance estimation for the instrumental variables approach to measurement error in generalized linear models

    Get PDF
    This paper derives and gives explicit formulas for a derived sandwich variance estimate. This variance estimate is appropriate for generalized linear additive measurement error models fitted using instrumental variables. We also generalize the known results for linear regression. As such, this article explains the theoretical justification for the sandwich estimate of variance utilized in the software for measurement error developed under the Small Business Innovation Research Grant (SBIR) by StataCorp. The results admit estimation of variance matrices for measurement error models where there is an instrument for the unknown covariate. Copyright 2003 by StataCorp LP.sandwich estimate of variance, measurement error, White's estimator, robust variance, generalized linear models, instrumental variables

    Measurement error, GLMs, and notational conventions

    Get PDF
    This paper introduces additive measurement error in a generalized linear-model context. We discuss the types of measurement error along with their effects on fitted models. In addition, we present the notational conventions to be used in this and the accompanying papers. Copyright 2003 by StataCorp LP.generalized linear models, transportability, measurement error

    Aurora Volume 07

    Get PDF
    College formerly located at Olivet, Illinois and known as Olivet University, 1912-1923 ; Olivet College, 1923-1939 ; Olivet Nazarene College, 1940-1986 ; Olivet Nazarene University, 1986-https://digitalcommons.olivet.edu/arch_yrbks/1006/thumbnail.jp

    Semiparametric Regression During 2003–2007

    Get PDF
    Semiparametric regression is a fusion between parametric regression and nonparametric regression and the title of a book that we published on the topic in early 2003. We review developments in the field during the five year period since the book was written. We find semiparametric regression to be a vibrant field with substantial involvement and activity, continual enhancement and widespread application
    • …
    corecore