Search CORE

2,108 research outputs found

Estimating the Distribution of Dietary Consumption Patterns

Author: Carroll Raymond J.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2014
Field of study

In the United States the preferred method of obtaining dietary intake data is the 24-hour dietary recall, yet the measure of most interest is usual or long-term average daily intake, which is impossible to measure. Thus, usual dietary intake is assessed with considerable measurement error. We were interested in estimating the population distribution of the Healthy Eating Index-2005 (HEI-2005), a multi-component dietary quality index involving ratios of interrelated dietary components to energy, among children aged 2-8 in the United States, using a national survey and incorporating survey weights. We developed a highly nonlinear, multivariate zero-inflated data model with measurement error to address this question. Standard nonlinear mixed model software such as SAS NLMIXED cannot handle this problem. We found that taking a Bayesian approach, and using MCMC, resolved the computational issues and doing so enabled us to provide a realistic distribution estimate for the HEI-2005 total score. While our computation and thinking in solving this problem was Bayesian, we relied on the well-known close relationship between Bayesian posterior means and maximum likelihood, the latter not computationally feasible, and thus were able to develop standard errors using balanced repeated replication, a survey-sampling approach.Comment: Published in at http://dx.doi.org/10.1214/12-STS413 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org). arXiv admin note: substantial text overlap with arXiv:1107.486

arXiv.org e-Print Archive

CiteSeerX

Crossref

OAKTrust Digital Repository (Texas A&M Univ)

PubMed Central

Unexpected properties of bandwidth choice when smoothing discrete data for constructing a functional data classifier

Author: Carroll Raymond J.
Delaigle Aurore
Hall Peter
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2013
Field of study

The data functions that are studied in the course of functional data analysis are assembled from discrete data, and the level of smoothing that is used is generally that which is appropriate for accurate approximation of the conceptually smooth functions that were not actually observed. Existing literature shows that this approach is effective, and even optimal, when using functional data methods for prediction or hypothesis testing. However, in the present paper we show that this approach is not effective in classification problems. There a useful rule of thumb is that undersmoothing is often desirable, but there are several surprising qualifications to that approach. First, the effect of smoothing the training data can be more significant than that of smoothing the new data set to be classified; second, undersmoothing is not always the right approach, and in fact in some cases using a relatively large bandwidth can be more effective; and third, these perverse results are the consequence of very unusual properties of error rates, expressed as functions of smoothing parameters. For example, the orders of magnitude of optimal smoothing parameter choices depend on the signs and sizes of terms in an expansion of error rate, and those signs and sizes can vary dramatically from one setting to another, even for the same classifier.Comment: Published in at http://dx.doi.org/10.1214/13-AOS1158 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

Crossref

OPUS - University of Technology Sydney

OAKTrust Digital Repository (Texas A&M Univ)

PubMed Central

Variance estimation for the instrumental variables approach to measurement error in generalized linear models

Author: James W. Hardin
Raymond J. Carroll
Publication venue
Publication date
Field of study

This paper derives and gives explicit formulas for a derived sandwich variance estimate. This variance estimate is appropriate for generalized linear additive measurement error models fitted using instrumental variables. We also generalize the known results for linear regression. As such, this article explains the theoretical justification for the sandwich estimate of variance utilized in the software for measurement error developed under the Small Business Innovation Research Grant (SBIR) by StataCorp. The results admit estimation of variance matrices for measurement error models where there is an instrument for the unknown covariate. Copyright 2003 by StataCorp LP.sandwich estimate of variance, measurement error, White's estimator, robust variance, generalized linear models, instrumental variables

Research Papers in Economics

Measurement error, GLMs, and notational conventions

Author: James W. Hardin
Raymond J. Carroll
Publication venue
Publication date
Field of study

This paper introduces additive measurement error in a generalized linear-model context. We discuss the types of measurement error along with their effects on fitted models. In addition, we present the notational conventions to be used in this and the accompanying papers. Copyright 2003 by StataCorp LP.generalized linear models, transportability, measurement error

Research Papers in Economics

Aurora Volume 07

Author: Carroll Raymond J., (Editor)
Publication venue: Digital Commons @ Olivet
Publication date: 01/01/1920
Field of study

College formerly located at Olivet, Illinois and known as Olivet University, 1912-1923 ; Olivet College, 1923-1939 ; Olivet Nazarene College, 1940-1986 ; Olivet Nazarene University, 1986-https://digitalcommons.olivet.edu/arch_yrbks/1006/thumbnail.jp

Olivet Nazarene University

Semiparametric Regression During 2003–2007

Author: D. Ruppert
M. P. Wand
Matt P. W
Raymond J. Carroll
Raymond J. Carroll
Publication venue
Publication date: 01/01/2008
Field of study

Semiparametric regression is a fusion between parametric regression and nonparametric regression and the title of a book that we published on the topic in early 2003. We review developments in the field during the five year period since the book was written. We find semiparametric regression to be a vibrant field with substantial involvement and activity, continual enhancement and widespread application

CiteSeerX

Crossref

OPUS - University of Technology Sydney

OAKTrust Digital Repository (Texas A&M Univ)

PubMed Central

Research Online