2,108 research outputs found
Estimating the Distribution of Dietary Consumption Patterns
In the United States the preferred method of obtaining dietary intake data is
the 24-hour dietary recall, yet the measure of most interest is usual or
long-term average daily intake, which is impossible to measure. Thus, usual
dietary intake is assessed with considerable measurement error. We were
interested in estimating the population distribution of the Healthy Eating
Index-2005 (HEI-2005), a multi-component dietary quality index involving ratios
of interrelated dietary components to energy, among children aged 2-8 in the
United States, using a national survey and incorporating survey weights. We
developed a highly nonlinear, multivariate zero-inflated data model with
measurement error to address this question. Standard nonlinear mixed model
software such as SAS NLMIXED cannot handle this problem. We found that taking a
Bayesian approach, and using MCMC, resolved the computational issues and doing
so enabled us to provide a realistic distribution estimate for the HEI-2005
total score. While our computation and thinking in solving this problem was
Bayesian, we relied on the well-known close relationship between Bayesian
posterior means and maximum likelihood, the latter not computationally
feasible, and thus were able to develop standard errors using balanced repeated
replication, a survey-sampling approach.Comment: Published in at http://dx.doi.org/10.1214/12-STS413 the Statistical
Science (http://www.imstat.org/sts/) by the Institute of Mathematical
Statistics (http://www.imstat.org). arXiv admin note: substantial text
overlap with arXiv:1107.486
Unexpected properties of bandwidth choice when smoothing discrete data for constructing a functional data classifier
The data functions that are studied in the course of functional data analysis
are assembled from discrete data, and the level of smoothing that is used is
generally that which is appropriate for accurate approximation of the
conceptually smooth functions that were not actually observed. Existing
literature shows that this approach is effective, and even optimal, when using
functional data methods for prediction or hypothesis testing. However, in the
present paper we show that this approach is not effective in classification
problems. There a useful rule of thumb is that undersmoothing is often
desirable, but there are several surprising qualifications to that approach.
First, the effect of smoothing the training data can be more significant than
that of smoothing the new data set to be classified; second, undersmoothing is
not always the right approach, and in fact in some cases using a relatively
large bandwidth can be more effective; and third, these perverse results are
the consequence of very unusual properties of error rates, expressed as
functions of smoothing parameters. For example, the orders of magnitude of
optimal smoothing parameter choices depend on the signs and sizes of terms in
an expansion of error rate, and those signs and sizes can vary dramatically
from one setting to another, even for the same classifier.Comment: Published in at http://dx.doi.org/10.1214/13-AOS1158 the Annals of
Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical
Statistics (http://www.imstat.org
Variance estimation for the instrumental variables approach to measurement error in generalized linear models
This paper derives and gives explicit formulas for a derived sandwich variance estimate. This variance estimate is appropriate for generalized linear additive measurement error models fitted using instrumental variables. We also generalize the known results for linear regression. As such, this article explains the theoretical justification for the sandwich estimate of variance utilized in the software for measurement error developed under the Small Business Innovation Research Grant (SBIR) by StataCorp. The results admit estimation of variance matrices for measurement error models where there is an instrument for the unknown covariate. Copyright 2003 by StataCorp LP.sandwich estimate of variance, measurement error, White's estimator, robust variance, generalized linear models, instrumental variables
Measurement error, GLMs, and notational conventions
This paper introduces additive measurement error in a generalized linear-model context. We discuss the types of measurement error along with their effects on fitted models. In addition, we present the notational conventions to be used in this and the accompanying papers. Copyright 2003 by StataCorp LP.generalized linear models, transportability, measurement error
Aurora Volume 07
College formerly located at Olivet, Illinois and known as Olivet University, 1912-1923 ; Olivet College, 1923-1939 ; Olivet Nazarene College, 1940-1986 ; Olivet Nazarene University, 1986-https://digitalcommons.olivet.edu/arch_yrbks/1006/thumbnail.jp
Semiparametric Regression During 2003–2007
Semiparametric regression is a fusion between parametric regression and nonparametric regression and the title of a book that we published on the topic in early 2003. We review developments in the field during the five year period since the book was written. We find semiparametric regression to be a vibrant field with substantial involvement and activity, continual enhancement and widespread application
- …