901 research outputs found

    The transfer principle: A tool for complete case analysis

    Get PDF
    This paper gives a general method for deriving limiting distributions of complete case statistics for missing data models from corresponding results for the model where all data are observed. This provides a convenient tool for obtaining the asymptotic behavior of complete case versions of established full data methods without lengthy proofs. The methodology is illustrated by analyzing three inference procedures for partially linear regression models with responses missing at random. We first show that complete case versions of asymptotically efficient estimators of the slope parameter for the full model are efficient, thereby solving the problem of constructing efficient estimators of the slope parameter for this model. Second, we derive an asymptotically distribution free test for fitting a normal distribution to the errors. Finally, we obtain an asymptotically distribution free test for linearity, that is, for testing that the nonparametric component of these models is a constant. This test is new both when data are fully observed and when data are missing at random.Comment: Published in at http://dx.doi.org/10.1214/12-AOS1061 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Topics in Partially Linear Single-Index Models for Longitudinal Data

    Get PDF
    The partially linear single-index model is a semiparametric model proposed to the case when some predictors are linearly associated with the response variable, while some other predictors are nonlinearly associated with the response variable. It is widely used for its flexibility in statistical modeling. Furthermore, its generalized version is a generalization of some popular models such as the generalized linear model, the partially linear model and the single-index model. However, the proper estimation in partially linear single-index models for longitudinal data, where multiple measurements are observed for each subject, is still open to discussion. Our main purpose is to establish a unified estimation method for the longitudinal partially linear single-index model and its generalized version. With this question in mind, we propose a new iterative three-stage estimation method in partially linear single-index models and generalized partially linear single-index models for longitudinal data. With the proposed method, the within-subject correlation is properly taken into consideration in the estimation of both the parameters and the nonparametric single-index function. The parameter estimators are shown to be asymptotically semiparametric efficient. The asymptotic variance of the single-index function estimator is shown to be generally less than that of existing estimators. Simulation studies are performed to demonstrate the finite sample performance. Three real data examples are also analyzed to illustrate the methodology

    Structured additive regression for multicategorical space-time data: A mixed model approach

    Get PDF
    In many practical situations, simple regression models suffer from the fact that the dependence of responses on covariates can not be sufficiently described by a purely parametric predictor. For example effects of continuous covariates may be nonlinear or complex interactions between covariates may be present. A specific problem of space-time data is that observations are in general spatially and/or temporally correlated. Moreover, unobserved heterogeneity between individuals or units may be present. While, in recent years, there has been a lot of work in this area dealing with univariate response models, only limited attention has been given to models for multicategorical space-time data. We propose a general class of structured additive regression models (STAR) for multicategorical responses, allowing for a flexible semiparametric predictor. This class includes models for multinomial responses with unordered categories as well as models for ordinal responses. Non-linear effects of continuous covariates, time trends and interactions between continuous covariates are modelled through Bayesian versions of penalized splines and flexible seasonal components. Spatial effects can be estimated based on Markov random fields, stationary Gaussian random fields or two-dimensional penalized splines. We present our approach from a Bayesian perspective, allowing to treat all functions and effects within a unified general framework by assigning appropriate priors with different forms and degrees of smoothness. Inference is performed on the basis of a multicategorical linear mixed model representation. This can be viewed as posterior mode estimation and is closely related to penalized likelihood estimation in a frequentist setting. Variance components, corresponding to inverse smoothing parameters, are then estimated by using restricted maximum likelihood. Numerically efficient algorithms allow computations even for fairly large data sets. As a typical example we present results on an analysis of data from a forest health survey
    corecore