2,100 research outputs found
Penalized variable selection procedure for Cox models with semiparametric relative risk
We study the Cox models with semiparametric relative risk, which can be
partially linear with one nonparametric component, or multiple additive or
nonadditive nonparametric components. A penalized partial likelihood procedure
is proposed to simultaneously estimate the parameters and select variables for
both the parametric and the nonparametric parts. Two penalties are applied
sequentially. The first penalty, governing the smoothness of the multivariate
nonlinear covariate effect function, provides a smoothing spline ANOVA
framework that is exploited to derive an empirical model selection tool for the
nonparametric part. The second penalty, either the
smoothly-clipped-absolute-deviation (SCAD) penalty or the adaptive LASSO
penalty, achieves variable selection in the parametric part. We show that the
resulting estimator of the parametric part possesses the oracle property, and
that the estimator of the nonparametric part achieves the optimal rate of
convergence. The proposed procedures are shown to work well in simulation
experiments, and then applied to a real data example on sexually transmitted
diseases.Comment: Published in at http://dx.doi.org/10.1214/09-AOS780 the Annals of
Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical
Statistics (http://www.imstat.org
Bayesian Regularisation in Structured Additive Regression Models for Survival Data
During recent years, penalized likelihood approaches have attracted a lot of interest both in the area of semiparametric regression and for the regularization of high-dimensional regression models. In this paper, we introduce a Bayesian formulation that allows to combine both aspects into a joint regression model with a focus on hazard regression for survival times. While Bayesian penalized splines form the basis for estimating nonparametric and flexible time-varying effects, regularization of high-dimensional covariate vectors is based on scale mixture of normals priors. This class of priors allows to keep a (conditional) Gaussian prior for regression coefficients on the predictor stage of the model but introduces suitable mixture distributions for the Gaussian variance to achieve regularization. This scale mixture property allows to device general and adaptive Markov chain Monte Carlo simulation algorithms for fitting a variety of hazard regression models. In particular, unifying algorithms based on iteratively weighted least squares proposals can be employed both for regularization and penalized semiparametric function estimation. Since sampling based estimates do no longer have the variable selection property well-known for the Lasso in frequentist analyses, we additionally consider spike and slab priors that introduce a further mixing stage that allows to separate between influential and redundant parameters. We demonstrate the different shrinkage properties with three simulation settings and apply the methods to the PBC Liver dataset
Bayesian Semiparametric Multi-State Models
Multi-state models provide a unified framework for the description of the evolution of discrete phenomena in continuous time. One particular example are Markov processes which can be characterised by a set of time-constant transition intensities between the states. In this paper, we will extend such parametric approaches to semiparametric models with flexible transition intensities based on Bayesian versions of penalised splines. The transition intensities will be modelled as smooth functions of time and can further be related to parametric as well as nonparametric covariate effects. Covariates with time-varying effects and frailty terms can be included in addition. Inference will be conducted either fully Bayesian using Markov chain Monte Carlo simulation techniques or empirically Bayesian based on a mixed model representation. A counting process representation of semiparametric multi-state models provides the likelihood formula and also forms the basis for model validation via martingale residual processes. As an application, we will consider human sleep data with a discrete set of sleep states such as REM and Non-REM phases. In this case, simple parametric approaches are inappropriate since the dynamics underlying human sleep are strongly varying throughout the night and individual-specific variation has to be accounted for using covariate information and frailty terms
- …