77,024 research outputs found
Penalized variable selection procedure for Cox models with semiparametric relative risk
We study the Cox models with semiparametric relative risk, which can be
partially linear with one nonparametric component, or multiple additive or
nonadditive nonparametric components. A penalized partial likelihood procedure
is proposed to simultaneously estimate the parameters and select variables for
both the parametric and the nonparametric parts. Two penalties are applied
sequentially. The first penalty, governing the smoothness of the multivariate
nonlinear covariate effect function, provides a smoothing spline ANOVA
framework that is exploited to derive an empirical model selection tool for the
nonparametric part. The second penalty, either the
smoothly-clipped-absolute-deviation (SCAD) penalty or the adaptive LASSO
penalty, achieves variable selection in the parametric part. We show that the
resulting estimator of the parametric part possesses the oracle property, and
that the estimator of the nonparametric part achieves the optimal rate of
convergence. The proposed procedures are shown to work well in simulation
experiments, and then applied to a real data example on sexually transmitted
diseases.Comment: Published in at http://dx.doi.org/10.1214/09-AOS780 the Annals of
Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical
Statistics (http://www.imstat.org
Penalized Likelihood and Bayesian Function Selection in Regression Models
Challenging research in various fields has driven a wide range of
methodological advances in variable selection for regression models with
high-dimensional predictors. In comparison, selection of nonlinear functions in
models with additive predictors has been considered only more recently. Several
competing suggestions have been developed at about the same time and often do
not refer to each other. This article provides a state-of-the-art review on
function selection, focusing on penalized likelihood and Bayesian concepts,
relating various approaches to each other in a unified framework. In an
empirical comparison, also including boosting, we evaluate several methods
through applications to simulated and real data, thereby providing some
guidance on their performance in practice
A Selective Review of Group Selection in High-Dimensional Models
Grouping structures arise naturally in many statistical modeling problems.
Several methods have been proposed for variable selection that respect grouping
structure in variables. Examples include the group LASSO and several concave
group selection methods. In this article, we give a selective review of group
selection concerning methodological developments, theoretical properties and
computational algorithms. We pay particular attention to group selection
methods involving concave penalties. We address both group selection and
bi-level selection methods. We describe several applications of these methods
in nonparametric additive models, semiparametric regression, seemingly
unrelated regressions, genomic data analysis and genome wide association
studies. We also highlight some issues that require further study.Comment: Published in at http://dx.doi.org/10.1214/12-STS392 the Statistical
Science (http://www.imstat.org/sts/) by the Institute of Mathematical
Statistics (http://www.imstat.org
Marginal integration for nonparametric causal inference
We consider the problem of inferring the total causal effect of a single
variable intervention on a (response) variable of interest. We propose a
certain marginal integration regression technique for a very general class of
potentially nonlinear structural equation models (SEMs) with known structure,
or at least known superset of adjustment variables: we call the procedure
S-mint regression. We easily derive that it achieves the convergence rate as
for nonparametric regression: for example, single variable intervention effects
can be estimated with convergence rate assuming smoothness with
twice differentiable functions. Our result can also be seen as a major
robustness property with respect to model misspecification which goes much
beyond the notion of double robustness. Furthermore, when the structure of the
SEM is not known, we can estimate (the equivalence class of) the directed
acyclic graph corresponding to the SEM, and then proceed by using S-mint based
on these estimates. We empirically compare the S-mint regression method with
more classical approaches and argue that the former is indeed more robust, more
reliable and substantially simpler.Comment: 40 pages, 14 figure
Determinants of the Dinar-Euro Nominal Exchange Rate
This paper studies drivers of daily dynamics of the nominal dinar-euro exchange rate from September 2006 to June 2010. Using a novel semiparametric approach we are able to incorporate the evidence of nonlinearities under very weak assumptions on the underlying data generating process. We identify several factors influencing daily exchange rate returns whose importance varies over time. In the period preceeding the financial crisis, information in past returns, changes in households’ foreign currency savings and banks' net purchases of foreign currency are the most significant factors. From September 2008 onwards other factors related to changes in country's risk and the information processing in the market gain importance. NBS interventions are found to be effective with a time delay.Foreign exchange market, Partially linear model, Kernel estimation
Inference in Additively Separable Models With a High-Dimensional Set of Conditioning Variables
This paper studies nonparametric series estimation and inference for the
effect of a single variable of interest x on an outcome y in the presence of
potentially high-dimensional conditioning variables z. The context is an
additively separable model E[y|x, z] = g0(x) + h0(z). The model is
high-dimensional in the sense that the series of approximating functions for
h0(z) can have more terms than the sample size, thereby allowing z to have
potentially very many measured characteristics. The model is required to be
approximately sparse: h0(z) can be approximated using only a small subset of
series terms whose identities are unknown. This paper proposes an estimation
and inference method for g0(x) called Post-Nonparametric Double Selection which
is a generalization of Post-Double Selection. Standard rates of convergence and
asymptotic normality for the estimator are shown to hold uniformly over a large
class of sparse data generating processes. A simulation study illustrates
finite sample estimation properties of the proposed estimator and coverage
properties of the corresponding confidence intervals. Finally, an empirical
application to college admissions policy demonstrates the practical
implementation of the proposed method
- …