38,873 research outputs found
Nonparametric Estimation of the Link Function Including Variable Selection
Nonparametric methods for the estimation of the link function in generalized linear models are able to avoid bias in the regression parameters. But for the estimation of the link typically the full model, which includes all predictors, has been used. When the number of predictors is large these methods fail since the full model can not be estimated. In the present article a boosting type method is proposed that simultaneously selects predictors and estimates the link function. The method performs quite well in simulations and real data examples
Nonparametric Identification of Multivariate Mixtures
This article analyzes the identifiability of k-variate, M-component finite mixture models in which each component distribution has independent marginals, including models in latent class analysis. Without making parametric assumptions on the component distributions, we investigate how one can identify the number of components and the component distributions from the distribution function of the observed data. We reveal an important link between the number of variables (k), the number of values each variable can take, and the number of identifiable components. A lower bound on the number of components (M) is nonparametrically identifiable if k >= 2, and the maximum identifiable number of components is determined by the number of different values each variable takes. When M is known, the mixing proportions and the component distributions are nonparametrically identified from matrices constructed from the distribution function of the data if (i) k >= 3, (ii) two of k variables take at least M different values, and (iii) these matrices satisfy some rank and eigenvalue conditions. For the unknown M case, we propose an algorithm that possibly identifies M and the component distributions from data. We discuss a condition for nonparametric identi fication and its observable implications. In case M cannot be identified, we use our identification condition to develop a procedure that consistently estimates a lower bound on the number of components by estimating the rank of a matrix constructed from the distribution function of observed variables.finite mixture, latent class analysis, latent class model, model selection, number of components, rank estimation
Markov-switching generalized additive models
We consider Markov-switching regression models, i.e. models for time series
regression analyses where the functional relationship between covariates and
response is subject to regime switching controlled by an unobservable Markov
chain. Building on the powerful hidden Markov model machinery and the methods
for penalized B-splines routinely used in regression analyses, we develop a
framework for nonparametrically estimating the functional form of the effect of
the covariates in such a regression model, assuming an additive structure of
the predictor. The resulting class of Markov-switching generalized additive
models is immensely flexible, and contains as special cases the common
parametric Markov-switching regression models and also generalized additive and
generalized linear models. The feasibility of the suggested maximum penalized
likelihood approach is demonstrated by simulation and further illustrated by
modelling how energy price in Spain depends on the Euro/Dollar exchange rate
Local Adaptive Grouped Regularization and its Oracle Properties for Varying Coefficient Regression
Varying coefficient regression is a flexible technique for modeling data
where the coefficients are functions of some effect-modifying parameter, often
time or location in a certain domain. While there are a number of methods for
variable selection in a varying coefficient regression model, the existing
methods are mostly for global selection, which includes or excludes each
covariate over the entire domain. Presented here is a new local adaptive
grouped regularization (LAGR) method for local variable selection in spatially
varying coefficient linear and generalized linear regression. LAGR selects the
covariates that are associated with the response at any point in space, and
simultaneously estimates the coefficients of those covariates by tailoring the
adaptive group Lasso toward a local regression model with locally linear
coefficient estimates. Oracle properties of the proposed method are established
under local linear regression and local generalized linear regression. The
finite sample properties of LAGR are assessed in a simulation study and for
illustration, the Boston housing price data set is analyzed.Comment: 30 pages, one technical appendix, two figure
- ā¦