37,640 research outputs found
Structured count data regression
Overdispersion in count data regression is often caused by neglection or inappropriate modelling of individual heterogeneity, temporal or spatial correlation, and nonlinear covariate effects. In this paper, we develop and study semiparametric count data models which can deal with these issues by incorporating corresponding components in structured additive form into the predictor. The models are fully Bayesian and inference is carried out by computationally efficient MCMC techniques. In a simulation study, we investigate how well the different components can be identified with the data at hand. The approach is applied to a large data set of claim frequencies from car insurance
Recommended from our members
Self-organizing linear output map (SOLO): An artificial neural network suitable for hydrologic modeling and analysis
Artificial neural networks (ANNs) can be useful in the prediction of hydrologic variables, such as streamflow, particularly when the underlying processes have complex nonlinear interrelationships. However, conventional ANN structures suffer from network training issues that significantly limit their widespread application. This paper presents a multivariate ANN procedure entitled self-organizing linear output map (SOLO), whose structure has been designed for rapid, precise, and inexpensive estimation of network structure/parameters and system outputs. More important, SOLO provides features that facilitate insight into the underlying processes, thereby extending its usefulness beyond forecast applications as a tool for scientific investigations. These characteristics are demonstrated using a classic rainfall-runoff forecasting problem. Various aspects of model performance are evaluated in comparison with other commonly used modeling approaches, including multilayer feedforward ANNs, linear time series modeling, and conceptual rainfall-runoff modeling
Boosted Beta regression.
Regression analysis with a bounded outcome is a common problem in applied statistics. Typical examples include regression models for percentage outcomes and the analysis of ratings that are measured on a bounded scale. In this paper, we consider beta regression, which is a generalization of logit models to situations where the response is continuous on the interval (0,1). Consequently, beta regression is a convenient tool for analyzing percentage responses. The classical approach to fit a beta regression model is to use maximum likelihood estimation with subsequent AIC-based variable selection. As an alternative to this established - yet unstable - approach, we propose a new estimation technique called boosted beta regression. With boosted beta regression estimation and variable selection can be carried out simultaneously in a highly efficient way. Additionally, both the mean and the variance of a percentage response can be modeled using flexible nonlinear covariate effects. As a consequence, the new method accounts for common problems such as overdispersion and non-binomial variance structures
Hierarchical Gaussian process mixtures for regression
As a result of their good performance in practice and their desirable analytical properties, Gaussian process regression models are becoming increasingly of interest in statistics, engineering and other fields. However, two major problems arise when the model is applied to a large data-set with repeated measurements. One stems from the systematic heterogeneity among the different replications, and the other is the requirement to invert a covariance matrix which is involved in the implementation of the model. The dimension of this matrix equals the sample size of the training data-set. In this paper, a Gaussian process mixture model for regression is proposed for dealing with the above two problems, and a hybrid Markov chain Monte Carlo (MCMC) algorithm is used for its implementation. Application to a real data-set is reported
Is the relationship between aid and economic growth nonlinear?:
"There have been intensive debates on the role of aid in promoting economic development in developing countries by using cross-country analyses. Cross-country regression assuming linear relationship between aid and growth and without taking into heterogeneity of countries would produce biased estimates. To correct this, in this paper we investigate the relationship between foreign aid and growth using recently developed sample splitting methods that allow us to simultaneously uncover evidence for the existence of heterogeneity and nonlinearity. We also address model uncertainty in the context of these methods. We find some evidence that aid may have heterogeneous effects on growth across two growth regimes defined by ethnolinguistic fractionalization. However, when we account for model uncertainty, we find no evidence to suggest that the relationship between aid and growth is nonlinear. In fact, our results suggest that the partial effect of aid on growth is likely to be weakly negative. In this sense, our findings suggest that aid is potentially counterproductive to growth with outcomes not meeting the expectations of donors... The methodology developed in this paper can be used to identify typologies on other outcome variables, such as those included in the Millennium Development Goals." from Authors' AbstractEconomic development, Cross-country studies, Foreign aid, Public investment, Nonlinearity, Typology,
Is the Relationship Between Aid and Economic Growth Nonlinear?
In this paper, we investigate the relationship between foreign aid and growth using recently developed sample splitting methods that allow us to uncover evidence for the existence of heterogeneity and nonlinearity simultaneously. We also implement a new methodology that allows us to deal with model uncertainty in the context of these methods. We find some evidence that aid may have heterogeneous effects on growth across two growth regimes defined by ethnic fractionalization. In particular, countries that belong to a growth regime characterized by levels of ethnic fractionalization above a threshold value experience a negative partial relationship between aid and growth, while those in the regime with ethnic fractionalization below the threshold experience no growth effects from aid at all. Nevertheless, there exists substantial model uncertainty so that attempts to pin down the typology of these growth regimes as being decisively characterized by ethnic fractionalization remain inconclusive. When we account for model uncertainty, we find no evidence to suggest that the relationship between aid and growth is nonlinear. Overall, our results suggest that the partial effect of aid on growth is very likely to be negative although we cannot reject the hypothesis that aid has no effect on growth. In this sense, our findings suggest that aid is potentially counterproductive to growth with outcomes not meeting the expectations of donors.
- …