11,260 research outputs found

    Detection of risk factors for obesity in early childhood with quantile regression methods for longitudinal data

    Get PDF
    This article compares and discusses three different statistical methods for investigating risk factors for overweight and obesity in early childhood by means of the LISA study, a recent German birth cohort study with 3097 children. Since the definition of overweight and obesity is typically based on upper quantiles (90% and 97%) of the age specific body mass index (BMI) distribution, our aim was to model the influence of risk factors and age on these quantiles while as far as possible taking the longitudinal data structure into account. The following statistical regression models were chosen: additive mixed models, generalized additive models for location, scale and shape (GAMLSS), and distribution free quantile regression models. The methods were compared empirically by cross-validation and for the data at hand no model could be rated superior. Motivated by previous studies we explored whether there is an age-specific skewness of the BMI distribution. The investigated data does not suggest such an effect, even after adjusting for risk factors. Concerning risk factors, our results mainly confirm results obtained in previous studies. From a methodological point of view, we conclude that GAMLSS and distribution free quantile regression are promising approaches for longitudinal quantile regression, requiring, however, further extensions to fully account for longitudinal data structures

    A Spatial Quantile Regression Hedonic Model of Agricultural Land Prices

    Get PDF
    Abstract Land price studies typically employ hedonic analysis to identify the impact of land characteristics on price. Owing to the spatial fixity of land, however, the question of possible spatial dependence in agricultural land prices arises. The presence of spatial dependence in agricultural land prices can have serious consequences for the hedonic model analysis. Ignoring spatial autocorrelation can lead to biased estimates in land price hedonic models. We propose using a flexible quantile regression-based estimation of the spatial lag hedonic model allowing for varying effects of the characteristics and, more importantly, varying degrees of spatial autocorrelation. In applying this approach to a sample of agricultural land sales in Northern Ireland we find that the market effectively consists of two relatively separate segments. The larger of these two segments conforms to the conventional hedonic model with no spatial lag dependence, while the smaller, much thinner market segment exhibits considerable spatial lag dependence. Un mod�le h�donique � r�gression quantile spatiale des prix des terrains agricoles R�sum� Les �tudes sur le prix des terrains font g�n�ralement usage d'une analyse h�donique pour identifier l'impact des caract�ristiques des terrains sur le prix. Toutefois, du fait de la fixit� spatiale des terrains, la question d'une �ventuelle d�pendance spatiale sur la valeur des terrains agricoles se pose. L'existence d'une d�pendance spatiale dans le prix des terrains agricoles peut avoir des cons�quences importantes sur l'analyse du mod�le h�donique. En ignorant cette corr�lation s�rielle, on s'expose au risque d'�valuations biais�es des mod�les h�doniques du prix des terrains. Nous proposons l'emploi d'une estimation � base de r�gression flexible du mod�le h�donique � d�calage spatial, tenant compte de diff�rents effets des caract�ristiques, et surtout de diff�rents degr�s de corr�lations s�rielles spatiales. En appliquant ce principe � un �chantillon de ventes de terrains agricoles en Irlande du Nord, nous d�couvrons que le march� se compose de deux segments relativement distincts. Le plus important de ces deux segments est conforme au mod�le h�donique traditionnel, sans d�pendance du d�calage spatial, tandis que le deuxi�me segment du march�, plus petit et beaucoup plus �troit, pr�sente une d�pendance consid�rable du d�calage spatial. Un modelo hed�nico de regresi�n cuantil espacial de los precios del terreno agr�cola Resumen T�picamente, los estudios del precio de la tierra emplean un an�lisis hed�nico para identificar el impacto de las caracter�sticas de la tierra sobre el precio. No obstante, debido a la fijeza espacial de la tierra, surge la cuesti�n de una posible dependencia espacial en los precios del terreno agr�cola. La presencia de dependencia espacial en los precios del terreno agr�cola puede tener consecuencias graves para el modelo de an�lisis hed�nico. Ignorar la autocorrelaci�n espacial puede conducir a estimados parciales en los modelos hed�nicos del precio de la tierra. Proponemos el uso de una valoraci�n basada en una regresi�n cuantil flexible del modelo hed�nico del lapso espacial que tenga en cuenta los diversos efectos de las caracter�sticas y, particularmente, los diversos grados de autocorrelaci�n espacial. Al aplicar este planteamiento a una muestra de ventas de terreno agr�cola en Irlanda del Norte, descubrimos que el mercado consiste efectivamente de dos segmento relativamente separados. El m�s grande de estos dos segmentos se ajusta al modelo hed�nico convencional sin dependencia del lapso espacial, mientras que el segmento m�s peque�o, y mucho m�s fino, muestra una dependencia considerable del lapso espacial.Spatial lag, quantile regression, hedonic model, C13, C14, C21, Q24,

    A note on the use of quantile regression in beta convergence analysis

    Get PDF
    We discuss how to interpret conflicting results obtained by the use of quantile regression methods in growth regression tests of β-convergence hypothesis and the results obtained by nonparametric methods. We show that the assumption of linearity may cause the non-rejection of the β-convergence hypothesis by quantile regression. We also show that using a nonparametric form of quantile regression, we can reject the hypothesis of β-convergence and confirm the results of divergence and formation of convergence clubs. We illustrate the discussion by using the conflicting results on convergence found in the dataset of per-capita income of Brazilian municipalities between 1970 and 1996.

    On Weighted Multivariate Sign Functions

    Full text link
    Multivariate sign functions are often used for robust estimation and inference. We propose using data dependent weights in association with such functions. The proposed weighted sign functions retain desirable robustness properties, while significantly improving efficiency in estimation and inference compared to unweighted multivariate sign-based methods. Using weighted signs, we demonstrate methods of robust location estimation and robust principal component analysis. We extend the scope of using robust multivariate methods to include robust sufficient dimension reduction and functional outlier detection. Several numerical studies and real data applications demonstrate the efficacy of the proposed methodology.Comment: Keywords: Multivariate sign, Principal component analysis, Data depth, Sufficient dimension reductio

    Boosted Beta regression.

    Get PDF
    Regression analysis with a bounded outcome is a common problem in applied statistics. Typical examples include regression models for percentage outcomes and the analysis of ratings that are measured on a bounded scale. In this paper, we consider beta regression, which is a generalization of logit models to situations where the response is continuous on the interval (0,1). Consequently, beta regression is a convenient tool for analyzing percentage responses. The classical approach to fit a beta regression model is to use maximum likelihood estimation with subsequent AIC-based variable selection. As an alternative to this established - yet unstable - approach, we propose a new estimation technique called boosted beta regression. With boosted beta regression estimation and variable selection can be carried out simultaneously in a highly efficient way. Additionally, both the mean and the variance of a percentage response can be modeled using flexible nonlinear covariate effects. As a consequence, the new method accounts for common problems such as overdispersion and non-binomial variance structures

    Interpretable statistics for complex modelling: quantile and topological learning

    Get PDF
    As the complexity of our data increased exponentially in the last decades, so has our need for interpretable features. This thesis revolves around two paradigms to approach this quest for insights. In the first part we focus on parametric models, where the problem of interpretability can be seen as a “parametrization selection”. We introduce a quantile-centric parametrization and we show the advantages of our proposal in the context of regression, where it allows to bridge the gap between classical generalized linear (mixed) models and increasingly popular quantile methods. The second part of the thesis, concerned with topological learning, tackles the problem from a non-parametric perspective. As topology can be thought of as a way of characterizing data in terms of their connectivity structure, it allows to represent complex and possibly high dimensional through few features, such as the number of connected components, loops and voids. We illustrate how the emerging branch of statistics devoted to recovering topological structures in the data, Topological Data Analysis, can be exploited both for exploratory and inferential purposes with a special emphasis on kernels that preserve the topological information in the data. Finally, we show with an application how these two approaches can borrow strength from one another in the identification and description of brain activity through fMRI data from the ABIDE project
    corecore