22,831 research outputs found

    A Statistical Toolbox For Mining And Modeling Spatial Data

    Get PDF
    Most data mining projects in spatial economics start with an evaluation of a set of attribute variables on a sample of spatial entities, looking for the existence and strength of spatial autocorrelation, based on the Moran’s and the Geary’s coefficients, the adequacy of which is rarely challenged, despite the fact that when reporting on their properties, many users seem likely to make mistakes and to foster confusion. My paper begins by a critical appraisal of the classical definition and rational of these indices. I argue that while intuitively founded, they are plagued by an inconsistency in their conception. Then, I propose a principled small change leading to corrected spatial autocorrelation coefficients, which strongly simplifies their relationship, and opens the way to an augmented toolbox of statistical methods of dimension reduction and data visualization, also useful for modeling purposes. A second section presents a formal framework, adapted from recent work in statistical learning, which gives theoretical support to our definition of corrected spatial autocorrelation coefficients. More specifically, the multivariate data mining methods presented here, are easily implementable on the existing (free) software, yield methods useful to exploit the proposed corrections in spatial data analysis practice, and, from a mathematical point of view, whose asymptotic behavior, already studied in a series of papers by Belkin & Niyogi, suggests that they own qualities of robustness and a limited sensitivity to the Modifiable Areal Unit Problem (MAUP), valuable in exploratory spatial data analysis

    Modeling Spatial Autocorrelation in Spatial Interaction Data: A Comparison of Spatial Econometric and Spatial Filtering Specifications

    Get PDF
    The need to account for spatial autocorrelation is well known in spatial analysis. Many spatial statistics and spatial econometric texts detail the way spatial autocorrelation can be identified and modelled in the case of object and field data. The literature on spatial autocorrelation is much less developed in the case of spatial interaction data. The focus of interest in this paper is on the problem of spatial autocorrelation in a spatial interaction context. The paper aims to illustrate that eigenfunction-based spatial filtering offers a powerful methodology that can efficiently account for spatial autocorrelation effects within a Poisson spatial interaction model context that serves the purpose to identify and measure spatial separation effects to interregional knowledge spillovers as captured by patent citations among high-technology-firms in Europe.

    Spatial Quantile Regression In Analysis Of Healthy Life Years In The European Union Countries

    Get PDF
    The paper investigates the impact of the selected factors on the healthy life years of men and women in the EU countries. The multiple quantile spatial autoregression models are used in order to account for substantial differences in the healthy life years and life quality across the EU members. Quantile regression allows studying dependencies between variables in different quantiles of the response distribution. Moreover, this statistical tool is robust against violations of the classical regression assumption about the distribution of the error term. Parameters of the models were estimated using instrumental variable method (Kim, Muller 2004), whereas the confidence intervals and p-values were bootstrapped

    MAPS OF CONTINUOUS SPATIAL DEPENDENCE

    Get PDF
    Heterogeneity is one of the distinguishing features in spatial econometric models. It is a frequent problem in applied work and can be very damaging for statistical inference. In this paper, we focus on the problems implied by the existence of instabilities in the mechanism of spatial dependence in a spatial lag model, assuming that the other terms of the specification remain stable. We begin the discussion with the role played by the algorithms of local estimation in detecting the instabilities. Problems appear when one must decide what to do once the existence of heterogeneity has been confirmed. The logical reaction is trying to parameterize this lack of stability. However, the solution is not obvious. Assuming that a set of indicators related to the problem has been identified, we propose a simple technique to deal with the unknown functional form. In the final part of the paper, we present some Monte Carlo evidence and an application to evaluate the instability in the mechanisms of spatial dependence in the convergence process of the European Regions.DEPENDENCE, LOCAL ESTIMATION, MONTE-CARLO, SPATIAL INSTABILITY

    Revisiting Guerry's data: Introducing spatial constraints in multivariate analysis

    Full text link
    Standard multivariate analysis methods aim to identify and summarize the main structures in large data sets containing the description of a number of observations by several variables. In many cases, spatial information is also available for each observation, so that a map can be associated to the multivariate data set. Two main objectives are relevant in the analysis of spatial multivariate data: summarizing covariation structures and identifying spatial patterns. In practice, achieving both goals simultaneously is a statistical challenge, and a range of methods have been developed that offer trade-offs between these two objectives. In an applied context, this methodological question has been and remains a major issue in community ecology, where species assemblages (i.e., covariation between species abundances) are often driven by spatial processes (and thus exhibit spatial patterns). In this paper we review a variety of methods developed in community ecology to investigate multivariate spatial patterns. We present different ways of incorporating spatial constraints in multivariate analysis and illustrate these different approaches using the famous data set on moral statistics in France published by Andr\'{e}-Michel Guerry in 1833. We discuss and compare the properties of these different approaches both from a practical and theoretical viewpoint.Comment: Published in at http://dx.doi.org/10.1214/10-AOAS356 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org
    • 

    corecore