469 research outputs found

    Introducing the GWmodel R and python packages for modelling spatial heterogeneity

    Get PDF
    In the very early developments of quantitative geography, statistical techniques were invariably applied at a ‘global’ level, where moments or relationships were assumed constant across the study region (Fotheringham and Brunsdon, 1999). However, the world is not an “average” space but full of variations and as such, statistical techniques need to account for different forms of spatial heterogeneity or non-stationarity (Goodchild, 2004). Consequently, a number of local methods were developed, many of which model non- stationarity relationships via some regression adaptation. Examples include: the expansion method (Casetti, 1972), random coefficient modelling (Swamy et al., 1988), multilevel modelling (Duncan and Jones, 2000) and space varying parameter models (Assunção, 2003). One such localised regression, geographically weighted regression (GWR) (Brunsdon et al., 1996) has become increasingly popular and has been broadly applied in many disciplines outside of its quantitative geography roots. This includes: regional economics, urban and regional analysis, sociology and ecology. There are several toolkits available for applying GWR, such as GWR3.x (Charlton et al., 2007); GWR 4.0 (Nakaya et al., 2009); the GWR toolkit in ArcGIS (ESRI, 2009); the R packages spgwr (Bivand and Yu, 2006) and gwrr (Wheeler, 2011); and STIS (Arbor, 2010). Most focus on the fundamental functions of GWR or some specific issue - for example, gwrr provides tools to diagnose collinearity. As a major extension, we report in this paper the development an integrated framework for handling spatially varying structures, via a wide range of geographically weighted (GW) models, not just GWR. All functions are included in an R package named GWmodel, which is also mirrored with a set of GW modelling tools for ESRI’s ArcGIS written in Python

    Links, comparisons and extensions of the geographically weighted regression model when used as a spatial predictor

    Get PDF
    In this study, we link and compare the geographically weighted regression (GWR) model with the kriging with an external drift (KED) model of geostatistics. This includes empirical work where models are performance tested with respect to prediction and prediction uncertainty accuracy. In basic forms, GWR and KED (specified with local neighbourhoods) both cater for nonstationary correlations (i.e. the process is heteroskedastic with respect to relationships between the variable of interest and its covariates) and as such, can predict more accurately than models that do not. Furthermore, on specification of an additional heteroskedastic term to the same models (now with respect to a process variance), locallyaccurate measures of prediction uncertainty can result. These heteroskedastic extensions of GWR and KED can be preferred to basic constructions, whose measures of prediction uncertainty are only ever likely to be globallyaccurate. We evaluate both basic and heteroskedastic GWR and KED models using a case study data set, where data relationships are known to vary across space. Here GWR performs well with respect to the more involved KED model and as such, GWR is considered a viable alternative to the more established model in this particular comparison. Our study adds to a growing body of empirical evidence that GWR can be a worthy predictor; complementing its more usual guise as an exploratory technique for investigating relationships in multivariate spatial data sets

    Package ‘GWmodel’

    Get PDF
    In GWmodel, we introduce techniques from a particular branch of spatial statis- tics,termed geographically-weighted (GW) models. GW models suit situa- tions when data are not described well by some global model, but where there are spatial re- gions where a suitably localised calibration provides a better description. GWmodel in- cludes functions to calibrate: GW summary statistics, GW principal components analy- sis, GW discriminant analysis and various forms of GW regression; some of which are pro- vided in basic and robust (outlier resistant) forms

    Package ‘GWmodel’

    Get PDF
    In GWmodel, we introduce techniques from a particular branch of spatial statis- tics,termed geographically-weighted (GW) models. GW models suit situa- tions when data are not described well by some global model, but where there are spatial re- gions where a suitably localised calibration provides a better description. GWmodel in- cludes functions to calibrate: GW summary statistics, GW principal components analy- sis, GW discriminant analysis and various forms of GW regression; some of which are pro- vided in basic and robust (outlier resistant) forms

    Novel approaches to investigating spatial variability in channel bank total phosphorus at the catchment scale

    Get PDF
    Phosphorus (P) is often a limiting nutrient that leads to the eutrophication of aquatic systems. While dissolved P forms are the most bioavailable, the form, mobility, transport and fate of P are directly related to its association with fine-grained riverine sediment. Therefore, to implement successful P catchment management strategies it is important to understand the relative contribution of different sediment sources to P loads across the river continuum. While agricultural topsoil and, to a lesser extent, riverbed sediment are important sources of sediment-associated P, channel banks have been shown to be an important sediment source in some catchments. However, comparatively little is known about the P concentration and corresponding spatial variability in channel bank sediment and the associated implications for catchment management. The present study examines the spatial variability of P associated with channel bank profiles within a series of three nested catchments using both non-spatial and spatial statistical methods, where for the latter, a novel spatial approach was used to estimate the spatial averages and variances of P in channel bank sediment along the stream network. Channel bank P concentrations were compared to factors such as catchment scale, stream order, land use, bank exposure and location along the stream network. Concentrations of TP ranged between 129.6 and 1206.9 mg P kg⁻¹ of which the water extractable P (WEP) content ranged from 0.01 to 0.12%. Stream order was found to influence TP concentrations, while land use and catchment scale provided only a moderate influence. This suggested that focussing channel bank sampling strategies at the largest catchment scale would capture key drivers of TP variability provided stream order is sufficiently represented. Whether the bank was had limited vegetation and was exposed and potentially eroding had a slight influence on TP variability in second-order stream banks in the larger of the two nested catchments. However, the slightly lower TP concentrations measured at these sites indicated that banks that are actually eroding may be contributing less TP than the total channel bank TP values measured across the catchments as a whole. The results of an explicitly spatial analysis demonstrated that local channel bank TP averages and TP variances vary along the stream network. Specifically, the most accurate spatial predictor of TP was local TP means with the use of ‘crow flies’ rather than stream network distances. Local TP variances were used to provide optimal designs for future channel bank TP sampling campaigns, given available resources. Throughout, both standard and outlier-resistant statistical analyses were applied to improve interpretation of the study findings

    gwpcorMapper: an interactive mapping tool for exploring geographically weighted correlation and partial correlation in high-dimensional geospatial datasets

    Full text link
    Exploratory spatial data analysis (ESDA) plays a key role in research that includes geographic data. In ESDA, analysts often want to be able to visualize observations and local relationships on a map. However, software dedicated to visualizing local spatial relations be-tween multiple variables in high dimensional datasets remains undeveloped. This paper introduces gwpcorMapper, a newly developed software application for mapping geographically weighted correlation and partial correlation in large multivariate datasets. gwpcorMap-per facilitates ESDA by giving researchers the ability to interact with map components that describe local correlative relationships. We built gwpcorMapper using the R Shiny framework. The software inherits its core algorithm from GWpcor, an R library for calculating the geographically weighted correlation and partial correlation statistics. We demonstrate the application of gwpcorMapper by using it to explore census data in order to find meaningful relationships that describe the work-life environment in the 23 special wards of Tokyo, Japan. We show that gwpcorMapper is useful in both variable selection and parameter tuning for geographically weighted statistics. gwpcorMapper highlights that there are strong statistically clear local variations in the relationship between the number of commuters and the total number of hours worked when considering the total population in each district across the 23 special wards of Tokyo. Our application demonstrates that the ESDA process with high-dimensional geospatial data using gwpcorMapper has applications across multiple fields.Comment: 18 pages, 8 figures, 2 table

    Investigating spatial error structures in continuous raster data

    Get PDF
    The objective of this study is to investigate spatial structures of error in the assessment of continuous raster data. The use of conventional diagnostics of error often overlooks the possible spatial variation in error because such diagnostics report only average error or deviation between predicted and reference values. In this respect, this work uses a moving window (kernel) approach to generate geographically weighted (GW) versions of the mean signed deviation, the mean absolute error and the root mean squared error and to quantify their spatial variations. Such approach computes local error diagnostics from data weighted by its distance to the centre of a moving kernel and allows to map spatial surfaces of each type of error. In addition, a GW correlation analysis between predicted and reference values provides an alternative view of local error. These diagnostics are applied to two earth observation case studies. The results reveal important spatial structures of error and unusual clusters of error can be identified through Monte Carlo permutation tests. The first case study demonstrates the use of GW diagnostics to fractional impervious surface area datasets generated by four different models for the Jakarta metropolitan area, Indonesia. The GW diagnostics reveal where the models perform differently and similarly, and found areas of under-prediction in the urban core, with larger errors in peri-urban areas. The second case study uses the GW diagnostics to four remotely sensed aboveground biomass datasets for the Yucatan Peninsula, Mexico. The mapping of GW diagnostics provides a means to compare the accuracy of these four continuous raster datasets locally. The discussion considers the relative nature of diagnostics of error, determining moving window size and issues around the interpretation of different error diagnostic measures. Investigating spatial structures of error hidden in conventional diagnostics of error provides informative descriptions of error in continuous raster data

    The GWmodel R package: Further Topics for Exploring Spatial Heterogeneity using Geographically Weighted Models

    Get PDF
    In this study, we present a collection of local models, termed geographically weighted (GW) models, that can be found within the GWmodel R package. A GW model suits situations when spatial data are poorly described by the global form, and for some regions the localised fit provides a better description. The approach uses a moving window weighting technique, where a collection of local models are estimated at target locations. Commonly, model parameters or outputs are mapped so that the nature of spatial heterogeneity can be explored and assessed. In particular, we present case studies using: (i) GW summary statistics and a GW principal components analysis; (ii) advanced GW regression fits and diagnostics; (iii) associated Monte Carlo significance tests for non-stationarity; (iv) a GW discriminant analysis; and (v) enhanced kernel bandwidth selection procedures. General Election data sets from the Republic of Ireland and US are used for demonstration. This study is designed to complement a companion GWmodel study, which focuses on basic and robust GW models

    GWmodel: an R package for exploring spatial heterogeneity using geographically weighted models

    Get PDF
    Spatial statistics is a growing discipline providing important analytical techniques in a wide range of disciplines in the natural and social sciences. In the R package GWmodel we present techniques from a particular branch of spatial statistics, termed geographically weighted (GW) models. GW models suit situations when data are not described well by some global model, but where there are spatial regions where a suitably localized calibration provides a better description. The approach uses a moving window weighting technique, where localized models are found at target locations. Outputs are mapped to provide a useful exploratory tool into the nature of the data spatial heterogeneity. Currently, GWmodel includes functions for: GW summary statistics, GW principal components analysis, GW regression, and GW discriminant analysis; some of which are provided in basic and robust forms
    corecore