418 research outputs found

    Cross-Covariance Functions for Multivariate Geostatistics

    Full text link
    Continuously indexed datasets with multiple variables have become ubiquitous in the geophysical, ecological, environmental and climate sciences, and pose substantial analysis challenges to scientists and statisticians. For many years, scientists developed models that aimed at capturing the spatial behavior for an individual process; only within the last few decades has it become commonplace to model multiple processes jointly. The key difficulty is in specifying the cross-covariance function, that is, the function responsible for the relationship between distinct variables. Indeed, these cross-covariance functions must be chosen to be consistent with marginal covariance functions in such a way that the second-order structure always yields a nonnegative definite covariance matrix. We review the main approaches to building cross-covariance models, including the linear model of coregionalization, convolution methods, the multivariate Mat\'{e}rn and nonstationary and space-time extensions of these among others. We additionally cover specialized constructions, including those designed for asymmetry, compact support and spherical domains, with a review of physics-constrained models. We illustrate select models on a bivariate regional climate model output example for temperature and pressure, along with a bivariate minimum and maximum temperature observational dataset; we compare models by likelihood value as well as via cross-validation co-kriging studies. The article closes with a discussion of unsolved problems.Comment: Published at http://dx.doi.org/10.1214/14-STS487 in the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Geostatistics as a tool to improve the natural background level definition: An application in groundwater

    Get PDF
    The Natural Background Level (NBL), suggested by UE BRIDGE project, is suited for spatially distributed datasets providing a regional value that could be higher than the Threshold Value (TV) set by every country. In hydrogeochemically dis-homogeneous areas, the use of a unique regional NBL, higher than TV, could arise problems to distinguish between natural occurrences and anthropogenic contaminant sources. Hence, the goal of this study is to improve the NBL definition employing a geostatistical approach, which reconstructs the contaminant spatial structure accounting geochemical and hydrogeological relationships. This integrated mapping is fundamental to evaluate the contaminant's distribution impact on the NBL, giving indications to improve it. We decided to test this method on the Drainage Basin of Venice Lagoon (DBVL, NE Italy), where the existing NBL is seven times higher than the TV. This area is notoriously affected by naturally occurring arsenic contamination. An available geochemical dataset collected by 50 piezometers was used to reconstruct the spatial distribution of arsenic in the densely populated area of the DBVL. A cokriging approachwas applied exploiting the geochemical relationships among As, Fe andNH4+. The obtained spatial predictions of arsenic concentrationswere divided into three different zones: i) areas with an As concentration lower than the TV, ii) areas with an As concentration between the TV and the median of the values higher than the TV, and iii) areas with an As concentration higher than the median. Following the BRIDGE suggestions, where enough samples were available, the 90th percentile for each zone was calculated to obtain a local NBL (LNBL). Differently from the original NBL, this local value gives more detailed water quality information accounting the hydrogeological and geochemical setting, and contaminant spatial variation. Hence, the LNBL could give more indications about the distinction between natural occurrence and anthropogenic contamination

    Use of geostatistical Bayesian updating to integrate airborne radiometrics and soil geochemistry to improve mapping for mineral exploration

    Get PDF
    Mineral exploration programmes around the world use data from remote sensing, geophysics, and direct sampling. On a regional scale, the combination of airborne geophysics and ground-based geochemical sampling can aid geological mapping and mineral exploration. Since airborne geophysical and traditional soil-sampling data are generated at different spatial resolutions, they are not immediately comparable due to their different sampling density. Several geostatistical techniques, including indicator cokriging and collocated cokriging, can be used to integrate different types of data into a geostatistical model. However, with increasing numbers of variables the inference of the crosscovariance model required for cokriging can be demanding in terms of effort and computational time. In this paper a Gaussian-based Bayesian updating approach is applied to integrate airborne radiometric data and ground-sampled geochemical soil data to maximize information generated from the soil survey, enabling more accurate geological interpretation for the exploration and development of natural resources. The Bayesian updating technique decomposes the collocated estimate into two models: prior and likelihood models. The prior model is built from primary information and the likelihood model is built from secondary information. The prior model is then updated with the likelihood model to build the final model. The approach allows multiple secondary variables to be simultaneously integrated into the mapping of the primary variable. The Bayesian updating approach is demonstrated using a case study from Northern Ireland. The geostatistical technique was used to improve the resolution of soil geochemistry, at a density of one sample per 2 km2, by integrating more closely measured airborne geophysical data from the GSNI Tellus Survey, measured over a footprint of 65 x 200 m. The directly measured geochemistry data were considered as primary data and the airborne radiometric data were used as secondary data. The approach produced more detailed updated maps and in particular enhanced information on the mapped distributions of zinc, copper, and lead. The enhanced delineation of an elongated northwest/southeast trending zone in the updated maps strengthened the potential for discovering stratabound base metal deposits

    Using stochastic space-time models to map extreme precipitation in southern Portugal

    Get PDF
    The topographic characteristics and spatial climatic diversity are significant in the South of continental Portugal where the rainfall regime is typically Mediterranean. Direct sequential cosimulation is proposed for mapping an extreme precipitation index in southern Portugal using elevation as auxiliary information. The analysed index (R5D) can be considered a flood indicator because it provides a measure of medium-term precipitation total. The methodology accounts for local data variability and incorporates space-time models that allow capturing long-term trends of extreme precipitation, and local changes in the relationship between elevation and extreme precipitation through time. Annual gridded datasets of the flood indicator are produced from 1940 to 1999 on 800 m×800 m grids by using the space-time relationship between elevation and the index. Uncertainty evaluations of the proposed scenarios are also produced for each year. The results indicate that the relationship between elevation and extreme precipitation varies locally and has decreased through time over the study region. In wetter years the flood indicator exhibits the highest values in mountainous regions of the South, while in drier years the spatial pattern of extreme precipitation has much less variability over the study region. The uncertainty of extreme precipitation estimates also varies in time and space, and in earlier decades is strongly dependent on the density of the monitoring stations network. The produced maps will be useful in regional and local studies related to climate change, desertification, land and water resources management, hydrological modelling, and flood mitigation planning

    Comparing different approaches - data mining, geostatistic, and deterministic pedology - to assess the frequency of WRB Reference Soil Groups in the Italian soil regions

    Get PDF
    Estimating frequency of soil classes in map unit is always affected by some degree of uncertainty, especially at small scales, with a larger generalization. The aim of this study was to compare different possible approaches - data mining, geostatistic, deterministic pedology - to assess the frequency of WRB Reference Soil Groups (RSG) in the major Italian soil regions. In the soil map of Italy (Costantini et al., 2012), a list of the first five RSG was reported in each major 10 soil regions. The soil map was produced using the national soil geodatabase, which stored 22,015 analyzed and classified pedons, 1,413 soil typological unit (STU) and a set of auxiliary variables (lithology, land-use, DEM). Other variables were added, to better consider the influence of soil forming factors (slope, soil aridity index, carbon stock, soil inorganic carbon content, clay, sand, geography of soil regions and soil systems) and a grid at 1 km mesh was set up. The traditional deterministic pedology assessed the STU frequency according to the expert judgment presence in every elementary landscape which formed the mapping unit. Different data mining techniques were firstly compared in their ability to predict RSG through auxiliary variables (neural networks, random forests, boosted tree, supported vector machine (SVM)). We selected SVM according to the result of a testing set. A SVM model is a representation of the examples as points in space, mapped so that examples of separate categories are divided by a clear gap that is as wide as possible. The geostatistic algorithm we used was an indicator collocated cokriging. The class values of the auxiliary variables, available at all the points of the grid, were transformed in indicator variables (values 0, 1). A principal component analysis allowed us to select the variables that were able to explain the largest variability, and to correlate each RSG with the first principal component, which explained the 51% of the total variability. The principal component was used as collocated variable. The results were as many probability maps as the estimated WRB classes. They were summed up in a unique map, with the most probable class at each pixel. The first five more frequent RSG resulting from the three methods were compared. The outcomes were validated with a subset of the 10% of the pedons, kept out before the elaborations. The error estimate was produced for each estimated RSG. The first results, obtained in one of the most widespread soil region (plains and low hills of central and southern Italy) showed that the first two frequency classes were the same for all the three methods. The deterministic method differed from the others at the third position, while the statistical methods inverted the third and fourth position. An advantage of the SVM was the possibility to use in the same elaboration numeric and categorical variable, without any previous transformation, which reduced the processing time. A Bayesian validation indicated that the SVM method was as reliable as the indicator collocated cokriging, and better than the deterministic pedological approach

    Improving rainfall erosivity estimates using merged TRMM and gauge data

    Get PDF
    Soil erosion is a global issue that threatens food security and causes environmental degradation. Management of water erosion requires accurate estimates of the spatial and temporal variations in the erosive power of rainfall (erosivity). Rainfall erosivity can be estimated from rain gauge stations and satellites. However, the time series rainfall data that has a high temporal resolution are often unavailable in many areas of the world. Satellite remote sensing allows provision of the continuous gridded estimates of rainfall, yet it is generally characterized by significant bias. Here we present a methodology that merges daily rain gauge measurements and the Tropical Rainfall Measuring Mission (TRMM) 3B42 data using collocated cokriging (ColCOK) to quantify the spatial distribution of rainfall and thereby to estimate rainfall erosivity across China. This study also used block kriging (BK) and TRMM to estimate rainfall and rainfall erosivity. The methodologies are evaluated based on the individual rain gauge stations. The results from the present study generally indicate that the ColCOK technique, in combination with TRMM and gauge data, provides merged rainfall fields with good agreement with rain gauges and with the best accuracy with rainfall erosivity estimates, when compared with BK gauges and TRMM alone

    Comparing Different Approaches - Data Mining, Geostatistic, and Deterministic Pedology - to Assess the Frequency of WRB Reference Soil Groups in the Italian Soil Regions

    Get PDF
    The assessment of class frequency in soil map legends is affected by uncertainty, especially at small scales, where generalization is larger. The aim of this study was to test the hypothesis that data mining or geostatistic techniques provide better estimation of class frequency than traditional deterministic pedology in a national soil map. In the map of Italian soil regions compiled at 1:5,000,000 reference scale, soil classes were the WRB Reference Soil Groups (RSGs). Different data mining techniques, namely neural networks, random forests, boosted tree, classification and regression tree, supported vector machine (SVM), were tested and the last one gave the best RSGs predictions, using selected auxiliary variables and 22,015 classified soil profiles. Given the categorical target variable, the multi-collocated indicator cokriging was the algorithm chosen for the geostatistic approach. The first five more frequent RSGs resulting from the three methods were compared. The outcomes were validated with a Bayesian approach on a subset of 10% of geographically representative profiles, kept out before the elaborations. The most frequent classes were uniformly predicted by the three methods, which instead differentiated notably for the classes with a lower occurrence. The Bayesian validation indicated that the SVM method was as reliable as the multi-collocated indicator cokriging, and both more than the deterministic pedological approach. An advantage of the SVM was the possibility to use numeric and categorical variable in the same elaboration, without any previous transformation, which notably reduced the processing time

    Estimating rainfall and water balance over the Okavango River Basin for hydrological applications

    Get PDF
    A historical database for use in rainfall-runoff modeling of the Okavango River Basin in Southwest Africa is presented. The work has relevance for similar data-sparse regions. The parameters of main concern are rainfall and catchment water balance which are key variables for subsequent studies of the hydrological impacts of development and climate change. Rainfall estimates are based on a combination of in-situ gauges and satellite sources. Rain gauge measurements are most extensive from 1955 to 1972, after which they are drastically reduced due to the Angolan civil war. The sensitivity of the rainfall fields to spatial interpolation techniques and the density of gauges was evaluated. Satellite based rainfall estimates for the basin are developed for the period from 1991 onwards, based on the Tropical Rainfall Measuring Mission (TRMM) and Special Sensor Microwave Imager (SSM/I) data sets. The consistency between the gauges and satellite estimates was considered. A methodology was developed to allow calibration of the rainfall-runoff hydrological model against rain gauge data from 1960-1972, with the prerequisite that the model should be driven by satellite derived rainfall products for the 1990s onwards. With the rain gauge data, addition of a single rainfall station (Longa) in regions where stations earlier were lacking was more important than the chosen interpolation method. Comparison of satellite and gauge rainfall outside the basin indicated that the satellite overestimates rainfall by 20%. A non-linear correction was derived used by fitting the rainfall frequency characteristics to those of the historical rainfall data. This satellite rainfall dataset was found satisfactory when using the Pitman rainfall-runoff model (Hughes et al., this issue). Intensive monitoring in the region is recommended to increase accuracy of the comprehensive satellite rainfall estimate calibration procedur
    corecore