Search CORE

1,537 research outputs found

High-Dimensional Bayesian Geostatistics

Author: Banerjee Sudipto
Publication venue
Publication date: 01/01/2017
Field of study

With the growing capabilities of Geographic Information Systems (GIS) and user-friendly software, statisticians today routinely encounter geographically referenced data containing observations from a large number of spatial locations and time points. Over the last decade, hierarchical spatiotemporal process models have become widely deployed statistical tools for researchers to better understand the complex nature of spatial and temporal variability. However, fitting hierarchical spatiotemporal models often involves expensive matrix computations with complexity increasing in cubic order for the number of spatial locations and temporal points. This renders such models unfeasible for large data sets. This article offers a focused review of two methods for constructing well-defined highly scalable spatiotemporal stochastic processes. Both these processes can be used as "priors" for spatiotemporal random fields. The first approach constructs a low-rank process operating on a lower-dimensional subspace. The second approach constructs a Nearest-Neighbor Gaussian Process (NNGP) that ensures sparse precision matrices for its finite realizations. Both processes can be exploited as a scalable prior embedded within a rich hierarchical modeling framework to deliver full Bayesian inference. These approaches can be described as model-based solutions for big spatiotemporal datasets. The models ensure that the algorithmic complexity has

\sim n

floating point operations (flops), where

n

the number of spatial locations (per iteration). We compare these methods and provide some insight into their methodological underpinnings

arXiv.org e-Print Archive

Ezid

Crossref

eScholarship - University of California

Latent Gaussian modeling and INLA: A review with focus on space-time applications

Author: Opitz Thomas
Publication venue
Publication date: 10/11/2016
Field of study

Bayesian hierarchical models with latent Gaussian layers have proven very flexible in capturing complex stochastic behavior and hierarchical structures in high-dimensional spatial and spatio-temporal data. Whereas simulation-based Bayesian inference through Markov Chain Monte Carlo may be hampered by slow convergence and numerical instabilities, the inferential framework of Integrated Nested Laplace Approximation (INLA) is capable to provide accurate and relatively fast analytical approximations to posterior quantities of interest. It heavily relies on the use of Gauss-Markov dependence structures to avoid the numerical bottleneck of high-dimensional nonsparse matrix computations. With a view towards space-time applications, we here review the principal theoretical concepts, model classes and inference tools within the INLA framework. Important elements to construct space-time models are certain spatial Mat\'ern-like Gauss-Markov random fields, obtained as approximate solutions to a stochastic partial differential equation. Efficient implementation of statistical inference tools for a large variety of models is available through the INLA package of the R software. To showcase the practical use of R-INLA and to illustrate its principal commands and syntax, a comprehensive simulation experiment is presented using simulated non Gaussian space-time count data with a first-order autoregressive dependence structure in time

arXiv.org e-Print Archive

HAL Descartes

Numérisation de Documents Anciens Mathématiques

Bayesian Nonstationary Spatial Modeling for Very Large Datasets

Author: Banerjee
Banerjee
Berliner
Bevilacqua
Calder
Cressie
Cressie
Cressie
Cressie
Cressie
Curriero
Dyk
Eidsvik
Finley
Furrer
Geman
Gilbert
Gneiting
Gneiting
Green
Guhaniyogi
Haario
Hastings
Henderson
Higdon
Holmes
Holmes
Kang
Kang
Kanter
Kass
Katzfuss
Katzfuss
Kaufman
Knuth
Lemos
Lindgren
Lindsay
Lopes
Mardia
Metropolis
Paciorek
Pracilio
Sang
Sang
Shaby
Sherman
Shi
Stein
Stein
Taylor
Viscarra Rossel
Wikle
Wikle
Wikle
Xu
Publication venue: 'Wiley'
Publication date: 21/12/2012
Field of study

With the proliferation of modern high-resolution measuring instruments mounted on satellites, planes, ground-based vehicles and monitoring stations, a need has arisen for statistical methods suitable for the analysis of large spatial datasets observed on large spatial domains. Statistical analyses of such datasets provide two main challenges: First, traditional spatial-statistical techniques are often unable to handle large numbers of observations in a computationally feasible way. Second, for large and heterogeneous spatial domains, it is often not appropriate to assume that a process of interest is stationary over the entire domain. We address the first challenge by using a model combining a low-rank component, which allows for flexible modeling of medium-to-long-range dependence via a set of spatial basis functions, with a tapered remainder component, which allows for modeling of local dependence using a compactly supported covariance function. Addressing the second challenge, we propose two extensions to this model that result in increased flexibility: First, the model is parameterized based on a nonstationary Matern covariance, where the parameters vary smoothly across space. Second, in our fully Bayesian model, all components and parameters are considered random, including the number, locations, and shapes of the basis functions used in the low-rank component. Using simulated data and a real-world dataset of high-resolution soil measurements, we show that both extensions can result in substantial improvements over the current state-of-the-art.Comment: 16 pages, 2 color figure

arXiv.org e-Print Archive

Crossref