Efficient probabilistic inversion of geophysical data

Abstract

Estimation of uncertainties is critical for subsequent decision making in all applications of geosciences such as geological hazard analysis and risk mitigation, management and exploitation of subsurface resources, and environmental waste disposal. More efficient probabilistic inversion methods in geosciences are vital to making rapid and improved predictions of geological hazards and estimation of subsurface resources from geophysical data, and estimation of associated uncertainties. While this thesis focuses on seismic data inversion for the estimation of geological properties, the methods developed may find a wide variety of applications in all fields of research that involve spatial data analysis. New concepts, models and methods are developed to perform more efficient probabilistic inversion by making use of the latest developments in machine learning and Bayesian inverse theory to solve geophysical inverse problems. The major contribution of this thesis is the development of efficient geostatistical inversion methods for approximate inference for structured inverse problems where probabilistic dependence between unknown model parameters may be expressed as a Markov random field (MRF). These methods are many orders of magnitude faster than the corresponding sampling based methods in such types of inverse problems. Further, some of the commonly used but avoidable assumptions in conventional geostatistical inversion methods are progressively relaxed and finally removed in this research. The faster inversion methods allow more complex models to be evaluated for more accurate predictions and improved estimation of uncertainty for given compute power and time. Most existing geostatistical inversion methods are based on the localized likelihoods assumption, whereby the seismic data at a location are assumed to depend on the geology only at that location. Such an assumption is unrealistic because of imperfect seismic data acquisition and processing, and fundamental limitations of seismic imaging methods. It is also assumed in most such previous research that the data are completely free of any correlated noise or errors. Although these requirements are almost never met in reality, existing methods use these assumptions to make solutions computationally tractable. Both of these assumptions are progressively removed in this thesis while still allowing computationally tractable solutions to be found for suitably structured problems. The class of problems considered here spans a broad range of spatial data analysis and geosciences, where geology at a location is assumed to depend directly only on the geology within some pre-specified neighbourhood of that location – the so called Markovian assumption – which is the core assumption across the entire literature of geostatistics and has been proven to be valid for all practical purposes. Exact Bayesian inference is intractable in most models of practical interest because it requires normalization of the posterior distribution by integrating model parameters over a very high dimensional space. Therefore, approximate inference is used in practice. Stochastic sampling (e.g., by using Markov-chain Monte Carlo – McMC) is the most commonly used approximate inference method but is computationally expensive and detection of its convergence is often based on subjective criteria and hence is unreliable. New Bayesian inversion methods are introduced that estimate the spatial distribution of geological properties from attributes of seismic data, by showing how the usual probabilistic inverse problem can be solved using an optimization framework while still providing full probabilistic results – the so called variational inference approach. The intractable posterior distribution is replaced by a tractable approximation in the variational approach. Inference can then be performed using the approximate distribution in an optimization framework, thus circumventing the need for sampling, while still providing probabilistic results. The methods developed in this thesis infer the post-inversion (posterior) probability density of the unknown model parameters from seismic data and geological prior information. These methods are shown to be robust against weak prior information and correlated noise in the data. The methods are computationally efficient, and are expected to be applicable to 3D models of realistic size on modern computers without incurring any significant computational limitations

    Similar works