5,265 research outputs found
Asymptotic Analysis of MAP Estimation via the Replica Method and Applications to Compressed Sensing
The replica method is a non-rigorous but well-known technique from
statistical physics used in the asymptotic analysis of large, random, nonlinear
problems. This paper applies the replica method, under the assumption of
replica symmetry, to study estimators that are maximum a posteriori (MAP) under
a postulated prior distribution. It is shown that with random linear
measurements and Gaussian noise, the replica-symmetric prediction of the
asymptotic behavior of the postulated MAP estimate of an n-dimensional vector
"decouples" as n scalar postulated MAP estimators. The result is based on
applying a hardening argument to the replica analysis of postulated posterior
mean estimators of Tanaka and of Guo and Verdu.
The replica-symmetric postulated MAP analysis can be readily applied to many
estimators used in compressed sensing, including basis pursuit, lasso, linear
estimation with thresholding, and zero norm-regularized estimation. In the case
of lasso estimation the scalar estimator reduces to a soft-thresholding
operator, and for zero norm-regularized estimation it reduces to a
hard-threshold. Among other benefits, the replica method provides a
computationally-tractable method for precisely predicting various performance
metrics including mean-squared error and sparsity pattern recovery probability.Comment: 22 pages; added details on the replica symmetry assumptio
Pinsker estimators for local helioseismology
A major goal of helioseismology is the three-dimensional reconstruction of
the three velocity components of convective flows in the solar interior from
sets of wave travel-time measurements. For small amplitude flows, the forward
problem is described in good approximation by a large system of convolution
equations. The input observations are highly noisy random vectors with a known
dense covariance matrix. This leads to a large statistical linear inverse
problem.
Whereas for deterministic linear inverse problems several computationally
efficient minimax optimal regularization methods exist, only one
minimax-optimal linear estimator exists for statistical linear inverse
problems: the Pinsker estimator. However, it is often computationally
inefficient because it requires a singular value decomposition of the forward
operator or it is not applicable because of an unknown noise covariance matrix,
so it is rarely used for real-world problems. These limitations do not apply in
helioseismology. We present a simplified proof of the optimality properties of
the Pinsker estimator and show that it yields significantly better
reconstructions than traditional inversion methods used in helioseismology,
i.e.\ Regularized Least Squares (Tikhonov regularization) and SOLA (approximate
inverse) methods.
Moreover, we discuss the incorporation of the mass conservation constraint in
the Pinsker scheme using staggered grids. With this improvement we can
reconstruct not only horizontal, but also vertical velocity components that are
much smaller in amplitude
Joint state-parameter estimation of a nonlinear stochastic energy balance model from sparse noisy data
While nonlinear stochastic partial differential equations arise naturally in
spatiotemporal modeling, inference for such systems often faces two major
challenges: sparse noisy data and ill-posedness of the inverse problem of
parameter estimation. To overcome the challenges, we introduce a strongly
regularized posterior by normalizing the likelihood and by imposing physical
constraints through priors of the parameters and states. We investigate joint
parameter-state estimation by the regularized posterior in a physically
motivated nonlinear stochastic energy balance model (SEBM) for paleoclimate
reconstruction. The high-dimensional posterior is sampled by a particle Gibbs
sampler that combines MCMC with an optimal particle filter exploiting the
structure of the SEBM. In tests using either Gaussian or uniform priors based
on the physical range of parameters, the regularized posteriors overcome the
ill-posedness and lead to samples within physical ranges, quantifying the
uncertainty in estimation. Due to the ill-posedness and the regularization, the
posterior of parameters presents a relatively large uncertainty, and
consequently, the maximum of the posterior, which is the minimizer in a
variational approach, can have a large variation. In contrast, the posterior of
states generally concentrates near the truth, substantially filtering out
observation noise and reducing uncertainty in the unconstrained SEBM
Analysis of overfitting in the regularized Cox model
The Cox proportional hazards model is ubiquitous in the analysis of
time-to-event data. However, when the data dimension p is comparable to the
sample size , maximum likelihood estimates for its regression parameters are
known to be biased or break down entirely due to overfitting. This prompted the
introduction of the so-called regularized Cox model. In this paper we use the
replica method from statistical physics to investigate the relationship between
the true and inferred regression parameters in regularized multivariate Cox
regression with L2 regularization, in the regime where both p and N are large
but with p/N ~ O(1). We thereby generalize a recent study from maximum
likelihood to maximum a posteriori inference. We also establish a relationship
between the optimal regularization parameter and p/N, allowing for
straightforward overfitting corrections in time-to-event analysis
- …