Search CORE

11,292 research outputs found

High-dimensional regression with noisy and missing data: Provable guarantees with nonconvexity

Author: Loh Po-Ling
Wainwright Martin J.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2011
Field of study

Although the standard formulations of prediction problems involve fully-observed and noiseless data drawn in an i.i.d. manner, many applications involve noisy and/or missing data, possibly involving dependence, as well. We study these issues in the context of high-dimensional sparse linear regression, and propose novel estimators for the cases of noisy, missing and/or dependent data. Many standard approaches to noisy or missing data, such as those using the EM algorithm, lead to optimization problems that are inherently nonconvex, and it is difficult to establish theoretical guarantees on practical algorithms. While our approach also involves optimizing nonconvex programs, we are able to both analyze the statistical error associated with any global optimum, and more surprisingly, to prove that a simple algorithm based on projected gradient descent will converge in polynomial time to a small neighborhood of the set of all global minimizers. On the statistical side, we provide nonasymptotic bounds that hold with high probability for the cases of noisy, missing and/or dependent data. On the computational side, we prove that under the same types of conditions required for statistical consistency, the projected gradient descent algorithm is guaranteed to converge at a geometric rate to a near-global minimizer. We illustrate these theoretical predictions with simulations, showing close agreement with the predicted scalings.Comment: Published in at http://dx.doi.org/10.1214/12-AOS1018 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Crossref

ScholarlyCommons@Penn

On Distributed Linear Estimation With Observation Model Uncertainties

Author: Sani Alireza
Vosoughi Azadeh
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 06/09/2017
Field of study

We consider distributed estimation of a Gaussian source in a heterogenous bandwidth constrained sensor network, where the source is corrupted by independent multiplicative and additive observation noises, with incomplete statistical knowledge of the multiplicative noise. For multi-bit quantizers, we derive the closed-form mean-square-error (MSE) expression for the linear minimum MSE (LMMSE) estimator at the FC. For both error-free and erroneous communication channels, we propose several rate allocation methods named as longest root to leaf path, greedy and integer relaxation to (i) minimize the MSE given a network bandwidth constraint, and (ii) minimize the required network bandwidth given a target MSE. We also derive the Bayesian Cramer-Rao lower bound (CRLB) and compare the MSE performance of our proposed methods against the CRLB. Our results corroborate that, for low power multiplicative observation noises and adequate network bandwidth, the gaps between the MSE of our proposed methods and the CRLB are negligible, while the performance of other methods like individual rate allocation and uniform is not satisfactory

arXiv.org e-Print Archive

Crossref

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Cosmological constraints from the convergence 1-point probability distribution

Author: Blazek Jonathan
Honscheid Klaus
Huff Eric
Melchior Peter
Patton Kenneth
Ross Ashley J.
Suchyta Eric
Publication venue: 'Oxford University Press (OUP)'
Publication date: 04/11/2016
Field of study

We examine the cosmological information available from the 1-point probability distribution (PDF) of the weak-lensing convergence field, utilizing fast L-PICOLA simulations and a Fisher analysis. We find competitive constraints in the

\Omega_m

\sigma_8

plane from the convergence PDF with

188\ arcmin^2

pixels compared to the cosmic shear power spectrum with an equivalent number of modes (

\ell < 886

). The convergence PDF also partially breaks the degeneracy cosmic shear exhibits in that parameter space. A joint analysis of the convergence PDF and shear 2-point function also reduces the impact of shape measurement systematics, to which the PDF is less susceptible, and improves the total figure of merit by a factor of

2-3

, depending on the level of systematics. Finally, we present a correction factor necessary for calculating the unbiased Fisher information from finite differences using a limited number of cosmological simulations.Comment: 10 pages, 5 figure

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Princeton University Open Access Repository

Crossref

Stochastic nonparametric envelopment of data: Cross-sectional frontier estimation subject to shape constraints

Author: Kortelainen Mika
Kuosmanen Timo
Publication venue: University of Joensuu
Publication date
Field of study

UEF Electronic Publications

Estimation of semiparametric stochastic frontiers under shape constraints with application to pollution generating technologies

Author: Kortelainen Mika
Publication venue: University of Joensuu
Publication date
Field of study

UEF Electronic Publications