Search CORE

4,139 research outputs found

Outlier Detection Using Nonconvex Penalized Regression

Author: Art B. Owen
Benjamini Y.
Hadi A. S.
Peña D.
Rousseeuw P.
Yiyuan She
Zhao P.
Publication venue
Publication date: 01/01/2010
Field of study

This paper studies the outlier detection problem from the point of view of penalized regressions. Our regression model adds one mean shift parameter for each of the

n

data points. We then apply a regularization favoring a sparse vector of mean shift parameters. The usual

L_1

penalty yields a convex criterion, but we find that it fails to deliver a robust estimator. The

L_1

penalty corresponds to soft thresholding. We introduce a thresholding (denoted by

\Theta

) based iterative procedure for outlier detection (

\Theta

-IPOD). A version based on hard thresholding correctly identifies outliers on some hard test problems. We find that

\Theta

-IPOD is much faster than iteratively reweighted least squares for large data because each iteration costs at most

O(np)

(and sometimes much less) avoiding an

O(np^2)

least squares estimate. We describe the connection between

\Theta

-IPOD and

M

-estimators. Our proposed method has one tuning parameter with which to both identify outliers and estimate regression coefficients. A data-dependent choice can be made based on BIC. The tuned

\Theta

-IPOD shows outstanding performance in identifying outliers in various situations in comparison to other existing approaches. This methodology extends to high-dimensional modeling with

p\gg n

, if both the coefficient vector and the outlier pattern are sparse

arXiv.org e-Print Archive

CiteSeerX

Crossref

Research Papers in Economics

Robust Subspace Learning: Robust PCA, Robust Subspace Tracking, and Robust Subspace Recovery

Author: Bouwmans Thierry
Javed Sajid
Narayanamurthy Praneeth
Vaswani Namrata
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 05/07/2018
Field of study

PCA is one of the most widely used dimension reduction techniques. A related easier problem is "subspace learning" or "subspace estimation". Given relatively clean data, both are easily solved via singular value decomposition (SVD). The problem of subspace learning or PCA in the presence of outliers is called robust subspace learning or robust PCA (RPCA). For long data sequences, if one tries to use a single lower dimensional subspace to represent the data, the required subspace dimension may end up being quite large. For such data, a better model is to assume that it lies in a low-dimensional subspace that can change over time, albeit gradually. The problem of tracking such data (and the subspaces) while being robust to outliers is called robust subspace tracking (RST). This article provides a magazine-style overview of the entire field of robust subspace learning and tracking. In particular solutions for three problems are discussed in detail: RPCA via sparse+low-rank matrix decomposition (S+LR), RST via S+LR, and "robust subspace recovery (RSR)". RSR assumes that an entire data vector is either an outlier or an inlier. The S+LR formulation instead assumes that outliers occur on only a few data vector indices and hence are well modeled as sparse corruptions.Comment: To appear, IEEE Signal Processing Magazine, July 201

arXiv.org e-Print Archive

Open Access Repository of IISc Research Publications

Computational Methods for Sparse Solution of Linear Inverse Problems

Author: Tropp Joel A.
Wright Stephen J.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2009
Field of study

The goal of the sparse approximation problem is to approximate a target signal using a linear combination of a few elementary signals drawn from a fixed collection. This paper surveys the major practical algorithms for sparse approximation. Specific attention is paid to computational issues, to the circumstances in which individual methods tend to perform well, and to the theoretical guarantees available. Many fundamental questions in electrical engineering, statistics, and applied mathematics can be posed as sparse approximation problems, making these algorithms versatile and relevant to a plethora of applications

CiteSeerX

Caltech Authors