Search CORE

33,355 research outputs found

Optimistic Robust Optimization With Applications To Machine Learning

Author: Mafusalov Alexander
Norton Matthew
Takeda Akiko
Publication venue
Publication date: 20/11/2017
Field of study

Robust Optimization has traditionally taken a pessimistic, or worst-case viewpoint of uncertainty which is motivated by a desire to find sets of optimal policies that maintain feasibility under a variety of operating conditions. In this paper, we explore an optimistic, or best-case view of uncertainty and show that it can be a fruitful approach. We show that these techniques can be used to address a wide variety of problems. First, we apply our methods in the context of robust linear programming, providing a method for reducing conservatism in intuitive ways that encode economically realistic modeling assumptions. Second, we look at problems in machine learning and find that this approach is strongly connected to the existing literature. Specifically, we provide a new interpretation for popular sparsity inducing non-convex regularization schemes. Additionally, we show that successful approaches for dealing with outliers and noise can be interpreted as optimistic robust optimization problems. Although many of the problems resulting from our approach are non-convex, we find that DCA or DCA-like optimization approaches can be intuitive and efficient

arXiv.org e-Print Archive

Calhoun, Institutional Archive of the Naval Postgraduate School

Regularizing Portfolio Optimization

Author: Acerbi C
Acerbi C Nordio C Sirtori C
Bengio Y
Bertsekas D P
Bordes A
Bottou L
Bouchaud J-Ph
Burda Z
Chopra V K
DeMiguel V
Elton E J
Embrechts P
Frahm G
Frahm G Memmel Ch
Gulyas N Kondor I
Imre Kondor
Jobson J D
Jorion P
Kempf A
Kondor I Varga-Haszonits I
Macrae R
Markowitz H
Morgan J P Reuters Riskmetrics
Perez-Cruz F
Potters M
Rockafellar R T
Schölkopf B
Schölkopf B
Susanne Still
Tibshirani R
Vanderbei R J
Vapnik V
Vapnik V
Vapnik V
Varga-Haszonits I
Publication venue: 'IOP Publishing'
Publication date: 09/11/2009
Field of study

The optimization of large portfolios displays an inherent instability to estimation error. This poses a fundamental problem, because solutions that are not stable under sample fluctuations may look optimal for a given sample, but are, in effect, very far from optimal with respect to the average risk. In this paper, we approach the problem from the point of view of statistical learning theory. The occurrence of the instability is intimately related to over-fitting which can be avoided using known regularization methods. We show how regularized portfolio optimization with the expected shortfall as a risk measure is related to support vector regression. The budget constraint dictates a modification. We present the resulting optimization problem and discuss the solution. The L2 norm of the weight vector is used as a regularizer, which corresponds to a diversification "pressure". This means that diversification, besides counteracting downward fluctuations in some assets by upward fluctuations in others, is also crucial because it improves the stability of the solution. The approach we provide here allows for the simultaneous treatment of optimization and diversification in one framework that enables the investor to trade-off between the two, depending on the size of the available data set

arXiv.org e-Print Archive

Crossref

ELTE Digital Institutional Repository (EDIT)

Computing Optimal Designs of multiresponse Experiments reduces to Second-Order Cone Programming

Author: Sagnol Guillaume
Publication venue
Publication date: 25/11/2010
Field of study

Elfving's Theorem is a major result in the theory of optimal experimental design, which gives a geometrical characterization of

c-

optimality. In this paper, we extend this theorem to the case of multiresponse experiments, and we show that when the number of experiments is finite,

c-,A-,T-

and

D-

optimal design of multiresponse experiments can be computed by Second-Order Cone Programming (SOCP). Moreover, our SOCP approach can deal with design problems in which the variable is subject to several linear constraints. We give two proofs of this generalization of Elfving's theorem. One is based on Lagrangian dualization techniques and relies on the fact that the semidefinite programming (SDP) formulation of the multiresponse

c-

optimal design always has a solution which is a matrix of rank

1

. Therefore, the complexity of this problem fades. We also investigate a \emph{model robust} generalization of

c-

optimality, for which an Elfving-type theorem was established by Dette (1993). We show with the same Lagrangian approach that these model robust designs can be computed efficiently by minimizing a geometric mean under some norm constraints. Moreover, we show that the optimality conditions of this geometric programming problem yield an extension of Dette's theorem to the case of multiresponse experiments. When the number of unknown parameters is small, or when the number of linear functions of the parameters to be estimated is small, we show by numerical examples that our approach can be between 10 and 1000 times faster than the classic, state-of-the-art algorithms

arXiv.org e-Print Archive

CiteSeerX

Voxel selection in fMRI data analysis based on sparse representation

Author: Feng Jianfeng
Gu Zhenghui
Guan Cuntai
Li Yuanqiang
Namburi Praneeth
Yu Zhuliang
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/10/2009
Field of study

Multivariate pattern analysis approaches toward detection of brain regions from fMRI data have been gaining attention recently. In this study, we introduce an iterative sparse-representation-based algorithm for detection of voxels in functional MRI (fMRI) data with task relevant information. In each iteration of the algorithm, a linear programming problem is solved and a sparse weight vector is subsequently obtained. The final weight vector is the mean of those obtained in all iterations. The characteristics of our algorithm are as follows: 1) the weight vector (output) is sparse; 2) the magnitude of each entry of the weight vector represents the significance of its corresponding variable or feature in a classification or regression problem; and 3) due to the convergence of this algorithm, a stable weight vector is obtained. To demonstrate the validity of our algorithm and illustrate its application, we apply the algorithm to the Pittsburgh Brain Activity Interpretation Competition 2007 functional fMRI dataset for selecting the voxels, which are the most relevant to the tasks of the subjects. Based on this dataset, the aforementioned characteristics of our algorithm are analyzed, and a comparison between our method with the univariate general-linear-model-based statistical parametric mapping is performed. Using our method, a combination of voxels are selected based on the principle of effective/sparse representation of a task. Data analysis results in this paper show that this combination of voxels is suitable for decoding tasks and demonstrate the effectiveness of our method

Warwick Research Archives Portal Repository

Sparse kernel density estimation technique based on zero-norm constraint

Author: Chen S
Harris C J
Hong Xia
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/07/2010
Field of study

A sparse kernel density estimator is derived based on the zero-norm constraint, in which the zero-norm of the kernel weights is incorporated to enhance model sparsity. The classical Parzen window estimate is adopted as the desired response for density estimation, and an approximate function of the zero-norm is used for achieving mathemtical tractability and algorithmic efficiency. Under the mild condition of the positive definite design matrix, the kernel weights of the proposed density estimator based on the zero-norm approximation can be obtained using the multiplicative nonnegative quadratic programming algorithm. Using the -optimality based selection algorithm as the preprocessing to select a small significant subset design matrix, the proposed zero-norm based approach offers an effective means for constructing very sparse kernel density estimates with excellent generalisation performance

Central Archive at the University of Reading

Southampton (e-Prints Soton)

Crossref

Structured Sparsity: Discrete and Convex approaches

Author: A. Beck
A. Chambolle
A. Chambolle
A. Gilbert
A. Goldberg
A. Goy
A. Gramfort
A. Nemirovskii
A. Puig
A. Subramanian
B. Efron
B. He
B. McCoy
B. Natarajan
C. Sheppard
D. Bertsekas
D. Donoho
D. Heckerman
D. Needell
F. Girosi
F. Rapaport
G. Nemhauser
G. Nemhauser
H. Zhou
I. Daubechies
I. Johnstone
International Neuroinformatics Coordinating Faculty
J. Bonnans
J. Borwein
J. Dahl
J. Huang
J. Huang
J. Orlin
J. Shapiro
J. Tropp
L. He
M. Born
M. Crouse
M. Fukushima
M. Lustig
M. Stojnic
M. Vincent
N. Simon
P. Combettes
P. Loh
P. Tseng
P. Zhao
Q. Tran-Dinh
R. Baraniuk
R. Baraniuk
R. Baraniuk
R. Jenatton
R. Jenatton
S. Boyd
S. Boyd
S. Chen
S. Foucart
S. Fujishige
S. Fujishige
S. Mallat
S. Mallat
S. Robinson
S. Villa
S. Villa
S. Wright
S. Wright
T. Blumensath
T. Blumensath
V. Chandrasekaran
V. Kolmogorov
W. Gerstner
Y. Bengio
Y. Eldar
Y. Nesterov
Y. Nesterov
Y. Nesterov
Publication venue
Publication date: 01/01/2015
Field of study

Compressive sensing (CS) exploits sparsity to recover sparse or compressible signals from dimensionality reducing, non-adaptive sensing mechanisms. Sparsity is also used to enhance interpretability in machine learning and statistics applications: While the ambient dimension is vast in modern data analysis problems, the relevant information therein typically resides in a much lower dimensional space. However, many solutions proposed nowadays do not leverage the true underlying structure. Recent results in CS extend the simple sparsity idea to more sophisticated {\em structured} sparsity models, which describe the interdependency between the nonzero components of a signal, allowing to increase the interpretability of the results and lead to better recovery performance. In order to better understand the impact of structured sparsity, in this chapter we analyze the connections between the discrete models and their convex relaxations, highlighting their relative advantages. We start with the general group sparse model and then elaborate on two important special cases: the dispersive and the hierarchical models. For each, we present the models in their discrete nature, discuss how to solve the ensuing discrete problems and then describe convex relaxations. We also consider more general structures as defined by set functions and present their convex proxies. Further, we discuss efficient optimization solutions for structured sparsity problems and illustrate structured sparsity in action via three applications.Comment: 30 pages, 18 figure

arXiv.org e-Print Archive

Crossref