3,780 research outputs found
Data-driven Distributionally Robust Optimization Using the Wasserstein Metric: Performance Guarantees and Tractable Reformulations
We consider stochastic programs where the distribution of the uncertain
parameters is only observable through a finite training dataset. Using the
Wasserstein metric, we construct a ball in the space of (multivariate and
non-discrete) probability distributions centered at the uniform distribution on
the training samples, and we seek decisions that perform best in view of the
worst-case distribution within this Wasserstein ball. The state-of-the-art
methods for solving the resulting distributionally robust optimization problems
rely on global optimization techniques, which quickly become computationally
excruciating. In this paper we demonstrate that, under mild assumptions, the
distributionally robust optimization problems over Wasserstein balls can in
fact be reformulated as finite convex programs---in many interesting cases even
as tractable linear programs. Leveraging recent measure concentration results,
we also show that their solutions enjoy powerful finite-sample performance
guarantees. Our theoretical results are exemplified in mean-risk portfolio
optimization as well as uncertainty quantification.Comment: 42 pages, 10 figure
A Practical Guide to Robust Optimization
Robust optimization is a young and active research field that has been mainly
developed in the last 15 years. Robust optimization is very useful for
practice, since it is tailored to the information at hand, and it leads to
computationally tractable formulations. It is therefore remarkable that
real-life applications of robust optimization are still lagging behind; there
is much more potential for real-life applications than has been exploited
hitherto. The aim of this paper is to help practitioners to understand robust
optimization and to successfully apply it in practice. We provide a brief
introduction to robust optimization, and also describe important do's and
don'ts for using it in practice. We use many small examples to illustrate our
discussions
Towards Machine Wald
The past century has seen a steady increase in the need of estimating and
predicting complex systems and making (possibly critical) decisions with
limited information. Although computers have made possible the numerical
evaluation of sophisticated statistical models, these models are still designed
\emph{by humans} because there is currently no known recipe or algorithm for
dividing the design of a statistical model into a sequence of arithmetic
operations. Indeed enabling computers to \emph{think} as \emph{humans} have the
ability to do when faced with uncertainty is challenging in several major ways:
(1) Finding optimal statistical models remains to be formulated as a well posed
problem when information on the system of interest is incomplete and comes in
the form of a complex combination of sample data, partial knowledge of
constitutive relations and a limited description of the distribution of input
random variables. (2) The space of admissible scenarios along with the space of
relevant information, assumptions, and/or beliefs, tend to be infinite
dimensional, whereas calculus on a computer is necessarily discrete and finite.
With this purpose, this paper explores the foundations of a rigorous framework
for the scientific computation of optimal statistical estimators/models and
reviews their connections with Decision Theory, Machine Learning, Bayesian
Inference, Stochastic Optimization, Robust Optimization, Optimal Uncertainty
Quantification and Information Based Complexity.Comment: 37 page
Tropical polyhedra are equivalent to mean payoff games
We show that several decision problems originating from max-plus or tropical
convexity are equivalent to zero-sum two player game problems. In particular,
we set up an equivalence between the external representation of tropical convex
sets and zero-sum stochastic games, in which tropical polyhedra correspond to
deterministic games with finite action spaces. Then, we show that the winning
initial positions can be determined from the associated tropical polyhedron. We
obtain as a corollary a game theoretical proof of the fact that the tropical
rank of a matrix, defined as the maximal size of a submatrix for which the
optimal assignment problem has a unique solution, coincides with the maximal
number of rows (or columns) of the matrix which are linearly independent in the
tropical sense. Our proofs rely on techniques from non-linear Perron-Frobenius
theory.Comment: 28 pages, 5 figures; v2: updated references, added background
materials and illustrations; v3: minor improvements, references update
Semi-supervised Learning based on Distributionally Robust Optimization
We propose a novel method for semi-supervised learning (SSL) based on
data-driven distributionally robust optimization (DRO) using optimal transport
metrics. Our proposed method enhances generalization error by using the
unlabeled data to restrict the support of the worst case distribution in our
DRO formulation. We enable the implementation of our DRO formulation by
proposing a stochastic gradient descent algorithm which allows to easily
implement the training procedure. We demonstrate that our Semi-supervised DRO
method is able to improve the generalization error over natural supervised
procedures and state-of-the-art SSL estimators. Finally, we include a
discussion on the large sample behavior of the optimal uncertainty region in
the DRO formulation. Our discussion exposes important aspects such as the role
of dimension reduction in SSL
SOME GUIDING PRINCIPLES FOR EMPIRICAL PRODUCTION RESEARCH IN AGRICULTURE
Constraints on production economic research are examined in three dimensions: problem focus, methodology, and data availability. Data availability has played a large role in the choice of problem focus and explains some misdirected focus. A proposal is made to address the data availability constraint. The greatest self-imposed constraints are methodological. Production economics has focused on flexible representations of technology at the expense of specificity in preferences. Yet some of the major problems faced by decision makers relate to long-term problems, e.g., the commodity boom and ensuring debt crisis of the 1970s and 1980s where standard short-term profit maximization models are unlikely to capture the essence of decision maker concerns.Production Economics,
Matrices of forests, analysis of networks, and ranking problems
The matrices of spanning rooted forests are studied as a tool for analysing
the structure of networks and measuring their properties. The problems of
revealing the basic bicomponents, measuring vertex proximity, and ranking from
preference relations / sports competitions are considered. It is shown that the
vertex accessibility measure based on spanning forests has a number of
desirable properties. An interpretation for the stochastic matrix of
out-forests in terms of information dissemination is given.Comment: 8 pages. This article draws heavily from arXiv:math/0508171.
Published in Proceedings of the First International Conference on Information
Technology and Quantitative Management (ITQM 2013). This version contains
some corrections and addition
- âŠ