423 research outputs found
Outlier detection using distributionally robust optimization under the Wasserstein metric
We present a Distributionally Robust Optimization (DRO) approach to outlier detection in a linear regression setting, where the closeness of probability distributions is measured using the Wasserstein metric. Training samples contaminated with outliers skew the regression plane computed by least squares and thus impede outlier detection. Classical approaches, such as robust regression, remedy this problem by downweighting the contribution of atypical data points. In contrast, our Wasserstein DRO approach hedges against a family of distributions that are close to the empirical distribution. We show that the resulting formulation encompasses a class of models, which include the regularized Least Absolute Deviation (LAD) as a special case. We provide new insights into the regularization term and give guidance on the selection of the regularization coefficient from the standpoint of a confidence region. We establish two types of performance guarantees for the solution to our formulation under mild conditions. One is related to its out-of-sample behavior, and the other concerns the discrepancy between the estimated and true regression planes. Extensive numerical results demonstrate the superiority of our approach to both robust regression and the regularized LAD in terms of estimation accuracy and outlier detection rates
Stochastic Optimal Power Flow Based on Data-Driven Distributionally Robust Optimization
We propose a data-driven method to solve a stochastic optimal power flow
(OPF) problem based on limited information about forecast error distributions.
The objective is to determine power schedules for controllable devices in a
power network to balance operation cost and conditional value-at-risk (CVaR) of
device and network constraint violations. These decisions include scheduled
power output adjustments and reserve policies, which specify planned reactions
to forecast errors in order to accommodate fluctuating renewable energy
sources. Instead of assuming the uncertainties across the networks follow
prescribed probability distributions, we assume the distributions are only
observable through a finite training dataset. By utilizing the Wasserstein
metric to quantify differences between the empirical data-based distribution
and the real data-generating distribution, we formulate a distributionally
robust optimization OPF problem to search for power schedules and reserve
policies that are robust to sampling errors inherent in the dataset. A simple
numerical example illustrates inherent tradeoffs between operation cost and
risk of constraint violation, and we show how our proposed method offers a
data-driven framework to balance these objectives
Data-driven linear decision rule approach for distributionally robust optimization of on-line signal control
We propose a two-stage, on-line signal control strategy for dynamic networks using a linear decision rule (LDR) approach and a distributionally robust optimization (DRO) technique. The first (off-line) stage formulates a LDR that maps real-time traffic data to optimal signal control policies. A DRO problem is solved to optimize the on-line performance of the LDR in the presence of uncertainties associated with the observed traffic states and ambiguity in their underlying distribution functions. We employ a data-driven calibration of the uncertainty set, which takes into account historical traffic data. The second (on-line) stage implements a very efficient linear decision rule whose performance is guaranteed by the off-line computation. We test the proposed signal control procedure in a simulation environment that is informed by actual traffic data obtained in Glasgow, and demonstrate its full potential in on-line operation and deployability on realistic networks, as well as its effectiveness in improving traffic
Regret Models and Preprocessing Techniques for Combinatorial Optimization under Uncertainty
Ph.DDOCTOR OF PHILOSOPH
- …