Search CORE

21,767 research outputs found

Global Optimization for Value Function Approximation

Author: Petrik Marek
Zilberstein Shlomo
Publication venue
Publication date: 14/06/2010
Field of study

Existing value function approximation methods have been successfully used in many applications, but they often lack useful a priori error bounds. We propose a new approximate bilinear programming formulation of value function approximation, which employs global optimization. The formulation provides strong a priori guarantees on both robust and expected policy loss by minimizing specific norms of the Bellman residual. Solving a bilinear program optimally is NP-hard, but this is unavoidable because the Bellman-residual minimization itself is NP-hard. We describe and analyze both optimal and approximate algorithms for solving bilinear programs. The analysis shows that this algorithm offers a convergent generalization of approximate policy iteration. We also briefly analyze the behavior of bilinear programming algorithms under incomplete samples. Finally, we demonstrate that the proposed approach can consistently minimize the Bellman residual on simple benchmark problems

arXiv.org e-Print Archive

ScholarWorks@UMass Amherst

A reducibility method for the weak linear bilevel programming problems and a case study in principal-agent

Author: Lu J
Zhang G
Zhang Z
Zheng Y
Publication venue: 'Elsevier BV'
Publication date: 01/07/2018
Field of study

© 2018 A weak linear bilevel programming (WLBP) problem often models problems involving hierarchy structure in expert and intelligent systems under the pessimistic point. In the paper, we deal with such a problem. Using the duality theory of linear programming, the WLBP problem is first equivalently transformed into a jointly constrained bilinear programming problem. Then, we show that the resolution of the jointly constrained bilinear programming problem is equivalent to the resolution of a disjoint bilinear programming problem under appropriate assumptions. This may give a possibility to solve the WLBP problem via a single-level disjoint bilinear programming problem. Furthermore, some examples illustrate the solution process and feasibility of the proposed method. Finally, the WLBP problem models a principal-agent problem under the pessimistic point that is also compared with a principal-agent problem under the optimistic point

OPUS - University of Technology Sydney

Optimal Rotational Load Shedding via Bilinear Integer Programming

Author: arya
aziz
billinton
constable
fahim
ferguson
hali
holdren
kiani
kwame
mcgreal
Publication venue
Publication date: 01/01/2018
Field of study

This paper addresses the problem of managing rotational load shedding schedules for a power distribution network with multiple load zones. An integer optimization problem is formulated to find the optimal number and duration of planned power outages. Various types of damage costs are proposed to capture the heterogeneous load shedding preferences of different zones. The McCormick relaxation along with an effective procedure feasibility recovery is developed to solve the resulting bilinear integer program, which yields a high-quality suboptimal solution. Extensive simulation results corroborate the merit of the proposed approach, which has a substantial edge over existing load shedding schemes.Comment: 6 pages, 11 figures. To appear at the conference of APSIPA ASC 201

arXiv.org e-Print Archive

Crossref

eScholarship - University of California

A Global Optimization Algorithm for Sum of Linear Ratios Problem

Author: Siqiao Jin
Yuelin Gao
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2013
Field of study

We equivalently transform the sum of linear ratios programming problem into bilinear programming problem, then by using the linear characteristics of convex envelope and concave envelope of double variables product function, linear relaxation programming of the bilinear programming problem is given, which can determine the lower bound of the optimal value of original problem. Therefore, a branch and bound algorithm for solving sum of linear ratios programming problem is put forward, and the convergence of the algorithm is proved. Numerical experiments are reported to show the effectiveness of the proposed algorithm

Crossref

Directory of Open Access Journals

Recommended from our members

Reformulations of mathematical programming problems as linear complementarity problems

Author: Judice JJ
Mitra G
Publication venue: Brunel University
Publication date: 01/01/1982
Field of study

A family of complementarity problems are defined as extensions of the well known Linear Complementarity Problem (LCP). These are (i.) Second Linear Complementarity Problem (SLCP) which is an LCP extended by introducing further equality restrictions and unrestricted variables, (ii.) Minimum Linear Complementarity Problem (MLCP) which is an LCP with additional variables not required to be complementary and with a linear objective function which is to be minimized, (iii.) Second Minimum Linear Complementarity Problem (SMLCP) which is an MLCP but the nonnegative restriction on one of each pair of complementary variables is relaxed so that it is allowed to be unrestricted in value. A number of well known mathematical programming problems, namely quadratic programming (convex, nonconvex, pseudoconvex nonconvex), bilinear programming, game theory, zero-one integer programming, the fixed charge problem, absolute value programming, variable separable programming are reformulated as members of this family of four complementarity problems

Brunel University Research Archive

Identification of Structured LTI MIMO State-Space Models

Author: Basri Ronen
Kovalsky Shahar
Verhaegen Michel
Yu Chengpu
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

The identification of structured state-space model has been intensively studied for a long time but still has not been adequately addressed. The main challenge is that the involved estimation problem is a non-convex (or bilinear) optimization problem. This paper is devoted to developing an identification method which aims to find the global optimal solution under mild computational burden. Key to the developed identification algorithm is to transform a bilinear estimation to a rank constrained optimization problem and further a difference of convex programming (DCP) problem. The initial condition for the DCP problem is obtained by solving its convex part of the optimization problem which happens to be a nuclear norm regularized optimization problem. Since the nuclear norm regularized optimization is the closest convex form of the low-rank constrained estimation problem, the obtained initial condition is always of high quality which provides the DCP problem a good starting point. The DCP problem is then solved by the sequential convex programming method. Finally, numerical examples are included to show the effectiveness of the developed identification algorithm.Comment: Accepted to IEEE Conference on Decision and Control (CDC) 201

arXiv.org e-Print Archive

TU Delft Repository

Bilinear Programming and Protein Structure Alignment

Author: Cain J.
Kamenetsky D.
Lavine N.
Publication venue: Rose-Hulman Scholar
Publication date: 02/08/2010
Field of study

Proteins are a primary functional component of organic life, and understanding their function is integral to many areas of research in biochemistry. The three-dimensional structure of a protein largely determines this function. Protein structure alignment compares the structure of a protein with known function to that of a protein with unknown function. A protein’s three-dimensional structure can be transformed through a smooth piecewise-linear sigmoid function to a real symmetric contact matrix that represents the functional significance of certain parts of the protein. We address the protein alignment problem as a minimization of the 2-norm difference of two proteins’ contact matrices. The minimization is presented as a bilinear program, and spectral bounds for best- and worst-case alignments are provided, which are continuous with respect to small changes in the protein’s structure. Further conditions for a perfect alignment and heuristics for finding quality solutions are given

Rose-Hulman Institute of Technology: Rose-Hulman Scholar