21,767 research outputs found
Global Optimization for Value Function Approximation
Existing value function approximation methods have been successfully used in
many applications, but they often lack useful a priori error bounds. We propose
a new approximate bilinear programming formulation of value function
approximation, which employs global optimization. The formulation provides
strong a priori guarantees on both robust and expected policy loss by
minimizing specific norms of the Bellman residual. Solving a bilinear program
optimally is NP-hard, but this is unavoidable because the Bellman-residual
minimization itself is NP-hard. We describe and analyze both optimal and
approximate algorithms for solving bilinear programs. The analysis shows that
this algorithm offers a convergent generalization of approximate policy
iteration. We also briefly analyze the behavior of bilinear programming
algorithms under incomplete samples. Finally, we demonstrate that the proposed
approach can consistently minimize the Bellman residual on simple benchmark
problems
A reducibility method for the weak linear bilevel programming problems and a case study in principal-agent
© 2018 A weak linear bilevel programming (WLBP) problem often models problems involving hierarchy structure in expert and intelligent systems under the pessimistic point. In the paper, we deal with such a problem. Using the duality theory of linear programming, the WLBP problem is first equivalently transformed into a jointly constrained bilinear programming problem. Then, we show that the resolution of the jointly constrained bilinear programming problem is equivalent to the resolution of a disjoint bilinear programming problem under appropriate assumptions. This may give a possibility to solve the WLBP problem via a single-level disjoint bilinear programming problem. Furthermore, some examples illustrate the solution process and feasibility of the proposed method. Finally, the WLBP problem models a principal-agent problem under the pessimistic point that is also compared with a principal-agent problem under the optimistic point
Optimal Rotational Load Shedding via Bilinear Integer Programming
This paper addresses the problem of managing rotational load shedding
schedules for a power distribution network with multiple load zones. An integer
optimization problem is formulated to find the optimal number and duration of
planned power outages. Various types of damage costs are proposed to capture
the heterogeneous load shedding preferences of different zones. The McCormick
relaxation along with an effective procedure feasibility recovery is developed
to solve the resulting bilinear integer program, which yields a high-quality
suboptimal solution. Extensive simulation results corroborate the merit of the
proposed approach, which has a substantial edge over existing load shedding
schemes.Comment: 6 pages, 11 figures. To appear at the conference of APSIPA ASC 201
A Global Optimization Algorithm for Sum of Linear Ratios Problem
We equivalently transform the sum of linear ratios programming problem into bilinear programming problem, then by using the linear characteristics of convex envelope and concave envelope of double variables product function, linear relaxation programming of the bilinear programming problem is given, which can determine the lower bound of the optimal value of original problem. Therefore, a branch and bound algorithm for solving sum of linear ratios programming problem is put forward, and the convergence of the algorithm is proved. Numerical experiments are reported to show the effectiveness of the proposed algorithm
Recommended from our members
Reformulations of mathematical programming problems as linear complementarity problems
A family of complementarity problems are defined as extensions of the well known Linear Complementarity Problem (LCP). These are
(i.) Second Linear Complementarity Problem (SLCP) which is an LCP extended by introducing further equality restrictions and unrestricted variables,
(ii.) Minimum Linear Complementarity Problem (MLCP) which is an
LCP with additional variables not required to be complementary and with a linear objective function which is to be minimized,
(iii.) Second Minimum Linear Complementarity Problem (SMLCP) which is an MLCP but the nonnegative restriction on one of each pair of complementary variables is relaxed so that it is allowed to be unrestricted in value.
A number of well known mathematical programming problems, namely quadratic programming (convex, nonconvex, pseudoconvex nonconvex), bilinear programming, game theory, zero-one integer programming, the fixed charge problem, absolute value programming, variable separable programming are reformulated as members of this family of four complementarity problems
Identification of Structured LTI MIMO State-Space Models
The identification of structured state-space model has been intensively
studied for a long time but still has not been adequately addressed. The main
challenge is that the involved estimation problem is a non-convex (or bilinear)
optimization problem. This paper is devoted to developing an identification
method which aims to find the global optimal solution under mild computational
burden. Key to the developed identification algorithm is to transform a
bilinear estimation to a rank constrained optimization problem and further a
difference of convex programming (DCP) problem. The initial condition for the
DCP problem is obtained by solving its convex part of the optimization problem
which happens to be a nuclear norm regularized optimization problem. Since the
nuclear norm regularized optimization is the closest convex form of the
low-rank constrained estimation problem, the obtained initial condition is
always of high quality which provides the DCP problem a good starting point.
The DCP problem is then solved by the sequential convex programming method.
Finally, numerical examples are included to show the effectiveness of the
developed identification algorithm.Comment: Accepted to IEEE Conference on Decision and Control (CDC) 201
Bilinear Programming and Protein Structure Alignment
Proteins are a primary functional component of organic life, and understanding their function is integral to many areas of research in biochemistry. The three-dimensional structure of a protein largely determines this function. Protein structure alignment compares the structure of a protein with known function to that of a protein with unknown function. A protein’s three-dimensional structure can be transformed through a smooth piecewise-linear sigmoid function to a real symmetric contact matrix that represents the functional significance of certain parts of the protein. We address the protein alignment problem as a minimization of the 2-norm difference of two proteins’ contact matrices. The minimization is presented as a bilinear program, and spectral bounds for best- and worst-case alignments are provided, which are continuous with respect to small changes in the protein’s structure. Further conditions for a perfect alignment and heuristics for finding quality solutions are given
- …