Search CORE

102,063 research outputs found

Discrete time McKean-Vlasov control problem: a dynamic programming approach

Author: Pham Huyên
Wei Xiaoli
Publication venue
Publication date: 29/11/2015
Field of study

We consider the stochastic optimal control problem of nonlinear mean-field systems in discrete time. We reformulate the problem into a deterministic control problem with marginal distribution as controlled state variable, and prove that dynamic programming principle holds in its general form. We apply our method for solving explicitly the mean-variance portfolio selection and the multivariate linear-quadratic McKean-Vlasov control problem

arXiv.org e-Print Archive

Hal-Diderot

HAL-Polytechnique

A probabilistic indirect adaptive control for systems with input-dependent noise

Author: Herzallah Randa
Publication venue: 'Wiley'
Publication date: 01/01/2011
Field of study

A probabilistic indirect adaptive controller is proposed for the general nonlinear multivariate class of discrete time system. The proposed probabilistic framework incorporates input–dependent noise prediction parameters in the derivation of the optimal control law. Moreover, because noise can be nonstationary in practice, the proposed adaptive control algorithm provides an elegant method for estimating and tracking the noise. For illustration purposes, the developed method is applied to the affine class of nonlinear multivariate discrete time systems and the desired result is obtained: the optimal control law is determined by solving a cubic equation and the distribution of the tracking error is shown to be Gaussian with zero mean. The efficiency of the proposed scheme is demonstrated numerically through the simulation of an affine nonlinear system

Aston Publications Explorer

Mean-Variance Policy for Discrete-time Cone Constrained Markets: The Consistency in Efficiency and Minimum-Variance Signed Supermartingale Measure

Author: Cui Xiangyu
Li Duan
Li Xun
Publication venue
Publication date: 04/03/2014
Field of study

The discrete-time mean-variance portfolio selection formulation, a representative of general dynamic mean-risk portfolio selection problems, does not satisfy time consistency in efficiency (TCIE) in general, i.e., a truncated pre-committed efficient policy may become inefficient when considering the corresponding truncated problem, thus stimulating investors' irrational investment behavior. We investigate analytically effects of portfolio constraints on time consistency of efficiency for convex cone constrained markets. More specifically, we derive the semi-analytical expressions for the pre-committed efficient mean-variance policy and the minimum-variance signed supermartingale measure (VSSM) and reveal their close relationship. Our analysis shows that the pre-committed discrete-time efficient mean-variance policy satisfies TCIE if and only if the conditional expectation of VSSM's density (with respect to the original probability measure) is nonnegative, or once the conditional expectation becomes negative, it remains at the same negative value until the terminal time. Our findings indicate that the property of time consistency in efficiency only depends on the basic market setting, including portfolio constraints, and this fact motivates us to establish a general solution framework in constructing TCIE dynamic portfolio selection problem formulations by introducing suitable portfolio constraints

arXiv.org e-Print Archive

The Hong Kong Polytechnic University Pao Yue-kong Library

PolyU Institutional Repository

A continuous analogue of the tensor-train decomposition

Author: Gorodetsky Alex A.
Karaman Sertac
Marzouk Youssef M.
Publication venue
Publication date: 11/12/2018
Field of study

We develop new approximation algorithms and data structures for representing and computing with multivariate functions using the functional tensor-train (FT), a continuous extension of the tensor-train (TT) decomposition. The FT represents functions using a tensor-train ansatz by replacing the three-dimensional TT cores with univariate matrix-valued functions. The main contribution of this paper is a framework to compute the FT that employs adaptive approximations of univariate fibers, and that is not tied to any tensorized discretization. The algorithm can be coupled with any univariate linear or nonlinear approximation procedure. We demonstrate that this approach can generate multivariate function approximations that are several orders of magnitude more accurate, for the same cost, than those based on the conventional approach of compressing the coefficient tensor of a tensor-product basis. Our approach is in the spirit of other continuous computation packages such as Chebfun, and yields an algorithm which requires the computation of "continuous" matrix factorizations such as the LU and QR decompositions of vector-valued functions. To support these developments, we describe continuous versions of an approximate maximum-volume cross approximation algorithm and of a rounding algorithm that re-approximates an FT by one of lower ranks. We demonstrate that our technique improves accuracy and robustness, compared to TT and quantics-TT approaches with fixed parameterizations, of high-dimensional integration, differentiation, and approximation of functions with local features such as discontinuities and other nonlinearities

arXiv.org e-Print Archive

DSpace@MIT

Convex inner approximations of nonconvex semialgebraic sets applied to fixed-order controller design

Author: Henrion Didier
Louembet Christophe
Publication venue
Publication date: 10/01/2012
Field of study

We describe an elementary algorithm to build convex inner approximations of nonconvex sets. Both input and output sets are basic semialgebraic sets given as lists of defining multivariate polynomials. Even though no optimality guarantees can be given (e.g. in terms of volume maximization for bounded sets), the algorithm is designed to preserve convex boundaries as much as possible, while removing regions with concave boundaries. In particular, the algorithm leaves invariant a given convex set. The algorithm is based on Gloptipoly 3, a public-domain Matlab package solving nonconvex polynomial optimization problems with the help of convex semidefinite programming (optimization over linear matrix inequalities, or LMIs). We illustrate how the algorithm can be used to design fixed-order controllers for linear systems, following a polynomial approach

arXiv.org e-Print Archive

Scientific Publications of the University of Toulouse II Le Mirail

HAL-INSA Toulouse