Approximate Dynamic Programming via Sum of Squares Programming

Kamgarpour, Maryam; Kariotoglou, Nikolaos; Kunz, Konstantin; Lygeros, John; Summers, Sean; Summers, Tyler H.

research

Approximate Dynamic Programming via Sum of Squares Programming

Authors: Maryam Kamgarpour
Nikolaos Kariotoglou
Konstantin Kunz
John Lygeros
Sean Summers
Tyler H. Summers
Publication date: 6 December 2012
Publisher
Doi

Abstract

We describe an approximate dynamic programming method for stochastic control problems on infinite state and input spaces. The optimal value function is approximated by a linear combination of basis functions with coefficients as decision variables. By relaxing the Bellman equation to an inequality, one obtains a linear program in the basis coefficients with an infinite set of constraints. We show that a recently introduced method, which obtains convex quadratic value function approximations, can be extended to higher order polynomial approximations via sum of squares programming techniques. An approximate value function can then be computed offline by solving a semidefinite program, without having to sample the infinite constraint. The policy is evaluated online by solving a polynomial optimization problem, which also turns out to be convex in some cases. We experimentally validate the method on an autonomous helicopter testbed using a 10-dimensional helicopter model.Comment: 7 pages, 5 figures. Submitted to the 2013 European Control Conference, Zurich, Switzerlan

Similar works

Full text

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.750.4...

Last time updated on 30/10/2017

Crossref

Last time updated on 10/08/2021