Asymptotic properties of optimal trajectories in dynamic programming

Abstract

We prove in a dynamic programming framework that uniform convergence of the finite horizon values implies that asymptotically the average accumulated payoff is constant on optimal trajectories. We analyze and discuss several possible extensions to two-person games

    Similar works