Online algorithms for POMDPs with continuous state, action, and
  observation spaces

Kochenderfer, Mykel; Sunberg, Zachary

research

Online algorithms for POMDPs with continuous state, action, and observation spaces

Authors: Mykel Kochenderfer
Zachary Sunberg
Publication date: 15 June 2018
Publisher
Doi

Abstract

Online solvers for partially observable Markov decision processes have been applied to problems with large discrete state spaces, but continuous state, action, and observation spaces remain a challenge. This paper begins by investigating double progressive widening (DPW) as a solution to this challenge. However, we prove that this modification alone is not sufficient because the belief representations in the search tree collapse to a single particle causing the algorithm to converge to a policy that is suboptimal regardless of the computation time. This paper proposes and evaluates two new algorithms, POMCPOW and PFT-DPW, that overcome this deficiency by using weighted particle filtering. Simulation results show that these modifications allow the algorithms to be successful where previous approaches fail.Comment: Added Multilane sectio

Similar works

Full text

Available Versions

Crossref

info:doi/10.1609%2Ficaps.v28i1...

Last time updated on 03/03/2024

Association for the Advancement of Artificial Intelligence: AAAI Publications

oai:ojs.aaai.org:article/13882

Last time updated on 20/02/2021