64 research outputs found
A survey of solution techniques for the partially observed Markov decision process
We survey several computational procedures for the partially observed Markov decision process (POMDP) that have been developed since the Monahan survey was published in 1982. The POMDP generalizes the standard, completely observed Markov decision process by permitting the possibility that state observations may be noise-corrupted and/or costly. Several computational procedures presented are convergence accelerating variants of, or approximations to, the Smallwood-Sondik algorithm. Finite-memory suboptimal design results are reported, and new research directions involving heuristic search are discussed.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/44198/1/10479_2005_Article_BF02204836.pd
- …