Information-Theoretic Policy Extraction from Partial Observations

Lefebvre, Tom

Information-Theoretic Policy Extraction from Partial Observations

Authors: Tom Lefebvre
Publication date: 1 January 2022
Publisher
Doi

Abstract

We investigate the problem of extracting a control policy from a single or multiple partial observation sequences. Therefore we cast the problem as a Controlled Hidden Markov Model. We then sketch two information-theoretic approaches to extract a policy which we refer to as A Posterior Control Distributions. The performance of both methods is investigated and compared empirically on a linear tracking problem

Similar works

Full text

Available Versions

Ghent University Academic Bibliography

oai:archive.ugent.be:8771690

Last time updated on 13/11/2022

arXiv.org e-Print Archive

oai:arXiv.org:2204.02350

Last time updated on 26/04/2022