Search CORE

3 research outputs found

Predictive-State Decoders: Encoding the Future into Recurrent Networks

Author: Venkatraman Arun
Rhinehart Nicholas
Sun Wen
Pinto Lerrel
Hebert Martial
Boots Byron
Kitani Kris M.
Bagnell J. Andrew
Publication venue
Publication date: 05/07/2017
Field of study

Recurrent neural networks (RNNs) are a vital modeling technique that rely on internal states learned indirectly by optimization of a supervised, unsupervised, or reinforcement training loss. RNNs are used to model dynamic processes that are characterized by underlying latent states whose form is often unknown, precluding its analytic representation inside an RNN. In the Predictive-State Representation (PSR) literature, latent state processes are modeled by an internal state representation that directly models the distribution of future observations, and most recent work in this area has relied on explicitly representing and targeting sufficient statistics of this probability distribution. We seek to combine the advantages of RNNs and PSRs by augmenting existing state-of-the-art recurrent neural networks with Predictive-State Decoders (PSDs), which add supervision to the network's internal state representation to target predicting future observations. Predictive-State Decoders are simple to implement and easily incorporated into existing training pipelines via additional loss regularization. We demonstrate the effectiveness of PSDs with experimental results in three different domains: probabilistic filtering, Imitation Learning, and Reinforcement Learning. In each, our method improves statistical performance of state-of-the-art recurrent baselines and does so with fewer iterations and less data.Comment: NIPS 201

arXiv.org e-Print Archive

Dryad Digital Repository (Duke University)

FigShare

Learning to Smooth with Bidirectional Predictive State Inference Machines

Author: Bagnell James
Boots Byron
Capobianco Roberto
Gordon Geoff
Sun Wen
Publication venue: AUAI Press
Publication date: 01/01/2016
Field of study

We present the Smoothing Machine (SMACH, pronounced "smash"), a dynamical system learning algorithm based on chain Conditional Random Fields (CRFs) with latent states. Unlike previous methods, SMACH is designed to optimize prediction performance when we have information from both past and future observations. By leveraging Predictive State Representations (PSRs), we model beliefs about latent states through predictive states—an alternative but equivalent representation that depends directly on observable quantities. Predictive states enable the use of well-developed supervised learning approaches in place of local-optimum-prone methods like EM: we learn regressors or classifiers that can approximate message passing and marginalization in the space of predictive states. We provide theoretical guarantees on smoothing performance and we empirically verify the efficacy of SMACH on several dynamical system benchmarks

Archivio della ricerca- Università di Roma La Sapienza