Search CORE

1,505 research outputs found

Predictive-State Decoders: Encoding the Future into Recurrent Networks

Author: Venkatraman Arun
Rhinehart Nicholas
Sun Wen
Pinto Lerrel
Hebert Martial
Boots Byron
Kitani Kris M.
Bagnell J. Andrew
Publication venue
Publication date: 05/07/2017
Field of study

Recurrent neural networks (RNNs) are a vital modeling technique that rely on internal states learned indirectly by optimization of a supervised, unsupervised, or reinforcement training loss. RNNs are used to model dynamic processes that are characterized by underlying latent states whose form is often unknown, precluding its analytic representation inside an RNN. In the Predictive-State Representation (PSR) literature, latent state processes are modeled by an internal state representation that directly models the distribution of future observations, and most recent work in this area has relied on explicitly representing and targeting sufficient statistics of this probability distribution. We seek to combine the advantages of RNNs and PSRs by augmenting existing state-of-the-art recurrent neural networks with Predictive-State Decoders (PSDs), which add supervision to the network's internal state representation to target predicting future observations. Predictive-State Decoders are simple to implement and easily incorporated into existing training pipelines via additional loss regularization. We demonstrate the effectiveness of PSDs with experimental results in three different domains: probabilistic filtering, Imitation Learning, and Reinforcement Learning. In each, our method improves statistical performance of state-of-the-art recurrent baselines and does so with fewer iterations and less data.Comment: NIPS 201

arXiv.org e-Print Archive

Dryad Digital Repository (Duke University)

FigShare

Recognising facial expressions in video sequences

Author: A Gee
A Lanitis
B Fasel
B Raducanu
D DeCarlo
D DeCarlo
D Terzopoulos
Enrique Muñoz
FM Alkoot
G Hager
G Zhao
H Rowley
I Cohen
I Essa
I Matthews
J Ohya
JB Tenenbaum
JJ Lien
JM Buenaposada
JN Basili
José M. Buenaposada
K Mase
Luis Baumela
M Panti
M Pantic
M Rosenblum
M Turk
MF McTear
MJ Black
MJ Black
MJ Lyons
MKH Leung
N Oliver
P Ekman
P Ekman
P Rani
P Viola
PN Belhumeur
R Cowie
RO Duda
RW Picard
S Baker
S Roweis
T Cootes
T Ojala
Y Tian
Y Xu
Y Yacoob
Y Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

We introduce a system that processes a sequence of images of a front-facing human face and recognises a set of facial expressions. We use an efficient appearance-based face tracker to locate the face in the image sequence and estimate the deformation of its non-rigid components. The tracker works in real-time. It is robust to strong illumination changes and factors out changes in appearance caused by illumination from changes due to face deformation. We adopt a model-based approach for facial expression recognition. In our model, an image of a face is represented by a point in a deformation space. The variability of the classes of images associated to facial expressions are represented by a set of samples which model a low-dimensional manifold in the space of deformations. We introduce a probabilistic procedure based on a nearest-neighbour approach to combine the information provided by the incoming image sequence with the prior information stored in the expression manifold in order to compute a posterior probability associated to a facial expression. In the experiments conducted we show that this system is able to work in an unconstrained environment with strong changes in illumination and face location. It achieves an 89\% recognition rate in a set of 333 sequences from the Cohn-Kanade data base

CiteSeerX

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM

Efficient and Stable Acoustic Tomography Using Sparse Reconstruction Methods

Author: Hormati Ali
Jovanovic Ivana
Sbaiz Luciano
Vetterli Martin
Publication venue
Publication date: 22/05/2007
Field of study

We study an acoustic tomography problem and propose a new inversion technique based on sparsity. Acoustic tomography observes the parameters of the medium that influence the speed of sound propagation. In the human body, the parameters that mostly influence the sound speed are temperature and density, in the ocean - temperature and current, in the atmosphere - temperature and wind. In this study, we focus on estimating temperature in the atmosphere using the information on the average sound speed along the propagation path. The latter is practically obtained from travel time measurements. We propose a reconstruction algorithm that exploits the concept of sparsity. Namely, the temperature is assumed to be a linear combination of some functions (e.g. bases or set of different bases) where many of the coefficients are known to be zero. The goal is to find the non-zero coefficients. To this end, we apply an algorithm based on linear programming that under some constrains finds the solution with minimum l0 norm. This is actually equivalent to the fact that many of the unknown coefficients are zeros. Finally, we perform numerical simulations to assess the effectiveness of our approach. The simulation results confirm the applicability of the method and demonstrate high reconstruction quality and robustness to noise

Infoscience - École polytechnique fédérale de Lausanne

MIMiC: Multimodal Interactive Motion Controller

Author: Dumebi Okwechime
Eng-Jon Ong
Richard Bowden
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Sparsity-Based Algorithms for Line Spectral Estimation

Author: Hansen Thomas Lundgaard
Publication venue: Aalborg Universitetsforlag
Publication date: 01/01/2018
Field of study

VBN