Search CORE

68,949 research outputs found

Multiscale Global Adaptive Attention Graph Neural Network

Author: GOU Ruru YANG Wenzhu, LUO Zifei, YUAN Yunfeng
Publication venue: Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press
Publication date: 01/12/2023
Field of study

Dynamic multiscale graph neural networks have high motion prediction errors due to the low correlation between the internal joints of body parts and the limited perceptual fields. A multiscale global adaptive attention graph neural network for human motion prediction is proposed to reduce motion prediction errors. Firstly, a multi-distance partitioning strategy for dividing skeleton joint is proposed to improve the degree of temporal and spatial correlation of body joint information. Secondly, a global adaptive attention spatial temporal graph convolutional network is designed to dynamically enhance the network??s attention to the spatial temporal joints contributing to a motion in combination with global adaptive attention. Finally, this paper integrates the above two improvements into the graph convolutional neural network gate recurrent unit to enhance the state propagation performance of the decoding network and reduce prediction errors. Experimental results show that the prediction error of the proposed method is decreased on Human 3.6M dataset, CMU Mocap dataset and 3DPW dataset compared with state-of-the-art methods

Directory of Open Access Journals

A Quadruple Diffusion Convolutional Recurrent Network for Human Motion Prediction

Author: Ho Edmond S.L.
Leung Howard
Men Qianhui
Shum Hubert P.H.
Publication venue: Institute of Electrical and Electronics Engineers
Publication date: 16/11/2020
Field of study

Recurrent neural network (RNN) has become popular for human motion prediction thanks to its ability to capture temporal dependencies. However, it has limited capacity in modeling the complex spatial relationship in the human skeletal structure. In this work, we present a novel diffusion convolutional recurrent predictor for spatial and temporal movement forecasting, with multi-step random walks traversing bidirectionally along an adaptive graph to model interdependency among body joints. In the temporal domain, existing methods rely on a single forward predictor with the produced motion deflecting to the drift route, which leads to error accumulations over time. We propose to supplement the forward predictor with a forward discriminator to alleviate such motion drift in the long term under adversarial training. The solution is further enhanced by a backward predictor and a backward discriminator to effectively reduce the error, such that the system can also look into the past to improve the prediction at early frames. The two-way spatial diffusion convolutions and two-way temporal predictors together form a quadruple network. Furthermore, we train our framework by modeling the velocity from observed motion dynamics instead of static poses to predict future movements that effectively reduces the discontinuity problem at early prediction. Our method outperforms the state of the arts on both 3D and 2D datasets, including the Human3.6M, CMU Motion Capture and Penn Action datasets. The results also show that our method correctly predicts both high-dynamic and low-dynamic moving trends with less motion drift

Durham Research Online

Northumbria Research Link

Enlighten

Im2Flow: Motion Hallucination from Static Images for Action Recognition

Author: Gao Ruohan
Grauman Kristen
Xiong Bo
Publication venue
Publication date: 30/05/2018
Field of study

Existing methods to recognize actions in static images take the images at their face value, learning the appearances---objects, scenes, and body poses---that distinguish each action class. However, such models are deprived of the rich dynamic structure and motions that also define human activity. We propose an approach that hallucinates the unobserved future motion implied by a single snapshot to help static-image action recognition. The key idea is to learn a prior over short-term dynamics from thousands of unlabeled videos, infer the anticipated optical flow on novel static images, and then train discriminative models that exploit both streams of information. Our main contributions are twofold. First, we devise an encoder-decoder convolutional neural network and a novel optical flow encoding that can translate a static image into an accurate flow map. Second, we show the power of hallucinated flow for recognition, successfully transferring the learned motion into a standard two-stream network for activity recognition. On seven datasets, we demonstrate the power of the approach. It not only achieves state-of-the-art accuracy for dense optical flow prediction, but also consistently enhances recognition of actions and dynamic scenes.Comment: Published in CVPR 2018, project page: http://vision.cs.utexas.edu/projects/im2flow

arXiv.org e-Print Archive

Crossref

Recognizing recurrent neural networks (rRNN): Bayesian inference for recurrent neural networks

Author: A Doucet
A Lazar
A Parlos
AH Jazwinski
B Cessac
C Archambeau
C Summerfield
D Debanne
D Mottet
D Perdikis
D Perdikis
D Verstraeten
DV Buonomano
EA Wan
EA Wan
EK Miller
EK Pissadaki
FH Hamker
G Schöner
GE Hinton
H Jaeger
J Daunizeau
J Ting-Ho Lo
JAS Kelso
JL Elman
JT Connor
K Friston
K Friston
K Narendra
K Sidiropoulou
KJ Friston
KJ Friston
KJ Friston
M Bar
M Boerlin
MI Rabinovich
N Spruston
R Blake
R Legenstein
R Wilson
RC Sotero
RP Rao
RP Rao
RPN Rao
S Denève
S Denève
S Rodrigues
S Roweis
Sebastian Bitzer
SJ Kiebel
SJ Kiebel
SJ Kiebel
SJ Kiebel
Stefan J. Kiebel
TB Schön
V Wassenhove van
VK Jirsa
W Maass
Z Ghahramani
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Recurrent neural networks (RNNs) are widely used in computational neuroscience and machine learning applications. In an RNN, each neuron computes its output as a nonlinear function of its integrated input. While the importance of RNNs, especially as models of brain processing, is undisputed, it is also widely acknowledged that the computations in standard RNN models may be an over-simplification of what real neuronal networks compute. Here, we suggest that the RNN approach may be made both neurobiologically more plausible and computationally more powerful by its fusion with Bayesian inference techniques for nonlinear dynamical systems. In this scheme, we use an RNN as a generative model of dynamic input caused by the environment, e.g. of speech or kinematics. Given this generative RNN model, we derive Bayesian update equations that can decode its output. Critically, these updates define a 'recognizing RNN' (rRNN), in which neurons compute and exchange prediction and prediction error messages. The rRNN has several desirable features that a conventional RNN does not have, for example, fast decoding of dynamic stimuli and robustness to initial conditions and noise. Furthermore, it implements a predictive coding scheme for dynamic inputs. We suggest that the Bayesian inversion of recurrent neural networks may be useful both as a model of brain function and as a machine learning tool. We illustrate the use of the rRNN by an application to the online decoding (i.e. recognition) of human kinematics

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector

MPG.PuRe