Search CORE

399 research outputs found

A syntactic approach to robot imitation learning using probabilistic activity grammars

Author: Demiris Y
Kim T-K
Lee K
Su Y
Publication venue: 'Elsevier BV'
Publication date: 01/12/2013
Field of study

Spiral - Imperial College Digital Repository

Learning the Semantics of Manipulation Action

Author: Aksoy Eren Erdal
Aloimonos Yiannis
Fermuller Cornelia
Yang Yezhou
Publication venue
Publication date: 01/01/2015
Field of study

In this paper we present a formal computational framework for modeling manipulation actions. The introduced formalism leads to semantics of manipulation action and has applications to both observing and understanding human manipulation actions as well as executing them with a robotic mechanism (e.g. a humanoid robot). It is based on a Combinatory Categorial Grammar. The goal of the introduced framework is to: (1) represent manipulation actions with both syntax and semantic parts, where the semantic part employs

\lambda

-calculus; (2) enable a probabilistic semantic parsing schema to learn the

\lambda

-calculus representation of manipulation action from an annotated action corpus of videos; (3) use (1) and (2) to develop a system that visually observes manipulation actions and understands their meaning while it can reason beyond observations using propositional logic and axiom schemata. The experiments conducted on a public available large manipulation action dataset validate the theoretical framework and our implementation

arXiv.org e-Print Archive

Crossref

The Meaning of Action:a review on action recognition and mapping

Author: Geib Christopher
Kragic Danica
Krüger Volker
Ude Ales
Publication venue
Publication date: 01/01/2007
Field of study

VBN

The Meaning of Action:a review on action recognition and mapping

Author: Geib Christopher
Kragic Danica
Krüger Volker
Ude Ales
Publication venue: Aalborg Universitet, Copenhagen Institute of Technology (CVMI)
Publication date: 01/01/2007
Field of study

In this paper, we analyze the different approaches taken to date within the computer vision, robotics and artificial intelligence communities for the representation, recognition, synthesis and understanding of action. We deal with action at different levels of complexity and provide the reader with the necessary related literature references. We put the literature references further into context and outline a possible interpretation of action by taking into account the different aspects of action recognition, action synthesis and task-level planning

Lund University Publications

VBN

STARE: Spatio-Temporal Attention Relocation for multiple structured activities detection

Author: Chang H
Demiris Y
Kim T-K
Lee K
Ognibene D
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 14/09/2015
Field of study

We present a spatio-temporal attention relocation (STARE) method, an information-theoretic approach for efficient detection of simultaneously occurring structured activities. Given multiple human activities in a scene, our method dynamically focuses on the currently most informative activity. Each activity can be detected without complete observation, as the structure of sequential actions plays an important role on making the system robust to unattended observations. For such systems, the ability to decide where and when to focus is crucial to achieving high detection performances under resource bounded condition. Our main contributions can be summarized as follows: 1) information-theoretic dynamic attention relocation framework that allows the detection of multiple activities efficiently by exploiting the activity structure information and 2) a new high-resolution data set of temporally-structured concurrent activities. Our experiments on applications show that the STARE method performs efficiently while maintaining a reasonable level of accuracy

University of Birmingham Research Portal

Spiral - Imperial College Digital Repository

Combining Self-Supervised Learning and Imitation for Vision-Based Rope Manipulation

Author: Abbeel Pieter
Agrawal Pulkit
Chen Dian
Isola Phillip
Levine Sergey
Malik Jitendra
Nair Ashvin
Publication venue
Publication date: 06/03/2017
Field of study

Manipulation of deformable objects, such as ropes and cloth, is an important but challenging problem in robotics. We present a learning-based system where a robot takes as input a sequence of images of a human manipulating a rope from an initial to goal configuration, and outputs a sequence of actions that can reproduce the human demonstration, using only monocular images as input. To perform this task, the robot learns a pixel-level inverse dynamics model of rope manipulation directly from images in a self-supervised manner, using about 60K interactions with the rope collected autonomously by the robot. The human demonstration provides a high-level plan of what to do and the low-level inverse model is used to execute the plan. We show that by combining the high and low-level plans, the robot can successfully manipulate a rope into a variety of target shapes using only a sequence of human-provided images for direction.Comment: 8 pages, accepted to International Conference on Robotics and Automation (ICRA) 201

arXiv.org e-Print Archive

Crossref

Learning Social Affordance Grammar from Videos: Transferring Human Interactions to Human-Robot Interactions

Author: Gao Xiaofeng
Ryoo Michael S.
Shu Tianmin
Zhu Song-Chun
Publication venue
Publication date: 01/03/2017
Field of study

In this paper, we present a general framework for learning social affordance grammar as a spatiotemporal AND-OR graph (ST-AOG) from RGB-D videos of human interactions, and transfer the grammar to humanoids to enable a real-time motion inference for human-robot interaction (HRI). Based on Gibbs sampling, our weakly supervised grammar learning can automatically construct a hierarchical representation of an interaction with long-term joint sub-tasks of both agents and short term atomic actions of individual agents. Based on a new RGB-D video dataset with rich instances of human interactions, our experiments of Baxter simulation, human evaluation, and real Baxter test demonstrate that the model learned from limited training data successfully generates human-like behaviors in unseen scenarios and outperforms both baselines.Comment: The 2017 IEEE International Conference on Robotics and Automation (ICRA

arXiv.org e-Print Archive

Crossref