Search CORE

99,663 research outputs found

3D Point Capsule Networks

Author: Birdal Tolga
Deng Haowen
Tombari Federico
Zhao Yongheng
Publication venue
Publication date: 01/01/2018
Field of study

In this paper, we propose 3D point-capsule networks, an auto-encoder designed to process sparse 3D point clouds while preserving spatial arrangements of the input data. 3D capsule networks arise as a direct consequence of our novel unified 3D auto-encoder formulation. Their dynamic routing scheme and the peculiar 2D latent space deployed by our approach bring in improvements for several common point cloud-related tasks, such as object classification, object reconstruction and part segmentation as substantiated by our extensive evaluations. Moreover, it enables new applications such as part interpolation and replacement.Comment: As published in CVPR 2019 (camera ready version), with supplementary materia

arXiv.org e-Print Archive

Crossref

Archivio istituzionale della ricerca - Università di Padova

3D Point Capsule Networks

Author: Birdal Tolga
Deng Haowen
Tombari Federico
ZHAO YONGHENG
Publication venue
Publication date: 01/01/2018
Field of study

Archivio istituzionale della ricerca - Università di Padova

Object Action Complexes as an Interface for Planning and Robot Control

Author: Geib Christopher
Krüger Norbert
Mourao Kira
Petrick Ron
Pugeault Nico
Steedman Mark
Wörgötter Florentin
Publication venue
Publication date: 01/01/2006
Field of study

Abstract — Much prior work in integrating high-level artificial intelligence planning technology with low-level robotic control has foundered on the significant representational differences between these two areas of research. We discuss a proposed solution to this representational discontinuity in the form of object-action complexes (OACs). The pairing of actions and objects in a single interface representation captures the needs of both reasoning levels, and will enable machine learning of high-level action representations from low-level control representations. I. Introduction and Background The different representations that are effective for continuous control of robotic systems and the discrete symbolic AI presents a significant challenge for integrating AI planning research and robotics. These areas of research should be abl

CiteSeerX

Edinburgh Research Explorer

Automatic semantic video annotation in wide domain videos based on similarity and commonsense knowledgebases

Author: Ahmed Amr
Altadmri Amjad
Publication venue
Publication date: 01/01/2009
Field of study

In this paper, we introduce a novel framework for automatic Semantic Video Annotation. As this framework detects possible events occurring in video clips, it forms the annotating base of video search engine. To achieve this purpose, the system has to able to operate on uncontrolled wide-domain videos. Thus, all layers have to be based on generic features. This framework aims to bridge the "semantic gap", which is the difference between the low-level visual features and the human's perception, by finding videos with similar visual events, then analyzing their free text annotation to find a common area then to decide the best description for this new video using commonsense knowledgebases. Experiments were performed on wide-domain video clips from the TRECVID 2005 BBC rush standard database. Results from these experiments show promising integrity between those two layers in order to find expressing annotations for the input video. These results were evaluated based on retrieval performance

University of Lincoln Institutional Repository

Crossref

Edge Hill University Research Information Repository

Kent Academic Repository