Search CORE

29,232 research outputs found

Holographic and 3D teleconferencing and visualization: implications for terabit networked applications

Author: Gharai L.
Perkins C.S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2006
Field of study

Abstract not available

Crossref

Enlighten

Online Visual Robot Tracking and Identification using Deep LSTM Networks

Author: Behnke Sven
Farazi Hafez
Publication venue
Publication date: 16/10/2018
Field of study

Collaborative robots working on a common task are necessary for many applications. One of the challenges for achieving collaboration in a team of robots is mutual tracking and identification. We present a novel pipeline for online visionbased detection, tracking and identification of robots with a known and identical appearance. Our method runs in realtime on the limited hardware of the observer robot. Unlike previous works addressing robot tracking and identification, we use a data-driven approach based on recurrent neural networks to learn relations between sequential inputs and outputs. We formulate the data association problem as multiple classification problems. A deep LSTM network was trained on a simulated dataset and fine-tuned on small set of real data. Experiments on two challenging datasets, one synthetic and one real, which include long-term occlusions, show promising results.Comment: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, Canada, 2017. IROS RoboCup Best Paper Awar

arXiv.org e-Print Archive

Crossref

Robust Deep Multi-Modal Sensor Fusion using Fusion Weight Regularization and Target Learning

Author: Li Peng
Li Yang
Shim Myung Seok
Zhang Wenrui
Zhang Xuchong
Zhao Chenye
Publication venue
Publication date: 01/01/2019
Field of study

Sensor fusion has wide applications in many domains including health care and autonomous systems. While the advent of deep learning has enabled promising multi-modal fusion of high-level features and end-to-end sensor fusion solutions, existing deep learning based sensor fusion techniques including deep gating architectures are not always resilient, leading to the issue of fusion weight inconsistency. We propose deep multi-modal sensor fusion architectures with enhanced robustness particularly under the presence of sensor failures. At the core of our gating architectures are fusion weight regularization and fusion target learning operating on auxiliary unimodal sensing networks appended to the main fusion model. The proposed regularized gating architectures outperform the existing deep learning architectures with and without gating under both clean and corrupted sensory inputs resulted from sensor failures. The demonstrated improvements are particularly pronounced when one or more multiple sensory modalities are corrupted.Comment: 8 page

arXiv.org e-Print Archive

eScholarship - University of California

Dynamic Reconfiguration in Camera Networks: A Short Survey

Author: Esterle Lukas
Foresti Gian Luca
Khan Asif
Piciarelli Claudio
Rinner Bernhard
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

There is a clear trend in camera networks towards enhanced functionality and flexibility, and a fixed static deployment is typically not sufficient to fulfill these increased requirements. Dynamic network reconfiguration helps to optimize the network performance to the currently required specific tasks while considering the available resources. Although several reconfiguration methods have been recently proposed, e.g., for maximizing the global scene coverage or maximizing the image quality of specific targets, there is a lack of a general framework highlighting the key components shared by all these systems. In this paper we propose a reference framework for network reconfiguration and present a short survey of some of the most relevant state-of-the-art works in this field, showing how they can be reformulated in our framework. Finally we discuss the main open research challenges in camera network reconfiguration

Archivio istituzionale della ricerca - Università degli Studi di Udine

Scenic: A Language for Scenario Specification and Scene Generation

Author: Dosovitskiy Alexey
Fremont Daniel J.
Gupta Ankush
Jiang Chenfanfu
Kulkarni Tejas
Liebelt Joerg
Milch Brian
Naveh Yehuda
Nori Aditya V
Ritchie Daniel
Ros Germán
Russell Stuart
Saheb-Djahromi Nasser
Sutton Michael
Wood Frank
Wu Bichen
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 20/06/2019
Field of study

We propose a new probabilistic programming language for the design and analysis of perception systems, especially those based on machine learning. Specifically, we consider the problems of training a perception system to handle rare events, testing its performance under different conditions, and debugging failures. We show how a probabilistic programming language can help address these problems by specifying distributions encoding interesting types of inputs and sampling these to generate specialized training and test sets. More generally, such languages can be used for cyber-physical systems and robotics to write environment models, an essential prerequisite to any formal analysis. In this paper, we focus on systems like autonomous cars and robots, whose environment is a "scene", a configuration of physical objects and agents. We design a domain-specific language, Scenic, for describing "scenarios" that are distributions over scenes. As a probabilistic programming language, Scenic allows assigning distributions to features of the scene, as well as declaratively imposing hard and soft constraints over the scene. We develop specialized techniques for sampling from the resulting distribution, taking advantage of the structure provided by Scenic's domain-specific syntax. Finally, we apply Scenic in a case study on a convolutional neural network designed to detect cars in road images, improving its performance beyond that achieved by state-of-the-art synthetic data generation methods.Comment: 41 pages, 36 figures. Full version of a PLDI 2019 paper (extending UC Berkeley EECS Department Tech Report No. UCB/EECS-2018-8

arXiv.org e-Print Archive

Crossref