Search CORE

73,480 research outputs found

Visual Odometry using Convolutional Neural Networks

Author: Fagan Thomas
Graves Alec
Lim Steffen
McFall Kevin, PhD.
Publication venue: DigitalCommons@Kennesaw State University
Publication date: 21/12/2017
Field of study

Visual odometry is the process of tracking an agent\u27s motion over time using a visual sensor. The visual odometry problem has only been recently solved using traditional, non-machine learning techniques. Despite the success of neural networks at many related problems such as object recognition, feature detection, and optical flow, visual odometry still has not been solved with a deep learning technique. This paper attempts to implement several Convolutional Neural Networks to solve the visual odometry problem and compare slight variations in data preprocessing. The work presented is a step toward reaching a legitimate neural network solution

DigitalCommons@Kennesaw State University

Multi-camera object tracking using surprisal observations in visual sensor networks

Author: Bhuvana V. P.
Huemer M.
Regazzoni C. S.
Rinner B.
Schranz M.
Tonello A. M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Archivio istituzionale della ricerca - Università degli Studi di Udine

Simulation tool implementing centralized and distributed algorithms for tracking acoustic targets

Author: Serna Martínez Álvaro de la
Publication venue
Publication date: 11/07/2014
Field of study

The goal of this document is the implementation of a software tool for the simulation of the acoustic tracking problem over a wireless sensor network working in a centralized or distributed manner. Its Graphical User Interface (GUI) allows the user to configure the parameters associated to the diffusion adaptive algorithms implemented in the simulation tool, in order to offer a visual representation of the behavior of a real sensor network working with those settings. For illustration we ran several simulations, which allowed us to visualize the performance of different network configurations. The results obtained with the implemented simulation tool show it can be very helpful to study the audio target tracking problem and ultimately for the design of sensor networks that can guarantee certain performance criteria. Moreover, we have developed the code for the implementation of a real acoustictracking sensor network working in a centralized manner, using ©Libelium’sWaspmote™ sensor boards as the network nodes and using ©Libelium’s Meshlium-Xtreme™ as central node.Ingeniería de Sistemas Audiovisuale

Universidad Carlos III de Madrid e-Archivo

Human mobility monitoring in very low resolution visual sensor network

Author: Aghajan Hamid
Deboeverie Francis
Eldib Mohamed
Guan Junzhi
Kleihorst Richard
Niño Castañeda Jorge
Nyan Bo Bo
Philips Wilfried
Slembrouck Maarten
Steendam Heidi
Van de Velde Samuel
Van Haerenborgh Dirk
Veelaert Peter
Xie Xingzhe
Publication venue: 'MDPI AG'
Publication date: 01/01/2014
Field of study

This paper proposes an automated system for monitoring mobility patterns using a network of very low resolution visual sensors (30 30 pixels). The use of very low resolution sensors reduces privacy concern, cost, computation requirement and power consumption. The core of our proposed system is a robust people tracker that uses low resolution videos provided by the visual sensor network. The distributed processing architecture of our tracking system allows all image processing tasks to be done on the digital signal controller in each visual sensor. In this paper, we experimentally show that reliable tracking of people is possible using very low resolution imagery. We also compare the performance of our tracker against a state-of-the-art tracking method and show that our method outperforms. Moreover, the mobility statistics of tracks such as total distance traveled and average speed derived from trajectories are compared with those derived from ground truth given by Ultra-Wide Band sensors. The results of this comparison show that the trajectories from our system are accurate enough to obtain useful mobility statistics

Multidisciplinary Digital Publishing Institute

Ghent University Academic Bibliography

Directory of Open Access Journals

PubMed Central

Event-based Vision: A Survey

Author: Bartolozzi Chiara
Censi Andrea
Conradt Joerg
Daniilidis Kostas
Davison Andrew
Delbruck Tobi
Gallego Guillermo
Leutenegger Stefan
Orchard Garrick
Scaramuzza Davide
Taba Brian
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

Event cameras are bio-inspired sensors that differ from conventional frame cameras: Instead of capturing images at a fixed rate, they asynchronously measure per-pixel brightness changes, and output a stream of events that encode the time, location and sign of the brightness changes. Event cameras offer attractive properties compared to traditional cameras: high temporal resolution (in the order of microseconds), very high dynamic range (140 dB vs. 60 dB), low power consumption, and high pixel bandwidth (on the order of kHz) resulting in reduced motion blur. Hence, event cameras have a large potential for robotics and computer vision in challenging scenarios for traditional cameras, such as low-latency, high speed, and high dynamic range. However, novel methods are required to process the unconventional output of these sensors in order to unlock their potential. This paper provides a comprehensive overview of the emerging field of event-based vision, with a focus on the applications and the algorithms developed to unlock the outstanding properties of event cameras. We present event cameras from their working principle, the actual sensors that are available and the tasks that they have been used for, from low-level vision (feature detection and tracking, optic flow, etc.) to high-level vision (reconstruction, segmentation, recognition). We also discuss the techniques developed to process events, including learning-based techniques, as well as specialized processors for these novel sensors, such as spiking neural networks. Additionally, we highlight the challenges that remain to be tackled and the opportunities that lie ahead in the search for a more efficient, bio-inspired way for machines to perceive and interact with the world

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

ZORA

Deep Forward and Inverse Perceptual Models for Tracking and Prediction

Author: Boots Byron
Lambert Alexander
Liu Zhen
Raj Amit
Shaban Amirreza
Publication venue
Publication date: 19/05/2018
Field of study

We consider the problems of learning forward models that map state to high-dimensional images and inverse models that map high-dimensional images to state in robotics. Specifically, we present a perceptual model for generating video frames from state with deep networks, and provide a framework for its use in tracking and prediction tasks. We show that our proposed model greatly outperforms standard deconvolutional methods and GANs for image generation, producing clear, photo-realistic images. We also develop a convolutional neural network model for state estimation and compare the result to an Extended Kalman Filter to estimate robot trajectories. We validate all models on a real robotic system.Comment: 8 pages, International Conference on Robotics and Automation (ICRA) 201

arXiv.org e-Print Archive

Crossref

A sparsity-driven approach to multi-camera tracking in visual sensor networks

Author: Cetin Mujdat
Cosar Serhan
Coşar Serhan
Çetin Müjdat
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/08/2013
Field of study

In this paper, a sparsity-driven approach is presented for multi-camera tracking in visual sensor networks (VSNs). VSNs consist of image sensors, embedded processors and wireless transceivers which are powered by batteries. Since the energy and bandwidth resources are limited, setting up a tracking system in VSNs is a challenging problem. Motivated by the goal of tracking in a bandwidth-constrained environment, we present a sparsity-driven method to compress the features extracted by the camera nodes, which are then transmitted across the network for distributed inference. We have designed special overcomplete dictionaries that match the structure of the features, leading to very parsimonious yet accurate representations. We have tested our method in indoor and outdoor people tracking scenarios. Our experimental results demonstrate how our approach leads to communication savings without significant loss in tracking performance

Crossref

INRIA a CCSD electronic archive server

Sabanci University Research Database

HAL-Rennes 1

SALSA: A Novel Dataset for Multimodal Group Behavior Analysis

Author: Alameda-Pineda Xavier
Batrinca Ligia
Lanz Oswald
Lepri Bruno
Ricci Elisa
Sebe Nicu
Staiano Jacopo
Subramanian Ramanathan
Publication venue
Publication date: 23/06/2015
Field of study

Studying free-standing conversational groups (FCGs) in unstructured social settings (e.g., cocktail party ) is gratifying due to the wealth of information available at the group (mining social networks) and individual (recognizing native behavioral and personality traits) levels. However, analyzing social scenes involving FCGs is also highly challenging due to the difficulty in extracting behavioral cues such as target locations, their speaking activity and head/body pose due to crowdedness and presence of extreme occlusions. To this end, we propose SALSA, a novel dataset facilitating multimodal and Synergetic sociAL Scene Analysis, and make two main contributions to research on automated social interaction analysis: (1) SALSA records social interactions among 18 participants in a natural, indoor environment for over 60 minutes, under the poster presentation and cocktail party contexts presenting difficulties in the form of low-resolution images, lighting variations, numerous occlusions, reverberations and interfering sound sources; (2) To alleviate these problems we facilitate multimodal analysis by recording the social interplay using four static surveillance cameras and sociometric badges worn by each participant, comprising the microphone, accelerometer, bluetooth and infrared sensors. In addition to raw data, we also provide annotations concerning individuals' personality as well as their position, head, body orientation and F-formation information over the entire event duration. Through extensive experiments with state-of-the-art approaches, we show (a) the limitations of current methods and (b) how the recorded multiple cues synergetically aid automatic analysis of social interactions. SALSA is available at http://tev.fbk.eu/salsa.Comment: 14 pages, 11 figure

arXiv.org e-Print Archive

Crossref

Archivio della ricerca - Fondazione Bruno Kessler

University of Canberra Research Repository