62 research outputs found
Object Tracking
Object tracking consists in estimation of trajectory of moving objects in the sequence of images. Automation of the computer object tracking is a difficult task. Dynamics of multiple parameters changes representing features and motion of the objects, and temporary partial or full occlusion of the tracked objects have to be considered. This monograph presents the development of object tracking algorithms, methods and systems. Both, state of the art of object tracking methods and also the new trends in research are described in this book. Fourteen chapters are split into two sections. Section 1 presents new theoretical ideas whereas Section 2 presents real-life applications. Despite the variety of topics contained in this monograph it constitutes a consisted knowledge in the field of computer object tracking. The intention of editor was to follow up the very quick progress in the developing of methods as well as extension of the application
Robust and affordable localization and mapping for 3D reconstruction. Application to architecture and construction
La localización y mapeado simultáneo a partir de una sola cámara en movimiento se conoce como Monocular
SLAM. En esta tesis se aborda este problema con cámaras de bajo coste cuyo principal reto consiste en ser
robustos al ruido, blurring y otros artefactos que afectan a la imagen. La aproximación al problema es discreta,
utilizando solo puntos de la imagen significativos para localizar la cámara y mapear el entorno. La principal
contribución es una simplificación del grafo de poses que permite mejorar la precisión en las escenas más
habituales, evaluada de forma exhaustiva en 4 datasets. Los resultados del mapeado permiten obtener una
reconstrucción 3D de la escena que puede ser utilizada en arquitectura y construcción para Modelar la Información
del Edificio (BIM). En la segunda parte de la tesis proponemos incorporar dicha información en un sistema de
visualización avanzada usando WebGL que ayude a simplificar la implantación de la metodología BIM.Departamento de Informática (Arquitectura y Tecnología de Computadores, Ciencias de la Computación e Inteligencia Artificial, Lenguajes y Sistemas Informáticos)Doctorado en Informátic
Neuromorphic deep convolutional neural network learning systems for FPGA in real time
Deep Learning algorithms have become one of the best approaches for pattern recognition in several fields, including computer vision, speech recognition, natural language processing, and audio recognition, among others. In image vision, convolutional neural networks stand out, due to their relatively simple supervised training and their efficiency extracting features from a scene. Nowadays, there exist several implementations of convolutional neural networks accelerators that manage to perform these networks in real time. However, the number of operations and power consumption of these implementations can be reduced using a different processing paradigm as neuromorphic engineering.
Neuromorphic engineering field studies the behavior of biological and inner systems of the human neural processing with the purpose of design analog, digital or mixed-signal systems to solve problems inspired in how human brain performs complex tasks, replicating the behavior and properties of biological neurons. Neuromorphic engineering tries to give an answer to how our brain is capable to learn and perform complex tasks with high efficiency under the paradigm of spike-based computation.
This thesis explores both frame-based and spike-based processing paradigms for the development of hardware architectures for visual pattern recognition based on convolutional neural networks. In this work, two FPGA implementations of convolutional neural networks accelerator architectures for frame-based using OpenCL and SoC technologies are presented. Followed by a novel neuromorphic convolution processor for spike-based processing paradigm, which implements the same behaviour of leaky integrate-and-fire neuron model. Furthermore, it reads the data in rows being able to perform multiple layers in the same chip. Finally, a novel FPGA implementation of Hierarchy of Time Surfaces algorithm and a new memory model for spike-based systems are proposed
Remote Assessment of the Cardiovascular Function Using Camera-Based Photoplethysmography
Camera-based photoplethysmography (cbPPG) is a novel measurement technique that allows the continuous monitoring of vital signs by using common video cameras. In the last decade, the technology has attracted a lot of attention as it is easy to set up, operates remotely, and offers new diagnostic opportunities. Despite the growing interest, cbPPG is not completely established yet and is still primarily the object of research. There are a variety of reasons for this lack of development including that reliable and autonomous hardware setups are missing, that robust processing algorithms are needed, that application fields are still limited, and that it is not completely understood which physiological factors impact the captured signal. In this thesis, these issues will be addressed.
A new and innovative measuring system for cbPPG was developed. In the course of three large studies conducted in clinical and non-clinical environments, the system’s great flexibility, autonomy, user-friendliness, and integrability could be successfully proven.
Furthermore, it was investigated what value optical polarization filtration adds to cbPPG. The results show that a perpendicular filter setting can significantly enhance the signal quality. In addition, the performed analyses were used to draw conclusions about the origin of cbPPG signals: Blood volume changes are most likely the defining element for the signal's modulation.
Besides the hardware-related topics, the software topic was addressed. A new method for the selection of regions of interest (ROIs) in cbPPG videos was developed. Choosing valid ROIs is one of the most important steps in the processing chain of cbPPG software. The new method has the advantage of being fully automated, more independent, and universally applicable. Moreover, it suppresses ballistocardiographic artifacts by utilizing a level-set-based approach. The suitability of the ROI selection method was demonstrated on a large and challenging data set.
In the last part of the work, a potentially new application field for cbPPG was explored. It was investigated how cbPPG can be used to assess autonomic reactions of the nervous system at the cutaneous vasculature. The results show that changes in the vasomotor tone, i.e. vasodilation and vasoconstriction, reflect
in the pulsation strength of cbPPG signals. These characteristics also shed more light on the origin problem. Similar to the polarization analyses, they support the classic blood volume theory.
In conclusion, this thesis tackles relevant issues regarding the application of cbPPG. The proposed solutions pave the way for cbPPG to become an established and widely accepted technology
Intelligent Circuits and Systems
ICICS-2020 is the third conference initiated by the School of Electronics and Electrical Engineering at Lovely Professional University that explored recent innovations of researchers working for the development of smart and green technologies in the fields of Energy, Electronics, Communications, Computers, and Control. ICICS provides innovators to identify new opportunities for the social and economic benefits of society. This conference bridges the gap between academics and R&D institutions, social visionaries, and experts from all strata of society to present their ongoing research activities and foster research relations between them. It provides opportunities for the exchange of new ideas, applications, and experiences in the field of smart technologies and finding global partners for future collaboration. The ICICS-2020 was conducted in two broad categories, Intelligent Circuits & Intelligent Systems and Emerging Technologies in Electrical Engineering
An Exploration into Model-Free Online Visual Object Tracking
This thesis presents a thorough investigation of model-free
visual object tracking, a fundamental computer vision task that
is essential for practical video analytics applications. Given
the states of the object in the rst frame, e.g., the position and
size of the target, the computational methods developed and
advanced in this thesis aim at determining target states in
consecutive video frames automatically. In contrast to the
tracking schemes that depend strictly on specic object detectors,
model-free tracking provides conveniently flexible and
competently general solutions where object representations are
initiated in the first frame and adapted in an online manner at
each frame.
We first articulate our motivations and intuitions in Chapter 1,
formulate model-free online visual tracking, illustrate outcomes
on two representative object tracking applications; drone control
and sports video broadcasting analysis, and elaborate other
relevant problems.
In Chapter 2, we review various tracking methodologies employed
by state-ofthe-art trackers and further review related background
knowledge, including several important dataset benchmarks and
workshop challenges, which are widely used for evaluating the
performance of trackers, as well as commonly applied evaluation
protocols in this chapter.
In Chapter 3 through Chapter 6, we then explore the model-free
online visual tracking problem in four different dimensions: 1)
learning a more discriminative classier with a two-layer
classication hierarchy and background contextual clusters; 2)
overcoming the limit of conventionally used local-search scheme
with a global object tracking framework based on instance-specic
object proposals; 3) tracking object affine motion with a
Structured Support Vector Machine (SSVM) framework incorporated
with motion manifold structure; 4) an efficient multiple object
model-free online tracking approach based on a shared pool of
object proposals.
Lastly, as a conclusion and future work outlook, we highlight and
summarize the contribution of this thesis and discuss several
promising research directions in Chapter 7, based on latest work
and their drawbacks of current state-of-the-art trackers
Recommended from our members
1995 BRAC Commission
NAWC Point Mugu, and China Lake - Data Obtained During Site Visits, Santa Cruz Island. (Box 277
An Energy-Efficient and Reliable Data Transmission Scheme for Transmitter-based Energy Harvesting Networks
Energy harvesting technology has been studied to overcome a limited power resource problem for a sensor network. This paper proposes a new data transmission period control and reliable data transmission algorithm for energy harvesting based sensor networks. Although previous studies proposed a communication protocol for energy harvesting based sensor networks, it still needs additional discussion. Proposed algorithm control a data transmission period and the number of data transmission dynamically based on environment information. Through this, energy consumption is reduced and transmission reliability is improved. The simulation result shows that the proposed algorithm is more efficient when compared with previous energy harvesting based communication standard, Enocean in terms of transmission success rate and residual energy.This research was supported by Basic Science Research Program through the National Research Foundation by Korea (NRF) funded by the Ministry of Education, Science and Technology(2012R1A1A3012227)
Gaze-Based Human-Robot Interaction by the Brunswick Model
We present a new paradigm for human-robot interaction based on social signal processing, and in particular on the Brunswick model. Originally, the Brunswick model copes with face-to-face dyadic interaction, assuming that the interactants are communicating through a continuous exchange of non verbal social signals, in addition to the spoken messages. Social signals have to be interpreted, thanks to a proper recognition phase that considers visual and audio information. The Brunswick model allows to quantitatively evaluate the quality of the interaction using statistical tools which measure how effective is the recognition phase. In this paper we cast this theory when one of the interactants is a robot; in this case, the recognition phase performed by the robot and the human have to be revised w.r.t. the original model. The model is applied to Berrick, a recent open-source low-cost robotic head platform, where the gazing is the social signal to be considered
- …