Search CORE

1,039 research outputs found

Video analytics system for surveillance videos

Author: Bai Yannan
Publication venue
Publication date: 03/07/2018
Field of study

Developing an intelligent inspection system that can enhance the public safety is challenging. An efficient video analytics system can help monitor unusual events and mitigate possible damage or loss. This thesis aims to analyze surveillance video data, report abnormal activities and retrieve corresponding video clips. The surveillance video dataset used in this thesis is derived from ALERT Dataset, a collection of surveillance videos at airport security checkpoints. The video analytics system in this thesis can be thought as a pipelined process. The system takes the surveillance video as input, and passes it through a series of processing such as object detection, multi-object tracking, person-bin association and re-identification. In the end, we can obtain trajectories of passengers and baggage in the surveillance videos. Abnormal events like taking away other's belongings will be detected and trigger the alarm automatically. The system could also retrieve the corresponding video clips based on user-defined query

Boston University Institutional Repository (OpenBU)

Tracking by Prediction: A Deep Generative Model for Mutli-Person localisation and Tracking

Author: Denman Simon
Fernando Tharindu
Fookes Clinton
Sridharan Sridha
Publication venue
Publication date: 01/01/2018
Field of study

Current multi-person localisation and tracking systems have an over reliance on the use of appearance models for target re-identification and almost no approaches employ a complete deep learning solution for both objectives. We present a novel, complete deep learning framework for multi-person localisation and tracking. In this context we first introduce a light weight sequential Generative Adversarial Network architecture for person localisation, which overcomes issues related to occlusions and noisy detections, typically found in a multi person environment. In the proposed tracking framework we build upon recent advances in pedestrian trajectory prediction approaches and propose a novel data association scheme based on predicted trajectories. This removes the need for computationally expensive person re-identification systems based on appearance features and generates human like trajectories with minimal fragmentation. The proposed method is evaluated on multiple public benchmarks including both static and dynamic cameras and is capable of generating outstanding performance, especially among other recently proposed deep neural network based approaches.Comment: To appear in IEEE Winter Conference on Applications of Computer Vision (WACV), 201

arXiv.org e-Print Archive

Crossref

Queensland University of Technology ePrints Archive

A follow-me algorithm for AR.Drone using MobileNet-SSD and PID control

Author: Garriga Ferrer Júlia
Publication venue
Publication date: 27/06/2018
Field of study

Treballs Finals de Grau d'Enginyeria Informàtica, Facultat de Matemàtiques, Universitat de Barcelona, Any: 2018, Director: Lluís Garrido Ostermann[en] In recent years the industry of quadcopters has experimented a boost. The appearance of inexpensive drones has led to the growth of the recreational use of this vehicles, which opens the door to the creation of new applications and technologies. This thesis presents a vision-based autonomous control system for an AR.Drone 2.0. A tracking algorithm is developed using onboard vision systems without relying on additional external inputs. In particular, the tracking algorithm is the combination of a trained MobileNet-SSD object detector and a KCF tracker. The noise induced by the tracker is decreased with a Kalman filter. Furthermore, PID controllers are implemented for the motion control of the quadcopter, which process the output of the tracking algorithm to move the drone to the desired position. The final implementation was tested indoors and the system yields acceptable results

Diposit Digital de la Universitat de Barcelona

Improving a real-time object detector with compact temporal information

Author: Ahrnbom Martin
Ardö Håkan
Jensen Morten Bornø
Moeslund Thomas B.
Nilsson Mikael
Åström Kalle
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

Neural networks designed for real-time object detectionhave recently improved significantly, but in practice, look-ing at only a single RGB image at the time may not be ideal.For example, when detecting objects in videos, a foregrounddetection algorithm can be used to obtain compact temporaldata, which can be fed into a neural network alongside RGBimages. We propose an approach for doing this, based onan existing object detector, that re-uses pretrained weightsfor the processing of RGB images. The neural network wastested on the VIRAT dataset with annotations for object de-tection, a problem this approach is well suited for. The ac-curacy was found to improve significantly (up to 66%), witha roughly 40% increase in computational time

Lund University Publications

VBN

Dual-View Single-Shot Multibox Detector at Urban Intersections: Settings and Performance Evaluation

Author: Alessia Paglialonga
Marta Lenatti
Maurizio Mongelli
Sara Narteni
Vittorio Rampa
Publication venue: 'MDPI AG'
Publication date: 01/01/2023
Field of study

The explosion of artificial intelligence methods has paved the way for more sophisticated smart mobility solutions. In this work, we present a multi-camera video content analysis (VCA) system that exploits a single-shot multibox detector (SSD) network to detect vehicles, riders, and pedestrians and triggers alerts to drivers of public transportation vehicles approaching the surveilled area. The evaluation of the VCA system will address both detection and alert generation performance by combining visual and quantitative approaches. Starting from a SSD model trained for a single camera, we added a second one, under a different field of view (FOV) to improve the accuracy and reliability of the system. Due to real-time constraints, the complexity of the VCA system must be limited, thus calling for a simple multi-view fusion method. According to the experimental test-bed, the use of two cameras achieves a better balance between precision (68%) and recall (84%) with respect to the use of a single camera (i.e., 62% precision and 86% recall). In addition, a system evaluation in temporal terms is provided, showing that missed alerts (false negatives) and wrong alerts (false positives) are typically transitory events. Therefore, adding spatial and temporal redundancyincreases the overall reliability of the VCA system

Archivio istituzionale della ricerca - Politecnico di Milano

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)