Search CORE

4 research outputs found

Foreground-Background Segmentation Based on Codebook and Edge Detector

Author: Ezin Eugène C.
Motamed Cina
Mousse Mikaël A.
Publication venue
Publication date: 23/10/2014
Field of study

Background modeling techniques are used for moving object detection in video. Many algorithms exist in the field of object detection with different purposes. In this paper, we propose an improvement of moving object detection based on codebook segmentation. We associate the original codebook algorithm with an edge detection algorithm. Our goal is to prove the efficiency of using an edge detection algorithm with a background modeling algorithm. Throughout our study, we compared the quality of the moving object detection when codebook segmentation algorithm is associated with some standard edge detectors. In each case, we use frame-based metrics for the evaluation of the detection. The different results are presented and analyzed.Comment: to appear in the 10th International Conference on Signal Image Technology & Internet Based Systems, 201

arXiv.org e-Print Archive

Crossref

CVABS: Moving Object Segmentation with Common Vector Approach for Videos

Author: Gerek Ömer Nezih
Işık Şahin
Özkan Kemal
Publication venue
Publication date: 19/10/2018
Field of study

Background modelling is a fundamental step for several real-time computer vision applications that requires security systems and monitoring. An accurate background model helps detecting activity of moving objects in the video. In this work, we have developed a new subspace based background modelling algorithm using the concept of Common Vector Approach with Gram-Schmidt orthogonalization. Once the background model that involves the common characteristic of different views corresponding to the same scene is acquired, a smart foreground detection and background updating procedure is applied based on dynamic control parameters. A variety of experiments is conducted on different problem types related to dynamic backgrounds. Several types of metrics are utilized as objective measures and the obtained visual results are judged subjectively. It was observed that the proposed method stands successfully for all problem types reported on CDNet2014 dataset by updating the background frames with a self-learning feedback mechanism.Comment: 12 Pages, 4 Figures, 1 Tabl

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

Security Threats from UAS and Its Mitigation Through Detection and Tracking

Author: Ganti Sai Ram
Publication venue: Digital Scholarship@UNLV
Publication date: 01/05/2016
Field of study

Unmanned Aerial Systems (UAS) are being used commonly for surveillance, providing valuable video data and reducing risk for humans wherever applicable. The cost of small UAS can range from as low as

30 to high as

5000, which makes it affordable by everyone. Most of the UAS are equipped with a camera which results in activities like disruption of privacy or capturing sensitive data. This research is aimed at developing a system which can detect and identify a drone and apply some counter measures to stop its functions or make it go away. The air traffic will increase significantly in next 20 years according to FAA (Federal Aviation Administration) and is doubling every year which increases the happening of all risks due to them. This system will detect and identify a drone through various methods combined into a single algorithm which includes image processing and audio detection. This research also proposes a new method where two algorithms run in sequence namely motion detection and SURF (Speed up Robust features). This system as a whole will warn a person when a drone is nearby and apply all the above mentioned techniques to keep the privacy intact and will also help keep the malicious or harmful drones away from the restricted/residential areas in the coming years

University of Nevada, Las Vegas Repository

VIDEO FOREGROUND LOCALIZATION FROM TRADITIONAL METHODS TO DEEP LEARNING

Author: Akilan Thangarajah
Publication venue: 'University of Windsor Leddy Library'
Publication date: 30/04/2018
Field of study

These days, detection of Visual Attention Regions (VAR), such as moving objects has become an integral part of many Computer Vision applications, viz. pattern recognition, object detection and classification, video surveillance, autonomous driving, human-machine interaction (HMI), and so forth. The moving object identification using bounding boxes has matured to the level of localizing the objects along their rigid borders and the process is called foreground localization (FGL). Over the decades, many image segmentation methodologies have been well studied, devised, and extended to suit the video FGL. Despite that, still, the problem of video foreground (FG) segmentation remains an intriguing task yet appealing due to its ill-posed nature and myriad of applications. Maintaining spatial and temporal coherence, particularly at object boundaries, persists challenging, and computationally burdensome. It even gets harder when the background possesses dynamic nature, like swaying tree branches or shimmering water body, and illumination variations, shadows cast by the moving objects, or when the video sequences have jittery frames caused by vibrating or unstable camera mounts on a surveillance post or moving robot. At the same time, in the analysis of traffic flow or human activity, the performance of an intelligent system substantially depends on its robustness of localizing the VAR, i.e., the FG. To this end, the natural question arises as what is the best way to deal with these challenges? Thus, the goal of this thesis is to investigate plausible real-time performant implementations from traditional approaches to modern-day deep learning (DL) models for FGL that can be applicable to many video content-aware applications (VCAA). It focuses mainly on improving existing methodologies through harnessing multimodal spatial and temporal cues for a delineated FGL. The first part of the dissertation is dedicated for enhancing conventional sample-based and Gaussian mixture model (GMM)-based video FGL using probability mass function (PMF), temporal median filtering, and fusing CIEDE2000 color similarity, color distortion, and illumination measures, and picking an appropriate adaptive threshold to extract the FG pixels. The subjective and objective evaluations are done to show the improvements over a number of similar conventional methods. The second part of the thesis focuses on exploiting and improving deep convolutional neural networks (DCNN) for the problem as mentioned earlier. Consequently, three models akin to encoder-decoder (EnDec) network are implemented with various innovative strategies to improve the quality of the FG segmentation. The strategies are not limited to double encoding - slow decoding feature learning, multi-view receptive field feature fusion, and incorporating spatiotemporal cues through long-shortterm memory (LSTM) units both in the subsampling and upsampling subnetworks. Experimental studies are carried out thoroughly on all conditions from baselines to challenging video sequences to prove the effectiveness of the proposed DCNNs. The analysis demonstrates that the architectural efficiency over other methods while quantitative and qualitative experiments show the competitive performance of the proposed models compared to the state-of-the-art

Scholarship at UWindsor