Search CORE

36,818 research outputs found

Anti-social behavior detection in audio-visual surveillance systems

Author: Kelly Philip
Kuklyte Jogile
O'Connor Noel E.
Xu Li-Qun
Ó Conaire Ciarán
Publication venue
Publication date: 01/12/2009
Field of study

In this paper we propose a general purpose framework for detection of unusual events. The proposed system is based on the unsupervised method for unusual scene detection in web{cam images that was introduced in [1]. We extend their algorithm to accommodate data from different modalities and introduce the concept of time-space blocks. In addition, we evaluate early and late fusion techniques for our audio-visual data features. The experimental results on 192 hours of data show that data fusion of audio and video outperforms using a single modality

CiteSeerX

Irish Universities

DCU Online Research Access Service

User-interface to a CCTV video search system

Author: Lee Hyowon
Murphy Noel
O'Connor Noel E.
Smeaton Alan F.
Publication venue: 'Institution of Engineering and Technology (IET)'
Publication date: 01/01/2005
Field of study

The proliferation of CCTV surveillance systems creates a problem of how to effectively navigate and search the resulting video archive, in a variety of security scenarios. We are concerned here with a situation where a searcher must locate all occurrences of a given person or object within a specified timeframe and with constraints on which camera(s) footage is valid to search. Conventional approaches based on browsing time/camera based combinations are inadequate. We advocate using automatically detected video objects as a basis for search, linking and browsing. In this paper we present a system under development based on users interacting with detected video objects. We outline the suite of technologies needed to achieve such a system and for each we describe where we are in terms of realizing those technologies. We also present a system interface to this system, designed with user needs and user tasks in mind

Crossref

Irish Universities

DCU Online Research Access Service

Attentive monitoring of multiple video streams driven by a Bayesian foraging strategy

Author: Boccignone Giuseppe
Napoletano Paolo
Tisato Francesco
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 27/04/2015
Field of study

In this paper we shall consider the problem of deploying attention to subsets of the video streams for collating the most relevant data and information of interest related to a given task. We formalize this monitoring problem as a foraging problem. We propose a probabilistic framework to model observer's attentive behavior as the behavior of a forager. The forager, moment to moment, focuses its attention on the most informative stream/camera, detects interesting objects or activities, or switches to a more profitable stream. The approach proposed here is suitable to be exploited for multi-stream video summarization. Meanwhile, it can serve as a preliminary step for more sophisticated video surveillance, e.g. activity and behavior analysis. Experimental results achieved on the UCR Videoweb Activities Dataset, a publicly available dataset, are presented to illustrate the utility of the proposed technique.Comment: Accepted to IEEE Transactions on Image Processin

arXiv.org e-Print Archive

AIR Universita degli studi di Milano

Learning Deep Representations of Appearance and Motion for Anomalous Event Detection

Author: Ricci Elisa
Sebe Nicu
Song Jingkuan
Xu Dan
Yan Yan
Publication venue
Publication date: 01/01/2015
Field of study

We present a novel unsupervised deep learning framework for anomalous event detection in complex video scenes. While most existing works merely use hand-crafted appearance and motion features, we propose Appearance and Motion DeepNet (AMDN) which utilizes deep neural networks to automatically learn feature representations. To exploit the complementary information of both appearance and motion patterns, we introduce a novel double fusion framework, combining both the benefits of traditional early fusion and late fusion strategies. Specifically, stacked denoising autoencoders are proposed to separately learn both appearance and motion features as well as a joint representation (early fusion). Based on the learned representations, multiple one-class SVM models are used to predict the anomaly scores of each input, which are then integrated with a late fusion strategy for final anomaly detection. We evaluate the proposed method on two publicly available video surveillance datasets, showing competitive performance with respect to state of the art approaches.Comment: Oral paper in BMVC 201

arXiv.org e-Print Archive

Crossref