Search CORE

218 research outputs found

Multiple path prediction for traffic scenes using LSTMs and mixture density models

Author: Fernandez Jaime B.
Little Suzanne
O'Connor Noel E.
Publication venue: 'Scitepress'
Publication date: 31/12/2019
Field of study

This work presents an analysis of predicting multiple future paths of moving objects in traffic scenes by leveraging Long Short-Term Memory architectures (LSTMs) and Mixture Density Networks (MDNs) in a single-shot manner. Path prediction allows estimating the future positions of objects. This is useful in important applications such as security monitoring systems, Autonomous Driver Assistance Systems and assistive technologies. Normal approaches use observed positions (tracklets) of objects in video frames to predict their future paths as a sequence of position values. This can be treated as a time series. LSTMs have achieved good performance when dealing with time series. However, LSTMs have the limitation of only predicting a single path per tracklet. Path prediction is not a deterministic task and requires predicting with a level of uncertainty. Predicting multiple paths instead of a single one is therefore a more realistic manner of approaching this task. In this work, predicting a set of future paths with associated uncertainty was archived by combining LSTMs and MDNs. The evaluation was made on the KITTI and the CityFlow datasets on three type of objects, four prediction horizons and two different points of view (image coordinates and birds-eye vie

Crossref

Scipedia

DCU Online Research Access Service

People, Penguins and Petri Dishes: Adapting Object Counting Models To New Visual Domains And Object Types Without Forgetting

Author: Keogh Ciara E.
Little Suzanne
Marsden Mark
McGuinness Kevin
O'Connor Noel E.
Publication venue
Publication date: 15/11/2017
Field of study

In this paper we propose a technique to adapt a convolutional neural network (CNN) based object counter to additional visual domains and object types while still preserving the original counting function. Domain-specific normalisation and scaling operators are trained to allow the model to adjust to the statistical distributions of the various visual domains. The developed adaptation technique is used to produce a singular patch-based counting regressor capable of counting various object types including people, vehicles, cell nuclei and wildlife. As part of this study a challenging new cell counting dataset in the context of tissue culture and patient diagnosis is constructed. This new collection, referred to as the Dublin Cell Counting (DCC) dataset, is the first of its kind to be made available to the wider computer vision community. State-of-the-art object counting performance is achieved in both the Shanghaitech (parts A and B) and Penguins datasets while competitive performance is observed on the TRANCOS and Modified Bone Marrow (MBM) datasets, all using a shared counting model.Comment: 10 page

arXiv.org e-Print Archive

Crossref

Irish Universities

DCU Online Research Access Service

ResnetCrowd: a residual deep learning architecture for crowd counting, violent behaviour detection and crowd density level classification

Author: Little Suzanne
Marsden Mark
McGuinness Kevin
O'Connor Noel E.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 23/10/2017
Field of study

In this paper we propose ResnetCrowd, a deep residual architecture for simultaneous crowd counting, violent behaviour detection and crowd density level classification. To train and evaluate the proposed multi-objective technique, a new 100 image dataset referred to as Multi Task Crowd is constructed. This new dataset is the first computer vision dataset fully annotated for crowd counting, violent behaviour detection and density level classification. Our experiments show that a multi-task approach boosts individual task performance for all tasks and most notably for violent behaviour detection which receives a 9\% boost in ROC curve AUC (Area under the curve). The trained ResnetCrowd model is also evaluated on several additional benchmarks highlighting the superior generalisation of crowd analysis models trained for multiple objectives

DCU Online Research Access Service

An evaluation of local action descriptors for human action classification in the presence of occlusion

Author: Direkoglu Cem
Jargalsaikhan Iveel
Little Suzanne
O'Connor Noel E.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

This paper examines the impact that the choice of local de- scriptor has on human action classifier performance in the presence of static occlusion. This question is important when applying human action classification to surveillance video that is noisy, crowded, complex and incomplete. In real-world scenarios, it is natural that a human can be occluded by an object while carrying out different actions. However, it is unclear how the performance of the proposed action descriptors are affected by the associated loss of information. In this paper, we evaluate and compare the classification performance of the state-of-art human local action descriptors in the presence of varying degrees of static occlusion. We consider four different local action descriptors: Trajectory (TRAJ), Histogram of Orientation Gradient (HOG), Histogram of Orientation Flow (HOF) and Motion Boundary Histogram (MBH). These descriptors are combined with a standard bag-of-features representation and a Support Vector Machine classifier for action recognition. We investigate the performance of these descriptors and their possible combinations with respect to varying amounts of artificial occlusion in the KTH action dataset. This preliminary investigation shows that MBH in combination with TRAJ has the best performance in the case of partial occlusion while TRAJ in combination with MBH achieves the best results in the presence of heavy occlusion

Crossref

Irish Universities

DCU Online Research Access Service

A novel shape descriptor based on salient keypoints detection for binary image matching and retrieval

Author: Chatbri Houssem
Kameyama Keisuke
Kwan Paul
Little Suzanne
O'Connor Noel E.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/04/2018
Field of study

We introduce a shape descriptor that extracts keypoints from binary images and automatically detects the salient ones among them. The proposed descriptor operates as follows: First, the contours of the image are detected and an image transformation is used to generate background information. Next, pixels of the transformed image that have specific characteristics in their local areas are used to extract keypoints. Afterwards, the most salient keypoints are automatically detected by filtering out redundant and sensitive ones. Finally, a feature vector is calculated for each keypoint by using the distribution of contour points in its local area. The proposed descriptor is evaluated using public datasets of silhouette images, handwritten math expressions, hand-drawn diagram sketches, and noisy scanned logos. Experimental results show that the proposed descriptor compares strongly against state of the art methods, and that it is reliable when applied on challenging images such as fluctuated handwriting and noisy scanned images. Furthermore, we integrate our descripto

Research UNE

Crossref

DCU Online Research Access Service

Performance of video processing at the edge for crowd-monitoring applications

Author: Ballas Camille
Little Suzanne
Marsden Mark
O'Connor Noel E.
Zhang Dian
Publication venue
Publication date: 05/02/2018
Field of study

Video analytics has a key role to play in smart cities and connected community applications such as crowd counting, activity detection, event classification, traffic counting etc. Using a cloud-centric approach where data is funneled to a central processor presents a number of key problems such as available bandwidth, real-time responsiveness and personal data privacy issues. With the development of edge computing, a new paradigm for smart data management is emerging. Raw video feeds can be pre-processed at the point of capture while integration and deeper analytics is performed in the cloud. In this paper we explore the capacity of video processing at the edge and shown that basic image processing can be achieved in near real-time on low-powered gateway devices. We have also investigated deep learning model capabilities for crowd counting in this context showing that its performance is highly dependent on the input size and re-scaling video frames can optimise processing and performance. Increased edge processing resolves a number of issues in video analytics for crowd monitoring applications

Crossref

DCU Online Research Access Service

Action recognition based on sparse motion trajectories

Author: Direkoglu Cem
Jargalsaikhan Iveel
Little Suzanne
O'Connor Noel E.
Publication venue
Publication date: 01/09/2013
Field of study

We present a method that extracts effective features in videos for human action recognition. The proposed method analyses the 3D volumes along the sparse motion trajectories of a set of interest points from the video scene. To represent human actions, we generate a Bag-of-Features (BoF) model based on extracted features, and finally a support vector machine is used to classify human activities. Evaluation shows that the proposed features are discriminative and computationally efficient. Our method achieves state-of-the-art performance with the standard human action recognition benchmarks, namely KTH and Weizmann datasets

Crossref

DCU Online Research Access Service

Action recognition in video using a spatial-temporal graph-based feature representation

Author: Jargalsaikhan Iveel
Little Suzanne
O'Connor Noel E.
Trichet Remi
Publication venue
Publication date: 26/08/2015
Field of study

We propose a video graph based human action recognition framework. Given an input video sequence, we extract spatio-temporal local features and construct a video graph to incorporate appearance and motion constraints to reflect the spatio-temporal dependencies among features. them. In particular, we extend a popular dbscan density-based clustering algorithm to form an intuitive video graph. During training, we estimate a linear SVM classifier using the standard Bag-of-words method. During classification, we apply Graph-Cut optimization to find the most frequent action label in the constructed graph and assign this label to the test video sequence. The proposed approach achieves stateof-the-art performance with standard human action recognition benchmarks, namely KTH and UCF-sports datasets and competitive results for the Hollywood (HOHA) dataset

Crossref

Irish Universities

DCU Online Research Access Service

Holistic features for real-time crowd behaviour anomaly detection

Author: Little Suzanne
Marsden Mark
McGuinness Kevin
O'Connor Noel E.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 16/06/2016
Field of study

This paper presents a new approach to crowd behaviour anomaly detection that uses a set of efficiently computed, easily interpretable, scene-level holistic features. This low-dimensional descriptor combines two features from the literature: crowd collectiveness [1] and crowd conflict [2], with two newly developed crowd features: mean motion speed and a new formulation of crowd density. Two different anomaly detection approaches are investigated using these features. When only normal training data is available we use a Gaussian Mixture Model (GMM) for outlier detection. When both normal and abnormal training data is available we use a Support Vector Machine (SVM) for binary classification. We evaluate on two crowd behaviour anomaly detection datasets, achieving both state-of-the-art classification performance on the violent-flows dataset [3] as well as better than real-time processing performance (40 frames per second)

arXiv.org e-Print Archive

Crossref

Irish Universities

DCU Online Research Access Service

SAVASA project @ TRECVID 2012: interactive surveillance event detection

Author: Clawson Kathy
Direkoglu Cem
Gimenez Roberto
Jargalsaikhan Iveel
Li Hao
Little Suzanne
Martinez Llorens Ana
Mereu Anna
Nieto Marcos
O'Connor Noel E.
Rodriguez Aitor
Sanchez Pedro
Santos de la Camara Raul
Smeaton Alan F.
Villarroel Peniza Karina
Publication venue
Publication date: 26/11/2012
Field of study

In this paper we describe our participation in the interactive surveillance event detection task at TRECVid 2012. The system we developed was comprised of individual classifiers brought together behind a simple video search interface that enabled users to select relevant segments based on down~sampled animated gifs. Two types of user -- `experts' and `end users' -- performed the evaluations. Due to time constraints we focussed on three events -- ObjectPut, PersonRuns and Pointing -- and two of the five available cameras (1 and 3). Results from the interactive runs as well as discussion of the performance of the underlying retrospective classifiers are presented

DCU Online Research Access Service