10 research outputs found
Agent-based framework for person re-identification
In computer based human object re-identification, a detected human is recognised to a
level sufficient to re-identify a tracked person in either a different camera capturing the
same individual, often at a different angle, or the same camera at a different time and/or
the person approaching the camera at a different angle. Instead of relying on face
recognition technology such systems study the clothing of the individuals being monitored
and/or objects being carried to establish correspondence and hence re-identify the human
object.
Unfortunately present human-object re-identification systems consider the entire human
object as one connected region in making the decisions about similarity of two objects
being matched. This assumption has a major drawback in that when a person is partially
occluded, a part of the occluding foreground will be picked up and used in matching. Our
research revealed that when a human observer carries out a manual human-object re-identification
task, the attention is often taken over by some parts of the human
figure/body, more than the others, e.g. face, brightly colour shirt, presence of texture
patterns in clothing etc., and occluding parts are ignored.
In this thesis, a novel multi-agent based framework is proposed for the design of a human
object re-identification system. Initially a HOG based feature extraction is used in a SVM
based classification of a human object as a human of a full-body or of half body nature.
Subsequently the relative visual significance of the top and the bottom parts of the human,
in re-identification is quantified by the analysis of Gray Level Co-occurrence based
texture features and colour histograms obtained in the HSV colour space. Accordingly
different weights are assigned to the top and bottom of the human body using a novel
probabilistic approach. The weights are then used to modify the Hybrid Spatiogram and
Covariance Descriptor (HSCD) feature based re-identification algorithm adopted.
A significant novelty of the human object re-identification systems proposed in this thesis
is the agent based design procedure adopted that separates the use of computer vision
algorithms for feature extraction, comparison etc., from the decision making process of re-identification. Multiple agents are assigned to execute different algorithmic tasks and
the agents communicate to make the required logical decisions.
Detailed experimental results are provided to prove that the proposed multi agent based
framework for human object re-identification performs significantly better than the state of-the-art algorithms. Further it is shown that the design flexibilities and scalabilities of
the proposed system allows it to be effectively utilised in more complex computer vision
based video analytic/forensic tasks often conducted within distributed, multi-camera
systems
Video foreground extraction for mobile camera platforms
Foreground object detection is a fundamental task in computer vision with many applications in areas such as object tracking, event identification, and behavior analysis. Most conventional foreground object detection methods work only in a stable illumination environments using fixed cameras. In real-world applications, however, it is often the case that the algorithm needs to operate under the following challenging conditions: drastic lighting changes, object shape complexity, moving cameras, low frame capture rates, and low resolution images. This thesis presents four novel approaches for foreground object detection on real-world datasets using cameras deployed on moving vehicles.The first problem addresses passenger detection and tracking tasks for public transport buses investigating the problem of changing illumination conditions and low frame capture rates. Our approach integrates a stable SIFT (Scale Invariant Feature Transform) background seat modelling method with a human shape model into a weighted Bayesian framework to detect passengers. To deal with the problem of tracking multiple targets, we employ the Reversible Jump Monte Carlo Markov Chain tracking algorithm. Using the SVM classifier, the appearance transformation models capture changes in the appearance of the foreground objects across two consecutives frames under low frame rate conditions. In the second problem, we present a system for pedestrian detection involving scenes captured by a mobile bus surveillance system. It integrates scene localization, foreground-background separation, and pedestrian detection modules into a unified detection framework. The scene localization module performs a two stage clustering of the video data.In the first stage, SIFT Homography is applied to cluster frames in terms of their structural similarity, and the second stage further clusters these aligned frames according to consistency in illumination. This produces clusters of images that are differential in viewpoint and lighting. A kernel density estimation (KDE) technique for colour and gradient is then used to construct background models for each image cluster, which is further used to detect candidate foreground pixels. Finally, using a hierarchical template matching approach, pedestrians can be detected.In addition to the second problem, we present three direct pedestrian detection methods that extend the HOG (Histogram of Oriented Gradient) techniques (Dalal and Triggs, 2005) and provide a comparative evaluation of these approaches. The three approaches include: a) a new histogram feature, that is formed by the weighted sum of both the gradient magnitude and the filter responses from a set of elongated Gaussian filters (Leung and Malik, 2001) corresponding to the quantised orientation, which we refer to as the Histogram of Oriented Gradient Banks (HOGB) approach; b) the codebook based HOG feature with branch-and-bound (efficient subwindow search) algorithm (Lampert et al., 2008) and; c) the codebook based HOGB approach.In the third problem, a unified framework that combines 3D and 2D background modelling is proposed to detect scene changes using a camera mounted on a moving vehicle. The 3D scene is first reconstructed from a set of videos taken at different times. The 3D background modelling identifies inconsistent scene structures as foreground objects. For the 2D approach, foreground objects are detected using the spatio-temporal MRF algorithm. Finally, the 3D and 2D results are combined using morphological operations.The significance of these research is that it provides basic frameworks for automatic large-scale mobile surveillance applications and facilitates many higher-level applications such as object tracking and behaviour analysis
Computer vision methods applied to person tracking and identification
2013 - 2014Computer vision methods for tracking and identification of people in constrained
and unconstrained environments have been widely explored in the last decades. De-
spite of the active research on these topics, they are still open problems for which
standards and/or common guidelines have not been defined yet. Application fields
of computer vision-based tracking systems are almost infinite. Nowadays, the Aug-
mented Reality is a very active field of the research that can benefit from vision-based
userâs tracking to work. Being defined as the fusion of real with virtual worlds, the
success of an augmented reality application is completely dependant on the efficiency
of the exploited tracking method. This work of thesis covers the issues related to
tracking systems in augmented reality applications proposing a comprehensive and
adaptable framework for marker-based tracking and a deep formal analysis. The
provided analysis makes possible to objectively assess and quantify the advantages
of using augmented reality principles in heterogeneous operative contexts. Two case
studies have been considered, that are the support to maintenance in an industrial
environment and to electrocardiography in a typical telemedicine scenario. Advan-
tages and drawback are provided as well as future directions of the proposed study.
The second topic covered in this thesis relates to the vision-based tracking solution
for unconstrained outdoor environments. In video surveillance domain, a tracker
is asked to handle variations in illumination, cope with appearance changes of the
tracked objects and, possibly, predict motion to better anticipate future positions. ... [edited by Author]XIII n.s
Recommended from our members
Video big data: an agile architecture for systematic exploration and analytics
Video is currently at the forefront of most business and natural environments. In surveillance, it is the most important technology as surveillance systems reveal information and patterns for solving many security problems including crime prevention. This research investigates technologies that currently drive video surveillance systems with a view to optimization and automated decision support.
The investigation reveals some features and properties that can be optimised to improve performance and derive further benefits from surveillance systems. These aspects include system-wide architecture, meta-data generation, meta-data persistence, object identification, object tagging, object tracking, search and querying sub-systems. The current less-than-optimum performance is attributable to many factors, which include massive volume, variety, and velocity (the speed at which streaming video transmit to storage) of video data in surveillance systems.
Research contributions are 2-fold. First, we propose a system-wide architecture for designing and implementing surveillance systems, based on the authorsâ system architecture for generating meta-data. Secondly, we design a simulation model of a multi-view surveillance system from which the researchers generate simulated video streams in large volumes. From each video sequence in the model, the authors extract meta-data and apply a novel algorithm for predicting the location of identifiable objects across a well-connected camera cluster.
This research provide evidence that independent surveillance systems (for example, security cameras) can be unified across a geographical location such as a smart city, where each network is administratively owned and managed independently. Our investigation involved 2 experiments - first, the implementation of a web-based solution where we developed a directory service for managing, cataloguing, and persisting metadata generated by the surveillance networks. The second experiment focused on the set up, configuration and the architecture of the surveillance system. These experiments involved the investigation and demonstration of 3 loosely coupled service-oriented APIs â these services provided the capability to generate the query-able metadata.
The results of our investigations provided answers to our research questions - the main question being âto what degree of accuracy can we predict the location of an object in a connected surveillance networkâ. Our experiment also provided evidence in support of our hypothesis â âit is feasible to âexploreâ unified surveillance data generated from independent surveillance networksâ
A framework for automated landmark recognition in community contributed image corpora
Any large library of information requires efficient ways to organise it and methods that allow people to access information efficiently and collections of digital images are no exception. Automatically creating high-level semantic tags based on image content is difficult, if not impossible to achieve accurately. In this thesis a framework is presented that allows for the automatic creation of rich and accurate tags for images with landmarks as the main object. This framework uses state of the art computer vision techniques fused with the wide range of contextual information that is available with community contributed imagery.
Images are organised into clusters based on image content and spatial data associated with each image. Based on these clusters different types of classifiers are* trained to recognise landmarks contained within the images in each cluster. A novel hybrid approach is proposed combining these classifiers with an hierarchical matching approach to allow near real-time classification and captioning of images containing landmarks
Object Tracking
Object tracking consists in estimation of trajectory of moving objects in the sequence of images. Automation of the computer object tracking is a difficult task. Dynamics of multiple parameters changes representing features and motion of the objects, and temporary partial or full occlusion of the tracked objects have to be considered. This monograph presents the development of object tracking algorithms, methods and systems. Both, state of the art of object tracking methods and also the new trends in research are described in this book. Fourteen chapters are split into two sections. Section 1 presents new theoretical ideas whereas Section 2 presents real-life applications. Despite the variety of topics contained in this monograph it constitutes a consisted knowledge in the field of computer object tracking. The intention of editor was to follow up the very quick progress in the developing of methods as well as extension of the application
Principal Component Analysis
This book is aimed at raising awareness of researchers, scientists and engineers on the benefits of Principal Component Analysis (PCA) in data analysis. In this book, the reader will find the applications of PCA in fields such as image processing, biometric, face recognition and speech processing. It also includes the core concepts and the state-of-the-art methods in data analysis and feature extraction