Search CORE

47,085 research outputs found

CIAGAN: Conditional Identity Anonymization Generative Adversarial Networks

Author: Elezi Ismail
Leal-Taixé Laura
Maximov Maxim
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 30/11/2020
Field of study

The unprecedented increase in the usage of computer vision technology in society goes hand in hand with an increased concern in data privacy. In many real-world scenarios like people tracking or action recognition, it is important to be able to process the data while taking careful consideration in protecting people's identity. We propose and develop CIAGAN, a model for image and video anonymization based on conditional generative adversarial networks. Our model is able to remove the identifying characteristics of faces and bodies while producing high-quality images and videos that can be used for any computer vision task, such as detection or tracking. Unlike previous methods, we have full control over the de-identification (anonymization) procedure, ensuring both anonymization as well as diversity. We compare our method to several baselines and achieve state-of-the-art results.Comment: CVPR 202

arXiv.org e-Print Archive

Crossref

Cognitive visual tracking and camera control

Author: Arens
Ayers
Bellotto
Ben Benfold
Bimbo
Binford
Boult
Brémond
Chuan Zhao
Eric Sommerlade
Haag
Hall
Hanno Harland
Hans-Hellmut Nagel
Hartley
Ian Reid
Kojima
Nagel
Nicola Bellotto
Nicola Pirlo
Robertson
Tsomko
Viola
Publication venue: 'Elsevier BV'
Publication date: 01/01/2012
Field of study

Cognitive visual tracking is the process of observing and understanding the behaviour of a moving person. This paper presents an efficient solution to extract, in real-time, high-level information from an observed scene, and generate the most appropriate commands for a set of pan-tilt-zoom (PTZ) cameras in a surveillance scenario. Such a high-level feedback control loop, which is the main novelty of our work, will serve to reduce uncertainties in the observed scene and to maximize the amount of information extracted from it. It is implemented with a distributed camera system using SQL tables as virtual communication channels, and Situation Graph Trees for knowledge representation, inference and high-level camera control. A set of experiments in a surveillance scenario show the effectiveness of our approach and its potential for real applications of cognitive vision

University of Lincoln Institutional Repository

Crossref

Adelaide Research & Scholarship

Learning Human Pose Estimation Features with Convolutional Networks

Author: Andriluka Mykhaylo
Bregler Christoph
Jain Arjun
Taylor Graham W.
Tompson Jonathan
Publication venue
Publication date: 01/01/2014
Field of study

This paper introduces a new architecture for human pose estimation using a multi- layer convolutional network architecture and a modified learning technique that learns low-level features and higher-level weak spatial models. Unconstrained human pose estimation is one of the hardest problems in computer vision, and our new architecture and learning schema shows significant improvement over the current state-of-the-art results. The main contribution of this paper is showing, for the first time, that a specific variation of deep learning is able to outperform all existing traditional architectures on this task. The paper also discusses several lessons learned while researching alternatives, most notably, that it is possible to learn strong low-level feature detectors on features that might even just cover a few pixels in the image. Higher-level spatial models improve somewhat the overall result, but to a much lesser extent then expected. Many researchers previously argued that the kinematic structure and top-down information is crucial for this domain, but with our purely bottom up, and weak spatial model, we could improve other more complicated architectures that currently produce the best results. This mirrors what many other researchers, like those in the speech recognition, object recognition, and other domains have experienced

arXiv.org e-Print Archive

CiteSeerX

MPG.PuRe

Learning to Find Eye Region Landmarks for Remote Gaze Estimation in Unconstrained Settings

Author: Honari Sina
Ioffe Sergey
Krafka Kyle
Shrivastava Ashish
Tompson Jonathan
Xu Pingmei
Zafeiriou Stefanos
Zhang Xucong
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2018
Field of study

Conventional feature-based and model-based gaze estimation methods have proven to perform well in settings with controlled illumination and specialized cameras. In unconstrained real-world settings, however, such methods are surpassed by recent appearance-based methods due to difficulties in modeling factors such as illumination changes and other visual artifacts. We present a novel learning-based method for eye region landmark localization that enables conventional methods to be competitive to latest appearance-based methods. Despite having been trained exclusively on synthetic data, our method exceeds the state of the art for iris localization and eye shape registration on real-world imagery. We then use the detected landmarks as input to iterative model-fitting and lightweight learning-based gaze estimation methods. Our approach outperforms existing model-fitting and appearance-based methods in the context of person-independent and personalized gaze estimation

arXiv.org e-Print Archive

Crossref

MPG.PuRe

A distributed camera system for multi-resolution surveillance

Author: Bellotto Nicola
Benfold Ben
Bibby Charles
Fenandez Carles
Gonzales Jordi
Reid Ian
Roth Daniel
Sommerlade Eric
Van Gool Luc
Publication venue
Publication date: 01/01/2009
Field of study

We describe an architecture for a multi-camera, multi-resolution surveillance system. The aim is to support a set of distributed static and pan-tilt-zoom (PTZ) cameras and visual tracking algorithms, together with a central supervisor unit. Each camera (and possibly pan-tilt device) has a dedicated process and processor. Asynchronous interprocess communications and archiving of data are achieved in a simple and effective way via a central repository, implemented using an SQL database. Visual tracking data from static views are stored dynamically into tables in the database via client calls to the SQL server. A supervisor process running on the SQL server determines if active zoom cameras should be dispatched to observe a particular target, and this message is effected via writing demands into another database table. We show results from a real implementation of the system comprising one static camera overviewing the environment under consideration and a PTZ camera operating under closed-loop velocity control, which uses a fast and robust level-set-based region tracker. Experiments demonstrate the effectiveness of our approach and its feasibility to multi-camera systems for intelligent surveillance

University of Lincoln Institutional Repository

Crossref

Adelaide Research & Scholarship