Search CORE

2,291 research outputs found

Ensemble of Different Approaches for a Reliable Person Re-identification System

Author: Braham Sheryl
Ghidoni Stefano
Menegatti Emanuele
Munaro Matteo
Nanni Loris
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

An ensemble of approaches for reliable person re-identification is proposed in this paper. The proposed ensemble is built combining widely used person re-identification systems using different color spaces and some variants of state-of-the-art approaches that are proposed in this paper. Different descriptors are tested, and both texture and color features are extracted from the images; then the different descriptors are compared using different distance measures (e.g., the Euclidean distance, angle, and the Jeffrey distance). To improve performance, a method based on skeleton detection, extracted from the depth map, is also applied when the depth map is available. The proposed ensemble is validated on three widely used datasets (CAVIAR4REID, IAS, and VIPeR), keeping the same parameter set of each approach constant across all tests to avoid overfitting and to demonstrate that the proposed system can be considered a general-purpose person re-identification system. Our experimental results show that the proposed system offers significant improvements over baseline approaches. The source code used for the approaches tested in this paper will be available at https://www.dei.unipd.it/node/2357 and http://robotics.dei.unipd.it/reid/

Elsevier - Publisher Connector

Crossref

Directory of Open Access Journals

Archivio istituzionale della ricerca - Università di Padova

Missouri State University: BearWorks

A statistical reduced-reference method for color image quality assessment

Author: Abdelouahad Abdelkaher Ait
Cherifi Hocine
Hassouni Mohammed El
Omari Mounir
Publication venue
Publication date: 15/11/2014
Field of study

Although color is a fundamental feature of human visual perception, it has been largely unexplored in the reduced-reference (RR) image quality assessment (IQA) schemes. In this paper, we propose a natural scene statistic (NSS) method, which efficiently uses this information. It is based on the statistical deviation between the steerable pyramid coefficients of the reference color image and the degraded one. We propose and analyze the multivariate generalized Gaussian distribution (MGGD) to model the underlying statistics. In order to quantify the degradation, we develop and evaluate two measures based respectively on the Geodesic distance between two MGGDs and on the closed-form of the Kullback Leibler divergence. We performed an extensive evaluation of both metrics in various color spaces (RGB, HSV, CIELAB and YCrCb) using the TID 2008 benchmark and the FRTV Phase I validation process. Experimental results demonstrate the effectiveness of the proposed framework to achieve a good consistency with human visual perception. Furthermore, the best configuration is obtained with CIELAB color space associated to KLD deviation measure

arXiv.org e-Print Archive

HAL-uB

Crossref

Manifold-valued Image Generation with Wasserstein Generative Adversarial Nets

Author: Huang Zhiwu
Van Gool Luc
Wu Jiqing
Publication venue
Publication date: 03/01/2019
Field of study

Generative modeling over natural images is one of the most fundamental machine learning problems. However, few modern generative models, including Wasserstein Generative Adversarial Nets (WGANs), are studied on manifold-valued images that are frequently encountered in real-world applications. To fill the gap, this paper first formulates the problem of generating manifold-valued images and exploits three typical instances: hue-saturation-value (HSV) color image generation, chromaticity-brightness (CB) color image generation, and diffusion-tensor (DT) image generation. For the proposed generative modeling problem, we then introduce a theorem of optimal transport to derive a new Wasserstein distance of data distributions on complete manifolds, enabling us to achieve a tractable objective under the WGAN framework. In addition, we recommend three benchmark datasets that are CIFAR-10 HSV/CB color images, ImageNet HSV/CB color images, UCL DT image datasets. On the three datasets, we experimentally demonstrate the proposed manifold-aware WGAN model can generate more plausible manifold-valued images than its competitors.Comment: Accepted by AAAI 201

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

A Second Order TV-type Approach for Inpainting and Denoising Higher Dimensional Combined Cyclic and Vector Space Data

Author: Bergmann Ronny
Weinmann Andreas
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 29/09/2015
Field of study

In this paper we consider denoising and inpainting problems for higher dimensional combined cyclic and linear space valued data. These kind of data appear when dealing with nonlinear color spaces such as HSV, and they can be obtained by changing the space domain of, e.g., an optical flow field to polar coordinates. For such nonlinear data spaces, we develop algorithms for the solution of the corresponding second order total variation (TV) type problems for denoising, inpainting as well as the combination of both. We provide a convergence analysis and we apply the algorithms to concrete problems.Comment: revised submitted versio

arXiv.org e-Print Archive

PuSH

Tongue Image Analysis for Diabetes Mellitus Diagnosis Based on SOM Kohonen

Author: Haris Fuad
Purnama Ketut
Purnomo Mauridhi
Publication venue
Publication date: 26/10/2011
Field of study

Tongue diagnosis is an important diagnostic method for evaluating the condition of internal organ by looking at the image of tongue . However, due to its qualitative, subjective and experience-based nature, traditional tongue diagnosis has a very limited application in clinical medicine. Moreover, traditional tongue diagnosis is always concerned with the identification of syndromes rather than with the connection between tongue abnormal appearances and diseases. This is not well understood in Western medicine, thus greatly obstruct its wider use in the world. In this paper, we present a novel computerized tongue inspection method aiming to address these problems. First, two kinds of quantitative features, chromatic and textural measures, are extracted from tongue images by using popular digital image processing techniques. Then, SOM Kohonen are employed to model the relationship between these quantitative features and diseases. The effectiveness of the method is tested on 35 patients affected by Diabetes Mellitus as well as other 30 healthy volunteers, and the diagnostic results predicted by the previously trained SOM Kohonen classifiers are compared with the HOMA-B

EEPIS Repository

Geodesics on the manifold of multivariate generalized Gaussian distributions with an application to multicomponent texture discrimination

Author: Scheunders Paul
Verdoolaege Geert
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

We consider the Rao geodesic distance (GD) based on the Fisher information as a similarity measure on the manifold of zero-mean multivariate generalized Gaussian distributions (MGGD). The MGGD is shown to be an adequate model for the heavy-tailed wavelet statistics in multicomponent images, such as color or multispectral images. We discuss the estimation of MGGD parameters using various methods. We apply the GD between MGGDs to color texture discrimination in several classification experiments, taking into account the correlation structure between the spectral bands in the wavelet domain. We compare the performance, both in terms of texture discrimination capability and computational load, of the GD and the Kullback-Leibler divergence (KLD). Likewise, both uni- and multivariate generalized Gaussian models are evaluated, characterized by a fixed or a variable shape parameter. The modeling of the interband correlation significantly improves classification efficiency, while the GD is shown to consistently outperform the KLD as a similarity measure

Ghent University Academic Bibliography

Institutional Repository Universiteit Antwerpen

Unsupervised Understanding of Location and Illumination Changes in Egocentric Videos

Author: Barakova Emilia
Betancourt Alejandro
Díaz-Rodríguez Natalia
Marcenaro Lucio
Rauterberg Matthias
Regazzoni Carlo
Publication venue
Publication date: 01/01/2017
Field of study

Wearable cameras stand out as one of the most promising devices for the upcoming years, and as a consequence, the demand of computer algorithms to automatically understand the videos recorded with them is increasing quickly. An automatic understanding of these videos is not an easy task, and its mobile nature implies important challenges to be faced, such as the changing light conditions and the unrestricted locations recorded. This paper proposes an unsupervised strategy based on global features and manifold learning to endow wearable cameras with contextual information regarding the light conditions and the location captured. Results show that non-linear manifold methods can capture contextual patterns from global features without compromising large computational resources. The proposed strategy is used, as an application case, as a switching mechanism to improve the hand-detection problem in egocentric videos.Comment: Submitted for publicatio

arXiv.org e-Print Archive

Repository TU/e

Crossref

Pure OAI Repository

Repositorio Institucional Universidad de Granada

Archivio istituzionale della ricerca - Università di Genova