Search CORE

55 research outputs found

A very simple framework for 3D human poses estimation using a single 2D image: Comparison of geometric moments descriptors.

Author: Atrevi Dieudonné Fabrice
Duculty Florent
Emile Bruno
Vivet Damien
Publication venue: 'Elsevier BV'
Publication date: 01/11/2017
Field of study

In this paper, we propose a framework in order to automatically extract the 3D pose of an individual from a single silhouette image obtained with a classical low-cost camera without any depth information. By pose, we mean the configuration of human bones in order to reconstruct a 3D skeleton representing the 3D posture of the detected human. Our approach combines prior learned correspondences between silhouettes and skeletons extracted from simulated 3D human models publicly available on the internet. The main advantages of such approach are that silhouettes can be very easily extracted from video, and 3D human models can be animated using motion capture data in order to quickly build any movement training data. In order to match detected silhouettes with simulated silhouettes, we compared geometrics invariants moments. According to our results, we show that the proposed method provides very promising results with a very low time processing

Crossref

Open Archive Toulouse Archive Ouverte

HAL Descartes

A novel framework for retrieval and interactive visualization of multimodal data

Author: Kalamaras Ilias
Mademlis Athanasios
Malassiotis Sotiris
Tzovaras Dimitrios
Publication venue: Universitat Autònoma de Barcelona
Publication date: 01/01/2013
Field of study

With the abundance of multimedia in web databases and the increasing user need for content of many modalities, such as images, sounds, etc. , new methods for retrieval and visualization of multimodal media are required. In this paper, novel techniques for retrieval and visualization of multimodal data, i. e. documents consisting of many modalities, are proposed. A novel cross-modal retrieval framework is presented, in which the results of several unimodal retrieval systems are fused into a single multimodal list by the introduction of a cross-modal distance. For the presentation of the retrieved results, a multimodal visualization framework is also proposed, which extends existing unimodal similarity-based visualization methods for multimodal data. The similarity measure between two multimodal objects is defined as the weighted sum of unimodal similarities, with the weights determined via an interactive user feedback scheme. Experimental results show that the cross-modal framework outperforms unimodal and other multimodal approaches while the visualization framework enhances existing visualization methods by efficiently exploiting multimodality and user feedback

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Directory of Open Access Journals

Revistes Catalanes amb Accés Obert

Electronic Letters on Computer Vision and Image Analysis (ELCVIA - Universitat Autònoma de Barcelona)

Diposit Digital de Documents de la UAB

Efficient Local Comparison Of Images Using Krawtchouk Descriptors

Author: DeVille Julian
Publication venue: 'Eastern Kentucky University'
Publication date: 01/01/2019
Field of study

It is known that image comparison can prove cumbersome in both computational complexity and runtime, due to factors such as the rotation, scaling, and translation of the object in question. Due to the locality of Krawtchouk polynomials, relatively few descriptors are necessary to describe a given image, and this can be achieved with minimal memory usage. Using this method, not only can images be described efficiently as a whole, but specific regions of images can be described as well without cropping. Due to this property, queries can be found within a single large image, or collection of large images, which serve as a database for search. Krawtchouk descriptors can also describe collections of patches of 3D objects, which is explored in this paper, as well as a theoretical methodology of describing nD hyperobjects. Test results for an implementation of 3D Krawtchouk descriptors in GNU Octave, as well as statistics regarding effectiveness and runtime, are included, and the code used for testing will be published open source in the near future

Eastern Kentucky University

Gait recognition based on shape and motion analysis of silhouette contours

Author: Barnich
Bashir
BenAbdelkader
Choudhury
Cooley
Cunado
Dockstader
Goffredo
Gu
Han
Hossain
Hu
Ioannidis
Kim
Kusakunniran
Lam
Liu
Liu
Liu
Lu
Lu
Maltoni
Matovski
Nixon
Nixon
Phillips
Ran
Sarkar
Sruti Das Choudhury
Tafazzoli
Tao
Tardi Tjahjadi
Venkat
Veraraghavan
Wang
Wang
Wang
Winter
Xu
Xu
Xue
Yap
Zhang
Publication venue: 'Elsevier BV'
Publication date: 01/12/2013
Field of study

This paper presents a three-phase gait recognition method that analyses the spatio-temporal shape and dynamic motion (STS-DM) characteristics of a human subject’s silhouettes to identify the subject in the presence of most of the challenging factors that affect existing gait recognition systems. In phase 1, phase-weighted magnitude spectra of the Fourier descriptor of the silhouette contours at ten phases of a gait period are used to analyse the spatio-temporal changes of the subject’s shape. A component-based Fourier descriptor based on anatomical studies of human body is used to achieve robustness against shape variations caused by all common types of small carrying conditions with folded hands, at the subject’s back and in upright position. In phase 2, a full-body shape and motion analysis is performed by fitting ellipses to contour segments of ten phases of a gait period and using a histogram matching with Bhattacharyya distance of parameters of the ellipses as dissimilarity scores. In phase 3, dynamic time warping is used to analyse the angular rotation pattern of the subject’s leading knee with a consideration of arm-swing over a gait period to achieve identification that is invariant to walking speed, limited clothing variations, hair style changes and shadows under feet. The match scores generated in the three phases are fused using weight-based score-level fusion for robust identification in the presence of missing and distorted frames, and occlusion in the scene. Experimental analyses on various publicly available data sets show that STS-DM outperforms several state-of-the-art gait recognition methods

Crossref

Warwick Research Archives Portal Repository

An overview of view-based 2D-3D indexing methods

Author: Petre Raluca Diana
Preteux Francoise
Zaharia Titus
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 01/08/2010
Field of study

International audienceThis paper proposes a comprehensive overview of state of the art 2D/3D, view-based indexing methods. The principle of 2D/3D indexing methods consists of describing 3D models by means of a set of 2D shape descriptors, associated with a set of corresponding 2D views (under the assumption of a given projection model). Notably, such an approach makes it possible to identify 3D objects of interest from 2D images/videos. An experimental evaluation is also proposed, in order to examine the influence of the number of views and of the associated viewing angle selection strategies on the retrieval results. Experiments concern both 3D model retrieval and image recognition from a single view. Results obtained show promising performances, with recognition rates from a single view higher then 66%, which opens interesting perspectives in terms of semantic metadata extraction from still images/videos

Crossref

HAL Descartes

Automatic recognition of military vehicles with Krawtchouk moments

Author: Clemente Carmine
De Maio Antonio
Gaglione Domenico
Pallotta Luca
Soraghan John J.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 31/07/2016
Field of study

The challenge of Automatic Target Recognition (ATR) of military targets within a Synthetic Aperture Radar (SAR) scene is addressed in this paper. The proposed approach exploits the discrete defined Krawtchouk moments, that are able to represent a detected extended target with few features, allowing its characterization. The proposed algorithm provides robust performance for target recognition, identification and characterization, with high reliability in presence of noise and reduced sensitivity to discretization errors. The effectiveness of the proposed approach is demonstrated using the MSTAR dataset

Archivio della ricerca - Università degli studi di Napoli Federico II

University of Strathclyde Institutional Repository

Archivio della Ricerca - Università della Basilicata

Archivio della Ricerca - Università di Roma 3

On The Potential of Image Moments for Medical Diagnosis

Author: Di Ruberto C.
Loddo A.
Putzu L.
Publication venue
Publication date: 01/01/2023
Field of study

Medical imaging is widely used for diagnosis and postoperative or post-therapy monitoring. The ever-increasing number of images produced has encouraged the introduction of automated methods to assist doctors or pathologists. In recent years, especially after the advent of convolutional neural networks, many researchers have focused on this approach, considering it to be the only method for diagnosis since it can perform a direct classification of images. However, many diagnostic systems still rely on handcrafted features to improve interpretability and limit resource consumption. In this work, we focused our efforts on orthogonal moments, first by providing an overview and taxonomy of their macrocategories and then by analysing their classification performance on very different medical tasks represented by four public benchmark data sets. The results confirmed that convolutional neural networks achieved excellent performance on all tasks. Despite being composed of much fewer features than those extracted by the networks, orthogonal moments proved to be competitive with them, showing comparable and, in some cases, better performance. In addition, Cartesian and harmonic categories provided a very low standard deviation, proving their robustness in medical diagnostic tasks. We strongly believe that the integration of the studied orthogonal moments can lead to more robust and reliable diagnostic systems, considering the performance obtained and the low variation of the results. Finally, since they have been shown to be effective on both magnetic resonance and computed tomography images, they can be easily extended to other imaging techniques

Archivio istituzionale della ricerca - Università di Cagliari

A Survey of 2D and 3D Shape Descriptors

Author: Kazmi Ismail Khalid
You Lihua
Zhang Jian Jun
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/08/2013
Field of study

Crossref

Teeside University's Research Repository