57,024 research outputs found

    From 3D Point Clouds to Pose-Normalised Depth Maps

    Get PDF
    We consider the problem of generating either pairwise-aligned or pose-normalised depth maps from noisy 3D point clouds in a relatively unrestricted poses. Our system is deployed in a 3D face alignment application and consists of the following four stages: (i) data filtering, (ii) nose tip identification and sub-vertex localisation, (iii) computation of the (relative) face orientation, (iv) generation of either a pose aligned or a pose normalised depth map. We generate an implicit radial basis function (RBF) model of the facial surface and this is employed within all four stages of the process. For example, in stage (ii), construction of novel invariant features is based on sampling this RBF over a set of concentric spheres to give a spherically-sampled RBF (SSR) shape histogram. In stage (iii), a second novel descriptor, called an isoradius contour curvature signal, is defined, which allows rotational alignment to be determined using a simple process of 1D correlation. We test our system on both the University of York (UoY) 3D face dataset and the Face Recognition Grand Challenge (FRGC) 3D data. For the more challenging UoY data, our SSR descriptors significantly outperform three variants of spin images, successfully identifying nose vertices at a rate of 99.6%. Nose localisation performance on the higher quality FRGC data, which has only small pose variations, is 99.9%. Our best system successfully normalises the pose of 3D faces at rates of 99.1% (UoY data) and 99.6% (FRGC data)

    A multi-sensor approach for volcanic ash cloud retrieval and eruption characterization: the 23 November 2013 Etna lava fountain

    Get PDF
    Volcanic activity is observed worldwide with a variety of ground and space-based remote sensing instruments, each with advantages and drawbacks. No single system can give a comprehensive description of eruptive activity, and so, a multi-sensor approach is required. This work integrates infrared and microwave volcanic ash retrievals obtained from the geostationary Meteosat Second Generation (MSG)-Spinning Enhanced Visible and Infrared Imager (SEVIRI), the polar-orbiting Aqua-MODIS and ground-based weather radar. The expected outcomes are improvements in satellite volcanic ash cloud retrieval (altitude, mass, aerosol optical depth and effective radius), the generation of new satellite products (ash concentration and particle number density in the thermal infrared) and better characterization of volcanic eruptions (plume altitude, total ash mass erupted and particle number density from thermal infrared to microwave). This approach is the core of the multi-platform volcanic ash cloud estimation procedure being developed within the European FP7-APhoRISM project. The Mt. Etna (Sicily, Italy) volcano lava fountaining event of 23 November 2013 was considered as a test case. The results of the integration show the presence of two volcanic cloud layers at different altitudes. The improvement of the volcanic ash cloud altitude leads to a mean difference between the SEVIRI ash mass estimations, before and after the integration, of about the 30%. Moreover, the percentage of the airborne “fine” ash retrieved from the satellite is estimated to be about 1%–2% of the total ash emitted during the eruption. Finally, all of the estimated parameters (volcanic ash cloud altitude, thickness and total mass) were also validated with ground-based visible camera measurements, HYSPLIT forward trajectories, Infrared Atmospheric Sounding Interferometer (IASI) satellite data and tephra deposits

    Geodesics on the manifold of multivariate generalized Gaussian distributions with an application to multicomponent texture discrimination

    Get PDF
    We consider the Rao geodesic distance (GD) based on the Fisher information as a similarity measure on the manifold of zero-mean multivariate generalized Gaussian distributions (MGGD). The MGGD is shown to be an adequate model for the heavy-tailed wavelet statistics in multicomponent images, such as color or multispectral images. We discuss the estimation of MGGD parameters using various methods. We apply the GD between MGGDs to color texture discrimination in several classification experiments, taking into account the correlation structure between the spectral bands in the wavelet domain. We compare the performance, both in terms of texture discrimination capability and computational load, of the GD and the Kullback-Leibler divergence (KLD). Likewise, both uni- and multivariate generalized Gaussian models are evaluated, characterized by a fixed or a variable shape parameter. The modeling of the interband correlation significantly improves classification efficiency, while the GD is shown to consistently outperform the KLD as a similarity measure

    Cross-Paced Representation Learning with Partial Curricula for Sketch-based Image Retrieval

    Get PDF
    In this paper we address the problem of learning robust cross-domain representations for sketch-based image retrieval (SBIR). While most SBIR approaches focus on extracting low- and mid-level descriptors for direct feature matching, recent works have shown the benefit of learning coupled feature representations to describe data from two related sources. However, cross-domain representation learning methods are typically cast into non-convex minimization problems that are difficult to optimize, leading to unsatisfactory performance. Inspired by self-paced learning, a learning methodology designed to overcome convergence issues related to local optima by exploiting the samples in a meaningful order (i.e. easy to hard), we introduce the cross-paced partial curriculum learning (CPPCL) framework. Compared with existing self-paced learning methods which only consider a single modality and cannot deal with prior knowledge, CPPCL is specifically designed to assess the learning pace by jointly handling data from dual sources and modality-specific prior information provided in the form of partial curricula. Additionally, thanks to the learned dictionaries, we demonstrate that the proposed CPPCL embeds robust coupled representations for SBIR. Our approach is extensively evaluated on four publicly available datasets (i.e. CUFS, Flickr15K, QueenMary SBIR and TU-Berlin Extension datasets), showing superior performance over competing SBIR methods
    • …
    corecore