2,034 research outputs found

    From 3D Point Clouds to Pose-Normalised Depth Maps

    Get PDF
    We consider the problem of generating either pairwise-aligned or pose-normalised depth maps from noisy 3D point clouds in a relatively unrestricted poses. Our system is deployed in a 3D face alignment application and consists of the following four stages: (i) data filtering, (ii) nose tip identification and sub-vertex localisation, (iii) computation of the (relative) face orientation, (iv) generation of either a pose aligned or a pose normalised depth map. We generate an implicit radial basis function (RBF) model of the facial surface and this is employed within all four stages of the process. For example, in stage (ii), construction of novel invariant features is based on sampling this RBF over a set of concentric spheres to give a spherically-sampled RBF (SSR) shape histogram. In stage (iii), a second novel descriptor, called an isoradius contour curvature signal, is defined, which allows rotational alignment to be determined using a simple process of 1D correlation. We test our system on both the University of York (UoY) 3D face dataset and the Face Recognition Grand Challenge (FRGC) 3D data. For the more challenging UoY data, our SSR descriptors significantly outperform three variants of spin images, successfully identifying nose vertices at a rate of 99.6%. Nose localisation performance on the higher quality FRGC data, which has only small pose variations, is 99.9%. Our best system successfully normalises the pose of 3D faces at rates of 99.1% (UoY data) and 99.6% (FRGC data)

    An Experimental and Numerical Investigation of Nitrogen Dioxide Emissions Characteristics of Compression Ignition Dual Fuel Engines

    Get PDF
    Detailed experimental research was conducted to explore the impact of the addition of gaseous fuels, including H2 and natural gas (NG), and engine load on the emissions of NO2, NO, and NOx from dual fuel engines. The addition of less than 2% of H2 or NG was shown to dramatically increase the emissions of NO2 until a maximum level of NO2 emissions was reached. The increased NO 2 emissions were due to the conversion of NO to NO2. The maximum NO2/NOx ratio obtained with the addition of H2 was 3.2 to 5.0 times that of diesel operation. The maximum NO 2/NOx ratio obtained with the addition of NG was 3.4 to 4.3 times that of diesel operation. Further increasing the amount of gaseous fuel beyond the point of maximum NO2 emissions resulted in a reduction of NO2 emissions. Detailed examination of factors having the potential to affect the formation of NOx and NO2 in compression ignition engines reported a firm correlation between the emissions of NO 2 and emissions of unburned H2 and methane (CH4), and their relative emissions. The presence of unburned gaseous fuels that survived the main combustion process appears to be one of the main factors contributing to the enhanced conversion of NO to NO2. This was supported by the experimental data reported in the literature. The presence of fumigation fuels outside the diesel spray plume might be the main factor contributing to the increased emissions of NO2 from dual fuel engines. The spontaneous combustion of fumigation fuels that are entrained into the diesel spray plume may not contribute to the increased emissions of NO 2. In comparison, the correlations between the increased emissions of NO2 and the variation in bulk mixture temperature and heat release process including maximum heat release rate, and combustion duration were weak.;A single zone, zero-dimensional, constant volume numerical model with detailed chemistry was used to simulate the oxidization process of the gaseous fuel, as well as its effect on the conversion of NO to NO2 after the post-combustion mixing of the gaseous fuel surviving the main combustion process with the NOx-containing combustion products. The gaseous fuel examined included CH4, H2, and carbon monoxide (CO). The simulation results revealed the significant effects of the fuel mixed, its initial concentration in the mixture, and the initial temperature on the oxidization of gaseous fuel, the conversion of NO to NO2, and the destruction of NO2 to NO after the completion of the oxidation process.;The single zone zero-dimensional model was further modified to a variable volume model with the volume of the combustion chamber calculated using the geometry of the 1999 Cummins engine and engine speed. The modified variable volume model with detailed chemistry was used to improve the simulation of the effect on the conversion of NO to NO2 of the post-combustion mixing of surviving gaseous fuel with NOx-containing combustion products. The spatial variation of the local bulk mixture temperature with the progress of the combustion process and the variation of cylinder volume during the expansion process was taken into account by a pseudo temperature at the top dead center (TDC) noted as Tpseudo TDC defined in this research. The simulation identified the importance of the phasing of postcombustion mixing on the oxidation of gaseous fuel and its effect on the conversion of NO to NO2.;A preliminary sensitivity analysis was also conducted to identify the reactions having significant effect on the conversion of NO to NO2 and its destruction to NO. Among the four reactions associated with the formation and destruction of NO2, R186 was identified as the main reaction to the formation of NO2 during the oxidation process of H 2 and CO. This was due to the high concentration of HO2 formed during the oxidation process of H2 and CO in the combustion product. The destruction of NO2 to NO occurred through R187 and R189. (Abstract shortened by UMI.)

    Face Image Modality Recognition and Photo-Sketch Matching

    Get PDF
    Face is an important physical characteristic of human body, and is widely used in many crucial applications, such as video surveillance, criminal investigation, and security access system. Based on realistic demand, such as useful face images in dark environment and criminal profile, different modalities of face images appeared, e.g. three-dimensional (3D), near infrared (NIR), and thermal infrared (TIR) face images. Thus, researches with various face image modalities become a hot area. Most of them are set on knowing the modality of face images in advance, which contains a few limitations. In this thesis, we present approaches for face image modality recognition to extend the possibility of cross-modality researches as well as handle new modality-mixed face images. Furthermore, a large facial image database is assembled with five commonly used modalities such as 3D, NIR, TIR, sketch, and visible light spectrum (VIS). Based on the analysis of results, a feature descriptor based on convolutional neural network with linear kernel SVM did an optimal performance.;As we mentioned above, face images are widely used in crucial applications, and one of them is using the sketch of suspect\u27s face, which based on the witness\u27 description, to assist law enforcement. Since it is difficult to capture face photos of the suspect during a criminal activity, automatic retrieving photos based on the suspect\u27s facial sketch is used for locating potential suspects. In this thesis, we perform photo-sketch matching by synthesizing the corresponding pseudo sketch from a given photo. There are three methods applied in this thesis, which are respectively based on style transfer, DualGAN, and cycle-consistent adversarial networks. Among the results of these methods, style transfer based method did a poor performance in photo-sketch matching, since it is an unsupervised one which is not purposeful in photo to sketch synthesis problem while the others need to train pointed models in synthesis stage

    Highly Parallel Geometric Characterization and Visualization of Volumetric Data Sets

    Get PDF
    Volumetric 3D data sets are being generated in many different application areas. Some examples are CAT scans and MRI data, 3D models of protein molecules represented by implicit surfaces, multi-dimensional numeric simulations of plasma turbulence, and stacks of confocal microscopy images of cells. The size of these data sets has been increasing, requiring the speed of analysis and visualization techniques to also increase to keep up. Recent advances in processor technology have stopped increasing clock speed and instead begun increasing parallelism, resulting in multi-core CPUS and many-core GPUs. To take advantage of these new parallel architectures, algorithms must be explicitly written to exploit parallelism. In this thesis we describe several algorithms and techniques for volumetric data set analysis and visualization that are amenable to these modern parallel architectures. We first discuss modeling volumetric data with Gaussian Radial Basis Functions (RBFs). RBF representation of a data set has several advantages, including lossy compression, analytic differentiability, and analytic application of Gaussian blur. We also describe a parallel volume rendering algorithm that can create images of the data directly from the RBF representation. Next we discuss a parallel, stochastic algorithm for measuring the surface area of volumetric representations of molecules. The algorithm is suitable for implementation on a GPU and is also progressive, allowing it to return a rough answer almost immediately and refine the answer over time to the desired level of accuracy. After this we discuss the concept of Confluent Visualization, which allows the visualization of the interaction between a pair of volumetric data sets. The interaction is visualized through volume rendering, which is well suited to implementation on parallel architectures. Finally we discuss a parallel, stochastic algorithm for classifying stem cells as having been grown on a surface that induces differentiation or on a surface that does not induce differentiation. The algorithm takes as input 3D volumetric models of the cells generated from confocal microscopy. This algorithm builds on our algorithm for surface area measurement and, like that algorithm, this algorithm is also suitable for implementation on a GPU and is progressive

    Log-Euclidean Bag of Words for Human Action Recognition

    Full text link
    Representing videos by densely extracted local space-time features has recently become a popular approach for analysing actions. In this paper, we tackle the problem of categorising human actions by devising Bag of Words (BoW) models based on covariance matrices of spatio-temporal features, with the features formed from histograms of optical flow. Since covariance matrices form a special type of Riemannian manifold, the space of Symmetric Positive Definite (SPD) matrices, non-Euclidean geometry should be taken into account while discriminating between covariance matrices. To this end, we propose to embed SPD manifolds to Euclidean spaces via a diffeomorphism and extend the BoW approach to its Riemannian version. The proposed BoW approach takes into account the manifold geometry of SPD matrices during the generation of the codebook and histograms. Experiments on challenging human action datasets show that the proposed method obtains notable improvements in discrimination accuracy, in comparison to several state-of-the-art methods

    Micro-expression Recognition using Spatiotemporal Texture Map and Motion Magnification

    Get PDF
    Micro-expressions are short-lived, rapid facial expressions that are exhibited by individuals when they are in high stakes situations. Studying these micro-expressions is important as these cannot be modified by an individual and hence offer us a peek into what the individual is actually feeling and thinking as opposed to what he/she is trying to portray. The spotting and recognition of micro-expressions has applications in the fields of criminal investigation, psychotherapy, education etc. However due to micro-expressions’ short-lived and rapid nature; spotting, recognizing and classifying them is a major challenge. In this paper, we design a hybrid approach for spotting and recognizing micro-expressions by utilizing motion magnification using Eulerian Video Magnification and Spatiotemporal Texture Map (STTM). The validation of this approach was done on the spontaneous micro-expression dataset, CASMEII in comparison with the baseline. This approach achieved an accuracy of 80% viz. an increase by 5% as compared to the existing baseline by utilizing 10-fold cross validation using Support Vector Machines (SVM) with a linear kernel

    3-D Content-Based Retrieval and Classification with Applications to Museum Data

    Get PDF
    There is an increasing number of multimedia collections arising in areas once only the domain of text and 2-D images. Richer types of multimedia such as audio, video and 3-D objects are becoming more and more common place. However, current retrieval techniques in these areas are not as sophisticated as textual and 2-D image techniques and in many cases rely upon textual searching through associated keywords. This thesis is concerned with the retrieval of 3-D objects and with the application of these techniques to the problem of 3-D object annotation. The majority of the work in this thesis has been driven by the European project, SCULPTEUR. This thesis provides an in-depth analysis of a range of 3-D shape descriptors for their suitability for general purpose and specific retrieval tasks using a publicly available data set, the Princeton Shape Benchmark, and using real world museum objects evaluated using a variety of performance metrics. This thesis also investigates the use of 3-D shape descriptors as inputs to popular classification algorithms and a novel classifier agent for use with the SCULPTEUR system is designed and developed and its performance analysed. Several techniques are investigated to improve individual classifier performance. One set of techniques combines several classifiers whereas the other set of techniques aim to find the optimal training parameters for a classifier. The final chapter of this thesis explores a possible application of these techniques to the problem of 3-D object annotation

    Facial Action Recognition Combining Heterogeneous Features via Multi-Kernel Learning

    Get PDF
    International audienceThis paper presents our response to the first interna- tional challenge on Facial Emotion Recognition and Analysis. We propose to combine different types of features to automatically detect Action Units in facial images. We use one multi-kernel SVM for each Action Unit we want to detect. The first kernel matrix is computed using Local Gabor Binary Pattern histograms and a histogram intersection kernel. The second kernel matrix is computed from AAM coefficients and an RBF kernel. During the training step, we combine these two types of features using the recently proposed SimpleMKL algorithm. SVM outputs are then averaged to exploit temporal information in the sequence. To eval- uate our system, we perform deep experimentations on several key issues: influence of features and kernel function in histogram- based SVM approaches, influence of spatially-independent in- formation versus geometric local appearance information and benefits of combining both, sensitivity to training data and interest of temporal context adaptation. We also compare our results to those of the other participants and try to explain why our method had the best performance during the FERA challenge
    • …
    corecore