2,350 research outputs found
Global and local characterization of rock classification by Gabor and DCT filters with a color texture descriptor
In the automatic classification of colored natural textures, the idea of
proposing methods that reflect human perception arouses the enthusiasm of
researchers in the field of image processing and computer vision. Therefore,
the color space and the methods of analysis of color and texture, must be
discriminating to correspond to the human vision. Rock images are a typical
example of natural images and their analysis is of major importance in the rock
industry. In this paper, we combine the statistical (Local Binary Pattern (LBP)
with Hue Saturation Value (HSV) and Red Green Blue (RGB) color spaces fusion)
and frequency (Gabor filter and Discrete Cosine Transform (DCT)) descriptors
named respectively Gabor Adjacent Local Binary Pattern Color Space Fusion
(G-ALBPCSF) and DCT Adjacent Local Binary Pattern Color Space Fusion
(D-ALBPCSF) for the extraction of visual textural and colorimetric features
from direct view images of rocks. The textural images from the two G-ALBPCSF
and D-ALBPCSF approaches are evaluated through similarity metrics such as Chi2
and the intersection of histograms that we have adapted to color histograms.
The results obtained allowed us to highlight the discrimination of the rock
classes. The proposed extraction method provides better classification results
for various direct view rock texture images. Then it is validated by a
confusion matrix giving a low error rate of 0.8% of classification
Face Recognition Technique Using Gabor Wavelets And Singular Value Decomposition
Gabor Wavelets (GWs) (also known as Gabor filter) and Singular Value Decomposition (SVD) have been studied extensively in the area of face recognition. In this project, face recognition system is developed using combination of GWs and SVD. Both techniques are used to extract facial features from the human facial image and presented in the form of feature vector. For GWs, only 12 out of 40 GWs are selected to extract facial features from the facial images. This offers the advantage of reducing computational time of feature extraction. As for SVD, only the first five singular values are selected and its associated right singular vectors are used as the facial feature vectors. The use of SVD in addition to the GWs increases the reliability of the face recognition system. In the face verification and matching stage, the similarity level between facial images is determined by computing the distance between the resulting facial feature vectors obtained from GWs and SVD respectively. Overall, the Gabor-SVD based face recognition technique showed constructive and promising result in recognizing the valid user and rejecting invalid users on the JAFFE database
Face recognition using multiple features in different color spaces
Face recognition as a particular problem of pattern recognition has been attracting substantial attention from researchers in computer vision, pattern recognition, and machine learning. The recent Face Recognition Grand Challenge (FRGC) program reveals that uncontrolled illumination conditions pose grand challenges to face recognition performance. Most of the existing face recognition methods use gray-scale face images, which have been shown insufficient to tackle these challenges. To overcome this challenging problem in face recognition, this dissertation applies multiple features derived from the color images instead of the intensity images only.
First, this dissertation presents two face recognition methods, which operate in different color spaces, using frequency features by means of Discrete Fourier Transform (DFT) and spatial features by means of Local Binary Patterns (LBP), respectively. The DFT frequency domain consists of the real part, the imaginary part, the magnitude, and the phase components, which provide the different interpretations of the input face images. The advantage of LBP in face recognition is attributed to its robustness in terms of intensity-level monotonic transformation, as well as its operation in the various scale image spaces. By fusing the frequency components or the multi-resolution LBP histograms, the complementary feature sets can be generated to enhance the capability of facial texture description. This dissertation thus uses the fused DFT and LBP features in two hybrid color spaces, the RIQ and the VIQ color spaces, respectively, for improving face recognition performance.
Second, a method that extracts multiple features in the CID color space is presented for face recognition. As different color component images in the CID color space display different characteristics, three different image encoding methods, namely, the patch-based Gabor image representation, the multi-resolution LBP feature fusion, and the DCT-based multiple face encodings, are presented to effectively extract features from the component images for enhancing pattern recognition performance. To further improve classification performance, the similarity scores due to the three color component images are fused for the final decision making.
Finally, a novel image representation is also discussed in this dissertation. Unlike a traditional intensity image that is directly derived from a linear combination of the R, G, and B color components, the novel image representation adapted to class separability is generated through a PCA plus FLD learning framework from the hybrid color space instead of the RGB color space. Based upon the novel image representation, a multiple feature fusion method is proposed to address the problem of face recognition under the severe illumination conditions.
The aforementioned methods have been evaluated using two large-scale databases, namely, the Face Recognition Grand Challenge (FRGC) version 2 database and the FERET face database. Experimental results have shown that the proposed methods improve face recognition performance upon the traditional methods using the intensity images by large margins and outperform some state-of-the-art methods
EMPATH: A Neural Network that Categorizes Facial Expressions
There are two competing theories of facial expression recognition. Some researchers have suggested that it is an example of "categorical perception." In this view, expression categories are considered to be discrete entities with sharp boundaries, and discrimination of nearby pairs of expressive faces is enhanced near those boundaries. Other researchers, however, suggest that facial expression perception is more graded and that facial expressions are best thought of as points in a continuous, low-dimensional space, where, for instance, "surprise" expressions lie between "happiness" and "fear" expressions due to their perceptual similarity. In this article, we show that a simple yet biologically plausible neural network model, trained to classify facial expressions into six basic emotions, predicts data used to support both of these theories. Without any parameter tuning, the model matches a variety of psychological data on categorization, similarity, reaction times, discrimination, and recognition difficulty, both qualitatively and quantitatively. We thus explain many of the seemingly complex psychological phenomena related to facial expression perception as natural consequences of the tasks' implementations in the brain
- …