Search CORE

38,147 research outputs found

Local Deep Neural Networks for gender recognition

Author: Albiol Colomer Alberto
Mansanet Sandín Jorge
Paredes Palacios Roberto
Publication venue: 'Elsevier BV'
Publication date: 15/01/2016
Field of study

Deep learning methods are able to automatically discover better representations of the data to improve the performance of the classifiers. However, in computer vision tasks, such as the gender recognition problem, sometimes it is difficult to directly learn from the entire image. In this work we propose a new model called Local Deep Neural Network (Local-DNN), which is based on two key concepts: local features and deep architectures. The model learns from small overlapping regions in the visual field using discriminative feed forward networks with several layers. We evaluate our approach on two well-known gender benchmarks, showing that our Local-DNN outperforms other deep learning methods also evaluated and obtains state-of-the-art results in both benchmarks. (C) 2015 Elsevier B.V. All rights reserved.This work was financially supported by the Ministerio de Ciencia e Innovacin (Spain), Plan Nacional de I-D+i, TEC2009-09146, and the FPI grant BES-2010-032945.Mansanet Sandín, J.; Albiol Colomer, A.; Paredes Palacios, R. (2016). Local Deep Neural Networks for gender recognition. Pattern Recognition Letters. 70:80-86. https://doi.org/10.1016/j.patrec.2015.11.015S80867

RiuNet

Convolutional RNN: an Enhanced Model for Extracting Features from Sequential Data

Author: Keren Gil
Schuller Björn
Publication venue
Publication date: 01/01/2016
Field of study

Traditional convolutional layers extract features from patches of data by applying a non-linearity on an affine function of the input. We propose a model that enhances this feature extraction process for the case of sequential data, by feeding patches of the data into a recurrent neural network and using the outputs or hidden states of the recurrent units to compute the extracted features. By doing so, we exploit the fact that a window containing a few frames of the sequential data is a sequence itself and this additional structure might encapsulate valuable information. In addition, we allow for more steps of computation in the feature extraction process, which is potentially beneficial as an affine function followed by a non-linearity can result in too simple features. Using our convolutional recurrent layers we obtain an improvement in performance in two audio classification tasks, compared to traditional convolutional layers. Tensorflow code for the convolutional recurrent layers is publicly available in https://github.com/cruvadom/Convolutional-RNN

arXiv.org e-Print Archive

OPUS Augsburg

Crossref

Understanding and Comparing Deep Neural Networks for Age and Gender Classification

Author: Binder Alexander
Lapuschkin Sebastian
Müller Klaus-Robert
Samek Wojciech
Publication venue
Publication date: 25/08/2017
Field of study

Recently, deep neural networks have demonstrated excellent performances in recognizing the age and gender on human face images. However, these models were applied in a black-box manner with no information provided about which facial features are actually used for prediction and how these features depend on image preprocessing, model initialization and architecture choice. We present a study investigating these different effects. In detail, our work compares four popular neural network architectures, studies the effect of pretraining, evaluates the robustness of the considered alignment preprocessings via cross-method test set swapping and intuitively visualizes the model's prediction strategies in given preprocessing conditions using the recent Layer-wise Relevance Propagation (LRP) algorithm. Our evaluations on the challenging Adience benchmark show that suitable parameter initialization leads to a holistic perception of the input, compensating artefactual data representations. With a combination of simple preprocessing steps, we reach state of the art performance in gender recognition.Comment: 8 pages, 5 figures, 5 tables. Presented at ICCV 2017 Workshop: 7th IEEE International Workshop on Analysis and Modeling of Faces and Gesture

arXiv.org e-Print Archive

Fraunhofer-ePrints