Search CORE

1,857 research outputs found

Methods for Interpreting and Understanding Deep Neural Networks

Author: Montavon Grégoire
Müller Klaus-Robert
Samek Wojciech
Publication venue: 'Elsevier BV'
Publication date: 24/06/2017
Field of study

This paper provides an entry point to the problem of interpreting a deep neural network model and explaining its predictions. It is based on a tutorial given at ICASSP 2017. It introduces some recently proposed techniques of interpretation, along with theory, tricks and recommendations, to make most efficient use of these techniques on real data. It also discusses a number of practical applications.Comment: 14 pages, 10 figure

arXiv.org e-Print Archive

Fraunhofer-ePrints

MPG.PuRe

Skip-Thought Vectors

Author: Fidler Sanja
Kiros Ryan
Salakhutdinov Ruslan
Torralba Antonio
Urtasun Raquel
Zemel Richard S.
Zhu Yukun
Publication venue
Publication date: 22/06/2015
Field of study

We describe an approach for unsupervised learning of a generic, distributed sentence encoder. Using the continuity of text from books, we train an encoder-decoder model that tries to reconstruct the surrounding sentences of an encoded passage. Sentences that share semantic and syntactic properties are thus mapped to similar vector representations. We next introduce a simple vocabulary expansion method to encode words that were not seen as part of training, allowing us to expand our vocabulary to a million words. After training our model, we extract and evaluate our vectors with linear models on 8 tasks: semantic relatedness, paraphrase detection, image-sentence ranking, question-type classification and 4 benchmark sentiment and subjectivity datasets. The end result is an off-the-shelf encoder that can produce highly generic sentence representations that are robust and perform well in practice. We will make our encoder publicly available.Comment: 11 page

arXiv.org e-Print Archive

CiteSeerX

DSpace@MIT

Maxmin convolutional neural networks for image classification

Author: Blot Michael
Cord Matthieu
Thome Nicolas
Publication venue
Publication date: 25/09/2016
Field of study

Convolutional neural networks (CNN) are widely used in computer vision, especially in image classification. However, the way in which information and invariance properties are encoded through in deep CNN architectures is still an open question. In this paper, we propose to modify the standard convo- lutional block of CNN in order to transfer more information layer after layer while keeping some invariance within the net- work. Our main idea is to exploit both positive and negative high scores obtained in the convolution maps. This behav- ior is obtained by modifying the traditional activation func- tion step before pooling. We are doubling the maps with spe- cific activations functions, called MaxMin strategy, in order to achieve our pipeline. Extensive experiments on two classical datasets, MNIST and CIFAR-10, show that our deep MaxMin convolutional net outperforms standard CNN

arXiv.org e-Print Archive

Crossref

Classification of Time-Series Images Using Deep Convolutional Neural Networks

Author: Abdel-Hamid
Armano
Bouvrie
Chen
Cui
Dalto
Deng
Eads
Graves
Hatami
Hatami
Krizhevsky
Krizhevsky
LeCun
LeCun
Lee
Nanopoulos
Rakthanmanon
Rodriguez
Senin
Simonyan
Souza
Souza
Wang
Wang
Wang
Xing
Yang
Zheng
Publication venue
Publication date: 07/10/2017
Field of study

Convolutional Neural Networks (CNN) has achieved a great success in image recognition task by automatically learning a hierarchical feature representation from raw data. While the majority of Time-Series Classification (TSC) literature is focused on 1D signals, this paper uses Recurrence Plots (RP) to transform time-series into 2D texture images and then take advantage of the deep CNN classifier. Image representation of time-series introduces different feature types that are not available for 1D signals, and therefore TSC can be treated as texture image recognition task. CNN model also allows learning different levels of representations together with a classifier, jointly and automatically. Therefore, using RP and CNN in a unified framework is expected to boost the recognition rate of TSC. Experimental results on the UCR time-series classification archive demonstrate competitive accuracy of the proposed approach, compared not only to the existing deep architectures, but also to the state-of-the art TSC algorithms.Comment: The 10th International Conference on Machine Vision (ICMV 2017

arXiv.org e-Print Archive

Crossref