Search CORE

17,850 research outputs found

Neural Networks with Recurrent Generative Feedback

Author: Anandkumar Anima
Dai Sihui
Gornet James
Huang Yujia
Nguyen Tan
Tsao Doris Y.
Yu Zhiding
Publication venue: Advances in Neural Information Processing Systems
Publication date: 01/12/2020
Field of study

Neural networks are vulnerable to input perturbations such as additive noise and adversarial attacks. In contrast, human perception is much more robust to such perturbations. The Bayesian brain hypothesis states that human brains use an internal generative model to update the posterior beliefs of the sensory input. This mechanism can be interpreted as a form of self-consistency between the maximum a posteriori (MAP) estimation of an internal generative model and the external environment. Inspired by such hypothesis, we enforce self-consistency in neural networks by incorporating generative recurrent feedback. We instantiate this design on convolutional neural networks (CNNs). The proposed framework, termed Convolutional Neural Networks with Feedback (CNN-F), introduces a generative feedback with latent variables to existing CNN architectures, where consistent predictions are made through alternating MAP inference under a Bayesian framework. In the experiments, CNN-F shows considerably improved adversarial robustness over conventional feedforward CNNs on standard benchmarks

Neural Networks with Recurrent Generative Feedback

Author: Anandkumar Anima
Dai Sihui
Gornet James
Huang Yujia
Nguyen Tan
Tsao Doris Y.
Yu Zhiding
Publication venue
Publication date: 10/11/2020
Field of study

arXiv.org e-Print Archive

Caltech Authors

Interpreting Adversarially Trained Convolutional Neural Networks

Author: Zhang Tianyuan
Zhu Zhanxing
Publication venue
Publication date: 01/01/2019
Field of study

We attempt to interpret how adversarially trained convolutional neural networks (AT-CNNs) recognize objects. We design systematic approaches to interpret AT-CNNs in both qualitative and quantitative ways and compare them with normally trained models. Surprisingly, we find that adversarial training alleviates the texture bias of standard CNNs when trained on object recognition tasks, and helps CNNs learn a more shape-biased representation. We validate our hypothesis from two aspects. First, we compare the salience maps of AT-CNNs and standard CNNs on clean images and images under different transformations. The comparison could visually show that the prediction of the two types of CNNs is sensitive to dramatically different types of features. Second, to achieve quantitative verification, we construct additional test datasets that destroy either textures or shapes, such as style-transferred version of clean data, saturated images and patch-shuffled ones, and then evaluate the classification accuracy of AT-CNNs and normal CNNs on these datasets. Our findings shed some light on why AT-CNNs are more robust than those normally trained ones and contribute to a better understanding of adversarial training over CNNs from an interpretation perspective.Comment: To apper in ICML1

arXiv.org e-Print Archive

Southampton (e-Prints Soton)

Learning Robust Representations of Text

Author: Baldwin Timothy
Cohn Trevor
Li Yitong
Publication venue
Publication date: 01/01/2016
Field of study

Deep neural networks have achieved remarkable results across many language processing tasks, however these methods are highly sensitive to noise and adversarial attacks. We present a regularization based method for limiting network sensitivity to its inputs, inspired by ideas from computer vision, thus learning models that are more robust. Empirical evaluation over a range of sentiment datasets with a convolutional neural network shows that, compared to a baseline model and the dropout method, our method achieves superior performance over noisy inputs and out-of-domain data.Comment: 5 pages with 2 pages reference, 2 tables, 1 figur

arXiv.org e-Print Archive

Crossref