Search CORE

813 research outputs found

Relaxed Quantization for Discretized Neural Networks

Author: Blankevoort T.
Gavves E.
Louizos C.
Reisser M.
Welling M.
Publication venue: OpenReview
Publication date: 21/02/2019
Field of study

International Migration, Integration and Social Cohesion online publications

Differentiable Sparsification for Deep Neural Networks

Author: Lee Yognjin
Publication venue
Publication date: 01/07/2021
Field of study

Deep neural networks have relieved a great deal of burden on human experts in relation to feature engineering. However, comparable efforts are instead required to determine effective architectures. In addition, as the sizes of networks have grown overly large, a considerable amount of resources is also invested in reducing the sizes. The sparsification of an over-complete model addresses these problems as it removes redundant components and connections. In this study, we propose a fully differentiable sparsification method for deep neural networks which allows parameters to be zero during training via stochastic gradient descent. Thus, the proposed method can learn the sparsified structure and weights of a network in an end-to-end manner. The method is directly applicable to various modern deep neural networks and imposes minimum modification to existing models. To the best of our knowledge, this is the first fully [sub-]differentiable sparsification method that zeroes out parameters. It provides a foundation for future structure learning and model compression methods

arXiv.org e-Print Archive