Search CORE

6,312 research outputs found

Empirical Bounds on Linear Regions of Deep Rectifier Networks

Author: Ramalingam Srikumar
Serra Thiago
Publication venue
Publication date: 14/12/2019
Field of study

We can compare the expressiveness of neural networks that use rectified linear units (ReLUs) by the number of linear regions, which reflect the number of pieces of the piecewise linear functions modeled by such networks. However, enumerating these regions is prohibitive and the known analytical bounds are identical for networks with same dimensions. In this work, we approximate the number of linear regions through empirical bounds based on features of the trained network and probabilistic inference. Our first contribution is a method to sample the activation patterns defined by ReLUs using universal hash functions. This method is based on a Mixed-Integer Linear Programming (MILP) formulation of the network and an algorithm for probabilistic lower bounds of MILP solution sets that we call MIPBound, which is considerably faster than exact counting and reaches values in similar orders of magnitude. Our second contribution is a tighter activation-based bound for the maximum number of linear regions, which is particularly stronger in networks with narrow layers. Combined, these bounds yield a fast proxy for the number of linear regions of a deep neural network.Comment: AAAI 202

arXiv.org e-Print Archive

Bucknell University

Association for the Advancement of Artificial Intelligence: AAAI Publications

Algebraic Methods in the Congested Clique

Author: Aho Alfred V.
Björklund Andreas
Czumaj Artur
Furman M. E.
Holzer Stephan
James
Nesetril Jaroslav
Tiskin Alexandre
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

In this work, we use algebraic methods for studying distance computation and subgraph detection tasks in the congested clique model. Specifically, we adapt parallel matrix multiplication implementations to the congested clique, obtaining an

O(n^{1-2/\omega})

round matrix multiplication algorithm, where

\omega < 2.3728639

is the exponent of matrix multiplication. In conjunction with known techniques from centralised algorithmics, this gives significant improvements over previous best upper bounds in the congested clique model. The highlight results include: -- triangle and 4-cycle counting in

O(n^{0.158})

rounds, improving upon the

O(n^{1/3})

triangle detection algorithm of Dolev et al. [DISC 2012], -- a

(1 + o(1))

-approximation of all-pairs shortest paths in

O(n^{0.158})

rounds, improving upon the

\tilde{O} (n^{1/2})

-round

(2 + o(1))

-approximation algorithm of Nanongkai [STOC 2014], and -- computing the girth in

O(n^{0.158})

rounds, which is the first non-trivial solution in this model. In addition, we present a novel constant-round combinatorial algorithm for detecting 4-cycles.Comment: This is work is a merger of arxiv:1412.2109 and arxiv:1412.266

arXiv.org e-Print Archive

Subtraction-free complexity, cluster transformations, and spanning trees

Author: Fomin S.
Grigoriev D.
Koshevoy G.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Random Weighting, Asymptotic Counting, and Inverse Isoperimetry

Author: Barvinok Alexander
Samorodnitsky Alex
Publication venue
Publication date: 01/01/2003
Field of study

For a family X of k-subsets of the set 1,...,n, let |X| be the cardinality of X and let Gamma(X,mu) be the expected maximum weight of a subset from X when the weights of 1,...,n are chosen independently at random from a symmetric probability distribution mu on R. We consider the inverse isoperimetric problem of finding mu for which Gamma(X,mu) gives the best estimate of ln|X|. We prove that the optimal choice of mu is the logistic distribution, in which case Gamma(X,mu) provides an asymptotically tight estimate of ln|X| as k^{-1}ln|X| grows. Since in many important cases Gamma(X,mu) can be easily computed, we obtain computationally efficient approximation algorithms for a variety of counting problems. Given mu, we describe families X of a given cardinality with the minimum value of Gamma(X,mu), thus extending and sharpening various isoperimetric inequalities in the Boolean cube.Comment: The revision contains a new isoperimetric theorem, some other improvements and extensions; 29 pages, 1 figur

arXiv.org e-Print Archive

CiteSeerX

32nd International Symposium on Theoretical Aspects of Computer Science: STACS '15, March 4 - 7, 2015, Garching, Germany

Author: STAC <32. 2015, Garching>
Publication venue: Schloss Dagstuhl - Leibniz-Zentrum für Informatik
Publication date: 01/02/2015
Field of study

Reduction of dynamical biochemical reaction networks in computational biology

Author: Gorban Alexander N.
Noel Vincent
Radulescu Ovidiu
Zinovyev Andrei
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2012
Field of study

Biochemical networks are used in computational biology, to model the static and dynamical details of systems involved in cell signaling, metabolism, and regulation of gene expression. Parametric and structural uncertainty, as well as combinatorial explosion are strong obstacles against analyzing the dynamics of large models of this type. Multi-scaleness is another property of these networks, that can be used to get past some of these obstacles. Networks with many well separated time scales, can be reduced to simpler networks, in a way that depends only on the orders of magnitude and not on the exact values of the kinetic parameters. The main idea used for such robust simplifications of networks is the concept of dominance among model elements, allowing hierarchical organization of these elements according to their effects on the network dynamics. This concept finds a natural formulation in tropical geometry. We revisit, in the light of these new ideas, the main approaches to model reduction of reaction networks, such as quasi-steady state and quasi-equilibrium approximations, and provide practical recipes for model reduction of linear and nonlinear networks. We also discuss the application of model reduction to backward pruning machine learning techniques

arXiv.org e-Print Archive

CiteSeerX

Directory of Open Access Journals

Frontiers - Publisher Connector

HAL-Rennes 1