Search CORE

31 research outputs found

Simultaneous Optimization of Both Node and Edge Conservation in Network Alignment via WAVE

Author: B Neyshabur
BP Kelley
C-S Liao
D Conte
FE Faisal
FE Faisal
FM Suchanek
G Ciriello
GW Klau
J Berg
J Berg
J Flannick
J Li
JA Flannick
JP Magalhães De
K Venkatesan
L Torresani
M El-Kebir
M Koyutürk
M Mina
M Zaslavskiy
M Zaslavskiy
N Pržulj
O Duchenne
O Kuchaiev
O Kuchaiev
R Patro
R Sharan
R Sharan
R Singh
R Singh
RW Solava
SF Altschul
SR Collins
T Milenković
T Milenković
T Milenković
T Milenković
V Memišević
V Saraph
V Vijayan
X Guo
Y Hulovatyy
Z Liang
Publication venue
Publication date: 13/10/2014
Field of study

Network alignment can be used to transfer functional knowledge between conserved regions of different networks. Typically, existing methods use a node cost function (NCF) to compute similarity between nodes in different networks and an alignment strategy (AS) to find high-scoring alignments with respect to the total NCF over all aligned nodes (or node conservation). But, they then evaluate quality of their alignments via some other measure that is different than the node conservation measure used to guide the alignment construction process. Typically, one measures the amount of conserved edges, but only after alignments are produced. Hence, a recent attempt aimed to directly maximize the amount of conserved edges while constructing alignments, which improved alignment accuracy. Here, we aim to directly maximize both node and edge conservation during alignment construction to further improve alignment accuracy. For this, we design a novel measure of edge conservation that (unlike existing measures that treat each conserved edge the same) weighs each conserved edge so that edges with highly NCF-similar end nodes are favored. As a result, we introduce a novel AS, Weighted Alignment VotEr (WAVE), which can optimize any measures of node and edge conservation, and which can be used with any NCF or combination of multiple NCFs. Using WAVE on top of established state-of-the-art NCFs leads to superior alignments compared to the existing methods that optimize only node conservation or only edge conservation or that treat each conserved edge the same. And while we evaluate WAVE in the computational biology domain, it is easily applicable in any domain.Comment: 12 pages, 4 figure

arXiv.org e-Print Archive

Crossref

Fair evaluation of global network aligners

Author: AE Aladağ
B Neyshabur
BJ Breitkreutz
BP Kelley
C Clark
C Liao
DB West
F Faisal
F Faisal
G Ciriello
GW Klau
J Berg
J Berg
J Flannick
J Flannick
J Li
Joseph Crawford
M Koyuturk
M Mina
M Zaslavskiy
N Malod-Dognin
N Pržulj
O Kuchaiev
O Kuchaiev
R Patro
R Sharan
R Sharan
R Singh
RA Pache
RA Pache
RW Solava
SF Altschul
SR Collins
T Milenković
T Milenković
T Milenković
T Milenković
The Gene Ontology Consortium
Tijana Milenković
V Saraph
X Guo
Y Hulovatyy
Yihan Sun
Z Liang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Recommended from our members

Stronger generalization bounds for deep nets via a compression approach

Author: Arora Sanjeev
Ge R
Neyshabur B
Zhang Y
Publication venue
Publication date: 01/01/2018
Field of study

Deep nets generalize well despite having more parameters than the number of training samples. Recent works try to give an explanation using PAC-Bayes and Margin-based analyses, but do not as yet result in sample complexity bounds better than naive parameter counting. The current paper shows generalization bounds that are orders of magnitude better in practice. These rely upon new succinct reparametrizations of the trained net - a compression that is explicit and efficient. These yield generalization bounds via a simple compression-based framework introduced here. Our results also provide some theoretical justification for widespread empirical success in compressing deep nets. Analysis of correctness of our compression relies upon some newly identified "noise stability"properties of trained deep nets, which are also experimentally verified. The study of these properties and resulting generalization bounds are also extended to convolutional nets, which had eluded earlier attempts on proving generalization

Princeton University Open Access Repository

On the Complexity of Inner Product Similarity Join

Author: Abramowitz M.
Arasu A.
Neyshabur B.
Srebro N.
Xia C.
Zezula P.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2016
Field of study

A number of tasks in classification, information retrieval, recommendation systems, and record linkage reduce to the core problem of inner product similarity join (IPS join): identifying pairs of vectors in a collection that have a sufficiently large inner product. IPS join is well understood when vectors are normalized and some approximation of inner products is allowed. However, the general case where vectors may have any length appears much more challenging. Recently, new upper bounds based on asymmetric locality-sensitive hashing (ALSH) and asymmetric embeddings have emerged, but little has been known on the lower bound side. In this paper we initiate a systematic study of inner product similarity join, showing new lower and upper bounds. Our main results are: * Approximation hardness of IPS join in subquadratic time, assuming the strong exponential time hypothesis. * New upper and lower bounds for (A)LSH-based algorithms. In particular, we show that asymmetry can be avoided by relaxing the LSH definition to only consider the collision probability of distinct elements. * A new indexing method for IPS based on linear sketches, implying that our hardness results are not far from being tight. Our technical contributions include new asymmetric embeddings that may be of independent interest. At the conceptual level we strive to provide greater clarity, for example by distinguishing among signed and unsigned variants of IPS join and shedding new light on the effect of asymmetry.Comment: in Proc. 35th ACM Symposium on Principles of Database Systems, 201

arXiv.org e-Print Archive

Crossref

The IT University of Copenhagen's Repository

Archivio istituzionale della ricerca - Università di Padova

A study of deep neural networks for human activity recognition

Author: Duchi J
Erhan D
Goodfellow I
Jordan MI
Moya Rueda F
Mozer MC
Neyshabur B
Plaut DC
Robinson AJ
Srivastava N
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Triad-based comparison and signatures of directed networks

Author: A Sarajlić
B Neyshabur
D Aparicio
F Picard
K Faust
KM Borgwardt
N Pržulj
N Wale
O Kuchaiev
PW Holland
R Milo
R Milo
RC Wilson
T Rito
W Ali
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2018
Field of study

We introduce two methods for comparing directed networks based on triad counts, called TriadEuclid and TriadEMD. TriadEuclid clusters the Euclidean distance between triad counts, whereas TriadEMD is an adaptation of NetEMD for directed networks. We apply both methods to cluster synthetic networks, a set of web networks including google, twitter, peer-to-peer, amazon, slashdot and citation networks, as well as world trade networks from 1962-2000. Furthermore, we find signature triads and signature orbits for each type of networks in our data, which show the main triad and orbit contributions of the networks when comparing them to the other networks in the respective data set

Crossref

Oxford University Research Archive

The Barron Space and the Flow-Induced Function Spaces for Neural Network Models

Author: A Benveniste
AR Barron
AR Barron
B Neyshabur
F Bach
H Kushner
HN Mhaskar
N Aronszajn
PG Ciarlet
PL Bartlett
RA DeVore
S Shalev-Shwartz
V Kurková
W E
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Learning Invariant Features in Modulatory Networks through Conflict and Ambiguity

Author: Bishop C. M.
Bridson R.
He K.
Kogo N.
Laurent Itti
Liu W.
Long M.
Neyshabur B.
Nicholls J. G.
Redmon J.
Ren S.
Rubin E.
Sabour S.
Serre T.
Sun B.
Teo C.
W. Shane Grant
Publication venue: 'MIT Press - Journals'
Publication date
Field of study

Crossref

Continuously Adaptive Similarity Search

Author: Andoni A.
Chechik G.
Dosovitskiy A.
Ester M.
Hastie T.
Jain P.
Knorr E. M.
Lv Q.
Malerba D.
McDonald J.
Neyshabur B.
Shalev-Shwartz S.
Singhal A.
Weinberger K. Q.
Xing E. P.
Yang L.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/05/2020
Field of study

Similarity search is the basis for many data analytics techniques, including k-nearest neighbor classification and outlier detection. Similarity search over large data sets relies on i) a distance metric learned from input examples and ii) an index to speed up search based on the learned distance metric. In interactive systems, input to guide the learning of the distance metric may be provided over time. As this new input changes the learned distance metric, a naive approach would adopt the costly process of re-indexing all items after each metric change. In this paper, we propose the first solution, called OASIS, to instantaneously adapt the index to conform to a changing distance metric without this prohibitive re-indexing process. To achieve this, we prove that locality-sensitive hashing (LSH) provides an invariance property, meaning that an LSH index built on the original distance metric is equally effective at supporting similarity search using an updated distance metric as long as the transform matrix learned for the new distance metric satisfies certain properties. This observation allows OASIS to avoid recomputing the index from scratch in most cases. Further, for the rare cases when an adaption of the LSH index is shown to be necessary, we design an efficient incremental LSH update strategy that re-hashes only a small subset of the items in the index. In addition, we develop an efficient distance metric learning strategy that incrementally learns the new metric as inputs are received. Our experimental study using real world public datasets confirms the effectiveness of OASIS at improving the accuracy of various similarity search-based data analytics tasks by instantaneously adapting the distance metric and its associated index in tandem, while achieving an up to 3 orders of magnitude speedup over the state-of-art techniques

DSpace@MIT

Crossref

A jamming transition from under- to over-parametrization affects generalization in deep learning

Author: Advani M S
Baity-Jesi M
Bansal Y
Belkin M
Belkin M
Caruana R
Choromanska A
Cooper Y
Dauphin Y N
Franz P U S
Franz S
Freeman C D
G Biroli
Geiger M
Geiger M
Hastie T
He K
Hoffer E
Ioffe S
Kingma D P
Krogh A
L Sagun
LeCun Y
Liao Z
Lipton Z C
Liu A J
M Geiger
M Wyart
Neyshabur B
Neyshabur B
S d’Ascoli
S Spigler
Sagun L
Sagun L
Saxe A M
Schoenholz S S
Soudry D
Srivastava N
Venturi L
Publication venue: 'IOP Publishing'
Publication date
Field of study

Crossref