Search CORE

38 research outputs found

Using Regular Languages to Explore the Representational Capacity of Recurrent Neural Architectures

Author: AS Reber
AW Smith
B Yoshua
G Jager
I Simon
J Rogers
JL Elman
M Casey
MP Marcus
N Chomsky
N Chomsky
S Hochreiter
WT Fitch
Y LeCun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

The presence of Long Distance Dependencies (LDDs) in sequential data poses significant challenges for computational models. Various recurrent neural architectures have been designed to mitigate this issue. In order to test these state-of-the-art architectures, there is growing need for rich benchmarking datasets. However, one of the drawbacks of existing datasets is the lack of experimental control with regards to the presence and/or degree of LDDs. This lack of control limits the analysis of model performance in relation to the specific challenge posed by LDDs. One way to address this is to use synthetic data having the properties of subregular languages. The degree of LDDs within the generated data can be controlled through the k parameter, length of the generated strings, and by choosing appropriate forbidden strings. In this paper, we explore the capacity of different RNN extensions to model LDDs, by evaluating these models on a sequence of SPk synthesized datasets, where each subsequent dataset exhibits a longer degree of LDD. Even though SPk are simple languages, the presence of LDDs does have significant impact on the performance of recurrent neural architectures, thus making them prime candidate in benchmarking tasks.Comment: International Conference of Artificial Neural Networks (ICANN) 201

arXiv.org e-Print Archive

Crossref

Arrow@TUDublin

Hybrid Models for Learning to Branch

Author: Bengio Yoshua
Gasse Maxime
Gupta Prateek
Khalil Elias B.
Kumar M. Pawan
Lodi Andrea
Publication venue
Publication date: 01/01/2020
Field of study

A recent Graph Neural Network (GNN) approach for learning to branch has been shown to successfully reduce the running time of branch-and-bound algorithms for Mixed Integer Linear Programming (MILP). While the GNN relies on a GPU for inference, MILP solvers are purely CPU-based. This severely limits its application as many practitioners may not have access to high-end GPUs. In this work, we ask two key questions. First, in a more realistic setting where only a CPU is available, is the GNN model still competitive? Second, can we devise an alternate computationally inexpensive model that retains the predictive power of the GNN architecture? We answer the first question in the negative, and address the second question by proposing a new hybrid architecture for efficient branching on CPU machines. The proposed architecture combines the expressive power of GNNs with computationally inexpensive multi-linear perceptrons (MLP) for branching. We evaluate our methods on four classes of MILP problems, and show that they lead to up to 26% reduction in solver running time compared to state-of-the-art methods without a GPU, while extrapolating to harder problems than it was trained on.Comment: Preprint. Under revie

arXiv.org e-Print Archive

Oxford University Research Archive

PolyPublie

Enhancing network embedding with implicit clustering

Author: A Zhang
B Yoshua
J-H Li
M Belkin
Q Li
S Wang
ST Roweis
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Network embedding aims at learning the low dimensional representation of nodes. These representations can be widely used for network mining tasks, such as link prediction, anomaly detection, and classification. Recently, a great deal of meaningful research work has been carried out on this emerging network analysis paradigm. The real- world network contains different size clusters because of the edges with different relationship types. These clusters also reflect some features of nodes, which can contribute to the optimization of the feature representation of nodes. However, existing network embedding methods do not distinguish these relationship types. In this paper, we propose an unsupervised network representation learning model that can encode edge relationship information. Firstly, an objective function is defined, which can learn the edge vectors by implicit clustering. Then, a biased random walk is designed to generate a series of node sequences, which are put into Skip-Gram to learn the low dimensional node representations. Extensive experiments are conducted on several network datasets. Compared with the state-of-art baselines, the proposed method is able to achieve favorable and stable results in multi-label classification and link prediction tasks

Crossref

OPUS - University of Technology Sydney

University of Tasmania Open Access Repository

BigBrain 3D atlas of cortical layers: Cortical and laminar thickness gradients diverge in sensory and motor cortices.

Author: Amunts Katrin
Bengio Yoshua
Bludau Sebastian
Cohen Joseph Paul
Cucurull Guillem
Dickscheid Timo
Evans Alan C
Fletcher Paul C
Funck Thomas
Larocque Stéphanie
Lepage Claude
Lewis Lindsay B
Palomero-Gallagher Nicola
Romero Adriana
Spitzer Hannah
Wagstyl Konrad
Zilles Karl
Publication venue: PLoS Biol
Publication date: 01/01/2020
Field of study

Histological atlases of the cerebral cortex, such as those made famous by Brodmann and von Economo, are invaluable for understanding human brain microstructure and its relationship with functional organization in the brain. However, these existing atlases are limited to small numbers of manually annotated samples from a single cerebral hemisphere, measured from 2D histological sections. We present the first whole-brain quantitative 3D laminar atlas of the human cerebral cortex. It was derived from a 3D histological atlas of the human brain at 20-micrometer isotropic resolution (BigBrain), using a convolutional neural network to segment, automatically, the cortical layers in both hemispheres. Our approach overcomes many of the historical challenges with measurement of histological thickness in 2D, and the resultant laminar atlas provides an unprecedented level of precision and detail. We utilized this BigBrain cortical atlas to test whether previously reported thickness gradients, as measured by MRI in sensory and motor processing cortices, were present in a histological atlas of cortical thickness and which cortical layers were contributing to these gradients. Cortical thickness increased across sensory processing hierarchies, primarily driven by layers III, V, and VI. In contrast, motor-frontal cortices showed the opposite pattern, with decreases in total and pyramidal layer thickness from motor to frontal association cortices. These findings illustrate how this laminar atlas will provide a link between single-neuron morphology, mesoscale cortical layering, macroscopic cortical thickness, and, ultimately, functional neuroanatomy

Directory of Open Access Journals

UCL Discovery

Juelich Shared Electronic Resources

Apollo (Cambridge)

Integration host factor bends and bridges DNA in a multiplicity of binding modes with varying specificity

Author: Howard Jamieson Anthony Leyland
Leake Mark Christian
Noy Agnes
Velasco Berrelleza Victor
Watson George Daniel
Yoshua Samuel B
Publication venue
Publication date
Field of study

White Rose Research Online

A community effort in SARS-CoV-2 drug discovery.

Author: Albani Simone
Allen William J
Antonopoulos Nick
Arthanari Haribabu
Athanasiou Christina
Barnsley Kelly
Beccari Andrea
Benabderrahmane Mohammed
Bengio Emmanuel
Bengio Yoshua
Berenger Francois
Blaschitz Klara
Bosko Ivan P
Bousquet-Melou Patrick
Brooks Charles L
Bureau Ronan
Carloni Paolo
Casini Arturo
Cespugli Marco
Charton Beatrice
Chen Kuang-Yu
Cirou Bertrand
Copeland Conner
D'Arrigo Giulia
Das Krishna M Padmanabha
De Rosa Maria
Druzhilovskiy Dmitry S
Durmaz Vedat
Eghbal-Zadeh Hamid
Elez Katarina
Embree Amanda
Epitropakis Nikolaos
Fackeldey Konstantin
Falsafi Babak
Fearon Daren
Filimonov Dmitry
Fischer Patrick D
Ford Bryan
Furs Konstantin V
Gaiser Jeremiah
García-Sastre Adolfo
Gianquinto Eleonora
Gil Gérard
Gilles Marcous
GLAAB Enrico
Gokcan Hatice
Gordon D Benjamin
Gorgulla Christoph
Goßen Jonas
Gruber Christian
Gruber Karl
Gulotta Maria R
Gusev Filipp
Gutkin Evgeny M
Halmich Christina
Hanke Anton
Haupt V Joachim
Hempel Tim
Hermans Thomas M
Hetmann Michael
Hochreiter Sepp
Horvath Dragos
Isayev Olexandr
Iyengar Suhasini M
Jacob Yves
Jain Moksh
Joseph Benjamin P
Kaiser Florian
Karpenko Anna D
Kinney Jamie E
Klambauer Guenter
Kokh Daria B
Korablyov Maksym
Kornoushenko Yury V
Kovachka Sandra
Kozlovskii Igor
Krasoulis Agamemnon
Krischuns Tim
Kumar Ashutosh
Kurnikova Maria G
Lafaye Pierre
Le Tuan
Lee Sang Yup
Lei David
Liu Cheng-Hao
Lombino Jessica
Maliutin Anton
Manelfi Candida
Mayr Andreas
Medvedev Alexander
Mekni Nedra
Mukherjee Goutam
Musiani Francesco
Muñiz-Chicharro Abraham
Naffakh Nadia
Narangoda Chamali H
Noé Frank
Nunes-Alves Ariane
Olson Daniel R
Olsson Simon
Ondrechen Mary Jo
Paiardi Giulia
Pandita Shreya
Perricone Ugo
Pitsikalis Vassilis
Pogodin Pavel V
Popov Petr
Poroikov Vladimir
Pratt Katelin
Pugliese Luisa
Raich Lluís
Rodríguez M Luis
Rossetti Giulia
Roy Amitava
Ruch Peter
Rudik Anastassia V
Sadiq S Kashif
Schimunek Johannes
Schroeder Michael
Seidl Philipp
Shuldau Mikita
Simone Giada De
Singh Amit
Sirimulla Suman
Spyrakis Francesca
Steinkellner Georg
Stolbov Leonid A
Talarico Carmine
Tesseyre Guilhem
Theodorakis Stavros
Tsengenes Alexandros
Varnek Alexandre
Venkatraman Vishwesh
Veselovsky Alexander V
Voigt Christopher A
von Delft Frank
Wade Rebecca
Wagner Gerhard
Walsh Martin A
Wang Zi-Fu
Watowich Stanley
Wheeler Travis J
White Kris M
Widrich Michael
Winter Robin
Yamanishi Yoshihiro
Yushkevich Artsemi
Yust Ryan J
Zaretckii Mark
Zettor Agnès
Zhang Kam
Zubatyuk Roman
Publication venue: Wiley
Publication date: 13/10/2023
Field of study

peer reviewedThe COVID-19 pandemic continues to pose a substantial threat to human lives and is likely to do so for years to come. Despite the availability of vaccines, searching for efficient small-molecule drugs that are widely available, including in low- and middle-income countries, is an ongoing challenge. In this work, we report the results of an open science community effort, the "Billion molecules against Covid-19 challenge", to identify small-molecule inhibitors against SARS-CoV-2 or relevant human receptors. Participating teams used a wide variety of computational methods to screen a minimum of 1 billion virtual molecules against 6 protein targets. Overall, 31 teams participated, and they suggested a total of 639,024 molecules, which were subsequently ranked to find 'consensus compounds'. The organizing team coordinated with various contract research organizations (CROs) and collaborating institutions to synthesize and test 878 compounds for biological activity against proteases (Nsp5, Nsp3, TMPRSS2), nucleocapsid N, RdRP (only the Nsp12 domain), and (alpha) spike protein S. Overall, 27 compounds with weak inhibition/binding were experimentally identified by binding-, cleavage-, and/or viral suppression assays and are presented here. Open science approaches such as the one presented here contribute to the knowledge base of future drug discovery efforts in finding better SARS-CoV-2 treatments.R-AGR-3826 - COVID19-14715687-CovScreen (01/06/2020 - 31/01/2021) - GLAAB Enric

HAL - Normandie Université

Open Repository and Bibliography - Luxembourg

HAL-Pasteur