Search CORE

68,609 research outputs found

Designing labeled graph classifiers by exploiting the R\'enyi entropy of the dissimilarity representation

Author: Livi Lorenzo
Publication venue: 'MDPI AG'
Publication date: 20/04/2017
Field of study

Representing patterns as labeled graphs is becoming increasingly common in the broad field of computational intelligence. Accordingly, a wide repertoire of pattern recognition tools, such as classifiers and knowledge discovery procedures, are nowadays available and tested for various datasets of labeled graphs. However, the design of effective learning procedures operating in the space of labeled graphs is still a challenging problem, especially from the computational complexity viewpoint. In this paper, we present a major improvement of a general-purpose classifier for graphs, which is conceived on an interplay between dissimilarity representation, clustering, information-theoretic techniques, and evolutionary optimization algorithms. The improvement focuses on a specific key subroutine devised to compress the input data. We prove different theorems which are fundamental to the setting of the parameters controlling such a compression operation. We demonstrate the effectiveness of the resulting classifier by benchmarking the developed variants on well-known datasets of labeled graphs, considering as distinct performance indicators the classification accuracy, computing time, and parsimony in terms of structural complexity of the synthesized classification models. The results show state-of-the-art standards in terms of test set accuracy and a considerable speed-up for what concerns the computing time.Comment: Revised versio

arXiv.org e-Print Archive

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

Non-linear regression models for Approximate Bayesian Computation

Author: A. Butler
A. Gelman
B. Schölkopf
B.D. Ripley
C. Gourieroux
C.M. Bishop
C.P. Robert
D.A. Nix
D.E. Reich
E.A. Nadaraya
G. Weiss
G.E.P. Box
G.S. Watson
I.J. Wilson
J. Fan
J. Hey
J.H. Friedman
J.K. Pritchard
J.K. Pritchard
J.P. King
J.S. Liu
K. Heggland
L.A. Zhivotovsky
L.A. Zhivotovsky
M. Stephens
M. Tanaka
M.A. Beaumont
M.D. Shriver
M.K. Kuhner
N.J.R. Fagundes
O. Ratmann
P. Bortot
P. Marjoram
P. Marjoram
P.J. Diggle
S. Tavaré
S. Tavaré
S.A. Sisson
T. Ohta
T. Toni
V.N. Vapnik
W. Härdle
Y.-X. Fu
Y.-X. Fu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 23/02/2009
Field of study

Approximate Bayesian inference on the basis of summary statistics is well-suited to complex problems for which the likelihood is either mathematically or computationally intractable. However the methods that use rejection suffer from the curse of dimensionality when the number of summary statistics is increased. Here we propose a machine-learning approach to the estimation of the posterior density by introducing two innovations. The new method fits a nonlinear conditional heteroscedastic regression of the parameter on the summary statistics, and then adaptively improves estimation using importance sampling. The new algorithm is compared to the state-of-the-art approximate Bayesian methods, and achieves considerable reduction of the computational burden in two examples of inference in statistical genetics and in a queueing model.Comment: 4 figures; version 3 minor changes; to appear in Statistics and Computin

arXiv.org e-Print Archive

Crossref

Evolutionary cellular configurations for designing feed-forward neural networks architectures

Author: Galván Inés M.
Gutiérrez Sánchez Germán
Isasi Pedro
Molina López José Manuel
Sanchis de Miguel María Araceli
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2001
Field of study

Proceeding of: 6th International Work-Conference on Artificial and Natural Neural Networks, IWANN 2001 Granada, Spain, June 13–15, 2001In the recent years, the interest to develop automatic methods to determine appropriate architectures of feed-forward neural networks has increased. Most of the methods are based on evolutionary computation paradigms. Some of the designed methods are based on direct representations of the parameters of the network. These representations do not allow scalability, so to represent large architectures, very large structures are required. An alternative more interesting are the indirect schemes. They codify a compact representation of the neural network. In this work, an indirect constructive encoding scheme is presented. This scheme is based on cellular automata representations in order to increase the scalability of the method

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Carlos III de Madrid e-Archivo

Computational strategies for dissecting the high-dimensional complexity of adaptive immune repertoires

Author: Afzal
Ahmed
Albert
Amanna
Andrews
Angermueller
Apeltsin
Arden
Atchley
Avnir
Barabási
Barak
Bashford-Rogers
Bastian
Baum
Becattini
Ben-Hamo
Berger
Betz
Boc
Bolen
Bolkhovskaya
Bolotin
Bouckaert
Boyd
Boyd
Breden
Brown
Burnet
Bürckert
Calis
Castro
Chang
Chao
Chen
Ching
Cobey
Collins
Corcoran
Covacu
Csardi
Cui
Dash
de Bourcy
DeKosky
DeKosky
DeWitt
Dziubianau
Elhanati
Elhanati
Elhanati
Ellebedy
Emerson
Felsenstein
Friedensohn
Gadala-Maria
Galson
Galson
Geering
Georgiou
Ghraichy
Giribet
Giudicelli
Glanville
Glanville
Glanville
Good
Granato
Greiff
Greiff
Greiff
Greiff
Greiff
Grigaityte
Guindon
Gupta
Gupta
Hagberg
Halliley
Hammarlund
Heather
Hershberg
Hochreiter
Hoehn
Hoehn
Hoehn
Horns
Howie
Iversen
Jackson
Janeway
Jiang
Johnston
Jost
Jurtz
Kaplinsky
Kaplinsky
Kendall
Khavrutskii
Kidd
Kidd
Kidera
Kirik
Konishi
Kumar
Landsverk
Larkin
Lavinder
Laydon
Laydon
Lee
Lee
Lewitus
Li
Lindeman
Lindner
Liu
Love
Lozupone
Madi
Madi
Malissen
Mamoshina
Mangul
Manz
Martin
Meng
Miho
Mora
Morisita
Murugan
Nazarov
Nouri
Oakes
Ostmeyer
Paradis
Parameswaran
Parola
Pinheiro
Pollok
Ralph
Ravn
Reddy
Rempala
Rempala
Revell
Rizzetto
Robinson
Ronquist
Roybal
Rubelt
Rubelt
Safonova
Schliep
Schramm
Schwab
Shannon
Sheng
Sheng
Shlemov
Shugay
Shugay
Shugay
Snir
Snir
Stamatakis
Stern
Strauli
Stubbington
Stubbington
Sun
Sun Cinelli
Swofford
Thomas
Tickotsky
Tonegawa
Torkamani
Trepel
VanDuijn
Venturi
Venturi
Vieira
Vita
Wang
Wardemann
Warren
Watson
Watson
Watson
Weinstein
Wine
Wine
Wu
Yaari
Yaari
Yang
Yeap
Yermanos
Yokota
Yu
Zhu
Publication venue: 'Frontiers Media SA'
Publication date: 29/11/2017
Field of study

The adaptive immune system recognizes antigens via an immense array of antigen-binding antibodies and T-cell receptors, the immune repertoire. The interrogation of immune repertoires is of high relevance for understanding the adaptive immune response in disease and infection (e.g., autoimmunity, cancer, HIV). Adaptive immune receptor repertoire sequencing (AIRR-seq) has driven the quantitative and molecular-level profiling of immune repertoires thereby revealing the high-dimensional complexity of the immune receptor sequence landscape. Several methods for the computational and statistical analysis of large-scale AIRR-seq data have been developed to resolve immune repertoire complexity in order to understand the dynamics of adaptive immunity. Here, we review the current research on (i) diversity, (ii) clustering and network, (iii) phylogenetic and (iv) machine learning methods applied to dissect, quantify and compare the architecture, evolution, and specificity of immune repertoires. We summarize outstanding questions in computational immunology and propose future directions for systems immunology towards coupling AIRR-seq with the computational discovery of immunotherapeutics, vaccines, and immunodiagnostics.Comment: 27 pages, 2 figure

arXiv.org e-Print Archive

Repository for Publications and Research Data

Crossref

NORA - Norwegian Open Research Archives

Detection of regulator genes and eQTLs in gene networks

Author: A Butte
A Chatr-Aryamontri
A Clauset
A Joshi
A Joshi
A Kundaje
AA Shabalin
AJ Enright
AJ Walhout
AS Dimas
B Schwanhausser
B Zhang
B Zhang
C Cenik
CO Daub
D Koller
DA Cusanovich
DM Greenawalt
E Bonnet
E Ravasz
E Segal
EC Neto
EC Neto
EC Neto
EE Schadt
EE Schadt
EE Schadt
EE Schadt
EE Schadt
EJ Foss
F Grubert
F Yue
FA Cubillos
FW Albert
G Hemani
G Nicholson
GD Smith
GH Golub
H Foroughi Asl
H Talukdar
HN Kadarmideen
J Millstein
J Qi
J Zhu
J Zhu
J Zhu
JE Aten
JF Ayroles
JJ Faith
JL Björkegren
JS Liu
K Basso
K Qu
KG Ardlie
L Wu
LA Hindorff
LH Hartwell
LS Chen
M Ashburner
M Civelek
M Georges
M Gerstein
M Medvedovic
M Schmidt
M Scutari
MA Schaub
MB Eisen
MD Ritchie
ME Goddard
MEJ Newman
MEJ Newman
MV Rockman
MV Rockman
N Friedman
N Friedman
N Friedman
N Laird
O Stegle
P Langfelder
P Langfelder
P Langfelder
P Lu
R Sharan
R Sharan
RB Brem
RW Williams
S Lee
S Roy
S Tavazoie
SI Lee
SM Waszak
SS Rao
T Lappalainen
T Michoel
TA Manolio
TF Mackay
The ENCODE
TS Furey
VG Cheung
W Cookson
W Zhang
Y Chen
Y Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2016
Field of study

Genetic differences between individuals associated to quantitative phenotypic traits, including disease states, are usually found in non-coding genomic regions. These genetic variants are often also associated to differences in expression levels of nearby genes (they are "expression quantitative trait loci" or eQTLs for short) and presumably play a gene regulatory role, affecting the status of molecular networks of interacting genes, proteins and metabolites. Computational systems biology approaches to reconstruct causal gene networks from large-scale omics data have therefore become essential to understand the structure of networks controlled by eQTLs together with other regulatory genes, and to generate detailed hypotheses about the molecular mechanisms that lead from genotype to phenotype. Here we review the main analytical methods and softwares to identify eQTLs and their associated genes, to reconstruct co-expression networks and modules, to reconstruct causal Bayesian gene and module networks, and to validate predicted networks in silico.Comment: minor revision with typos corrected; review article; 24 pages, 2 figure

arXiv.org e-Print Archive

Crossref

Evolving artificial cell signaling networks using molecular classifier systems

Author: Decraene James
McMullin Barry
Mitchell George G.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2006
Field of study

Nature is a source of inspiration for computational techniques which have been successfully applied to a wide variety of complex application domains. In keeping with this we examine Cell Signaling Networks (CSN) which are chemical networks responsible for coordinating cell activities within their environment. Through evolution they have become highly efficient for governing critical control processes such as immunological responses, cell cycle control or homeostasis. Realising (and evolving) Artificial Cell Signaling Networks (ACSNs) may provide new computational paradigms for a variety of application areas. Our abstraction of Cell Signaling Networks focuses on four characteristic properties distinguished as follows: Computation, Evolution, Crosstalk and Robustness. These properties are also desirable for potential applications in the control systems, computation and signal processing field. These characteristics are used as a guide for the development of an ACSN evolutionary simulation platform. In this paper we present a novel evolutionary approach named Molecular Classifier System (MCS) to simulate such ACSNs. The MCS that we have designed is derived from Holland's Learning Classifier System. The research we are currently involved in is part of the multi disciplinary European funded project, ESIGNET, with the central question of the study of the computational properties of CSNs by evolving them using methods from evolutionary computation, and to re-apply this understanding in developing new ways to model and predict real CSNs

DCU Online Research Access Service

Environmental boundary conditions for the origin of life converge to an organo-sulfur metabolism

Author: Goldford Joshua E.
Hartman Hyman
Marsland Robert
Segrè Daniel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2019
Field of study

Published in final edited form as: Nat Ecol Evol. 2019 December ; 3(12): 1715–1724. doi:10.1038/s41559-019-1018-8.It has been suggested that a deep memory of early life is hidden in the architecture of metabolic networks, whose reactions could have been catalyzed by small molecules or minerals before genetically encoded enzymes. A major challenge in unravelling these early steps is assessing the plausibility of a connected, thermodynamically consistent proto-metabolism under different geochemical conditions, which are still surrounded by high uncertainty. Here we combine network-based algorithms with physico-chemical constraints on chemical reaction networks to systematically show how different combinations of parameters (temperature, pH, redox potential and availability of molecular precursors) could have affected the evolution of a proto-metabolism. Our analysis of possible trajectories indicates that a subset of boundary conditions converges to an organo-sulfur-based proto-metabolic network fuelled by a thioester- and redox-driven variant of the reductive tricarboxylic acid cycle that is capable of producing lipids and keto acids. Surprisingly, environmental sources of fixed nitrogen and low-potential electron donors are not necessary for the earliest phases of biochemical evolution. We use one of these networks to build a steady-state dynamical metabolic model of a protocell, and find that different combinations of carbon sources and electron donors can support the continuous production of a minimal ancient 'biomass' composed of putative early biopolymers and fatty acids.80NSSC17K0295 - Intramural NASA; 80NSSC17K0296 - Intramural NASA; T32 GM100842 - NIGMS NIH HHSAccepted manuscrip

Boston University Institutional Repository (OpenBU)