Search CORE

1,569 research outputs found

Novel topological descriptors for analyzing biological networks

Author: Armin A Graber
Dehmer Matthias M
Kurt K Varmuza
Matthias M Dehmer
Nicola N Barbarini
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Topological descriptors, other graph measures, and in a broader sense, graph-theoretical methods, have been proven as powerful tools to perform biological network analysis. However, the majority of the developed descriptors and graph-theoretical methods does not have the ability to take vertex- and edge-labels into account, e.g., atom- and bond-types when considering molecular graphs. Indeed, this feature is important to characterize biological networks more meaningfully instead of only considering pure topological information. Results In this paper, we put the emphasis on analyzing a special type of biological networks, namely bio-chemical structures. First, we derive entropic measures to calculate the information content of vertex- and edge-labeled graphs and investigate some useful properties thereof. Second, we apply the mentioned measures combined with other well-known descriptors to supervised machine learning methods for predicting Ames mutagenicity. Moreover, we investigate the influence of our topological descriptors - measures for only unlabeled vs. measures for labeled graphs - on the prediction performance of the underlying graph classification problem. Conclusions Our study demonstrates that the application of entropic measures to molecules representing graphs is useful to characterize such structures meaningfully. For instance, we have found that if one extends the measures for determining the structural information content of unlabeled graphs to labeled graphs, the uniqueness of the resulting indices is higher. Because measures to structurally characterize labeled graphs are clearly underrepresented so far, the further development of such methods might be valuable and fruitful for solving problems within biological network analysis.</p

CiteSeerX

Crossref

Springer

Directory of Open Access Journals

PubMed Central

New Polynomial-Based Molecular Descriptors with Low Degeneracy

Author: A Mowshowitz
A Robles-Kelly
A Schwaighofer
Armin Graber
B Jackson
CE Shannon
D Bonchev
D Bonchev
D Bonchev
D Bonchev
D Woodall
DM Cvetkovic
E Estrada
EV Konstantinova
EV Konstantinova
F Emmert-Streib
Fabio Rapallo
FM Dong
H Hosoya
H Scsibrany
H Wiener
I Gutman
J Bajorath
J Bang-Jensen
J Devillers
J Gasteiger
JA Ellis-Monaghan
K Hansen
L Lovász
L Pachter
Laurin A. J. Mueller
M Dehmer
M Dehmer
M Fukui
M Kovše
M Mignotte
M Randić
M Randić
Matthias Dehmer
MV Diudea
N Trinajstić
O Bretscher
O Ivanciuc
O Ivanciuc
O Mason
R Kamae
R Todeschini
R Todeschini
R Todeschini
S Wasserman
SC Basak
SC Basak
SE Stein
SJ Chen
TM Cover
Publication venue: Public Library of Science
Publication date: 30/07/2010
Field of study

In this paper, we introduce a novel graph polynomial called the ‘information polynomial’ of a graph. This graph polynomial can be derived by using a probability distribution of the vertex set. By using the zeros of the obtained polynomial, we additionally define some novel spectral descriptors. Compared with those based on computing the ordinary characteristic polynomial of a graph, we perform a numerical study using real chemical databases. We obtain that the novel descriptors do have a high discrimination power

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Rapid Quantification of Molecular Diversity for Selective Database Acquisition

Author: Ash J. E.
Ashton M. J.
Barnard J. M.
Basak S. C.
Basak S. C.
Boyd S. M.
David B. Turner
Downs G. M.
Downs G. M.
Holliday J. D.
Hu C.-Y.
Kearsley S. K.
Martin E. J.
Martin Y. C.
Nilakantan R.
Peter Willett
Shemetulskis N. E.
Simon M. Tyrrell
Voorhees E. M
Willett P.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/1997
Field of study

There is an increasing need to expand the structural diversity of the molecules investigated in lead-discovery programs. One way in which this can be achieved is by acquiring external datasets that will enhance an existing database. This paper describes a rapid procedure for the selection of external datasets using a measure of structural diversity that is calculated from sums of pairwise intermolecular structural similarities

CiteSeerX

Crossref

University of East Anglia digital repository

Evolutionary Computation and QSAR Research

Author: Aguiar-Pulido Vanessa
Cruz-Monteagudo Maykel
Dorado Julián
Gestal M.
Munteanu Cristian-Robert
Rabuñal Juan R.
Publication venue: 'Bentham Science Publishers Ltd.'
Publication date: 01/01/2013
Field of study

[Abstract] The successful high throughput screening of molecule libraries for a specific biological property is one of the main improvements in drug discovery. The virtual molecular filtering and screening relies greatly on quantitative structure-activity relationship (QSAR) analysis, a mathematical model that correlates the activity of a molecule with molecular descriptors. QSAR models have the potential to reduce the costly failure of drug candidates in advanced (clinical) stages by filtering combinatorial libraries, eliminating candidates with a predicted toxic effect and poor pharmacokinetic profiles, and reducing the number of experiments. To obtain a predictive and reliable QSAR model, scientists use methods from various fields such as molecular modeling, pattern recognition, machine learning or artificial intelligence. QSAR modeling relies on three main steps: molecular structure codification into molecular descriptors, selection of relevant variables in the context of the analyzed activity, and search of the optimal mathematical model that correlates the molecular descriptors with a specific activity. Since a variety of techniques from statistics and artificial intelligence can aid variable selection and model building steps, this review focuses on the evolutionary computation methods supporting these tasks. Thus, this review explains the basic of the genetic algorithms and genetic programming as evolutionary computation approaches, the selection methods for high-dimensional data in QSAR, the methods to build QSAR models, the current evolutionary feature selection methods and applications in QSAR and the future trend on the joint or multi-task feature selection methods.Instituto de Salud Carlos III, PIO52048Instituto de Salud Carlos III, RD07/0067/0005Ministerio de Industria, Comercio y Turismo; TSI-020110-2009-53)Galicia. Consellería de Economía e Industria; 10SIN105004P

Repositorio da Universidade da Coruña

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Recent Developments in Quantitative Graph Theory: Information Inequalities for Networks

Author: Dehmer Matthias
Sivakumar Lavanya
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

In this article, we tackle a challenging problem in quantitative graph theory. We establish relations between graph entropy measures representing the structural information content of networks. In particular, we prove formal relations between quantitative network measures based on Shannon's entropy to study the relatedness of those measures. In order to establish such information inequalities for graphs, we focus on graph entropy measures based on information functionals. To prove such relations, we use known graph classes whose instances have been proven useful in various scientific areas. Our results extend the foregoing work on information inequalities for graphs

Public Library of Science (PLOS)

Directory of Open Access Journals

PubMed Central

Use of Statistical and Neural Net Approaches in Predicting Toxicity of Chemicals

Author: Brian D Gute
David Opitz
Gregory D Grunwald
Krishnan Balasubramanian
Subhash C Basak
Publication venue
Publication date: 03/04/2020
Field of study

Hierarchical quantitative structure-activity relationships (H-QSAR) have been developed as a new approach in constructing models for estimating physicochemical, biomedicinal, and toxicological properties of interest. This approach uses increasingly more complex molecular descriptors in a graduated approach to model building. In this study, statistical and neural network methods have been applied to the development of H-QSAR models for estimating the acute aquatic toxicity (LC 50 ) of 69 benzene derivatives to Pimephales promelas (fathead minnow). Topostructural, topochemical, geometrical, and quantum chemical indices were used as the four levels of the hierarchical method. It is clear from both the statistical and neural network models that topostructural indices alone cannot adequately model this set of congeneric chemicals. Not surprisingly, topochemical indices greatly increase the predictive power of both statistical and neural network models. Quantum chemical indices also add significantly to the modeling of this set of acute aquatic toxicity data

CiteSeerX