11,076 research outputs found
Modeling and Analyzing Biomolecular Networks
The authors argue for the need to model and analyze biological networks at molecular and cellular levels. They propose a computational toolbox for biologists. Central to their approach is the paradigm of hybrid models in which discrete events are combined with continuous differential equations to capture switching behavior
Mathematics at the eve of a historic transition in biology
A century ago physicists and mathematicians worked in tandem and established
quantum mechanism. Indeed, algebras, partial differential equations, group
theory, and functional analysis underpin the foundation of quantum mechanism.
Currently, biology is undergoing a historic transition from qualitative,
phenomenological and descriptive to quantitative, analytical and predictive.
Mathematics, again, becomes a driving force behind this new transition in
biology.Comment: 5 pages, 2 figure
TopologyNet: Topology based deep convolutional neural networks for biomolecular property predictions
Although deep learning approaches have had tremendous success in image, video
and audio processing, computer vision, and speech recognition, their
applications to three-dimensional (3D) biomolecular structural data sets have
been hindered by the entangled geometric complexity and biological complexity.
We introduce topology, i.e., element specific persistent homology (ESPH), to
untangle geometric complexity and biological complexity. ESPH represents 3D
complex geometry by one-dimensional (1D) topological invariants and retains
crucial biological information via a multichannel image representation. It is
able to reveal hidden structure-function relationships in biomolecules. We
further integrate ESPH and convolutional neural networks to construct a
multichannel topological neural network (TopologyNet) for the predictions of
protein-ligand binding affinities and protein stability changes upon mutation.
To overcome the limitations to deep learning arising from small and noisy
training sets, we present a multitask topological convolutional neural network
(MT-TCNN). We demonstrate that the present TopologyNet architectures outperform
other state-of-the-art methods in the predictions of protein-ligand binding
affinities, globular protein mutation impacts, and membrane protein mutation
impacts.Comment: 20 pages, 8 figures, 5 table
Genes2Networks: Connecting Lists of Proteins by Using Background Literature-based Mammalian Networks
In recent years, in-silico literature-based mammalian protein-protein interaction network datasets have been developed. These datasets contain binary interactions extracted manually from legacy experimental biomedical research literature. Placing lists of genes or proteins identified as significantly changing in multivariate experiments, in the context of background knowledge about binary interactions, can be used to place these genes or proteins in the context of pathways and protein complexes.
Genes2Networks is a software system that integrates the content of ten mammalian literature-based interaction network datasets. Filtering to prune low-confidence interactions was implemented. Genes2Networks is delivered as a web-based service using AJAX. The system can be used to extract relevant subnetworks created from “seed” lists of human Entrez gene names. The output includes a dynamic linkable three color web-based network map, with a statistical analysis report that identifies significant intermediate nodes used to connect the seed list. Genes2Networks is available at http://actin.pharm.mssm.edu/genes2networks.
Genes2Network is a powerful web-based software application tool that can help experimental biologists to interpret high-throughput experimental results used in genomics and proteomics studies where the output of these experiments is a list of significantly changing genes or proteins. The system can be used to find relationships between nodes from the seed list, and predict novel nodes that play a key role in a common function
Genes2Networks: Connecting Lists of Proteins by Using Background Literature-based Mammalian Networks
In recent years, in-silico literature-based mammalian protein-protein interaction network datasets have been developed. These datasets contain binary interactions extracted manually from legacy experimental biomedical research literature. Placing lists of genes or proteins identified as significantly changing in multivariate experiments, in the context of background knowledge about binary interactions, can be used to place these genes or proteins in the context of pathways and protein complexes.
Genes2Networks is a software system that integrates the content of ten mammalian literature-based interaction network datasets. Filtering to prune low-confidence interactions was implemented. Genes2Networks is delivered as a web-based service using AJAX. The system can be used to extract relevant subnetworks created from “seed” lists of human Entrez gene names. The output includes a dynamic linkable three color web-based network map, with a statistical analysis report that identifies significant intermediate nodes used to connect the seed list. Genes2Networks is available at http://actin.pharm.mssm.edu/genes2networks.
Genes2Network is a powerful web-based software application tool that can help experimental biologists to interpret high-throughput experimental results used in genomics and proteomics studies where the output of these experiments is a list of significantly changing genes or proteins. The system can be used to find relationships between nodes from the seed list, and predict novel nodes that play a key role in a common function
Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening
This work introduces a number of algebraic topology approaches, such as
multicomponent persistent homology, multi-level persistent homology and
electrostatic persistence for the representation, characterization, and
description of small molecules and biomolecular complexes. Multicomponent
persistent homology retains critical chemical and biological information during
the topological simplification of biomolecular geometric complexity.
Multi-level persistent homology enables a tailored topological description of
inter- and/or intra-molecular interactions of interest. Electrostatic
persistence incorporates partial charge information into topological
invariants. These topological methods are paired with Wasserstein distance to
characterize similarities between molecules and are further integrated with a
variety of machine learning algorithms, including k-nearest neighbors, ensemble
of trees, and deep convolutional neural networks, to manifest their descriptive
and predictive powers for chemical and biological problems. Extensive numerical
experiments involving more than 4,000 protein-ligand complexes from the PDBBind
database and near 100,000 ligands and decoys in the DUD database are performed
to test respectively the scoring power and the virtual screening power of the
proposed topological approaches. It is demonstrated that the present approaches
outperform the modern machine learning based methods in protein-ligand binding
affinity predictions and ligand-decoy discrimination
Multiscale virtual particle based elastic network model (MVP-ENM) for biomolecular normal mode analysis
In this paper, a multiscale virtual particle based elastic network model
(MVP-ENM) is proposed for biomolecular normal mode analysis. The multiscale
virtual particle model is proposed for the discretization of biomolecular
density data in different scales. Essentially, the model works as the
coarse-graining of the biomolecular structure, so that a delicate balance
between biomolecular geometric representation and computational cost can be
achieved. To form "connections" between these multiscale virtual particles, a
new harmonic potential function, which considers the influence from both mass
distributions and distance relations, is adopted between any two virtual
particles. Unlike the previous ENMs that use a constant spring constant, a
particle-dependent spring parameter is used in MVP-ENM. Two independent models,
i.e., multiscale virtual particle based Gaussian network model (MVP-GNM) and
multiscale virtual particle based anisotropic network model (MVP-ANM), are
proposed. Even with a rather coarse grid and a low resolution, the MVP-GNM is
able to predict the Debye-Waller factors (B-factors) with considerable good
accuracy. Similar properties have also been observed in MVP-ANM. More
importantly, in B-factor predictions, the mismatch between the predicted
results and experimental ones is predominantly from higher fluctuation regions.
Further, it is found that MVP-ANM can deliver a very consistent low-frequency
eigenmodes in various scales. This demonstrates the great potential of MVP-ANM
in the deformation analysis of low resolution data. With the multiscale
rigidity function, the MVP-ENM can be applied to biomolecular data represented
in density distribution and atomic coordinates. Further, the great advantage of
my MVP-ENM model in computational cost has been demonstrated by using two
poliovirus virus structures. Finally, the paper ends with a conclusion.Comment: 15 figures; 25 page
- …