Search CORE

17,345 research outputs found

Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening

Author: Cang Zixuan
Mu Lin
Wei Guowei
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 27/08/2017
Field of study

This work introduces a number of algebraic topology approaches, such as multicomponent persistent homology, multi-level persistent homology and electrostatic persistence for the representation, characterization, and description of small molecules and biomolecular complexes. Multicomponent persistent homology retains critical chemical and biological information during the topological simplification of biomolecular geometric complexity. Multi-level persistent homology enables a tailored topological description of inter- and/or intra-molecular interactions of interest. Electrostatic persistence incorporates partial charge information into topological invariants. These topological methods are paired with Wasserstein distance to characterize similarities between molecules and are further integrated with a variety of machine learning algorithms, including k-nearest neighbors, ensemble of trees, and deep convolutional neural networks, to manifest their descriptive and predictive powers for chemical and biological problems. Extensive numerical experiments involving more than 4,000 protein-ligand complexes from the PDBBind database and near 100,000 ligands and decoys in the DUD database are performed to test respectively the scoring power and the virtual screening power of the proposed topological approaches. It is demonstrated that the present approaches outperform the modern machine learning based methods in protein-ligand binding affinity predictions and ligand-decoy discrimination

arXiv.org e-Print Archive

Directory of Open Access Journals

FigShare

7th German Conference on Chemoinformatics: 25 CIC-Workshop : Goslar, Germany, 6 - 8 November 2011 ; meeting abstracts / Edited by Frank Oellien, Uli Fechner and Thomas Engel

Author: Engel Thomas
Fechner Uli
Oellien Frank
Publication venue
Publication date: 01/05/2012
Field of study

Hochschulschriftenserver - Universität Frankfurt am Main

A topological approach for protein classification

Author: Cang Zixuan
Mu Lin
Opron Kristopher
Wei Guo-Wei
Wu Kedi
Xia Kelin
Publication venue
Publication date: 01/01/2015
Field of study

Protein function and dynamics are closely related to its sequence and structure. However prediction of protein function and dynamics from its sequence and structure is still a fundamental challenge in molecular biology. Protein classification, which is typically done through measuring the similarity be- tween proteins based on protein sequence or physical information, serves as a crucial step toward the understanding of protein function and dynamics. Persistent homology is a new branch of algebraic topology that has found its success in the topological data analysis in a variety of disciplines, including molecular biology. The present work explores the potential of using persistent homology as an indepen- dent tool for protein classification. To this end, we propose a molecular topological fingerprint based support vector machine (MTF-SVM) classifier. Specifically, we construct machine learning feature vectors solely from protein topological fingerprints, which are topological invariants generated during the filtration process. To validate the present MTF-SVM approach, we consider four types of problems. First, we study protein-drug binding by using the M2 channel protein of influenza A virus. We achieve 96% accuracy in discriminating drug bound and unbound M2 channels. Additionally, we examine the use of MTF-SVM for the classification of hemoglobin molecules in their relaxed and taut forms and obtain about 80% accuracy. The identification of all alpha, all beta, and alpha-beta protein domains is carried out in our next study using 900 proteins. We have found a 85% success in this identifica- tion. Finally, we apply the present technique to 55 classification tasks of protein superfamilies over 1357 samples. An average accuracy of 82% is attained. The present study establishes computational topology as an independent and effective alternative for protein classification

arXiv.org e-Print Archive

DR-NTU (Digital Repository of NTU)

Molecular architecture of human polycomb repressive complex 2.

Author: Aebersold Ruedi
Ciferri Claudio
Herzog Franz
Lander Gabriel C
Maiolica Alessio
Nogales Eva
Publication venue: eScholarship, University of California
Publication date: 01/10/2012
Field of study

Polycomb Repressive Complex 2 (PRC2) is essential for gene silencing, establishing transcriptional repression of specific genes by tri-methylating Lysine 27 of histone H3, a process mediated by cofactors such as AEBP2. In spite of its biological importance, little is known about PRC2 architecture and subunit organization. Here, we present the first three-dimensional electron microscopy structure of the human PRC2 complex bound to its cofactor AEBP2. Using a novel internal protein tagging-method, in combination with isotopic chemical cross-linking and mass spectrometry, we have localized all the PRC2 subunits and their functional domains and generated a detailed map of interactions. The position and stabilization effect of AEBP2 suggests an allosteric role of this cofactor in regulating gene silencing. Regions in PRC2 that interact with modified histone tails are localized near the methyltransferase site, suggesting a molecular mechanism for the chromatin-based regulation of PRC2 activity.DOI:http://dx.doi.org/10.7554/eLife.00005.001

PubMed Central

eScholarship - University of California

Three-dimensional architecture and biogenesis of membrane structures associated with hepatitis C virus replication

Author: Antony Claude
Bartenschlager Ralf
Chiramel Abhilash
Chlanda Petr
Habermann Anja
Haselman Uta
Hoppe Simone
Kallis Stephanie
Krijnse-Locker Jacomine
Lee Ji-Young
Merz Andreas
Romero-Brey Inés
Santarella-Mellwig Rachel
Walther Paul
Publication venue
Publication date: 01/01/2012
Field of study

All positive strand RNA viruses are known to replicate their genomes in close association with intracellular membranes. In case of the hepatitis C virus (HCV), a member of the family Flaviviridae, infected cells contain accumulations of vesicles forming a membranous web (MW) that is thought to be the site of viral RNA replication. However, little is known about the biogenesis and three-dimensional structure of the MW. In this study we used a combination of immunofluorescence- and electron microscopy (EM)-based methods to analyze the membranous structures induced by HCV in infected cells. We found that the MW is derived primarily from the endoplasmic reticulum (ER) and contains markers of rough ER as well as markers of early and late endosomes, COP vesicles, mitochondria and lipid droplets (LDs). The main constituents of the MW are single and double membrane vesicles (DMVs). The latter predominate and the kinetic of their appearance correlates with kinetics of viral RNA replication. DMVs are induced primarily by NS5A whereas NS4B induces single membrane vesicles arguing that MW formation requires the concerted action of several HCV replicase proteins. Three-dimensional reconstructions identify DMVs as protrusions from the ER membrane into the cytosol, frequently connected to the ER membrane via a neck-like structure. In addition, late in infection multi-membrane vesicles become evident, presumably as a result of a stress-induced reaction. Thus, the morphology of the membranous rearrangements induced in HCV-infected cells resemble those of the unrelated picorna-, corona- and arteriviruses, but are clearly distinct from those of the closely related flaviviruses. These results reveal unexpected similarities between HCV and distantly related positive-strand RNA viruses presumably reflecting similarities in cellular pathways exploited by these viruses to establish their membranous replication factories

CiteSeerX

Directory of Open Access Journals

PubMed Central

Hochschulschriftenserver - Universität Frankfurt am Main

FigShare

Context-aware visual exploration of molecular databases

Author: Berthold Michael
Di Fatta Giuseppe
Fiannaca Antonino
Gaglio Salvatore
Rizzo Riccardo
Urso Alfonso
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2006
Field of study

Facilitating the visual exploration of scientific data has received increasing attention in the past decade or so. Especially in life science related application areas the amount of available data has grown at a breath taking pace. In this paper we describe an approach that allows for visual inspection of large collections of molecular compounds. In contrast to classical visualizations of such spaces we incorporate a specific focus of analysis, for example the outcome of a biological experiment such as high throughout screening results. The presented method uses this experimental data to select molecular fragments of the underlying molecules that have interesting properties and uses the resulting space to generate a two dimensional map based on a singular value decomposition algorithm and a self organizing map. Experiments on real datasets show that the resulting visual landscape groups molecules of similar chemical properties in densely connected regions

Central Archive at the University of Reading

CiteSeerX

Archivio istituzionale della ricerca - Università di Palermo

Visual and computational analysis of structure-activity relationships in high-throughput screening data

Author: Agrafiotis
Agrafiotis
Ahlberg
Ajay
Ajay
Bayada
Bemis
Bernard
Bonabeau
Brown
Calvert
Card
Chen
Chen
Cho
Christianini
Clark
Clark
Cox
Duda
Edwards
Engels
Frimurer
Gao
Garrido
Ghose
Gillet
Gillet
Hand
Hann
Haupts
Hayward
Hertzberg
Izrailev
Jiang
Jones-Hertzog
Kirew
Kobayashi
Kohonen
Ladd
Lee
Lepre
Martin
Mason
Mello
Meyer
Miller
Mitchell
Oprea
Peter Gedeck
Peter Willett
Poroikov
Rhodes
Roberts
Roberts
Ros
Rusinko
Sadowski
Sadowski
Scherf
Sheridan
Shi
Stanton
Su
Teague
Thompson
Tropsha
Tufte
Tufte
Wagener
Walters
Wang
Wedin
Xie
Xu
Zupan
Publication venue: 'Elsevier BV'
Publication date: 01/08/2001
Field of study

Novel analytic methods are required to assimilate the large volumes of structural and bioassay data generated by combinatorial chemistry and high-throughput screening programmes in the pharmaceutical and agrochemical industries. This paper reviews recent work in visualisation and data mining that can be used to develop structure-activity relationships from such chemical/biological datasets

Crossref

White Rose Research Online