Search CORE

60,351 research outputs found

Graph theoretic methods for the analysis of structural relationships in biological macromolecules

Author: Altschul
Artymiuk
Artymiuk
Artymiuk
Artymiuk
Artymiuk
Barnard
Baxevanis
Benning
Berman
Bernstein
Brint
Brint
Bron
Bruno
Bryant
Crandell
Dean
Diestel
Doubet
Fan
Feizi
Figueras
Flores
Gardiner
Gati
Good
Gray
Groves
Gruer
Gund
Hagadone
Harrison
Holden
Hutchinson
Jasanoff
Johnson
Kanna
Klausner
Kleywegt
Koch
Kraulis
Lengauer
Lesk
Martin
Martin
McGregor
Messmer
Mitchell
Ollis
Pickering
Ray
Raymond
Read
Salton
Samudrala
Sayle
Simon
Srere
Sussenguth
Tesmer
Tinoco
Trinajstic
Tsukada
Ullmann
van Rijsbergen
Willett
Willett
Willett
Willett
Williams
Wilson
Zhang
Publication venue: 'Wiley'
Publication date: 01/01/2005
Field of study

Subgraph isomorphism and maximum common subgraph isomorphism algorithms from graph theory provide an effective and an efficient way of identifying structural relationships between biological macromolecules. They thus provide a natural complement to the pattern matching algorithms that are used in bioinformatics to identify sequence relationships. Examples are provided of the use of graph theory to analyze proteins for which three-dimensional crystallographic or NMR structures are available, focusing on the use of the Bron-Kerbosch clique detection algorithm to identify common folding motifs and of the Ullmann subgraph isomorphism algorithm to identify patterns of amino acid residues. Our methods are also applicable to other types of biological macromolecule, such as carbohydrate and nucleic acid structures

CiteSeerX

Crossref

White Rose Research Online

Sussex Research Online

Entropy-scaling search of massive biological data

Author: Berger Bonnie
Daniels Noah M.
Danko David Christian
Yu Y. William
Publication venue: 'Elsevier BV'
Publication date: 01/06/2015
Field of study

Many datasets exhibit a well-defined structure that can be exploited to design faster search tools, but it is not always clear when such acceleration is possible. Here, we introduce a framework for similarity search based on characterizing a dataset's entropy and fractal dimension. We prove that searching scales in time with metric entropy (number of covering hyperspheres), if the fractal dimension of the dataset is low, and scales in space with the sum of metric entropy and information-theoretic entropy (randomness of the data). Using these ideas, we present accelerated versions of standard tools, with no loss in specificity and little loss in sensitivity, for use in three domains---high-throughput drug screening (Ammolite, 150x speedup), metagenomics (MICA, 3.5x speedup of DIAMOND [3,700x BLASTX]), and protein structure search (esFragBag, 10x speedup of FragBag). Our framework can be used to achieve "compressive omics," and the general theory can be readily applied to data science problems outside of biology.Comment: Including supplement: 41 pages, 6 figures, 4 tables, 1 bo

arXiv.org e-Print Archive

Elsevier - Publisher Connector

DSpace@MIT

PubMed Central

Recommended from our members

A computer system to perform structure comparison using TOPS representations of protein structure

Author: Gilbert D
Thornton J
Viksna J
Westhead V
Publication venue: 'Elsevier BV'
Publication date: 01/01/2001
Field of study

We describe the design and implementation of a fast topology–based method for protein structure comparison. The approach uses the TOPS topological representation of protein structure, aligning two structures using a common discovered pattern and generating measure of distance derived from an insert score. Heavy use is made of a constraint-based pattern matching algorithm for TOPS diagrams that we have designed and described elsewhere Gilbert et al. (1999). The comparison system is maintained at the European Bioinformatics Institute and is available over the Web via the at tops.ebi.ac.uk/tops. Users submit a structure description in Protein Data Bank (PDB) format and can compare it with structures in the entire PDB or a representative subset of protein domains, receiving the results by email

Brunel University Research Archive

Toward a multilevel representation of protein molecules: comparative approaches to the aggregation/folding propensity problem

Author: Giuliani Alessandro
Livi Lorenzo
Rizzi Antonello
Publication venue: 'Elsevier BV'
Publication date: 29/04/2015
Field of study

This paper builds upon the fundamental work of Niwa et al. [34], which provides the unique possibility to analyze the relative aggregation/folding propensity of the elements of the entire Escherichia coli (E. coli) proteome in a cell-free standardized microenvironment. The hardness of the problem comes from the superposition between the driving forces of intra- and inter-molecule interactions and it is mirrored by the evidences of shift from folding to aggregation phenotypes by single-point mutations [10]. Here we apply several state-of-the-art classification methods coming from the field of structural pattern recognition, with the aim to compare different representations of the same proteins gathered from the Niwa et al. data base; such representations include sequences and labeled (contact) graphs enriched with chemico-physical attributes. By this comparison, we are able to identify also some interesting general properties of proteins. Notably, (i) we suggest a threshold around 250 residues discriminating "easily foldable" from "hardly foldable" molecules consistent with other independent experiments, and (ii) we highlight the relevance of contact graph spectra for folding behavior discrimination and characterization of the E. coli solubility data. The soundness of the experimental results presented in this paper is proved by the statistically relevant relationships discovered among the chemico-physical description of proteins and the developed cost matrix of substitution used in the various discrimination systems.Comment: 17 pages, 3 figures, 46 reference

arXiv.org e-Print Archive

Archivio della ricerca- Università di Roma La Sapienza

Recommended from our members

Topology-based protein structure comparison using a pattern discovery technique

Author: Gilbert D
Thornton J
Viksna J
Westhead D
Publication venue: 'University of Birmingham Library Services'
Publication date: 01/01/2000
Field of study

Brunel University Research Archive

ConSole: using modularity of contact maps to locate solenoid domains in protein structures.

Author: Godzik Adam
Hrabe Thomas
Publication venue: eScholarship, University of California
Publication date: 01/01/2014
Field of study

BackgroundPeriodic proteins, characterized by the presence of multiple repeats of short motifs, form an interesting and seldom-studied group. Due to often extreme divergence in sequence, detection and analysis of such motifs is performed more reliably on the structural level. Yet, few algorithms have been developed for the detection and analysis of structures of periodic proteins.ResultsConSole recognizes modularity in protein contact maps, allowing for precise identification of repeats in solenoid protein structures, an important subgroup of periodic proteins. Tests on benchmarks show that ConSole has higher recognition accuracy as compared to Raphael, the only other publicly available solenoid structure detection tool. As a next step of ConSole analysis, we show how detection of solenoid repeats in structures can be used to improve sequence recognition of these motifs and to detect subtle irregularities of repeat lengths in three solenoid protein families.ConclusionsThe ConSole algorithm provides a fast and accurate tool to recognize solenoid protein structures as a whole and to identify individual solenoid repeat units from a structure. ConSole is available as a web-based, interactive server and is available for download at http://console.sanfordburnham.org

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Computation of protein geometry and its applications: Packing and function prediction

Author: A. Bondi
A. Goede
A.C. Wallace
A.C.R. Martin
A.E. Todd
B. Lee
B.J. Gellatly
C. Hu
C.A. Orengo
D. Fischer
F. Glaser
F.M. Richards
F.M. Richards
F.M. Richards
F.M. Richards
F.M. Richards
G. Rhodes
G.M. Crippen
H. Edelsbrunner
H. Edelsbrunner
H. Edelsbrunner
H. Edelsbrunner
H. Edelsbrunner
H. Edelsbrunner
J. Liang
J. Liang
J. Liang
J. Liang
J. Tsai
J. Word
J. Zhang
J.L. Finney
K.W. Kratky
L. Guibas
L. Holm
M. Levitt
M. Petitjean
M.L. Connolly
O. Lichtarge
P. Røgen
P. Røgen
P.J. Artymiuk
R. Bader
R. Norel
R. Russell
R.A. Laskowski
R.K. Singh
S. Chakravarty
T. Binkowski
T. Binkowski
T.A. Binkowski
T.A. Binkowski
T.J. Richmond
W. Rieping
W. Zheng
X. Li
X. Li
Y. Harpaz
Y. Tseng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/01/2006
Field of study

This chapter discusses geometric models of biomolecules and geometric constructs, including the union of ball model, the weigthed Voronoi diagram, the weighted Delaunay triangulation, and the alpha shapes. These geometric constructs enable fast and analytical computaton of shapes of biomoleculres (including features such as voids and pockets) and metric properties (such as area and volume). The algorithms of Delaunay triangulation, computation of voids and pockets, as well volume/area computation are also described. In addition, applications in packing analysis of protein structures and protein function prediction are also discussed.Comment: 32 pages, 9 figure

arXiv.org e-Print Archive

Crossref