Search CORE

7 research outputs found

Tableau-based protein substructure search using quadratic programming

Author: A Abyzov
A Caprara
A Caprara
A Guerler
A Harrison
AG Murzin
Alex Stivala
AM Lesk
Anthony Wirth
AP Kamat
AP Singh
AS Konagurthu
AS Konagurthu
B Kolbeck
B Thiruv
BK Koo
D Fischer
D Frishman
D Gilbert
DA Pelta
E Anderson
E Krissinel
GM Torrance
HK Ho
HM Berman
I Majumdar
J Jung
J Shapiro
JA Casbon
JA Hanley
JF Gibrat
JJ Dongarra
L Holm
ML Sierk
O Carugo
Peter J Stuckey
PR Elliott
S Kirillova
S Shi
SB Needleman
SS Krishna
T Hamelryck
T Madej
T Sing
TA Davis
TA Davis
TA Davis
TA Davis
V Sam
W Kabsch
W Xie
Y Ye
Y Ye
Y Ye
Z Gáspári
Z Li
Publication venue: BioMed Central
Publication date: 01/05/2009
Field of study

Abstract Background Searching for proteins that contain similar substructures is an important task in structural biology. The exact solution of most formulations of this problem, including a recently published method based on tableaux, is too slow for practical use in scanning a large database. Results We developed an improved method for detecting substructural similarities in proteins using tableaux. Tableaux are compared efficiently by solving the quadratic program (QP) corresponding to the quadratic integer program (QIP) formulation of the extraction of maximally-similar tableaux. We compare the accuracy of the method in classifying protein folds with some existing techniques. Conclusion We find that including constraints based on the separation of secondary structure elements increases the accuracy of protein structure search using maximally-similar subtableau extraction, to a level where it has comparable or superior accuracy to existing techniques. We demonstrate that our implementation is able to search a structural database in a matter of hours on a standard PC.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

Fast and accurate protein substructure searching with simulated annealing and GPUs

Abstract Background Searching a database of protein structures for matches to a query structure, or occurrences of a structural motif, is an important task in structural biology and bioinformatics. While there are many existing methods for structural similarity searching, faster and more accurate approaches are still required, and few current methods are capable of substructure (motif) searching. Results We developed an improved heuristic for tableau-based protein structure and substructure searching using simulated annealing, that is as fast or faster and comparable in accuracy, with some widely used existing methods. Furthermore, we created a parallel implementation on a modern graphics processing unit (GPU). Conclusions The GPU implementation achieves up to 34 times speedup over the CPU implementation of tableau-based structure search with simulated annealing, making it one of the fastest available methods. To the best of our knowledge, this is the first application of a GPU to the protein structural search problem.</p

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

A fast indexing approach for protein structure comparison

Author: Arun S Konagurthu
James Bailey
Kotagiri Ramamohanarao
Lei Zhang
Publication venue: Springer Nature
Publication date: 01/01/2010
Field of study

BACKGROUND: Protein structure comparison is a fundamental task in structural biology. While the number of known protein structures has grown rapidly over the last decade, searching a large database of protein structures is still relatively slow using existing methods. There is a need for new techniques which can rapidly compare protein structures, whilst maintaining high matching accuracy. RESULTS: We have developed IR Tableau, a fast protein comparison algorithm, which leverages the tableau representation to compare protein tertiary structures. IR tableau compares tableaux using information retrieval style feature indexing techniques. Experimental analysis on the ASTRAL SCOP protein structural domain database demonstrates that IR Tableau achieves two orders of magnitude speedup over the search times of existing methods, while producing search results of comparable accuracy. CONCLUSION: We show that it is possible to obtain very significant speedups for the protein structure comparison problem, by employing an information retrieval style approach for indexing proteins. The comparison accuracy achieved is also strong, thus opening the way for large scale processing of very large protein structure databases

Springer - Publisher Connector

PubMed Central

University of Melbourne Institutional Repository

A fast indexing approach for protein structure comparison

Author: A Lesk
A Stivala
A Tversky
AG Murzin
AM Lesk
AP Kamat
Arun S Konagurthu
AS Konagurthu
AS Konagurthu
CA Orengo
E Krissinel
ES Shih
ES Shih
ESC Shih
FM Richards
HM Berman
I Michalopoulos
J Shapiro
James Bailey
JF Gibrat
Kotagiri Ramamohanarao
L Holm
Lei Zhang
M Carpentier
O Carugo
P Jaccard
S Kirillova
SE Brenner
SF Altschul
T Madej
W Lo
W Lo
W Lo
WL Delano
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

MALISAM: a database of structurally analogous motifs in proteins

Author: Altschul
Andreeva
Aravind
B.-H. Kim
Bateman
Brenner
Chothia
Copley
Dietmann
Finkelstein
GODZIK
H. Cheng
Harrison
Holm
Holm
Lin
Lindqvist
Matsuo
Murzin
N. V. Grishin
Oh
Orengo
Orengo
Ponting
Ruczinski
Russell
Sadreyev
Sleigh
Yamashita
Zemla
Zhang
Zhu
Publication venue: Oxford University Press
Publication date
Field of study

MALISAM (manual alignments for structurally analogous motifs) represents the first database containing pairs of structural analogs and their alignments. To find reliable analogs, we developed an approach based on three ideas. First, an insertion together with a part of the evolutionary core of one domain family (a hybrid motif) is analogous to a similar motif contained within the core of another domain family. Second, a motif at an interface, formed by secondary structural elements (SSEs) contributed by two or more domains or subunits contacting along that interface, is analogous to a similar motif present in the core of a single domain. Third, an artificial protein obtained through selection from random peptides or in sequence design experiments not biased by sequences of a particular homologous family, is analogous to a structurally similar natural protein. Each analogous pair is superimposed and aligned manually, as well as by several commonly used programs. Applications of this database may range from protein evolution studies, e.g. development of remote homology inference tools and discriminators between homologs and analogs, to protein-folding research, since in the absence of evolutionary reasons, similarity between proteins is caused by structural and folding constraints. The database is publicly available at http://prodata.swmed.edu/malisam

Crossref

PubMed Central

Conceptual Framework and Methodology for Analysing Previous Molecular Docking Results

Author: Temelkovski D.
Temelkovski D.
Publication venue
Publication date: 01/01/2019
Field of study

Modern drug discovery relies on in-silico computational simulations such as molecular docking. Molecular docking models biochemical interactions to predict where and how two molecules would bind. The results of large-scale molecular docking simulations can provide valuable insight into the relationship between two molecules. This is useful to a biomedical scientist before conducting in-vitro or in-vivo wet-lab experiments. Although this ˝eld has seen great advancements, feedback from biomedical scientists shows that there is a need for storage and further analysis of molecular docking results. To meet this need, biomedical scientists need to have access to computing, data, and network resources, and require speci˝c knowledge or skills they might lack. Therefore, a conceptual framework speci˝cally tailored to enable biomedical scientists to reuse molecular docking results, and a methodology which uses regular input from scientists, has been proposed. The framework is composed of 5 types of elements and 13 interfaces. The methodology is light and relies on frequent communication between biomedical sciences and computer science experts, speci˝ed by particular roles. It shows how developers can bene˝t from using the framework which allows them to determine whether a scenario ˝ts the framework, whether an already implemented element can be reused, or whether a newly proposed tool can be used as an element. Three scenarios that show the versatility of this new framework and the methodology based on it, have been identi˝ed and implemented. A methodical planning and design approach was used and it was shown that the implementations are at least as usable as existing solutions. To eliminate the need for access to expensive computing infrastructure, state-of-the-art cloud computing techniques are used. The implementations enable faster identi˝cation of new molecules for use in docking, direct querying of existing databases, and simpler learning of good molecular docking practice without the need to manually run multiple tools. Thus, the framework and methodol-ogy enable more user-friendly implementations, and less error-prone use of computational methods in drug discovery. Their use could lead to more e˙ective discovery of new drugs

WestminsterResearch

Structural bioinformatics Searching for three-dimensional secondary structural patterns in proteins with ProSMoS

Author: Indraneel Majumdar
Nick V. Grishin
S. Sri Krishna
Shuoyong Shi
Yi Zhong
Publication venue
Publication date
Field of study

Motivation: Many evolutionarily distant, but functionally meaningful links between proteins come to light through comparison of spatial structures. Most programs that assess structural similarity compare two proteins to each other and find regions in common between them. Structural classification experts look for a particular structural motif instead. Programs base similarity scores on superposition or closeness of either Cartesian coordinates or inter-residue contacts. Experts pay more attention to the general orientation of the main chain and mutual spatial arrangement of secondary structural elements. There is a need for a computational tool to find proteins with the same secondary structures, topological connections and spatial architecture, regardless of subtle differences in 3D coordinates. Results: We developed ProSMoS—a Protein Structure Motif Searc

CiteSeerX