Search CORE

2 research outputs found

The LabelHash algorithm for substructure matching

Background: There is an increasing number of proteins with known structure but unknown function. Determining their function would have a significant impact on understanding diseases and designing new therapeutics. However, experimental protein function determination is expensive and very time-consuming. Computational methods can facilitate function determination by identifying proteins that have high structural and chemical similarity. Results: We present LabelHash, a novel algorithm for matching substructural motifs to large collections of protein structures. The algorithm consists of two phases. In the first phase the proteins are preprocessed in a fashion that allows for instant lookup of partial matches to any motif. In the second phase, partial matches for a given motif are expanded to complete matches. The general applicability of the algorithm is demonstrated with three different case studies. First, we show that we can accurately identify members of the enolase superfamily with a single motif. Next, we demonstrate how LabelHash can complement SOIPPA, an algorithm for motif identification and pairwise substructure alignment. Finally, a large collection of Catalytic Site Atlas motifs is used to benchmark the performance of the algorithm. LabelHash runs very efficiently in parallel; matching a motif against all proteins in the 95 % sequence identity filtered non-redundant Protein Data Bank typically takes no more than a few minutes. The LabelHash algorithm is available through a web server and as a suite of standalone programs a

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

The Faecal Microbiome of Building-Dwelling Insectivorous Bats (Myotis myotis and Rhinolophus hipposideros) also Contains Antibiotic-Resistant Bacterial Representatives

Author: A Aljorayid
A Gharout-Sait
A López-Baucells
A Nowakiewicz
A Valiente-Banuet
A Vandžurová
AH Parrey
AO Oluduro
B Mohan
BB Chomel
C Di Bella
C Dietz
CC Voigt
CC Voigt
CD Phillips
CH Calisher
CH Calisher
CM McAney
D Nováková
D Russo
DS Daniel
E Afonso
ER Dumont
G Jones
G Neuweiler
G Xiao
G-Z Zhao
H Günthard
H Radhouani
H Wu
I Ezechukwu
I Konieczna
J Li
J Spergser
J Wolkers-Rooijackers
JG Boyles
JK Emerson
K Mühldorfer
K Mühldorfer
K Mühldorfer
M Carrillo-Araujo
M Dietrich
M Kosoy
M Uhrin
M Vengust
MJ Ramos Pereira
MJ Stuckey
MM Galicia
MM Newman
MR Ingala
N González-Quiñonez
NB Simmons
O Gaona
O Gaona
O Gaona
P Bandelj
PD Klite
PK Misra
R Arlettaz
R Hodgkison
RA Medellin
S Banskar
S Banskar
S De Mandal
S Smith
TH Kunz
V Veikkolainen
VC Cláudio
VY Fofanov
WC Hazeleger
WG Weisburg
X Puig-Montserrat
XY Zhu
Y Sun
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref