Search CORE

41 research outputs found

Algorithm engineering for optimal alignment of protein structure distance matrices

Author: A. Andreeva
A. Caprara
A. Marin
A. Schrijver
C. Berbalk
D. Wu
D.A. Pelta
E. Althaus
G. Mayr
Gunnar W. Klau
H. Hasegawa
H.P. Lenhof
I. Wohlers
Inken Wohlers
L. Holm
N. Malod-Dognin
P. Di Lena
R. Andonov
R. Kolodny
R.H. Lathrop
Rumen Andonov
T. Havel
T. Kawabata
W. Xie
W.R. Taylor
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Protein structural alignment is an important problem in computational biology. In this paper, we present first successes on provably optimal pairwise alignment of protein inter-residue distance matrices, using the popular Dali scoring function. We introduce the structural alignment problem formally, which enables us to express a variety of scoring functions used in previous work as special cases in a unified framework. Further, we propose the first mathematical model for computing optimal structural alignments based on dense inter-residue distance matrices. We therefore reformulate the problem as a special graph problem and give a tight integer linear programming model. We then present algorithm engineering techniques to handle the huge integer linear programs of real-life distance matrix alignment problems. Applying these techniques, we can compute provably optimal Dali alignments for the very first time

arXiv.org e-Print Archive

HAL-CentraleSupelec

CiteSeerX

Crossref

CWI's Institutional Repository

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

HAL-Rennes 1

A Mathematical Framework for Protein Structure Comparison

Author: A Srivastava
A Srivastava
A Zemla
AG Murzin
Anuj Srivastava
AR Ortiz
AS Konagurthu
B Kolbeck
C Berbalk
CA Orengo
CA Orengo
DL Theobald
E Klassen
E Krissinel
F Teichert
G Mayr
H Hasegawa
HM Berman
IN Shindyalov
J Dundas
J Ebert
J Zhang
J Zhang
J Zhu
JF Gibrat
Jinfeng Zhang
K Illergard
L Holm
L Holm
L Lo Conte
M Levitt
M Menke
M Shatsky
M Shatsky
MJ Sippl
N Furnham
O Dror
P Koehl
PD Dobson
QS Du
R Kolodny
R Kolodny
R Mosca
R Mosca
Roland L. Dunbrack
S Kurtek
SH Joshi
SR Eddy
VA Ilyin
W Mio
Wei Liu
WR Taylor
X Zhou
Y Ye
Y Zhang
YJ Huang
Publication venue: Public Library of Science
Publication date: 03/02/2011
Field of study

Comparison of protein structures is important for revealing the evolutionary relationship among proteins, predicting protein functions and predicting protein structures. Many methods have been developed in the past to align two or multiple protein structures. Despite the importance of this problem, rigorous mathematical or statistical frameworks have seldom been pursued for general protein structure comparison. One notable issue in this field is that with many different distances used to measure the similarity between protein structures, none of them are proper distances when protein structures of different sequences are compared. Statistical approaches based on those non-proper distances or similarity scores as random variables are thus not mathematically rigorous. In this work, we develop a mathematical framework for protein structure comparison by treating protein structures as three-dimensional curves. Using an elastic Riemannian metric on spaces of curves, geodesic distance, a proper distance on spaces of curves, can be computed for any two protein structures. In this framework, protein structures can be treated as random variables on the shape manifold, and means and covariance can be computed for populations of protein structures. Furthermore, these moments can be used to build Gaussian-type probability distributions of protein structures for use in hypothesis testing. The covariance of a population of protein structures can reveal the population-specific variations and be helpful in improving structure classification. With curves representing protein structures, the matching is performed using elastic shape analysis of curves, which can effectively model conformational changes and insertions/deletions. We show that our method performs comparably with commonly used methods in protein structure classification on a large manually annotated data set

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central