Search CORE

14 research outputs found

Near-Native Protein Loop Sampling Using Nonparametric Density Estimation Accommodating Sparcity

Unlike the core structural elements of a protein like regular secondary structure, template based modeling (TBM) has difficulty with loop regions due to their variability in sequence and structure as well as the sparse sampling from a limited number of homologous templates. We present a novel, knowledge-based method for loop sampling that leverages homologous torsion angle information to estimate a continuous joint backbone dihedral angle density at each loop position. The φ,ψ distributions are estimated via a Dirichlet process mixture of hidden Markov models (DPM-HMM). Models are quickly generated based on samples from these distributions and were enriched using an end-to-end distance filter. The performance of the DPM-HMM method was evaluated against a diverse test set in a leave-one-out approach. Candidates as low as 0.45 Å RMSD and with a worst case of 3.66 Å were produced. For the canonical loops like the immunoglobulin complementarity-determining regions (mean RMSD <2.0 Å), the DPM-HMM method performs as well or better than the best templates, demonstrating that our automated method recaptures these canonical loops without inclusion of any IgG specific terms or manual intervention. In cases with poor or few good templates (mean RMSD >7.0 Å), this sampling method produces a population of loop structures to around 3.66 Å for loops up to 17 residues. In a direct test of sampling to the Loopy algorithm, our method demonstrates the ability to sample nearer native structures for both the canonical CDRH1 and non-canonical CDRH3 loops. Lastly, in the realistic test conditions of the CASP9 experiment, successful application of DPM-HMM for 90 loops from 45 TBM targets shows the general applicability of our sampling method in loop modeling problem. These results demonstrate that our DPM-HMM produces an advantage by consistently sampling near native loop structure. The software used in this analysis is available for download at http://www.stat.tamu.edu/~dahl/software/cortorgles/

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Texas A&M Repository

Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins

Our manuscript presents a novel approach to protein structure analyses. We have organized an 8-dimensional data cube with protein 3D-structural information from 8706 high-resolution non-redundant protein-chains with the aim of identifying packing rules at the amino acid pair level. The cube contains information about amino acid type, solvent accessibility, spatial and sequence distance, secondary structure and sequence length. We are able to pose structural queries to the data cube using program ProPack. The response is a 1, 2 or 3D graph. Whereas the response is of a statistical nature, the user can obtain an instant list of all PDB-structures where such pair is found. The user may select a particular structure, which is displayed highlighting the pair in question. The user may pose millions of different queries and for each one he will receive the answer in a few seconds. In order to demonstrate the capabilities of the data cube as well as the programs, we have selected well known structural features, disulphide bridges and salt bridges, where we illustrate how the queries are posed, and how answers are given. Motifs involving cysteines such as disulphide bridges, zinc-fingers and iron-sulfur clusters are clearly identified and differentiated. ProPack also reveals that whereas pairs of Lys residues virtually never appear in close spatial proximity, pairs of Arg are abundant and appear at close spatial distance, contrasting the belief that electrostatic repulsion would prevent this juxtaposition and that Arg-Lys is perceived as a conservative mutation. The presented programs can find and visualize novel packing preferences in proteins structures allowing the user to unravel correlations between pairs of amino acids. The new tools allow the user to view statistical information and visualize instantly the structures that underpin the statistical information, which is far from trivial with most other SW tools for protein structure analysis

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

VBN

A quality metric for homology modeling: the H-factor

Author: A Berglund
A Ilari
A Kolinski
A Sali
A Tramontano
A Wlodawer
AC Paiva
AE Keating
AE Torda
AG Murzin
AR Subramanian
AT Brunger
AT Brunger
B Wallner
BW Matthews
C Chothia
C Venclovas
C Venclovas
CG Roessler
CM Summa
CM Summa
D Baker
D Cozzetto
D Frishman
D Petrey
DH Ohlendorf
DT Jones
E di Luccio
E Di Luccio
E Saccenti
EL Sonnhammer
EN Brown
Eric di Luccio
G Chopra
G Vriend
GJ Kleywegt
GJ Kleywegt
H Yang
HW van Vlijmen
I Friedberg
IY Koh
J Kopp
J Moult
J Moult
J Moult
J Warringer
J Zhu
JC Kendrew
JD Thompson
JW Ponder
K Fidelis
K Wuthrich
K Wuthrich
K Wuthrich
KM Misura
KR Acharya
KR Acharya
LJ McGuffin
M Levitt
M Levitt
M Levitt
M Levitt
M Tress
M Tress
M Vasquez
M Wiederstein
MA Hanson
MA Olson
MJ Sippl
MY Shen
N Eswar
N Guex
N Siew
NV Buchete
ON Jensen
P Benkert
P Koehl
P Koehl
P Koehl
P Koehl
PA Alexander
Patrice Koehl
Q Fang
RA Laskowski
RC Edgar
RL Dunbrack Jr
RL Dunbrack Jr
RL Dunbrack Jr
S Grzesiek
SC Lovell
SC Lovell
SR Eddy
T Schwede
WJ Browne
X Yu
X Zhang
Publication venue: BioMed Central
Publication date: 01/02/2011
Field of study

Abstract Background The analysis of protein structures provides fundamental insight into most biochemical functions and consequently into the cause and possible treatment of diseases. As the structures of most known proteins cannot be solved experimentally for technical or sometimes simply for time constraints, <it>in silico </it>protein structure prediction is expected to step in and generate a more complete picture of the protein structure universe. Molecular modeling of protein structures is a fast growing field and tremendous works have been done since the publication of the very first model. The growth of modeling techniques and more specifically of those that rely on the existing experimental knowledge of protein structures is intimately linked to the developments of high resolution, experimental techniques such as NMR, X-ray crystallography and electron microscopy. This strong connection between experimental and <it>in silico </it>methods is however not devoid of criticisms and concerns among modelers as well as among experimentalists. Results In this paper, we focus on homology-modeling and more specifically, we review how it is perceived by the structural biology community and what can be done to impress on the experimentalists that it can be a valuable resource to them. We review the common practices and provide a set of guidelines for building better models. For that purpose, we introduce the H-factor, a new indicator for assessing the quality of homology models, mimicking the R-factor in X-ray crystallography. The methods for computing the H-factor is fully described and validated on a series of test cases. Conclusions We have developed a web service for computing the H-factor for models of a protein structure. This service is freely accessible at <url>http://koehllab.genomecenter.ucdavis.edu/toolkit/h-factor</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Clustering and percolation in protein loop structures

Author: A Andreeva
A Andreeva
A Fiser
A Krokhotin
A Nekouzadeh
A Roy
AG Murzin
AJ Niemi
AJ Niemi
Antti J. Niemi
B Widom
D Chivian
G Cottone
GA Petsko
HM Berman
HW van Vlijmen
I Sillitoe
I Sillitoe
J Moult
J Skolnick
J Vojtěchovskỳ
Jianfeng He
K Fidelis
K Hinsen
KG Wilson
L Schafer
L. D Faddeev
LP Kadanoff
M Chernodub
M Jamroz
M Lundgren
M Lundgren
M Lundgren
MA Olson
ME Fisher
MF Lucas
MJ Ablowitz
MP Jacobson
N Eswar
N Molkenthin
PG De Gennes
S Hu
S Hu
S Hu
S Rackovsky
T Ioannidou
T Schwede
UH Danielsson
W Kabsch
Wilson KG
X Peng
Xubiao Peng
Y Song
Y Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

DaReUS-Loop: accurate loop modeling using fragments from remote or unrelated proteins

Author: A Fiser
A Ganesan
A Martin
A Roy
A Stein
AG Murzin
AM Bonvin
B Oliva
BW Brandt
C Marks
CA Orengo
CB Anfinsen
CH Wu
CM Deane
CS Ring
D Holtby
D Tobi
DA Goldfeld
DJ Mandell
E Michalsky
F Guyon
F Guyon
F Wilcoxon
GR Lee
H Park
HM Berman
HW Vlijmen van
J Ismer
J Moult
J Moult
J Söding
J Wojcik
J-B Reiser
JR López-Blanco
L Holm
L Shi
M Alvim-Gaston
M Fasnacht
M Huse
M Remmert
M-y Shen
MA Marti-Renom
MA Messih
N Fernandez-Fuentes
PW Hildebrand
R Tippana
RP Joosten
S Jones
S Liang
SD Rufino
SJ Wu
SW Wong
X Wang
Y Choi
Y Shen
Y Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/09/2018
Field of study

Abstract Despite efforts during the past decades, loop modeling remains a difficult part of protein structure modeling. Several approaches have been developed in the framework of crystal structures. However, for homology models, the modeling of loops is still far from being solved. We propose DaReUS-Loop, a data-based approach that identifies loop candidates mining the complete set of experimental structures available in the Protein Data Bank. Candidate filtering relies on local conformation profile-profile comparison, together with physico-chemical scoring. Applied to three different template-based test sets, DaReUS-Loop shows significant increase in the number of high-accuracy loops, and significant enhancement for modeling long loops. A special advantage is that our method proposes a prediction confidence score that correlates well with the expected accuracy of the loops. Strikingly, over 50% of successful loop models are derived from unrelated proteins, indicating that fragments under similar constraints tend to adopt similar structure, beyond mere homology

Crossref

Directory of Open Access Journals

HAL-Inserm

Protein Structure Modeling with MODELLER.

Author: A Fiser
A Sali
A Sali
AC May
B John
B Rost
B Webb
B Webb
BR Brooks
C Chothia
C Robinson
D Baker
D Eramian
D Eramian
D Russel
D Schneidman
D Schneidman-Duhovny
E Tjioe
EA Coutsias
F Alber
F Melo
G Wu
H Zhou
HM Berman
HW van Vlijmen
J Holcomb
J Shi
K Ginalski
K Lasker
KT Simons
LJ McGuffin
MA Marti-Renom
MA Marti-Renom
MF Lensink
MP Jacobson
MS Madhusudhan
MS Madhusudhan
MY Shen
N Eswar
N Fernandez-Fuentes
N Srinivasan
PA Steindel
R Das
R Karchin
R Sanchez
R Sanchez
RL Dunbrack Jr
S Goodwin
S Henikoff
S Vajda
S Zhao
SB Needleman
SF Altschul
SP Nguyen
T Schwede
TF Smith
U Pieper
WR Pearson
Y Karami
Y Zhang
Y Zhang
Z Xiang
Publication venue: eScholarship, University of California
Publication date: 01/01/2021
Field of study

Genome sequencing projects have resulted in a rapid increase in the number of known protein sequences. In contrast, only about one-hundredth of these sequences have been characterized at atomic resolution using experimental structure determination methods. Computational protein structure modeling techniques have the potential to bridge this sequence-structure gap. In the following chapter, we present an example that illustrates the use of MODELLER to construct a comparative model for a protein with unknown structure. Automation of a similar protocol has resulted in models of useful accuracy for domains in more than half of all known protein sequences

Crossref

eScholarship - University of California

The pH stability of foot-and-mouth disease virus

Author: A Jurgeit
A Kotecha
A Schneemann
A Vazquez-Calvo
A Vazquez-Calvo
A Vazquez-Calvo
AS Yang
C Vasquez
CE Fricks
D Bashford
D Garriga
DJ Yamashiro
Dong Li
E Domingo
E Domingo
E Luna
E Mullapudi
E Prchla
EE Fry
F Brown
F Caridi
F Caridi
F Sobrino
FF Maree
FM Ellard
GJ Belsham
GJ Belsham
H Wang
HC Levy
Hong Yuan
Huifang Bao
HW van Vlijmen
I Klapper
J Ren
J Srivastava
J Warwicker
JE Johnson
JF Newman
Jie Zhang
Jing Zhang
JK Biswal
K Lyu
KL Shingler
M Bostina
M Huss
M Marsh
M Suomalainen
MA Martin-Acebes
MA Martin-Acebes
MA Martinez
MG Mateu
MJ Grubman
MJ Grubman
MJ Sternberg
PG Stockley
Pinghua Li
Pu Sun
PW Mason
Qifeng Bai
R Acharya
R Mateo
RL Thurlkill
S Alexandersen
S Curry
S Curry
S Lea
SC Han
SE Bakker
T Kampmann
T Liang
T Liang
T Twomey
TJ Tuthill
TR Doel
V Marshansky
V Rincon
X Wang
Xingwen Bai
Xueqing Ma
Yimei Cao
Yingli Chen
YM Cao
Yuanfang Fu
Zaixin Liu
Zengjun Lu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref