Search CORE

64,115 research outputs found

Sibe: a computation tool to apply protein sequence statistics to predict folding and design in silico.

Author: Cheung Ngaam J
Yu Wookyung
Publication venue: BMC Bioinformatics
Publication date: 01/09/2019
Field of study

BACKGROUND: Evolutionary information contained in the amino acid sequences of proteins specifies the biological function and fold, but exactly what information contained in the protein sequence drives both of these processes? Considerable progress has been made to answer this fundamental question, but it remains challenging to explore the potential space of cooperative interactions between amino acids. Statistical analysis plays a significant role in studying such interactions and its use has expanded in recent years to studies ranging from coevolution-guided rational protein design to protein folding in silico. RESULTS: Here we describe a computational tool named Sibe for use in studies of protein sequence, folding, and design using evolutionary coupling between amino acids as a driving factor. In this study, Sibe is used to identify positionally conserved couplings between pairwise amino acids and aid rational protein design. In this process, pairwise couplings are filtered according to the relative entropy computed from the positional conservations and grouped into several 'blocks', which could contribute to driving protein folding and design. A human β2-adrenergic receptor (β2AR) was used to demonstrate that those 'blocks' contribute the rational design for specifying functional residues. Sibe also provides folding modules based on both the positionally conserved couplings and well-established statistical potentials for simulating protein folding in silico and predicting tertiary structure. Our results show that statistically inferences of basic evolutionary principles, such as conservations and coupled-mutations, can be used to rapidly design a diverse set of proteins and study protein folding. CONCLUSIONS: The developed software Sibe provides a computational tool for systematical analysis from protein primary to its tertiary structure using the evolutionary couplings as a driving factor. Sibe, written in C++, accounts for compatibility with the 'big data' era in biological science, and it primarily focuses on protein sequence analysis, but it is also applicable to extend to other modeling and predictions of experimental measurements

DGIST Library Institutional Repository

Deriving a mutation index of carcinogenicity using protein structure and protein interfaces

Author: A Custodio
A David
A Dixit
A Hamosh
A Pal
AJ Bass
Anna Tramontano
B Reva
B Vogelstein
CJ Richardson
CM Croce
D Chasman
D Sims
D Talavera
D Xu
E Krissinel
EC Chao
ER Mardis
F Damm
Frances Pearl
G Birrane
G De Baets
H Boutselakis
H Carter
H Makishima
IA Adzhubei
IS Moreira
J Carlsson
Jarle Hakas
JM Hurst
JM Izarzugaza
JR Morris
K Wang
Konstantinos Mitsopoulos
L Breiman
L Ding
M Li
M Magrane
Marketa Zvelebil
MR Stratton
MR Stratton
MS Greenblatt
MW MacArthur
MY Frederic
Octavio Espinosa
P Flicek
P Kumar
P Srivastava
PA Chan
PA Futreal
PB Crowley
PC Ng
PC Ng
PD Stenson
PH Lee
PT Wan
PV Hornbeck
PY Chou
R Ferla
R Rajasekaran
RJ Kinsella
S Jones
S Sunyaev
S Velankar
SA Forbes
TM Anne
V Ramensky
W Huang da
W Kabsch
X Wang
X Wang
Y Bromberg
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

With the advent of Next Generation Sequencing the identification of mutations in the genomes of healthy and diseased tissues has become commonplace. While much progress has been made to elucidate the aetiology of disease processes in cancer, the contributions to disease that many individual mutations make remain to be characterised and their downstream consequences on cancer phenotypes remain to be understood. Missense mutations commonly occur in cancers and their consequences remain challenging to predict. However, this knowledge is becoming more vital, for both assessing disease progression and for stratifying drug treatment regimes. Coupled with structural data, comprehensive genomic databases of mutations such as the 1000 Genomes project and COSMIC give an opportunity to investigate general principles of how cancer mutations disrupt proteins and their interactions at the molecular and network level. We describe a comprehensive comparison of cancer and neutral missense mutations; by combining features derived from structural and interface properties we have developed a carcinogenicity predictor, InCa (Index of Carcinogenicity). Upon comparison with other methods, we observe that InCa can predict mutations that might not be detected by other methods. We also discuss general limitations shared by all predictors that attempt to predict driver mutations and discuss how this could impact high-throughput predictions. A web interface to a server implementation is publicly available at http://inca.icr.ac.uk/

CiteSeerX

Directory of Open Access Journals

FigShare