Search CORE

Springer - Publisher Connector

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Archivio istituzionale della ricerca - Università di Padova

Large scale analysis of protein stability in OMIM disease related human protein variants

Author: Aggazio Francesco
Babbi Giulia
Casadio Rita
Fariselli Piero
Martelli Pier Luigi
Savojardo Castrense
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Modern genomic techniques allow to associate several Mendelian human diseases to single residue variations in different proteins. Molecular mechanisms explaining the relationship among genotype and phenotype are still under debate. Change of protein stability upon variation appears to assume a particular relevance in annotating whether a single residue substitution can or cannot be associated to a given disease. Thermodynamic properties of human proteins and of their disease related variants are lacking. In the present work, we take advantage of the available three dimensional structure of human proteins for predicting the role of disease related variations on the perturbation of protein stability

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Archivio istituzionale della ricerca - Università di Padova

Institutional Research Information System University of Turin

Impact of the 237th Residue on the Folding of Human Carbonic Anhydrase II

Author: Alibés
Almstedt
Bordner
Cheng
Crowther
Dobson
Encinar
Feng
Hammarstrom
Hammarstrom
Hammarstrom
Hernández-Santoyo
Hu
Jiang
Kiel
Knaupp
Lindskog
Martensson
Ming-Jie Wu
Pang
Pey
Pocker
Potapov
Reich
Roth
Schymkowitz
Sly
Sly
Svensson
Szczepek
Yan Jiang
Yong-Bin Yan
Publication venue: Molecular Diversity Preservation International (MDPI)
Publication date: 01/04/2011
Field of study

The deficiency of human carbonic anhydrase II (HCAII) has been recognized to be associated with a disease called CAII deficiency syndrome (CADS). Among the many mutations, the P237H mutation has been characterized to lead to a significant decrease in the activity of the enzyme and in the Gibbs free energy of folding. However, sequence alignment indicated that the 237th residue of CAII is not fully conserved across all species. The FoldX theoretical calculations suggested that this residue did not significantly contribute to the overall folding of HCAII, since all mutants had small ΔΔG values (around 1 kcal/mol). The experimental determination indicated that at least three mutations affect HCAII folding significantly and the P237H mutation was the most deleterious one, suggesting that Pro237 was important to HCAII folding. The discrepancy between theoretical and experimental results suggested that caution should be taken when using the prediction methods to evaluate the details of disease-related mutations

Multidisciplinary Digital Publishing Institute

Multidisciplinary Digital Publishing Institute

A deep-learning sequence-based method to predict protein stability changes upon genetic variations

Author: Benevenuta S.
Birolo G.
Capriotti E.
Fariselli P.
Pancotti C.
Repetto V.
Sanavia T.
Publication venue: 'MDPI AG'
Publication date: 01/01/2021
Field of study

Several studies have linked disruptions of protein stability and its normal functions to disease. Therefore, during the last few decades, many tools have been developed to predict the free energy changes upon protein residue variations. Most of these methods require both sequence and structure information to obtain reliable predictions. However, the lower number of protein structures available with respect to their sequences, due to experimental issues, drastically limits the application of these tools. In addition, current methodologies ignore the antisymmetric property characterizing the thermodynamics of the protein stability: a variation from wild-type to a mutated form of the protein structure (XW→XM) and its reverse process (XM→XW) must have opposite values of the free energy difference (ΔΔGWM=−ΔΔGMW). Here we propose ACDC-NN-Seq, a deep neural network system that exploits the sequence information and is able to incorporate into its architecture the antisymmetry property. To our knowledge, this is the first convolutional neural network to predict protein stability changes relying solely on the protein sequence. We show that ACDC-NN-Seq compares favorably with the existing sequence-based methods

Institutional Research Information System University of Turin

Predicting changes in protein thermostability brought about by single- or multi-site mutations

Author: Chu Xiaoyu
Fan Yunliu
Tian Jian
Wu Ningfeng
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background An important aspect of protein design is the ability to predict changes in protein thermostability arising from single- or multi-site mutations. Protein thermostability is reflected in the change in free energy (ΔΔ<it>G</it>) of thermal denaturation. Results We have developed predictive software, Prethermut, based on machine learning methods, to predict the effect of single- or multi-site mutations on protein thermostability. The input vector of Prethermut is based on known structural changes and empirical measurements of changes in potential energy due to protein mutations. Using a 10-fold cross validation test on the M-dataset, consisting of 3366 mutants proteins from ProTherm, the classification accuracy of random forests and the regression accuracy of random forest regression were slightly better than support vector machines and support vector regression, whereas the overall accuracy of classification and the Pearson correlation coefficient of regression were 79.2% and 0.72, respectively. Prethermut performs better on proteins containing multi-site mutations than those with single mutations. Conclusions The performance of Prethermut indicates that it is a useful tool for predicting changes in protein thermostability brought about by single- or multi-site mutations and will be valuable in the rational design of proteins.</p

Springer - Publisher Connector

Online Research Database In Technology

Regional TMPRSS2 V197M Allele Frequencies Are Correlated with COVID-19 Case Fatality Rates.

Author: Bhak Jong
Bhak Youngjune
Blazyte Asta
Bolser Dan
Cho Yun Sung
Choi Hansol
Jeon Sungwon
Jeon Yeonsu
Kim Byung Chul
Manica Andrea
Ryoo Namhee
Ryu Hyojung
Shin Eun-Seok
Yoon Changhan
Publication venue: Mol Cells
Publication date: 01/09/2021
Field of study

Coronavirus disease, COVID-19 (coronavirus disease 2019), caused by SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2), has a higher case fatality rate in European countries than in others, especially East Asian ones. One potential explanation for this regional difference is the diversity of the viral infection efficiency. Here, we analyzed the allele frequencies of a nonsynonymous variant rs12329760 (V197M) in the TMPRSS2 gene, a key enzyme essential for viral infection and found a significant association between the COVID-19 case fatality rate and the V197M allele frequencies, using over 200,000 present-day and ancient genomic samples. East Asian countries have higher V197M allele frequencies than other regions, including European countries which correlates to their lower case fatality rates. Structural and energy calculation analysis of the V197M amino acid change showed that it destabilizes the TMPRSS2 protein, possibly negatively affecting its ACE2 and viral spike protein processing

ScholarWorks@UNIST

Apollo (Cambridge)

Recommended from our members

A base measure of precision for protein stability predictors: structural sensitivity.

Author: Blundell Tom L
Caldararu Octav
Kepp Kasper P
Publication venue: BMC Bioinformatics
Publication date: 01/01/2021
Field of study

BACKGROUND: Prediction of the change in fold stability (ΔΔG) of a protein upon mutation is of major importance to protein engineering and screening of disease-causing variants. Many prediction methods can use 3D structural information to predict ΔΔG. While the performance of these methods has been extensively studied, a new problem has arisen due to the abundance of crystal structures: How precise are these methods in terms of structure input used, which structure should be used, and how much does it matter? Thus, there is a need to quantify the structural sensitivity of protein stability prediction methods. RESULTS: We computed the structural sensitivity of six widely-used prediction methods by use of saturated computational mutagenesis on a diverse set of 87 structures of 25 proteins. Our results show that structural sensitivity varies massively and surprisingly falls into two very distinct groups, with methods that take detailed account of the local environment showing a sensitivity of ~ 0.6 to 0.8 kcal/mol, whereas machine-learning methods display much lower sensitivity (~ 0.1 kcal/mol). We also observe that the precision correlates with the accuracy for mutation-type-balanced data sets but not generally reported accuracy of the methods, indicating the importance of mutation-type balance in both contexts. CONCLUSIONS: The structural sensitivity of stability prediction methods varies greatly and is caused mainly by the models and less by the actual protein structural differences. As a new recommended standard, we therefore suggest that ΔΔG values are evaluated on three protein structures when available and the associated standard deviation reported, to emphasize not just the accuracy but also the precision of the method in a specific study. Our observation that machine-learning methods deemphasize structure may indicate that folded wild-type structures alone, without the folded mutant and unfolded structures, only add modest value for assessing protein stability effects, and that side-chain-sensitive methods overstate the significance of the folded wild-type structure

Apollo (Cambridge)

Computational Refinement of Functional Single Nucleotide Polymorphisms Associated with ATM Gene

Author: A Basu
A Broeks
A Li
A Li
AY Ho
B. Rajith
C Ansong
C Greenman
C Schaffner
C Schaffner
C. George Priya Doss
CT Walsh
D Botstein
D Stoppa-Lyonnet
D Watters
David K. Crockett
E Capriotti
E Grasbon-Frodl
F Bullrich
FF Zhou
G Grillo
GPD C
GPD C
GPD C
GPD C
H Ashkenazy
H Chen
HY Yuan
I Vorechovsky
I Vorechovsky
J Reinders
J Ren
JE Lee
JH Nadeau
JS Palmer
K Julenius
KH Taylor
L Conde
L Izatt
M Cargill
M Mitui
M Toyoshima
MAR Yuille
MT Bedford
N Sandoval
N Sonenberg
OJ Bandele
P Kumar
PC Ng
R B
R Karchin
RL Plackett
S Castellvi-Bel
S Gilad
S Khan
S Matsuoka
S Morandell
S Stilgenbauer
SE Flanagan
SG Becker-Catania
SV Kozlov
SW Doniger
T Fukao
T Sasaki
T Stankovic
T Stankovic
TA Aly
TD Schneider
V Ramensky
W Xu
Y Sun
Y Xue
Publication venue: Public Library of Science
Publication date: 13/04/2012
Field of study

gene are the most common forms of genetic variations that account for various forms of cancer. However, the extent to which SNPs interferes with the gene regulation and affects cancer susceptibility remains largely unknown. gene. gene function can aid in better understanding of genetic differences in disease susceptibility

Public Library of Science (PLOS)

Improving the prediction of disease-related variants using protein three-dimensional structure

Author: B Li
CC Chang
E Capriotti
E Capriotti
E Capriotti
E Capriotti
E Capriotti
E Capriotti
EI Boyle
Emidio Capriotti
G Wainreb
H Berman
H Zhou
HapMap Consortium
International Human Genome Sequencing Consortium
J Pei
JS Kaminker
L Bao
L Bao
M Cargill
MA Care
ML Waters
P Baldi
P Yue
PC Ng
PC Ng
PD Thomas
PD Thomas
R Calabrese
R Guerois
R Karchin
RG Cotton
RJ Dobson
Russ B Altman
SF Altschul
SF Betz
ST Sherry
V Parthiban
V Ramensky
VG Krishnan
W Kabsch
Y Bromberg
YL Yip
Z Wang
ZQ Ye
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Background: Single Nucleotide Polymorphisms (SNPs) are an important source of human genome variability. Non-synonymous SNPs occurring in coding regions result in single amino acid polymorphisms (SAPs) that may affect protein function and lead to pathology. Several methods attempt to estimate the impact of SAPs using different sources of information. Although sequence-based predictors have shown good performance, the quality of these predictions can be further improved by introducing new features derived from three-dimensional protein structures.Results: In this paper, we present a structure-based machine learning approach for predicting disease-related SAPs. We have trained a Support Vector Machine (SVM) on a set of 3,342 disease-related mutations and 1,644 neutral polymorphisms from 784 protein chains. We use SVM input features derived from the protein's sequence, structure, and function. After dataset balancing, the structure-based method (SVM-3D) reaches an overall accuracy of 85%, a correlation coefficient of 0.70, and an area under the receiving operating characteristic curve (AUC) of 0.92. When compared with a similar sequence-based predictor, SVM-3D results in an increase of the overall accuracy and AUC by 3%, and correlation coefficient by 0.06. The robustness of this improvement has been tested on different datasets and in all the cases SVM-3D performs better than previously developed methods even when compared with PolyPhen2, which explicitly considers in input protein structure information.Conclusion: This work demonstrates that structural information can increase the accuracy of disease-related SAPs identification. Our results also quantify the magnitude of improvement on a large dataset. This improvement is in agreement with previously observed results, where structure information enhanced the prediction of protein stability changes upon mutation. Although the structural information contained in the Protein Data Bank is limiting the application and the performance of our structure-based method, we expect that SVM-3D will result in higher accuracy when more structural date become available. \ua9 2011 Capriotti; licensee BioMed Central Ltd

CiteSeerX

Springer - Publisher Connector