Search CORE

21,088 research outputs found

Prediction of Protein Binding Regions in Disordered Proteins

Author: A De Biasio
A Mohan
A Nakabachi
A Schlessinger
A Sigalov
AA Russo
AH Huber
AH Tong
AK Dunker
AK Dunker
AK Dunker
AS Kim
B Meszaros
B Rost
B Vogelstein
Bálint Mészáros
C Haynes
CA Galea
CA Galea
CJ Oldfield
CJ Oldfield
CM Preston
D Marc
D Wu
E Bochkareva
E Garner
EA Bienkiewicz
EA Waxman
ER Lacy
F Alber
F Ferron
FG Hanisch
H Xie
HD Ochs
HJ Dyson
HJ Dyson
HM Berman
István Simon
J Chen
J Liu
J Sampietro
JB Marchand
JC Hansen
JH Hoh
JJ Ward
JL Fauchere
K Gunasekaran
LM Iakoucheva
LM Iakoucheva
M Fuxreiter
M Fuxreiter
M Fuxreiter
M Hertzog
M Sickmeier
MK Sorenson
N Abdul-Manan
P Di Lello
P Puntervoll
P Romero
P Tompa
P Tompa
P Tompa
PE Wright
PH Kussie
PM Chumakov
R Albert
R Bruschweiler
R Dawson
R Kiss
R Mukhopadhyay
Rita Casadio
Romero
RS Spolar
RW Kriwacki
S Vucetic
SJ Demarest
SV Frankfort
T Fawcett
TD Hurley
V Neduva
V Perez-Brocal
V Vacic
VN Uversky
VN Uversky
VN Uversky
W Kabsch
WA Linke
X Li
Y Cheng
Y Cheng
Y Cheng
Y Zhang
Z Dosztanyi
Z Dosztanyi
Z Dosztanyi
Z Dosztanyi
Z Dosztanyi
Zsuzsanna Dosztányi
Publication venue: Public Library of Science
Publication date: 01/05/2009
Field of study

Many disordered proteins function via binding to a structured partner and undergo a disorder-to-order transition. The coupled folding and binding can confer several functional advantages such as the precise control of binding specificity without increased affinity. Additionally, the inherent flexibility allows the binding site to adopt various conformations and to bind to multiple partners. These features explain the prevalence of such binding elements in signaling and regulatory processes. In this work, we report ANCHOR, a method for the prediction of disordered binding regions. ANCHOR relies on the pairwise energy estimation approach that is the basis of IUPred, a previous general disorder prediction method. In order to predict disordered binding regions, we seek to identify segments that are in disordered regions, cannot form enough favorable intrachain interactions to fold on their own, and are likely to gain stabilizing energy by interacting with a globular protein partner. The performance of ANCHOR was found to be largely independent from the amino acid composition and adopted secondary structure. Longer binding sites generally were predicted to be segmented, in agreement with available experimentally characterized examples. Scanning several hundred proteomes showed that the occurrence of disordered binding sites increased with the complexity of the organisms even compared to disordered regions in general. Furthermore, the length distribution of binding sites was different from disordered protein regions in general and was dominated by shorter segments. These results underline the importance of disordered proteins and protein segments in establishing new binding regions. Due to their specific biophysical properties, disordered binding sites generally carry a robust sequence signal, and this signal is efficiently captured by our method. Through its generality, ANCHOR opens new ways to study the essential functional sites of disordered proteins

Crossref

Directory of Open Access Journals

PubMed Central

Critical assessment of protein intrinsic disorder prediction.

Author: CAID Predictors .
DisProt Curators .
Necci M
Piovesan D
Tosatto SCE
Publication venue
Publication date: 01/05/2021
Field of study

Intrinsically disordered proteins, defying the traditional protein structure-function paradigm, are a challenge to study experimentally. Because a large part of our knowledge rests on computational predictions, it is crucial that their accuracy is high. The Critical Assessment of protein Intrinsic Disorder prediction (CAID) experiment was established as a community-based blind test to determine the state of the art in prediction of intrinsically disordered regions and the subset of residues involved in binding. A total of 43 methods were evaluated on a dataset of 646 proteins from DisProt. The best methods use deep learning techniques and notably outperform physicochemical methods. The top disorder predictor has Fmax = 0.483 on the full dataset and Fmax = 0.792 following filtering out of bona fide structured regions. Disordered binding regions remain hard to predict, with Fmax = 0.231. Interestingly, computing times among methods can vary by up to four orders of magnitude

UCL Discovery

Recommended from our members

Critical assessment of protein intrinsic disorder prediction.

Author: CAID Predictors
DisProt Curators
Necci Marco
Piovesan Damiano
Tosatto Silvio CE
Publication venue: Nat Methods
Publication date: 01/05/2021
Field of study

Apollo (Cambridge)

Disorder prediction methods, their applicability to different protein targets and their usefulness for guiding experimental studies

Author: Dunker
Jennifer Atkins
Lee
Liam McGuffin
Romero
Samuel Boateng
Thomas Sorensen
Publication venue: 'MDPI AG'
Publication date: 01/08/2015
Field of study

The role and function of a given protein is dependent on its structure. In recent years, however, numerous studies have highlighted the importance of unstructured, or disordered regions in governing a protein’s function. Disordered proteins have been found to play important roles in pivotal cellular functions, such as DNA binding and signalling cascades. Studying proteins with extended disordered regions is often problematic as they can be challenging to express, purify and crystallise. This means that interpretable experimental data on protein disorder is hard to generate. As a result, predictive computational tools have been developed with the aim of predicting the level and location of disorder within a protein. Currently, over 60 prediction servers exist, utilizing different methods for classifying disorder and different training sets. Here we review several good performing, publicly available prediction methods, comparing their application and discussing how disorder prediction servers can be used to aid the experimental solution of protein structure. The use of disorder prediction methods allows us to adopt a more targeted approach to experimental studies by accurately identifying the boundaries of ordered protein domains so that they may be investigated separately, thereby increasing the likelihood of their successful experimental solution

Multidisciplinary Digital Publishing Institute

Central Archive at the University of Reading

Crossref

Directory of Open Access Journals

PubMed Central

Spritz: a server for the prediction of intrinsically disordered regions in protein sequences using kernel machines

Author: Bortolami Oscar
Pollastri Gianluca
Tosatto Silvio C. E.
Vullo Alessandro
Publication venue: Oxford University Press
Publication date: 01/01/2006
Field of study

Intrinsically disordered proteins have long stretches of their polypeptide chain, which do not adopt a single native structure composed of stable secondary and tertiary structure in the absence of binding partners. The prediction of intrinsically disordered regions in proteins from sequence is increasingly becoming of interest, as the presence of many such regions in the complete genome sequences are discovered and important functional roles are associated with them. We have developed a machine learning approach based on two support vector machines (SVM) to discriminate disordered regions from sequence. The SVM are trained and benchmarked on two sets, representing long and short disordered regions. A preliminary version of Spritz was shown to perform consistently well at the recent biannual CASP-6 experiment [Critical Assessment of Techniques for Protein Structure Prediction (CASP), 2004]. The fully developed Spritz method is freely available as a web server at and

PubMed Central

Catalogo dei prodotti della ricerca

Archivio istituzionale della ricerca - Università di Padova

Markov Models of Amino Acid Substitution to Study Proteins with Intrinsically Disordered Regions

Author: Anisimova Maria
Szalkowski Adam M.
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Intrinsically disordered proteins (IDPs) or proteins with disordered regions (IDRs) do not have a well-defined tertiary structure, but perform a multitude of functions, often relying on their native disorder to achieve the binding flexibility through changing to alternative conformations. Intrinsic disorder is frequently found in all three kingdoms of life, and may occur in short stretches or span whole proteins. To date most studies contrasting the differences between ordered and disordered proteins focused on simple summary statistics. Here, we propose an evolutionary approach to study IDPs, and contrast patterns specific to ordered protein regions and the corresponding IDRs.Two empirical Markov models of amino acid substitutions were estimated, based on a large set of multiple sequence alignments with experimentally verified annotations of disordered regions from the DisProt database of IDPs. We applied new methods to detect differences in Markovian evolution and evolutionary rates between IDRs and the corresponding ordered protein regions. Further, we investigated the distribution of IDPs among functional categories, biochemical pathways and their preponderance to contain tandem repeats. disorder prediction using a phylogenetic Hidden Markov Model based on our matrices showed a performance similar to other disorder predictors

Public Library of Science (PLOS)

Repository for Publications and Research Data

Directory of Open Access Journals

PubMed Central

Critical assessment of protein intrinsic disorder prediction

Author: Aykac-Fas Burcu
Bassot Claudio
Benítez Guillermo Ignacio
Bevilacqua Martina
Bitard-Feildel Tristan
Caid Predictors
Callebaut Isabelle
Chasapi Anastasia
Chemes Lucia Beatriz
Cheng Jianlin
Cozzetto Domenico
Davey Norman
Davidović Radoslav
Disprot Curators
Dosztányi Zsuzsanna
Dunker A. Keith
Elofsson Arne
Erdős Gábor
Galzitskaya Oxana Valerianovna
Gao Jianzhao
González-Foutel Nicolás S.
Govindarajan Sudha
Gsponer Jörg
Guharoy Mainak
Hajdu-Soltész Borbála
Hanson Jack
Hatos András
Hoque Md Tamjidul
Horvath Tamas
Hu Gang
Iglesias Valentin
Iqbal Sumaiya
Jones David T.
Kajava Andrey V.
Kovacs Orsolya Panna
Kurgan Lukasz
Lamb John
Lambrughi Matteo
Lazar Tamas
Leclercq Jeremy Y.
Leonardi Emanuela
Litfin Thomas
Lobanov Michail Yu
Macedo-Ribeiro Sandra
Macossay-Castillo Mauricio
Maiani Emiliano
Malhis Nawar
Manso Jose Antonio
Marino-Buslje Cristina
Martínez-Pérez Elizabeth
Meng Fanchi
Minervini Giovanni
Mirabello Claudio
Mičetić Ivan
Monzon Alexander Miguel
Murvai Nikoletta
Mészáros Bálint
Necci Marco
Orlando Gabriele
Ouzounis Christos
Pajkos Mátyás
Paladin Lisanna
Paliwal Kuldip
Palopoli Nicolás
Pancsa Rita
Papaleo Elena
Parisi Gustavo
Peng Zhenling
Pereira Pedro José Barbosa
Piovesan Damiano
Promponas Vasilis J.
Pujols Jordi
Quaglia Federica
Raimondi Daniele
Salvatore Marco
Schad Eva
Sharma Alok
Sharma Ronesh
Sormanni Pietro
Szabo Beata
Szaniszló Tamás
Tamana Stella
Tantos Agnes
Tompa Peter
Tosatto Silvio C. E.
Veljkovic Nevena
Vendruscolo Michele
Ventura Salvador
Vranken Wim
Wallner Björn
Walsh Ian
Wang Chen
Wang Kui
Wang Sheng
Wu Tianqi
Wu Zhonghua
Xu Jinbo
Yan Jing
Zhou Yaoqi
Álvarez Lucía
Publication venue: Nature Methods
Publication date: 01/01/2021
Field of study

Abstract: Intrinsically disordered proteins, defying the traditional protein structure–function paradigm, are a challenge to study experimentally. Because a large part of our knowledge rests on computational predictions, it is crucial that their accuracy is high. The Critical Assessment of protein Intrinsic Disorder prediction (CAID) experiment was established as a community-based blind test to determine the state of the art in prediction of intrinsically disordered regions and the subset of residues involved in binding. A total of 43 methods were evaluated on a dataset of 646 proteins from DisProt. The best methods use deep learning techniques and notably outperform physicochemical methods. The top disorder predictor has Fmax = 0.483 on the full dataset and Fmax = 0.792 following filtering out of bona fide structured regions. Disordered binding regions remain hard to predict, with Fmax = 0.231. Interestingly, computing times among methods can vary by up to four orders of magnitude

CONICET Digital

HAL-IRD

Diposit Digital de Documents de la UAB

Apollo (Cambridge)

A new census of protein tandem repeats and their relationship with intrinsic disorder

Author: Anisimova Maria
Delucchi Matteo
Elofsson Arne
Sachenkova Oxana
Schaper Elke
Publication venue: 'MDPI AG'
Publication date: 09/04/2020
Field of study

Protein tandem repeats (TRs) are often associated with immunity-related functions and diseases. Since that last census of protein TRs in 1999, the number of curated proteins increased more than seven-fold and new TR prediction methods were published. TRs appear to be enriched with intrinsic disorder and vice versa. The significance and the biological reasons for this association are unknown. Here, we characterize protein TRs across all kingdoms of life and their overlap with intrinsic disorder in unprecedented detail. Using state-of-the-art prediction methods, we estimate that 50.9% of proteins contain at least one TR, often located at the sequence flanks. Positive linear correlation between the proportion of TRs and the protein length was observed universally, with Eukaryotes in general having more TRs, but when the difference in length is taken into account the difference is quite small. TRs were enriched with disorder-promoting amino acids and were inside intrinsically disordered regions. Many such TRs were homorepeats. Our results support that TRs mostly originate by duplication and are involved in essential functions such as transcription processes, structural organization, electron transport and iron-binding. In viruses, TRs are found in proteins essential for virulence

Multidisciplinary Digital Publishing Institute

ZHAW digitalcollection

Critical assessment of protein intrinsic disorder prediction

Author: Davidović Radoslav S.
Necci Marco
Piovesan Damiano
Tosatto Silvio C. E.
Veljković Nevena V.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Intrinsically disordered proteins, defying the traditional protein structure–function paradigm, are a challenge to study experimentally. Because a large part of our knowledge rests on computational predictions, it is crucial that their accuracy is high. The Critical Assessment of protein Intrinsic Disorder prediction (CAID) experiment was established as a community-based blind test to determine the state of the art in prediction of intrinsically disordered regions and the subset of residues involved in binding. A total of 43 methods were evaluated on a dataset of 646 proteins from DisProt. The best methods use deep learning techniques and notably outperform physicochemical methods. The top disorder predictor has F max = 0.483 on the full dataset and F max = 0.792 following filtering out of bona fide structured regions. Disordered binding regions remain hard to predict, with F max = 0.231. Interestingly, computing times among methods can vary by up to four orders of magnitude

Repository of the Vinča Nuclear Institute (VinaR)

D2P2: database of disordered protein predictions

Author: Dosztányi Zsuzsanna
Ishida Takashi
Oates Matt E.
Romero Pedro
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2013
Field of study

We present the Database of Disordered Protein Prediction (D2P2), available at http://d2p2.pro (including website source code). A battery of disorder predictors and their variants, VL-XT, VSL2b, PrDOS, PV2, Espritz and IUPred, were run on all protein sequences from 1765 complete proteomes (to be updated as more genomes are completed). Integrated with these results are all of the predicted (mostly structured) SCOP domains using the SUPERFAMILY predictor. These disorder/structure annotations together enable comparison of the disorder predictors with each other and examination of the overlap between disordered predictions and SCOP domains on a large scale. D2P2 will increase our understanding of the interplay between disorder and structure, the genomic distribution of disorder, and its evolutionary history. The parsed data are made available in a unified format for download as flat files or SQL tables either by genome, by predictor, or for the complete set. An interactive website provides a graphical view of each protein annotated with the SCOP domains and disordered regions from all predictors overlaid (or shown as a consensus). There are statistics and tools for browsing and comparing genomes and their disorder within the context of their position on the tree of life. © The Author(s) 2012. Published by Oxford University Press

Repository of the Academy's Library