Search CORE

14 research outputs found

Prediction of Protein Binding Regions in Disordered Proteins

Author: A De Biasio
A Mohan
A Nakabachi
A Schlessinger
A Sigalov
AA Russo
AH Huber
AH Tong
AK Dunker
AK Dunker
AK Dunker
AS Kim
B Meszaros
B Rost
B Vogelstein
Bálint Mészáros
C Haynes
CA Galea
CA Galea
CJ Oldfield
CJ Oldfield
CM Preston
D Marc
D Wu
E Bochkareva
E Garner
EA Bienkiewicz
EA Waxman
ER Lacy
F Alber
F Ferron
FG Hanisch
H Xie
HD Ochs
HJ Dyson
HJ Dyson
HM Berman
István Simon
J Chen
J Liu
J Sampietro
JB Marchand
JC Hansen
JH Hoh
JJ Ward
JL Fauchere
K Gunasekaran
LM Iakoucheva
LM Iakoucheva
M Fuxreiter
M Fuxreiter
M Fuxreiter
M Hertzog
M Sickmeier
MK Sorenson
N Abdul-Manan
P Di Lello
P Puntervoll
P Romero
P Tompa
P Tompa
P Tompa
PE Wright
PH Kussie
PM Chumakov
R Albert
R Bruschweiler
R Dawson
R Kiss
R Mukhopadhyay
Rita Casadio
Romero
RS Spolar
RW Kriwacki
S Vucetic
SJ Demarest
SV Frankfort
T Fawcett
TD Hurley
V Neduva
V Perez-Brocal
V Vacic
VN Uversky
VN Uversky
VN Uversky
W Kabsch
WA Linke
X Li
Y Cheng
Y Cheng
Y Cheng
Y Zhang
Z Dosztanyi
Z Dosztanyi
Z Dosztanyi
Z Dosztanyi
Z Dosztanyi
Zsuzsanna Dosztányi
Publication venue: Public Library of Science
Publication date: 01/05/2009
Field of study

Many disordered proteins function via binding to a structured partner and undergo a disorder-to-order transition. The coupled folding and binding can confer several functional advantages such as the precise control of binding specificity without increased affinity. Additionally, the inherent flexibility allows the binding site to adopt various conformations and to bind to multiple partners. These features explain the prevalence of such binding elements in signaling and regulatory processes. In this work, we report ANCHOR, a method for the prediction of disordered binding regions. ANCHOR relies on the pairwise energy estimation approach that is the basis of IUPred, a previous general disorder prediction method. In order to predict disordered binding regions, we seek to identify segments that are in disordered regions, cannot form enough favorable intrachain interactions to fold on their own, and are likely to gain stabilizing energy by interacting with a globular protein partner. The performance of ANCHOR was found to be largely independent from the amino acid composition and adopted secondary structure. Longer binding sites generally were predicted to be segmented, in agreement with available experimentally characterized examples. Scanning several hundred proteomes showed that the occurrence of disordered binding sites increased with the complexity of the organisms even compared to disordered regions in general. Furthermore, the length distribution of binding sites was different from disordered protein regions in general and was dominated by shorter segments. These results underline the importance of disordered proteins and protein segments in establishing new binding regions. Due to their specific biophysical properties, disordered binding sites generally carry a robust sequence signal, and this signal is efficiently captured by our method. Through its generality, ANCHOR opens new ways to study the essential functional sites of disordered proteins

Crossref

Directory of Open Access Journals

PubMed Central

Systematic analysis of somatic mutations driving cancer: uncovering functional protein regions in disease development

Crossref

InterPro in 2017-beyond protein family and domain annotations

InterPro (http://www.ebi.ac.uk/interpro/) is a freely available database used to classify protein sequences into families and to predict the presence of important domains and sites. InterProScan is the underlying software that allows both protein and nucleic acid sequences to be searched against InterPro's predictive models, which are provided by its member databases. Here, we report recent developments with InterPro and its associated software, including the addition of two new databases (SFLD and CDD), and the functionality to include residue-level annotation and prediction of intrinsic disorder. These developments enrich the annotations provided by InterPro, increase the overall number of residues annotated and allow more specific functional inferences

Crossref

PubMed Central

eScholarship - University of California

The University of Manchester - Institutional Repository

Explore Bristol Research

Archivio istituzionale della ricerca - Università di Padova

An intrinsically disordered proteins community for ELIXIR.

Intrinsically disordered proteins (IDPs) and intrinsically disordered regions (IDRs) are now recognised as major determinants in cellular regulation. This white paper presents a roadmap for future e-infrastructure developments in the field of IDP research within the ELIXIR framework. The goal of these developments is to drive the creation of high-quality tools and resources to support the identification, analysis and functional characterisation of IDPs. The roadmap is the result of a workshop titled "An intrinsically disordered protein user community proposal for ELIXIR" held at the University of Padua. The workshop, and further consultation with the members of the wider IDP community, identified the key priority areas for the roadmap including the development of standards for data annotation, storage and dissemination; integration of IDP data into the ELIXIR Core Data Resources; and the creation of benchmarking criteria for IDP-related software. Here, we discuss these areas of priority, how they can be implemented in cooperation with the ELIXIR platforms, and their connections to existing ELIXIR Communities and international consortia. The article provides a preliminary blueprint for an IDP Community in ELIXIR and is an appeal to identify and involve new stakeholders

Maastricht University Research Portal

Birkbeck Institutional Research Online

ZORA

Repository of the Academy's Library

UPF Digital Repository

Apollo (Cambridge)

Institute of Cancer Research Repository

Archivio istituzionale della ricerca - Università di Padova

MFSPSSMpred: identifying short disorder-to-order binding regions in disordered proteins based on contextual local evolutionary conservation

Author: A Mohan
C Cheng-Wei
C Chica
C Chih-Chung
C Yugong
Chun Fang
CJ Oldfield
D Zsuzsanna
Daisuke Tominaga
ED Norman
ED Norman
ED Norman
FA Stephen
Hayato Yamana
JH Niall
JM Marcin
JW Jonathan
K Shimizu
KL Ioly
L McGuffin
M Fuxreiter
MD Fatemeh
RC Gonzalez
S Avner
Tamotsu Noguchi
V Vacic
Z Dosztanyi
Z Tuo
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Caractérisation des périodes de sécheresse sur le domaine de l'Afrique simulée par le Modèle Régional Canadien du Climat (MRCC5)

Les conséquences des changements climatiques sur la fréquence ainsi que sur l'intensité des précipitations auront un impact direct sur les périodes de sécheresse et par conséquent sur différents secteurs économiques tels que le secteur de l'agriculture. Ainsi, dans cette étude, l'habilité du Modèle Régional Canadien du Climat (MRCC5) à simuler les différentes caractéristiques des périodes de sécheresse est évaluée pour 4 seuils de précipitation soit 0.5 mm, 1 mm, 2 mm et 3 mm. Ces caractéristiques incluent le nombre de jours secs, le nombre de périodes de sécheresse ainsi que le maximum de jours consécutifs sans précipitation associé à une récurrence de 5 ans. Les résultats sont présentés pour des moyennes annuelles et saisonnières. L'erreur de performance est évaluée en comparant le MRCC5 piloté par ERA-Interim aux données d'analyses du GPCP pour le climat présent (1997-2008). L'erreur due aux conditions aux frontières c'est-à-dire les erreurs de pilotage du MRCC5, soit par CanESM2 et par ERA-Interim ainsi que l'évaluation de la valeur ajoutée du MRCC5 face au CanESM2 sont également analysées. L'analyse de ces caractéristiques est également faite dans un contexte de climat changeant pour deux périodes futures, soit 2041-2070 et 2071-2100 à l'aide du MRCC5 piloté par le modèle de circulation générale CanESM2 de même que par le modèle CanESM2 sous le scénario RCP 4.5. Les résultats suggèrent que le MRCC5 piloté par ERA-Interim a tendance à surestimer la moyenne annuelle du nombre de jours secs ainsi que le maximum de jours consécutifs sans précipitation associé à une récurrence de 5 ans dans la plupart des régions de l'Afrique et une tendance à sous-estimer le nombre de périodes de sécheresse. En général, l'erreur de performance est plus importante que l'erreur due aux conditions aux frontières pour les différentes caractéristiques de périodes de sécheresse. Pour les régions équatoriales, les changements appréhendés par le MRCC5 piloté par CanESM2 pour les différentes caractéristiques de périodes de sécheresse et pour deux périodes futures (2041-2070 et 2071-2100), suggèrent une augmentation significatives du nombre de jours secs ainsi que du maximum de jours consécutifs sans précipitation associé à une récurrence de 5 ans. Une diminution significative du nombre de périodes de sécheresse est aussi prévue.\ud ______________________________________________________________________________ \ud MOTS-CLÉS DE L’AUTEUR : Modèle Régional du Climat, Changement climatique, Jours secs, Nombre de périodes de sécheresse, Événement de faible récurrence, Afriqu

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

HAL AMU

HAL Descartes

DI-fusion

Diposit Digital de Documents de la UAB

Hal-Diderot

Archivio istituzionale della ricerca - Università di Padova

USFSP Digital Archive

Crossref

IUPUIScholarWorks

INRIA a CCSD electronic archive server

PubMed Central

Archipel - Université du Québec à Montréal

Repository of the Vinča Nuclear Institute (VinaR)

Scholar Commons - University of South Florida

Pipeline for transferring annotations between proteins beyond globular domains

Author: Dosztanyi Zsuzsanna
Gibson Toby James
Marino Cristina Ester
Martinez Perez Elizabeth
Pajkos Mátyás
Tosatto Silvio C. E.
Publication venue: John Wiley & Sons
Publication date: 01/05/2023
Field of study

Background DisProt is the primary repository of Intrinsically Disordered Proteins (IDPs). This database is manually curated and the annotations there have strong experimental support. Currently, DisProt contains a relatively small number of proteins highlighting the importance of transferring annotations regarding verified disorder state and corresponding functions to homologous proteins in other species. In such a way, providing them with highly valuable information to better understand their biological roles. While the principles and practicalities of homology transfer are well-established for globular proteins, these are largely lacking for disordered proteins. Methods We used DisProt to evaluate the transferability of the annotation terms to orthologous proteins. For each protein, we looked for their orthologs, with the assumption that they will have a similar function. Then, for each protein and their orthologs we made multiple sequence alignments (MSAs). Disordered sequences are fast evolving and can be hard to align: Therefore we implemented alignment quality control steps ensuring robust alignments before mapping the annotations. Results We have designed a pipeline to obtain good quality MSAs and to transfer annotations from any protein to their orthologs. Applying the pipeline to DisProt proteins, from the 1,731 entries with 5,623 annotations we can reach 97,555 orthologs and transfer a total of 301,190 terms by homology. We also provide a web server for consulting the results of DisProt proteins and execute the pipeline for any other protein. The server Homology Transfer IDP (HoTIDP) is accessible at http://hotidp.leloir.org.ar.Fil: Martinez Perez, Elizabeth. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigaciones Bioquímicas de Buenos Aires. Fundación Instituto Leloir. Instituto de Investigaciones Bioquímicas de Buenos Aires; Argentina. Fundación Instituto Leloir; ArgentinaFil: Pajkos, Mátyás. Eötvös University; ArgentinaFil: Tosatto, Silvio C. E.. Università di Padova; ItaliaFil: Gibson, Toby James. European Molecular Biology Laboratory Heidelberg; AlemaniaFil: Dosztanyi, Zsuzsanna. Eötvös University; ArgentinaFil: Marino, Cristina Ester. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigaciones Bioquímicas de Buenos Aires. Fundación Instituto Leloir. Instituto de Investigaciones Bioquímicas de Buenos Aires; Argentina. Fundación Instituto Leloir; Argentin

CONICET Digital

(DP2)-P-2: database of disordered protein predictions

Author: Dosztanyi Zsuzsanna
Dunker A. Keith
Ghalwash Mohamed
Gough Julian
Ishida Takashi
Kurgan Lukasz
Mizianty Marcin J.
Oates Matthew
Obradovic Zoran
Romero Pedro
Uversky Vladimir N.
Xue Bin
Publication venue: 'Oxford University Press (OUP)'
Publication date: 29/11/2012
Field of study

We present the Database of Disordered Protein Prediction (D(2)P(2)), available at http://d2p2.pro (including website source code). A battery of disorder predictors and their variants, VL-XT, VSL2b, PrDOS, PV2, Espritz and IUPred, were run on all protein sequences from 1765 complete proteomes (to be updated as more genomes are completed). Integrated with these results are all of the predicted (mostly structured) SCOP domains using the SUPERFAMILY predictor. These disorder/structure annotations together enable comparison of the disorder predictors with each other and examination of the overlap between disordered predictions and SCOP domains on a large scale. D(2)P(2) will increase our understanding of the interplay between disorder and structure, the genomic distribution of disorder, and its evolutionary history. The parsed data are made available in a unified format for download as flat files or SQL tables either by genome, by predictor, or for the complete set. An interactive website provides a graphical view of each protein annotated with the SCOP domains and disordered regions from all predictors overlaid (or shown as a consensus). There are statistics and tools for browsing and comparing genomes and their disorder within the context of their position on the tree of life

USFSP Digital Archive

Crossref

PubMed Central

Scholar Commons - University of South Florida

Explore Bristol Research

Integron-associated mobile gene cassettes code for folded proteins: the structure of Bal32a, a new member of the adaptable α+β barrel family

Author: Curmi Paul
Dixon Nicholas
Dosztanyi Zsuzsanna
Gillings Michael
Harrop Stephen
Holmes Andrew
Mabbutt Bridget
Nevalainen K M Helena
Otting Gottfried
Robinson Andrew
Schaeffer Patrick
Stokes Harold W
Wu Peter
Publication venue: 'Elsevier BV'
Publication date: 11/12/2015
Field of study

The wide-ranging physiology and large genetic variability observed for prokaryotes is largely attributed, not to the prokaryotic genome itself, but rather to mechanisms of lateral gene transfer. Cassette PCR has been used to sample the integron/gene cassette metagenome from different natural environments without laboratory cultivation of the host organism, and without prior knowledge of any target protein sequence. Since over 90% of cassette genes are unrelated to any sequence in the current databases, it is not clear whether these genes code for folded functional proteins. We have selected a sample of eight cassette-encoded genes with no known homologs; five have been isolated as soluble protein products and shown by biophysical techniques to be folded. In solution, at least three of these proteins organise as stable oligomeric assemblies. The tertiary structure of one of these, Bal32a derived from a contaminated soil site, has been solved by X-ray crystallography to 1.8 Å resolution. From the three-dimensional structure, Bal32a is found to be a member of the highly adaptable α+β barrel family of transport proteins and enzymes. In Bal32a, the barrel cavity is unusually deep and inaccessible to solvent. Polar side-chains in its interior are reminiscent of catalytic sites of limonene-1,2-epoxide hydrolase and nogalonic acid methyl ester cyclase. These studies demonstrate the viability of direct sampling of mobile DNA as a route for the discovery of novel proteins

The Australian National University

Disentangling the complexity of low complexity proteins

Author: Ahmed
Alba
Aleksandra Gruca
Andrey V Kajava
Annika Urbanek
Bachmann
Baias
Bennett
Borbála Hajdu-Soltész
Bosshard
Brangwynne
Chavali
Christos A Ouzounis
Coletta
Communie
Dariusz Plewczynski
Darling
Darling
Das
Dobson
Dosztanyi
Dosztanyi
Dosztanyi
Dudola
Dunker
Dzuricky
Eapen
Eapen
Eftekharzadeh
Fan
Finn
Gaspari
Gaspari
Gavira
Guo
Hao
Hao
Harbi
Harrison
Harrison
Huntley
Iakoucheva
Jadlowiec
Jadlowiec
John M Hancock
Jorda
Jorda
Kajava
Kato
Kim
Kirmitzoglou
Knight
Kreil
Kuznetsov
Labaj
Li
Lin
Lin
Lisanna Paladin
Liu
Luo
Lupas
Lupas
Marcin Grynberg
Martin
Martinez
María Velasco
Masino
McDonnell
Meszaros
Mier
Mier
Miguel A Andrade-Navarro
Mittal
Na
Obradovic
Pablo Mier
Palidwor
Pau Bernadó
Peng
Piovesan
Piovesan
Promponas
Quiroz
Rado-Trilla
Rambaran
Regad
Romero
Shin
Silvio C E Tosatto
Simm
Simon
Smith
Smithers
Sophia Petrosian
Spink
Stella Tamana
Suveges
Suzuki
Szappanos
Tautz
Totzeck
Urbanek
Uversky
Uversky
Vasilis J Promponas
Walsh
Wolny
Wootton
Wright
Zhemkov
Zoltán Gáspári
Zsuzsanna Dosztanyi
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2019
Field of study

There are multiple definitions for low complexity regions (LCRs) in protein sequences, with all of them broadly considering LCRs as regions with fewer amino acid types compared to an average composition. Following this view, LCRs can also be defined as regions showing composition bias. In this critical review, we focus on the definition of sequence complexity of LCRs and their connection with structure. We present statistics and methodological approaches that measure low complexity (LC) and related sequence properties. Composition bias is often associated with LC and disorder, but repeats, while compositionally biased, might also induce ordered structures. We illustrate this dichotomy, and more generally the overlaps between different properties related to LCRs, using examples. We argue that statistical measures alone cannot capture all structural aspects of LCRs and recommend the combined usage of a variety of predictive tools and measurements. While the methodologies available to study LCRs are already very advanced, we foresee that a more comprehensive annotation of sequences in the databases will enable the improvement of predictions and a better understanding of the evolution and the connection between structure and function of LCRs. This will require the use of standards for the generation and exchange of data describing all aspects of LCRs

Archivio istituzionale della ricerca - Università di Padova