Search CORE

345 research outputs found

Intrinsically disordered domains deviate significantly from random sequences in mammalian proteins

Author: A Patil
AK Dunker
AK Dunker
AK Dunker
Ashwini Patil
C Haynes
CJ Brown
CJ Brown
Daron M Standley
HJ Dyson
HJ Dyson
J Liu
JJ Ward
LM Iakoucheva
M Korb
M Simon
Shunsuke Teraguchi
W Li
Y Minezaki
Publication venue: BioMed Central
Publication date
Field of study

Crossref

PubMed Central

Influence of Sequence Changes and Environment on Intrinsically Disordered Proteins

Author: AK Dunker
AK Dunker
AK Dunker
Amrita Mohan
B Ma
B Ma
B Ma
B Rost
C Sander
CJ Tsai
CJ Tsai
CJ Tsai
F Ferron
G Rhodes
GW Daughdrill
HJ Dyson
HJ Dyson
J Zurdo
JJ Ward
K Gunasekaran
K Peng
KA Dill
L Bordoli
M Fuxreiter
OF Lange
P Radivojac
P Radivojac
P Romero
P Tompa
PE Wright
Predrag Radivojac
Ruth Nussinov
SK Palaninathan
VJ Hilser
Vladimir N. Uversky
VN Uversky
VN Uversky
Y Zhang
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

Many large-scale studies on intrinsically disordered proteins are implicitly based on the structural models deposited in the Protein Data Bank. Yet, the static nature of deposited models supplies little insight into variation of protein structure and function under diverse cellular and environmental conditions. While the computational predictability of disordered regions provides practical evidence that disorder is an intrinsic property of proteins, the robustness of disordered regions to changes in sequence or environmental conditions has not been systematically studied. We analyzed intrinsically disordered regions in the same or similar proteins crystallized independently and studied their sensitivity to changes in protein sequence and parameters of crystallographic experiments. The observed changes in the existence, position, and length of disordered regions indicate that their appearance in X-ray structures dramatically depends on changes in amino acid sequence and peculiarities of the crystallographic experiment. Our study also raises general questions regarding protein evolution and the regulation of protein structure, dynamics, and function via variations in cellular and environmental conditions

CiteSeerX

Public Library of Science (PLOS)

Crossref

USFSP Digital Archive

IUPUIScholarWorks

Directory of Open Access Journals

PubMed Central

Scholar Commons - University of South Florida

Predicting mostly disordered proteins by using structure-unknown protein data

Author: AK Dunker
AK Dunker
AK Dunker
AL Fink
CJ Oldfield
DT Jones
E Garner
EA Weathers
HJ Dyson
J Prilusky
JJ Ward
JJ Ward
JW Chen
Kana Shimizu
Kentaro Tomii
LM Iakoucheva
MJ Zvelebil
NS Bogatyreva
P Romero
P Tompa
P Tompa
PE Wright
R Apweiler
R Linding
R Linding
S Vucetic
S Vucetic
Shuichi Hirose
SO Garbuzynskiy
T Joachims
Tamotsu Noguchi
V Receveur-Brechot
VN Uversky
VN Uversky
VN Uversky
X Li
Y Minezaki
Yoichi Muraoka
Z Dosztanyi
Z Obradovic
ZR Yang
Publication venue: BioMed Central
Publication date: 01/03/2007
Field of study

BACKGROUND: Predicting intrinsically disordered proteins is important in structural biology because they are thought to carry out various cellular functions even though they have no stable three-dimensional structure. We know the structures of far more ordered proteins than disordered proteins. The structural distribution of proteins in nature can therefore be inferred to differ from that of proteins whose structures have been determined experimentally. We know many more protein sequences than we do protein structures, and many of the known sequences can be expected to be those of disordered proteins. Thus it would be efficient to use the information of structure-unknown proteins in order to avoid training data sparseness. We propose a novel method for predicting which proteins are mostly disordered by using spectral graph transducer and training with a huge amount of structure-unknown sequences as well as structure-known sequences. RESULTS: When the proposed method was evaluated on data that included 82 disordered proteins and 526 ordered proteins, its sensitivity was 0.723 and its specificity was 0.977. It resulted in a Matthews correlation coefficient 0.202 points higher than that obtained using FoldIndex, 0.221 points higher than that obtained using the method based on plotting hydrophobicity against the number of contacts and 0.07 points higher than that obtained using support vector machines (SVMs). To examine robustness against training data sparseness, we investigated the correlation between two results obtained when the method was trained on different datasets and tested on the same dataset. The correlation coefficient for the proposed method is 0.14 higher than that for the method using SVMs. When the proposed SGT-based method was compared with four per-residue predictors (VL3, GlobPlot, DISOPRED2 and IUPred (long)), its sensitivity was 0.834 for disordered proteins, which is 0.052–0.523 higher than that of the per-residue predictors, and its specificity was 0.991 for ordered proteins, which is 0.036–0.153 higher than that of the per-residue predictors. The proposed method was also evaluated on data that included 417 partially disordered proteins. It predicted the frequency of disordered proteins to be 1.95% for the proteins with 5%–10% disordered sequences, 1.46% for the proteins with 10%–20% disordered sequences and 16.57% for proteins with 20%–40% disordered sequences. CONCLUSION: The proposed method, which utilizes the information of structure-unknown data, predicts disordered proteins more accurately than other methods and is less affected by training data sparseness

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The N-terminal intrinsically disordered domain of mgm101p is localized to the mitochondrial nucleoid.

Author: A Moya
A Schlessinger
AEA Hobbs
AK Dunker
AK Dunker
AM Waterhouse
B He
BA Kaufman
C Galea
CJ Brown
D Tillett
David C. Hayward
DF Bogenhagen
DF Bogenhagen
E Garner
GD Clark-Walker
George Desmond Clark-Walker
GW Daughdrill
H Dyson
H Hegyi
HJ Dyson
I Miyakawa
JD Nardozzi
JJ Ward
K Itoh
K Okamoto
K Peng
L Dente
M Ito
M Kucej
M Mbantenkhu
MA Larkin
MG Claros
MW Gray
P Tompa
PE Wright
RD Gietz
RW Gilkerson
S Meeusen
S Meeusen
Vladimir N. Uversky
X Chen
X Zuo
X Zuo
XJ Chen
Y Elbaz
Z Dosztányi
Z Dosztányi
Zsuzsanna Dosztányi
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

The mitochondrial genome maintenance gene, MGM101, is essential for yeasts that depend on mitochondrial DNA replication. Previously, in Saccharomyces cerevisiae, it has been found that the carboxy-terminal two-thirds of Mgm101p has a functional core. Furthermore, there is a high level of amino acid sequence conservation in this region from widely diverse species. By contrast, the amino-terminal region, that is also essential for function, does not have recognizable conservation. Using a bioinformatic approach we find that the functional core from yeast and a corresponding region of Mgm101p from the coral Acropora millepora have an ordered structure, while the N-terminal domains of sequences from yeast and coral are predicted to be disordered. To examine whether ordered and disordered domains of Mgm101p have specific or general functions we made chimeric proteins from yeast and coral by swapping the two regions. We find, by an in vivo assay in S.cerevisiae, that the ordered domain of A.millepora can functionally replace the yeast core region but the disordered domain of the coral protein cannot substitute for its yeast counterpart. Mgm101p is found in the mitochondrial nucleoid along with enzymes and proteins involved in mtDNA replication. By attaching green fluorescent protein to the N-terminal disordered domain of yeast Mgm101p we find that GFP is still directed to the mitochondrial nucleoid where full-length Mgm101p-GFP is targeted

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The Australian National University

Repository of the Academy's Library

The Francis Crick Institute

Differences in the Number of Intrinsically Disordered Regions between Yeast Duplicated Proteins, and Their Relationship with Functional Divergence

Author: A Baudot
A Ceol
A Force
A Presser
A Schlessinger
A Wagner
A Wagner
AK Dunker
AK Dunker
AK Dunker
AK Dunker
AL Hughes
Arthur J. Lustig
B Conrad
B Dujon
B He
C Haynes
CJ Brown
CJ Oldfield
D Ekman
Denis C. Shields
DR Scannell
DR Scannell
EE Schmidt
FJ Chain
Floriane Montanari
GK Goh
H Hegyi
H Hegyi
H Jiang
I Tirosh
JJ Ward
JL Gordon
K Shimizu
KH Wolfe
L Jin
LM Iakoucheva
M Fuxreiter
M Kellis
M Lynch
M Lynch
Nora Khaldi
P Han
P Romero
P Tompa
P Tompa
P Tompa
R Sorek
RJ Edwards
RP Sugino
S Maere
S Raychaudhuri
SH Kim
T Blomme
T Makino
T Vavouri
U Midic
V Neduva
VN Uversky
X He
Y Cheng
Y Van de Peer
Z Dosztanyi
Z Dosztanyi
Publication venue: Public Library of Science
Publication date: 15/09/2011
Field of study

BACKGROUND: Intrinsically disordered regions are enriched in short interaction motifs that play a critical role in many protein-protein interactions. Since new short interaction motifs may easily evolve, they have the potential to rapidly change protein interactions and cellular signaling. In this work we examined the dynamics of gain and loss of intrinsically disordered regions in duplicated proteins to inspect if changes after genome duplication can create functional divergence. For this purpose we used Saccharomyces cerevisiae and the outgroup species Lachancea kluyveri. PRINCIPAL FINDINGS: We find that genes duplicated as part of a genome duplication (ohnologs) are significantly more intrinsically disordered than singletons (p<2.2(e)-16, Wilcoxon), reflecting a preference for retaining intrinsically disordered proteins in duplicate. In addition, there have been marked changes in the extent of intrinsic disorder following duplication. A large number of duplicated genes have more intrinsic disorder than their L. kluyveri ortholog (29% for duplicates versus 25% for singletons) and an even greater number have less intrinsic disorder than the L. kluyveri ortholog (37% for duplicates versus 25% for singletons). Finally, we show that the number of physical interactions is significantly greater in the more intrinsically disordered ohnolog of a pair (p = 0.003, Wilcoxon). CONCLUSION: This work shows that intrinsic disorder gain and loss in a protein is a mechanism by which a genome can also diverge and innovate. The higher number of interactors for proteins that have gained intrinsic disorder compared with their duplicates may reflect the acquisition of new interaction partners or new functional roles

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Native aggregation as a cause of origin of temporary cellular structures needed for all forms of cellular activity, signaling and transformations

Author: A Richardson
AB Fulton
AE Mirsky
AE Mirsky
AE Mirsky
AK Dunker
AK Dunker
AK Dunker
AK Lala
AS Troshin
AV Finkelshtein
CB Anfinsen
D Eliezer
D Inners
DD Kemp
DH Williams
DN Nasonov
EB Wilson
GN Ling
GN Ling
GN Ling
GN Ling
GN Ling
GN Ling
GN Ling
GN Ling
GN Ling
H Hegyi
J Bearden Jr
J Heuser
JM Balό-Banga
JM Balό-Banga
JM Collins
JM Zheng
JM Zheng
KR Porter
KR Porter
LM Iakoucheva
LM Iakoucheva
M Chaplin
M Haslbeck
MA Spackman
NA Kasim
P Tompa
PE Wright
RJ Ellis
S Mukhopadhyay
SH Kim
SY Proskuryakov
T Weikl
U Rawat
Vladimir V Matveev
VN Uversky
VV Matveev
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

According to the hypothesis explored in this paper, native aggregation is genetically controlled (programmed) reversible aggregation that occurs when interacting proteins form new temporary structures through highly specific interactions. It is assumed that Anfinsen's dogma may be extended to protein aggregation: composition and amino acid sequence determine not only the secondary and tertiary structure of single protein, but also the structure of protein aggregates (associates). Cell function is considered as a transition between two states (two states model), the resting state and state of activity (this applies to the cell as a whole and to its individual structures). In the resting state, the key proteins are found in the following inactive forms: natively unfolded and globular. When the cell is activated, secondary structures appear in natively unfolded proteins (including unfolded regions in other proteins), and globular proteins begin to melt and their secondary structures become available for interaction with the secondary structures of other proteins. These temporary secondary structures provide a means for highly specific interactions between proteins. As a result, native aggregation creates temporary structures necessary for cell activity

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The dawn of a new era in cell signalling research

Author: AH Bild
AH Bild
AK Dunker
BJ Mayer
C Jørgensen
CE Holt
JD Scott
JT Chang
M Emaduddin
M Fuxreiter
S de Chadarevian
Stephan M Feller
T Aleksic
TJ Gibson
U Midic
V Csizmok
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Oxford University Research Archive

Free Cysteine Modulates the Conformation of Human C/EBP Homologous Protein

Author: A Banjac
A Bruhat
A Campen
A Gow
A Kumar
AK Dunker
AK Dunker
AK Dunker
AK Dunker
AK Dunker
AK Dunker
AL Fink
AP Demchenko
B He
B Xue
B Xue
BA Johnson
C Bracken
C Hwang
C Shao
CB Chiribau
CJ Bates
CJ Oldfield
CJ Oldfield
CL Anderson
D Ron
D Ron
DL Eizirik
DP Jones
DP Jones
EA Permyakov
EV Maytin
F Delaglio
F Ferron
FG van der Goot
FQ Schafer
G Bohm
G Bruylants
GE Schulz
GP Meares
GW Daughdrill
H Tsukano
H Yamaguchi
H Yoshida
H Zinszner
HJ Dyson
HJ Dyson
I Tabas
IM Kuznetsova
IM Kuznetsova
IM Kuznetsova
J Alter
J Gsponer
J Liu
J Prilusky
JD Malhotra
JM Bourhis
K Gunasekaran
K Namba
K Peng
KD McCullough
Kim Munro
KK Turoverov
LM Iakoucheva
M Aridor
M Matsumoto
M Schroder
M Ubeda
MM Babu
MM Mathews-Roth
Mona N. Rahman
MY Yeh
N Ohoka
NA Bushmarina
OA Obeid
P Cunnea
P Radivojac
P Radivojac
P Romero
P Tompa
Peter Csermely
R Gogia
RJ Kaufman
S Chakravarthi
S Chigurupati
S D'Amico
S Oyadomari
S Oyadomari
SE Moriarty
SH Park
SS Iyer
Steven P. Smith
Vinay K. Singh
VK Singh
Vladimir N. Uversky
VN Uversky
VN Uversky
VN Uversky
VN Uversky
VN Uversky
VN Uversky
VN Uversky
VN Uversky
VN Uversky
VN Uversky
VN Uversky
W Zou
X Li
XZ Wang
XZ Wang
Y Miyazaki
YS Wei
Z Dosztanyi
Z Dosztanyi
Zongchao Jia
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The C/EBP Homologous Protein (CHOP) is a nuclear protein that is integral to the unfolded protein response culminating from endoplasmic reticulum stress. Previously, CHOP was shown to comprise extensive disordered regions and to self-associate in solution. In the current study, the intrinsically disordered nature of this protein was characterized further by comprehensive in silico analyses. Using circular dichroism, differential scanning calorimetry and nuclear magnetic resonance, we investigated the global conformation and secondary structure of CHOP and demonstrated, for the first time, that conformational changes in this protein can be induced by the free amino acid l-cysteine. Addition of l-cysteine caused a significant dose-dependent decrease in the protein helicity – dropping from 69.1% to 23.8% in the presence of 1 mM of l-cysteine – and a sequential transition to a more disordered state, unlike that caused by thermal denaturation. Furthermore, the presence of small amounts of free amino acid (80 µM, an 8∶1 cysteine∶CHOP ratio) during CHOP thermal denaturation altered the molecular mechanism of its melting process, leading to a complex, multi-step transition. On the other hand, high levels (4 mM) of free l-cysteine seemed to cause a complete loss of rigid cooperatively melting structure. These results suggested a potential regulatory function of l-cysteine which may lead to changes in global conformation of CHOP in response to the cellular redox state and/or endoplasmic reticulum stress

CiteSeerX

Public Library of Science (PLOS)

Crossref

USFSP Digital Archive

Directory of Open Access Journals

PubMed Central

Scholar Commons - University of South Florida

Bayesian statistical modelling of human protein interaction network incorporating protein disorder information

Author: A Dunker
A Dunker
A Patil
A Potapov
A Thomas
AK Dunker
AK Dunker
AL Barabasi
AL Barabasi
Alla Bulashevska
BH Junker
C Haynes
CJ Oldfield
D Ekman
D Strauss
DR Hunter
G Robins
G Singh
GP Singh
H Dyson
H Fröhlich
H Jeong
H Xie
H Xie
H Yu
I Ispolatov
J Berg
J Nacher
K Shimizu
K Shimizu
L Radivojac
LM Iakoucheva
M Girvan
M van Duijn
MS Handcock
O Frank
P Kim
P Kim
P Romero
P Wright
PS Gill
PW Holland
R Albert
Roland Eils
S Hirose
S Vucetic
S Wasserman
S Wasserman
S Wasserman
S Wuchty
SE Fienberg
SS Shen-Orr
Svetlana Bulashevska
TAB Snijders
TAB Snijders
U Stelzl
V Uversky
V Uversky
X Zhu
Y Cheng
Z Dosztanyi
Z Saul
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background We present a statistical method of analysis of biological networks based on the exponential random graph model, namely p2-model, as opposed to previous descriptive approaches. The model is capable to capture generic and structural properties of a network as emergent from local interdependencies and uses a limited number of parameters. Here, we consider one global parameter capturing the density of edges in the network, and local parameters representing each node's contribution to the formation of edges in the network. The modelling suggests a novel definition of important nodes in the network, namely <it>social</it>, as revealed based on the local <it>sociality </it>parameters of the model. Moreover, the sociality parameters help to reveal organizational principles of the network. An inherent advantage of our approach is the possibility of hypotheses testing: <it>a priori </it>knowledge about biological properties of the nodes can be incorporated into the statistical model to investigate its influence on the structure of the network. Results We applied the statistical modelling to the human protein interaction network obtained with Y2H experiments. Bayesian approach for the estimation of the parameters was employed. We deduced <it>social </it>proteins, essential for the formation of the network, while incorporating into the model information on protein disorder. <it>Intrinsically disordered </it>are proteins which lack a well-defined three-dimensional structure under physiological conditions. We predicted the fold group (ordered or disordered) of proteins in the network from their primary sequences. The network analysis indicated that protein disorder has a positive effect on the connectivity of proteins in the network, but do not fully explains the interactivity. Conclusions The approach opens a perspective to study effects of biological properties of individual entities on the structure of biological networks.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

Modeling Disordered Regions in Proteins Using Rosetta

Author: A Leaver-Fay
A Zemla
AK Dunker
CA Rohl
CJ Oldfield
David Baker
E Alm
HJ Dyson
JJ Ward
Kristina Krassovsky
Michael Tyka
MV Berjanskii
Ray Yu-Ruei Wang
Vladimir N. Uversky
William Sheffler
Y Shen
Y Shen
Yan Han
Publication venue: Public Library of Science
Publication date: 29/07/2011
Field of study

Protein structure prediction methods such as Rosetta search for the lowest energy conformation of the polypeptide chain. However, the experimentally observed native state is at a minimum of the free energy, rather than the energy. The neglect of the missing configurational entropy contribution to the free energy can be partially justified by the assumption that the entropies of alternative folded states, while very much less than unfolded states, are not too different from one another, and hence can be to a first approximation neglected when searching for the lowest free energy state. The shortcomings of current structure prediction methods may be due in part to the breakdown of this assumption. Particularly problematic are proteins with significant disordered regions which do not populate single low energy conformations even in the native state. We describe two approaches within the Rosetta structure modeling methodology for treating such regions. The first does not require advance knowledge of the regions likely to be disordered; instead these are identified by minimizing a simple free energy function used previously to model protein folding landscapes and transition states. In this model, residues can be either completely ordered or completely disordered; they are considered disordered if the gain in entropy outweighs the loss of favorable energetic interactions with the rest of the protein chain. The second approach requires identification in advance of the disordered regions either from sequence alone using for example the DISOPRED server or from experimental data such as NMR chemical shifts. During Rosetta structure prediction calculations the disordered regions make only unfavorable repulsive contributions to the total energy. We find that the second approach has greater practical utility and illustrate this with examples from de novo structure prediction, NMR structure calculation, and comparative modeling

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central