Search CORE

70 research outputs found

NSort/DB: an intra-nuclear compartment protein database

Author: Boden Mikael
Mohamad Nurul
Willadsen Kai
Publication venue: 'Elsevier BV'
Publication date: 01/08/2012
Field of study

Distinct substructures within the nucleus are associated with a wide variety of important nuclear processes. Structures such as chromatin and nuclear pores have specific roles, while others such as Cajal bodies are more functionally varied. Understanding the roles of these membraneless intra-nuclear compartments requires extensive data sets covering nuclear and compartment-associated proteins. NSort/DB is a database providing access to intra- or sub-nuclear compartment associations for the mouse nuclear proteome. Based on resources ranging from large-scale curated data sets to detailed experiments, this data set provides a high-quality set of annotations of non-exclusive association of nuclear proteins with structures such as promyelocytic leukaemia bodies and chromatin. The database is searchable by protein identifier or compartment, and has a documented web service API. The search interface, web service and data download are all freely available online at http://www.nsort.org/db/. Availability of this data set will enable systematic analyses of the protein complements of nuclear compartments, improving our understanding of the diverse functional repertoire of these structures

Elsevier - Publisher Connector

University of Queensland eSpace

Assessing protein similarity with Gene Ontology and its use in subnuclear localization prediction

Author: AK Bjorklund
B Rost
BW Matthews
D Sarda
G Dellaire
H Wu
J Wang
JL Gardy
JL Gardy
K Itoh
K Nakai
K Tu
KC Chou
L Cocco
M Bhasin
MA Harris
P Zhang
PW Lord
PW Lord
R Gentleman
R Nair
R Nair
V Brendel
X Lu
X Wu
Yang Dai
YD Cai
Z Lei
Zhengdeng Lei
ZP Feng
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: The accomplishment of the various genome sequencing projects resulted in accumulation of massive amount of gene sequence information. This calls for a large-scale computational method for predicting protein localization from sequence. The protein localization can provide valuable information about its molecular function, as well as the biological pathway in which it participates. The prediction of localization of a protein at subnuclear level is a challenging task. In our previous work we proposed an SVM-based system using protein sequence information for this prediction task. In this work, we assess protein similarity with Gene Ontology (GO) and then improve the performance of the system by adding a module of nearest neighbor classifier using a similarity measure derived from the GO annotation terms for protein sequences. RESULTS: The performance of the new system proposed here was compared with our previous system using a set of proteins resided within 6 localizations collected from the Nuclear Protein Database (NPD). The overall MCC (accuracy) is elevated from 0.284 (50.0%) to 0.519 (66.5%) for single-localization proteins in leave-one-out cross-validation; and from 0.420 (65.2%) to 0.541 (65.2%) for an independent set of multi-localization proteins. The new system is available at . CONCLUSION: The prediction of protein subnuclear localizations can be largely influenced by various definitions of similarity for a pair of proteins based on different similarity measures of GO terms. Using the sum of similarity scores over the matched GO term pairs for two proteins as the similarity definition produced the best predictive outcome. Substantial improvement in predicting protein subnuclear localizations has been achieved by combining Gene Ontology with sequence information

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Towards defining the nuclear proteome

Author: Fink J Lynn
Gardiner Donald M
Hamilton Nicholas
Hayashizaki Yosihide
Kai Chikatoshi
Karunaratne Seetha
Mahony Donna
Mittal Amit
Suzuki Harukazu
Teasdale Rohan D
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Direct evidence is reported for 2,568 mammalian proteins within the nuclear proteome, consisting of at least 14% of the entire proteome

Crossref

Springer - Publisher Connector

PubMed Central

University of Queensland eSpace

The proteins of intra-nuclear bodies: a data-driven analysis of sequence, interaction and expression

Author: Bodén Mikael
Mohamad Nurul
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Cajal bodies, nucleoli, PML nuclear bodies, and nuclear speckles are morpohologically distinct intra-nuclear structures that dynamically respond to cellular cues. Such nuclear bodies are hypothesized to play important regulatory roles, e.g. by sequestering and releasing transcription factors in a timely manner. While the nucleolus and nuclear speckles have received more attention experimentally, the PML nuclear body and the Cajal body are still incompletely characterized in terms of their roles and protein complement. Results By collating recent experimentally verified data, we find that almost 1000 proteins in the mouse nuclear proteome are known to associate with one or more of the nuclear bodies. Their gene ontology terms highlight their regulatory roles: splicing is confirmed to be a core activity of speckles and PML nuclear bodies house a range of proteins involved in DNA repair. We train support-vector machines to show that nuclear proteins contain discriminative sequence features that can be used to identify their intra-nuclear body associations. Prediction accuracy is highest for nucleoli and nuclear speckles. The trained models are also used to estimate the full protein complement of each nuclear body. Protein interactions are found primarily to link proteins in the nuclear speckles with proteins from other compartments. Cell cycle expression data provide support for increased activity in nucleoli, nuclear speckles and PML nuclear bodies especially during S and G2 phases. Conclusions The large-scale analysis of the mouse nuclear proteome sheds light on the <it>functional </it>organization of <it>physically </it>embodied intra-nuclear compartments. We observe partial support for the hypothesis that the physical organization of the nucleus mirrors functional modularity. However, we are unable to unambiguously identify proteins' intra-nuclear destination, suggesting that critical drivers behind of intra-nuclear translocation are yet to be identified.</p

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

UM Digital Repository

An SVM-based system for predicting protein subnuclear localizations

Author: Dai Yang
Lei Zhengdeng
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

BACKGROUND: The large gap between the number of protein sequences in databases and the number of functionally characterized proteins calls for the development of a fast computational tool for the prediction of subnuclear and subcellular localizations generally applicable to protein sequences. The information on localization may reveal the molecular function of novel proteins, in addition to providing insight on the biological pathways in which they function. The bulk of past work has been focused on protein subcellular localizations. Furthermore, no specific tool has been dedicated to prediction at the subnuclear level, despite its high importance. In order to design a suitable predictive system, the extraction of subtle sequence signals that can discriminate among proteins with different subnuclear localizations is the key. RESULTS: New kernel functions used in a support vector machine (SVM) learning model are introduced for the measurement of sequence similarity. The k-peptide vectors are first mapped by a matrix of high-scored pairs of k-peptides which are measured by BLOSUM62 scores. The kernels, measuring the similarity for sequences, are then defined on the mapped vectors. By combining these new encoding methods, a multi-class classification system for the prediction of protein subnuclear localizations is established for the first time. The performance of the system is evaluated with a set of proteins collected in the Nuclear Protein Database (NPD). The overall accuracy of prediction for 6 localizations is about 50% (vs. random prediction 16.7%) for single localization proteins in the leave-one-out cross-validation; and 65% for an independent set of multi-localization proteins. This integrated system can be accessed at . CONCLUSION: The integrated system benefits from the combination of predictions from several SVMs based on selected encoding methods. Finally, the predictive power of the system is expected to improve as more proteins with known subnuclear localizations become available

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

A manually curated network of the PML nuclear body interactome reveals an important role for PML-NBs in SUMOylation dynamics

Author: Dang Thanh Hai
Laukens Kris
Van Damme Ellen
Van Ostade Xaveer
Publication venue: Ivyspring International Publisher
Publication date: 01/01/2010
Field of study

Promyelocytic Leukaemia Protein nuclear bodies (PML-NBs) are dynamic nuclear protein aggregates. To gain insight in PML-NB function, reductionist and high throughput techniques have been employed to identify PML-NB proteins. Here we present a manually curated network of the PML-NB interactome based on extensive literature review including database information. By compiling 'the PML-ome', we highlighted the presence of interactors in the Small Ubiquitin Like Modifier (SUMO) conjugation pathway. Additionally, we show an enrichment of SUMOylatable proteins in the PML-NBs through an in-house prediction algorithm. Therefore, based on the PML network, we hypothesize that PML-NBs may function as a nuclear SUMOylation hotspot

Crossref

PubMed Central

Institutional Repository Universiteit Antwerpen

In vitro nuclear interactome of the HIV-1 Tat protein

Author: A Clery
A Gatignol
A Kuzmichev
A Kuzmichev
A Marchler-Bauer
A Mazumdar
A Munnia
A Pumfery
A You
AB Makar
AJ Ruthenburg
AM Hidalgo-Estevez
AM Pyle
AS Neish
AY Nikolaev
B Kalverda
BB Singh
BJ Breitkreutz
BJ Wilson
BK Das
BM Lunde
BP Ashburner
C Alfarano
C Delphin
C Franz
C Hetzer
C Morris-Desbois
C Musahl
C Schwerk
C Treand
C Underhill
CA Hassig
CA Hassig
CA Johnson
CA Johnson
CA Parada
CD Laherty
CG Lee
CH Herrmann
CK Dreger
CL Jiang
CL Will
CM Grozinger
D Dorner
D Gorlich
D Li
D Maiorano
D Reinberg
D Schulte
D Yasui
DA Bochar
DC Bharucha
DD Fischer
DG Quintana
DR Schmidt
E Agbottah
E Cavellan
E Craig
E Nagoshi
E Nicolas
E Nicolas
EK Sullivan
EP Kransdorf
ER Griffis
F Kashanchi
F Macian
F Peruzzi
FR Bischoff
G Dellaire
G He
G Jiang
G Lattanzi
GL Mayeur
H Cho
H Fujita
H Kato
H Kitagawa
H Sakai
H Takeuchi
H Xiao
H Zhang
H Zhang
HH Ng
HL Kiefer
HT Adler
I Letunic
I Olave
J Becker
J Boeke
J Brady
J Harborth
J Joseph
J Kamine
J Koipally
J Koipally
J Koipally
J Marango
J Moroianu
J Park
J Taplick
J van der Vlag
J Zhou
JA Schmiesing
JA Schmiesing
JF Rual
JH Lee
JH Liu
JJ Coull
JK Tong
JM Bridger
JR Dobosy
JW Critchfield
K Furukawa
K Imai
K Scheffzek
K Yoder
K Zhao
K Zhou
KC Moraes
KI Tatematsu
KL Block
KM Lounsbury
KS McKeegan
L Corsini
L Deng
L Finch
L Mohrmann
L Tickenbrock
L Yang
LA Boyer
LF Garcia-Martinez
Lili Gu
LR Racki
M Ashburner
M Barboric
M Benkirane
M Bienz
M Brackertz
M Brackertz
M Brand
M Bukrinsky
M Chibi
M Fornerod
M Fuchs
M Fujita
M Kneissl
M Kohler
M Meurer
M Ohnishi
M Ott
M Saito
M Sorin
M Stros
M Tyagi
M Yanagida
MA Bhat
MA Hakimi
MA Hakimi
MA Hakimi
MJ Bottomley
ML Phelan
MR Rountree
MV Natsiuk
MY Chou
N Bonifaci
N Epie
N Fujita
N Methot
N Wagner
N Yabuta
Niaobh O'Donoghue
NJ Watkins
Noreen Sheehy
NR Yaseen
NR Yaseen
O Rohr
O Rozenblatt-Rosen
P Cramer
P Gacesa
PA Tucker
PG Young
PM Dehe
PM Dehe
Q Ye
R Berro
R Gamsjaeger
R Mahajan
R Murr
R Truant
R Van Duyne
R Van Duyne
RA Sclafani
RA Silverstein
RC Hillig
RD Adzerikho
RD Finn
RE Kiernan
RH Kehlenbach
RM Ewing
S Bannwarth
S Debernardi
S Mujtaba
S Mujtaba
S Nakielny
S Nekhai
S Nekhai
S Pagans
S Peri
S Shaklai
S Sif
S Takezawa
S Vashee
SA Ansari
SA Denslow
SA Williams
SC Tsai
SK Dhar
SL Forsburg
SM Nicol
Stephen Pennington
T Ammosova
T Hirano
T Mahmoudi
T Otsuki
T Sasaki
T Tasara
T Yoshida
TC Fleischer
TE Harris
TP Cujec
TP Cujec
TW Reichman
U Kutay
U Stelzl
V Bres
VB Cismasiu
VF Zhupan
Virginie W Gautier
VW Gautier
W Antonin
W Fischle
W Fischle
W Fu
W Wang
WE Muller
WE Muller
William W Hall
WM Yang
X Yang
Y Ariumi
Y Bennasser
Y Doyon
Y Ishimi
Y Ishimi
Y Shi
Y Xue
Y Zhang
Y Zhang
Y Zhang
Y Zhang
Y Zhang
Y Zhang
Y Zhu
YJ Hsieh
YL Yao
YL Yao
YP Li
YP Li
Z Nie
Z You
Z You
ZB Xia
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background One facet of the complexity underlying the biology of HIV-1 resides not only in its limited number of viral proteins, but in the extensive repertoire of cellular proteins they interact with and their higher-order assembly. HIV-1 encodes the regulatory protein Tat (86–101aa), which is essential for HIV-1 replication and primarily orchestrates HIV-1 provirus transcriptional regulation. Previous studies have demonstrated that Tat function is highly dependent on specific interactions with a range of cellular proteins. However they can only partially account for the intricate molecular mechanisms underlying the dynamics of proviral gene expression. To obtain a comprehensive nuclear interaction map of Tat in T-cells, we have designed a proteomic strategy based on affinity chromatography coupled with mass spectrometry. Results Our approach resulted in the identification of a total of 183 candidates as Tat nuclear partners, 90% of which have not been previously characterised. Subsequently we applied <it>in silico </it>analysis, to validate and characterise our dataset which revealed that the Tat nuclear interactome exhibits unique signature(s). First, motif composition analysis highlighted that our dataset is enriched for domains mediating protein, RNA and DNA interactions, and helicase and ATPase activities. Secondly, functional classification and network reconstruction clearly depicted Tat as a polyvalent protein adaptor and positioned Tat at the nexus of a densely interconnected interaction network involved in a range of biological processes which included gene expression regulation, RNA biogenesis, chromatin structure, chromosome organisation, DNA replication and nuclear architecture. Conclusion We have completed the <it>in vitro </it>Tat nuclear interactome and have highlighted its modular network properties and particularly those involved in the coordination of gene expression by Tat. Ultimately, the highly specialised set of molecular interactions identified will provide a framework to further advance our understanding of the mechanisms of HIV-1 proviral gene silencing and activation.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Prediction of protein submitochondria locations by hybridizing pseudo-amino acid composition with various physicochemical features of segmented sequence

Author: A Kumar
A Reinhardt
B Matthews
C Guda
C Guda
CH Wu
G Dellaire
G-P Zhou
H Liu
H-B Shen
H-B Shen
HGE Sutherland
J Cedano
JL Heazlewood
K Nakai
K-C Chou
K-C Chou
K-C Chou
K-C Chou
K-C Chou
K-C Chou
K-C Chou
K-C Chou
K-C Chou
K-C Chou
K-C Chou
K-C Chou
K-C Chou
K-C Chou
K-C Chou
K-C Chou
K-C Chou
K-J Park
M Wang
M Wang
MA Andrade
MS Scott
O Emanuelsson
P Lio
Pufeng Du
Q-B Gao
RA Gottlieb
S Hua
S Kawashima
S-Q Wang
W Jassem
W Li
WA BickMore
Y Huang
Y-D Cai
Y-D Cai
Y-D Cai
Yanda Li
Z Lei
Z Yuan
Z-P Feng
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Knowing the submitochondria localization of a mitochondria protein is an important step to understand its function. We develop a method which is based on an extended version of pseudo-amino acid composition to predict the protein localization within mitochondria. This work goes one step further than predicting protein subcellular location. We also try to predict the membrane protein type for mitochondrial inner membrane proteins. RESULTS: By using leave-one-out cross validation, the prediction accuracy is 85.5% for inner membrane, 94.5% for matrix and 51.2% for outer membrane. The overall prediction accuracy for submitochondria location prediction is 85.2%. For proteins predicted to localize at inner membrane, the accuracy is 94.6% for membrane protein type prediction. CONCLUSION: Our method is an effective method for predicting protein submitochondria location. But even with our method or the methods at subcellular level, the prediction of protein submitochondria location is still a challenging problem. The online service SubMito is now available at

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Recommended from our members

Detailed prediction of protein sub-nuclear localization

Author: Bodén Mikael
Goldberg Tatyana
Littmann Maria
Rost Burkhard
Seitz Sebastian
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2019
Field of study

Background Sub-nuclear structures or locations are associated with various nuclear processes. Proteins localized in these substructures are important to understand the interior nuclear mechanisms. Despite advances in high-throughput methods, experimental protein annotations remain limited. Predictions of cellular compartments have become very accurate, largely at the expense of leaving out substructures inside the nucleus making a fine-grained analysis impossible. Results Here, we present a new method (LocNuclei) that predicts nuclear substructures from sequence alone. LocNuclei used a string-based Profile Kernel with Support Vector Machines (SVMs). It distinguishes sub-nuclear localization in 13 distinct substructures and distinguishes between nuclear proteins confined to the nucleus and those that are also native to other compartments (traveler proteins). High performance was achieved by implicitly leveraging a large biological knowledge-base in creating predictions by homology-based inference through BLAST. Using this approach, the performance reached AUC = 0.70–0.74 and Q13 = 59–65%. Travelling proteins (nucleus and other) were identified at Q2 = 70–74%. A Gene Ontology (GO) analysis of the enrichment of biological processes revealed that the predicted sub-nuclear compartments matched the expected functionality. Analysis of protein-protein interactions (PPI) show that formation of compartments and functionality of proteins in these compartments highly rely on interactions between proteins. This suggested that the LocNuclei predictions carry important information about function. The source code and data sets are available through GitHub: https://github.com/Rostlab/LocNuclei . Conclusions LocNuclei predicts subnuclear compartments and traveler proteins accurately. These predictions carry important information about functionality and PPIs

Columbia University Academic Commons