Search CORE

PuSH

HMM_RA: An Improved Method for Alpha-Helical Transmembrane Protein Topology Prediction

Author: Hu Jing
Yan Changhui
Publication venue: Libertas Academica
Publication date: 01/01/2008
Field of study

α-helical transmembrane (TM) proteins play important and diverse functional roles in cells. The ability to predict the topology of these proteins is important for identifying functional sites and inferring function of membrane proteins. This paper presents a Hidden Markov Model (referred to as HMM_RA) that can predict the topology of α-helical transmembrane proteins with improved performance. HMM_RA adopts the same structure as the HMMTOP method, which has five modules: inside loop, inside helix tail, membrane helix, outside helix tail and outside loop. Each module consists of one or multiple states. HMM_RA allows using reduced alphabets to encode protein sequences. Thus, each state of HMM_RA is associated with n emission probabilities, where n is the size of the reduced alphabet set. Direct comparisons using two standard data sets show that HMM_RA consistently outperforms HMMTOP and TMHMM in topology prediction. Specifically, on a high-quality data set of 83 proteins, HMM_RA outperforms HMMTOP by up to 7.6% in topology accuracy and 6.4% in α-helices location accuracy. On the same data set, HMM_RA outperforms TMHMM by up to 6.4% in topology accuracy and 2.9% in location accuracy. Comparison also shows that HMM_RA achieves comparable performance as Phobius, a recently published method

Rampant exchange of the structure and function of extramembrane domains between membrane and water soluble proteins.

Author: Bowie James U
Han Seong Kyu
Kim Sanguk
Nam Hyun-Jun
Publication venue: eScholarship, University of California
Publication date: 01/01/2013
Field of study

Of the membrane proteins of known structure, we found that a remarkable 67% of the water soluble domains are structurally similar to water soluble proteins of known structure. Moreover, 41% of known water soluble protein structures share a domain with an already known membrane protein structure. We also found that functional residues are frequently conserved between extramembrane domains of membrane and soluble proteins that share structural similarity. These results suggest membrane and soluble proteins readily exchange domains and their attendant functionalities. The exchanges between membrane and soluble proteins are particularly frequent in eukaryotes, indicating that this is an important mechanism for increasing functional complexity. The high level of structural overlap between the two classes of proteins provides an opportunity to employ the extensive information on soluble proteins to illuminate membrane protein structure and function, for which much less is known. To this end, we employed structure guided sequence alignment to elucidate the functions of membrane proteins in the human genome. Our results bridge the gap of fold space between membrane and water soluble proteins and provide a resource for the prediction of membrane protein function. A database of predicted structural and functional relationships for proteins in the human genome is provided at sbi.postech.ac.kr/emdmp

eScholarship - University of California

A Combination of Compositional Index and Genetic Algorithm for Predicting Transmembrane Helical Segments

Author: A Krogh
A Thomas
B Rost
E Falkenauer
E Wallin
EL Sonnhammer
F Tekaia
G Tusnady
G von Heijne
GE Tusnady
H Berman
H Shen
H Zhou
J Holland
J Pylouster
JM Cuthbertson
L Kall
M Cserzo
M Suyama
MG Claros
Nazar Zaki
Pierandrea Temussi
R Garey
RY Kahsay
S Hosseini
S Jayasinghe
S Roy
Salah Bouktif
Sanja Lazarova-Molnar
T Hirokawa
T Nugent
T Taylor
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Transmembrane helix (TMH) topology prediction is becoming a focal problem in bioinformatics because the structure of TM proteins is difficult to determine using experimental methods. Therefore, methods that can computationally predict the topology of helical membrane proteins are highly desirable. In this paper we introduce TMHindex, a method for detecting TMH segments using only the amino acid sequence information. Each amino acid in a protein sequence is represented by a Compositional Index, which is deduced from a combination of the difference in amino acid occurrences in TMH and non-TMH segments in training protein sequences and the amino acid composition information. Furthermore, a genetic algorithm was employed to find the optimal threshold value for the separation of TMH segments from non-TMH segments. The method successfully predicted 376 out of the 378 TMH segments in a dataset consisting of 70 test protein sequences. The sensitivity and specificity for classifying each amino acid in every protein sequence in the dataset was 0.901 and 0.865, respectively. To assess the generality of TMHindex, we also tested the approach on another standard 73-protein 3D helix dataset. TMHindex correctly predicted 91.8% of proteins based on TM segments. The level of the accuracy achieved using TMHindex in comparison to other recent approaches for predicting the topology of TM proteins is a strong argument in favor of our proposed method. Availability: The datasets, software together with supplementary materials are available at: http://faculty.uaeu.ac.ae/nzaki/TMHindex.htm

Public Library of Science (PLOS)

University of Southern Denmark Research Output

Membrane Topology and Predicted RNA-Binding Function of the ‘Early Responsive to Dehydration (ERD4)’ Plant Protein

Functional annotation of uncharacterized genes is the main focus of computational methods in the post genomic era. These tools search for similarity between proteins on the premise that those sharing sequence or structural motifs usually perform related functions, and are thus particularly useful for membrane proteins. Early responsive to dehydration (ERD) genes are rapidly induced in response to dehydration stress in a variety of plant species. In the present work we characterized function of Brassica juncea ERD4 gene using computational approaches. The ERD4 protein of unknown function possesses ubiquitous DUF221 domain (residues 312–634) and is conserved in all plant species. We suggest that the protein is localized in chloroplast membrane with at least nine transmembrane helices. We detected a globular domain of 165 amino acid residues (183–347) in plant ERD4 proteins and expect this to be posited inside the chloroplast. The structural-functional annotation of the globular domain was arrived at using fold recognition methods, which suggested in its sequence presence of two tandem RNA-recognition motif (RRM) domains each folded into βαββαβ topology. The structure based sequence alignment with the known RNA-binding proteins revealed conservation of two non-canonical ribonucleoprotein sub-motifs in both the putative RNA-recognition domains of the ERD4 protein. The function of highly conserved ERD4 protein may thus be associated with its RNA-binding ability during the stress response. This is the first functional annotation of ERD4 family of proteins that can be useful in designing experiments to unravel crucial aspects of stress tolerance mechanism

FigShare

Properties of the phage-shock-protein (Psp) regulatory complex that govern signal transduction and induction of the Psp response in Escherichia coli

Author: Abramoff
Adams
Antony J. Mayhew
Becker
Bergler
Bernsel
Bidle
Bogdanov
Bogdanov
Bogdanov
Brown
Cherepanov
Christoph Engl
Corpet
Darwin
Darwin
Dowhan
Elderkin
Elderkin
Engl
Finn
Goran Jovanovic
Gueguen
Guilvout
Haggie
Huvet
Javadpour
Jiang
Joly
Jones
Jovanovic
Jovanovic
Kahsay
Karimova
Karimova
Kleerebezem
Kobayashi
Kusters
Lloyd
Martin Buck
Maxson
Miller
Model
Patricia C. Burrows
Rowley
Russel
Tusnády
von Heijne
Vrancken
Weiner
Weiner
Wigneshweraraj
Zhang
Zhang
Publication venue: Society for General Microbiology
Publication date: 01/01/2010
Field of study

The phage-shock-protein (Psp) response maintains the proton-motive force (pmf) under extracytoplasmic stress conditions that impair the inner membrane (IM) in bacterial cells. In Escherichia coli transcription of the pspABCDE and pspG genes requires activation of σ54-RNA polymerase by the enhancer-binding protein PspF. A regulatory network comprising PspF–A–C–B–ArcB controls psp expression. One key regulatory point is the negative control of PspF imposed by its binding to PspA. It has been proposed that under stress conditions, the IM-bound sensors PspB and PspC receive and transduce the signal(s) to PspA via protein–protein interactions, resulting in the release of the PspA–PspF inhibitory complex and the consequent induction of psp. In this work we demonstrate that PspB self-associates and interacts with PspC via putative IM regions. We present evidence suggesting that PspC has two topologies and that conserved residue G48 and the putative leucine zipper motif are determinants required for PspA interaction and signal transduction upon stress. We also establish that PspC directly interacts with the effector PspG, and show that PspG self-associates. These results are discussed in the context of formation and function of the Psp regulatory complex

Queen Mary Research Online

imagine

Genome-wide analysis of regulatory proteases sequences identified through bioinformatics data mining in Taenia solium

Author: Brindley Paul J.
Cai Xuepeng
Guo Aijiang
Hou Junling
Jia Wan-Zhong
Li Li
Lou Zhong-Zi
Luo Xuenong
Yan Hong-Bin
Zheng Yadong
Publication venue: Health Sciences Research Commons
Publication date: 04/06/2014
Field of study

Background Cysticercosis remains a major neglected tropical disease of humanity in many regions, especially in sub-Saharan Africa, Central America and elsewhere. Owing to the emerging drug resistance and the inability of current drugs to prevent re-infection, identification of novel vaccines and chemotherapeutic agents against Taenia solium and related helminth pathogens is a public health priority. The T. solium genome and the predicted proteome were reported recently, providing a wealth of information from which new interventional targets might be identified. In order to characterize and classify the entire repertoire of protease-encoding genes of T. solium, which act fundamental biological roles in all life processes, we analyzed the predicted proteins of this cestode through a combination of bioinformatics tools. Functional annotation was performed to yield insights into the signaling processes relevant to the complex developmental cycle of this tapeworm and to highlight a suite of the proteases as potential intervention targets. Results Within the genome of this helminth parasite, we identified 200 open reading frames encoding proteases from five clans, which correspond to 1.68% of the 11,902 protein-encoding genes predicted to be present in its genome. These proteases include calpains, cytosolic, mitochondrial signal peptidases, ubiquitylation related proteins, and others. Many not only show significant similarity to proteases in the Conserved Domain Database but have conserved active sites and catalytic domains. KEGG Automatic Annotation Server (KAAS) analysis indicated that ~60% of these proteases share strong sequence identities with proteins of the KEGG database, which are involved in human disease, metabolic pathways, genetic information processes, cellular processes, environmental information processes and organismal systems. Also, we identified signal peptides and transmembrane helices through comparative analysis with classes of important regulatory proteases. Phylogenetic analysis using Bayes approach provided support for inferring functional divergence among regulatory cysteine and serine proteases. Conclusion Numerous putative proteases were identified for the first time in T. solium, and important regulatory proteases have been predicted. This comprehensive analysis not only complements the growing knowledge base of proteolytic enzymes, but also provides a platform from which to expand knowledge of cestode proteases and to explore their biochemistry and potential as intervention targets

George Washington University: Health Sciences Research Commons (HSRC)

CoBaltDB: Complete bacterial and archaeal orfeomes subcellular localization database and associated resources

Author: Avner Stéphane
Barloy-Hubler Frédérique
Goudenège David
Lucchetti-Miganeh Céline
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

International audienceBACKGROUND: The functions of proteins are strongly related to their localization in cell compartments (for example the cytoplasm or membranes) but the experimental determination of the sub-cellular localization of proteomes is laborious and expensive. A fast and low-cost alternative approach is in silico prediction, based on features of the protein primary sequences. However, biologists are confronted with a very large number of computational tools that use different methods that address various localization features with diverse specificities and sensitivities. As a result, exploiting these computer resources to predict protein localization accurately involves querying all tools and comparing every prediction output; this is a painstaking task. Therefore, we developed a comprehensive database, called CoBaltDB, that gathers all prediction outputs concerning complete prokaryotic proteomes. DESCRIPTION: The current version of CoBaltDB integrates the results of 43 localization predictors for 784 complete bacterial and archaeal proteomes (2.548.292 proteins in total). CoBaltDB supplies a simple user-friendly interface for retrieving and exploring relevant information about predicted features (such as signal peptide cleavage sites and transmembrane segments). Data are organized into three work-sets ("specialized tools", "meta-tools" and "additional tools"). The database can be queried using the organism name, a locus tag or a list of locus tags and may be browsed using numerous graphical and text displays. CONCLUSIONS: With its new functionalities, CoBaltDB is a novel powerful platform that provides easy access to the results of multiple localization tools and support for predicting prokaryotic protein localizations with higher confidence than previously possible. CoBaltDB is available at http://www.umr6026.univ-rennes1.fr/english/home/research/basic/software/cobalten

Springer - Publisher Connector

University of Essex Research Repository

HAL-Rennes 1

Structure and functional motifs of GCR1, the only plant protein with a GPCR fold?

Author: Abdalla Nuradin Y
Bailey Gregory R
Jordon Sian R D
Reeves Philip J
Reynolds Christopher A
Taddese Bruck
Upton Graham J G
Publication venue: 'American Society of Plant Biologists (ASPB)'
Publication date: 01/01/2013
Field of study

Whether GPCRs exist in plants is a fundamental biological question. Interest in deorphanizing new G protein coupled receptors (GPCRs), arises because of their importance in signaling. Within plants, this is controversial as genome analysis has identified 56 putative GPCRs, including GCR1 which is reportedly a remote homologue to class A, B and E GPCRs. Of these, GCR2, is not a GPCR; more recently it has been proposed that none are, not even GCR1. We have addressed this disparity between genome analysis and biological evidence through a structural bioinformatics study, involving fold recognition methods, from which only GCR1 emerges as a strong candidate. To further probe GCR1, we have developed a novel helix alignment method, which has been benchmarked against the the class A – class B - class F GPCR alignments. In addition, we have presented a mutually consistent set of alignments of GCR1 homologues to class A, class B and class F GPCRs, and shown that GCR1 is closer to class A and /or class B GPCRs than class A, class B or class F GPCRs are to each other. To further probe GCR1, we have aligned transmembrane helix 3 of GCR1 to each of the 6 GPCR classes. Variability comparisons provide additional evidence that GCR1 homologues have the GPCR fold. From the alignments and a GCR1 comparative model we have identified motifs that are common to GCR1, class A, B and E GPCRs. We discuss the possibilities that emerge from this controversial evidence that GCR1 has a GPCR fol

Highly Divergent Mitochondrial ATP Synthase Complexes in Tetrahymena thermophila

Author: A Bernsel
A Pain
A Pain
A Villavicencio-Queijeiro
A Zikova
A. B Vaidya
A. J Rodgers
A. M Waterhouse
Akhil B. Vaidya
B Schierwater
B. F Lang
C Börnhovd
C Meisinger
C. F Brunk
C. M Angevine
C. W Greider
D Pogoryelov
D Stock
D. G Smith
D. T Jones
E Bisetto
E Zerbetto
E. A Nash
E. A Vasilyeva
E. J Boekema
E. L. L Sonnhammer
Egbert J. Boekema
F Armougom
F Burki
F Krause
F Pazos
F Rodríguez
F Ronquist
G Burger
G Turner
H Viklund
I Wittig
I. R Collinson
J Felsenstein
J Hermolin
J St-Pierre
J Traba
J. A Eisen
J. D Thompson
J. E Brownell
J. E Walker
J. P Abrahams
J. P Huelsenbeck
Jennifer E. van Eyk
Jonathan A. Eisen
K Bryson
K Katoh
K Kruger
K Peters
K Tamura
K. A Brayton
K. A Conklin
K. K Peachman
L Kilpatrick
Lesley A. Kane
M Johnson
M Vázquez-Acevedo
M Weigt
M. D Unitt
M. E Pullman
M. J Gardner
M. L Sogin
M. L Sogin
M. M Moradian
M. S Abrahamsen
Michael W. Mather
N Numoto
N. V Dudkina
N. V Dudkina
N. V Dudkina
N. V Dudkina
N. V Dudkina
N. V Dudkina
Natalya V. Dudkina
P Xu
P. D Boyer
P. D Boyer
Praveen Balabaskaran Nina
R Kagawa
R Rabl
R Sadreyev
R Seshadri
R. D Allen
R. D Finn
R. I Menz
R. L Cross
R. L Gundry
R. Y Kahsay
S Guindon
S Sunderhaus
S Whelan
S. F Altschul
S. P Muench
S. V Brown
T Cavalier-Smith
T Kobayashi
T Meier
T. F Smith
V Hampl
W Junge
W Werhahn
Y Zhang
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Tetrahymena ATP synthase, an evolutionarily divergent protein complex, has a very unusual structure and protein composition including a unique Fo subunit a and at least 13 proteins with no orthologs outside of the ciliate lineage

Public Library of Science (PLOS)

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen