161 research outputs found
Discrimination of outer membrane proteins with improved performance
<p>Abstract</p> <p>Background</p> <p>Outer membrane proteins (OMPs) perform diverse functional roles in Gram-negative bacteria. Identification of outer membrane proteins is an important task.</p> <p>Results</p> <p>This paper presents a method for distinguishing outer membrane proteins (OMPs) from non-OMPs (that is, globular proteins and inner membrane proteins (IMPs)). First, we calculated the average residue compositions of OMPs, globular proteins and IMPs separately using a training set. Then for each protein from the test set, its distances to the three groups were calculated based on residue composition using a weighted Euclidean distance (WED) approach. Proteins from the test set were classified into OMP versus non-OMP classes based on the least distance. The proposed method can distinguish between OMPs and non-OMPs with 91.0% accuracy and 0.639 Matthews correlation coefficient (MCC). We then improved the method by including homologous sequences into the calculation of residue composition and using a feature-selection method to select the single residue and di-peptides that were useful for OMP prediction. The final method achieves an accuracy of 96.8% with 0.859 MCC. In direct comparisons, the proposed method outperforms previously published methods.</p> <p>Conclusion</p> <p>The proposed method can identify OMPs with improved performance. It will be very helpful to the discovery of OMPs in a genome scale.</p
Grammatical-Restrained Hidden Conditional Random Fields for Bioinformatics applications
<p>Abstract</p> <p>Background</p> <p>Discriminative models are designed to naturally address classification tasks. However, some applications require the inclusion of grammar rules, and in these cases generative models, such as Hidden Markov Models (HMMs) and Stochastic Grammars, are routinely applied.</p> <p>Results</p> <p>We introduce Grammatical-Restrained Hidden Conditional Random Fields (GRHCRFs) as an extension of Hidden Conditional Random Fields (HCRFs). GRHCRFs while preserving the discriminative character of HCRFs, can assign labels in agreement with the production rules of a defined grammar. The main GRHCRF novelty is the possibility of including in HCRFs prior knowledge of the problem by means of a defined grammar. Our current implementation allows <it>regular grammar </it>rules. We test our GRHCRF on a typical biosequence labeling problem: the prediction of the topology of Prokaryotic outer-membrane proteins.</p> <p>Conclusion</p> <p>We show that in a typical biosequence labeling problem the GRHCRF performs better than CRF models of the same complexity, indicating that GRHCRFs can be useful tools for biosequence analysis applications.</p> <p>Availability</p> <p>GRHCRF software is available under GPLv3 licence at the website</p> <p><url>http://www.biocomp.unibo.it/~savojard/biocrf-0.9.tar.gz.</url></p
IgTM: An algorithm to predict transmembrane domains and topology in proteins
<p>Abstract</p> <p>Background</p> <p>Due to their role of receptors or transporters, membrane proteins play a key role in many important biological functions. In our work we used Grammatical Inference (GI) to localize transmembrane segments. Our GI process is based specifically on the inference of Even Linear Languages.</p> <p>Results</p> <p>We obtained values close to 80% in both specificity and sensitivity. Six datasets have been used for the experiments, considering different encodings for the input sequences. An encoding that includes the topology changes in the sequence (from inside and outside the membrane to it and vice versa) allowed us to obtain the best results. This software is publicly available at: <url>http://www.dsic.upv.es/users/tlcc/bio/bio.html</url></p> <p>Conclusion</p> <p>We compared our results with other well-known methods, that obtain a slightly better precision. However, this work shows that it is possible to apply Grammatical Inference techniques in an effective way to bioinformatics problems.</p
Novel Colicin F-Y of Yersinia frederiksenii Inhibits Pathogenic Yersinia Strains via YiuR-Mediated Reception, TonB Import, and Cell Membrane Pore Formation
A novel colicin type, designated colicin F-Y, was found to be encoded and produced by the strain Yersinia frederiksenii Y27601. Colicin F-Y was active against both pathogenic and nonpathogenic strains of the genus Yersinia. Plasmid YF27601 (5,574 bp) of Y. frederiksenii Y27601 was completely sequenced. The colicin F-Y activity gene (cfyA) and the colicin F-Y immunity gene (cfyI) were identified. The deduced amino acid sequence of colicin F-Y was very similar in its C-terminal pore-forming domain to colicin Ib (69% identity in the last 178 amino acid residues), indicating pore forming as its lethal mode of action. Transposon mutagenesis of the colicin F-Y-susceptible strain Yersinia kristensenii Y276 revealed the yiuR gene (ykris001_4440), which encodes the YiuR outer membrane protein with unknown function, as the colicin F-Y receptor molecule. Introduction of the yiuR gene into the colicin F-Y-resistant strain Y. kristensenii Y104 restored its susceptibility to colicin F-Y. In contrast, the colicin F-Y-resistant strain Escherichia coli TOP10F' acquired susceptibility to colicin F-Y only when both the yiuR and tonB genes from Y. kristensenii Y276 were introduced. Similarities between colicins F-Y and Ib, similarities between the Cir and YiuR receptors, and the detected partial cross-immunity of colicin F-Y and colicin Ib producers suggest a common evolutionary origin of the colicin F-Y-YiuR and colicin Ib-Cir systems
Transmembrane protein topology prediction using support vector machines
Background: Alpha-helical transmembrane (TM) proteins are involved in a wide range of important biological processes such as cell signaling, transport of membrane-impermeable molecules, cell-cell communication, cell recognition and cell adhesion. Many are also prime drug targets, and it has been estimated that more than half of all drugs currently on the market target membrane proteins. However, due to the experimental difficulties involved in obtaining high quality crystals, this class of protein is severely under-represented in structural databases. In the absence of structural data, sequence-based prediction methods allow TM protein topology to be investigated.Results: We present a support vector machine-based (SVM) TM protein topology predictor that integrates both signal peptide and re-entrant helix prediction, benchmarked with full cross-validation on a novel data set of 131 sequences with known crystal structures. The method achieves topology prediction accuracy of 89%, while signal peptides and re-entrant helices are predicted with 93% and 44% accuracy respectively. An additional SVM trained to discriminate between globular and TM proteins detected zero false positives, with a low false negative rate of 0.4%. We present the results of applying these tools to a number of complete genomes. Source code, data sets and a web server are freely available from http://bioinf.cs.ucl.ac.uk/psipred/.Conclusion: The high accuracy of TM topology prediction which includes detection of both signal peptides and re-entrant helices, combined with the ability to effectively discriminate between TM and globular proteins, make this method ideally suited to whole genome annotation of alpha-helical transmembrane proteins
Solution-Based Structural Analysis of the Decaheme Cytochrome, MtrA, by Small-Angle X-ray Scattering and Analytical Ultracentrifugation
The potential exploitation of metal-reducing bacteria as a means for environmental cleanup or alternative fuel is an exciting prospect; however, the cellular processes that would allow for these applications need to be better understood. MtrA is a periplasmic decaheme c-type cytochrome from Shewanella oneidensis involved in the reduction of extracellular iron oxides and therefore is a critical element in Shewanella ability to engage in extracellular charge transfer. As a relatively small 333-residue protein, the heme content is surprisingly high. MtrA is believed to obtain electrons from the inner membrane-bound quinol oxidoreductase, CymA, and shuttle them across the outer membrane to MtrC, another decaheme cytochrome that directly interacts with insoluble metal oxides. How MtrA is able to perform this task is a question of interest. Here through the use of two solution-based techniques, small-angle X-ray scattering (SAXS) and analytical ultracentrifugation (AUC), we present the first structural analysis of MtrA. Our results establish that between 0.5 and 4 mg/mL, MtrA exists as a monomeric protein that is shaped like an extended molecular “wire” with a maximum protein dimension (D[subscript max]) of 104 Å and a rod-like aspect ratio of 2.2 to 2.5. This study contributes to a greater understanding of how MtrA fulfills its role in the redox processes that must occur before electrons reach the outside of the cell.National Science Foundation (U.S.). (0546323)National Institutes of Health (U.S.) (Grant Number F32GM904862)Howard Hughes Medical Institute. InvestigatorNational Science Foundation (U.S.) (Award DMR- 0936384
Bivariate genome-wide association meta-analysis of pediatric musculoskeletal traits reveals pleiotropic effects at the SREBF1/TOM1L2 locus
Bone mineral density is known to be a heritable, polygenic trait whereas genetic variants contributing to lean mass variation remain largely unknown. We estimated the shared SNP heritability and performed a bivariate GWAS meta-analysis of total-body lean mass (TB-LM) and total-body less head bone mineral density (TBLH-BMD) regions in 10,414 children. The estimated SNP heritability is 43% for TBLH-BMD, and 39% for TB-LM, with a shared genetic component of 43%. We identify variants with pleiotropic effects in eight loci, including seven established bone mineral density loci: _WNT4, GALNT3, MEPE, CPED1/WNT16, TNFSF11, RIN3, and PPP6R3/LRP5_. Variants in the _TOM1L2/SREBF1_ locus exert opposing effects TB-LM and TBLH-BMD, and have a stronger association with the former trait. We show that _SREBF1_ is expressed in murine and human osteoblasts, as well as in human muscle tissue. This is the first bivariate GWAS meta-analysis to demonstrate genetic factors with pleiotropic effects on bone mineral density and lean mass
A −436C>A Polymorphism in the Human FAS Gene Promoter Associated with Severe Childhood Malaria
Human genetics and immune responses are considered to critically influence the outcome of malaria infections including life-threatening syndromes caused by Plasmodium falciparum. An important role in immune regulation is assigned to the apoptosis-signaling cell surface receptor CD95 (Fas, APO-1), encoded by the gene FAS. Here, a candidate-gene association study including variant discovery at the FAS gene locus was carried out in a case-control group comprising 1,195 pediatric cases of severe falciparum malaria and 769 unaffected controls from a region highly endemic for malaria in Ghana, West Africa. We found the A allele of c.−436C>A (rs9658676) located in the promoter region of FAS to be significantly associated with protection from severe childhood malaria (odds ratio 0.71, 95% confidence interval 0.58–0.88, pempirical = 0.02) and confirmed this finding in a replication group of 1,412 additional severe malaria cases and 2,659 community controls from the same geographic area. The combined analysis resulted in an odds ratio of 0.71 (95% confidence interval 0.62–0.80, p = 1.8×10−7, n = 6035). The association applied to c.−436AA homozygotes (odds ratio 0.47, 95% confidence interval 0.36–0.60) and to a lesser extent to c.−436AC heterozygotes (odds ratio 0.73, 95% confidence interval 0.63–0.84), and also to all phenotypic subgroups studied, including severe malaria anemia, cerebral malaria, and other malaria complications. Quantitative FACS analyses assessing CD95 surface expression of peripheral blood mononuclear cells of naïve donors showed a significantly higher proportion of CD69+CD95+ cells among persons homozygous for the protective A allele compared to AC heterozygotes and CC homozygotes, indicating a functional role of the associated CD95 variant, possibly in supporting lymphocyte apoptosis
Bivariate genome-wide association meta-analysis of pediatric musculoskeletal traits reveals pleiotropic effects at the SREBF1/TOM1L2 locus
Bone mineral density is known to be a heritable, polygenic trait whereas genetic variants contributing to lean mass variation remain largely unknown. We estimated the shared SNP heritability and performed a bivariate GWAS meta-analysis of total-body lean mass (TB-LM) and total-body less head bone mineral density (TBLH-BMD) regions in 10,414 children. The estimated SNP heritability is 43% for TBLH-BMD, and 39% for TB-LM, with a shared genetic component of 43%. We identify variants with pleiotropic effects in eight loci, including seven established bone mineral density loci: _WNT4, GALNT3, MEPE, CPED1/WNT16, TNFSF11, RIN3, and PPP6R3/LRP5_. Variants in the _TOM1L2/SREBF1_ locus exert opposing effects TB-LM and TBLH-BMD, and have a stronger association with the former trait. We show that _SREBF1_ is expressed in murine and human osteoblasts, as well as in human muscle tissue. This is the first bivariate GWAS meta-analysis to demonstrate genetic factors with pleiotropic effects on bone mineral density and lean mass
Comparative study of the extracellular proteome of Sulfolobus species reveals limited secretion
Although a large number of potentially secreted proteins can be predicted on the basis of genomic distribution of signal sequence-bearing proteins, protein secretion in Archaea has barely been studied. A proteomic inventory and comparison of the growth medium proteins in three hyperthermoacidophiles, i.e., Sulfolobus solfataricus, S. acidocaldarius and S. tokodaii, indicates that only few proteins are freely secreted into the growth medium and that the majority originates from cell envelope bound forms. In S. acidocaldarius both cell-associated and secreted α-amylase activities are detected. Inactivation of the amyA gene resulted in a complete loss of activity, suggesting that the same protein is responsible for the a-amylase activity at both locations. It is concluded that protein secretion in Sulfolobus is a limited process, and it is suggested that the S-layer may act as a barrier for the free diffusion of folded proteins into the medium
- …