21 research outputs found
The performance (coverage metric) of the HMM search on the main dataset using a five-fold cross-validation.
<p>The performance (coverage metric) of the HMM search on the main dataset using a five-fold cross-validation.</p
Variance inamino acid composition of transporter proteins.
<p>The variance in amino acid residues across amino acid transporters, anion transporters, cation transporters, electron transporters, protein/mRNA transporters, sugar transporters, other transporters, and non-transporters is plotted.</p
The performances of the best models on the main dataset for different substrate-specific transporter classes.
<p>The performances of the best models on the main dataset for different substrate-specific transporter classes.</p
Prediction of Membrane Transport Proteins and Their Substrate Specificities Using Primary Sequence Information
<div><p>Background</p><p>Membrane transport proteins (transporters) move hydrophilic substrates across hydrophobic membranes and play vital roles in most cellular functions. Transporters represent a diverse group of proteins that differ in topology, energy coupling mechanism, and substrate specificity as well as sequence similarity. Among the functional annotations of transporters, information about their transporting substrates is especially important. The experimental identification and characterization of transporters is currently costly and time-consuming. The development of robust bioinformatics-based methods for the prediction of membrane transport proteins and their substrate specificities is therefore an important and urgent task.</p><p>Results</p><p>Support vector machine (SVM)-based computational models, which comprehensively utilize integrative protein sequence features such as amino acid composition, dipeptide composition, physico-chemical composition, biochemical composition, and position-specific scoring matrices (PSSM), were developed to predict the substrate specificity of seven transporter classes: amino acid, anion, cation, electron, protein/mRNA, sugar, and other transporters. An additional model to differentiate transporters from non-transporters was also developed. Among the developed models, the biochemical composition and PSSM hybrid model outperformed other models and achieved an overall average prediction accuracy of 76.69% with a Mathews correlation coefficient (MCC) of 0.49 and a receiver operating characteristic area under the curve (AUC) of 0.833 on our main dataset. This model also achieved an overall average prediction accuracy of 78.88% and MCC of 0.41 on an independent dataset.</p><p>Conclusions</p><p>Our analyses suggest that evolutionary information (i.e., the PSSM) and the AAIndex are key features for the substrate specificity prediction of transport proteins. In comparison, similarity-based methods such as BLAST, PSI-BLAST, and hidden Markov models do not provide accurate predictions for the substrate specificity of membrane transport proteins. <i>TrSSP: The Transporter Substrate Specificity Prediction Server</i>, a web server that implements the SVM models developed in this paper, is freely available at <a href="http://bioinfo.noble.org/TrSSP" target="_blank">http://bioinfo.noble.org/TrSSP</a>.</p></div
The performance (coverage metric) of the BLAST search on the main dataset using the standard five-fold cross-validation.
<p>The performance (coverage metric) of the BLAST search on the main dataset using the standard five-fold cross-validation.</p
Amino acid composition of transporter proteins.
<p>The amino acid composition for each amino acid as a percentage of the total number of amino acids for amino acid transporters (blue diamonds), anion transporters (red squares), cation transporters (green triangles), electron transporters (purple x), protein/mRNA transporters (cyan asterisks), sugar transporters (orange circles), other transporters (plus signs), and non-transporters (orange dashes) are plotted.</p
A comparison of the coverage metric of our models with TTRBF.
<p>A comparison of the coverage metric of our models with TTRBF.</p
The performance (coverage metric) of the HMM search on the main dataset using a five-fold cross-validation.
<p>The performance (coverage metric) of the HMM search on the main dataset using a five-fold cross-validation.</p
A comparison of the performance of our model with TTRBF.
<p>* Four substrate-specific transporter classes described in Chen et al. <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0100278#pone.0100278-Chen1" target="_blank">[13]</a>; $ Non-transporter, % other class; ∧ ion transporter.</p
The number of predicted transporters in Human, Drosophila, <i>E. coli</i>, Yeast and <i>A. thaliana</i>.
<p>The number of predicted transporters in Human, Drosophila, <i>E. coli</i>, Yeast and <i>A. thaliana</i>.</p