40 research outputs found

    Incorporating background frequency improves entropy-based residue conservation measures

    Get PDF
    BACKGROUND: Several entropy-based methods have been developed for scoring sequence conservation in protein multiple sequence alignments. High scoring amino acid positions may correlate with structurally or functionally important residues. However, amino acid background frequencies are usually not taken into account in these entropy-based scoring schemes. RESULTS: We demonstrate that using a relative entropy measure that incorporates amino acid background frequency results in improved performance in identifying functional sites from protein multiple sequence alignments. CONCLUSION: Our results suggest that the application of appropriate background frequency information may lead to more biologically relevant results in many areas of bioinformatics

    L1pred: A Sequence-Based Prediction Tool for Catalytic Residues in Enzymes with the L1-logreg Classifier

    Get PDF
    To understand enzyme functions, identifying the catalytic residues is a usual first step. Moreover, knowledge about catalytic residues is also useful for protein engineering and drug-design. However, to experimentally identify catalytic residues remains challenging for reasons of time and cost. Therefore, computational methods have been explored to predict catalytic residues. Here, we developed a new algorithm, L1pred, for catalytic residue prediction, by using the L1-logreg classifier to integrate eight sequence-based scoring functions. We tested L1pred and compared it against several existing sequence-based methods on carefully designed datasets Data604 and Data63. With ten-fold cross-validation, L1pred showed the area under precision-recall curve (AUPR) and the area under ROC curve (AUC) of 0.2198 and 0.9494 on the training dataset, Data604, respectively. In addition, on the independent test dataset, Data63, it showed the AUPR and AUC values of 0.2636 and 0.9375, respectively. Compared with other sequence-based methods, L1pred showed the best performance on both datasets. We also analyzed the importance of each attribute in the algorithm, and found that all the scores contributed more or less equally to the L1pred performance

    Identifying allosteric fluctuation transitions between different protein conformational states as applied to Cyclin Dependent Kinase 2

    Get PDF
    BACKGROUND: The mechanisms underlying protein function and associated conformational change are dominated by a series of local entropy fluctuations affecting the global structure yet are mediated by only a few key residues. Transitional Dynamic Analysis (TDA) is a new method to detect these changes in local protein flexibility between different conformations arising from, for example, ligand binding. Additionally, Positional Impact Vertex for Entropy Transfer (PIVET) uses TDA to identify important residue contact changes that have a large impact on global fluctuation. We demonstrate the utility of these methods for Cyclin-dependent kinase 2 (CDK2), a system with crystal structures of this protein in multiple functionally relevant conformations and experimental data revealing the importance of local fluctuation changes for protein function. RESULTS: TDA and PIVET successfully identified select residues that are responsible for conformation specific regional fluctuation in the activation cycle of Cyclin Dependent Kinase 2 (CDK2). The detected local changes in protein flexibility have been experimentally confirmed to be essential for the regulation and function of the kinase. The methodologies also highlighted possible errors in previous molecular dynamic simulations that need to be resolved in order to understand this key player in cell cycle regulation. Finally, the use of entropy compensation as a possible allosteric mechanism for protein function is reported for CDK2. CONCLUSION: The methodologies embodied in TDA and PIVET provide a quick approach to identify local fluctuation change important for protein function and residue contacts that contributes to these changes. Further, these approaches can be used to check for possible errors in protein dynamic simulations and have the potential to facilitate a better understanding of the contribution of entropy to protein allostery and function

    Accurate Protein Structure Annotation through Competitive Diffusion of Enzymatic Functions over a Network of Local Evolutionary Similarities

    Get PDF
    High-throughput Structural Genomics yields many new protein structures without known molecular function. This study aims to uncover these missing annotations by globally comparing select functional residues across the structural proteome. First, Evolutionary Trace Annotation, or ETA, identifies which proteins have local evolutionary and structural features in common; next, these proteins are linked together into a proteomic network of ETA similarities; then, starting from proteins with known functions, competing functional labels diffuse link-by-link over the entire network. Every node is thus assigned a likelihood z-score for every function, and the most significant one at each node wins and defines its annotation. In high-throughput controls, this competitive diffusion process recovered enzyme activity annotations with 99% and 97% accuracy at half-coverage for the third and fourth Enzyme Commission (EC) levels, respectively. This corresponds to false positive rates 4-fold lower than nearest-neighbor and 5-fold lower than sequence-based annotations. In practice, experimental validation of the predicted carboxylesterase activity in a protein from Staphylococcus aureus illustrated the effectiveness of this approach in the context of an increasingly drug-resistant microbe. This study further links molecular function to a small number of evolutionarily important residues recognizable by Evolutionary Tracing and it points to the specificity and sensitivity of functional annotation by competitive global network diffusion. A web server is at http://mammoth.bcm.tmc.edu/networks

    Separation of Recombination and SOS Response in Escherichia coli RecA Suggests LexA Interaction Sites

    Get PDF
    RecA plays a key role in homologous recombination, the induction of the DNA damage response through LexA cleavage and the activity of error-prone polymerase in Escherichia coli. RecA interacts with multiple partners to achieve this pleiotropic role, but the structural location and sequence determinants involved in these multiple interactions remain mostly unknown. Here, in a first application to prokaryotes, Evolutionary Trace (ET) analysis identifies clusters of evolutionarily important surface amino acids involved in RecA functions. Some of these clusters match the known ATP binding, DNA binding, and RecA-RecA homo-dimerization sites, but others are novel. Mutation analysis at these sites disrupted either recombination or LexA cleavage. This highlights distinct functional sites specific for recombination and DNA damage response induction. Finally, our analysis reveals a composite site for LexA binding and cleavage, which is formed only on the active RecA filament. These new sites can provide new drug targets to modulate one or more RecA functions, with the potential to address the problem of evolution of antibiotic resistance at its root

    Reduced conditioned fear response in mice that lack Dlx1 and show subtype-specific loss of interneurons

    Get PDF
    The inhibitory GABAergic system has been implicated in multiple neuropsychiatric diseases such as schizophrenia and autism. The Dlx homeobox transcription factor family is essential for development and function of GABAergic interneurons. Mice lacking the Dlx1 gene have postnatal subtype-specific loss of interneurons and reduced IPSCs in their cortex and hippocampus. To ascertain consequences of these changes in the GABAergic system, we performed a battery of behavioral assays on the Dlx1 mutant mice, including zero maze, open field, locomotor activity, food intake, rotarod, tail suspension, fear conditioning assays (context and trace), prepulse inhibition, and working memory related tasks (spontaneous alteration task and spatial working memory task). Dlx1 mutant mice displayed elevated activity levels in open field, locomotor activity, and tail suspension tests. These mice also showed deficits in contextual and trace fear conditioning, and possibly in prepulse inhibition. Their learning deficits were not global, as the mutant mice did not differ from the wild-type controls in tests of working memory. Our findings demonstrate a critical role for the Dlx1 gene, and likely the subclasses of interneurons that are affected by the lack of this gene, in behavioral inhibition and associative fear learning. These observations support the involvement of particular components of the GABAergic system in specific behavioral phenotypes related to complex neuropsychiatric diseases

    A comparative study of conservation and variation scores

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Conservation and variation scores are used when evaluating sites in a multiple sequence alignment, in order to identify residues critical for structure or function. A variety of scores are available today but it is not clear how different scores relate to each other.</p> <p>Results</p> <p>We applied 25 conservation and variation scores to alignments from the Catalytic Site Atlas (CSA). We calculated distances among scores based on correlation coefficients, and constructed a dendrogram of the scores by average linking cluster analysis. The cluster analysis showed that most scores fall into one of two groups--substitution matrix based group and frequency based group respectively. We also evaluated the scores' performance in predicting catalytic sites and found that frequency based scores generally perform best.</p> <p>Conclusions</p> <p>Conservation and variation scores can be classified into mainly two large groups. When using a score to predict catalytic sites, frequency based scores that also consider a background distribution are most successful.</p
    corecore