51 research outputs found

    A survey of orphan enzyme activities

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Using computational database searches, we have demonstrated previously that no gene sequences could be found for at least 36% of enzyme activities that have been assigned an Enzyme Commission number. Here we present a follow-up literature-based survey involving a statistically significant sample of such "orphan" activities. The survey was intended to determine whether sequences for these enzyme activities are truly unknown, or whether these sequences are absent from the public sequence databases but can be found in the literature.</p> <p>Results</p> <p>We demonstrate that for ~80% of sampled orphans, the absence of sequence data is bona fide. Our analyses further substantiate the notion that many of these enzyme activities play biologically important roles.</p> <p>Conclusion</p> <p>This survey points toward significant scientific cost of having such a large fraction of characterized enzyme activities disconnected from sequence data. It also suggests that a larger effort, beginning with a comprehensive survey of all putative orphan activities, would resolve nearly 300 artifactual orphans and reconnect a wealth of enzyme research with modern genomics. For these reasons, we propose that a systematic effort to identify the cognate genes of orphan enzymes be undertaken.</p

    New Insight into the Transcarbamylase Family: The Structure of Putrescine Transcarbamylase, a Key Catalyst for Fermentative Utilization of Agmatine

    Get PDF
    Transcarbamylases reversibly transfer a carbamyl group from carbamylphosphate (CP) to an amine. Although aspartate transcarbamylase and ornithine transcarbamylase (OTC) are well characterized, little was known about putrescine transcarbamylase (PTC), the enzyme that generates CP for ATP production in the fermentative catabolism of agmatine. We demonstrate that PTC (from Enterococcus faecalis), in addition to using putrescine, can utilize L-ornithine as a poor substrate. Crystal structures at 2.5 Å and 2.0 Å resolutions of PTC bound to its respective bisubstrate analog inhibitors for putrescine and ornithine use, N-(phosphonoacetyl)-putrescine and δ-N-(phosphonoacetyl)-L-ornithine, shed light on PTC preference for putrescine. Except for a highly prominent C-terminal helix that projects away and embraces an adjacent subunit, PTC closely resembles OTCs, suggesting recent divergence of the two enzymes. Since differences between the respective 230 and SMG loops of PTC and OTC appeared to account for the differential preference of these enzymes for putrescine and ornithine, we engineered the 230-loop of PTC to make it to resemble the SMG loop of OTCs, increasing the activity with ornithine and greatly decreasing the activity with putrescine. We also examined the role of the C-terminal helix that appears a constant and exclusive PTC trait. The enzyme lacking this helix remained active but the PTC trimer stability appeared decreased, since some of the enzyme eluted as monomers from a gel filtration column. In addition, truncated PTC tended to aggregate to hexamers, as shown both chromatographically and by X-ray crystallography. Therefore, the extra C-terminal helix plays a dual role: it stabilizes the PTC trimer and, by shielding helix 1 of an adjacent subunit, it prevents the supratrimeric oligomerizations of obscure significance observed with some OTCs. Guided by the structural data we identify signature traits that permit easy and unambiguous annotation of PTC sequences

    pKa Modulation of the Acid/Base Catalyst within GH32 and GH68: A Role in Substrate/Inhibitor Specificity?

    Get PDF
    Glycoside hydrolases of families 32 (GH32) and 68 (GH68) belong to clan GH-J, containing hydrolytic enzymes (sucrose/fructans as donor substrates) and fructosyltransferases (sucrose/fructans as donor and acceptor substrates). In GH32 members, some of the sugar substrates can also function as inhibitors, this regulatory aspect further adding to the complexity in enzyme functionalities within this family. Although 3D structural information becomes increasingly available within this clan and huge progress has been made on structure-function relationships, it is not clear why some sugars bind as inhibitors without being catalyzed. Conserved aspartate and glutamate residues are well known to act as nucleophile and acid/bases within this clan. Based on the available 3D structures of enzymes and enzyme-ligand complexes as well as docking simulations, we calculated the pKa of the acid-base before and after substrate binding. The obtained results strongly suggest that most GH-J members show an acid-base catalyst that is not sufficiently protonated before ligand entrance, while the acid-base can be fully protonated when a substrate, but not an inhibitor, enters the catalytic pocket. This provides a new mechanistic insight aiming at understanding the complex substrate and inhibitor specificities observed within the GH-J clan. Moreover, besides the effect of substrate entrance on its own, we strongly suggest that a highly conserved arginine residue (in the RDP motif) rather than the previously proposed Tyr motif (not conserved) provides the proton to increase the pKa of the acid-base catalyst

    Annotation Error in Public Databases: Misannotation of Molecular Function in Enzyme Superfamilies

    Get PDF
    Due to the rapid release of new data from genome sequencing projects, the majority of protein sequences in public databases have not been experimentally characterized; rather, sequences are annotated using computational analysis. The level of misannotation and the types of misannotation in large public databases are currently unknown and have not been analyzed in depth. We have investigated the misannotation levels for molecular function in four public protein sequence databases (UniProtKB/Swiss-Prot, GenBank NR, UniProtKB/TrEMBL, and KEGG) for a model set of 37 enzyme families for which extensive experimental information is available. The manually curated database Swiss-Prot shows the lowest annotation error levels (close to 0% for most families); the two other protein sequence databases (GenBank NR and TrEMBL) and the protein sequences in the KEGG pathways database exhibit similar and surprisingly high levels of misannotation that average 5%–63% across the six superfamilies studied. For 10 of the 37 families examined, the level of misannotation in one or more of these databases is >80%. Examination of the NR database over time shows that misannotation has increased from 1993 to 2005. The types of misannotation that were found fall into several categories, most associated with “overprediction” of molecular function. These results suggest that misannotation in enzyme superfamilies containing multiple families that catalyze different reactions is a larger problem than has been recognized. Strategies are suggested for addressing some of the systematic problems contributing to these high levels of misannotation

    Comparative analyses imply that the enigmatic sigma factor 54 is a central controller of the bacterial exterior

    Get PDF
    Contains fulltext : 95738.pdf (publisher's version ) (Open Access)BACKGROUND: Sigma-54 is a central regulator in many pathogenic bacteria and has been linked to a multitude of cellular processes like nitrogen assimilation and important functional traits such as motility, virulence, and biofilm formation. Until now it has remained obscure whether these phenomena and the control by Sigma-54 share an underlying theme. RESULTS: We have uncovered the commonality by performing a range of comparative genome analyses. A) The presence of Sigma-54 and its associated activators was determined for all sequenced prokaryotes. We observed a phylum-dependent distribution that is suggestive of an evolutionary relationship between Sigma-54 and lipopolysaccharide and flagellar biosynthesis. B) All Sigma-54 activators were identified and annotated. The relation with phosphotransfer-mediated signaling (TCS and PTS) and the transport and assimilation of carboxylates and nitrogen containing metabolites was substantiated. C) The function annotations, that were represented within the genomic context of all genes encoding Sigma-54, its activators and its promoters, were analyzed for intra-phylum representation and inter-phylum conservation. Promoters were localized using a straightforward scoring strategy that was formulated to identify similar motifs. We found clear highly-represented and conserved genetic associations with genes that concern the transport and biosynthesis of the metabolic intermediates of exopolysaccharides, flagella, lipids, lipopolysaccharides, lipoproteins and peptidoglycan. CONCLUSION: Our analyses directly implicate Sigma-54 as a central player in the control over the processes that involve the physical interaction of an organism with its environment like in the colonization of a host (virulence) or the formation of biofilm
    corecore