26 research outputs found

    Bioinformatic evidence for a widely distributed, ribosomally produced electron carrier precursor, its maturation proteins, and its nicotinoprotein redox partners

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Enzymes in the radical SAM (rSAM) domain family serve in a wide variety of biological processes, including RNA modification, enzyme activation, bacteriocin core peptide maturation, and cofactor biosynthesis. Evolutionary pressures and relationships to other cellular constituents impose recognizable grammars on each class of rSAM-containing system, shaping patterns in results obtained through various comparative genomics analyses.</p> <p>Results</p> <p>An uncharacterized gene cluster found in many Actinobacteria and sporadically in Firmicutes, Chloroflexi, Deltaproteobacteria, and one Archaeal plasmid contains a PqqE-like rSAM protein family that includes Rv0693 from <it>Mycobacterium tuberculosis</it>. Members occur clustered with a strikingly well-conserved small polypeptide we designate "mycofactocin," similar in size to bacteriocins and PqqA, precursor of pyrroloquinoline quinone (PQQ). Partial Phylogenetic Profiling (PPP) based on the distribution of these markers identifies the mycofactocin cluster, but also a second tier of high-scoring proteins. This tier, strikingly, is filled with up to thirty-one members per genome from three variant subfamilies that occur, one each, in three unrelated classes of nicotinoproteins. The pattern suggests these variant enzymes require not only NAD(P), but also the novel gene cluster. Further study was conducted using SIMBAL, a PPP-like tool, to search these nicotinoproteins for subsequences best correlated across multiple genomes to the presence of mycofactocin. For both the short chain dehydrogenase/reductase (SDR) and iron-containing dehydrogenase families, aligning SIMBAL's top-scoring sequences to homologous solved crystal structures shows signals centered over NAD(P)-binding sites rather than over substrate-binding or active site residues. Previous studies on some of these proteins have revealed a non-exchangeable NAD cofactor, such that enzymatic activity <it>in vitro </it>requires an artificial electron acceptor such as N,N-dimethyl-4-nitrosoaniline (NDMA) for the enzyme to cycle.</p> <p>Conclusions</p> <p>Taken together, these findings suggest that the mycofactocin precursor is modified by the Rv0693 family rSAM protein and other enzymes in its cluster. It becomes an electron carrier molecule that serves <it>in vivo </it>as NDMA and other artificial electron acceptors do <it>in vitro</it>. Subclasses from three different nicotinoprotein families show "only-if" relationships to mycofactocin because they require its presence. This framework suggests a segregated redox pool in which mycofactocin mediates communication among enzymes with non-exchangeable cofactors.</p

    Detection of Biochemical Pathways by Probabilistic Matching of Phyletic Vectors

    Get PDF
    A phyletic vector, also known as a phyletic (or phylogenetic) pattern, is a binary representation of the presences and absences of orthologous genes in different genomes. Joint occurrence of two or more genes in many genomes results in closely similar binary vectors representing these genes, and this similarity between gene vectors may be used as a measure of functional association between genes. Better understanding of quantitative properties of gene co-occurrences is needed for systematic studies of gene function and evolution. We used the probabilistic iterative algorithm Psi-square to find groups of similar phyletic vectors. An extended Psi-square algorithm, in which pseudocounts are implemented, shows better sensitivity in identifying proteins with known functional links than our earlier hierarchical clustering approach. At the same time, the specificity of inferring functional associations between genes in prokaryotic genomes is strongly dependent on the pathway: phyletic vectors of the genes involved in energy metabolism and in de novo biosynthesis of the essential precursors tend to be lumped together, whereas cellular modules involved in secretion, motility, assembly of cell surfaces, biosynthesis of some coenzymes, and utilization of secondary carbon sources tend to be identified with much greater specificity. It appears that the network of gene coinheritance in prokaryotes contains a giant connected component that encompasses most biosynthetic subsystems, along with a series of more independent modules involved in cell interaction with the environment

    The evolutionary signal in metagenome phyletic profiles predicts many gene functions

    Get PDF
    Background. The function of many genes is still not known even in model organisms. An increasing availability of microbiome DNA sequencing data provides an opportunity to infer gene function in a systematic manner. Results. We evaluated if the evolutionary signal contained in metagenome phyletic profiles (MPP) is predictive of a broad array of gene functions. The MPPs are an encoding of environmental DNA sequencing data that consists of relative abundances of gene families across metagenomes. We find that such MPPs can accurately predict 826 Gene Ontology functional categories, while drawing on human gut microbiomes, ocean metagenomes, and DNA sequences from various other engineered and natural environments. Overall, in this task, the MPPs are highly accurate, and moreover they provide coverage for a set of Gene Ontology terms largely complementary to standard phylogenetic profiles, derived from fully sequenced genomes. We also find that metagenomes approximated from taxon relative abundance obtained via 16S rRNA gene sequencing may provide surprisingly useful predictive models. Crucially, the MPPs derived from different types of environments can infer distinct, non-overlapping sets of gene functions and therefore complement each other. Consistently, simulations on &gt; 5000 metagenomes indicate that the amount of data is not in itself critical for maximizing predictive accuracy, while the diversity of sampled environments appears to be the critical factor for obtaining robust models. Conclusions. In past work, metagenomics has provided invaluable insight into ecology of various habitats, into diversity of microbial life and also into human health and disease mechanisms. We propose that environmental DNA sequencing additionally constitutes a useful tool to predict biological roles of genes, yielding inferences out of reach for existing comparative genomics approaches

    TMEM107 recruits ciliopathy proteins to subdomains of the ciliary transition zone and causes Joubert syndrome

    Get PDF
    The transition zone (TZ) ciliary subcompartment is thought to control cilium composition and signalling by facilitating a protein diffusion barrier at the ciliary base. TZ defects cause ciliopathies such as Meckel–Gruber syndrome (MKS), nephronophthisis (NPHP) and Joubert syndrome1 (JBTS). However, the molecular composition and mechanisms underpinning TZ organization and barrier regulation are poorly understood. To uncover candidate TZ genes, we employed bioinformatics (coexpression and co-evolution) and identified TMEM107 as a TZ protein mutated in oral–facial–digital syndrome and JBTS patients. Mechanistic studies in Caenorhabditis elegans showed that TMEM-107 controls ciliary composition and functions redundantly with NPHP-4 to regulate cilium integrity, TZ docking and assembly of membrane to microtubule Y-link connectors. Furthermore, nematode TMEM-107 occupies an intermediate layer of the TZ-localized MKS module by organizing recruitment of the ciliopathy proteins MKS-1, TMEM-231 (JBTS20) and JBTS-14 (TMEM237). Finally, MKS module membrane proteins are immobile and super-resolution microscopy in worms and mammalian cells reveals periodic localizations within the TZ. This work expands the MKS module of ciliopathy-causing TZ proteins associated with diffusion barrier formation and provides insight into TZ subdomain architecture

    Plasmodium falciparum Nucleosomes Exhibit Reduced Stability and Lost Sequence Dependent Nucleosome Positioning

    Get PDF
    The packaging and organization of genomic DNA into chromatin represents an additional regulatory layer of gene expression, with specific nucleosome positions that restrict the accessibility of regulatory DNA elements. The mechanisms that position nucleosomes in vivo are thought to depend on the biophysical properties of the histones, sequence patterns, like phased di-nucleotide repeats and the architecture of the histone octamer that folds DNA in 1.65 tight turns. Comparative studies of human and P. falciparum histones reveal that the latter have a strongly reduced ability to recognize internal sequence dependent nucleosome positioning signals. In contrast, the nucleosomes are positioned by AT-repeat sequences flanking nucleosomes in vivo and in vitro. Further, the strong sequence variations in the plasmodium histones, compared to other mammalian histones, do not present adaptations to its AT-rich genome. Human and parasite histones bind with higher affinity to GC-rich DNA and with lower affinity to AT-rich DNA. However, the plasmodium nucleosomes are overall less stable, with increased temperature induced mobility, decreased salt stability of the histones H2A and H2B and considerable reduced binding affinity to GC-rich DNA, as compared with the human nucleosomes. In addition, we show that plasmodium histone octamers form the shortest known nucleosome repeat length (155bp) in vitro and in vivo. Our data suggest that the biochemical properties of the parasite histones are distinct from the typical characteristics of other eukaryotic histones and these properties reflect the increased accessibility of the P. falciparum genome
    corecore