3,699 research outputs found

    Horizontal gene transfer and nucleotide compositional anomaly in large DNA viruses

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>DNA viruses have a wide range of genome sizes (5 kb up to 1.2 Mb, compared to 0.16 Mb to 1.5 Mb for obligate parasitic bacteria) that do not correlate with their virulence or the taxonomic distribution of their hosts. The reasons for such large variation are unclear. According to the traditional view of viruses as gifted "gene pickpockets", large viral genome sizes could originate from numerous gene acquisitions from their hosts. We investigated this hypothesis by studying 67 large DNA viruses with genome sizes larger than 150 kb, including the recently characterized giant mimivirus. Given that horizontally transferred DNA often have anomalous nucleotide compositions differing from the rest of the genome, we conducted a detailed analysis of the inter- and intra-genome compositional properties of these viruses. We then interpreted their compositional heterogeneity in terms of possible causes, including strand asymmetry, gene function/expression, and horizontal transfer.</p> <p>Results</p> <p>We first show that the global nucleotide composition and nucleotide word usage of viral genomes are species-specific and distinct from those of their hosts. Next, we identified compositionally anomalous (cA) genes in viral genomes, using a method based on Bayesian inference. The proportion of cA genes is highly variable across viruses and does not exhibit a significant correlation with genome size. The vast majority of the cA genes were of unknown function, lacking homologs in the databases. For genes with known homologs, we found a substantial enrichment of cA genes in specific functional classes for some of the viruses. No significant association was found between cA genes and compositional strand asymmetry. A possible exogenous origin for a small fraction of the cA genes could be confirmed by phylogenetic reconstruction.</p> <p>Conclusion</p> <p>At odds with the traditional dogma, our results argue against frequent genetic transfers to large DNA viruses from their modern hosts. The large genome sizes of these viruses are not simply explained by an increased propensity to acquire foreign genes. This study also confirms that the anomalous nucleotide compositions of the cA genes is sometimes linked to particular biological functions or expression patterns, possibly leading to an overestimation of recent horizontal gene transfers.</p

    Evolutionary rates and gene dispensability associate with replication timing in the Archaeon Sulfolobus islandicus

    Get PDF
    In bacterial chromosomes, the position of a gene relative to the single origin of replication generally reflects its replication timing, how often it is expressed, and consequently, its rate of evolution. However, because some archaeal genomes contain multiple origins of replication, bias in gene dosage caused by delayed replication should be minimized and hence the substitution rate of genes should associate less with chromosome position. To test this hypothesis, six archaeal genomes from the genus Sulfolobus containing three origins of replication were selected, conserved orthologs were identified, and the evolutionary rates (dN and dS) of these orthologs were quantified. Ortholog families were grouped by their consensus position and designated by their proximity to one of the three origins (O1, O2, O3). Conserved orthologs were concentrated near the origins and most variation in genome content occurred distant from the origins. Linear regressions of both synonymous and nonsynonymous substitution rates on distance from replication origins were significantly positive, the rates being greatest in the region furthest from any of the origins and slowest among genes near the origins. Genes near O1 also evolved faster than those near O2 and O3, which suggest that this origin may fire later in the cell cycle. Increased evolutionary rates and gene dispensability are strongly associated with reduced gene expression caused in part by reduced gene dosage during the cell cycle. Therefore, in this genus of Archaea as well as in many Bacteria, evolutionary rates and variation in genome content associate with replication timing

    Chromosome Architecture and Evolution in Bacteria

    Get PDF
    Inferences of organismal molecular evolution have been dominated by comparisons of their constituent genes. Yet the evolutionary histories of genes within Bacterial genomes are not necessarily congruent. Here, Horizontal Gene Transfer (HGT) of sequences across species boundaries can confound these analyses. There does appear to be phylogenetic cohesion, where members of higher taxonomic groups share genotypic similarity despite gene transfer. Herein I examine the rules for governing HGT to determine the impact this process has played in the evolution of Bacteria and Archaea. Bacterial chromosomes are more than simple lists of genes. Genomes must maintain information beyond component genes to direct efficient replication and segregation of their chromosomes. I propose that this structure constrains the process of HGT so that transfer among certain pairs of donors and recipients is favored. I present methods to detect this structure and new theories of bacterial cell biology and evolution based on what this structure reveals. I present evidence that bacterial chromosomes are structured by repetitive sequences termed Architecture IMparting Sequences (AIMS). AIMS are found primarily on leading strands and increase in abundance towards the replication terminus. Bacteria with robustly-identified replication origins and termini all have AIMS, and related AIMS are conserved amongst families of bacteria. We propose that AIMS are under selection to provide DNA binding proteins with polarity information, facilitating identification of the location of the replication terminus. Although AIMS evolved to direct the biology of cell division and replication, the conservation of AIMS among related taxa leads to a secondary effect. Because AIMS are counterselected when in nonpermissive orientations, AIMS constrain both intragenomic and intergenomic rearrangements. Thus HGT frequency will depend on AIMS compatibility between different species. We predict that HGT is most common between bacterial genomes which are more closely related and will impede transfer between species which have dissimilar genome architecture. The additional level of selection reflected by AIMS has resulted in cohesive bacterial groups that reflect common gene pools as a result of biased rates of gene transfer

    Horizontal Gene Transfer Regulation in Bacteria as a “Spandrel” of DNA Repair Mechanisms

    Get PDF
    Horizontal gene transfer (HGT) is recognized as the major force for bacterial genome evolution. Yet, numerous questions remain about the transferred genes, their function, quantity and frequency. The extent to which genetic transformation by exogenous DNA has occurred over evolutionary time was initially addressed by an in silico approach using the complete genome sequence of the Ralstonia solanacearum GMI1000 strain. Methods based on phylogenetic reconstruction of prokaryote homologous genes families detected 151 genes (13.3%) of foreign origin in the R. solanacearum genome and tentatively identified their bacterial origin. These putative transfers were analyzed in comparison to experimental transformation tests involving 18 different genomic DNA positions in the genome as sites for homologous or homeologous recombination. Significant transformation frequency differences were observed among these positions tested regardless of the overall genomic divergence of the R. solanacearum strains tested as recipients. The genomic positions containing the putative exogenous DNA were not systematically transformed at the highest frequencies. The two genomic “hot spots”, which contain recA and mutS genes, exhibited transformation frequencies from 2 to more than 4 orders of magnitude higher than positions associated with other genes depending on the recipient strain. These results support the notion that the bacterial cell is equipped with active mechanisms to modulate acquisition of new DNA in different genomic positions. Bio-informatics study correlated recombination “hot-spots” to the presence of Chi-like signature sequences with which recombination might be preferentially initiated. The fundamental role of HGT is certainly not limited to the critical impact that the very rare foreign genes acquired mainly by chance can have on the bacterial adaptation potential. The frequency to which HGT with homologous and homeologous DNA happens in the environment might have led the bacteria to hijack DNA repair mechanisms in order to generate genetic diversity without losing too much genomic stability

    Translational Selection Is Ubiquitous in Prokaryotes

    Get PDF
    Codon usage bias in prokaryotic genomes is largely a consequence of background substitution patterns in DNA, but highly expressed genes may show a preference towards codons that enable more efficient and/or accurate translation. We introduce a novel approach based on supervised machine learning that detects effects of translational selection on genes, while controlling for local variation in nucleotide substitution patterns represented as sequence composition of intergenic DNA. A cornerstone of our method is a Random Forest classifier that outperformed previous distance measure-based approaches, such as the codon adaptation index, in the task of discerning the (highly expressed) ribosomal protein genes by their codon frequencies. Unlike previous reports, we show evidence that translational selection in prokaryotes is practically universal: in 460 of 461 examined microbial genomes, we find that a subset of genes shows a higher codon usage similarity to the ribosomal proteins than would be expected from the local sequence composition. These genes constitute a substantial part of the genome—between 5% and 33%, depending on genome size—while also exhibiting higher experimentally measured mRNA abundances and tending toward codons that match tRNA anticodons by canonical base pairing. Certain gene functional categories are generally enriched with, or depleted of codon-optimized genes, the trends of enrichment/depletion being conserved between Archaea and Bacteria. Prominent exceptions from these trends might indicate genes with alternative physiological roles; we speculate on specific examples related to detoxication of oxygen radicals and ammonia and to possible misannotations of asparaginyl–tRNA synthetases. Since the presence of codon optimizations on genes is a valid proxy for expression levels in fully sequenced genomes, we provide an example of an “adaptome” by highlighting gene functions with expression levels elevated specifically in thermophilic Bacteria and Archaea

    The Vein Patterning 1 (VEP1) Gene Family Laterally Spread through an Ecological Network

    Get PDF
    Lateral gene transfer (LGT) is a major evolutionary mechanism in prokaryotes. Knowledge about LGT— particularly, multicellular— eukaryotes has only recently started to accumulate. A widespread assumption sees the gene as the unit of LGT, largely because little is yet known about how LGT chances are affected by structural/functional features at the subgenic level. Here we trace the evolutionary trajectory of VEin Patterning 1, a novel gene family known to be essential for plant development and defense. At the subgenic level VEP1 encodes a dinucleotide-binding Rossmann-fold domain, in common with members of the short-chain dehydrogenase/reductase (SDR) protein family. We found: i) VEP1 likely originated in an aerobic, mesophilic and chemoorganotrophic α-proteobacterium, and was laterally propagated through nets of ecological interactions, including multiple LGTs between phylogenetically distant green plant/fungi-associated bacteria, and five independent LGTs to eukaryotes. Of these latest five transfers, three are ancient LGTs, implicating an ancestral fungus, the last common ancestor of land plants and an ancestral trebouxiophyte green alga, and two are recent LGTs to modern embryophytes. ii) VEP1's rampant LGT behavior was enabled by the robustness and broad utility of the dinucleotide-binding Rossmann-fold, which provided a platform for the evolution of two unprecedented departures from the canonical SDR catalytic triad. iii) The fate of VEP1 in eukaryotes has been different in different lineages, being ubiquitous and highly conserved in land plants, whereas fungi underwent multiple losses. And iv) VEP1-harboring bacteria include non-phytopathogenic and phytopathogenic symbionts which are non-randomly distributed with respect to the type of harbored VEP1 gene. Our findings suggest that VEP1 may have been instrumental for the evolutionary transition of green plants to land, and point to a LGT-mediated ‘Trojan Horse’ mechanism for the evolution of bacterial pathogenesis against plants. VEP1 may serve as tool for revealing microbial interactions in plant/fungi-associated environments

    Phylogenomics of the Reproductive Parasite Wolbachia pipientis wMel: A Streamlined Genome Overrun by Mobile Genetic Elements

    Get PDF
    The complete sequence of the 1,267,782 bp genome of Wolbachia pipientis wMel, an obligate intracellular bacteria of Drosophila melanogaster, has been determined. Wolbachia, which are found in a variety of invertebrate species, are of great interest due to their diverse interactions with different hosts, which range from many forms of reproductive parasitism to mutualistic symbioses. Analysis of the wMel genome, in particular phylogenomic comparisons with other intracellular bacteria, has revealed many insights into the biology and evolution of wMel and Wolbachia in general. For example, the wMel genome is unique among sequenced obligate intracellular species in both being highly streamlined and containing very high levels of repetitive DNA and mobile DNA elements. This observation, coupled with multiple evolutionary reconstructions, suggests that natural selection is somewhat inefficient in wMel, most likely owing to the occurrence of repeated population bottlenecks. Genome analysis predicts many metabolic differences with the closely related Rickettsia species, including the presence of intact glycolysis and purine synthesis, which may compensate for an inability to obtain ATP directly from its host, as Rickettsia can. Other discoveries include the apparent inability of wMel to synthesize lipopolysaccharide and the presence of the most genes encoding proteins with ankyrin repeat domains of any prokaryotic genome yet sequenced. Despite the ability of wMel to infect the germline of its host, we find no evidence for either recent lateral gene transfer between wMel and D. melanogaster or older transfers between Wolbachia and any host. Evolutionary analysis further supports the hypothesis that mitochondria share a common ancestor with the α-Proteobacteria, but shows little support for the grouping of mitochondria with species in the order Rickettsiales. With the availability of the complete genomes of both species and excellent genetic tools for the host, the wMel–D. melanogaster symbiosis is now an ideal system for studying the biology and evolution of Wolbachia infections

    Validation of Bacterial Replication Termination Models Using Simulation of Genomic Mutations

    Get PDF
    In bacterial circular chromosomes and most plasmids, the replication is known to be terminated when either of the following occurs: the forks progressing in opposite directions meet at the distal end of the chromosome or the replication forks become trapped by Tus proteins bound to Ter sites. Most bacterial genomes have various polarities in their genomic structures. The most notable feature is polar genomic compositional asymmetry of the bases G and C in the leading and lagging strands, called GC skew. This asymmetry is caused by replication-associated mutation bias, and this “footprint" of the replication machinery suggests that, in contrast to the two known mechanisms, replication termination occurs near the chromosome dimer resolution site dif. To understand this difference between the known replication machinery and genomic compositional bias, we undertook a simulation study of genomic mutations, and we report here how different replication termination models contribute to the generation of replication-related genomic compositional asymmetry. Contrary to naive expectations, our results show that a single finite termination site at dif or at the GC skew shift point is not sufficient to reconstruct the genomic compositional bias as observed in published sequences. The results also show that the known replication mechanisms are sufficient to explain the position of the GC skew shift point

    Codon usage bias and tRNA over-expression in Buchnera aphidicola after aromatic amino acid nutritional stress on its host Acyrthosiphon pisum

    Get PDF
    Codon usage bias and relative abundances of tRNA isoacceptors were analysed in the obligate intracellular symbiotic bacterium, Buchnera aphidicola from the aphid Acyrthosiphon pisum, using a dedicated 35mer oligonucleotide microarray. Buchnera is archetypal of organisms living with minimal metabolic requirements and presents a reduced genome with high-evolutionary rate. Codonusage in Buchnera has been overcome by the high mutational bias towards AT bases. However, several lines of evidence for codon usage selection are given here. A significant correlation was found between tRNA relative abundances and codon composition of Buchnera genes. A significant codon usage bias was found for the choice of rare codons in Buchnera: C-ending codons are preferred in highly expressed genes, whereas G-ending codons are avoided. This bias is not explained by GC skew in the bacteria and might correspond to a selection for perfect matching between codon–anticodon pairs for some essential amino acids in Buchnera proteins. Nutritional stress applied to the aphid host induced a significant overexpression of most of the tRNA isoacceptors in bacteria. Although, molecular regulation of the tRNA operons in Buchnera was not investigated, a correlation between relative expression levels and organization in transcription unit was found in the genome of Buchnera
    corecore