194 research outputs found

    Genome BLAST distance phylogenies inferred from whole plastid and whole mitochondrion genome sequences

    Get PDF
    BACKGROUND: Phylogenetic methods which do not rely on multiple sequence alignments are important tools in inferring trees directly from completely sequenced genomes. Here, we extend the recently described Genome BLAST Distance Phylogeny (GBDP) strategy to compute phylogenetic trees from all completely sequenced plastid genomes currently available and from a selection of mitochondrial genomes representing the major eukaryotic lineages. BLASTN, TBLASTX, or combinations of both are used to locate high-scoring segment pairs (HSPs) between two sequences from which pairwise similarities and distances are computed in different ways resulting in a total of 96 GBDP variants. The suitability of these distance formulae for phylogeny reconstruction is directly estimated by computing a recently described measure of "treelikeness", the so-called ÎŽ value, from the respective distance matrices. Additionally, we compare the trees inferred from these matrices using UPGMA, NJ, BIONJ, FastME, or STC, respectively, with the NCBI taxonomy tree of the taxa under study. RESULTS: Our results indicate that, at this taxonomic level, plastid genomes are much more valuable for inferring phylogenies than are mitochondrial genomes, and that distances based on breakpoints are of little use. Distances based on the proportion of "matched" HSP length to average genome length were best for tree estimation. Additionally we found that using TBLASTX instead of BLASTN and, particularly, combining TBLASTX and BLASTN leads to a small but significant increase in accuracy. Other factors do not significantly affect the phylogenetic outcome. The BIONJ algorithm results in phylogenies most in accordance with the current NCBI taxonomy, with NJ and FastME performing insignificantly worse, and STC performing as well if applied to high quality distance matrices. ÎŽ values are found to be a reliable predictor of phylogenetic accuracy. CONCLUSION: Using the most treelike distance matrices, as judged by their ÎŽ values, distance methods are able to recover all major plant lineages, and are more in accordance with Apicomplexa organelles being derived from "green" plastids than from plastids of the "red" type. GBDP-like methods can be used to reliably infer phylogenies from different kinds of genomic data. A framework is established to further develop and improve such methods. ÎŽ values are a topology-independent tool of general use for the development and assessment of distance methods for phylogenetic inference

    Early evolution without a tree of life

    Get PDF
    Life is a chemical reaction. Three major transitions in early evolution are considered without recourse to a tree of life. The origin of prokaryotes required a steady supply of energy and electrons, probably in the form of molecular hydrogen stemming from serpentinization. Microbial genome evolution is not a treelike process because of lateral gene transfer and the endosymbiotic origins of organelles. The lack of true intermediates in the prokaryote-to-eukaryote transition has a bioenergetic cause

    Fermentation innovation through complex hybridization of wild and domesticated yeasts

    Get PDF
    The most common fermented beverage, lager beer, is produced by interspecies hybrids of the brewing yeast Saccharomyces cerevisiae and its wild relative S. eubayanus. Lager-brewing yeasts are not the only example of hybrid vigour or heterosis in yeasts, but the full breadth of interspecies hybrids associated with human fermentations has received less attention. Here we present a comprehensive genomic analysis of 122 Saccharomyces hybrids and introgressed strains. These strains arose from hybridization events between two to four species. Hybrids with S. cerevisiae contributions originated from three lineages of domesticated S. cerevisiae, including the major wine-making lineage and two distinct brewing lineages. In contrast, the undomesticated parents of these interspecies hybrids were all from wild Holarctic or European lineages. Most hybrids have inherited a mitochondrial genome from a parent other than S. cerevisiae, which recent functional studies suggest could confer adaptation to colder temperatures. A subset of hybrids associated with crisp flavour profiles, including both lineages of lager-brewing yeasts, have inherited inactivated S. cerevisiae alleles of critical phenolic off-flavour genes and/or lost functional copies from the wild parent through multiple genetic mechanisms. These complex hybrids shed light on the convergent and divergent evolutionary trajectories of interspecies hybrids and their impact on innovation in lager brewing and other diverse fermentation industries.Fil: Langdon, Quinn K.. University of Wisconsin; Estados UnidosFil: Peris, David. University of Wisconsin; Estados Unidos. Consejo Superior de Investigaciones Científicas; EspañaFil: Baker, Emily Clare. University of Wisconsin; Estados UnidosFil: Opulente, Dana A.. University of Wisconsin; Estados UnidosFil: Nguyen, Huu-Vang. Université Paris-Saclay; Francia. Institut National de la Recherche Agronomique; FranciaFil: Bond, Ursula. Trinity College; Estados UnidosFil: Gonçalves, Paula. Universidade Nova de Lisboa; PortugalFil: Sampaio, José Paulo. Universidade Nova de Lisboa; PortugalFil: Libkind Frati, Diego. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Patagonia Norte. Instituto Andino Patagónico de Tecnologías Biológicas y Geoambientales. Universidad Nacional del Comahue. Instituto Andino Patagónico de Tecnologías Biológicas y Geoambientales; ArgentinaFil: Hittinger, Chris. University of Wisconsin; Estados Unido

    The complete plastid genome sequence of Welwitschia mirabilis: an unusually compact plastome with accelerated divergence rates

    Get PDF
    Background Welwitschia mirabilis is the only extant member of the family Welwitschiaceae, one of three lineages of gnetophytes, an enigmatic group of gymnosperms variously allied with flowering plants or conifers. Limited sequence data and rapid divergence rates have precluded consensus on the evolutionary placement of gnetophytes based on molecular characters. Here we report on the first complete gnetophyte chloroplast genome sequence, from Welwitschia mirabilis, as well as analyses on divergence rates of protein-coding genes, comparisons of gene content and order, and phylogenetic implications. Results The chloroplast genome of Welwitschia mirabilis [GenBank: EU342371] is comprised of 119,726 base pairs and exhibits large and small single copy regions and two copies of the large inverted repeat (IR). Only 101 unique gene species are encoded. The Welwitschia plastome is the most compact photosynthetic land plant plastome sequenced to date; 66% of the sequence codes for product. The genome also exhibits a slightly expanded IR, a minimum of 9 inversions that modify gene order, and 19 genes that are lost or present as pseudogenes. Phylogenetic analyses, including one representative of each extant seed plant lineage and based on 57 concatenated protein-coding sequences, place Welwitschia at the base of all seed plants (distance, maximum parsimony) or as the sister to Pinus (the only conifer representative) in a monophyletic gymnosperm clade (maximum likelihood, bayesian). Relative rate tests on these gene sequences show the Welwitschia sequences to be evolving at faster rates than other seed plants. For these genes individually, a comparison of average pairwise distances indicates that relative divergence in Welwitschia ranges from amounts about equal to other seed plants to amounts almost three times greater than the average for non-gnetophyte seed plants. Conclusion Although the basic organization of the Welwitschia plastome is typical, its compactness, gene content and high nucleotide divergence rates are atypical. The current lack of additional conifer plastome sequences precludes any discrimination between the gnetifer and gnepine hypotheses of seed plant relationships. However, both phylogenetic analyses and shared genome features identified here are consistent with either of the hypotheses that link gnetophytes with conifers, but are inconsistent with the anthophyte hypothesis

    A scenario of mitochondrial genome evolution in maize based on rearrangement events

    Get PDF
    Background: Despite their monophyletic origin, animal and plant mitochondrial genomes have been described as exhibiting different modes of evolution. Indeed, plant mitochondrial genomes feature a larger size, a lower mutation rate and more rearrangements than their animal counterparts. Gene order variation in animal mitochondrial genomes is often described as being due to translocation and inversion events, but tandem duplication followed by loss has also been proposed as an alternative process. In plant mitochondrial genomes, at the species level, gene shuffling and duplicate occurrence are such that no clear phylogeny has ever been identified, when considering genome structure variation. Results: In this study we analyzed the whole sequences of eight mitochondrial genomes from maize and teosintes in order to comprehend the events that led to their structural features, i.e. the order of genes, tRNAs, rRNAs, ORFs, pseudogenes and non-coding sequences shared by all mitogenomes and duplicate occurrences. We suggest a tandem duplication model similar to the one described in animals, except that some duplicates can remain. Thi

    A comparative study of nemertean complete mitochondrial genomes, including two new ones for Nectonemertes cf. mirabilis and Zygeupolia rubens, may elucidate the fundamental pattern for the phylum Nemertea

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The mitochondrial genome is important for studying genome evolution as well as reconstructing the phylogeny of organisms. Complete mitochondrial genome sequences have been reported for more than 2200 metazoans, mainly vertebrates and arthropods. To date, from a total of about 1275 described nemertean species, only three complete and two partial mitochondrial DNA sequences from nemerteans have been published. Here, we report the entire mitochondrial genomes for two more nemertean species: <it>Nectonemertes </it>cf. <it>mirabilis </it>and <it>Zygeupolia rubens</it>.</p> <p>Results</p> <p>The sizes of the entire mitochondrial genomes are 15365 bp for <it>N</it>. cf. <it>mirabilis </it>and 15513 bp for <it>Z. rubens</it>. Each circular genome contains 37 genes and an AT-rich non-coding region, and overall nucleotide composition is AT-rich. In both species, there is significant strand asymmetry in the distribution of nucleotides, with the coding strand being richer in T than A and in G than C. The AT-rich non-coding regions of the two genomes have some repeat sequences and stem-loop structures, both of which may be associated with the initiation of replication or transcription. The 22 tRNAs show variable substitution patterns in nemerteans, with higher sequence conservation in genes located on the H strand. Gene arrangement of <it>N</it>. cf. <it>mirabilis </it>is identical to that of <it>Paranemertes </it>cf. <it>peregrina</it>, both of which are Hoplonemertea, while that of <it>Z. rubens </it>is the same as in <it>Lineus viridis</it>, both of which are Heteronemertea. Comparison of the gene arrangements and phylogenomic analysis based on concatenated nucleotide sequences of the 12 mitochondrial protein-coding genes revealed that species with closer relationships share more identical gene blocks.</p> <p>Conclusion</p> <p>The two new mitochondrial genomes share many features, including gene contents, with other known nemertean mitochondrial genomes. The tRNA families display a composite substitution pathway. Gene order comparison to the proposed ground pattern of Bilateria and some lophotrochozoans suggests that the nemertean ancestral mitochondrial gene order most closely resembles the heteronemertean type. Phylogenetic analysis proposes a sister-group relationship between Hetero- and Hoplonemertea, which supports one of two recent alternative hypotheses of nemertean phylogeny.</p

    Finding an optimal inversion median: Experimental results

    Get PDF
    We derive a branch-and-bound algorithm to find an optimal inversion median of three signed permutations. The algorithm prunes to manageable size an extremely large search tree using simple geometric properties of the problem and a newly available linear-time routine for inversion distance. Our experiments on simulated data sets indicate that the algorithm finds optimal medians in reasonable time for genomes of medium size when distances are not too large, as commonly occurs in phylogeny reconstruction. In addition, we have compared inversion and breakpoint medians, and found that inversion medians generally score significantly better and tend to be far more unique, which should make them valuable in median-based tree-building algorithms

    Reconstruction of ancestral chromosome architecture and gene repertoire reveals principles of genome evolution in a model yeast genus

    No full text
    International audienceReconstructing genome history is complex but necessary to reveal quantitative principles governing genome evolution. Such reconstruction requires recapitulating into a single evolutionary framework the evolution of genome architecture and gene repertoire. Here, we reconstructed the genome history of the genus Lachancea that appeared to cover a continuous evolutionary range from closely related to more diverged yeast species. Our approach integrated the generation of a high-quality genome data set; the development of AnChro, a new algorithm for reconstructing ancestral genome architecture; and a comprehensive analysis of gene repertoire evolution. We found that the ancestral genome of the genus Lachancea contained eight chromosomes and about 5173 protein-coding genes. Moreover, we characterized 24 horizontal gene transfers and 159 putative gene creation events that punctuated species diversification. We retraced all chromosomal rearrangements, including gene losses, gene duplications, chromosomal inversions and translocations at single gene resolution. Gene duplications outnumbered losses and balanced rearrangements with 1503, 929, and 423 events, respectively. Gene content variations between extant species are mainly driven by differential gene losses, while gene duplications remained globally constant in all lineages. Remarkably, we discovered that balanced chromosomal rearrangements could be responsible for up to 14% of all gene losses by disrupting genes at their breakpoints. Finally, we found that nonsynonymous substitutions reached fixation at a coordinated pace with chromosomal inversions, translocations, and duplications, but not deletions. Overall, we provide a granular view of genome evolution within an entire eukaryotic genus, linking gene content, chromosome rearrangements , and protein divergence into a single evolutionary framework
    • 

    corecore