90 research outputs found

    Tracing the Maternal Line in Glacial-Interglacial Migrations of \u3ci\u3ePopulus tremuloides\u3c/i\u3e: Finding Trees for Future Sustainable Forests by Searching in the Past

    Get PDF
    Maintaining and planting sustainable forests is fundamental in perpetuating the essential functions of these ecosystems. A central aspect of managing forests for future resilience is the consideration of past migration and evolution of trees using genetic and genomic data to ensure that functionally appropriate diversity is conserved and utilized. In our study, we generated and compared genetic and genomic data from the plastome to better understand phylogeography and molecular evolution in the tree species Populus tremuloides (aspen). With these analyses, we found evidence of divergence and migration between northern and southern sites. Additionally, evidence of deep incomplete plastome sorting across the Salicaceae was found when examining insertion–deletion (indel) sites associated with DNA repair. By examining these indels in plastomic genes with introns across Salicaceae, we found a strong correlation between the abundance of DNA repair with genomic position and transcript abundance. From our findings, we conclude that previously ignored plastomic data are essential in understanding phylogeography and the evolution of key metabolic processes for improved aspen forest planning. Given the propensity of aspen forests to host high levels of biodiversity, rapidly sequester carbon, absorb excess nitrogen, and efficiently regulate snowmelt, improvements to planning and conservation will be highly impactful

    Chloroplast Genomes in Populus (Salicaceae): Comparisons From an Intensively Sampled Genus Reveal Dynamic Patterns of Evolution

    Get PDF
    The chloroplast is one of two organelles containing a separate genome that codes for essential and distinct cellular functions such as photosynthesis. Given the importance of chloroplasts in plant metabolism, the genomic architecture and gene content have been strongly conserved through long periods of time and as such are useful molecular tools for evolutionary inferences. At present, complete chloroplast genomes from over 4000 species have been deposited into publicly accessible databases. Despite the large number of complete chloroplast genomes, comprehensive analyses regarding genome architecture and gene content have not been conducted for many lineages with complete species sampling. In this study, we employed the genus Populus to assess how more comprehensively sampled chloroplast genome analyses can be used in understanding chloroplast evolution in a broadly studied lineage of angiosperms. We conducted comparative analyses across Populus in order to elucidate variation in key genome features such as genome size, gene number, gene content, repeat type and number, SSR (Simple Sequence Repeat) abundance, and boundary positioning between the four main units of the genome. We found that some genome annotations were variable across the genus owing in part from errors in assembly or data checking and from this provided corrected annotations. We also employed complete chloroplast genomes for phylogenetic analyses including the dating of divergence times throughout the genus. Lastly, we utilized re-sequencing data to describe the variations of pan-chloroplast genomes at the population level for P. euphratica. The analyses used in this paper provide a blueprint for the types of analyses that can be conducted with publicly available chloroplast genomes as well as methods for building upon existing datasets to improve evolutionary inference

    Comparative analysis of organellar genomes between diploid and tetraploid Chrysanthemum indicum with its relatives

    Get PDF
    Chrysanthemum indicum, a species native to Eastern Asia is well known as one of the progenitor species of the cultivated Chrysanthemum which is grown for its ornamental and medicinal value. Previous genomic studies on Chrysanthemum have largely ignored the dynamics of plastid genome (plastome) and mitochondria genome (mitogenome) evolution when analyzing this plant lineage. In this study, we sequenced and assembled the plastomes and mitogenomes of diploid and tetraploid C. indicum as well as the morphologically divergent variety C. indicum var. aromaticum. We used published data from 27 species with both plastome and mitogenome complete sequences to explore differences in sequence evolution between the organellar genomes. The size and structure of organellar genome between diploid and tetraploid C. indicum were generally similar but the tetraploid C. indicum and C. indicum var. aromaticum were found to contain unique sequences in the mitogenomes which also contained previously undescribed open reading frames (ORFs). Across Chrysanthemum mitogenome structure varied greatly but sequences transferred from plastomes in to the mitogenomes were conserved. Finally, differences observed between mitogenome and plastome gene trees may be the result of the difference in the rate of sequence evolution between genes in these two genomes. In total the findings presented here greatly expand the resources for studying Chrysanthemum organellar genome evolution with possible applications to conservation, breeding, and gene banking in the future

    The genomic and bulked segregant analysis of \u3ci\u3eCurcuma alismatifolia\u3c/i\u3e revealed its diverse bract pigmentation

    Get PDF
    Compared with most flowers where the showy part comprises specialized leaves (petals) directly subtending the reproductive structures, most Zingiberaceae species produce showy ‘‘flowers’’ through modifications of leaves (bracts) subtending the true flowers throughout an inflorescence. Curcuma alismatifolia, belonging to the Zingiberaceae family, a plant species originating from Southeast Asia, has become increasingly popular in the flower market worldwide because of its varied and esthetically pleasing bracts produced in different cultivars. Here, we present the chromosome-scale genome assembly of C. alismatifolia ‘‘Chiang Mai Pink’’ and explore the underlying mechanisms of bract pigmentation. Comparative genomic analysis revealed C. alismatifolia contains a residual signal of wholegenome duplication. Duplicated genes, including pigment-related genes, exhibit functional and structural differentiation resulting in diverse bract colors among C. alismatifolia cultivars. In addition, we identified the key genes that produce different colored bracts in C. alismatifolia, such as F3\u275’H, DFR, ANS and several transcription factors for anthocyanin synthesis, as well as chlH and CAO in the chlorophyll synthesis pathway by conducting transcriptomic analysis, bulked segregant analysis using both DNA and RNA data, and population genomic analysis. This work provides data for understanding the mechanism of bract pigmentation and will accelerate breeding in developing novel cultivars with richly colored bracts in C. alismatifolia and related species. It is also important to understand the variation in the evolution of the Zingiberaceae family

    Are Differences in Genomic Data Sets due to True Biological Variants or Errors in Genome Assembly: An Example from Two Chloroplast Genomes

    No full text
    DNA sequencing has been revolutionized by the development of high-throughput sequencing technologies. Plummeting costs and the massive throughput capacities of second and third generation sequencing platforms have transformed many fields of biological research. Concurrently, new data processing pipelines made rapid de novo genome assemblies possible. However, high quality data are critically important for all investigations in the genomic era. We used chloroplast genomes of one Oryza species (O. australiensis) to compare differences in sequence quality: one genome (GU592209) was obtained through Illumina sequencing and reference-guided assembly and the other genome (KJ830774) was obtained via target enrichment libraries and shotgun sequencing. Based on the whole genome alignment, GU592209 was more similar to the reference genome (O. sativa: AY522330) with 99.2% sequence identity (SI value) compared with the 98.8% SI values in the KJ830774 genome; whereas the opposite result was obtained when the SI values in coding and noncoding regions of GU592209 and KJ830774 were compared. Additionally, the junctions of two single copies and repeat copies in the chloroplast genome exhibited differences. Phylogenetic analyses were conducted using these sequences, and the different data sets yielded dissimilar topologies: phylogenetic replacements of the two individuals were remarkably different based on whole genome sequencing or SNP data and insertions and deletions (indels) data. Thus, we concluded that the genomic composition of GU592209 was heterogeneous in coding and non-coding regions. These findings should impel biologists to carefully consider the quality of sequencing and assembly when working with next-generation data

    Complete chloroplast genome of carnation (Caryophyllaceae: Dianthus caryophyllus L.)

    No full text
    The complete carnation (Dianthus caryophyllus L.) chloroplast (cp) genome was determined and analyzed in this study. The circular cp genome had a total length of 149,595 bp, made up of a pair of 24,808 bp inverted repeats (IRa,b) separated by a large single-copy region (LSC) of 82,883 bp and a small single-copy region (SSC) of 17,096 bp. The overall GC content of the genome is 36.32%. The annotated genome contained 112 unique genes, including 78 protein-coding genes, four ribosomal RNA genes, and 30 tRNA genes. Among these, 16 are duplicated in the inverted repeat regions, and 17 genes contain introns. With the new complete cp genome of carnation, we were also able to correct improperly assembled cp genomes from published data that had indicated a missing one rpl2 copy in the IR regions

    The Complete Chloroplast Genome of Catha edulis: A Comparative Analysis of Genome Features with Related Species

    No full text
    Qat (Catha edulis, Celastraceae) is a woody evergreen species with great economic and cultural importance. It is cultivated for its stimulant alkaloids cathine and cathinone in East Africa and southwest Arabia. However, genome information, especially DNA sequence resources, for C. edulis are limited, hindering studies regarding interspecific and intraspecific relationships. Herein, the complete chloroplast (cp) genome of Catha edulis is reported. This genome is 157,960 bp in length with 37% GC content and is structurally arranged into two 26,577 bp inverted repeats and two single-copy areas. The size of the small single-copy and the large single-copy regions were 18,491 bp and 86,315 bp, respectively. The C. edulis cp genome consists of 129 coding genes including 37 transfer RNA (tRNA) genes, 8 ribosomal RNA (rRNA) genes, and 84 protein coding genes. For those genes, 112 are single copy genes and 17 genes are duplicated in two inverted regions with seven tRNAs, four rRNAs, and six protein coding genes. The phylogenetic relationships resolved from the cp genome of qat and 32 other species confirms the monophyly of Celastraceae. The cp genomes of C. edulis, Euonymus japonicus and seven Celastraceae species lack the rps16 intron, which indicates an intron loss took place among an ancestor of this family. The cp genome of C. edulis provides a highly valuable genetic resource for further phylogenomic research, barcoding and cp transformation in Celastraceae

    Evaluation of Intracellular Gene Transfers from Plastome to Nuclear Genome across Progressively Improved Assemblies for Arabidopsis thaliana and Oryza sativa

    No full text
    DNA originating from organellar genomes are regularly discovered in nuclear sequences during genome assembly. Nevertheless, such insertions are sometimes omitted during the process of nuclear genome assembly because the inserted DNA is assigned to organellar genomes, leading to a systematic underestimation of their frequency. With the rapid development of high-throughput sequencing technology, more inserted fragments from organelle genomes can now be detected. Therefore, it is necessary to be aware of the insertion events from organellar genomes during nuclear genome assembly to properly attribute the impact and rate of such insertions in the evolution of nuclear genomes. Here, we investigated the impact of intracellular gene transfer (IGT) from the plastome to the nuclear genome using genome assemblies that were refined through time with technological improvements from two model species, Arabidopsis thaliana and Oryza sativa. We found that IGT from the plastome to the nuclear genome is a dynamic and ongoing process in both A. thaliana and O. sativa, and mostly occurred recently, as the majority of transferred sequences showed over 95% sequence similarity with plastome sequences of origin. Differences in the plastome-to-nuclear genome IGT between A. thaliana and O. sativa varied among the different assembly versions and were associated with the quality of the nuclear genome assembly. IGTs from the plastome to nuclear genome occurred more frequently in intergenic regions, which were often associated with transposable elements (TEs). This study provides new insights into intracellular genome evolution and nuclear genome assembly by characterizing and comparing IGT from the plastome into the nuclear genome for two model plant species

    Phylogenetic trees were constructed for nine species from the rice tribe using different methods, and two Bayesian trees are shown for the whole genome sequence and the insertion-deletion data.

    No full text
    <p>A. The whole genome sequence data were used with four different methods, Bayesian inference (BI), maximum parsimony (MP), maximum likelihood (ML) and neighbor-joining (NJ). Numbers above the branches are the posterior probabilities for BI and bootstrap values of MP, ML and NJ, respectively. B. The coding data from insertions and deletions (indels) were used with three different methods, Bayesian inference (BI) and maximum parsimony (MP), and two neighbor-joining (NJ) methods, for two different sets of coded data. Numbers above the branches are the posterior probabilities for BI and bootstrap values of MP and NJ. Branch length is proportional to the number of substitutions, as indicated by the scale bar. Stars represent the different positions for <i>O. australiensis</i> (GU592340) in the two trees.</p

    Characterization of the whole chloroplast genome of Chikusichloa mutica and its comparison with other rice tribe (Oryzeae) species

    No full text
    Chloroplast genomes are a significant genomic resource in plant species and have been used in many research areas. The complete genomic information from wild crop species could supply a valuable genetic reservoir for breeding. Chikusichloa mutica is one of the most important wild distant relatives of cultivated rice. In this study, we sequenced and characterized its complete chloroplast (cp) genome and compared it with other species in the same tribe. The whole cp genome sequence is 136,603 bp in size and exhibits a typical quadripartite structure with large and small single-copy regions (LSC, 82,327 bp; SSC, 12,598 bp) separated by a pair of 20,839-bp inverted repeats (IRA, B). A total of 110 unique genes are annotated, including 76 protein-coding genes, 4 ribosomal RNA genes and 30 tRNA genes. The genome structure, gene order, GC content, and other features are similar to those of other angiosperm cp genomes. When comparing the cp genomes between Oryzinae and Zizaniinae subtribes, the main differences were found between the junction regions and distribution of simple sequence repeats (SSRs). In comparing the two Chikusichloa species, the genomes were only 40 bp different in length and 108 polymorphic sites, including 83 single nucleotide substitutions (SNPs) and 25 insertion-deletions (Indels), were found between the whole cp genomes. The complete cp genome of C. mutica will be an important genetic tool for future breeding programs and understanding the evolution of wild rice relatives
    • …
    corecore