217 research outputs found

    Genome-Wide SNP discovery and genomic characterization in avocado (Persea americana Mill.)

    Get PDF
    Modern crop breeding is based on the use of genetically and phenotypically diverse plant material and, consequently, a proper understanding of population structure and genetic diversity is essential for the effective development of breeding programs. An example is avocado, a woody perennial fruit crop native to Mesoamerica with an increasing popularity worldwide. Despite its commercial success, there are important gaps in the molecular tools available to support on-going avocado breeding programs. In order to fill this gap, in this study, an avocado \u2018Hass\u2019 draft assembly was developed and used as reference to study 71 avocado accessions which represent the three traditionally recognized avocado horticultural races or subspecies (Mexican, Guatemalan and West Indian). An average of 5.72 M reads per individual and a total of 7,108 single nucleotide polymorphism (SNP) markers were produced for the 71 accessions analyzed. These molecular markers were used in a study of genetic diversity and population structure. The results broadly separate the accessions studied according to their botanical race in four main groups: Mexican, Guatemalan, West Indian and an additional group of Guatemalan 7 Mexican hybrids. The high number of SNP markers developed in this study will be a useful genomic resource for the avocado community

    Mining transcriptomic data to study the origins and evolution of a plant allopolyploid complex

    Get PDF
    Allopolyploidy combines two progenitor genomes in the same nucleus. It is a common speciation process, especially in plants. Deciphering the origins of polyploid species is a complex problem due to, among other things, extinct progenitors, multiple origins, gene flow between different polyploid populations, and loss of parental contributions through gene or chromosome loss. Among the perennial species of Glycine, the plant genus that includes the cultivated soybean (G. max), are eight allopolyploid species, three of which are studied here. Previous crossing studies and molecular systematic results from two nuclear gene sequences led to hypotheses of origin for these species from among extant diploid species. We use several phylogenetic and population genomics approaches to clarify the origins of the genomes of three of these allopolyploid species using single nucleotide polymorphism data and a guided transcriptome assembly. The results support the hypothesis that all three polyploid species are fixed hybrids combining the genomes of the two putative parents hypothesized on the basis of previous work. Based on mapping to the soybean reference genome, there appear to be no large regions for which one homoeologous contribution is missing. Phylogenetic analyses of 27 selected transcripts using a coalescent approach also are consistent with multiple origins for these allopolyploid species, and suggest that origins occurred within the last several hundred thousand years

    Expression‐level support for gene dosage sensitivity in three Glycine subgenus Glycine polyploids and their diploid progenitors

    Get PDF
    Retention or loss of paralogs following duplication correlates strongly with the function of the gene and whether the gene was duplicated by whole\u2010genome duplication (WGD) or by small\u2010scale duplication. Selection on relative gene dosage (to maintain proper stoichiometry among interacting proteins) has been invoked to explain these patterns of duplicate gene retention and loss. In order for gene dosage to be visible to natural selection, there must necessarily be a correlation between gene copy number and gene expression level (transcript abundance), but this has rarely been examined. We used RNA\u2010Seq data from seven Glycine subgenus Glycine species (three recently formed allotetraploids and their four diploid progenitors) to determine if expression patterns and gene dosage responses at the level of transcription are consistent with selection on relative gene dosage. As predicted, metabolic pathways and gene ontologies that are putatively dosage\u2010sensitive based on duplication history exhibited reduced expression variance across species, and more coordinated expression responses to recent WGD, relative to putatively dosage\u2010insensitive networks. We conclude that selection on relative dosage has played an important role in shaping gene networks in Glycine

    Organelle_PBA, a pipeline for assembling chloroplast and mitochondrial genomes from PacBio DNA sequencing data

    Get PDF
    Background The development of long-read sequencing technologies, such as single-molecule real-time (SMRT) sequencing by PacBio, has produced a revolution in the sequencing of small genomes. Sequencing organelle genomes using PacBio long-read data is a cost effective, straightforward approach. Nevertheless, the availability of simple-to-use software to perform the assembly from raw reads is limited at present. Results We present Organelle-PBA, a Perl program designed specifically for the assembly of chloroplast and mitochondrial genomes. For chloroplast genomes, the program selects the chloroplast reads from a whole genome sequencing pool, maps the reads to a reference sequence from a closely related species, and then performs read correction and de novo assembly using Sprai. Organelle-PBA completes the assembly process with the additional step of scaffolding by SSPACE-LongRead. The program then detects the chloroplast inverted repeats and reassembles and re-orients the assembly based on the organelle origin of the reference. We have evaluated the performance of the software using PacBio reads from different species, read coverage, and reference genomes. Finally, we present the assembly of two novel chloroplast genomes from the species Picea glauca (Pinaceae) and Sinningia speciosa (Gesneriaceae). Conclusion Organelle-PBA is an easy-to-use Perl-based software pipeline that was written specifically to assemble mitochondrial and chloroplast genomes from whole genome PacBio reads. The program is available at https://github.com/aubombarely/Organelle_PBA

    TobEA: an atlas of tobacco gene expression from seed to senescence.

    Get PDF
    RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are.BACKGROUND: Transcriptomics has resulted in the development of large data sets and tools for the progression of functional genomics and systems biology in many model organisms. Currently there is no commercially available microarray to allow such expression studies in Nicotiana tabacum (tobacco). RESULTS: A custom designed Affymetrix tobacco expression microarray was generated from a set of over 40k unigenes and used to measure gene expression in 19 different tobacco samples to produce the Tobacco Expression Atlas (TobEA). TobEA provides a snap shot of the transcriptional activity for thousands of tobacco genes in different tissues throughout the lifecycle of the plant and enables the identification of the biological processes occurring in these different tissues. 772 of 2513 transcription factors previously identified in tobacco were mapped to the array, with 87% of them being expressed in at least one tissue in the atlas. Putative transcriptional networks were identified based on the co-expression of these transcription factors. Several interactions in a floral identity transcription factor network were consistent with previous results from other plant species. To broaden access and maximise the benefit of TobEA a set of tools were developed to provide researchers with expression information on their genes of interest via the Solanaceae Genomics Network (SGN) web site. The array has also been made available for public use via the Nottingham Arabidopsis Stock Centre microarray service. CONCLUSIONS: The generation of a tobacco expression microarray is an important development for research in this model plant. The data provided by TobEA represents a valuable resource for plant functional genomics and systems biology research and can be used to identify gene targets for both fundamental and applied scientific applications in tobacco

    Demethylation of oligogalacturonides by FaPE1 in the fruits of the wild strawberry Fragaria vesca triggers metabolic and transcriptional changes associated with defence and development of the fruit

    Get PDF
    Ectopic expression of the strawberry (Fragariaxananassa) gene FaPE1 encoding pectin methyl esterase produced in the wild species Fragaria vesca partially demethylated oligogalacturonides (OGAs), which conferred partial resistance of ripe fruits to the fungus Botrytis cinerea. Analyses of metabolic and transcriptional changes in the receptacle of the transgenic fruits revealed channelling of metabolites to aspartate and aromatic amino acids as well as phenolics, flavanones, and sesquiterpenoids, which was in parallel with the increased expression of some genes related to plant defence. The results illustrate the changes associated with resistance to B. cinerea in the transgenic F. vesca. These changes were accompanied by a significant decrease in the auxin content of the receptacle of the ripe fruits of transgenic F. vesca, and enhanced expression of some auxin-repressed genes. The role of these OGAs in fruit development was revealed by the larger size of the ripe fruits in transgenic F. vesca. When taken together these results show that in cultivated F. ananassa FaPE1 participates in the de-esterification of pectins and the generation of partially demethylated OGAs, which might reinforce the plant defence system and play an active role in fruit development

    From manual curation to visualization of gene families and networks across Solanaceae plant species

    Get PDF
    High-quality manual annotation methods and practices need to be scaled to the increased rate of genomic data production. Curation based on gene families and gene networks is one approach that can significantly increase both curation efficiency and quality. The Sol Genomics Network (SGN; http://solgenomics.net) is a comparative genomics platform, with genetic, genomic and phenotypic information of the Solanaceae family and its closely related species that incorporates a community-based gene and phenotype curation system. In this article, we describe a manual curation system for gene families aimed at facilitating curation, querying and visualization of gene interaction patterns underlying complex biological processes, including an interface for efficiently capturing information from experiments with large data sets reported in the literature. Well-annotated multigene families are useful for further exploration of genome organization and gene evolution across species. As an example, we illustrate the system with the multigene transcription factor families, WRKY and Small Auxin Up-regulated RNA (SAUR), which both play important roles in responding to abiotic stresses in plants

    Natural variation in stress response gene activity in the allopolyploid Arabidopsis suecica

    Get PDF
    Background Allopolyploids contain genomes composed of more than two complete sets of chromosomes that originate from at least two species. Allopolyploidy has been suggested as an important evolutionary mechanism that can lead to instant speciation. Arabidopsis suecica is a relatively recent allopolyploid species, suggesting that its natural accessions might be genetically very similar to each other. Nonetheless, subtle phenotypic differences have been described between different geographic accessions of A. suecica grown in a common garden. Results To determine the degree of genomic similarity between different populations of A. suecica, we obtained transcriptomic sequence, quantified SNP variation within the gene space, and analyzed gene expression levels genome-wide from leaf material grown in controlled lab conditions. Despite their origin from the same progenitor species, the two accessions of A. suecica used in our study show genomic and transcriptomic variation. We report significant gene expression differences between the accessions, mostly in genes with stress-related functions. Among the differentially expressed genes, there are a surprising number of homoeologs coordinately regulated between sister accessions. Conclusions Many of these homoeologous genes and other differentially expressed genes affect transpiration and stomatal regulation, suggesting that they might be involved in the establishment of the phenotypic differences between the two accessions

    SolCyc: a database hub at the Sol Genomics Network (SGN) for the manual curation of metabolic networks in Solanum and Nicotiana specific databases

    Get PDF
    SolCyc is the entry portal to pathway/genome databases (PGDBs) for major species of the Solanaceae family hosted at the Sol Genomics Network. Currently, SolCyc comprises six organism-specific PGDBs for tomato, potato, pepper, petunia, tobacco and one Rubiaceae, coffee. The metabolic networks of those PGDBs have been computationally predicted by the pathologic component of the pathway tools software using the manually curated multi-domain database MetaCyc (http://www.metacyc.org/) as reference. SolCyc has been recently extended by taxon-specific databases, i.e. the family-specific SolanaCyc database, containing only curated data pertinent to species of the nightshade family, and NicotianaCyc, a genus-specific database that stores all relevant metabolic data of the Nicotiana genus. Through manual curation of the published literature, new metabolic pathways have been created in those databases, which are complemented by the continuously updated, relevant species-specific pathways from MetaCyc. At present, SolanaCyc comprises 199 pathways and 29 superpathways and NicotianaCyc accounts for 72 pathways and 13 superpathways. Curator-maintained, taxon-specific databases such as SolanaCyc and NicotianaCyc are characterized by an enrichment of data specific to these taxa and free of falsely predicted pathways. Both databases have been used to update recently created Nicotiana-specific databases for Nicotiana tabacum, Nicotiana benthamiana, Nicotiana sylvestris and Nicotiana tomentosiformis by propagating verifiable data into those PGDBs. In addition, in-depth curation of the pathways in N.tabacum has been carried out which resulted in the elimination of 156 pathways from the 569 pathways predicted by pathway tools. Together, in-depth curation of the predicted pathway network and the supplementation with curated data from taxon-specific databases has substantially improved the curation status of the species\u2013specific N.tabacum PGDB. The implementation of this strategy will significantly advance the curation status of all organism-specific databases in SolCyc resulting in the improvement on database accuracy, data analysis and visualization of biochemical networks in those species

    Complete Plastome Sequences from Glycine syndetika, and Six Additional Perennial Wild Relatives of Soybean

    Get PDF
    Organelle sequences have a long history of utility in phylogenetic analyses. Chloroplast sequences when combined with nuclear data can help resolve relationships among flowering plant genera, and within genera incongruence can point to reticulate evolution. Plastome sequences are becoming plentiful because they are increasingly easier to obtain. Complete plastome sequences allow us to detect rare rearrangements and test the tempo of sequence evolution. Chloroplast sequences are generally considered a nuisance to be kept to a minimum in bacterial artificial chromosome libraries. Here, we sequenced two bacterial artificial chromosomes per species to generate complete plastome sequences from seven species. The plastome sequences from Glycine syndetika and six other perennial Glycine species are similar in arrangement and gene content to the previously published soybean plastome. Repetitive sequences were detected in high frequencies as in soybean, but further analysis showed that repeat sequence numbers are inflated. Previous chloroplast-based phylogenetic trees for perennial Glycine were incongruent with nuclear gene\u2013based phylogenetic trees. We tested whether the hypothesis of introgression was supported by the complete plastomes. Alignment of complete plastome sequences and Bayesian analysis allowed us to date putative hybridization events supporting the hypothesis of introgression and chloroplast \u201ccapture.\u201
    • …