77,747 research outputs found

    2b-RAD genotyping for population genomic studies of Chagas disease vectors: Rhodnius ecuadoriensis in Ecuador

    Get PDF
    Background: Rhodnius ecuadoriensis is the main triatomine vector of Chagas disease, American trypanosomiasis, in Southern Ecuador and Northern Peru. Genomic approaches and next generation sequencing technologies have become powerful tools for investigating population diversity and structure which is a key consideration for vector control. Here we assess the effectiveness of three different 2b restriction site-associated DNA (2b-RAD) genotyping strategies in R. ecuadoriensis to provide sufficient genomic resolution to tease apart microevolutionary processes and undertake some pilot population genomic analyses. Methodology/Principal findings: The 2b-RAD protocol was carried out in-house at a non-specialized laboratory using 20 R. ecuadoriensis adults collected from the central coast and southern Andean region of Ecuador, from June 2006 to July 2013. 2b-RAD sequencing data was performed on an Illumina MiSeq instrument and analyzed with the STACKS de novo pipeline for loci assembly and Single Nucleotide Polymorphism (SNP) discovery. Preliminary population genomic analyses (global AMOVA and Bayesian clustering) were implemented. Our results showed that the 2b-RAD genotyping protocol is effective for R. ecuadoriensis and likely for other triatomine species. However, only BcgI and CspCI restriction enzymes provided a number of markers suitable for population genomic analysis at the read depth we generated. Our preliminary genomic analyses detected a signal of genetic structuring across the study area. Conclusions/Significance: Our findings suggest that 2b-RAD genotyping is both a cost effective and methodologically simple approach for generating high resolution genomic data for Chagas disease vectors with the power to distinguish between different vector populations at epidemiologically relevant scales. As such, 2b-RAD represents a powerful tool in the hands of medical entomologists with limited access to specialized molecular biological equipment. Author summary: Understanding Chagas disease vector (triatomine) population dispersal is key for the design of control measures tailored for the epidemiological situation of a particular region. In Ecuador, Rhodnius ecuadoriensis is a cause of concern for Chagas disease transmission, since it is widely distributed from the central coast to southern Ecuador. Here, a genome-wide sequencing (2b-RAD) approach was performed in 20 specimens from four communities from ManabĂ­ (central coast) and Loja (southern) provinces of Ecuador, and the effectiveness of three type IIB restriction enzymes was assessed. The findings of this study show that this genotyping methodology is cost effective in R. ecuadoriensis and likely in other triatomine species. In addition, preliminary population genomic analysis results detected a signal of population structure among geographically distinct communities and genetic variability within communities. As such, 2b-RAD shows significant promise as a relatively low-tech solution for determination of vector population genomics, dynamics, and spread

    De novo sequencing of the Hypericum perforatum L. flower transcriptome to identify potential genes that are related to plant reproduction sensu lato

    Get PDF
    Background: St. John's wort (Hypericum perforatum L.) is a medicinal plant that produces important metabolites with antidepressant and anticancer activities. Recently gained biological information has shown that this species is also an attractive model system for the study of a naturally occurring form of asexual reproduction called apomixis, which allows cloning plants through seeds. In aposporic gametogenesis, one or multiple somatic cells belonging to the ovule nucellus change their fate by dividing mitotically and developing functionally unreduced embryo sacs by mimicking sexual gametogenesis. Although the introduction of apomixis into agronomically important crops could have revolutionary implications for plant breeding, the genetic control of this mechanism of seed formation is still not well understood for most of the model species investigated so far. We used Roche 454 technology to sequence the entire H. perforatum flower transcriptome of whole flower buds and single flower verticils collected from obligately sexual and unrelated highly or facultatively apomictic genotypes, which enabled us to identify RNAs that are likely exclusive to flower organs (i.e., sepals, petals, stamens and carpels) or reproductive strategies (i.e., sexual vs. apomictic). Results: Here we sequenced and annotated the flower transcriptome of H. perforatum with particular reference to reproductive organs and processes. In particular, in our study we characterized approximately 37,000 transcripts found expressed in male and/or female reproductive organs, including tissues or cells of sexual and apomictic flower buds. Ontological annotation was applied to identify major biological processes and molecular functions involved in flower development and plant reproduction. Starting from this dataset, we were able to recover and annotate a large number of transcripts related to meiosis, gametophyte/gamete formation, and embryogenesis, as well as genes that are exclusively or preferentially expressed in sexual or apomictic libraries. Real-Time RT-qPCR assays on pistils and anthers collected at different developmental stages from accessions showing alternative modes of reproduction were used to identify potential genes that are related to plant reproduction sensu lato in H. perforatum. Conclusions: Our approach of sequencing flowers from two fully obligate sexual genotypes and two unrelated highly apomictic genotypes, in addition to different flower parts dissected from a facultatively apomictic accession, enabled us to analyze the complexity of the flower transcriptome according to its main reproductive organs as well as for alternative reproductive behaviors. Both annotation and expression data provided original results supporting the hypothesis that apomixis in H. perforatum relies upon spatial or temporal mis-expression of genes acting during female sexual reproduction. The present analyses aim to pave the way toward a better understanding of the molecular basis of flower development and plant reproduction, by identifying genes or RNAs that may differentiate or regulate the sexual and apomictic reproductive pathways in H. perforatum

    Single cell transcriptome analysis using next generation sequencing.

    Get PDF
    The heterogeneity of tissues, especially in cancer research, is a central issue in transcriptome analysis. In recent years, research has primarily focused on the development of methods for single cell analysis. Single cell analysis aims at gaining (novel) insights into biological processes of healthy and diseased cells. Some of the challenges in transcriptome analysis concern low abundance of sample starting material, necessary sample amplification steps and subsequent analysis. In this study, two fundamentally different approaches to amplification were compared using next-generation sequencing analysis: I. exponential amplification using polymerase-chain-reaction (PCR) and II. linear amplification. For both approaches, protocols for single cell extraction, cell lysis, cDNA synthesis, cDNA amplification and preparation of next-generation sequencing libraries were developed. We could successfully show that transcriptome analysis of low numbers of cells is feasible with both exponential and linear amplification. Using exponential amplification, the highest amplification rates up to 106 were possible. The reproducibility of results is a strength of the linear amplification method. The analysis of next generation sequencing data in single cell samples showed detectable expression in at least 16.000 genes. The variance between samples results in a need to work with a greater amount of biological replicates. In summary it can be said that single cell transcriptome analysis with next generation sequencing is possible but improvements leading to a higher yield of transcriptome reads is required. In the near future by comparing single cancer cells with healthy ones for example, a basis for improved prognosis and diagnosis can be realised

    GBS-SNP-CROP: a reference-optional pipeline for SNP discovery and plant germplasm characterization using variable length, paired-end genotyping-by-sequencing data

    Get PDF
    Background: With its simple library preparation and robust approach to genome reduction, genotyping-by-sequencing (GBS) is a flexible and cost-effective strategy for SNP discovery and genotyping, provided an appropriate reference genome is available. For resource-limited curation, research, and breeding programs of underutilized plant genetic resources, however, even low-depth references may not be within reach, despite declining sequencing costs. Such programs would find value in an open-source bioinformatics pipeline that can maximize GBS data usage and perform high-density SNP genotyping in the absence of a reference. Results: The GBS SNP-Calling Reference Optional Pipeline (GBS-SNP-CROP) developed and presented here adopts a clustering strategy to build a population-tailored “Mock Reference” from the same GBS data used for downstream SNP calling and genotyping. Designed for libraries of paired-end (PE) reads, GBS-SNP-CROP maximizes data usage by eliminating unnecessary data culling due to imposed read-length uniformity requirements. Using 150 bp PE reads from a GBS library of 48 accessions of tetraploid kiwiberry (Actinidia arguta), GBS-SNP-CROP yielded on average three times as many SNPs as TASSEL-GBS analyses (32 and 64 bp tag lengths) and over 18 times as many as TASSEL-UNEAK, with fewer genotyping errors in all cases, as evidenced by comparing the genotypic characterizations of biological replicates. Using the published reference genome of a related diploid species (A. chinensis), the reference-based version of GBS-SNP-CROP behaved similarly to TASSEL-GBS in terms of the number of SNPs called but had an improved read depth distribution and fewer genotyping errors. Our results also indicate that the sets of SNPs detected by the different pipelines above are largely orthogonal to one another; thus GBS-SNP-CROP may be used to augment the results of alternative analyses, whether or not a reference is available. Conclusions: By achieving high-density SNP genotyping in populations for which no reference genome is available, GBS-SNP-CROP is worth consideration by curators, researchers, and breeders of under-researched plant genetic resources. In cases where a reference is available, especially if from a related species or when the target population is particularly diverse, GBS-SNP-CROP may complement other reference-based pipelines by extracting more information per sequencing dollar spent. The current version of GBS-SNP-CROP is available at https://github.com/halelab/GBS-SNP-CROP.gi

    Assessing the Gene Content of the Megagenome: Sugar Pine (Pinus lambertiana).

    Get PDF
    Sugar pine (Pinus lambertiana Douglas) is within the subgenus Strobus with an estimated genome size of 31 Gbp. Transcriptomic resources are of particular interest in conifers due to the challenges presented in their megagenomes for gene identification. In this study, we present the first comprehensive survey of the P. lambertiana transcriptome through deep sequencing of a variety of tissue types to generate more than 2.5 billion short reads. Third generation, long reads generated through PacBio Iso-Seq have been included for the first time in conifers to combat the challenges associated with de novo transcriptome assembly. A technology comparison is provided here to contribute to the otherwise scarce comparisons of second and third generation transcriptome sequencing approaches in plant species. In addition, the transcriptome reference was essential for gene model identification and quality assessment in the parallel project responsible for sequencing and assembly of the entire genome. In this study, the transcriptomic data were also used to address questions surrounding lineage-specific Dicer-like proteins in conifers. These proteins play a role in the control of transposable element proliferation and the related genome expansion in conifers

    Terminal restriction fragment length polymorphism is an “old school” reliable technique for swift microbial community screening in anaerobic digestion

    Get PDF
    The microbial community in anaerobic digestion has been analysed through microbial fingerprinting techniques, such as terminal restriction fragment length polymorphism (TRFLP), for decades. In the last decade, high-throughput 16S rRNA gene amplicon sequencing has replaced these techniques, but the time-consuming and complex nature of high-throughput techniques is a potential bottleneck for full-scale anaerobic digestion application, when monitoring community dynamics. Here, the bacterial and archaeal TRFLP profiles were compared with 16S rRNA gene amplicon profiles (Illumina platform) of 25 full-scale anaerobic digestion plants. The α-diversity analysis revealed a higher richness based on Illumina data, compared with the TRFLP data. This coincided with a clear difference in community organisation, Pareto distribution, and co-occurrence network statistics, i.e., betweenness centrality and normalised degree. The β-diversity analysis showed a similar clustering profile for the Illumina, bacterial TRFLP and archaeal TRFLP data, based on different distance measures and independent of phylogenetic identification, with pH and temperature as the two key operational parameters determining microbial community composition. The combined knowledge of temporal dynamics and projected clustering in the β-diversity profile, based on the TRFLP data, distinctly showed that TRFLP is a reliable technique for swift microbial community dynamics screening in full-scale anaerobic digestion plants
    • …
    corecore