181,080 research outputs found
Capturing the ‘ome’ : the expanding molecular toolbox for RNA and DNA library construction
All sequencing experiments and most functional genomics screens rely on the generation of libraries to comprehensively capture pools of targeted sequences. In the past decade especially, driven by the progress in the field of massively parallel sequencing, numerous studies have comprehensively assessed the impact of particular manipulations on library complexity and quality, and characterized the activities and specificities of several key enzymes used in library construction. Fortunately, careful protocol design and reagent choice can substantially mitigate many of these biases, and enable reliable representation of sequences in libraries. This review aims to guide the reader through the vast expanse of literature on the subject to promote informed library generation, independent of the application
Optimizing Illumina next-generation sequencing library preparation for extremely AT-biased genomes.
BAckground:
Massively parallel sequencing technology is revolutionizing approaches to genomic and genetic research. Since its advent, the scale and efficiency of Next-Generation Sequencing (NGS) has rapidly improved. In spite of this success, sequencing genomes or genomic regions with extremely biased base composition is still a great challenge to the currently available NGS platforms. The genomes of some important pathogenic organisms like Plasmodium falciparum (high AT content) and Mycobacterium tuberculosis (high GC content) display extremes of base composition. The standard library preparation procedures that employ PCR amplification have been shown to cause uneven read coverage particularly across AT and GC rich regions, leading to problems in genome assembly and variation analyses. Alternative library-preparation approaches that omit PCR amplification require large quantities of starting material and hence are not suitable for small amounts of DNA/RNA such as those from clinical isolates. We have developed and optimized library-preparation procedures suitable for low quantity starting material and tolerant to extremely high AT content sequences.
Results:
We have used our optimized conditions in parallel with standard methods to prepare Illumina sequencing libraries from a non-clinical and a clinical isolate (containing ~53% host contamination). By analyzing and comparing the quality of sequence data generated, we show that our optimized conditions that involve a PCR additive (TMAC), produces amplified libraries with improved coverage of extremely AT-rich regions and reduced bias toward GC neutral templates.
Conclusion:
We have developed a robust and optimized Next-Generation Sequencing library amplification method suitable for extremely AT-rich genomes. The new amplification conditions significantly reduce bias and retain the complexity of either extremes of base composition. This development will greatly benefit sequencing clinical samples that often require amplification due to low mass of DNA starting material
Rapid, ultra low coverage copy number profiling of cell-free DNA as a precision oncology screening strategy.
Current cell-free DNA (cfDNA) next generation sequencing (NGS) precision oncology workflows are typically limited to targeted and/or disease-specific applications. In advanced cancer, disease burden and cfDNA tumor content are often elevated, yielding unique precision oncology opportunities. We sought to demonstrate the utility of a pan-cancer, rapid, inexpensive, whole genome NGS of cfDNA approach (PRINCe) as a precision oncology screening strategy via ultra-low coverage (~0.01x) tumor content determination through genome-wide copy number alteration (CNA) profiling. We applied PRINCe to a retrospective cohort of 124 cfDNA samples from 100 patients with advanced cancers, including 76 men with metastatic castration-resistant prostate cancer (mCRPC), enabling cfDNA tumor content approximation and actionable focal CNA detection, while facilitating concordance analyses between cfDNA and tissue-based NGS profiles and assessment of cfDNA alteration associations with mCRPC treatment outcomes. Therapeutically relevant focal CNAs were present in 42 (34%) cfDNA samples, including 36 of 93 (39%) mCRPC patient samples harboring AR amplification. PRINCe identified pre-treatment cfDNA CNA profiles facilitating disease monitoring. Combining PRINCe with routine targeted NGS of cfDNA enabled mutation and CNA assessment with coverages tuned to cfDNA tumor content. In mCRPC, genome-wide PRINCe cfDNA and matched tissue CNA profiles showed high concordance (median Pearson correlation = 0.87), and PRINCe detectable AR amplifications predicted reduced time on therapy, independent of therapy type (Kaplan-Meier log-rank test, chi-square = 24.9, p < 0.0001). Our screening approach enables robust, broadly applicable cfDNA-based precision oncology for patients with advanced cancer through scalable identification of therapeutically relevant CNAs and pre-/post-treatment genomic profiles, enabling cfDNA- or tissue-based precision oncology workflow optimization
Taxonomy of anaerobic digestion microbiome reveals biases associated with the applied high throughput sequencing strategies
In the past few years, many studies investigated the anaerobic digestion microbiome by means of 16S rRNA amplicon sequencing. Results obtained from these studies were compared to each other without taking into consideration the followed procedure for amplicons preparation and data analysis. This negligence was mainly due to the lack of knowledge regarding the biases influencing specific steps of the microbiome investigation process. In the present study, the main technical aspects of the 16S rRNA analysis were checked giving special attention to the approach used for high throughput sequencing. More specifically, the microbial compositions of three laboratory scale biogas reactors were analyzed before and after addition of sodium oleate by sequencing the microbiome with three different approaches: 16S rRNA amplicon sequencing, shotgun DNA and shotgun RNA. This comparative analysis revealed that, in amplicon sequencing, abundance of some taxa (Euryarchaeota and Spirochaetes) was biased by the inefficiency of universal primers to hybridize all the templates. Reliability of the results obtained was also influenced by the number of hypervariable regions under investigation. Finally, amplicon sequencing and shotgun DNA underestimated the Methanoculleus genus, probably due to the low 16S rRNA gene copy number encoded in this taxon
Comparison of TCGA and GENIE genomic datasets for the detection of clinically actionable alterations in breast cancer.
Whole exome sequencing (WES), targeted gene panel sequencing and single nucleotide polymorphism (SNP) arrays are increasingly used for the identification of actionable alterations that are critical to cancer care. Here, we compared The Cancer Genome Atlas (TCGA) and the Genomics Evidence Neoplasia Information Exchange (GENIE) breast cancer genomic datasets (array and next generation sequencing (NGS) data) in detecting genomic alterations in clinically relevant genes. We performed an in silico analysis to determine the concordance in the frequencies of actionable mutations and copy number alterations/aberrations (CNAs) in the two most common breast cancer histologies, invasive lobular and invasive ductal carcinoma. We found that targeted sequencing identified a larger number of mutational hotspots and clinically significant amplifications that would have been missed by WES and SNP arrays in many actionable genes such as PIK3CA, EGFR, AKT3, FGFR1, ERBB2, ERBB3 and ESR1. The striking differences between the number of mutational hotspots and CNAs generated from these platforms highlight a number of factors that should be considered in the interpretation of array and NGS-based genomic data for precision medicine. Targeted panel sequencing was preferable to WES to define the full spectrum of somatic mutations present in a tumor
Salmonella produce microRNA-like RNA fragment Sal-1 in the infected cells to facilitate intracellular survival.
Salmonella have developed a sophisticated machinery to evade immune clearance and promote survival in the infected cells. Previous studies were mostly focused on either bacteria itself or host cells, the interaction mechanism of host-pathogen awaits further exploration. In the present study, we show that Salmonella can exploit mammalian cell non-classical microRNA processing machinery to further process bacterial small non-coding RNAs into microRNA-like fragments. Sal-1, one such fragment with the highest copy number in the infected cells, is derived from Salmonella 5-leader of the ribosomal RNA transcript and has a stem structure-containing precursor. Processing of Sal-1 precursors to mature Sal-1 is dependent on host cell Argonaute 2 (AGO2) but not Dicer. Functionally, depleting cellular Sal-1 strongly renders the Salmonella bacteria less resistant to the host defenses both in vitro and in vivo. In conclusion, we demonstrate a novel strategy for Salmonella evading the host immune clearance, in which Salmonella produce microRNA-like functional RNA fragments to establish a microenvironment facilitating bacterial survival
Advances in Microfluidics and Lab-on-a-Chip Technologies
Advances in molecular biology are enabling rapid and efficient analyses for
effective intervention in domains such as biology research, infectious disease
management, food safety, and biodefense. The emergence of microfluidics and
nanotechnologies has enabled both new capabilities and instrument sizes
practical for point-of-care. It has also introduced new functionality, enhanced
sensitivity, and reduced the time and cost involved in conventional molecular
diagnostic techniques. This chapter reviews the application of microfluidics
for molecular diagnostics methods such as nucleic acid amplification,
next-generation sequencing, high resolution melting analysis, cytogenetics,
protein detection and analysis, and cell sorting. We also review microfluidic
sample preparation platforms applied to molecular diagnostics and targeted to
sample-in, answer-out capabilities
PCR biases distort bacterial and archaeal community structure in pyrosequencing datasets
As 16S rRNA gene targeted massively parallel sequencing has become a common tool for microbial diversity investigations, numerous advances have been made to minimize the influence of sequencing and chimeric PCR artifacts through rigorous quality control measures. However, there has been little effort towards understanding the effect of multi-template PCR biases on microbial community structure. In this study, we used three bacterial and three archaeal mock communities consisting of, respectively, 33 bacterial and 24 archaeal 16S rRNA gene sequences combined in different proportions to compare the influences of (1) sequencing depth, (2) sequencing artifacts (sequencing errors and chimeric PCR artifacts), and (3) biases in multi-template PCR, towards the interpretation of community structure in pyrosequencing datasets. We also assessed the influence of each of these three variables on α- and β-diversity metrics that rely on the number of OTUs alone (richness) and those that include both membership and the relative abundance of detected OTUs (diversity). As part of this study, we redesigned bacterial and archaeal primer sets that target the V3–V5 region of the 16S rRNA gene, along with multiplexing barcodes, to permit simultaneous sequencing of PCR products from the two domains. We conclude that the benefits of deeper sequencing efforts extend beyond greater OTU detection and result in higher precision in β-diversity analyses by reducing the variability between replicate libraries, despite the presence of more sequencing artifacts. Additionally, spurious OTUs resulting from sequencing errors have a significant impact on richness or shared-richness based α- and β-diversity metrics, whereas metrics that utilize community structure (including both richness and relative abundance of OTUs) are minimally affected by spurious OTUs. However, the greatest obstacle towards accurately evaluating community structure are the errors in estimated mean relative abundance of each detected OTU due to biases associated with multi-template PCR reactions
- …
