7 research outputs found

    Using Nanopore Sequencing to Interrogate the Genome and Epigenome

    Get PDF
    This work involves using native RNA nanopore sequencing to directly characterize the transcriptome of a human cell line, GM12878. We demonstrated several new methods, and findings, including newly discovered isoforms, allele-specific isoforms, measurement of polyadenylation length, and even measurement of RNA modifications. We also describe an application of nanopore RNA sequencing and chemical labeling to measure the secondary structure of RNA. Lastly, we demonstrate an analysis framework for looking at a new file format for single-molecule/long-read modification data

    Nanopore native RNA sequencing of a human poly(A) transcriptome

    Get PDF
    High-throughput complementary DNA sequencing technologies have advanced our understanding of transcriptome complexity and regulation. However, these methods lose information contained in biological RNA because the copied reads are often short and modifications are not retained. We address these limitations using a native poly(A) RNA sequencing strategy developed by Oxford Nanopore Technologies. Our study generated 9.9 million aligned sequence reads for the human cell line GM12878, using thirty MinION flow cells at six institutions. These native RNA reads had a median length of 771 bases, and a maximum aligned length of over 21,000 bases. Mitochondrial poly(A) reads provided an internal measure of read-length quality. We combined these long nanopore reads with higher accuracy short-reads and annotated GM12878 promoter regions to identify 33,984 plausible RNA isoforms. We describe strategies for assessing 3′ poly(A) tail length, base modifications and transcript haplotypes

    Direct detection of RNA modifications and structure using single-molecule nanopore sequencing

    No full text
    Summary: Modifications are present on many classes of RNA, including tRNA, rRNA, and mRNA. These modifications modulate diverse biological processes such as genetic recoding and mRNA export and folding. In addition, modifications can be introduced to RNA molecules using chemical probing strategies that reveal RNA structure and dynamics. Many methods exist to detect RNA modifications by short-read sequencing; however, limitations on read length inherent to short-read-based methods dissociate modifications from their native context, preventing single-molecule modification analysis. Here, we demonstrate direct RNA nanopore sequencing to detect endogenous and exogenous RNA modifications on long RNAs at the single-molecule level. We detect endogenous 2′-O-methyl and base modifications across E. coli and S. cerevisiae ribosomal RNAs as shifts in current signal and dwell times distally through interactions with the helicase motor protein. We further use the 2′-hydroxyl reactive SHAPE reagent acetylimidazole to probe RNA structure at the single-molecule level with readout by direct nanopore sequencing

    Plasma virome and the risk of blood-borne infection in persons with substance use disorder.

    No full text
    There is an urgent need for innovative methods to reduce transmission of bloodborne pathogens like HIV and HCV among people who inject drugs (PWID). We investigate if PWID who acquire non-pathogenic bloodborne viruses like anelloviruses and pegiviruses might be at greater risk of acquiring a bloodborne pathogen. PWID who later acquire HCV accumulate more non-pathogenic viruses in plasma than matched controls who do not acquire HCV infection. Additionally, phylogenetic analysis of those non-pathogenic virus sequences reveals drug use networks. Here we find first in Baltimore and confirm in San Francisco that the accumulation of non-pathogenic viruses in PWID is a harbinger for subsequent acquisition of pathogenic viruses, knowledge that may guide the prioritization of the public health resources to combat HIV and HCV

    Epigenetic patterns in a complete human genome

    No full text
    The completion of a telomere-to-telomere human reference genome, T2T-CHM13, has resolved complex regions of the genome, including repetitive and homologous regions. Here, we present a high-resolution epigenetic study of previously unresolved sequences, representing entire acrocentric chromosome short arms, gene family expansions, and a diverse collection of repeat classes. This resource precisely maps CpG methylation (32.28 million CpGs), DNA accessibility, and short-read datasets (166,058 previously unresolved chromatin immunoprecipitation sequencing peaks) to provide evidence of activity across previously unidentified or corrected genes and reveals clinically relevant paralog-specific regulation. Probing CpG methylation across human centromeres from six diverse individuals generated an estimate of variability in kinetochore localization. This analysis provides a framework with which to investigate the most elusive regions of the human genome, granting insights into epigenetic regulation
    corecore