124 research outputs found
MetMap Enables Genome-Scale Methyltyping for Determining Methylation States in Populations
The ability to assay genome-scale methylation patterns using high-throughput sequencing makes it possible to carry out association studies to determine the relationship between epigenetic variation and phenotype. While bisulfite sequencing can determine a methylome at high resolution, cost inhibits its use in comparative and population studies. MethylSeq, based on sequencing of fragment ends produced by a methylation-sensitive restriction enzyme, is a method for methyltyping (survey of methylation states) and is a site-specific and cost-effective alternative to whole-genome bisulfite sequencing. Despite its advantages, the use of MethylSeq has been restricted by biases in MethylSeq data that complicate the determination of methyltypes. Here we introduce a statistical method, MetMap, that produces corrected site-specific methylation states from MethylSeq experiments and annotates unmethylated islands across the genome. MetMap integrates genome sequence information with experimental data, in a statistically sound and cohesive Bayesian Network. It infers the extent of methylation at individual CGs and across regions, and serves as a framework for comparative methylation analysis within and among species. We validated MetMap's inferences with direct bisulfite sequencing, showing that the methylation status of sites and islands is accurately inferred. We used MetMap to analyze MethylSeq data from four human neutrophil samples, identifying novel, highly unmethylated islands that are invisible to sequence-based annotation strategies. The combination of MethylSeq and MetMap is a powerful and cost-effective tool for determining genome-scale methyltypes suitable for comparative and association studies
Single-neuron RNA-Seq: technical feasibility and reproducibility
Understanding brain function involves improved knowledge about how the genome specifies such a large diversity of neuronal types. Transcriptome analysis of single neurons has been previously described using gene expression microarrays. Using high-throughput transcriptome sequencing (RNA-Seq), we have developed a method to perform single-neuron RNA-Seq. Following electrophysiology recording from an individual neuron, total RNA was extracted by aspirating the cellular contents into a fine glass electrode tip. The mRNAs were reverse transcribed and amplified to construct a single neuron cDNA library, and subsequently subjected to high-throughput sequencing. This approach was applicable to both individual neurons cultured from embryonic mouse hippocampus, as well as neocortical neurons from live brain slices. We found that the average pairwise Spearman’s rank correlation coefficient of gene expression level expressed as RPKM (reads per kilobase of transcript per million mapped reads) was 0.51 between five cultured neuronal cells, whereas the same measure between three cortical layer V neurons in situ was 0.25. The data suggest that there may be greater heterogeneity of the cortical neurons, as compared to neurons in vitro. The results demonstrate the technical feasibility and reproducibility of RNA-Seq in capturing a part of the transcriptome landscape of single neurons, and confirmed that morphologically identical neurons, even from the same region, have distinct gene expression patterns
Duplex-specific nuclease efficiently removes rRNA for prokaryotic RNA-seq
Next-generation sequencing has great potential for application in bacterial transcriptomics. However, unlike eukaryotes, bacteria have no clear mechanism to select mRNAs over rRNAs; therefore, rRNA removal is a critical step in sequencing-based transcriptomics. Duplex-specific nuclease (DSN) is an enzyme that, at high temperatures, degrades duplex DNA in preference to single-stranded DNA. DSN treatment has been successfully used to normalize the relative transcript abundance in mRNA-enriched cDNA libraries from eukaryotic organisms. In this study, we demonstrate the utility of this method to remove rRNA from prokaryotic total RNA. We evaluated the efficacy of DSN to remove rRNA by comparing it with the conventional subtractive hybridization (Hyb) method. Illumina deep sequencing was performed to obtain transcriptomes from Escherichia coli grown under four growth conditions. The results clearly showed that our DSN treatment was more efficient at removing rRNA than the Hyb method was, while preserving the original relative abundance of mRNA species in bacterial cells. Therefore, we propose that, for bacterial mRNA-seq experiments, DSN treatment should be preferred to Hyb-based methods.
Single-Cell Transcriptome Analysis Reveals Dynamic Changes in lncRNA Expression during Reprogramming
Cellular reprogramming highlights the epigenetic plasticity of the somatic cell state. Long noncoding RNAs (lncRNAs) have emerging roles in epigenetic regulation, but their potential functions in reprogramming cell fate have been largely unexplored. We used single-cell RNA sequencing to characterize the expression patterns of over 16,000 genes, including 437 lncRNAs, during defined stages of reprogramming to pluripotency. Self-organizing maps (SOMs) were used as an intuitive way to structure and interrogate transcriptome data at the single-cell level. Early molecular events during reprogramming involved the activation of Ras signaling pathways, along with hundreds of lncRNAs. Loss-of-function studies showed that activated lncRNAs can repress lineage-specific genes, while lncRNAs activated in multiple reprogramming cell types can regulate metabolic gene expression. Our findings demonstrate that reprogramming cells activate defined sets of functionally relevant lncRNAs and provide a resource to further investigate how dynamic changes in the transcriptome reprogram cell state
Phase II Open Label Study of Valproic Acid in Spinal Muscular Atrophy
UNLABELLED:Preliminary in vitro and in vivo studies with valproic acid (VPA) in cell lines and patients with spinal muscular atrophy (SMA) demonstrate increased expression of SMN, supporting the possibility of therapeutic benefit. We performed an open label trial of VPA in 42 subjects with SMA to assess safety and explore potential outcome measures to help guide design of future controlled clinical trials. Subjects included 2 SMA type I ages 2-3 years, 29 SMA type II ages 2-14 years and 11 type III ages 2-31 years, recruited from a natural history study. VPA was well-tolerated and without evident hepatotoxicity. Carnitine depletion was frequent and temporally associated with increased weakness in two subjects. Exploratory outcome measures included assessment of gross motor function via the modified Hammersmith Functional Motor Scale (MHFMS), electrophysiologic measures of innervation including maximum ulnar compound muscle action potential (CMAP) amplitudes and motor unit number estimation (MUNE), body composition and bone density via dual-energy X-ray absorptiometry (DEXA), and quantitative blood SMN mRNA levels. Clear decline in motor function occurred in several subjects in association with weight gain; mean fat mass increased without a corresponding increase in lean mass. We observed an increased mean score on the MHFMS scale in 27 subjects with SMA type II (p<or=0.001); however, significant improvement was almost entirely restricted to participants <5 years of age. Full length SMN levels were unchanged and Delta7SMN levels were significantly reduced for 2 of 3 treatment visits. In contrast, bone mineral density (p<or=0.0036) and maximum ulnar CMAP scores (p<or=0.0001) increased significantly. CONCLUSIONS:While VPA appears safe and well-tolerated in this initial pilot trial, these data suggest that weight gain and carnitine depletion are likely to be significant confounding factors in clinical trials. This study highlights potential strengths and limitations of various candidate outcome measures and underscores the need for additional controlled clinical trials with VPA targeting more restricted cohorts of subjects. TRIAL REGISTRATION:ClinicalTrials.gov
SMA CARNI-VAL Trial Part I: Double-Blind, Randomized, Placebo-Controlled Trial of L-Carnitine and Valproic Acid in Spinal Muscular Atrophy
Valproic acid (VPA) has demonstrated potential as a therapeutic candidate for spinal muscular atrophy (SMA) in vitro and in vivo.Two cohorts of subjects were enrolled in the SMA CARNIVAL TRIAL, a non-ambulatory group of "sitters" (cohort 1) and an ambulatory group of "walkers" (cohort 2). Here, we present results for cohort 1: a multicenter phase II randomized double-blind intention-to-treat protocol in non-ambulatory SMA subjects 2-8 years of age. Sixty-one subjects were randomized 1:1 to placebo or treatment for the first six months; all received active treatment the subsequent six months. The primary outcome was change in the modified Hammersmith Functional Motor Scale (MHFMS) score following six months of treatment. Secondary outcomes included safety and adverse event data, and change in MHFMS score for twelve versus six months of active treatment, body composition, quantitative SMN mRNA levels, maximum ulnar CMAP amplitudes, myometry and PFT measures.At 6 months, there was no difference in change from the baseline MHFMS score between treatment and placebo groups (difference = 0.643, 95% CI = -1.22-2.51). Adverse events occurred in >80% of subjects and were more common in the treatment group. Excessive weight gain was the most frequent drug-related adverse event, and increased fat mass was negatively related to change in MHFMS values (p = 0.0409). Post-hoc analysis found that children ages two to three years that received 12 months treatment, when adjusted for baseline weight, had significantly improved MHFMS scores (p = 0.03) compared to those who received placebo the first six months. A linear regression analysis limited to the influence of age demonstrates young age as a significant factor in improved MHFMS scores (p = 0.007).This study demonstrated no benefit from six months treatment with VPA and L-carnitine in a young non-ambulatory cohort of subjects with SMA. Weight gain, age and treatment duration were significant confounding variables that should be considered in the design of future trials.Clinicaltrials.gov NCT00227266
Targeted bisulfite sequencing by solution hybrid selection and massively parallel sequencing
We applied a solution hybrid selection approach to the enrichment of CpG islands (CGIs) and promoter sequences from the human genome for targeted high-throughput bisulfite sequencing. A single lane of Illumina sequences allowed accurate and quantitative analysis of ~1 million CpGs in more than 21 408 CGIs and more than 15 946 transcriptional regulatory regions. Of the CpGs analyzed, 77–84% fell on or near capture probe sequences; 69–75% fell within CGIs. More than 85% of capture probes successfully yielded quantitative DNA methylation information of targeted regions. Differentially methylated regions (DMRs) were identified in the 5′-end regulatory regions, as well as the intra- and intergenic regions, particularly in the X-chromosome among the three breast cancer cell lines analyzed. We chose 46 candidate loci (762 CpGs) for confirmation with PCR-based bisulfite sequencing and demonstrated excellent correlation between two data sets. Targeted bisulfite sequencing of three DNA methyltransferase (DNMT) knockout cell lines and the wild-type HCT116 colon cancer cell line revealed a significant decrease in CpG methylation for the DNMT1 knockout and DNMT1, 3B double knockout cell lines, but not in DNMT3B knockout cell line. We demonstrated the targeted bisulfite sequencing approach to be a powerful method to uncover novel aberrant methylation in the cancer epigenome. Since all targets were captured and sequenced as a pool through a series of single-tube reactions, this method can be easily scaled up to deal with a large number of samples
A novel bioinformatics pipeline for identification and characterization of fusion transcripts in breast cancer and normal cell lines
SnowShoes-FTD, developed for fusion transcript detection in paired-end mRNA-Seq data, employs multiple steps of false positive filtering to nominate fusion transcripts with near 100% confidence. Unique features include: (i) identification of multiple fusion isoforms from two gene partners; (ii) prediction of genomic rearrangements; (iii) identification of exon fusion boundaries; (iv) generation of a 5′–3′ fusion spanning sequence for PCR validation; and (v) prediction of the protein sequences, including frame shift and amino acid insertions. We applied SnowShoes-FTD to identify 50 fusion candidates in 22 breast cancer and 9 non-transformed cell lines. Five additional fusion candidates with two isoforms were confirmed. In all, 30 of 55 fusion candidates had in-frame protein products. No fusion transcripts were detected in non-transformed cells. Consideration of the possible functions of a subset of predicted fusion proteins suggests several potentially important functions in transformation, including a possible new mechanism for overexpression of ERBB2 in a HER-positive cell line. The source code of SnowShoes-FTD is provided in two formats: one configured to run on the Sun Grid Engine for parallelization, and the other formatted to run on a single LINUX node. Executables in PERL are available for download from our web site: http://mayoresearch.mayo.edu/mayo/research/biostat/stand-alone-packages.cfm
- …