Search CORE

539 research outputs found

Cloud-scale RNA-sequencing differential expression analysis with Myrna

Author: Hansen Kasper D
Langmead Ben
Leek Jeffrey T
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

As sequencing throughput approaches dozens of gigabases per day, there is a growing need for efficient software for analysis of transcriptome sequencing (RNA-Seq) data. Myrna is a cloud-computing pipeline for calculating differential gene expression in large RNA-Seq datasets. We apply Myrna to the analysis of publicly available data sets and assess the goodness of fit of standard statistical models. Myrna is available from http://bowtie-bio.sf.net/myrna

Springer - Publisher Connector

PubMed Central

BSmooth: from whole genome bisulfite sequencing reads to differentially methylated regions

Author: Benjamin Langmead
Kasper D Hansen
Rafael A Irizarry
Publication venue: Springer Nature
Publication date: 01/01/2012
Field of study

DNA methylation is an important epigenetic modification involved in gene regulation, which can now be measured using whole-genome bisulfite sequencing. However, cost, complexity of the data, and lack of comprehensive analytical tools are major challenges that keep this technology from becoming widely applied. Here we present BSmooth, an alignment, quality control and analysis pipeline that provides accurate and precise results even with low coverage data, appropriately handling biological replicates. BSmooth is open source software, and can be downloaded from http://rafalab.jhsph.edu/bsmooth

Springer - Publisher Connector

PubMed Central

Removing technical variability in RNA-seq data using conditional quantile normalization

Author: Hansen Kasper D.
Irizarry Rafael A.
WU Zhijin
Publication venue: Oxford University Press
Publication date: 24/05/2011
Field of study

The ability to measure gene expression on a genome-wide scale is one of the most promising accomplishments in molecular biology. Microarrays, the technology that first permitted this, were riddled with problems due to unwanted sources of variability. Many of these problems are now mitigated, after a decade's worth of statistical methodology development. The recently developed RNA sequencing (RNA-seq) technology has generated much excitement in part due to claims of reduced variability in comparison to microarrays. However, we show that RNA-seq data demonstrate unwanted and obscuring variability similar to what was first observed in microarrays. In particular, we find guanine-cytosine content (GC-content) has a strong sample-specific effect on gene expression measurements that, if left uncorrected, leads to false positives in downstream results. We also report on commonly observed data distortions that demonstrate the need for data normalization. Here, we describe a statistical methodology that improves precision by 42% without loss of accuracy. Our resulting conditional quantile normalization algorithm combines robust generalized regression to remove systematic bias introduced by deterministic features such as GC-content and quantile normalization to correct for global distortions

PubMed Central

Collection Of Biostatistics Research Archive

Biases in Illumina transcriptome sequencing caused by random hexamer priming

Author: Brenner Steven E.
Dudoit Sandrine
Hansen Kasper D.
Publication venue: Oxford University Press
Publication date: 01/07/2010
Field of study

Generation of cDNA using random hexamer priming induces biases in the nucleotide composition at the beginning of transcriptome sequencing reads from the Illumina Genome Analyzer. The bias is independent of organism and laboratory and impacts the uniformity of the reads along the transcriptome. We provide a read count reweighting scheme, based on the nucleotide frequencies of the reads, that mitigates the impact of the bias

PubMed Central

eScholarship - University of California

“Gap hunting” to characterize clustered probe signals in Illumina methylation array data

Author: Andrew P. Feinberg
Christine Ladd-Acosta
Kasper D. Hansen
M. Daniele Fallin
Shan V. Andrews
Publication venue: Springer Nature
Publication date: 01/01/2016
Field of study

Additional file 6: Figures S26–S31. All remaining SBE site scenarios. Each additional scenario of a SBE site-mapping SNP delimited in Fig. 4 not including the scenario shown in Fig. 5. Each of these figures contains 4 plots, showing every combination of CpG site interrogations on the forward and reverse strand as well as which nucleotide is the reference nucleotide

Springer - Publisher Connector

FigShare

Recommended from our members

Common DNA sequence variation influences 3-dimensional conformation of the human genome.

Author: Chiou Joshua
Fletez-Brant Kipper
Gaulton Kyle J
Gorkin David U
Hansen Kasper D
Hu Ming
Li Yun
Liu Tristin
Noor Amina
Qiu Yunjiang
Ren Bing
Schmitt Anthony D
Sebat Jonathan
Publication venue: eScholarship, University of California
Publication date: 01/11/2019
Field of study

BACKGROUND:The 3-dimensional (3D) conformation of chromatin inside the nucleus is integral to a variety of nuclear processes including transcriptional regulation, DNA replication, and DNA damage repair. Aberrations in 3D chromatin conformation have been implicated in developmental abnormalities and cancer. Despite the importance of 3D chromatin conformation to cellular function and human health, little is known about how 3D chromatin conformation varies in the human population, or whether DNA sequence variation between individuals influences 3D chromatin conformation. RESULTS:To address these questions, we perform Hi-C on lymphoblastoid cell lines from 20 individuals. We identify thousands of regions across the genome where 3D chromatin conformation varies between individuals and find that this variation is often accompanied by variation in gene expression, histone modifications, and transcription factor binding. Moreover, we find that DNA sequence variation influences several features of 3D chromatin conformation including loop strength, contact insulation, contact directionality, and density of local cis contacts. We map hundreds of quantitative trait loci associated with 3D chromatin features and find evidence that some of these same variants are associated at modest levels with other molecular phenotypes as well as complex disease risk. CONCLUSION:Our results demonstrate that common DNA sequence variants can influence 3D chromatin conformation, pointing to a more pervasive role for 3D chromatin conformation in human phenotypic variation than previously recognized

eScholarship - University of California