820 research outputs found
BSmooth: from whole genome bisulfite sequencing reads to differentially methylated regions
DNA methylation is an important epigenetic modification involved in gene regulation, which can now be measured using whole-genome bisulfite sequencing. However, cost, complexity of the data, and lack of comprehensive analytical tools are major challenges that keep this technology from becoming widely applied. Here we present BSmooth, an alignment, quality control and analysis pipeline that provides accurate and precise results even with low coverage data, appropriately handling biological replicates. BSmooth is open source software, and can be downloaded from http://rafalab.jhsph.edu/bsmooth
“Gap hunting” to characterize clustered probe signals in Illumina methylation array data
Additional file 6: Figures S26–S31. All remaining SBE site scenarios. Each additional scenario of a SBE site-mapping SNP delimited in Fig. 4 not including the scenario shown in Fig. 5. Each of these figures contains 4 plots, showing every combination of CpG site interrogations on the forward and reverse strand as well as which nucleotide is the reference nucleotide
Recommended from our members
Common DNA sequence variation influences 3-dimensional conformation of the human genome.
BACKGROUND:The 3-dimensional (3D) conformation of chromatin inside the nucleus is integral to a variety of nuclear processes including transcriptional regulation, DNA replication, and DNA damage repair. Aberrations in 3D chromatin conformation have been implicated in developmental abnormalities and cancer. Despite the importance of 3D chromatin conformation to cellular function and human health, little is known about how 3D chromatin conformation varies in the human population, or whether DNA sequence variation between individuals influences 3D chromatin conformation. RESULTS:To address these questions, we perform Hi-C on lymphoblastoid cell lines from 20 individuals. We identify thousands of regions across the genome where 3D chromatin conformation varies between individuals and find that this variation is often accompanied by variation in gene expression, histone modifications, and transcription factor binding. Moreover, we find that DNA sequence variation influences several features of 3D chromatin conformation including loop strength, contact insulation, contact directionality, and density of local cis contacts. We map hundreds of quantitative trait loci associated with 3D chromatin features and find evidence that some of these same variants are associated at modest levels with other molecular phenotypes as well as complex disease risk. CONCLUSION:Our results demonstrate that common DNA sequence variants can influence 3D chromatin conformation, pointing to a more pervasive role for 3D chromatin conformation in human phenotypic variation than previously recognized
Age and sun exposure-related widespread genomic blocks of hypomethylation in nonmalignant skin
BACKGROUND: Aging and sun exposure are the leading causes of skin cancer. It has been shown that epigenetic changes, such as DNA methylation, are well established mechanisms for cancer, and also have emerging roles in aging and common disease. Here, we directly ask whether DNA methylation is altered following skin aging and/or chronic sun exposure in humans. RESULTS: We compare epidermis and dermis of both sun-protected and sun-exposed skin derived from younger subjects (under 35 years old) and older subjects (over 60 years old), using the Infinium HumanMethylation450 array and whole genome bisulfite sequencing. We observe large blocks of the genome that are hypomethylated in older, sun-exposed epidermal samples, with the degree of hypomethylation associated with clinical measures of photo-aging. We replicate these findings using whole genome bisulfite sequencing, comparing epidermis from an additional set of younger and older subjects. These blocks largely overlap known hypomethylated blocks in colon cancer and we observe that these same regions are similarly hypomethylated in squamous cell carcinoma samples. CONCLUSIONS: These data implicate large scale epigenomic change in mediating the effects of environmental damage with photo-aging. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-015-0644-y) contains supplementary material, which is available to authorized users
Cloud-scale RNA-sequencing differential expression analysis with Myrna
As sequencing throughput approaches dozens of gigabases per day, there is a growing need for efficient software for analysis of transcriptome sequencing (RNA-Seq) data. Myrna is a cloud-computing pipeline for calculating differential gene expression in large RNA-Seq datasets. We apply Myrna to the analysis of publicly available data sets and assess the goodness of fit of standard statistical models. Myrna is available from http://bowtie-bio.sf.net/myrna
Removing technical variability in RNA-seq data using conditional quantile normalization
The ability to measure gene expression on a genome-wide scale is one of the most promising accomplishments in molecular biology. Microarrays, the technology that first permitted this, were riddled with problems due to unwanted sources of variability. Many of these problems are now mitigated, after a decade's worth of statistical methodology development. The recently developed RNA sequencing (RNA-seq) technology has generated much excitement in part due to claims of reduced variability in comparison to microarrays. However, we show that RNA-seq data demonstrate unwanted and obscuring variability similar to what was first observed in microarrays. In particular, we find guanine-cytosine content (GC-content) has a strong sample-specific effect on gene expression measurements that, if left uncorrected, leads to false positives in downstream results. We also report on commonly observed data distortions that demonstrate the need for data normalization. Here, we describe a statistical methodology that improves precision by 42% without loss of accuracy. Our resulting conditional quantile normalization algorithm combines robust generalized regression to remove systematic bias introduced by deterministic features such as GC-content and quantile normalization to correct for global distortions
Biases in Illumina transcriptome sequencing caused by random hexamer priming
Generation of cDNA using random hexamer priming induces biases in the nucleotide composition at the beginning of transcriptome sequencing reads from the Illumina Genome Analyzer. The bias is independent of organism and laboratory and impacts the uniformity of the reads along the transcriptome. We provide a read count reweighting scheme, based on the nucleotide frequencies of the reads, that mitigates the impact of the bias
Highlights from the Pierre Auger Observatory
The Pierre Auger Observatory is the world's largest cosmic ray observatory.
Our current exposure reaches nearly 40,000 km str and provides us with an
unprecedented quality data set. The performance and stability of the detectors
and their enhancements are described. Data analyses have led to a number of
major breakthroughs. Among these we discuss the energy spectrum and the
searches for large-scale anisotropies. We present analyses of our X
data and show how it can be interpreted in terms of mass composition. We also
describe some new analyses that extract mass sensitive parameters from the 100%
duty cycle SD data. A coherent interpretation of all these recent results opens
new directions. The consequences regarding the cosmic ray composition and the
properties of UHECR sources are briefly discussed.Comment: 9 pages, 12 figures, talk given at the 33rd International Cosmic Ray
Conference, Rio de Janeiro 201
The Pierre Auger Observatory III: Other Astrophysical Observations
Astrophysical observations of ultra-high-energy cosmic rays with the Pierre
Auger ObservatoryComment: Contributions to the 32nd International Cosmic Ray Conference,
Beijing, China, August 201
Update on the correlation of the highest energy cosmic rays with nearby extragalactic matter
Data collected by the Pierre Auger Observatory through 31 August 2007 showed
evidence for anisotropy in the arrival directions of cosmic rays above the
Greisen-Zatsepin-Kuz'min energy threshold, \nobreak{eV}. The
anisotropy was measured by the fraction of arrival directions that are less
than from the position of an active galactic nucleus within 75 Mpc
(using the V\'eron-Cetty and V\'eron catalog). An updated
measurement of this fraction is reported here using the arrival directions of
cosmic rays recorded above the same energy threshold through 31 December 2009.
The number of arrival directions has increased from 27 to 69, allowing a more
precise measurement. The correlating fraction is , compared
with expected for isotropic cosmic rays. This is down from the early
estimate of . The enlarged set of arrival directions is
examined also in relation to other populations of nearby extragalactic objects:
galaxies in the 2 Microns All Sky Survey and active galactic nuclei detected in
hard X-rays by the Swift Burst Alert Telescope. A celestial region around the
position of the radiogalaxy Cen A has the largest excess of arrival directions
relative to isotropic expectations. The 2-point autocorrelation function is
shown for the enlarged set of arrival directions and compared to the isotropic
expectation.Comment: Accepted for publication in Astroparticle Physics on 31 August 201
- …
