820 research outputs found

    BSmooth: from whole genome bisulfite sequencing reads to differentially methylated regions

    Get PDF
    DNA methylation is an important epigenetic modification involved in gene regulation, which can now be measured using whole-genome bisulfite sequencing. However, cost, complexity of the data, and lack of comprehensive analytical tools are major challenges that keep this technology from becoming widely applied. Here we present BSmooth, an alignment, quality control and analysis pipeline that provides accurate and precise results even with low coverage data, appropriately handling biological replicates. BSmooth is open source software, and can be downloaded from http://rafalab.jhsph.edu/bsmooth

    “Gap hunting” to characterize clustered probe signals in Illumina methylation array data

    Get PDF
    Additional file 6: Figures S26–S31. All remaining SBE site scenarios. Each additional scenario of a SBE site-mapping SNP delimited in Fig. 4 not including the scenario shown in Fig. 5. Each of these figures contains 4 plots, showing every combination of CpG site interrogations on the forward and reverse strand as well as which nucleotide is the reference nucleotide

    Age and sun exposure-related widespread genomic blocks of hypomethylation in nonmalignant skin

    Get PDF
    BACKGROUND: Aging and sun exposure are the leading causes of skin cancer. It has been shown that epigenetic changes, such as DNA methylation, are well established mechanisms for cancer, and also have emerging roles in aging and common disease. Here, we directly ask whether DNA methylation is altered following skin aging and/or chronic sun exposure in humans. RESULTS: We compare epidermis and dermis of both sun-protected and sun-exposed skin derived from younger subjects (under 35 years old) and older subjects (over 60 years old), using the Infinium HumanMethylation450 array and whole genome bisulfite sequencing. We observe large blocks of the genome that are hypomethylated in older, sun-exposed epidermal samples, with the degree of hypomethylation associated with clinical measures of photo-aging. We replicate these findings using whole genome bisulfite sequencing, comparing epidermis from an additional set of younger and older subjects. These blocks largely overlap known hypomethylated blocks in colon cancer and we observe that these same regions are similarly hypomethylated in squamous cell carcinoma samples. CONCLUSIONS: These data implicate large scale epigenomic change in mediating the effects of environmental damage with photo-aging. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-015-0644-y) contains supplementary material, which is available to authorized users

    Cloud-scale RNA-sequencing differential expression analysis with Myrna

    Get PDF
    As sequencing throughput approaches dozens of gigabases per day, there is a growing need for efficient software for analysis of transcriptome sequencing (RNA-Seq) data. Myrna is a cloud-computing pipeline for calculating differential gene expression in large RNA-Seq datasets. We apply Myrna to the analysis of publicly available data sets and assess the goodness of fit of standard statistical models. Myrna is available from http://bowtie-bio.sf.net/myrna

    Removing technical variability in RNA-seq data using conditional quantile normalization

    Get PDF
    The ability to measure gene expression on a genome-wide scale is one of the most promising accomplishments in molecular biology. Microarrays, the technology that first permitted this, were riddled with problems due to unwanted sources of variability. Many of these problems are now mitigated, after a decade's worth of statistical methodology development. The recently developed RNA sequencing (RNA-seq) technology has generated much excitement in part due to claims of reduced variability in comparison to microarrays. However, we show that RNA-seq data demonstrate unwanted and obscuring variability similar to what was first observed in microarrays. In particular, we find guanine-cytosine content (GC-content) has a strong sample-specific effect on gene expression measurements that, if left uncorrected, leads to false positives in downstream results. We also report on commonly observed data distortions that demonstrate the need for data normalization. Here, we describe a statistical methodology that improves precision by 42% without loss of accuracy. Our resulting conditional quantile normalization algorithm combines robust generalized regression to remove systematic bias introduced by deterministic features such as GC-content and quantile normalization to correct for global distortions

    Biases in Illumina transcriptome sequencing caused by random hexamer priming

    Get PDF
    Generation of cDNA using random hexamer priming induces biases in the nucleotide composition at the beginning of transcriptome sequencing reads from the Illumina Genome Analyzer. The bias is independent of organism and laboratory and impacts the uniformity of the reads along the transcriptome. We provide a read count reweighting scheme, based on the nucleotide frequencies of the reads, that mitigates the impact of the bias

    Highlights from the Pierre Auger Observatory

    Full text link
    The Pierre Auger Observatory is the world's largest cosmic ray observatory. Our current exposure reaches nearly 40,000 km2^2 str and provides us with an unprecedented quality data set. The performance and stability of the detectors and their enhancements are described. Data analyses have led to a number of major breakthroughs. Among these we discuss the energy spectrum and the searches for large-scale anisotropies. We present analyses of our Xmax_{max} data and show how it can be interpreted in terms of mass composition. We also describe some new analyses that extract mass sensitive parameters from the 100% duty cycle SD data. A coherent interpretation of all these recent results opens new directions. The consequences regarding the cosmic ray composition and the properties of UHECR sources are briefly discussed.Comment: 9 pages, 12 figures, talk given at the 33rd International Cosmic Ray Conference, Rio de Janeiro 201

    The Pierre Auger Observatory III: Other Astrophysical Observations

    Full text link
    Astrophysical observations of ultra-high-energy cosmic rays with the Pierre Auger ObservatoryComment: Contributions to the 32nd International Cosmic Ray Conference, Beijing, China, August 201

    Update on the correlation of the highest energy cosmic rays with nearby extragalactic matter

    Get PDF
    Data collected by the Pierre Auger Observatory through 31 August 2007 showed evidence for anisotropy in the arrival directions of cosmic rays above the Greisen-Zatsepin-Kuz'min energy threshold, \nobreak{6×10196\times 10^{19}eV}. The anisotropy was measured by the fraction of arrival directions that are less than 3.13.1^\circ from the position of an active galactic nucleus within 75 Mpc (using the V\'eron-Cetty and V\'eron 12th12^{\rm th} catalog). An updated measurement of this fraction is reported here using the arrival directions of cosmic rays recorded above the same energy threshold through 31 December 2009. The number of arrival directions has increased from 27 to 69, allowing a more precise measurement. The correlating fraction is (386+7)(38^{+7}_{-6})%, compared with 2121% expected for isotropic cosmic rays. This is down from the early estimate of (6913+11)(69^{+11}_{-13})%. The enlarged set of arrival directions is examined also in relation to other populations of nearby extragalactic objects: galaxies in the 2 Microns All Sky Survey and active galactic nuclei detected in hard X-rays by the Swift Burst Alert Telescope. A celestial region around the position of the radiogalaxy Cen A has the largest excess of arrival directions relative to isotropic expectations. The 2-point autocorrelation function is shown for the enlarged set of arrival directions and compared to the isotropic expectation.Comment: Accepted for publication in Astroparticle Physics on 31 August 201
    corecore