163 research outputs found
jMOSAiCS: joint analysis of multiple ChIP-seq datasets
The ChIP-seq technique enables genome-wide mapping of in vivo protein-DNA interactions and chromatin states. Current analytical approaches for ChIP-seq analysis are largely geared towards single-sample investigations, and have limited applicability in comparative settings that aim to identify combinatorial patterns of enrichment across multiple datasets. We describe a novel probabilistic method, jMOSAiCS, for jointly analyzing multiple ChIP-seq datasets. We demonstrate its usefulness with a wide range of data-driven computational experiments and with a case study of histone modifications on GATA1-occupied segments during erythroid differentiation. jMOSAiCS is open source software and can be downloaded from Bioconductor [1]
Parameter estimation for robust HMM analysis of ChIP-chip data
Tiling arrays are an important tool for the study of transcriptional activity, protein-DNA interactions and chromatin structure on a genome-wide scale at high resolution. Although hidden Markov models have been used successfully to analyse tiling array data, parameter estimation for these models is typically ad hoc. Especially in the context of ChIP-chip experiments, no standard procedures exist to obtain parameter estimates from the data. Common methods for the calculation of maximum likelihood estimates such as the Baum-Welch algorithm or Viterbi training are rarely applied in the context of tiling array analysis. Results: Here we develop a hidden Markov model for the analysis of chromatin structure ChIP-chip tiling array data, using t emission distributions to increase robustness towards outliers. Maximum likelihood estimates are used for all model parameters. Two different approaches to parameter estimation are investigated and combined into an efficient procedure. Conclusion: We illustrate an efficient parameter estimation procedure that can be used for HMM based methods in general and leads to a clear increase in performance when compared to the use of ad hoc estimates. The resulting hidden Markov model outperforms established methods like TileMap in the context of histone modification studies.13 page(s
A Phylogenetic Mixture Model for the Evolution of Gene Expression
Microarray platforms are used increasingly to make comparative inferences through genome-wide surveys of gene expression. Although recent studies focus on describing the evidence for natural selection using estimates of the within- and between-taxa mutational variances, these methods do not explicitly or flexibly account for predicted nonindependence due to phylogenetic associations between measurements. In the interest of parsing the effects of selection: we introduce a mixture model for the comparative analysis of variation in gene expression across multiple taxa. This class of models isolates the phylogenetic signal from the nonphylogenetic and the heritable signal from the nonheritable while measuring the proper amount of correction. As a result, the mixture model resolves outstanding differences between existing models, relates different ways to estimate the across taxa variance, and induces a likelihood ratio test for selection. We investigate by simulation and application the feasibility and utility of estimation of the required parameters and the power of the proposed test. We illustrate analysis under this mixture model with a gene duplication family data set
Analyzing ChIP-chip Data Using Bioconductor
Analyzing ChIP-chip Data Using Bioconducto
An Integrated Pipeline for the Genome-Wide Analysis of Transcription Factor Binding Sites from ChIP-Seq
ChIP-Seq has become the standard method for genome-wide profiling DNA association
of transcription factors. To simplify analyzing and interpreting ChIP-Seq data,
which typically involves using multiple applications, we describe an integrated,
open source, R-based analysis pipeline. The pipeline addresses data input, peak
detection, sequence and motif analysis, visualization, and data export, and can
readily be extended via other R and Bioconductor packages. Using a standard
multicore computer, it can be used with datasets consisting of tens of thousands
of enriched regions. We demonstrate its effectiveness on published human
ChIP-Seq datasets for FOXA1, ER, CTCF and STAT1, where it detected co-occurring
motifs that were consistent with the literature but not detected by other
methods. Our pipeline provides the first complete set of Bioconductor tools for
sequence and motif analysis of ChIP-Seq and ChIP-chip data
Discovering Transcription Factor Binding Sites in Highly Repetitive Regions of Genomes with Multi-Read Analysis of ChIP-Seq Data
Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is rapidly replacing chromatin immunoprecipitation combined with genome-wide tiling array analysis (ChIP-chip) as the preferred approach for mapping transcription-factor binding sites and chromatin modifications. The state of the art for analyzing ChIP-seq data relies on using only reads that map uniquely to a relevant reference genome (uni-reads). This can lead to the omission of up to 30% of alignable reads. We describe a general approach for utilizing reads that map to multiple locations on the reference genome (multi-reads). Our approach is based on allocating multi-reads as fractional counts using a weighted alignment scheme. Using human STAT1 and mouse GATA1 ChIP-seq datasets, we illustrate that incorporation of multi-reads significantly increases sequencing depths, leads to detection of novel peaks that are not otherwise identifiable with uni-reads, and improves detection of peaks in mappable regions. We investigate various genome-wide characteristics of peaks detected only by utilization of multi-reads via computational experiments. Overall, peaks from multi-read analysis have similar characteristics to peaks that are identified by uni-reads except that the majority of them reside in segmental duplications. We further validate a number of GATA1 multi-read only peaks by independent quantitative real-time ChIP analysis and identify novel target genes of GATA1. These computational and experimental results establish that multi-reads can be of critical importance for studying transcription factor binding in highly repetitive regions of genomes with ChIP-seq experiments
Magnetic crystals and helical liquids in alkaline-earth fermionic gases
The joint action of a synthetic gauge potential and of atomic contact repulsion in a one-dimensional alkaline-earth(-like) fermionic gas with nuclear spin I leads to the existence of a hierarchy of fractional insulating and conducting states with intriguing properties. We unveil the existence and the features of those phases by means of both analytical bosonization techniques and numerical methods based on the density-matrix renormalization group algorithm. In particular, we show that the gapless phases can support helical modes, whereas the gapped states, which appear under certain conditions, are characterised both by density and magnetic order. Several distinct features emerge solely for spin I larger than 1/2, thus making their study with cold-atoms unique. We will finally argue that these states are related to the properties of an unconventional fractional quantum Hall effect in the thin-torus limit. The properties of this hierarchy of states can be experimentally studied in state-of-the-art cold-atom laboratories
- …