4 research outputs found
Benchmarking of 4C-seq pipelines based on real and simulated data
"jats:sec" "jats:title"Motivation"/jats:title" "jats:p"With its capacity for high-resolution data output in one region of interest, chromosome conformation capture combined with high-throughput sequencing (4C-seq) is a state-of-the-art next-generation sequencing technique that provides epigenetic insights, and regularly advances current medical research. However, 4C-seq data are complex and prone to biases, and while specialized programs exist, an unbiased, extensive benchmarking is still lacking. Furthermore, neither substantial datasets with fully characterized ground truth, nor simulation programs for realistic 4C-seq data have been published."/jats:p" "/jats:sec" "jats:sec" "jats:title"Results"/jats:title" "jats:p"We conducted a benchmarking study on 66 4C-seq samples from 20 datasets, and developed a novel 4C-seq simulation software, Basic4CSim, to allow for detailed comparisons of 4C-seq algorithms on 50 simulated datasets with 10–120 samples each. Simulations and benchmarking were adapted to address different characteristics of 4C-seq data. Simulated data were compared with published samples to validate simulation settings. We identified differences between 4C-seq algorithms in terms of precision, recall, interaction structure, and run time, and observed general trends. Novel differential pipeline versions of single-sample based 4C-seq algorithms were included in the benchmarking. While no single tool was optimally suited for both near-cis and far-cis, and both single-sample and differential analyses, choosing a high-performing algorithm variant did improve results considerably. For near-cis scenarios, r3Cseq, peakC and FourCSeq offered high precision, while fourSig demonstrated high overall F1 scores in far-cis analyses. Finally, 4C-seq simulations may aid in the development of improved analysis algorithms."/jats:p" "/jats:sec" "jats:sec" "jats:title"Availability and implementation"/jats:title" "jats:p"Basic4CSim is available at https://github.com/walter–ca/Basic4CSim."/jats:p" "/jats:sec" "jats:sec" "jats:title"Supplementary information"/jats:title" "jats:p"Supplementary data are available at Bioinformatics online."/jats:p" "/jats:sec
Document type: Articl
Recommended from our members
Label-free enrichment of avian Leucocytozoon using flow cytrometric sorting
Chakarov N, Greiner J, Hauser S, et al. Label-free enrichment of avian Leucocytozoon using flow cytrometric sorting. Parasitology. 2012;139(12):1547-1552
Efficient animal-serum free 3D cultivation method for adult human neural crest-derived stem cell therapeutics
Greiner J, Hauser S, Widera D, et al. Efficient animal-serum free 3D cultivation method for adult human neural crest-derived stem cell therapeutics. European Cells & Materials. 2011;22:403-419
Temporal autoregulation during human PU.1 locus SubTAD formation
Epigenetic control of gene expression occurs within discrete spatial chromosomal units called topologically associating domains (TADs), but the exact spatial requirements of most genes are unknown; this is of particular interest for genes involved in cancer. We therefore applied high-resolution chromosomal conformation capture sequencing to map the three-dimensional (3D) organization of the human locus encoding the key myeloid transcription factor PU.1 in healthy monocytes and acute myeloid leukemia (AML) cells. We identified a dynamic similar to 75-kb unit (SubTAD) as the genomic region in which spatial interactions between PU.1 gene regulatory elements occur during myeloid differentiation and are interrupted in AML. Within this SubTAD, proper initiation of the spatial chromosomal interactions requires PU.1 autoregulation and recruitment of the chromatin-adaptor protein LDB1 (LIM domain-binding protein 1). However, once these spatial interactions have occurred, LDB1 stabilizes them independently of PU.1 autoregulation. Thus, our data support that PU.1 autoregulates its expression in a "hit-and-run" manner by initiating stable chromosomal loops that result in a transcriptionally active chromatin architecture