Search CORE

40 research outputs found

RUbioSeq+: A multiplatform application that executes parallelized pipelines to analyse next-generation sequencing data

Author: López-Fernández Hugo
Rubio-Camarillo Miriam
Publication venue: 'Elsevier BV'
Publication date: 26/10/2017
Field of study

This is the peer reviewed version of the following article: Computer Methods and Programs in Biomedine 138 (2016): 73-81, which has been published in final form at http://dx.doi.org/10.1016/j.cmpb.2016.10.008Background and objective To facilitate routine analysis and to improve the reproducibility of the results, next-generation sequencing (NGS) analysis requires intuitive, efficient and integrated data processing pipelines. Methods We have selected well-established software to construct a suite of automated and parallelized workflows to analyse NGS data for DNA-seq (single-nucleotide variants (SNVs) and indels), CNA-seq, bisulfite-seq and ChIP-seq experiments. Results Here, we present RUbioSeq+, an updated and extended version of RUbioSeq, a multiplatform application that incorporates a suite of automated and parallelized workflows to analyse NGS data. This new version includes: (i) an interactive graphical user interface (GUI) that facilitates its use by both biomedical researchers and bioinformaticians, (ii) a new pipeline for ChIP-seq experiments, (iii) pair-wise comparisons (case–control analyses) for DNA-seq experiments, (iv) and improvements in the parallelized and multithreaded execution options. Results generated by our software have been experimentally validated and accepted for publication. Conclusions RUbioSeq+ is free and open to all users at http://rubioseq.bioinfo.cnio.es/.M.R-C is funded by the BLUEPRINT Consortium (FP7/ 2007-2013) under grant agreement number 282510. J.M.F is funded by the INB Node 2 - CNIO, a member of Proteored - PRB2-ISCIII and is supported by grant PT13/0001, of the PE I+D+i 2013-2016, funded by ISCIII and FEDER. H.L-F is funded by a postdoctoral fellowship from the Xunta de Galicia. F.F-R and D.G-P are funded by the European Union's Seventh Framework Programme FP7/REGPOT 2012 2013.1 under grant agreement n° 316265 (BIOCAPS) and the "Platform of integration of intelligent techniques for analysis of biomedical information" project (TIN2013-47153-C3-3-R) financed by the Spanish Ministry of Economy and Competitiveness C.FT is funded by the "Spanish National Youth Guarantee Implementation Plan” (2013/2016) financed by the Spanish Ministry of Economy and Competitivenes

Biblos-e Archivo

Q&A: ChIP-seq technologies and the study of gene regulation

Author: A Barski
A Barski
A Goren
A Visel
AP Boyle
C Taslim
C Zang
DA Nix
Edison T Liu
G Bourque
H Xu
I Kozarewa
J Rozowsky
JC Dohm
L Teytelman
M Guttman
Mikael Huss
MJ Fullwood
PV Kharchenko
R Jothi
S Pepke
Sebastian Pott
TD Laajala
TS Mikkelsen
VB Vega
X Chen
Y Zhang
Z Wang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

10.1186/1741-7007-8-56BMC Biology85

Crossref

The Jackson Laboratory: The Mouseion at the JAXlibrary

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ScholarBank@NUS

The Ensembl Regulatory Build

Author: A Barski
A Visel
AL Dixon
AR Quinlan
B Ren
BE Stranger
CM Koch
CY McLean
Daniel R Zerbino
DR Zerbino
DS Johnson
E Lieberman-Aiden
FANTOM The
GA Maston
H Li
H Xu
I Keshet
J Dostie
J Ernst
J Ernst
J Severin
JD Buenrostro
M Esteller
M Kellis
M Levine
M Weber
MJ Fullwood
ML Freedman
MM Hoffman
MM Hoffman
Nathan Johnson
P Flicek
P Fraser
Paul R Flicek
PG Giresi
R Andersson
R Jaenisch
R Lister
R Margueron
RE Thurman
RJ Klose
RM Kuhn
SIS Grewal
Steven P Wilder
T Jenuwein
The ENCODE project consortium
Thomas Juettemann
TS Mikkelsen
V Curwen
Y Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A computational model for histone mark propagation reproduces the distribution of heterochromatin in different human cell types

Author: Jensen Ole Nørregaard
Schwämmle Veit
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Chromatin is a highly compact and dynamic nuclear structure that consists of DNA and associated proteins. The main organizational unit is the nucleosome, which consists of a histone octamer with DNA wrapped around it. Histone proteins are implicated in the regulation of eukaryote genes and they carry numerous reversible post-translational modifications that control DNA-protein interactions and the recruitment of chromatin binding proteins. Heterochromatin, the transcriptionally inactive part of the genome, is densely packed and contains histone H3 that is methylated at Lys 9 (H3K9me). The propagation of H3K9me in nucleosomes along the DNA in chromatin is antagonizing by methylation of H3 Lysine 4 (H3K4me) and acetylations of several lysines, which is related to euchromatin and active genes. We show that the related histone modifications form antagonized domains on a coarse scale. These histone marks are assumed to be initiated within distinct nucleation sites in the DNA and to propagate bi-directionally. We propose a simple computer model that simulates the distribution of heterochromatin in human chromosomes. The simulations are in agreement with previously reported experimental observations from two different human cell lines. We reproduced different types of barriers between heterochromatin and euchromatin providing a unified model for their function. The effect of changes in the nucleation site distribution and of propagation rates were studied. The former occurs mainly with the aim of (de-)activation of single genes or gene groups and the latter has the power of controlling the transcriptional programs of entire chromosomes. Generally, the regulatory program of gene transcription is controlled by the distribution of nucleation sites along the DNA string.Comment: 24 pages,9 figures, 1 table + supplementary materia

arXiv.org e-Print Archive

Public Library of Science (PLOS)

University of Southern Denmark Research Output

FigShare

Bivalent-Like Chromatin Markers Are Predictive for Transcription Start Site Distribution in Human

Author: A Antos
A Barski
A Valouev
BE Bernstein
C Cayrou
D Allis
DA Gilchrist
DE Schones
DE Schones
E Segal
E Segal
E Valen
EA Rach
H Kawaji
H Xu
J Kim
J Sun
JE Ohm
JK Sims
JQ Svejstrup
K Koh
K Nishioka
L Balakrishnan
LA Sanz
Leonardo Mariño-Ramírez
LJ Core
M Megraw
MC Frith
MG Guenther
Michael Q. Zhang
O Ram
P Carninci
P Carninci
PJ Sabo
R Karlic
RA Hoskins
S Nechaev
S Saxonov
ST Jensen
T Hastie
T Schurmann
TEP Consortium
TM Spektor
TN Mavrich
TY Roh
V Matys
VK Rakyan
X Wang
Xiaotu Ma
Y Field
Y Zhang
Y Zhang
Z Wang
Z Zhang
Z Zhang
Zhihua Zhang
Publication venue: Public Library of Science
Publication date: 29/06/2012
Field of study

Deep sequencing of 5′ capped transcripts has revealed a variety of transcription initiation patterns, from narrow, focused promoters to wide, broad promoters. Attempts have already been made to model empirically classified patterns, but virtually no quantitative models for transcription initiation have been reported. Even though both genetic and epigenetic elements have been associated with such patterns, the organization of regulatory elements is largely unknown. Here, linear regression models were derived from a pool of regulatory elements, including genomic DNA features, nucleosome organization, and histone modifications, to predict the distribution of transcription start sites (TSS). Importantly, models including both active and repressive histone modification markers, e.g. H3K4me3 and H4K20me1, were consistently found to be much more predictive than models with only single-type histone modification markers, indicating the possibility of “bivalent-like” epigenetic control of transcription initiation. The nucleosome positions are proposed to be coded in the active component of such bivalent-like histone modification markers. Finally, we demonstrated that models trained on one cell type could successfully predict TSS distribution in other cell types, suggesting that these models may have a broader application range

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

A diffusion model for the coordination of DNA replication in Schizosaccharomyces pombe

Author: Allison J. R.
Grand R. S.
Kaykov A.
Martienssen R. A.
Masuda K.
Nurse P.
O'Sullivan J. M.
Pichugina T.
Schierding W.
Sugawara T.
Ueno M.
Uewaki J.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/01/2016
Field of study

The locations of proteins and epigenetic marks on the chromosomal DNA sequence are believed to demarcate the eukaryotic genome into distinct structural and functional domains that contribute to gene regulation and genome organization. However, how these proteins and epigenetic marks are organized in three dimensions remains unknown. Recent advances in proximity-ligation methodologies and high resolution microscopy have begun to expand our understanding of these spatial relationships. Here we use polymer models to examine the spatial organization of epigenetic marks, euchromatin and heterochromatin, and origins of replication within the Schizosaccharomyces pombe genome. These models incorporate data from microscopy and proximity-ligation experiments that inform on the positions of certain elements and contacts within and between chromosomes. Our results show a striking degree of compartmentalization of epigenetic and genomic features and lead to the proposal of a diffusion based mechanism, centred on the spindle pole body, for the coordination of DNA replication in S. pombe

Cold Spring Harbor Laboratory Institutional Repository

PubMed Central

DISMISS: detection of stranded methylation in MeDIP-Seq data

Author
Publication venue: BioMed Central
Publication date: 29/07/2016
Field of study

Springer - Publisher Connector

Evaluation of Algorithm Performance in ChIP-Seq Peak Detection

Author: Facciotti Marc T.
Wilbanks Elizabeth G.
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Next-generation DNA sequencing coupled with chromatin immunoprecipitation (ChIP-seq) is revolutionizing our ability to interrogate whole genome protein-DNA interactions. Identification of protein binding sites from ChIP-seq data has required novel computational tools, distinct from those used for the analysis of ChIP-Chip experiments. The growing popularity of ChIP-seq spurred the development of many different analytical programs (at last count, we noted 31 open source methods), each with some purported advantage. Given that the literature is dense and empirical benchmarking challenging, selecting an appropriate method for ChIP-seq analysis has become a daunting task. Herein we compare the performance of eleven different peak calling programs on common empirical, transcription factor datasets and measure their sensitivity, accuracy and usability. Our analysis provides an unbiased critical assessment of available technologies, and should assist researchers in choosing a suitable tool for handling ChIP-seq data

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Detecting broad domains and narrow peaks in ChIP-seq data with hiddenDomains

Author: A Rada-Iglesias
AR Quinlan
C Zang
CE Grant
D Kim
EG Wilbanks
EJ Wagner
J Wang
Joshua Starmer
M Micsinai
PJ Collins
Q Song
S Anders
TC Lystig
Terry Magnuson
VW Zhou
X Feng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/03/2016
Field of study

Abstract Background Correctly identifying genomic regions enriched with histone modifications and transcription factors is key to understanding their regulatory and developmental roles. Conceptually, these regions are divided into two categories, narrow peaks and broad domains, and different algorithms are used to identify each one. Datasets that span these two categories are often analyzed with a single program for peak calling combined with an ad hoc method for domains. Results We developed hiddenDomains, which identifies both peaks and domains, and compare it to the leading algorithms using H3K27me3, H3K36me3, GABP, ESR1 and FOXA ChIP-seq datasets. The output from the programs was compared to qPCR-validated enriched and depleted sites, predicted transcription factor binding sites, and highly-transcribed gene bodies. With every method, hiddenDomains, performed as well as, if not better than algorithms dedicated to a specific type of analysis. Conclusions hiddenDomains performs as well as the best domain and peak calling algorithms, making it ideal for analyzing ChIP-seq datasets, especially those that contain a mixture of peaks and domains

Crossref

Springer - Publisher Connector

PubMed Central

Carolina Digital Repository