Search CORE

The Francis Crick Institute

Sussex Research Online

eFORGE v2.0: updated analysis of cell type-specific signal in epigenomic data

Author: Beck S
Bourque G
Breeze CE
Dunham I
Lazar J
Neph S
Reynolds AP
Stamatoyannopoulos JA
Teschendorff AE
van Dongen J
Vierstra J
Publication venue
Publication date: 04/06/2019
Field of study

SUMMARY: The Illumina Infinium EPIC BeadChip is a new high-throughput array for DNA methylation analysis, extending the earlier 450k array by over 400,000 new sites. Previously, a method named eFORGE was developed to provide insights into cell type-specific and cell composition effects for 450k data. Here, we present a significantly updated and improved version of eFORGE that can analyse both EPIC and 450k array data. New features include analysis of chromatin states, TF motifs and DNase I footprints, providing tools for EWAS interpretation and epigenome editing. AVAILABILITY: eFORGE v2.0 is implemented as a web tool available from https://eforge.altiusinstitute.org and https://eforge-tf.altiusinstitute.org/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online

UCL Discovery

Genetic, environmental and stochastic factors in monozygotic twin discordance with a focus on epigenetic differences

Author: A Chess
A Harder
A Imamura
A Itsara
A Petronis
A Petronis
A Petronis
A Petronis
A Petronis
A van Oudenaarden
A Victoria
AAGL Schinzel
AEaM Chang
AJ Notini
AW Norris
B Gottlieb
BG Forde
BM Javierre
C Desplan
CEG Bruder
D Boomsma
D Freedman
D Freedman
D Galetzka
D Porteous
DR Cox
E Ballestar
E Whitelaw
E Whitelaw
EL Dempster
FN Haque
G Kuratomi
G Machin
G Mari
G Vassart
George C Ebers
GM Martin
H Yamagishi
H Youssoufian
HM Gardiner
HR Raynes
HS Wong
IP Pogribny
J Dube
J Mill
JA Stamatoyannopoulos
JD Watson
JN Hirschhorn
JN Hirschhorn
JP Dumanski
JR Ecker
JT Bell
Julia M Morahan
K Demissie
K Gartner
KW Kinzler
L Kaplan
M Baiget
M Dichgans
M Kaern
M Zernicka-Goetz
MA Ramosarroyo
MF Fraga
MH Gollob
MI McCarthy
MJ Daly
MJ Hoffmann
MM Braun
MP Lee
NA Oates
NA Youngson
NCK Tan
NY Souren
P Donnelly
P Gringras
P Korsten
P Poulsen
R Acosta-Rojas
R Bajoria
R Bajoria
R Hirschhorn
R Jaenisch
R Losick
R Saffery
R Weksberg
RA Quintero
RE Hoskins
RP Erickson
RP Erickson
RS Spielman
S Bagchi
S Singh
S Tierling
SA Frank
SaM Szymanski
SE Baranzini
Sreeram V Ramagopalan
T Kato
TB Agbabiaka
TJ Bouchard
TJ Bouchard
V Shotelersuk
VK Rakyan
VK Rakyan
VK Rakyan
VW Hu
Witold Czyz
WN Spellacy
YH Jiang
Z Kaminsky
ZA Kaminsky
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

PMCID: PMC3566971This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited

Springer - Publisher Connector

Oxford University Research Archive

Queen Mary Research Online

A genetic variant in the LDLRpromoter is responsible for part of the LDL-cholesterol variability in primary hypercholesterolemia

Author: BE Bernstein
D Tejedor
DE Reich
DJ Rader
DM Waterworth
Emilio Ros
F Civeira
Fernando Civeira
I De Castro-Orós
I De Castro-Orós
Isabel De Castro-Orós
J Goldstein
JA Gómez-Gerique
JA Riancho
JA Stamatoyannopoulos
Javier Pérez-López
Jose A Casasnovas
Jose C Rodríguez-Rey
K Quandt
L Palacios
M Abifadel
M Bourbon
Marta Ledesma
Miguel Pocoví
Montserrat Cofán
Montserrat León
MS Sandhu
MV Rockman
P Mozas
P Mozas
PJ Talmud
PJ Wittkopp
R Worsley-Hunt
Rocio Mateo-Gallego
S Kathiresan
S Pampín
S Sanna
Soraya Rebollar
TL Innerarity
TM Teslovich
VD Marinescu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Molecular identification of Sicilian (<FONT FACE=Symbol>dß)º-thalassemia associated with ß-thalassemia and hemoglobin S in Brazil</FONT>

Author: A. Fattori
Alter BP
Belhani M
Costa FF
Craig JE
Dacie JV
Efremov GD
Esposito G
F.F. Costa
Henthorn P
Kattamis C
Kinney TR
M.F. Sonati
Mirabile E
Morengo-Rowe AJ
Ottolenghi S
Pembrey MF
S.T.O. Saad
Stamatoyannopoulos G
Stamatoyannopoulos G
T.G. de Andrade
Trent RJ
Weatherall DJ
Wolff JA
Zelkowitz L
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

The Hellenic type of nondeletional hereditary persistence of fetal hemoglobin results from a novel mutation (g.-109G>T) in the HBG2 gene promoter

Author: A Papachatzopoulou
A Ronchi
AE Ronchi
Angelos Kalamaras
AS Tan
Christos Chassanidis
Farzin Pourfarzad
G Stamatoyannopoulos
George P. Patrinos
GP Patrinos
GP Patrinos
GP Patrinos
GP Patrinos
GP Patrinos
HY Luo
JA Bollekens
K Indrak
M Berry
M Losekoot
M Tasiopoulou
M Wijgerde
Manoussos N. Papadakis
Marios Phylactides
MN Papadakis
MN Papadakis
Nikolaos K. Vamvakopoulos
P Kollia
Panagoula Kollia
R Gelinas
RA Swank
RC Hardison
S Baal van
Sophia Likousi
TH Huisman
Vassiliki Aleporou-Marinou
Vassilis Maroulis
Z Chen
Publication venue: Springer-Verlag
Publication date: 01/01/2008
Field of study

Nondeletional hereditary persistence of fetal hemoglobin (nd-HPFH), a rare hereditary condition resulting in elevated levels of fetal hemoglobin (Hb F) in adults, is associated with promoter mutations in the human fetal globin (HBG1 and HBG2) genes. In this paper, we report a novel type of nd-HPFH due to a HBG2 gene promoter mutation (HBG2:g.-109G>T). This mutation, located at the 3′ end of the HBG2 distal CCAAT box, was initially identified in an adult female subject of Central Greek origin and results in elevated Hb F levels (4.1%) and significantly increased Gγ-globin chain production (79.2%). Family studies and DNA analysis revealed that the HBG2:g.-109G>T mutation is also found in the family members in compound heterozygosity with the HBG2:g.-158C>T single nucleotide polymorphism or the silent HBB:g.-101C>T β-thalassemia mutation, resulting in the latter case in significantly elevated Hb F levels (14.3%). Electrophoretic mobility shift analysis revealed that the HBG2:g.-109G>T mutation abolishes a transcription factor binding site, consistent with previous observations using DNA footprinting analysis, suggesting that guanine at position HBG2/1:g.-109 is critical for NF-E3 binding. These data suggest that the HBG2:g-109G>T mutation has a functional role in increasing HBG2 transcription and is responsible for the HPFH phenotype observed in our index cases

Springer - Publisher Connector

University of Thessaly Institutional Repository

An Integrated Model of Multiple-Condition ChIP-Seq Data Reveals Predeterminants of Cdx2 Binding

Author: A Arvey
A Marson
A Meissner
AC Mullen
AK Tewari
Akshay Kakumanu
B Langmead
C Taslim
Carolyn A. Morrison
D Strumpf
David K. Gifford
E Redhead
EO Mazzoni
EO Mazzoni
EO Mazzoni
Esteban O. Mazzoni
H Ji
H Niwa
H Xu
HS Rhee
Hynek Wichterle
Ilya Ioshikhes
J-CD Heng
JA Granek
JA Stamatoyannopoulos
JP Ferguson
K Liang
KS Zaret
M Berger
M Ku
MAT Figueiredo
Matthew D. Edwards
MD Robinson
MH Kagey
MP Creyghton
P Huggins
PB Rahl
R Jothi
RI Sherwood
Richard I. Sherwood
S John
S Mahony
S Mahony
SG Landt
Shaun Mahony
TL Bailey
TS Mikkelsen
X Chen
X Zeng
Y Guo
Y Guo
Y Zhang
Z Shao
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/10/2013
Field of study

Regulatory proteins can bind to different sets of genomic targets in various cell types or conditions. To reliably characterize such condition-specific regulatory binding we introduce MultiGPS, an integrated machine learning approach for the analysis of multiple related ChIP-seq experiments. MultiGPS is based on a generalized Expectation Maximization framework that shares information across multiple experiments for binding event discovery. We demonstrate that our framework enables the simultaneous modeling of sparse condition-specific binding changes, sequence dependence, and replicate-specific noise sources. MultiGPS encourages consistency in reported binding event locations across multiple-condition ChIP-seq datasets and provides accurate estimation of ChIP enrichment levels at each event. MultiGPS's multi-experiment modeling approach thus provides a reliable platform for detecting differential binding enrichment across experimental conditions. We demonstrate the advantages of MultiGPS with an analysis of Cdx2 binding in three distinct developmental contexts. By accurately characterizing condition-specific Cdx2 binding, MultiGPS enables novel insight into the mechanistic basis of Cdx2 site selectivity. Specifically, the condition-specific Cdx2 sites characterized by MultiGPS are highly associated with pre-existing genomic context, suggesting that such sites are pre-determined by cell-specific regulatory architecture. However, MultiGPS-defined condition-independent sites are not predicted by pre-existing regulatory signals, suggesting that Cdx2 can bind to a subset of locations regardless of genomic environment. A summary of this paper appears in the proceedings of the RECOMB 2014 conference, April 2–5.National Science Foundation (U.S.) (Graduate Research Fellowship under Grant 0645960)National Institutes of Health (U.S.) (grant P01 NS055923)Pennsylvania State University. Center for Eukaryotic Gene Regulatio

CiteSeerX

Public Library of Science (PLOS)

DSpace@MIT

Harvard University - DASH

Columbia University Academic Commons

Public Library of Science (PLOS)

Predicting Human Nucleosome Occupancy from Primary Sequence

Nucleosomes are the fundamental repeating unit of chromatin and comprise the structural building blocks of the living eukaryotic genome. Micrococcal nuclease (MNase) has long been used to delineate nucleosomal organization. Microarray-based nucleosome mapping experiments in yeast chromatin have revealed regularly-spaced translational phasing of nucleosomes. These data have been used to train computational models of sequence-directed nuclesosome positioning, which have identified ubiquitous strong intrinsic nucleosome positioning signals. Here, we successfully apply this approach to nucleosome positioning experiments from human chromatin. The predictions made by the human-trained and yeast-trained models are strongly correlated, suggesting a shared mechanism for sequence-based determination of nucleosome occupancy. In addition, we observed striking complementarity between classifiers trained on experimental data from weakly versus heavily digested MNase samples. In the former case, the resulting model accurately identifies nucleosome-forming sequences; in the latter, the classifier excels at identifying nucleosome-free regions. Using this model we are able to identify several characteristics of nucleosome-forming and nucleosome-disfavoring sequences. First, by combining results from each classifier applied de novo across the human ENCODE regions, the classifier reveals distinct sequence composition and periodicity features of nucleosome-forming and nucleosome-disfavoring sequences. Short runs of dinucleotide repeat appear as a hallmark of nucleosome-disfavoring sequences, while nucleosome-forming sequences contain short periodic runs of GC base pairs. Second, we show that nucleosome phasing is most frequently predicted flanking nucleosome-free regions. The results suggest that the major mechanism of nucleosome positioning in vivo is boundary-event-driven and affirm the classical statistical positioning theory of nucleosome organization

Chromatin loop anchors are associated with genome instability in cancer and recombination hotspots in the germline

Author: A Auton
A Canela
A Gardini
A Liaw
A Losada
AL Valton
AR Quinlan
AS Kudlicki
B Charlesworth
B Gel
B Schuster-Bockler
BJ Taylor
BL Moore
C Bertoli
C Grey
CE Grant
CJ Lord
CM Carvalho
CM Manville
Colin A. Semple
CS Walsh
CT Ong
CY McLean
D Hnisz
D Perera
DG Lupianez
DR Zerbino
E Guillou
E Hatchi
E Splinter
Encode Project Consortium
EP Nora
F Baudat
F McNicoll
F Pratto
G Coop
G Fudenberg
G Fudenberg
G McVicker
GA McVean
J Feichtinger
J MacArthur
J Weischenfeldt
JA Rosenfeld
JA Stamatoyannopoulos
JHI Haarhuis
JM Engreitz
JN Strathern
JR Dixon
JS Gehring
K Brick
K Hilmi
L Uuskula-Reimand
LJ Valentijn
M Peifer
MA Reijns
MH Nichols
PA Northcott
R Hänsel-Hertsch
R Katainen
R Sabarinathan
S Besenbacher
S Courbet
S Groschel
S Morganella
S Myers
S Myers
S Nik-Zainal
SS Rao
SV Lensing
TJ Hudson
TL Bailey
TW Glover
TW Glover
V Dileep
VB Kaiser
Vera B. Kaiser
W Winckler
WA Flavahan
WM Hicks
Y Drier
Y Liu
Y Zhang
Z Tang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/07/2018
Field of study

Abstract Background Chromatin loops form a basic unit of interphase nuclear organization, with chromatin loop anchor points providing contacts between regulatory regions and promoters. However, the mutational landscape at these anchor points remains under-studied. Here, we describe the unusual patterns of somatic mutations and germline variation associated with loop anchor points and explore the underlying features influencing these patterns. Results Analyses of whole genome sequencing datasets reveal that anchor points are strongly depleted for single nucleotide variants (SNVs) in tumours. Despite low SNV rates in their genomic neighbourhood, anchor points emerge as sites of evolutionary innovation, showing enrichment for structural variant (SV) breakpoints and a peak of SNVs at focal CTCF sites within the anchor points. Both CTCF-bound and non-CTCF anchor points harbour an excess of SV breakpoints in multiple tumour types and are prone to double-strand breaks in cell lines. Common fragile sites, which are hotspots for genome instability, also show elevated numbers of intersecting loop anchor points. Recurrently disrupted anchor points are enriched for genes with functions in cell cycle transitions and regions associated with predisposition to cancer. We also discover a novel class of CTCF-bound anchor points which overlap meiotic recombination hotspots and are enriched for the core PRDM9 binding motif, suggesting that the anchor points have been foci for diversity generated during recent human evolution. Conclusions We suggest that the unusual chromatin environment at loop anchor points underlies the elevated rates of variation observed, marking them as sites of regulatory importance but also genomic fragility