Search CORE

315,851 research outputs found

Statistical inference with anchored Bayesian mixture of regressions models: A case study analysis of allometric data

Author: Kunkel Deborah
Peruggia Mario
Publication venue
Publication date: 10/05/2019
Field of study

We present a case study in which we use a mixture of regressions model to improve on an ill-fitting simple linear regression model relating log brain mass to log body mass for 100 placental mammalian species. The slope of this regression model is of particular scientific interest because it corresponds to a constant that governs a hypothesized allometric power law relating brain mass to body mass. A specific line of investigation is to determine whether the regression parameters vary across subgroups of related species. We model these data using an anchored Bayesian mixture of regressions model, which modifies the standard Bayesian Gaussian mixture by pre-assigning small subsets of observations to given mixture components with probability one. These observations (called anchor points) break the relabeling invariance typical of exchangeable model specifications (the so-called label-switching problem). A careful choice of which observations to pre-classify to which mixture components is key to the specification of a well-fitting anchor model. In the article we compare three strategies for the selection of anchor points. The first assumes that the underlying mixture of regressions model holds and assigns anchor points to different components to maximize the information about their labeling. The second makes no assumption about the relationship between x and y and instead identifies anchor points using a bivariate Gaussian mixture model. The third strategy begins with the assumption that there is only one mixture regression component and identifies anchor points that are representative of a clustering structure based on case-deletion importance sampling weights. We compare the performance of the three strategies on the allometric data set and use auxiliary taxonomic information about the species to evaluate the model-based classifications estimated from these models

arXiv.org e-Print Archive

Chromatin loop anchors are associated with genome instability in cancer and recombination hotspots in the germline

Author: A Auton
A Canela
A Gardini
A Liaw
A Losada
AL Valton
AR Quinlan
AS Kudlicki
B Charlesworth
B Gel
B Schuster-Bockler
BJ Taylor
BL Moore
C Bertoli
C Grey
CE Grant
CJ Lord
CM Carvalho
CM Manville
Colin A. Semple
CS Walsh
CT Ong
CY McLean
D Hnisz
D Perera
DG Lupianez
DR Zerbino
E Guillou
E Hatchi
E Splinter
Encode Project Consortium
EP Nora
F Baudat
F McNicoll
F Pratto
G Coop
G Fudenberg
G Fudenberg
G McVicker
GA McVean
J Feichtinger
J MacArthur
J Weischenfeldt
JA Rosenfeld
JA Stamatoyannopoulos
JHI Haarhuis
JM Engreitz
JN Strathern
JR Dixon
JS Gehring
K Brick
K Hilmi
L Uuskula-Reimand
LJ Valentijn
M Peifer
MA Reijns
MH Nichols
PA Northcott
R Hänsel-Hertsch
R Katainen
R Sabarinathan
S Besenbacher
S Courbet
S Groschel
S Morganella
S Myers
S Myers
S Nik-Zainal
SS Rao
SV Lensing
TJ Hudson
TL Bailey
TW Glover
TW Glover
V Dileep
VB Kaiser
Vera B. Kaiser
W Winckler
WA Flavahan
WM Hicks
Y Drier
Y Liu
Y Zhang
Z Tang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/07/2018
Field of study

Abstract Background Chromatin loops form a basic unit of interphase nuclear organization, with chromatin loop anchor points providing contacts between regulatory regions and promoters. However, the mutational landscape at these anchor points remains under-studied. Here, we describe the unusual patterns of somatic mutations and germline variation associated with loop anchor points and explore the underlying features influencing these patterns. Results Analyses of whole genome sequencing datasets reveal that anchor points are strongly depleted for single nucleotide variants (SNVs) in tumours. Despite low SNV rates in their genomic neighbourhood, anchor points emerge as sites of evolutionary innovation, showing enrichment for structural variant (SV) breakpoints and a peak of SNVs at focal CTCF sites within the anchor points. Both CTCF-bound and non-CTCF anchor points harbour an excess of SV breakpoints in multiple tumour types and are prone to double-strand breaks in cell lines. Common fragile sites, which are hotspots for genome instability, also show elevated numbers of intersecting loop anchor points. Recurrently disrupted anchor points are enriched for genes with functions in cell cycle transitions and regions associated with predisposition to cancer. We also discover a novel class of CTCF-bound anchor points which overlap meiotic recombination hotspots and are enriched for the core PRDM9 binding motif, suggesting that the anchor points have been foci for diversity generated during recent human evolution. Conclusions We suggest that the unusual chromatin environment at loop anchor points underlies the elevated rates of variation observed, marking them as sites of regulatory importance but also genomic fragility

Crossref

Directory of Open Access Journals

Edinburgh Research Explorer