Search CORE

232 research outputs found

Cis and Trans Effects of Human Genomic Variants on Gene Expression

Author: A Boyd
AC Nica
AC Nica
AH Talukder
Alfonso Buil
AR Wood
AS Dimas
BE Stranger
BE Stranger
BU Schraml
C Wallace
Christopher David Brown
David M. Evans
DF Conrad
DJ Gaffney
DL Nicolae
DM Greenawalt
Donald F. Conrad
E Grundberg
E Klopocki
EE Schadt
Emmanouil T. Dermitzakis
George Davey Smith
GR Abecasis
H Huang
HJ Westra
J Ding
J Millstein
J Zhu
JD Storey
JE Powell
JK Pickrell
John P. Kemp
Julien Bryois
K Hildner
Karen M. Ho
LA Hindorff
M Gutierrez-Arcelus
M Scutari
Matthew Hurles
ND Miller
NL Barbosa-Morais
Panos Deloukas
RW Jones
SB Montgomery
SB Montgomery
SJ Loughran
Stephen B. Montgomery
Susan Ring
T Lappalainen
TR Insel
W Wang
Y Li
Y Li
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

This work was funded by the Louis-Jeantet Foundation (http://www.jeantet.ch/), the European Research Council (Grant ID: 260927 http://erc.europa.eu/), the Swiss National Foundation (Grant ID: 130342 http://www.snf.ch), NCCR Frontiers In Genetics (http://www.frontiers-in-genetics.org), the UK Medical Research Council (http://www.mrc.ac.uk) and the Wellcome Trust (Grant ID: 092731).

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Queen Mary Research Online

Explore Bristol Research

University of Queensland eSpace

Archive ouverte UNIGE

FigShare

Candidate Causal Regulatory Effects by Integration of Expression QTLs with Complex Trait Genetic Associations

Author: AJ Myers
AL Dixon
Alexandra C. Nica
Antigone S. Dimas
AS Dimas
Barbara E. Stranger
BE Stranger
BE Stranger
Claude Beazley
E Zeggini
EE Schadt
Emmanouil T. Dermitzakis
ET Dermitzakis
G Hom
GA McVean
Greg Gibson
HB Fraser
HH Goring
Inês Barroso
JC Barrett
JK Pritchard
L Franke
LAJH Hindorff
M Parkes
MF Moffatt
P Goyette
RA Eeles
RJ Loos
Stephen B. Montgomery
TI Pollin
V Emilsson
V Plagnol
VD Peltekova
VG Cheung
Y Chen
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

The recent success of genome-wide association studies (GWAS) is now followed by the challenge to determine how the reported susceptibility variants mediate complex traits and diseases. Expression quantitative trait loci (eQTLs) have been implicated in disease associations through overlaps between eQTLs and GWAS signals. However, the abundance of eQTLs and the strong correlation structure (LD) in the genome make it likely that some of these overlaps are coincidental and not driven by the same functional variants. In the present study, we propose an empirical methodology, which we call Regulatory Trait Concordance (RTC) that accounts for local LD structure and integrates eQTLs and GWAS results in order to reveal the subset of association signals that are due to cis eQTLs. We simulate genomic regions of various LD patterns with both a single or two causal variants and show that our score outperforms SNP correlation metrics, be they statistical (r2) or historical (D'). Following the observation of a significant abundance of regulatory signals among currently published GWAS loci, we apply our method with the goal to prioritize relevant genes for each of the respective complex traits. We detect several potential disease-causing regulatory effects, with a strong enrichment for immunity-related conditions, consistent with the nature of the cell line tested (LCLs). Furthermore, we present an extension of the method in trans, where interrogating the whole genome for downstream effects of the disease variant can be informative regarding its unknown primary biological effect. We conclude that integrating cellular phenotype associations with organismal complex traits will facilitate the biological interpretation of the genetic effects on these traits

CiteSeerX

Public Library of Science (PLOS)

Crossref

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

Archive ouverte UNIGE

Data analysis issues for allele-specific expression using Illumina's GoldenGate assay.

Author: A Gimelbrant
AC Tan
Antigone S Dimas
AS Dimas
BE Stranger
BJ Main
C Daelemans
Caroline Daelemans
D Serre
Emmanouil T Dermitzakis
GK Smyth
GK Smyth
GK Smyth
HS Lo
HT Bjornsson
International HapMap Consortium
International HapMap Consortium
J Oosting
J Staaf
JB Fan
JC Knight
K Zhang
KB Meyer
KK Dobbin
Matthew E Ritchie
Matthew S Forrest
ME Ritchie
MJ Dunning
MJ Dunning
ML Martin-Magniette
MP Lee
Panagiotis Deloukas
PH van Bilsen
PR Buckland
PV Pant
R Development Core Team
S Davis
Simon Tavaré
X Feng
Publication venue: BMC Bioinformatics
Publication date: 01/01/2010
Field of study

BACKGROUND: High-throughput measurement of allele-specific expression (ASE) is a relatively new and exciting application area for array-based technologies. In this paper, we explore several data sets which make use of Illumina's GoldenGate BeadArray technology to measure ASE. This platform exploits coding SNPs to obtain relative expression measurements for alleles at approximately 1500 positions in the genome. RESULTS: We analyze data from a mixture experiment where genomic DNA samples from pairs of individuals of known genotypes are pooled to create allelic imbalances at varying levels for the majority of SNPs on the array. We observe that GoldenGate has less sensitivity at detecting subtle allelic imbalances (around 1.3 fold) compared to extreme imbalances, and note the benefit of applying local background correction to the data. Analysis of data from a dye-swap control experiment allowed us to quantify dye-bias, which can be reduced considerably by careful normalization. The need to filter the data before carrying out further downstream analysis to remove non-responding probes, which show either weak, or non-specific signal for each allele, was also demonstrated. Throughout this paper, we find that a linear model analysis of the data from each SNP is a flexible modelling strategy that allows for testing of allelic imbalances in each sample when replicate hybridizations are available. CONCLUSIONS: Our analysis shows that local background correction carried out by Illumina's software, together with quantile normalization of the red and green channels within each array, provides optimal performance in terms of false positive rates. In addition, we strongly encourage intensity-based filtering to remove SNPs which only measure non-specific signal. We anticipate that a similar analysis strategy will prove useful when quantifying ASE on Illumina's higher density Infinium BeadChips.RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are

Crossref

Springer - Publisher Connector

PubMed Central

Apollo (Cambridge)

University of Melbourne Institutional Repository

Archive ouverte UNIGE

Detection of regulator genes and eQTLs in gene networks

Author: A Butte
A Chatr-Aryamontri
A Clauset
A Joshi
A Joshi
A Kundaje
AA Shabalin
AJ Enright
AJ Walhout
AS Dimas
B Schwanhausser
B Zhang
B Zhang
C Cenik
CO Daub
D Koller
DA Cusanovich
DM Greenawalt
E Bonnet
E Ravasz
E Segal
EC Neto
EC Neto
EC Neto
EE Schadt
EE Schadt
EE Schadt
EE Schadt
EE Schadt
EJ Foss
F Grubert
F Yue
FA Cubillos
FW Albert
G Hemani
G Nicholson
GD Smith
GH Golub
H Foroughi Asl
H Talukdar
HN Kadarmideen
J Millstein
J Qi
J Zhu
J Zhu
J Zhu
JE Aten
JF Ayroles
JJ Faith
JL Björkegren
JS Liu
K Basso
K Qu
KG Ardlie
L Wu
LA Hindorff
LH Hartwell
LS Chen
M Ashburner
M Civelek
M Georges
M Gerstein
M Medvedovic
M Schmidt
M Scutari
MA Schaub
MB Eisen
MD Ritchie
ME Goddard
MEJ Newman
MEJ Newman
MV Rockman
MV Rockman
N Friedman
N Friedman
N Friedman
N Laird
O Stegle
P Langfelder
P Langfelder
P Langfelder
P Lu
R Sharan
R Sharan
RB Brem
RW Williams
S Lee
S Roy
S Tavazoie
SI Lee
SM Waszak
SS Rao
T Lappalainen
T Michoel
TA Manolio
TF Mackay
The ENCODE
TS Furey
VG Cheung
W Cookson
W Zhang
Y Chen
Y Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2016
Field of study

Genetic differences between individuals associated to quantitative phenotypic traits, including disease states, are usually found in non-coding genomic regions. These genetic variants are often also associated to differences in expression levels of nearby genes (they are "expression quantitative trait loci" or eQTLs for short) and presumably play a gene regulatory role, affecting the status of molecular networks of interacting genes, proteins and metabolites. Computational systems biology approaches to reconstruct causal gene networks from large-scale omics data have therefore become essential to understand the structure of networks controlled by eQTLs together with other regulatory genes, and to generate detailed hypotheses about the molecular mechanisms that lead from genotype to phenotype. Here we review the main analytical methods and softwares to identify eQTLs and their associated genes, to reconstruct co-expression networks and modules, to reconstruct causal Bayesian gene and module networks, and to validate predicted networks in silico.Comment: minor revision with typos corrected; review article; 24 pages, 2 figure

arXiv.org e-Print Archive

Crossref

Patterns of Cis Regulatory Variation in Diverse Human Populations

The genetic basis of gene expression variation has long been studied with the aim to understand the landscape of regulatory variants, but also more recently to assist in the interpretation and elucidation of disease signals. To date, many studies have looked in specific tissues and population-based samples, but there has been limited assessment of the degree of inter-population variability in regulatory variation. We analyzed genome-wide gene expression in lymphoblastoid cell lines from a total of 726 individuals from 8 global populations from the HapMap3 project and correlated gene expression levels with HapMap3 SNPs located in cis to the genes. We describe the influence of ancestry on gene expression levels within and between these diverse human populations and uncover a non-negligible impact on global patterns of gene expression. We further dissect the specific functional pathways differentiated between populations. We also identify 5,691 expression quantitative trait loci (eQTLs) after controlling for both non-genetic factors and population admixture and observe that half of the cis-eQTLs are replicated in one or more of the populations. We highlight patterns of eQTL-sharing between populations, which are partially determined by population genetic relatedness, and discover significant sharing of eQTL effects between Asians, European-admixed, and African subpopulations. Specifically, we observe that both the effect size and the direction of effect for eQTLs are highly conserved across populations. We observe an increasing proximity of eQTLs toward the transcription start site as sharing of eQTLs among populations increases, highlighting that variants close to TSS have stronger effects and therefore are more likely to be detected across a wider panel of populations. Together these results offer a unique picture and resource of the degree of differentiation among human populations in functional regulatory variation and provide an estimate for the transferability of complex trait variants across populations

Public Library of Science (PLOS)

CiteSeerX

Crossref

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

Publikationsserver der Universität Tübingen

MPG.PuRe

Explore Bristol Research

Archive ouverte UNIGE

FigShare

University of Queensland eSpace

Comparison of Strategies to Detect Epistasis from eQTL Data

Author: AJ Iafrate
AS Dimas
B Maher
BE Stranger
C Herold
D Curtis
DM Evans
E Lee
H Lango Allen
HJ Cordell
Ioannis Xenarios
J Gayan
J Marchini
J Ronald
J Zhu
JD Storey
JN Hirschhorn
KA Frazer
Karen Kapur
L De Lobel
LJ Jensen
M Emily
M Kellis
Momiao Xiong
NA Sinnott-Armstrong
RB Brem
RB Brem
S Purcell
S Suthram
SI Lee
Sven Bergmann
T Schupbach
Thierry Schüpbach
WS Bush
X Zhang
YV Sun
Zoltán Kutalik
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Genome-wide association studies have been instrumental in identifying genetic variants associated with complex traits such as human disease or gene expression phenotypes. It has been proposed that extending existing analysis methods by considering interactions between pairs of loci may uncover additional genetic effects. However, the large number of possible two-marker tests presents significant computational and statistical challenges. Although several strategies to detect epistasis effects have been proposed and tested for specific phenotypes, so far there has been no systematic attempt to compare their performance using real data. We made use of thousands of gene expression traits from linkage and eQTL studies, to compare the performance of different strategies. We found that using information from marginal associations between markers and phenotypes to detect epistatic effects yielded a lower false discovery rate (FDR) than a strategy solely using biological annotation in yeast, whereas results from human data were inconclusive. For future studies whose aim is to discover epistatic effects, we recommend incorporating information about marginal associations between SNPs and phenotypes instead of relying solely on biological annotation. Improved methods to discover epistatic effects will result in a more complete understanding of complex genetic effects

Public Library of Science (PLOS)

Crossref

Serveur académique lausannois

Directory of Open Access Journals

PubMed Central

Trait-Associated SNPs Are More Likely to Be eQTLs: Annotation to Enhance Discovery from GWAS

Author: A Gerrits
AL Stark
AS Dimas
D Levy
Dan L. Nicolae
DB Goldstein
DJ Klionsky
DL Nicolae
E Choy
EE Schadt
EE Schadt
EE Schadt
ER Gamazon
Eric Gamazon
GR Abecasis
Greg Gibson
HR Coleman
I Hovatta
J Hampe
JC Barrett
JN Hirschhorn
K Bullaughey
L Shi
LA Hindorff
LA Hindorff
M Comabella
M. Eileen Dolan
MJ Cowley
Nancy J. Cox
P Kraft
RA Irizarry
S Duan
S Purcell
Shiwei Duan
TA Manolio
V Emilsson
Wei Zhang
Publication venue: Public Library of Science
Publication date: 01/04/2010
Field of study

Although genome-wide association studies (GWAS) of complex traits have yielded more reproducible associations than had been discovered using any other approach, the loci characterized to date do not account for much of the heritability to such traits and, in general, have not led to improved understanding of the biology underlying complex phenotypes. Using a web site we developed to serve results of expression quantitative trait locus (eQTL) studies in lymphoblastoid cell lines from HapMap samples (http://www.scandb.org), we show that single nucleotide polymorphisms (SNPs) associated with complex traits (from http://www.genome.gov/gwastudies/) are significantly more likely to be eQTLs than minor-allele-frequency–matched SNPs chosen from high-throughput GWAS platforms. These findings are robust across a range of thresholds for establishing eQTLs (p-values from 10−4–10−8), and a broad spectrum of human complex traits. Analyses of GWAS data from the Wellcome Trust studies confirm that annotating SNPs with a score reflecting the strength of the evidence that the SNP is an eQTL can improve the ability to discover true associations and clarify the nature of the mechanism driving the associations. Our results showing that trait-associated SNPs are more likely to be eQTLs and that application of this information can enhance discovery of trait-associated SNPs for complex phenotypes raise the possibility that we can utilize this information both to increase the heritability explained by identifiable genetic factors and to gain a better understanding of the biology underlying complex traits

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Parent-of-origin-specific allelic associations among 106 genomic loci for age at menarche.

Author: A Kong
A Lomniczi
A Murphy
A Schröder
AA D’Aloisio
Aarno Palotie
AB Migliano
Adamo Pio D’adamo
Aida K. Dieffenbach
AL Dixon
Albert Hofman
Albert V. Smith
Albertine J. Oldehinkel
Alexander Teumer
Alison M. Dunning
Andrea D. Coviello
Andrea Ganna
Andres Metspalu
Andrew C. Heath
Andrew D. Johnson
André G. Uitterlinden
Angela Cox
Anja Rudolph
Anna M. Storniolo
Anna Murray
Anneli Pouta
Annika Lindblom
Antonietta Robino
AP Abreu
Arto Mannermaa
AS Dimas
AV Segrè
B Kabakchiev
B Zhang
BE Stranger
Behrooz Z. Alizadeh
Ben A. Oostra
Bernardo Bonanni
Bjarke Feenstra
BP Fairfax
Bruce H. R. Wolffenbuttel
C Colantuoni
C Liu
C-J Partsch
Carl Blomqvist
Catharina A. Hartman
Cathy E. Elks
CE Elks
Christian Gieger
Chunyan He
CJ Willer
Cornelia M. van Duijn
CP Schaaf
Craig E. Pennell
D Mehta
D Sasayama
DA Cusanovich
Daniel F. Gudbjartsson
Daniel I. Chasman
Daniel L. Koller
David Couper
David J. Hunter
David Karasik
David P. Strachan
David Schlessinger
Debbie Lawlor
Deborah J. Thompson
Diana L. Cousminer
Dieter Flesch-Janys
Diether Lambrechts
Dirkje S. Postma
DM Greenawalt
Doris Stöckl
Dorret I. Boomsma
Douglas F. Easton
Douglas P. Kiel
E Grundberg
E Grundberg
E Grundberg
Eco E. J. de Geus
EE Schadt
EK Speliotes
EL Heinzen
Eleonora Porcu
Elisabeth Widen
Elizabeth A. Streeten
Ellen A. Nohr
Ellen W. Demerath
Emmi Tikkanen
Enda M. Byrne
Eric Boerwinkle
Erik Ingelsson
Erin K. Wagner
Eva Albrecht
Evelin Mihailov
F Innocenti
F Nagl
F Zou
Felix Day
Femke Atsma
Fergus J. Couch
Fernando Rivadeneira
Frank B. Hu
Frank Geller
GA Heap
George Davey Smith
George McMahon
Georgia Chenevix-Trench
Gerard Waeber
Gisli Masson
Gonneke Willemsen
Graham G. Giles
Grant W. Montgomery
Gudmar Thorleifsson
Gudny Eiriksdottir
H Lango Allen
H-J Westra
Harold Snieder
Heather A. Boyd
Heli Nevanlinna
Henri Wallaschofski
Henrik Flyger
Henry Völzke
Hermann Brenner
HHH Göring
Hiltrud Brauch
Hoda Anton-Culver
IK Temple
Ilja M. Nolte
Immaculata De Vivo
Irene L. Andrulis
J Ding
J Huang
J Yang
J Yang
J Yang
J. Margriet Collée
JA Webster
Javier Benitez
JC Barrett
Jenny A. Visser
Jenny Chang-Claude
JF Degner
Jing Hua Zhao
Jinhui Chen
Joanne M. Murabito
Joe Dennis
Johan G. Eriksson
John L. Hopper
John P. Rice
John R. B. Perry
Jonathan Tyrer
Jouke-Jan Hottenga
JR Gibbs
Julia A. Knight
Julian Peto
Julie E. Buring
K Estrada
K Hao
K Michailidou
Kamila Czene
Kari Stefansson
Kathryn L. Lunetta
Katri Pylkäs
Kay-Tee Khaw
Ken K. Ong
KS Kompass
Kyriaki Michailidou
L Liang
Laura Crisponi
Laura J. Bierut
Laura M. Yerges-Armstrong
Lavinia Paternoster
LB Barreiro
LFG Silveira
Lili Milani
Lisette Stolk
Lude Franke
Luigi Ferrucci
Lynda M. Rose
M Horikoshi
M Rantalainen
Maartje J. Hooning
Mads Melbye
Manjeet K. Bolla
Marek Zygmunt
Margaret J. Wright
Marjanka K. Schmidt
Marjo-Riitta Järvelin
Mark I. McCarthy
Massimo Mangino
Matthias W. Beckmann
Meir J. Stampfer
Melanie Waldenberger
Melissa E. Garcia
Mellissa C. Southey
Michael J. Econs
Montserrat García-Closas
Munro Peacock
N Ruf
Najaf Amin
Nancy L. Pedersen
Natalia Tšernikova
Nicholas G. Martin
Nicholas J. Timpson
Nicholas J. Wareham
Nora Franceschini
P Prentice
Paolo Gasparini
Paolo Peterlongo
Paolo Radice
Pascal Guénel
Patrick F. McArdle
Patrick Neven
Patrick Sulem
Patrik K. Magnusson
Paul D. P. Pharoah
Paul M. Ridker
Per Hall
Peter A. Fasching
Peter Devilee
Peter Kraft
Peter Vollenweider
Q Li
Qin Wang
Reedik Mägi
Robert Winqvist
Robin Haring
Roger L. Milne
RP Grinspon
RS Huang
RSN Fehrmann
Ruth J. F. Loos
S Cho
S Constantin
S Sulzbacher
Sandra Lai
Sanela Kjellqvist
SE Parker
Serena Sanna
Sheila Ulivi
Stefania Bandinelli
Stephen Chanock
Stig E. Bojesen
Suiqun Guo
Susan Ring
Sven Bergmann
T Kwan
T Maeda
T Zeller
Tamara B. Harris
Tanguy Corre
Tatiana Foroud
Teresa Ferreira
Thorkild I. A. Sørensen
Tim D. Spector
Tune H. Pers
Tõnu Esko
Ulla Sovio
Unnur Thorsteinsdottir
Ute Hamann
V Emilsson
Veikko Salomaa
Veli-Matti Kosma
Vilmundur Gudnason
W Zheng
Wei Q. Ang
Wendy L. McArdle
Y Idaghdour
Y Stelzer
Z Zadik
Zoltan Kutalik
Publication venue: Nature Publishing Group
Publication date: 01/01/2014
Field of study

Age at menarche is a marker of timing of puberty in females. It varies widely between individuals, is a heritable trait and is associated with risks for obesity, type 2 diabetes, cardiovascular disease, breast cancer and all-cause mortality. Studies of rare human disorders of puberty and animal models point to a complex hypothalamic-pituitary-hormonal regulation, but the mechanisms that determine pubertal timing and underlie its links to disease risk remain unclear. Here, using genome-wide and custom-genotyping arrays in up to 182,416 women of European descent from 57 studies, we found robust evidence (P < 5 × 10(-8)) for 123 signals at 106 genomic loci associated with age at menarche. Many loci were associated with other pubertal traits in both sexes, and there was substantial overlap with genes implicated in body mass index and various diseases, including rare disorders of puberty. Menarche signals were enriched in imprinted regions, with three loci (DLK1-WDR25, MKRN3-MAGEL2 and KCNK9) demonstrating parent-of-origin-specific associations concordant with known parental expression patterns. Pathway analyses implicated nuclear hormone receptors, particularly retinoic acid and γ-aminobutyric acid-B2 receptor signalling, among novel mechanisms that regulate pubertal timing in humans. Our findings suggest a genetic architecture involving at least hundreds of common variants in the coordinated timing of the pubertal transition

VU Research Portal

University of Groningen

Carolina Digital Repository

Leiden University Scholary Publications

PuSH

White Rose Research Online

Online Research Database In Technology

Archivio istituzionale della ricerca - Università di Trieste

Proceedings - University of Groningen

LSHTM Research Online

Copenhagen University Research Information System

PubMed Central

Oxford University Research Archive

Apollo (Cambridge)

University of Melbourne Institutional Repository

Explore Bristol Research

Serveur académique lausannois

Spiral - Imperial College Digital Repository

Institute of Cancer Research Repository

University of Queensland eSpace

Archivio della ricerca - Università degli studi di Napoli Federico II

Crossref

IUPUIScholarWorks

ARTS repository - University of Groningen

EUR Research Repository

Open Research Exeter

University of Southern Denmark Research Output

King's Research Portal

St George's Online Research Archive

Dissertations of the University of Groningen

Modified penetrance of coding variants by cis-regulatory variation contributes to disease risk

Coding variants represent many of the strongest associations between genotype and phenotype; however, they exhibit interindividual differences in effect, termed 'variable penetrance'. Here, we study how cis-regulatory variation modifies the penetrance of coding variants. Using functional genomic and genetic data from the Genotype-Tissue Expression Project (GTEx), we observed that in the general population, purifying selection has depleted haplotype combinations predicted to increase pathogenic coding variant penetrance. Conversely, in cancer and autism patients, we observed an enrichment of penetrance increasing haplotype configurations for pathogenic variants in disease-implicated genes, providing evidence that regulatory haplotype configuration of coding variants affects disease risk. Finally, we experimentally validated this model by editing a Mendelian single-nucleotide polymorphism (SNP) using CRISPR/Cas9 on distinct expression haplotypes with the transcriptome as a phenotypic readout. Our results demonstrate that joint regulatory and coding variant effects are an important part of the genetic architecture of human traits and contribute to modified penetrance of disease-causing variants.Peer reviewe

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Helsingin yliopiston digitaalinen arkisto

Using Stochastic Causal Trees to Augment Bayesian Networks for Modeling eQTL Datasets

Author: AFM Smith
Ambuj K Singh
AS Dimas
AV Werhli
BE Stranger
BJ Chen
D Heckerman
D Husmeier
D Husmeier
D Madigan
DC Kulp
DJ Lockhart
DM Ruderfer
E Chaibub Neto
EE Schadt
EO Perlstein
GA Churchill
J Pearl
J Zhu
J Zhu
J Zhu
JD Storey
JJ Faith
JJ Keurentjes
Kyle C Chipman
M Ashburner
M Morley
M Schena
MH Kutner
N Bing
N Friedman
N Friedman
O Litvin
RB Brem
RB Brem
RC Jansen
RW Doerge
S Imoto
S Mukherjee
SI Lee
W Pan
W Zhang
W Zou
Y Benjamini
Z Wang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The combination of genotypic and genome-wide expression data arising from segregating populations offers an unprecedented opportunity to model and dissect complex phenotypes. The immense potential offered by these data derives from the fact that genotypic variation is the sole source of perturbation and can therefore be used to reconcile changes in gene expression programs with the parental genotypes. To date, several methodologies have been developed for modeling eQTL data. These methods generally leverage genotypic data to resolve causal relationships among gene pairs implicated as associates in the expression data. In particular, leading studies have augmented Bayesian networks with genotypic data, providing a powerful framework for learning and modeling causal relationships. While these initial efforts have provided promising results, one major drawback associated with these methods is that they are generally limited to resolving causal orderings for transcripts most proximal to the genomic loci. In this manuscript, we present a probabilistic method capable of learning the causal relationships between transcripts at all levels in the network. We use the information provided by our method as a prior for Bayesian network structure learning, resulting in enhanced performance for gene network reconstruction. Results Using established protocols to synthesize eQTL networks and corresponding data, we show that our method achieves improved performance over existing leading methods. For the goal of gene network reconstruction, our method achieves improvements in recall ranging from 20% to 90% across a broad range of precision levels and for datasets of varying sample sizes. Additionally, we show that the learned networks can be utilized for expression quantitative trait loci mapping, resulting in upwards of 10-fold increases in recall over traditional univariate mapping. Conclusions Using the information from our method as a prior for Bayesian network structure learning yields large improvements in accuracy for the tasks of gene network reconstruction and expression quantitative trait loci mapping. In particular, our method is effective for establishing causal relationships between transcripts located both proximally and distally from genomic loci.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central