Search CORE

33 research outputs found

Analysis of case-control association studies with known risk variants

Author: Altshuler David
Dermitzakis Emmanouil T.
Groop Leif
Haiman Christopher A.
Henderson Brian E.
Kolonel Laurence N.
Kraft Peter
Marchand Loic Le
Patterson Nick
Paşaniuc Bogdan
Pollack Samuela
Price Alkes L.
Stranger Barbara E.
Voight Benjamin
Waters Kevin
Zaitlen Noah
Publication venue
Publication date: 02/08/2017
Field of study

Motivation: The question of how to best use information from known associated variants when conducting disease association studies has yet to be answered. Some studies compute a marginal P-value for each Several Nucleotide Polymorphisms independently, ignoring previously discovered variants. Other studies include known variants as covariates in logistic regression, but a weakness of this standard conditioning strategy is that it does not account for disease prevalence and non-random ascertainment, which can induce a correlation structure between candidate variants and known associated variants even if the variants lie on different chromosomes. Here, we propose a new conditioning approach, which is based in part on the classical technique of liability threshold modeling. Roughly, this method estimates model parameters for each known variant while accounting for the published disease prevalence from the epidemiological literature. Results: We show via simulation and application to empirical datasets that our approach outperforms both the no conditioning strategy and the standard conditioning strategy, with a properly controlled false-positive rate. Furthermore, in multiple data sets involving diseases of low prevalence, standard conditioning produces a severe drop in test statistics whereas our approach generally performs as well or better than no conditioning. Our approach may substantially improve disease gene discovery for diseases with many known risk variants. Availability: LTSOFT software is available online http://www.hsph.harvard.edu/faculty/alkes-price/software/ Contact: [email protected]; [email protected] Supplementary information: Supplementary data are available at Bioinformatics onlin

RERO DOC Digital Library

Recommended from our members

Leveraging population admixture to characterize the heritability of complex traits.

Author: Assimes Themistocles L
Berndt Sonja I
Bhatia Gaurav
Blot William J
Chanock Stephen
Franceschini Nora
Goodman Phyllis G
Gusev Alexander
Haiman Christopher
He Jing
Hennis Anselm JM
Hsing Ann
Ingles Sue A
Isaacs William
Kittles Rick A
Klein Eric A
Kooperberg Charles
Lange Leslie A
Nemesure Barbara
Pasaniuc Bogdan
Patterson Nick
Pollack Samuela
Price Alkes L
Reich David
Reiner Alex P
Rybicki Benjamin A
Sankararaman Sriram
Stanford Janet L
Stevens Victoria L
Stram Daniel
Strom Sara S
Tandon Arti
Tang Hua
Vilhjálmsson Bjarni J
Whitsel Eric A
Wilson James G
Witte John S
Xu Jianfeng
Young Taylor
Zaitlen Noah
Zhang Jianqi
Publication venue: eScholarship, University of California
Publication date: 01/12/2014
Field of study

Despite recent progress on estimating the heritability explained by genotyped SNPs (h(2)g), a large gap between h(2)g and estimates of total narrow-sense heritability (h(2)) remains. Explanations for this gap include rare variants or upward bias in family-based estimates of h(2) due to shared environment or epistasis. We estimate h(2) from unrelated individuals in admixed populations by first estimating the heritability explained by local ancestry (h(2)γ). We show that h(2)γ = 2FSTCθ(1 - θ)h(2), where FSTC measures frequency differences between populations at causal loci and θ is the genome-wide ancestry proportion. Our approach is not susceptible to biases caused by epistasis or shared environment. We applied this approach to the analysis of 13 phenotypes in 21,497 African-American individuals from 3 cohorts. For height and body mass index (BMI), we obtained h(2) estimates of 0.55 ± 0.09 and 0.23 ± 0.06, respectively, which are larger than estimates of h(2)g in these and other data but smaller than family-based estimates of h(2)

eScholarship - University of California

Recommended from our members

Genetically Determined Plasma Lipid Levels and Risk of Diabetic Retinopathy: A Mendelian Randomization Study.

Author: Asian Genetic Epidemiology Network Consortium
Burdon Kathryn P
Chen Ching J
Chen Yii-Der Ida
Cheng Ching-Yu
Chong Yong He
Christiansen Mark W
Cotch Mary Frances
Craig Jamie E
Fan Qiao
Gan Alfred
Gudnason Vilmundur
Guo Xiuqing
Hai Yang
Hancock Heather
Hanis Craig L
Huang Yu-Chuen
Hung Yi-Jen
Ipp Eli
Jensen Richard A
Jia Yucheng
Kaidonis Georgia
Kim Jihye
Klein Barbara EK
Klein Ronald
Kuo Jane
Lee Wen-Jane
Li Xiaohui
Liao Wen-Ling
Liew Gerald
Mitchell Paul
Penman Alan
Pollack Samuela
Price Alkes
Psaty Bruce M
Rotter Jerome I
Sandow Kevin
Smith Albert V
Sobrin Lucia
Stanwyck Lynn K
Tan Gavin
Tsai Fuu-Jen
Wang Jie Jin
Wong Tien Yin
Publication venue: eScholarship, University of California
Publication date: 01/12/2017
Field of study

Results from observational studies examining dyslipidemia as a risk factor for diabetic retinopathy (DR) have been inconsistent. We evaluated the causal relationship between plasma lipids and DR using a Mendelian randomization approach. We pooled genome-wide association studies summary statistics from 18 studies for two DR phenotypes: any DR (N = 2,969 case and 4,096 control subjects) and severe DR (N = 1,277 case and 3,980 control subjects). Previously identified lipid-associated single nucleotide polymorphisms served as instrumental variables. Meta-analysis to combine the Mendelian randomization estimates from different cohorts was conducted. There was no statistically significant change in odds ratios of having any DR or severe DR for any of the lipid fractions in the primary analysis that used single nucleotide polymorphisms that did not have a pleiotropic effect on another lipid fraction. Similarly, there was no significant association in the Caucasian and Chinese subgroup analyses. This study did not show evidence of a causal role of the four lipid fractions on DR. However, the study had limited power to detect odds ratios less than 1.23 per SD in genetically induced increase in plasma lipid levels, thus we cannot exclude that causal relationships with more modest effect sizes exist

eScholarship - University of California

Using Extended Genealogy to Estimate Components of Heritability for 23 Quantitative and Dichotomous Traits

Author: A Gusev
A Helgason
A Kong
AL Price
Alkes L. Price
B Maher
B Pasaniuc
B Towne
BJ Hayes
Bogdan Pasaniuc
DB Goldstein
EA Stahl
EE Eichler
G Gibson
G Pilia
Gaurav Bhatia
HC So
HM Kang
IJ Deary
IJ Deary
J McClellan
J Yang
J Yang
J Yang
JE Powell
JM Murabito
K Silventoinen
KS Kendler
LA Hindorff
MF Feitosa
Nick Patterson
Noah Zaitlen
NR Wray
O Zuk
Peter Kraft
Peter M. Visscher
PM Visscher
PM Visscher
PM Visscher
PM Visscher
PM Visscher
S Vattikuti
Samuela Pollack
SH Lee
SP Dickson
SR Browning
SR Browning
TA Manolio
WG Hill
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/09/2012
Field of study

Important knowledge about the determinants of complex human phenotypes can be obtained from the estimation of heritability, the fraction of phenotypic variation in a population that is determined by genetic factors. Here, we make use of extensive phenotype data in Iceland, long-range phased genotypes, and a population-wide genealogical database to examine the heritability of 11 quantitative and 12 dichotomous phenotypes in a sample of 38,167 individuals. Most previous estimates of heritability are derived from family-based approaches such as twin studies, which may be biased upwards by epistatic interactions or shared environment. Our estimates of heritability, based on both closely and distantly related pairs of individuals, are significantly lower than those from previous studies. We examine phenotypic correlations across a range of relationships, from siblings to first cousins, and find that the excess phenotypic correlation in these related individuals is predominantly due to shared environment as opposed to dominance or epistasis. We also develop a new method to jointly estimate narrow-sense heritability and the heritability explained by genotyped SNPs. Unlike existing methods, this approach permits the use of information from both closely and distantly related pairs of individuals, thereby reducing the variance of estimates of heritability explained by genotyped SNPs while preventing upward bias. Our results show that common SNPs explain a larger proportion of the heritability than previously thought, with SNPs present on Illumina 300K genotyping arrays explaining more than half of the heritability for the 23 phenotypes examined in this study. Much of the remaining heritability is likely to be due to rare alleles that are not captured by standard genotyping arrays

DSpace@MIT

Crossref

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

Recommended from our members

Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types.

Author: Anttila Verneri
Bernstein Bradley E
Brainstorm Consortium
Buenrostro Jason D
Byrnes Andrea
Finucane Hilary K
Gazal Steven
Genovese Giulio
Gusev Alexander
Lareau Caleb
Loh Po-Ru
Macosko Evan
McCarroll Steven
Neale Benjamin M
Perry John RB
Pollack Samuela
Price Alkes L
Raychaudhuri Soumya
Reshef Yakir A
Saunders Arpiar
Shoresh Noam
Slowikowski Kamil
Publication venue: Nat Genet
Publication date: 01/04/2018
Field of study

We introduce an approach to identify disease-relevant tissues and cell types by analyzing gene expression data together with genome-wide association study (GWAS) summary statistics. Our approach uses stratified linkage disequilibrium (LD) score regression to test whether disease heritability is enriched in regions surrounding genes with the highest specific expression in a given tissue. We applied our approach to gene expression data from several sources together with GWAS summary statistics for 48 diseases and traits (average N = 169,331) and found significant tissue-specific enrichments (false discovery rate (FDR) < 5%) for 34 traits. In our analysis of multiple tissues, we detected a broad range of enrichments that recapitulated known biology. In our brain-specific analysis, significant enrichments included an enrichment of inhibitory over excitatory neurons for bipolar disorder, and excitatory over inhibitory neurons for schizophrenia and body mass index. Our results demonstrate that our polygenic approach is a powerful way to leverage gene expression data for interpreting GWAS signals

Apollo (Cambridge)

Replication and fine mapping of asthma-associated loci in individuals of African ancestry

Author: Barr R. Graham
Burkart Kristin M.
Gajdos Zofia K.
Heckbert Susan R.
Hirschhorn Joel N.
Jacobs David R.
Kantor David B.
Kumar Rajesh
Loehr Laura R.
London Stephanie J.
Lyon Helen
Meng Yan
O’Connor George T.
Palmer Cameron D.
Papanicolaou George
Petrini Marcy F.
Pollack Samuela
Price Alkes L.
Smith Lewis J.
White Wendy B.
Young Taylor R.
Publication venue
Publication date: 01/01/2013
Field of study

Asthma originates from genetic and environmental factors with about half the risk of disease attributable to heritable causes. Genome-wide association studies, mostly in populations of European ancestry, have identified numerous asthma-associated single nucleotide polymorphisms (SNPs). Studies in populations with diverse ancestries allow both for identification of robust associations that replicate across ethnic groups and for improved resolution of associated loci due to different patterns of linkage disequilibrium between ethnic groups. Here we report on an analysis of 745 African-American subjects with asthma and 3,238 African-American control subjects from the Candidate Gene Association Resource (CARe) Consortium, including analysis of SNPs imputed using 1,000 Genomes reference panels and adjustment for local ancestry. We show strong evidence that variation near RAD50/IL13, implicated in studies of European ancestry individuals, replicates in individuals largely of African ancestry. Fine mapping in African ancestry populations also refined the variants of interest for this association. We also provide strong or nominal evidence of replication at loci near ORMDL3/GSDMB, IL1RLML18R1, and 10pl4, all previously associated with asthma in European or Japanese populations, but not at the PYHIN1 locus previously reported in studies of African-American samples. These results improve the understanding of asthma genetics and further demonstrate the utility of genetic studies in populations other than those of largely European ancestry

PubMed Central

Carolina Digital Repository

Multiethnic Genome-Wide Association Study of Diabetic Retinopathy Using Liability Threshold Modeling of Duration of Diabetes and Glycemic Control

Author: Adler Sharon G.
Ahlqvist Emma
Ahn Jeeyun
Cheng Ching-Yu
Christiansen Mark
Colhoun Helen M.
Davoudi Samaneh
Dimitrov Latchezar M.
Doney Alexander
Freedman Barry I.
Groop Leif
Guo Xiuqing
H-H Sheu Wayne
Hadjadj Samy
Hai Yang
Hosseini S. Mohsen
Ida Chen Yii-Der
Igo Robert P.
Imamura Minako
Ipp Eli
Jensen Richard A.
Jia Yucheng
Kim Jihye
Kubo Michiaki
Kuo Jane Z.
Lee I-Te
Leong Aaron
Li Xiaohui
Marre Michel
McCarthy Mark I.
Mckean-Cowdin Roberta
Meng Weihua
Mitchell Paul
Morris Andrew
Ng Maggie C. Y.
Nousome Darryl
Palmer Colin
Park Kyu Hyung
Paterson Andrew D.
Pollack Samuela
Price Alkes
Rossin Elizabeth J.
Sedor John R.
Segrè Ayellet V.
Shah Kaanan
Smith Albert V.
Sobrin Luca
Stanwyck Lynn K.
Takahashi Atsushi
Tan Gavin S.
Taylor Kent D.
Tregouet David-Alexandre
Varma Rohit
Publication venue
Publication date: 28/11/2018
Field of study

Correction: Volume69, Issue6 Page1306-1306 DOI10.2337/db20-er06a Published JUN 2020To identify genetic variants associated with diabetic retinopathy (DR), we performed a large multiethnic genome-wide association study. Discovery included eight European cohorts (n = 3,246) and seven African American cohorts (n = 2,611). We meta-analyzed across cohorts using inverse-variance weighting, with and without liability threshold modeling of glycemic control and duration of diabetes. Variants with a P valuePeer reviewe

Lund University Publications

Edinburgh Research Explorer

Helsingin yliopiston digitaalinen arkisto

University of Dundee Online Publications

Genome-wide Comparison of African-Ancestry Populations from CARe and Other Cohorts Reveals Signals of Natural Selection

The study of recent natural selection in human populations has important applications to human history and medicine. Positive natural selection drives the increase in beneficial alleles and plays a role in explaining diversity across human populations. By discovering traits subject to positive selection, we can better understand the population level response to environmental pressures including infectious disease. Our study examines unusual population differentiation between three large data sets to detect natural selection. The populations examined, African Americans, Nigerians, and Gambians, are genetically close to one another (FST < 0.01 for all pairs), allowing us to detect selection even with moderate changes in allele frequency. We also develop a tree-based method to pinpoint the population in which selection occurred, incorporating information across populations. Our genome-wide significant results corroborate loci previously reported to be under selection in Africans including HBB and CD36. At the HLA locus on chromosome 6, results suggest the existence of multiple, independent targets of population-specific selective pressure. In addition, we report a genome-wide significant (p = 1.36 × 10−11) signal of selection in the prostate stem cell antigen (PSCA) gene. The most significantly differentiated marker in our analysis, rs2920283, is highly differentiated in both Africa and East Asia and has prior genome-wide significant associations to bladder and gastric cancers

Carolina Digital Repository

Informed Conditioning on Clinical Covariates Increases Power in Case-Control Association Studies

Author: Aage Haugen
AL Price
AL Price
Albert Rosenberger
Alkes L. Price
Angela Risch
Ann W. Morgan
Anne Barton
Anthony G. Wilson
Barry I. Freedman
Benjamin Voight
BF Voight
Bogdan Pasaniuc
Brian E. Henderson
C Wallace
Carl D. Langefeld
Christopher Haiman
CI Amos
CL Kuo
D Campa
D Clayton
D Cox
D Thomas
DA Schaumberg
Daniel I. Chasman
David Altshuler
David C. Christiani
David J. Friedman
David J. Hunter
David Scherf
Debra A. Schaumberg
DJ Hunter
Donald W. Bowden
DS Falconer
Eric Tchetgen Tchetgen
ESBD Lander
G Genovese
G Jin
G Maskarinec
Giulio Genovese
GM Monsees
GV Kryukov
H Holm
HC So
Heike Bickeböller
J Dong
J Marchini
Jane Worthington
JK Field
JM Neuhaus
Joachim Heinrich
John K. Field
JR Perry
JRB Perry
Kevin M. Waters
KL Ellis
KM Waters
Laurence N. Kolonel
LD Robinson
Leif Groop
Loic Le Marchand
LT Guey
Lynne J. Hocking
M Imielinski
M Pirinen
Maria Teresa Landi
Marilyn Cornelis
Martin Walshaw
Michael Meister
ML Freedman
N Chatterjee
N Risch
N Zaitlen
N Zaitlen
Nick Patterson
NJ Risch
NJ Wald
Noah Zaitlen
NR Wray
Olaide Y. Raji
P Armitage
P Kraft
P Sulem
Pamela J. Hicks
Paul Wordsworth
Peter Kraft
Peter M. Visscher
PM Ridker
Robert M. Plenge
S Kathiresan
S Lindstrom
S Raychaudhuri
S Rose
S Van Gestel
S Zienolddiny
Samuela Pollack
Sara Lindström
SH Lee
Shanbeh Zienolddiny
SJ Chanock
Sophia Steer
Steve Eyre
T Lumley
TH Hamza
TJ Vanderweele
TM Frayling
W Thomson
WG Hill
WW Piegorsch
Z Kote-Jarai
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

Genetic case-control association studies often include data on clinical covariates, such as body mass index (BMI), smoking status, or age, that may modify the underlying genetic risk of case or control samples. For example, in type 2 diabetes, odds ratios for established variants estimated from low–BMI cases are larger than those estimated from high–BMI cases. An unanswered question is how to use this information to maximize statistical power in case-control studies that ascertain individuals on the basis of phenotype (case-control ascertainment) or phenotype and clinical covariates (case-control-covariate ascertainment). While current approaches improve power in studies with random ascertainment, they often lose power under case-control ascertainment and fail to capture available power increases under case-control-covariate ascertainment. We show that an informed conditioning approach, based on the liability threshold model with parameters informed by external epidemiological information, fully accounts for disease prevalence and non-random ascertainment of phenotype as well as covariates and provides a substantial increase in power while maintaining a properly controlled false-positive rate. Our method outperforms standard case-control association tests with or without covariates, tests of gene x covariate interaction, and previously proposed tests for dealing with covariates in ascertained data, with especially large improvements in the case of case-control-covariate ascertainment. We investigate empirical case-control studies of type 2 diabetes, prostate cancer, lung cancer, breast cancer, rheumatoid arthritis, age-related macular degeneration, and end-stage kidney disease over a total of 89,726 samples. In these datasets, informed conditioning outperforms logistic regression for 115 of the 157 known associated variants investigated (P-value = 1×10−9). The improvement varied across diseases with a 16% median increase in χ2 test statistics and a commensurate increase in power. This suggests that applying our method to existing and future association studies of these diseases may identify novel disease loci

Directory of Open Access Journals

The University of Manchester - Institutional Repository

PuSH

White Rose Research Online

FigShare

University of Queensland eSpace

Lund University Publications

Crossref

Harvard University - DASH

PubMed Central

Oxford University Research Archive

King's Research Portal

Recommended from our members

Leveraging population admixture to explain missing heritability of complex traits

Author: Assimes Themistocles L.
Berndt Sonja I.
Bhatia Gaurav
Blot William J.
Chanock Stephen
Franceschini Nora
Goodman Phyllis G.
Gusev Alexander
Haiman Christopher
He Jing
Hennis Anselm JM
Hsing Ann
Ingles Sue A.
Isaacs William
Kittles Rick A.
Klein Eric A.
Kooperberg Charles
Lange Leslie A.
Nemesure Barbara
Pasaniuc Bogdan
Patterson Nick
Pollack Samuela
Price Alkes L.
Reich David
Reiner Alex P.
Rybicki Benjamin A.
Sankararaman Sriram
Stanford Janet L.
Stevens Victoria L
Stram Daniel
Strom Sara S.
Tandon Arti
Tang Hua
Vilhjálmsson Bjarni J.
Whitsel Eric A
Wilson James G.
Witte John S.
Xu Jianfeng
Young Taylor
Zaitlen Noah
Zhang Jianqi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 13/07/2015
Field of study

Despite recent progress on estimating the heritability explained by genotyped SNPs (hg2), a large gap between hg2 and estimates of total narrow-sense heritability (h2) remains. Explanations for this gap include rare variants, or upward bias in family-based estimates of h2 due to shared environment or epistasis. We estimate h2 from unrelated individuals in admixed populations by first estimating the heritability explained by local ancestry (hγ2). We show that hγ2 = 2FSTCθ(1−θ)h2, where FSTC measures frequency differences between populations at causal loci and θ is the genome-wide ancestry proportion. Our approach is not susceptible to biases caused by epistasis or shared environment. We examined 21,497 African Americans from three cohorts, analyzing 13 phenotypes. For height and BMI, we obtained h2 estimates of 0.55 ± 0.09 and 0.23 ± 0.06, respectively, which are larger than estimates of hg2 in these and other data, but smaller than family-based estimates of h2

Harvard University - DASH