Search CORE

51 research outputs found

The cost of large numbers of hypothesis tests on power, effect size and sample size

Author: A Ray
CR Genovese
DJ Dow
DR Nyholt
J Li
JA Todd
JD Storey
JM Bland
JM Cheverud
JP Shaffer
KN Conneely
L C Lazzeroni
LC Lazzeroni
M Baker
P Billingsley
PH Westfall
PJ Veazie
R Bender
RA Fisher
RG Miller
S Stigler
SP Selwood
WYS Wang
Y Benjamini
Publication venue: Nature Publishing Group
Publication date
Field of study

Advances in high-throughput biology and computer science are driving an exponential increase in the number of hypothesis tests in genomics and other scientific disciplines. Studies using current genotyping platforms frequently include a million or more tests. In addition to the monetary cost, this increase imposes a statistical cost owing to the multiple testing corrections needed to avoid large numbers of false-positive results. To safeguard against the resulting loss of power, some have suggested sample sizes on the order of tens of thousands that can be impractical for many diseases or may lower the quality of phenotypic measurements. This study examines the relationship between the number of tests on the one hand and power, detectable effect size or required sample size on the other. We show that once the number of tests is large, power can be maintained at a constant level, with comparatively small increases in the effect size or sample size. For example at the 0.05 significance level, a 13% increase in sample size is needed to maintain 80% power for ten million tests compared with one million tests, whereas a 70% increase in sample size is needed for 10 tests compared with a single test. Relative costs are less when measured by increases in the detectable effect size. We provide an interactive Excel calculator to compute power, effect size or sample size when comparing study designs or genome platforms involving different numbers of hypothesis tests. The results are reassuring in an era of extreme multiple testing

Crossref

PubMed Central

Genetic Variation in Selenoprotein Genes, Lifestyle, and Risk of Colon and Rectal Cancer

Author: Abbie Lundgreen
AE Damdimopoulos
AP Lothrop
B Welbourn
Bill Welbourn
C Meplan
C Meplan
Christopher Corcoran
D Behne
D Sibbing
E Kumaraswamy
ES Arner
ES Arner
Georgina L. Hold
H Schenk
J Hesketh
K Liu
KN Conneely
LC Clark
Martha L. Slattery
ML Slattery
ML Slattery
ML Slattery
ML Slattery
ML Slattery
ML Slattery
MLLA Slattery
N Fradejas
P Gresner
PH Westfall
QA Sun
R Irons
RF Burk
RL Poerschke
Roger K. Wolff
S Edwards
S Holm
U Haug
U Peters
U Peters
VA Shchedrina
WS Samowitz
WS Samowitz
X Zhou
X Zhu
YJ Kim
Publication venue: Public Library of Science
Publication date: 17/05/2012
Field of study

BACKGROUND: Associations between selenium and cancer have directed attention to role of selenoproteins in the carcinogenic process. METHODS: We used data from two population-based case-control studies of colon (n = 1555 cases, 1956 controls) and rectal (n = 754 cases, 959 controls) cancer. We evaluated the association between genetic variation in TXNRD1, TXNRD2, TXNRD3, C11orf31 (SelH), SelW, SelN1, SelS, SepX, and SeP15 with colorectal cancer risk. RESULTS: After adjustment for multiple comparisons, several associations were observed. Two SNPs in TXNRD3 were associated with rectal cancer (rs11718498 dominant OR 1.42 95% CI 1.16,1.74 pACT 0.0036 and rs9637365 recessive 0.70 95% CI 0.55,0.90 pACT 0.0208). Four SNPs in SepN1 were associated with rectal cancer (rs11247735 recessive OR 1.30 95% CI 1.04,1.63 pACT 0.0410; rs2072749 GGvsAA OR 0.53 95% CI 0.36,0.80 pACT 0.0159; rs4659382 recessive OR 0.58 95% CI 0.39,0.86 pACT 0.0247; rs718391 dominant OR 0.76 95% CI 0.62,0.94 pACT 0.0300). Interaction between these genes and exposures that could influence these genes showed numerous significant associations after adjustment for multiple comparisons. Two SNPs in TXNRD1 and four SNPs in TXNRD2 interacted with aspirin/NSAID to influence colon cancer; one SNP in TXNRD1, two SNPs in TXNRD2, and one SNP in TXNRD3 interacted with aspirin/NSAIDs to influence rectal cancer. Five SNPs in TXNRD2 and one in SelS, SeP15, and SelW1 interacted with estrogen to modify colon cancer risk; one SNP in SelW1 interacted with estrogen to alter rectal cancer risk. Several SNPs in this candidate pathway influenced survival after diagnosis with colon cancer (SeP15 and SepX1 increased HRR) and rectal cancer (SepX1 increased HRR). CONCLUSIONS: Findings support an association between selenoprotein genes and colon and rectal cancer development and survival after diagnosis. Given the interactions observed, it is likely that the impact of cancer susceptibility from genotype is modified by lifestyle

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Rapid and Accurate Multiple Testing Correction and Power Estimation for Millions of Correlated Markers

Author: A Genz
A Genz
B Devlin
B Devlin
B Han
BL Browning
Buhm Han
D Altshuler
DA Williams
DJ Schaid
DL Nicolae
DR Nyholt
DY Lin
E Eskin
E Jorgenson
Eleazar Eskin
F Dudbridge
F Dudbridge
F Yates
FS Collins
G Kimmel
GU Yule
Hyun Min Kang
I Pe'er
J Li
J Marchini
JD Storey
JK Pritchard
JM Cheverud
John D. Storey
KN Conneely
LA Wasserman
N Risch
N Zaitlen
NA Zaitlen
P de Bakker
PD Sasieni
PH Westfall
RJ Klein
S Purcell
SR Browning
SR Seaman
TA Louis
TR Bhangale
V Hajivassiliou
V Moskvina
Y Benjamini
Publication venue: Public Library of Science
Publication date: 01/04/2009
Field of study

With the development of high-throughput sequencing and genotyping technologies, the number of markers collected in genetic association studies is growing rapidly, increasing the importance of methods for correcting for multiple hypothesis testing. The permutation test is widely considered the gold standard for accurate multiple testing correction, but it is often computationally impractical for these large datasets. Recently, several studies proposed efficient alternative approaches to the permutation test based on the multivariate normal distribution (MVN). However, they cannot accurately correct for multiple testing in genome-wide association studies for two reasons. First, these methods require partitioning of the genome into many disjoint blocks and ignore all correlations between markers from different blocks. Second, the true null distribution of the test statistic often fails to follow the asymptotic distribution at the tails of the distribution. We propose an accurate and efficient method for multiple testing correction in genome-wide association studies—SLIDE. Our method accounts for all correlation within a sliding window and corrects for the departure of the true null distribution of the statistic from the asymptotic distribution. In simulations using the Wellcome Trust Case Control Consortium data, the error rate of SLIDE's corrected p-values is more than 20 times smaller than the error rate of the previous MVN-based methods' corrected p-values, while SLIDE is orders of magnitude faster than the permutation test and other competing methods. We also extend the MVN framework to the problem of estimating the statistical power of an association study with correlated markers and propose an efficient and accurate power estimation method SLIP. SLIP and SLIDE are available at http://slide.cs.ucla.edu

Public Library of Science (PLOS)

Crossref

SNU Open Repository and Archive

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Common Polymorphisms in MTNR1B, G6PC2 and GCK Are Associated with Increased Fasting Plasma Glucose and Impaired Beta-Cell Function in Chinese Subjects

BACKGROUND: Previous studies identified melatonin receptor 1B (MTNR1B), islet-specific glucose 6 phosphatase catalytic subunit-related protein (G6PC2), glucokinase (GCK) and glucokinase regulatory protein (GCKR) as candidate genes for type 2 diabetes (T2D) acting through elevated fasting plasma glucose (FPG). We examined the associations of the reported common variants of these genes with T2D and glucose homeostasis in three independent Chinese cohorts. METHODOLOGY/PRINCIPAL FINDINGS: Five single nucleotide polymorphisms (SNPs), MTNR1B rs10830963, G6PC2 rs16856187 and rs478333, GCK rs1799884 and GCKR rs780094, were genotyped in 1644 controls (583 adults and 1061 adolescents) and 1342 T2D patients. The G-allele of MTNR1B rs10830963 and the C-alleles of both G6PC2 rs16856187 and rs478333 were associated with higher FPG (0.0034<P<6.6x10(-5)) in healthy controls. In addition to our previous report for association with FPG, the A-allele of GCK rs1799884 was also associated with reduced homeostasis model assessment of beta-cell function (HOMA-B) (P=0.0015). Together with GCKR rs780094, the risk alleles of these SNPs exhibited dosage effect in their associations with increased FPG (P=2.9x10(-9)) and reduced HOMA-B (P=1.1x10(-3)). Meta-analyses strongly supported additive effects of MTNR1B rs10830963 and G6PC2 rs16856187 on FPG. CONCLUSIONS/SIGNIFICANCE: Common variants of MTNR1B, G6PC2 and GCK are associated with elevated FPG and impaired insulin secretion, both individually and jointly, suggesting that these risk alleles may precipitate or perpetuate hyperglycemia in predisposed individuals

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

DNA methylation signature of chronic low-grade inflammation and its role in cardio-respiratory diseases

Author: Baccarelli AA
Beekman M
Bell JT
BIOS consortium
Bis JC
Bodinier B
Boerwinkle E
Brenner H
Brody JA
Cardona A
Chadeau-Hyam M
Chambers JC
Chen W
Colicino E
Conneely KN
Cugliari G
Deary IJ
Dehghan A
Deloukas P
Doerr M
Elliott P
Farlik M
Fiorito G
Fisher K
Ghanbari M
Grabe HJ
Graf-Schindler J
Guan W
Gào X
Heijmans BT
Herzig K-H
Hurme MA
Iacoviello L
Joehanes R
Järvelin M-R
Karhunen V
Kasela S
Koenig W
Kooner JS
Krogh V
Kuehnel B
Kähönen M
Köttgen A
Lehne B
Lehtimäki T
Levy D
Li S
Loh M
Mandaviya PR
Marioni RE
Marouli E
Matullo G
Milani L
Miller AH
Mishra PP
Mustafa R
Nauck M
Ong KK
Palaniswamy S
Panico S
Peters A
Psaty BM
Raitakari OT
Rathmann W
Relton C
Robinson O
Sacerdote C
Schlosser P
Selvin E
Smith AK
Sotoodehnia N
Tanaka T
Teumer A
Torres MA
Tsai P-C
Tumino R
Tzala E
Van der Harst P
Van Meurs JBJ
Verweij N
Vineis P
Waite LL
Waldenberger M
Walton E
Weninger W
Wielscher M
Wilson R
Xia Y
Zhang T
Zhang Y
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

We performed a multi-ethnic Epigenome Wide Association study on 22,774 individuals to describe the DNA methylation signature of chronic low-grade inflammation as measured by C-Reactive protein (CRP). We find 1,511 independent differentially methylated loci associated with CRP. These CpG sites show correlation structures across chromosomes, and are primarily situated in euchromatin, depleted in CpG islands. These genomic loci are predominantly situated in transcription factor binding sites and genomic enhancer regions. Mendelian randomization analysis suggests altered CpG methylation is a consequence of increased blood CRP levels. Mediation analysis reveals obesity and smoking as important underlying driving factors for changed CpG methylation. Finally, we find that an activated CpG signature significantly increases the risk for cardiometabolic diseases and COPD

Spiral - Imperial College Digital Repository

Leiden University Scholary Publications

PuSH

Queen Mary Research Online

The association of polymorphisms in hormone metabolism pathway genes, menopausal hormone therapy, and breast cancer risk: a nested case-control study in the California Teachers Study cohort

Author: A Forsti
A Jansson
A Kalliokoski
AH Eliassen
AH Payne
AM Dunning
AM Dunning
Argyrios Ziogas
Breast cancer and hormone replacement therapy: collaborative reanalysis of data from 51 epidemiological studies of 52 705 women with breast cancer and 108,411 women without breast cancer
BT Zhu
C Justenhoven
C Mao
C Schairer
CA Haiman
CA Haiman
CA Haiman
CA Haiman
CK Edlund
CM Farquhar
Collaborative Group on Hormonal Factors in Breast Cancer
David Van Den Berg
DJ Hunter
DO Stram
Eunjung Lee
F Canzian
Fredrick Schumacher
Giske Ursin
H Ding
HL Olsson
Hoda Anton-Culver
HS Feigelson
I Tamai
J Li
JE Rossouw
JL Kelsey
Juan Pablo Lewinger
JY Choi
Katherine D Henderson
KN Conneely
KP Economopoulos
KW Reding
KW Reding
L Beckmann
L Bernstein
L Le Marchand
L Speroff
L Yao
Leslie Bernstein
LF Masson
M Becchis
MM Gaudet
N Gamage
Pamela L Horn-Ross
PD Pharoah
RG Tirona
RK Ross
S Kwong
SS Tworoger
SS Tworoger
Susan L Neuhausen
T Key
T Nozawa
T Saxena
The MARIE-GENICA Consortium on Genetic Susceptibility for Menopausal Hormone Therapy Related Breast Cancer Risk
TR Rebbeck
UV Onay
VN Kristensen
VW Setiawan
VW Setiawan
W Zheng
WM van der Deure
Y Ji
Y Li
Y Sun
YL Low
YL Low
Z Wang
Publication venue: BioMed Central
Publication date: 01/04/2011
Field of study

Abstract Introduction The female sex steroids estrogen and progesterone are important in breast cancer etiology. It therefore seems plausible that variation in genes involved in metabolism of these hormones may affect breast cancer risk, and that these associations may vary depending on menopausal status and use of hormone therapy. Methods We conducted a nested case-control study of breast cancer in the California Teachers Study cohort. We analyzed 317 tagging single nucleotide polymorphisms (SNPs) in 24 hormone pathway genes in 2746 non-Hispanic white women: 1351 cases and 1395 controls. Odds ratios (ORs) and 95% confidence intervals (CIs) were estimated by fitting conditional logistic regression models using all women or subgroups of women defined by menopausal status and hormone therapy use. P values were adjusted for multiple correlated tests (P ACT). Results The strongest associations were observed for SNPs in SLCO1B1, a solute carrier organic anion transporter gene, which transports estradiol-17β-glucuronide and estrone-3-sulfate from the blood into hepatocytes. Ten of 38 tagging SNPs of SLCO1B1 showed significant associations with postmenopausal breast cancer risk; 5 SNPs (rs11045777, rs11045773, rs16923519, rs4149057, rs11045884) remained statistically significant after adjusting for multiple testing within this gene (P ACT = 0.019-0.046). In postmenopausal women who were using combined estrogen-progestin therapy (EPT) at cohort enrollment, the OR of breast cancer was 2.31 (95% CI = 1.47-3.62) per minor allele of rs4149013 in SLCO1B1 (P = 0.0003; within-gene P ACT = 0.002; overall P ACT = 0.023). SNPs in other hormone pathway genes evaluated in this study were not associated with breast cancer risk in premenopausal or postmenopausal women. Conclusions We found evidence that genetic variation in SLCO1B1 is associated with breast cancer risk in postmenopausal women, particularly among those using EPT

Crossref

PubMed Central

eScholarship - University of California

Genome-wide association studies identify 137 genetic loci for DNA methylation biomarkers of aging

Author: Arnett D
Bandinelli S
Bell JT
Binder AM
Boerwinkle E
Boomsma DI
Broer L
Chen W
Christensen K
Conneely KN
Correa A
Davies G
de Geus EJC
Deary IJ
Dugué P-A
Durda P
Elliott HR
Elliott P
Ferrucci L
Fornage M
Genetics of DNA Methylation Consortium .
Gieger C
Guo X
Harris SE
Hayward C
Hemani G
Horvath S
Hägg S
Imboden M
Irvin M
Jeong A
Jung J
Kaprio J
Kardia SLR
Kasela S
Katrinli S
Kresovich JK
Kuo P-L
Kähönen M
Lawlor DA
Lehtimäki T
Li S
Lohoff FW
Lu AT
Lunetta KL
Mangino M
Marioni RE
Mason D
Matias-Garcia PR
McCartney DL
McIntosh AM
Mengel-From J
Milani L
Milne RL
Min JL
Mishra PP
Moore AZ
Murabito JM
NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium .
Nygaard M
Ollikainen M
Palviainen T
Pankow JS
Patki A
Pedersen NL
Peters A
Polidoro S
Porteous DJ
Probst-Hensch N
Raffield LM
Raitakari O
Ratliff SM
Reiner AP
Relton CL
Rich SS
Richardson TG
Richmond RC
Ritz B
Robinson O
Rotter JI
Sandler DP
Sillanpää E
Smith AK
Smith JA
Sobczyk MK
Soerensen M
Southey MC
Strauch K
Sun D
Tanaka T
Taylor JA
Tillin T
Tiwari H
Tsai P-C
Uitterlinden AG
Van Den Berg DJ
van der Zee MD
van Dongen J
van Meurs JBJ
Vineis P
Waldenberger M
Walker RM
Wang X
Wang Y
Wilson JG
Wright J
Xia R
Xu Z
Yao J
Yet I
Zhao W
Publication venue
Publication date: 01/01/2021
Field of study

BACKGROUND: Biological aging estimators derived from DNA methylation data are heritable and correlate with morbidity and mortality. Consequently, identification of genetic and environmental contributors to the variation in these measures in populations has become a major goal in the field. RESULTS: Leveraging DNA methylation and SNP data from more than 40,000 individuals, we identify 137 genome-wide significant loci, of which 113 are novel, from genome-wide association study (GWAS) meta-analyses of four epigenetic clocks and epigenetic surrogate markers for granulocyte proportions and plasminogen activator inhibitor 1 levels, respectively. We find evidence for shared genetic loci associated with the Horvath clock and expression of transcripts encoding genes linked to lipid metabolism and immune function. Notably, these loci are independent of those reported to regulate DNA methylation levels at constituent clock CpGs. A polygenic score for GrimAge acceleration showed strong associations with adiposity-related traits, educational attainment, parental longevity, and C-reactive protein levels. CONCLUSION: This study illuminates the genetic architecture underlying epigenetic aging and its shared genetic contributions with lifestyle factors and longevity

UCL Discovery

Multiple testing correction in linear mixed models

Author: A Cortes
A Genz
A Genz
A Kirby
A Köttgen
AM Davie
B Han
B Pasaniuc
BJ Bennett
BL Browning
Buhm Han
BW Parks
BZ He
C Lippert
C Sabatti
CC Park
CR Farber
D Altshuler
D Lee
DE Reich
DL Aylor
DY Lin
E Kostem
E Org
E Zeggini
EK Speliotes
Eleazar Eskin
EN Smith
F Hormozdiari
F Hormozdiari
F Le Gall
Farhad Hormozdiari
G Consortium
G Kichaev
GR Abecasis
H Hakonarson
HM Kang
HM Kang
J Flint
J Hagmann
J Listgarten
J Yang
J Yang
J Yu
JH Sul
Jong Wha J. Joo
JWJ Joo
JWJ Joo
KN Conneely
M Abney
M Fakiola
MI McCarthy
N Fusi
N Zaitlen
NA Bokulich
NA Furlotte
P-RR Loh
R Sladek
RA Gibbs
RB Brem
S Ripke
SR Seaman
V Hajivassiliou
V Williams
W Chen
W Huang
W Valdar
W Zhang
X Gao
X Zhou
Y Lu
Y Okada
Z Sidák
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2016
Field of study

BACKGROUND: Multiple hypothesis testing is a major issue in genome-wide association studies (GWAS), which often analyze millions of markers. The permutation test is considered to be the gold standard in multiple testing correction as it accurately takes into account the correlation structure of the genome. Recently, the linear mixed model (LMM) has become the standard practice in GWAS, addressing issues of population structure and insufficient power. However, none of the current multiple testing approaches are applicable to LMM. RESULTS: We were able to estimate per-marker thresholds as accurately as the gold standard approach in real and simulated datasets, while reducing the time required from months to hours. We applied our approach to mouse, yeast, and human datasets to demonstrate the accuracy and efficiency of our approach. CONCLUSIONS: We provide an efficient and accurate multiple testing correction approach for linear mixed models. We further provide an intuition about the relationships between per-marker threshold, genetic relatedness, and heritability, based on our observations in real data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-016-0903-6) contains supplementary material, which is available to authorized users

Crossref

PubMed Central

eScholarship - University of California

Epigenetic Signatures of Cigarette Smoking

Author: Absher DM
Ambatipudi S
Arnett DK
Aslibekyan S
Baccarelli AA
Bandinelli S
Barrdahl M
Binder EB
Bressler J
Brody JA
Colicino E
Conneely KN
Deary IJ
DeMeo DL
Demerath EW
Dhingra R
Ding J
Elks CE
Ferrucci L
Fornage M
Gharib SA
Grove ML
Guan W
Herceg Z
Hernandez DG
Hofman A
Hou L
Huan T
Irvin MR
Joehanes R
Just AC
Kardia SLR
Kiel DP
Klengel T
Kunze S
Levy D
Liang L
Liu C
Liu Y
Lohman K
London SJ
Mandaviya PR
Marioni RE
McRae AF
Melzer D
Mendelson MM
Moreno-Macias H
O'Connor GT
Ong KK
Pankow JS
Peters A
Pilling LC
Psaty BM
Ressler KJ
Reynolds LM
Rodriguez CJ
Romieu I
Schwartz J
Sha J
Shah SH
Singleton AB
Smith AK
Smith JA
Sotoodehnia N
Starr JM
Swenson BR
Taylor KD
Turner ST
Uitterlinden AG
van Meurs JBJ
Vineis P
Visscher PM
Vokonas PS
Waldenberger M
Wang-Sattler R
Ware EB
Wareham NJ
Wray NR
Xu T
Yao C
Yousefi P
Zhao W
Zhi D
Publication venue: Circulation: Cardiovascular Genetics
Publication date: 01/01/2016
Field of study

BACKGROUND: DNA methylation leaves a long-term signature of smoking exposure and is one potential mechanism by which tobacco exposure predisposes to adverse health outcomes, such as cancers, osteoporosis, lung, and cardiovascular disorders. METHODS AND RESULTS: To comprehensively determine the association between cigarette smoking and DNA methylation, we conducted a meta-analysis of genome-wide DNA methylation assessed using the Illumina BeadChip 450K array on 15 907 blood-derived DNA samples from participants in 16 cohorts (including 2433 current, 6518 former, and 6956 never smokers). Comparing current versus never smokers, 2623 cytosine-phosphate-guanine sites (CpGs), annotated to 1405 genes, were statistically significantly differentially methylated at Bonferroni threshold of P<1×10

^{-7}

(18 760 CpGs at false discovery rate <0.05). Genes annotated to these CpGs were enriched for associations with several smoking-related traits in genome-wide studies including pulmonary function, cancers, inflammatory diseases, and heart disease. Comparing former versus never smokers, 185 of the CpGs that differed between current and never smokers were significant P<1×10

^{-7}

(2623 CpGs at false discovery rate <0.05), indicating a pattern of persistent altered methylation, with attenuation, after smoking cessation. Transcriptomic integration identified effects on gene expression at many differentially methylated CpGs. CONCLUSIONS: Cigarette smoking has a broad impact on genome-wide methylation that, at many loci, persists many years after smoking cessation. Many of the differentially methylated genes were novel genes with respect to biological effects of smoking and might represent therapeutic targets for prevention or treatment of tobacco-related diseases. Methylation at these sites could also serve as sensitive and stable biomarkers of lifetime exposure to tobacco smoke.Biotechnology and Biological Sciences Research Council, British Heart Foundation, Cancer Research UK, Medical Research Council, National Institutes of Health, Royal Society, Wellcome Trus

Maastricht University Research Portal

EUR Research Repository

PubMed Central

Edinburgh Research Explorer

Erasmus University Digital Repository

Apollo (Cambridge)

MPG.PuRe