Search CORE

1,451 research outputs found

Unsupervised empirical Bayesian multiple testing with external covariates

Author: Ferkingstad Egil
Frigessi Arnoldo
Kong Augustine
Rue Håvard
Thorleifsson Gudmar
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2008
Field of study

In an empirical Bayesian setting, we provide a new multiple testing method, useful when an additional covariate is available, that influences the probability of each null hypothesis being true. We measure the posterior significance of each test conditionally on the covariate and the data, leading to greater power. Using covariate-based prior information in an unsupervised fashion, we produce a list of significant hypotheses which differs in length and order from the list obtained by methods not taking covariate-information into account. Covariate-modulated posterior probabilities of each null hypothesis are estimated using a fast approximate algorithm. The new method is applied to expression quantitative trait loci (eQTL) data.Comment: Published in at http://dx.doi.org/10.1214/08-AOAS158 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Crossref

Local False Discovery Rate Based Methods for Multiple Testing of One-Way Classified Hypotheses

Author: Sarkar Sanat K.
Zhao Zhigen
Publication venue
Publication date: 29/07/2019
Field of study

This paper continues the line of research initiated in \cite{Liu:Sarkar:Zhao:2016} on developing a novel framework for multiple testing of hypotheses grouped in a one-way classified form using hypothesis-specific local false discovery rates (Lfdr's). It is built on an extension of the standard two-class mixture model from single to multiple groups, defining hypothesis-specific Lfdr as a function of the conditional Lfdr for the hypothesis given that it is within a significant group and the Lfdr for the group itself and involving a new parameter that measures grouping effect. This definition captures the underlying group structure for the hypotheses belonging to a group more effectively than the standard two-class mixture model. Two new Lfdr based methods, possessing meaningful optimalities, are produced in their oracle forms. One, designed to control false discoveries across the entire collection of hypotheses, is proposed as a powerful alternative to simply pooling all the hypotheses into a single group and using commonly used Lfdr based method under the standard single-group two-class mixture model. The other is proposed as an Lfdr analog of the method of \cite{Benjamini:Bogomolov:2014} for selective inference. It controls Lfdr based measure of false discoveries associated with selecting groups concurrently with controlling the average of within-group false discovery proportions across the selected groups. Simulation studies and real-data application show that our proposed methods are often more powerful than their relevant competitors.Comment: 26 pages, 17 figure

arXiv.org e-Print Archive

A correction for sample overlap in genome-wide association studies in a polygenic pleiotropy-informed framework.

Author: Agartz I
Agerbo E
Albus M
Alexander M
Amin F
Andreassen BK
Andreassen OA
Bacanu SA
Begemann M
Belliveau RA
Bene J
Bevilacqua E
Bigdeli TB
Black DW
Bruggeman R
Buccola NG
Buckner RL
Bulik-Sullivan B
Cahn W
Cai G
Cairns MJ
Campion D
Cantor RM
Carr VJ
Carrera N
Catts SV
Chambert KD
Chan RCK
Chen EYH
Chen RYL
Cheng W
Cheung EFC
Chong SA
Cloninger CR
Cohen D
Cohen N
Collier DA
Cormican P
Corvin A
Craddock N
Crespo-Facorro B
Crowley JJ
Curtis D
Davidson M
Davis KL
de Haan L
Degenhardt F
DeLisi LE
Demontis D
Dikeos D
Dinan T
Donohoe G
Drapeau E
Duan J
Dudbridge F
Durmishi N
Eichhammer P
Eriksson J
Escott-Price V
Essioux L
Fanous AH
Farh KH
Farrell MS
Favero JD
Frank J
Franke L
Freedman R
Freimer NB
Friedl M
Friedman JI
Frigessi A
Fromer M
Genovese G
Georgieva L
Gershon ES
Giegling I
Giusti-Rodriguez P
Godard S
Goldstein JI
Golimbet V
Gopal S
Gratten J
Hammer C
Hamshere ML
Hansen M
Hansen T
Haroutunian V
Hartmann AM
Henskens FA
Herms S
Hirschhorn JN
Huang H
LeBlanc M
Lee P
Neale BM
Pers TH
Ripke S
Thompson WK
Walters JTR
Zuber V
Publication venue: BMC Genomics
Publication date: 01/01/2018
Field of study

BACKGROUND: There is considerable evidence that many complex traits have a partially shared genetic basis, termed pleiotropy. It is therefore useful to consider integrating genome-wide association study (GWAS) data across several traits, usually at the summary statistic level. A major practical challenge arises when these GWAS have overlapping subjects. This is particularly an issue when estimating pleiotropy using methods that condition the significance of one trait on the signficance of a second, such as the covariate-modulated false discovery rate (cmfdr). RESULTS: We propose a method for correcting for sample overlap at the summary statistic level. We quantify the expected amount of spurious correlation between the summary statistics from two GWAS due to sample overlap, and use this estimated correlation in a simple linear correction that adjusts the joint distribution of test statistics from the two GWAS. The correction is appropriate for GWAS with case-control or quantitative outcomes. Our simulations and data example show that without correcting for sample overlap, the cmfdr is not properly controlled, leading to an excessive number of false discoveries and an excessive false discovery proportion. Our correction for sample overlap is effective in that it restores proper control of the false discovery rate, at very little loss in power. CONCLUSIONS: With our proposed correction, it is possible to integrate GWAS summary statistics with overlapping samples in a statistical framework that is dependent on the joint distribution of the two GWAS

University of Liverpool Repository

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

UNSWorks

Spiral - Imperial College Digital Repository

Queen Mary Research Online

NORA - Norwegian Open Research Archives

White Rose Research Online

UCrea

UCL Discovery

eScholarship - University of California

Apollo (Cambridge)

Utrecht University Repository

University of Melbourne Institutional Repository

Recommended from our members

Covariate-assisted ranking and screening for large-scale two-sample inference

Author: Cai T. Tony
Sun Wenguang
Wang Weinan
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

Two-sample multiple testing has a wide range of applications. The conventionalpractice first reduces the original observations to a vector of p-values and then chooses a cutoffto adjust for multiplicity. However, this data reduction step could cause significant loss ofinformation and thus lead to suboptimal testing procedures.We introduce a new framework fortwo-sample multiple testing by incorporating a carefully constructed auxiliary variable in inferenceto improve the power. A data-driven multiple-testing procedure is developed by employinga covariate-assisted ranking and screening (CARS) approach that optimally combines the informationfrom both the primary and the auxiliary variables. The proposed CARS procedureis shown to be asymptotically valid and optimal for false discovery rate control. The procedureis implemented in the R package CARS. Numerical results confirm the effectiveness of CARSin false discovery rate control and show that it achieves substantial power gain over existingmethods. CARS is also illustrated through an application to the analysis of a satellite imagingdata set for supernova detection

eScholarship - University of California

Leveraging genomic annotations and pleiotropic enrichment for improved replication rates in schizophrenia GWAS

Author: Andreassen Ole A.
Bettella Francesco
Chen Chi-Hua
Chen Qiang
Cichon Sven
Dale Anders M.
Desikan Rahul S.
Devor Anna
Djurovic Srdjan
Holland Dominic
Li Wen
Nöthen Markus M.
O’Donovan Michael
Rietschel Marcella
Schork Andrew J.
Thompson Wesley K.
Visscher Peter M.
Wang Yunpeng
Weinberger Daniel R.
Werge Thomas
Witoelar Aree
Zuber Verena
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2016
Field of study

Most of the genetic architecture of schizophrenia (SCZ) has not yet been identified. Here, we apply a novel statistical algorithm called Covariate-Modulated Mixture Modeling (CM3), which incorporates auxiliary information (heterozygosity, total linkage disequilibrium, genomic annotations, pleiotropy) for each single nucleotide polymorphism (SNP) to enable more accurate estimation of replication probabilities, conditional on the observed test statistic (“z-score”) of the SNP. We use a multiple logistic regression on z-scores to combine information from auxiliary information to derive a “relative enrichment score” for each SNP. For each stratum of these relative enrichment scores, we obtain nonparametric estimates of posterior expected test statistics and replication probabilities as a function of discovery z-scores, using a resampling-based approach that repeatedly and randomly partitions meta-analysis sub-studies into training and replication samples. We fit a scale mixture of two Gaussians model to each stratum, obtaining parameter estimates that minimize the sum of squared differences of the scale-mixture model with the stratified nonparametric estimates. We apply this approach to the recent genome-wide association study (GWAS) of SCZ (n = 82,315), obtaining a good fit between the model-based and observed effect sizes and replication probabilities. We observed that SNPs with low enrichment scores replicate with a lower probability than SNPs with high enrichment scores even when both they are genome-wide significant (p < 5x10-8). There were 693 and 219 independent loci with model-based replication rates ≥80% and ≥90%, respectively. Compared to analyses not incorporating relative enrichment scores, CM3 increased out-of-sample yield for SNPs that replicate at a given rate. This demonstrates that replication probabilities can be more accurately estimated using prior enrichment information with CM3

University of Newcastle's Digital Repository

Online Research @ Cardiff

Directory of Open Access Journals

Copenhagen University Research Information System

PubMed Central

eScholarship - University of California

NORA - Norwegian Open Research Archives

University of Queensland eSpace

FigShare

Weighted False Discovery Rate Control in Large-Scale Multiple Testing

Author: Basu Pallavi
Cai T. Tony
Das Kiranmoy
Sun Wenguang
Publication venue
Publication date: 09/05/2017
Field of study

The use of weights provides an effective strategy to incorporate prior domain knowledge in large-scale inference. This paper studies weighted multiple testing in a decision-theoretic framework. We develop oracle and data-driven procedures that aim to maximize the expected number of true positives subject to a constraint on the weighted false discovery rate. The asymptotic validity and optimality of the proposed methods are established. The results demonstrate that incorporating informative domain knowledge enhances the interpretability of results and precision of inference. Simulation studies show that the proposed method controls the error rate at the nominal level, and the gain in power over existing methods is substantial in many settings. An application to genome-wide association study is discussed.Comment: Revise

arXiv.org e-Print Archive

ScholarlyCommons@Penn

FigShare

Plasma protein biomarkers for depression and schizophrenia by multi analyte profiling of case-control collections.

Author: Alexander Robert C.
Brittain Claire
Bullmore Edward T.
Domenici Enrico
Giegling Ina
Holsboer Florian
McKeown Astrid
Merlo-Pich Emilio
Middleton Lefkos
Miller Sam
Muglia Pierandrea
Prokopenko Inga
Rujescu Dan
Tozzi Federica
Turck Christoph W.
Willé David R.
Publication venue: Ludwig-Maximilians-Universität München
Publication date: 01/01/2010
Field of study

Despite significant research efforts aimed at understanding the neurobiological underpinnings of psychiatric disorders, the diagnosis and the evaluation of treatment of these disorders are still based solely on relatively subjective assessment of symptoms. Therefore, biological markers which could improve the current classification of psychiatry disorders, and in perspective stratify patients on a biological basis into more homogeneous clinically distinct subgroups, are highly needed. In order to identify novel candidate biological markers for major depression and schizophrenia, we have applied a focused proteomic approach using plasma samples from a large case-control collection. Patients were diagnosed according to DSM criteria using structured interviews and a number of additional clinical variables and demographic information were assessed. Plasma samples from 245 depressed patients, 229 schizophrenic patients and 254 controls were submitted to multi analyte profiling allowing the evaluation of up to 79 proteins, including a series of cytokines, chemokines and neurotrophins previously suggested to be involved in the pathophysiology of depression and schizophrenia. Univariate data analysis showed more significant p-values than would be expected by chance and highlighted several proteins belonging to pathways or mechanisms previously suspected to be involved in the pathophysiology of major depression or schizophrenia, such as insulin and MMP-9 for depression, and BDNF, EGF and a number of chemokines for schizophrenia. Multivariate analysis was carried out to improve the differentiation of cases from controls and identify the most informative panel of markers. The results illustrate the potential of plasma biomarker profiling for psychiatric disorders, when conducted in large collections. The study highlighted a set of analytes as candidate biomarker signatures for depression and schizophrenia, warranting further investigation in independent collections

Directory of Open Access Journals

Open Access LMU

PubMed Central

Spiral - Imperial College Digital Repository

MPG.PuRe

Digging for gold nuggets : uncovering novel candidate genes for variation in gastrointestinal nematode burden in a wild bird species

Author: Piertney S. B.
Wenzel M. A.
Publication venue
Publication date: 16/02/2015
Field of study

Acknowledgements This study was funded by a BBSRC studentship (MAWenzel) and NERC grants NE/H00775X/1 and NE/D000602/1 (SB Piertney). The authors are grateful to Marianne James, Mario Roder and Keliya Bai for field-work assistance, Lucy M.I. Webster and Steve Paterson for help during prior development of genetic markers,Heather Ritchie for helpful comments on manuscript drafts and all estate owners, factors and keepers for access to field sites, most particularly MJ Taylor and Mike Nisbet (Airlie), Neil Brown (Allargue), RR Gledson and David Scrimgeour (Delnadamph), Andrew Salvesen and John Hay (Dinnet), Stuart Young and Derek Calder (Edinglassie), Kirsty Donald and DavidBusfield (Glen Dye), Neil Hogbin and Ab Taylor (Glen Muick), Alistair Mitchell (Glenlivet), Simon Blackett, Jim Davidson and Liam Donald (Invercauld), Richard Cooke and Fred Taylor (Invermark), Shaila Rao and Christopher Murphy (Mar Lodge), and Ralph Peters and Philip Astor (Tillypronie)Peer reviewedPostprin

Aberdeen University Research

ZENODO

Dryad Digital Repository (Duke University)

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Electronic Archiving System