1,784 research outputs found

    Estimation of the Multiple Testing Burden for Genomewide Association Studies of Common Variants

    Get PDF
    Genomewide association studies are an exciting strategy in genetics, recently becoming feasible. While pioneering studies are being underway, it is already clear that the analytic issue of determining the significance of results presents a challenge to the wide community of researchers. Rather than each study engaging in independent evaluation, there is need in the community to have standards set for whole-genome significance. 
We have therefore undertaken the task of developing such standards, based on data collected with collaborators in the International Haplotype Map Consortium. We report an estimated burden of a million independent tests genomewide in Europeans, and twice that number in Africans. We further identify the sensitivity of the testing burden to the required significance level, with implications to staged design of association studies.
&#xa

    Stochastic Rainfall-runoff Model with Explicit Soil Moisture Dynamics

    Get PDF
    Stream runoff is perhaps the most poorly represented process in ecohydrological stochastic soil moisture models. Here we present a rainfall-runoff model with a new stochastic description of runoff linked to soil moisture dynamics. We describe the rainfall-runoff system as the joint probability density function (PDF) of rainfall, soil moisture and runoff forced by random, instantaneous jumps of rainfall. We develop a master equation for the soil moisture PDF that accounts explicitly for a general state-dependent rainfall-runoff transformation. This framework is then used to derive the joint rainfall-runoff and soil moisture-runoff PDFs. Runoff is initiated by a soil moisture threshold and a linear progressive partitioning of rainfall based on the soil moisture status. We explore the dependence of the PDFs on the rainfall occurrence PDF (homogeneous or state-dependent Poisson process) and the rainfall magnitude PDF (exponential or mixed-exponential distribution). We calibrate the model to 63 years of rainfall and runoff data from the Upper Little Tennessee watershed (USA) and show how the new model can reproduce the measured runoff PDF

    Public platform with 39,472 exome control samples enables association studies without genotype sharing

    Get PDF
    Acquiring a sufficiently powered cohort of control samples matched to a case sample can be time-consuming or, in some cases, impossible. Accordingly, an ability to leverage genetic data from control samples that were already collected elsewhere could dramatically improve power in genetic association studies. Sharing of control samples can pose significant challenges, since most human genetic data are subject to strict sharing regulations. Here, using the properties of singular value decomposition and subsampling algorithm, we developed a method allowing selection of the best-matching controls in an external pool of samples compliant with personal data protection and eliminating the need for genotype sharing. We provide access to a library of 39,472 exome sequencing controls at http://dnascore.net enabling association studies for case cohorts lacking control subjects. Using this approach, control sets can be selected from this online library with a prespecified matching accuracy, ensuring well-calibrated association analysis for both rare and common variants

    Meta-analysis fine-mapping is often miscalibrated at single-variant resolution

    Get PDF
    Meta-analysis is pervasively used to combine multiple genome-wide association studies (GWASs). Fine-mapping of meta-analysis studies is typically performed as in a single-cohort study. Here, we first demonstrate that heterogeneity (e.g., of sample size, phenotyping, imputation) hurts calibration of meta-analysis fine-mapping. We propose a summary statistics-based quality-control (QC) method, suspicious loci analysis of meta-analysis summary statistics (SLALOM), that identifies suspicious loci for meta-analysis fine-mapping by detecting outliers in association statistics. We validate SLALOM in simulations and the GWAS Catalog. Applying SLALOM to 14 meta-analyses from the Global Biobank Meta-analysis Initiative (GBMI), we find that 67% of loci show suspicious patterns that call into question fine-mapping accuracy. These predicted suspicious loci are significantly depleted for having nonsynonymous variants as lead variant (2.7×; Fisher's exact p = 7.3 × 10−4). We find limited evidence of fine-mapping improvement in the GBMI meta-analyses compared with individual biobanks. We urge extreme caution when interpreting fine-mapping results from meta-analysis of heterogeneous cohorts.</p

    The critical needs and challenges for genetic architecture studies in Africa

    Get PDF
    Human genetic studies have long been vastly Eurocentric, raising a key question about the generalizability of these study findings to other populations. Because humans originated in Africa, these populations retain more genetic diversity, and yet individuals of African descent have been tremendously underrepresented in genetic studies. The diversity in Africa affords ample opportunities to improve fine-mapping resolution for associated loci, discover novel genetic associations with phenotypes, build more generalizable genetic risk prediction models, and better understand the genetic architecture of complex traits and diseases subject to varying environmental pressures. Thus, it is both ethically and scientifically imperative that geneticists globally surmount challenges that have limited progress in African genetic studies to date. Additionally, African investigators need to be meaningfully included, as greater inclusivity and enhanced research capacity afford enormous opportunities to accelerate genomic discoveries that translate more effectively to all populations. We review the advantages, challenges, and examples of genetic architecture studies of complex traits and diseases in Africa. For example, with greater genetic diversity comes greater ancestral heterogeneity; this higher level of understudied diversity can yield novel genetic findings, but some methods that assume homogeneous population structure and work well in European populations may work less well in the presence of greater heterogeneity in African populations. Consequently, we advocate for methodological development that will accelerate studies important for all populations, especially those currently underrepresented in genetics.Peer reviewe

    Base-specific mutational intolerance near splice sites clarifies the role of nonessential splice nucleotides

    Get PDF
    Variation in RNA splicing (i.e., alternative splicing) plays an important role in many diseases. Variants near 5' and 3' splice sites often affect splicing, but the effects of these variants on splicing and disease have not been fully characterized beyond the two "essential" splice nucleotides flanking each exon. Here we provide quantitative measurements of tolerance to mutational disruptions by position and reference allele-alternative allele combinations. We show that certain reference alleles are particularly sensitive to mutations, regardless of the alternative alleles into which they are mutated. Using public RNA-seq data, we demonstrate that individuals carrying such variants have significantly lower levels of the correctly spliced transcript, compared to individuals without them, and confirm that these specific substitutions are highly enriched for known Mendelian mutations. Our results propose a more refined definition of the "splice region" and offer a new way to prioritize and provide functional interpretation of variants identified in diagnostic sequencing and association studies.Peer reviewe
    • …
    corecore