6 research outputs found
A Framework For Detecting Noncoding Rare-Variant associations of Large-Scale Whole-Genome Sequencing Studies
Large-scale whole-genome sequencing studies have enabled analysis of noncoding rare-variant (RV) associations with complex human diseases and traits. Variant-set analysis is a powerful approach to study RV association. However, existing methods have limited ability in analyzing the noncoding genome. We propose a computationally efficient and robust noncoding RV association detection framework, STAARpipeline, to automatically annotate a whole-genome sequencing study and perform flexible noncoding RV association analysis, including gene-centric analysis and fixed window-based and dynamic window-based non-gene-centric analysis by incorporating variant functional annotations. In gene-centric analysis, STAARpipeline uses STAAR to group noncoding variants based on functional categories of genes and incorporate multiple functional annotations. In non-gene-centric analysis, STAARpipeline uses SCANG-STAAR to incorporate dynamic window sizes and multiple functional annotations. We apply STAARpipeline to identify noncoding RV sets associated with four lipid traits in 21,015 discovery samples from the Trans-Omics for Precision Medicine (TOPMed) program and replicate several of them in an additional 9,123 toPMed samples. We also analyze five non-lipid toPMed traits
Recommended from our members
Insights From a Large-Scale Whole-Genome Sequencing Study of Systolic Blood Pressure, Diastolic Blood Pressure, and Hypertension
BackgroundThe availability of whole-genome sequencing data in large studies has enabled the assessment of coding and noncoding variants across the allele frequency spectrum for their associations with blood pressure.MethodsWe conducted a multiancestry whole-genome sequencing analysis of blood pressure among 51 456 Trans-Omics for Precision Medicine and Centers for Common Disease Genomics program participants (stage-1). Stage-2 analyses leveraged array data from UK Biobank (N=383 145), Million Veteran Program (N=318 891), and Reasons for Geographic and Racial Differences in Stroke (N=10 643) participants, along with whole-exome sequencing data from UK Biobank (N=199 631) participants.ResultsTwo blood pressure signals achieved genome-wide significance in meta-analyses of stage-1 and stage-2 single variant findings (P<5×10-8). Among them, a rare intergenic variant at novel locus, LOC100506274, was associated with lower systolic blood pressure in stage-1 (beta [SE]=-32.6 [6.0]; P=4.99×10-8) but not stage-2 analysis (P=0.11). Furthermore, a novel common variant at the known INSR locus was suggestively associated with diastolic blood pressure in stage-1 (beta [SE]=-0.36 [0.07]; P=4.18×10-7) and attained genome-wide significance in stage-2 (beta [SE]=-0.29 [0.03]; P=7.28×10-23). Nineteen additional signals suggestively associated with blood pressure in meta-analysis of single and aggregate rare variant findings (P<1×10-6 and P<1×10-4, respectively).DiscussionWe report one promising but unconfirmed rare variant for blood pressure and, more importantly, contribute insights for future blood pressure sequencing studies. Our findings suggest promise of aggregate analyses to complement single variant analysis strategies and the need for larger, diverse samples, and family studies to enable robust rare variant identification
Recommended from our members
Genetic determinants of telomere length from 109,122 ancestrally diverse whole-genome sequences in TOPMed
Genetic studies on telomere length are important for understanding age-related diseases. Prior GWAS for leukocyte TL have been limited to European and Asian populations. Here, we report the first sequencing-based association study for TL across ancestrally-diverse individuals (European, African, Asian and Hispanic/Latino) from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. We used whole genome sequencing (WGS) of whole blood for variant genotype calling and the bioinformatic estimation of telomere length in n=109,122 individuals. We identified 59 sentinel variants (p-value <5×10-9) in 36 loci associated with telomere length, including 20 newly associated loci (13 were replicated in external datasets). There was little evidence of effect size heterogeneity across populations. Fine-mapping at OBFC1 indicated the independent signals colocalized with cell-type specific eQTLs for OBFC1 (STN1). Using a multi-variant gene-based approach, we identified two genes newly implicated in telomere length, DCLRE1B (SNM1B) and PARN. In PheWAS, we demonstrated our TL polygenic trait scores (PTS) were associated with increased risk of cancer-related phenotypes
Chromosome Xq23 is associated with lower atherogenic lipid concentrations and favorable cardiometabolic indices
Abstract
Autosomal genetic analyses of blood lipids have yielded key insights for coronary heart disease (CHD). However, X chromosome genetic variation is understudied for blood lipids in large sample sizes. We now analyze genetic and blood lipid data in a high-coverage whole X chromosome sequencing study of 65,322 multi-ancestry participants and perform replication among 456,893 European participants. Common alleles on chromosome Xq23 are strongly associated with reduced total cholesterol, LDL cholesterol, and triglycerides (min P = 8.5 × 10−72), with similar effects for males and females. Chromosome Xq23 lipid-lowering alleles are associated with reduced odds for CHD among 42,545 cases and 591,247 controls (P = 1.7 × 10−4), and reduced odds for diabetes mellitus type 2 among 54,095 cases and 573,885 controls (P = 1.4 × 10−5). Although we observe an association with increased BMI, waist-to-hip ratio adjusted for BMI is reduced, bioimpedance analyses indicate increased gluteofemoral fat, and abdominal MRI analyses indicate reduced visceral adiposity. Co-localization analyses strongly correlate increased CHRDL1 gene expression, particularly in adipose tissue, with reduced concentrations of blood lipids