12 research outputs found
Genetic diversity fuels gene discovery for tobacco and alcohol use
Tobacco and alcohol use are heritable behaviours associated with 15% and 5.3% of worldwide deaths, respectively, due largely to broad increased risk for disease and injury(1-4). These substances are used across the globe, yet genome-wide association studies have focused largely on individuals of European ancestries(5). Here we leveraged global genetic diversity across 3.4 million individuals from four major clines of global ancestry (approximately 21% non-European) to power the discovery and fine-mapping of genomic loci associated with tobacco and alcohol use, to inform function of these loci via ancestry-aware transcriptome-wide association studies, and to evaluate the genetic architecture and predictive power of polygenic risk within and across populations. We found that increases in sample size and genetic diversity improved locus identification and fine-mapping resolution, and that a large majority of the 3,823 associated variants (from 2,143 loci) showed consistent effect sizes across ancestry dimensions. However, polygenic risk scores developed in one ancestry performed poorly in others, highlighting the continued need to increase sample sizes of diverse ancestries to realize any potential benefit of polygenic prediction.Peer reviewe
Do COPD subtypes really exist? COPD heterogeneity and clustering in 10 independent cohorts
BACKGROUND: COPD is a heterogeneous disease, but there is little consensus on specific definitions for COPD subtypes. Unsupervised clustering offers the promise of 'unbiased' data-driven assessment of COPD heterogeneity. Multiple groups have identified COPD subtypes using cluster analysis, but there has been no systematic assessment of the reproducibility of these subtypes. OBJECTIVE: We performed clustering analyses across 10 cohorts in North America and Europe in order to assess the reproducibility of (1) correlation patterns of key COPD-related clinical characteristics and (2) clustering results. METHODS: We studied 17 146 individuals with COPD using identical methods and common COPD-related characteristics across cohorts (FEV1, FEV1/FVC, FVC, body mass index, Modified Medical Research Council score, asthma and cardiovascular comorbid disease). Correlation patterns between these clinical characteristics were assessed by principal components analysis (PCA). Cluster analysis was performed using k-medoids and hierarchical clustering, and concordance of clustering solutions was quantified with normalised mutual information (NMI), a metric that ranges from 0 to 1 with higher values indicating greater concordance. RESULTS: The reproducibility of COPD clustering subtypes across studies was modest (median NMI range 0.17-0.43). For methods that excluded individuals that did not clearly belong to any cluster, agreement was better but still suboptimal (median NMI range 0.32-0.60). Continuous representations of COPD clinical characteristics derived from PCA were much more consistent across studies. CONCLUSIONS: Identical clustering analyses across multiple COPD cohorts showed modest reproducibility. COPD heterogeneity is better characterised by continuous disease traits coexisting in varying degrees within the same individual, rather than by mutually exclusive COPD subtypes.status: publishe
Multi-ancestry transcriptome-wide association analyses yield insights into tobacco use biology and drug repurposing
Most transcriptome-wide association studies (TWASs) so far focus on European ancestry and lack diversity. To overcome this limitation, we aggregated genome-wide association study (GWAS) summary statistics, whole-genome sequences and expression quantitative trait locus (eQTL) data from diverse ancestries. We developed a new approach, TESLA (multi-ancestry integrative study using an optimal linear combination of association statistics), to integrate an eQTL dataset with a multi-ancestry GWAS. By exploiting shared phenotypic effects between ancestries and accommodating potential effect heterogeneities, TESLA improves power over other TWAS methods. When applied to tobacco use phenotypes, TESLA identified 273 new genes, up to 55% more compared with alternative TWAS methods. These hits and subsequent fine mapping using TESLA point to target genes with biological relevance. In silico drug-repurposing analyses highlight several drugs with known efficacy, including dextromethorphan and galantamine, and new drugs such as muscle relaxants that may be repurposed for treating nicotine addiction
Association study in African-admixed populations across the Americas recapitulates asthma risk loci in non-African populations
Rationale: Asthma is a complex disease with striking disparities across racial and ethnic groups. Despite its high burden, representation of African ancestry individuals in asthma genome-wide association studies (GWAS) has been inadequate to date, and true associations in these underrepresented minority groups may have been missed. Here, we report the largest asthma GWAS to date from the Consortium on Asthma among African Ancestry Populations (CAAPA). Methods: CAAPA participants (7009 asthmatics, 7645 controls) were genotyped using the African Diaspora Power Chip (ADPC), an array designed to complement existing genome-wide array data, as well as Illumina’s Multi-Ethnic Genotyping array. Genotypes were imputed using the CAAPA whole genome-sequence reference panel. Logistic mixed effects models were used to test for association between allelic dosage and asthma, separately for each study. Results were meta-analyzed using a meta-regression approach that accounts for heterogeneity in allelic effects among ethnic groups. Results: We identified two novel loci that may be specific to asthma risk in African ancestry populations (lead SNP rs13277810, intronic to LOC101927815, p=3E-8; lead SNP rs114647118, intronic to TATDN1, p=3E-7). We found strong evidence for association at four previously reported asthma loci whose discovery was driven largely by non-African populations (p\u3c0.05/810 candidate SNPs investigated), including the chr12q13 region, a novel locus identified by the Trans-National Asthma Genetic Consortium (TAGC) that has previously not been replicated. Conclusions: We report two associations that may bespecific to asthma risk in African ancestry populations. Our results also suggest some asthma risk loci discovered in non-African populations are relevant in African ancestry populations
Whole-exome sequence analysis of anthropometric traits illustrates challenges in identifying effects of rare genetic variants
Anthropometric traits, measuring body size and shape, are highly heritable and significant clinical risk factors for cardiometabolic disorders. These traits have been extensively studied in genome-wide association studies (GWASs), with hundreds of genome-wide significant loci identified. We performed a whole-exome sequence analysis of the genetics of height, body mass index (BMI) and waist/hip ratio (WHR). We meta-analyzed single-variant and gene-based associations of whole-exome sequence variation with height, BMI, and WHR in up to 22,004 individuals, and we assessed replication of our findings in up to 16,418 individuals from 10 independent cohorts from Trans-Omics for Precision Medicine (TOPMed). We identified four trait associations with single-nucleotide variants (SNVs; two for height and two for BMI) and replicated the LECT2 gene association with height. Our expression quantitative trait locus (eQTL) analysis within previously reported GWAS loci implicated CEP63 and RFT1 as potential functional genes for known height loci. We further assessed enrichment of SNVs, which were monogenic or syndromic variants within loci associated with our three traits. This led to the significant enrichment results for height, whereas we observed no Bonferroni-corrected significance for all SNVs. With a sample size of ∼20,000 whole-exome sequences in our discovery dataset, our findings demonstrate the importance of genomic sequencing in genetic association studies, yet they also illustrate the challenges in identifying effects of rare genetic variants
Whole-genome sequencing in diverse subjects identifies genetic correlates of leukocyte traits: The NHLBI TOPMed program
Many common and rare variants associated with hematologic traits have been discovered through imputation on large-scale reference panels. However, the majority of genome-wide association studies (GWASs) have been conducted in Europeans, and determining causal variants has proved challenging. We performed a GWAS of total leukocyte, neutrophil, lymphocyte, monocyte, eosinophil, and basophil counts generated from 109,563,748 variants in the autosomes and the X chromosome in the Trans-Omics for Precision Medicine (TOPMed) program, which included data from 61,802 individuals of diverse ancestry. We discovered and replicated 7 leukocyte trait associations, including (1) the association between a chromosome X, pseudo-autosomal region (PAR), noncoding variant located between cytokine receptor genes (CSF2RA and CLRF2) and lower eosinophil count; and (2) associations between single variants found predominantly among African Americans at the S1PR3 (9q22.1) and HBB (11p15.4) loci and monocyte and lymphocyte counts, respectively. We further provide evidence indicating that the newly discovered eosinophil-lowering chromosome X PAR variant might be associated with reduced susceptibility to common allergic diseases such as atopic dermatitis and asthma. Additionally, we found a burden of very rare FLT3 (13q12.2) variants associated with monocyte counts. Together, these results emphasize the utility of whole-genome sequencing in diverse samples in identifying associations missed by European-ancestry-driven GWASs
Recommended from our members
Rare genetic variants explain missing heritability in smoking.
Common genetic variants explain less variation in complex phenotypes than inferred from family-based studies, and there is a debate on the source of this 'missing heritability'. We investigated the contribution of rare genetic variants to tobacco use with whole-genome sequences from up to 26,257 unrelated individuals of European ancestries and 11,743 individuals of African ancestries. Across four smoking traits, single-nucleotide-polymorphism-based heritability ([Formula: see text]) was estimated from 0.13 to 0.28 (s.e., 0.10-0.13) in European ancestries, with 35-74% of it attributable to rare variants with minor allele frequencies between 0.01% and 1%. These heritability estimates are 1.5-4 times higher than past estimates based on common variants alone and accounted for 60% to 100% of our pedigree-based estimates of narrow-sense heritability ([Formula: see text], 0.18-0.34). In the African ancestry samples, [Formula: see text] was estimated from 0.03 to 0.33 (s.e., 0.09-0.14) across the four smoking traits. These results suggest that rare variants are important contributors to the heritability of smoking
Recommended from our members
Multi-ancestry transcriptome-wide association analyses yield insights into tobacco use biology and drug repurposing
Most transcriptome-wide association studies (TWASs) so far focus on European ancestry and lack diversity. To overcome this limitation, we aggregated genome-wide association study (GWAS) summary statistics, whole-genome sequences and expression quantitative trait locus (eQTL) data from diverse ancestries. We developed a new approach, TESLA (multi-ancestry integrative study using an optimal linear combination of association statistics), to integrate an eQTL dataset with a multi-ancestry GWAS. By exploiting shared phenotypic effects between ancestries and accommodating potential effect heterogeneities, TESLA improves power over other TWAS methods. When applied to tobacco use phenotypes, TESLA identified 273 new genes, up to 55% more compared with alternative TWAS methods. These hits and subsequent fine mapping using TESLA point to target genes with biological relevance. In silico drug-repurposing analyses highlight several drugs with known efficacy, including dextromethorphan and galantamine, and new drugs such as muscle relaxants that may be repurposed for treating nicotine addiction
Recommended from our members
A continuum of admixture in the Western Hemisphere revealed by the African Diaspora genome
The African Diaspora in the Western Hemisphere represents one of the largest forced migrations in
history and had a profound impact on genetic diversity in modern populations. To date, the fine-scale
population structure of descendants of the African Diaspora remains largely uncharacterized. Here we
present genetic variation from deeply sequenced genomes of 642 individuals from North and South
American, Caribbean and West African populations, substantially increasing the lexicon of human
genomic variation and suggesting much variation remains to be discovered in African-admixed populations
in the Americas. We summarize genetic variation in these populations, quantifying the postcolonial
sex-biased European gene flow across multiple regions. Moreover, we refine estimates on the
burden of deleterious variants carried across populations and how this varies with African ancestry. Our
data are an important resource for empowering disease mapping studies in African-admixed individuals
and will facilitate gene discovery for diseases disproportionately affecting individuals of African ancestry
Recommended from our members
A continuum of admixture in the Western Hemisphere revealed by the African Diaspora genome
The African Diaspora in the Western Hemisphere represents one of the largest forced migrations in
history and had a profound impact on genetic diversity in modern populations. To date, the fine-scale
population structure of descendants of the African Diaspora remains largely uncharacterized. Here we
present genetic variation from deeply sequenced genomes of 642 individuals from North and South
American, Caribbean and West African populations, substantially increasing the lexicon of human
genomic variation and suggesting much variation remains to be discovered in African-admixed populations
in the Americas. We summarize genetic variation in these populations, quantifying the postcolonial
sex-biased European gene flow across multiple regions. Moreover, we refine estimates on the
burden of deleterious variants carried across populations and how this varies with African ancestry. Our
data are an important resource for empowering disease mapping studies in African-admixed individuals
and will facilitate gene discovery for diseases disproportionately affecting individuals of African ancestry