78 research outputs found
Imputing Amino Acid Polymorphisms in Human Leukocyte Antigens
DNA sequence variation within human leukocyte antigen (HLA) genes mediate susceptibility to a wide range of human diseases. The complex genetic structure of the major histocompatibility complex (MHC) makes it difficult, however, to collect genotyping data in large cohorts. Long-range linkage disequilibrium between HLA loci and SNP markers across the major histocompatibility complex (MHC) region offers an alternative approach through imputation to interrogate HLA variation in existing GWAS data sets. Here we describe a computational strategy, SNP2HLA, to impute classical alleles and amino acid polymorphisms at class I (HLA-A, -B, -C) and class II (-DPA1, -DPB1, -DQA1, -DQB1, and -DRB1) loci. To characterize performance of SNP2HLA, we constructed two European ancestry reference panels, one based on data collected in HapMap-CEPH pedigrees (90 individuals) and another based on data collected by the Type 1 Diabetes Genetics Consortium (T1DGC, 5,225 individuals). We imputed HLA alleles in an independent data set from the British 1958 Birth Cohort (N = 918) with gold standard four-digit HLA types and SNPs genotyped using the Affymetrix GeneChip 500 K and Illumina Immunochip microarrays. We demonstrate that the sample size of the reference panel, rather than SNP density of the genotyping platform, is critical to achieve high imputation accuracy. Using the larger T1DGC reference panel, the average accuracy at four-digit resolution is 94.7% using the low-density Affymetrix GeneChip 500 K, and 96.7% using the high-density Illumina Immunochip. For amino acid polymorphisms within HLA genes, we achieve 98.6% and 99.3% accuracy using the Affymetrix GeneChip 500 K and Illumina Immunochip, respectively. Finally, we demonstrate how imputation and association testing at amino acid resolution can facilitate fine-mapping of primary MHC association signals, giving a specific example from type 1 diabetes
Common Variants in 40 Genes Assessed for Diabetes Incidence and Response to Metformin and Lifestyle Intervention in the Diabetes Prevention Program
OBJECTIVE: Genome-wide association studies have begun to elucidate the genetic architecture of type 2 diabetes. We examined whether single nucleotide polymorphisms (SNPs) identified through targeted complementary approaches affect diabetes incidence in the at-risk population of the Diabetes Prevention Program (DPP) and whether they influence a response to preventive interventions. RESEARCH DESIGN AND METHODS: We selected SNPs identified by prior genome-wide association studies for type 2 diabetes and related traits, or capturing common variation in 40 candidate genes previously associated with type 2 diabetes, implicated in monogenic diabetes, encoding type 2 diabetes drug targets or drug-metabolizing/transporting enzymes, or involved in relevant physiological processes. We analyzed 1,590 SNPs for association with incident diabetes and their interaction with response to metformin or lifestyle interventions in 2,994 DPP participants. We controlled for multiple hypothesis testing by assessing false discovery rates. RESULTS: We replicated the association of variants in the metformin transporter gene SLC47A1 with metformin response and detected nominal interactions in the AMP kinase (AMPK) gene STK11, the AMPK subunit genes PRKAA1 and PRKAA2, and a missense SNP in SLC22A1, which encodes another metformin transporter. The most significant association with diabetes incidence occurred in the AMPK subunit gene PRKAG2 (hazard ratio 1.24, 95% CI 1.09-1.40, P = 7 × 10(-4)). Overall, there were nominal associations with diabetes incidence at 85 SNPs and nominal interactions with the metformin and lifestyle interventions at 91 and 69 mostly nonoverlapping SNPs, respectively. The lowest P values were consistent with experiment-wide 33% false discovery rates. CONCLUSIONS: We have identified potential genetic determinants of metformin response. These results merit confirmation in independent samples
Genetic susceptibility loci for cardiovascular disease and their impact on atherosclerotic plaques
Background:
Atherosclerosis is a chronic inflammatory disease in part caused by lipid uptake in the vascular wall, but the exact underlying mechanisms leading to acute myocardial infarction and stroke remain poorly understood. Large consortia identified genetic susceptibility loci that associate with large artery ischemic stroke and coronary artery disease. However, deciphering their underlying mechanisms are challenging. Histological studies identified destabilizing characteristics in human atherosclerotic plaques that associate with clinical outcome. To what extent established susceptibility loci for large artery ischemic stroke and coronary artery disease relate to plaque characteristics is thus far unknown but may point to novel mechanisms.
Methods:
We studied the associations of 61 established cardiovascular risk loci with 7 histological plaque characteristics assessed in 1443 carotid plaque specimens from the Athero-Express Biobank Study. We also assessed if the genotyped cardiovascular risk loci impact the tissue-specific gene expression in 2 independent biobanks, Biobank of Karolinska Endarterectomy and Stockholm Atherosclerosis Gene Expression.
Results:
A total of 21 established risk variants (out of 61) nominally associated to a plaque characteristic. One variant (rs12539895, risk allele A) at 7q22 associated to a reduction of intraplaque fat, P=5.09×10−6 after correction for multiple testing. We further characterized this 7q22 Locus and show tissue-specific effects of rs12539895 on HBP1 expression in plaques and COG5 expression in whole blood and provide data from public resources showing an association with decreased LDL (low-density lipoprotein) and increase HDL (high-density lipoprotein) in the blood.
Conclusions:
Our study supports the view that cardiovascular susceptibility loci may exert their effect by influencing the atherosclerotic plaque characteristics
Genetic modulation of lipid profiles following lifestyle modification or metformin treatment: The Diabetes Prevention Program
Weight-loss interventions generally improve lipid profiles and reduce cardiovascular disease risk, but effects are variable and may depend on genetic factors. We performed a genetic association analysis of data from 2,993 participants in the Diabetes Prevention Program to test the hypotheses that a genetic risk score (GRS) based on deleterious alleles at 32 lipid-associated single-nucleotide polymorphisms modifies the effects of lifestyle and/or metformin interventions on lipid levels and nuclear magnetic resonance (NMR) lipoprotein subfraction size and number. Twenty-three loci previously associated with fasting LDL-C, HDL-C, or triglycerides replicated (P = 0.04–1×10−17). Except for total HDL particles (r = −0.03, P = 0.26), all components of the lipid profile correlated with the GRS (partial |r| = 0.07–0.17, P = 5×10−5–1×10−19). The GRS was associated with higher baseline-adjusted 1-year LDL cholesterol levels (β = +0.87, SEE±0.22 mg/dl/allele, P = 8×10−5, Pinteraction = 0.02) in the lifestyle intervention group, but not in the placebo (β = +0.20, SEE±0.22 mg/dl/allele, P = 0.35) or metformin (β = −0.03, SEE±0.22 mg/dl/allele, P = 0.90; Pinteraction = 0.64) groups. Similarly, a higher GRS predicted a greater number of baseline-adjusted small LDL particles at 1 year in the lifestyle intervention arm (β = +0.30, SEE±0.012 ln nmol/L/allele, P = 0.01, Pinteraction = 0.01) but not in the placebo (β = −0.002, SEE±0.008 ln nmol/L/allele, P = 0.74) or metformin (β = +0.013, SEE±0.008 nmol/L/allele, P = 0.12; Pinteraction = 0.24) groups. Our findings suggest that a high genetic burden confers an adverse lipid profile and predicts attenuated response in LDL-C levels and small LDL particle number to dietary and physical activity interventions aimed at weight loss
Exome-chip association analysis of intracranial aneurysms
Objective: To investigate to what extent low-frequency genetic variants (with minor allele frequencies <5%) affect the risk of intracranial aneurysms (IAs).
Methods: One thousand fifty-six patients with IA and 2,097 population-based controls from the Netherlands were genotyped with the Illumina HumanExome BeadChip. After quality control (QC) of samples and single nucleotide variants (SNVs), we conducted a single variant analysis using the Fisher exact test. We also performed the variable threshold (VT) test and the sequence kernel association test (SKAT) at different minor allele count (MAC) thresholds of >5 and >0 to test the hypothesis that multiple variants within the same gene are associated with IA risk. Significant results were tested in a replication cohort of 425 patients with IA and 311 controls, and results of the 2 cohorts were combined in a meta-analysis.
Results: After QC, 995 patients with IA and 2,080 controls remained for further analysis. The single variant analysis comprising 46,534 SNVs did not identify significant loci at the genome-wide level. The gene-based tests showed a statistically significant association for fibulin 2 (FBLN2) (best p = 1 × 10-6 for the VT test, MAC >5). Associations were not statistically significant in the independent but smaller replication cohort (p > 0.57) but became slightly stronger in a meta-analysis of the 2 cohorts (best p = 4.8 × 10-7 for the SKAT, MAC ≥1).
Conclusion: Gene-based tests indicated an association for FBLN2, a gene encoding an extracellular matrix protein implicated in vascular wall remodeling, but independent validation in larger cohorts is warranted. We did not identify any significant associations for single low-frequency genetic variants
Recommended from our members
High density genetic mapping identifies new susceptibility loci for rheumatoid arthritis
Summary Using the Immunochip custom single nucleotide polymorphism (SNP) array, designed for dense genotyping of 186 genome wide association study (GWAS) confirmed loci we analysed 11,475 rheumatoid arthritis cases of European ancestry and 15,870 controls for 129,464 markers. The data were combined in meta-analysis with GWAS data from additional independent cases (n=2,363) and controls (n=17,872). We identified fourteen novel loci; nine were associated with rheumatoid arthritis overall and 5 specifically in anti-citrillunated peptide antibody positive disease, bringing the number of confirmed European ancestry rheumatoid arthritis loci to 46. We refined the peak of association to a single gene for 19 loci, identified secondary independent effects at six loci and association to low frequency variants (minor allele frequency <0.05) at 4 loci. Bioinformatic analysis of the data generated strong hypotheses for the causal SNP at seven loci. This study illustrates the advantages of dense SNP mapping analysis to inform subsequent functional investigations
A novel MMP12 locus is associated with large artery atherosclerotic stroke using a genome-wide age-at-onset informed approach.
Genome-wide association studies (GWAS) have begun to identify the common genetic component to ischaemic stroke (IS). However, IS has considerable phenotypic heterogeneity. Where clinical covariates explain a large fraction of disease risk, covariate informed designs can increase power to detect associations. As prevalence rates in IS are markedly affected by age, and younger onset cases may have higher genetic predisposition, we investigated whether an age-at-onset informed approach could detect novel associations with IS and its subtypes; cardioembolic (CE), large artery atherosclerosis (LAA) and small vessel disease (SVD) in 6,778 cases of European ancestry and 12,095 ancestry-matched controls. Regression analysis to identify SNP associations was performed on posterior liabilities after conditioning on age-at-onset and affection status. We sought further evidence of an association with LAA in 1,881 cases and 50,817 controls, and examined mRNA expression levels of the nearby genes in atherosclerotic carotid artery plaques. Secondly, we performed permutation analyses to evaluate the extent to which age-at-onset informed analysis improves significance for novel loci. We identified a novel association with an MMP12 locus in LAA (rs660599; p = 2.5×10⁻⁷), with independent replication in a second population (p = 0.0048, OR(95% CI) = 1.18(1.05-1.32); meta-analysis p = 2.6×10⁻⁸). The nearby gene, MMP12, was significantly overexpressed in carotid plaques compared to atherosclerosis-free control arteries (p = 1.2×10⁻¹⁵; fold change = 335.6). Permutation analyses demonstrated improved significance for associations when accounting for age-at-onset in all four stroke phenotypes (p<0.001). Our results show that a covariate-informed design, by adjusting for age-at-onset of stroke, can detect variants not identified by conventional GWAS
Cystatin C and Cardiovascular Disease
Background Epidemiological studies show that high circulating cystatin C is associated with risk of cardiovascular disease (CVD), independent of creatinine-based renal function measurements. It is unclear whether this relationship is causal, arises from residual confounding, and/or is a consequence of reverse causation. Objectives The aim of this study was to use Mendelian randomization to investigate whether cystatin C is causally related to CVD in the general population. Methods We incorporated participant data from 16 prospective cohorts (n = 76,481) with 37,126 measures of cystatin C and added genetic data from 43 studies (n = 252,216) with 63,292 CVD events. We used the common variant rs911119 in CST3 as an instrumental variable to investigate the causal role of cystatin C in CVD, including coronary heart disease, ischemic stroke, and heart failure. Results Cystatin C concentrations were associated with CVD risk after adjusting for age, sex, and traditional risk factors (relative risk: 1.82 per doubling of cystatin C; 95% confidence interval [CI]: 1.56 to 2.13; p = 2.12 × 10−14). The minor allele of rs911119 was associated with decreased serum cystatin C (6.13% per allele; 95% CI: 5.75 to 6.50; p = 5.95 × 10−211), explaining 2.8% of the observed variation in cystatin C. Mendelian randomization analysis did not provide evidence for a causal role of cystatin C, with a causal relative risk for CVD of 1.00 per doubling cystatin C (95% CI: 0.82 to 1.22; p = 0.994), which was statistically different from the observational estimate (p = 1.6 × 10−5). A causal effect of cystatin C was not detected for any individual component of CVD. Conclusions Mendelian randomization analyses did not support a causal role of cystatin C in the etiology of CVD. As such, therapeutics targeted at lowering circulating cystatin C are unlikely to be effective in preventing CVD
Genome-wide Association Study Identifies Five Susceptibility Loci for Follicular Lymphoma outside the HLA Region
Genome-wide association studies (GWASs) of follicular lymphoma (FL) have previously identified human leukocyte antigen (HLA) gene variants. To identify additional FL susceptibility loci, we conducted a large-scale two-stage GWAS in 4,523 case subjects and 13,344 control subjects of European ancestry. Five non-HLA loci were associated with FL risk: 11q23.3 (rs4938573, p = 5.79 × 10−20) near CXCR5; 11q24.3 (rs4937362, p = 6.76 × 10−11) near ETS1; 3q28 (rs6444305, p = 1.10 × 10−10) in LPP; 18q21.33 (rs17749561, p = 8.28 × 10−10) near BCL2; and 8q24.21 (rs13254990, p = 1.06 × 10−8) near PVT1. In an analysis of the HLA region, we identified four linked HLA-DRβ1 multiallelic amino acids at positions 11, 13, 28, and 30 that were associated with FL risk (pomnibus = 4.20 × 10−67 to 2.67 × 10−70). Additional independent signals included rs17203612 in HLA class II (odds ratio [ORper-allele] = 1.44; p = 4.59 × 10−16) and rs3130437 in HLA class I (ORper-allele = 1.23; p = 8.23 × 10−9). Our findings further expand the number of loci associated with FL and provide evidence that multiple common variants outside the HLA region make a significant contribution to FL risk
Genetically predicted longer telomere length is associated with increased risk of B-cell lymphoma subtypes
Evidence from a small number of studies suggests that longer telomere length measured in peripheral leukocytes is associated with an increased risk of non-Hodgkin lymphoma (NHL). However, these studies may be biased by reverse causation, confounded by unmeasured environmental exposures and might miss time points for which prospective telomere measurement would best reveal a relationship between telomere length and NHL risk. We performed an analysis of genetically inferred telomere length and NHL risk in a study of 10 102 NHL cases of the four most common B-cell histologic types and 9562 controls using a genetic risk score (GRS) comprising nine telomere length-associated single-nucleotide polymorphisms. This approach uses existing genotype data and estimates telomere length by weighing the number of telomere length-associated variant alleles an individual carries with the published change in kb of telomere length. The analysis of the telomere length GRS resulted in an association between longer telomere length and increased NHL risk [four B-cell histologic types combined; odds ratio (OR) = 1.49, 95% CI 1.22–1.82, P-value = 8.5 × 10−5]. Subtype-specific analyses indicated that chronic lymphocytic leukemia or small lymphocytic lymphoma (CLL/SLL) was the principal NHL subtype contributing to this association (OR = 2.60, 95% CI 1.93–3.51, P-value = 4.0 × 10−10). Significant interactions were observed across strata of sex for CLL/SLL and marginal zone lymphoma subtypes as well as age for the follicular lymphoma subtype. Our results indicate that a genetic background that favors longer telomere length may increase NHL risk, particularly risk of CLL/SLL, and are consistent with earlier studies relating longer telomere length with increased NHL risk
- …