117 research outputs found
Copy number variation in African Americans
<p>Abstract</p> <p>Background</p> <p>Copy number variants (CNVs) have been identified in several studies to be associated with complex diseases. It is important, therefore, to understand the distribution of CNVs within and among populations. This study is the first report of a CNV map in African Americans.</p> <p>Results</p> <p>Employing a SNP platform with greater than 500,000 SNPs, a first-generation CNV map of the African American genome was generated using DNA from 385 healthy African American individuals, and compared to a sample of 435 healthy White individuals. A total of 1362 CNVs were identified within African Americans, which included two CNV regions that were significantly different in frequency between African Americans and Whites (17q21 and 15q11). In addition, a duplication was identified in 74% of DNAs derived from cell lines that was not present in any of the whole blood derived DNAs.</p> <p>Conclusion</p> <p>The Affymetrix 500 K array provides reliable CNV mapping information. However, using cell lines as a source of DNA may introduce artifacts. The duplication identified in high frequency in Whites and low frequency in African Americans on chromosome 17q21 reflects haplotype specific frequency differences between ancestral groups. The generation of the CNV map will be a valuable tool for identifying disease associated CNVs in African Americans.</p
SNP imputation bias reduces effect size determination
Imputation is a commonly used technique that exploits linkage disequilibrium to infer missing genotypes in genetic datasets, using a well characterized reference population. While there is agreement that the reference population has to match the ethnicity of the query dataset, it is common practice to use the same reference to impute genotypes for a wide variety of phenotypes. We hypothesized that using a reference composed of samples with a different phenotype than the query dataset would introduce imputation bias.To test this hypothesis we used GWAS datasets from amyotrophic lateral sclerosis, Parkinson disease, and Crohn disease. First, we masked and then performed imputation of 100 disease-associated markers and 100 non-associated markers from each study. Two references for imputation were used in parallel: one consisting of healthy controls and another consisting of patients with the same disease. We assessed the discordance (imprecision) and bias (inaccuracy) of imputation by comparing predicted genotypes to those assayed by SNP-chip. We also assessed the bias on the observed effect size when the predicted genotypes were used in a GWAS study.When healthy controls were used as reference for imputation, a significant bias was observed, particularly in the disease-associated markers. Using cases as reference significantly attenuated this bias. For nearly all markers, the direction of the bias favored the non-risk allele. In GWAS studies of the three diseases (with healthy reference controls from the 1000 genomes as reference), the mean OR for disease-associated markers obtained by imputation was lower than that obtained using original assayed genotypes.We found that the bias is inherent to imputation as using different methods did not alter the results. In conclusion, imputation is a powerful method to predict genotypes and estimate genetic risk for GWAS. However, a careful choice of reference population is needed to minimize biases inherent to this approac
Recommended from our members
A specific amino acid motif of HLA-DRB1 mediates risk and interacts with smoking history in Parkinson's disease.
Parkinson's disease (PD) is a neurodegenerative disease in which genetic risk has been mapped to HLA, but precise allelic associations have been difficult to infer due to limitations in genotyping methodology. Mapping PD risk at highest possible resolution, we performed sequencing of 11 HLA genes in 1,597 PD cases and 1,606 controls. We found that susceptibility to PD can be explained by a specific combination of amino acids at positions 70-74 on the HLA-DRB1 molecule. Previously identified as the primary risk factor in rheumatoid arthritis and referred to as the "shared epitope" (SE), the residues Q/R-K/R-R-A-A at positions 70-74 in combination with valine at position 11 (11-V) is highly protective in PD, while risk is attributable to the identical epitope in the absence of 11-V. Notably, these effects are modified by history of cigarette smoking, with a strong protective effect mediated by a positive history of smoking in combination with the SE and 11-V (P = 10-4; odds ratio, 0.51; 95% confidence interval, 0.36-0.72) and risk attributable to never smoking in combination with the SE without 11-V (P = 0.01; odds ratio, 1.51; 95% confidence interval, 1.08-2.12). The association of specific combinations of amino acids that participate in critical peptide-binding pockets of the HLA class II molecule implicates antigen presentation in PD pathogenesis and provides further support for genetic control of neuroinflammation in disease. The interaction of HLA-DRB1 with smoking history in disease predisposition, along with predicted patterns of peptide binding to HLA, provide a molecular model that explains the unique epidemiology of smoking in PD
Recommended from our members
miRNA contributions to pediatric-onset multiple sclerosis inferred from GWAS.
ObjectiveOnset of multiple sclerosis (MS) occurs in childhood for approximately 5% of cases (pediatric MS, or ped-MS). Epigenetic influences are strongly implicated in MS pathogenesis in adults, including the contribution from microRNAs (miRNAs), small noncoding RNAs that affect gene expression by binding target gene mRNAs. Few studies have specifically examined miRNAs in ped-MS, but individuals developing MS at an early age may carry a relatively high burden of genetic risk factors, and miRNA dysregulation may therefore play a larger role in the development of ped-MS than in adult-onset MS. This study aimed to look for evidence of miRNA involvement in ped-MS pathogenesis.MethodsGWAS results from 486 ped-MS cases and 1362 controls from the U.S. Pediatric MS Network and Kaiser Permanente Northern California membership were investigated for miRNA-specific signals. First, enrichment of miRNA-target gene network signals was evaluated using MIGWAS software. Second, SNPs in miRNA genes and in target gene binding sites (miR-SNPs) were tested for association with ped-MS, and pathway analysis was performed on associated target genes.ResultsMIGWAS analysis showed that miRNA-target gene signals were enriched in GWAS (P = 0.038) and identified 39 candidate biomarker miRNA-target gene pairs, including immune and neuronal signaling genes. The miR-SNP analysis implicated dysregulation of miRNA binding to target genes in five pathways, mainly involved in immune signaling.InterpretationEvidence from GWAS suggests that miRNAs play a role in ped-MS pathogenesis by affecting immune signaling and other pathways. Candidate biomarker miRNA-target gene pairs should be further studied for diagnostic, prognostic, and/or therapeutic utility
Transcription-based prediction of response to IFNbeta using supervised computational methods
Changes in cellular functions in response to drug therapy are mediated by specific transcriptional profiles resulting from the induction or repression in the activity of a number of genes, thereby modifying the preexisting gene activity pattern of the drug-targeted cell(s). Recombinant human interferon beta (rIFNbeta) is routinely used to control exacerbations in multiple sclerosis patients with only partial success, mainly because of adverse effects and a relatively large proportion of nonresponders. We applied advanced data-mining and predictive modeling tools to a longitudinal 70-gene expression dataset generated by kinetic reverse-transcription PCR from 52 multiple sclerosis patients treated with rIFNbeta to discover higher-order predictive patterns associated with treatment outcome and to define the molecular footprint that rIFNbeta engraves on peripheral blood mononuclear cells. We identified nine sets of gene triplets whose expression, when tested before the initiation of therapy, can predict the response to interferon beta with up to 86% accuracy. In addition, time-series analysis revealed potential key players involved in a good or poor response to interferon beta. Statistical testing of a random outcome class and tolerance to noise was carried out to establish the robustness of the predictive models. Large-scale kinetic reverse-transcription PCR, coupled with advanced data-mining efforts, can effectively reveal preexisting and drug-induced gene expression signatures associated with therapeutic effects
Genome-wide association analysis of susceptibility and clinical phenotype in multiple sclerosis
Multiple sclerosis (MS), a chronic disorder of the central nervous system and common cause of neurological disability in young adults, is characterized by moderate but complex risk heritability. Here we report the results of a genome-wide association study performed in a 1000 prospective case series of well-characterized individuals with MS and group-matched controls using the Sentrix® HumanHap550 BeadChip platform from Illumina. After stringent quality control data filtering, we compared allele frequencies for 551 642 SNPs in 978 cases and 883 controls and assessed genotypic influences on susceptibility, age of onset, disease severity, as well as brain lesion load and normalized brain volume from magnetic resonance imaging exams. A multi-analytical strategy identified 242 susceptibility SNPs exceeding established thresholds of significance, including 65 within the MHC locus in chromosome 6p21.3. Independent replication confirms a role for GPC5, a heparan sulfate proteoglycan, in disease risk. Gene ontology-based analysis shows a functional dichotomy between genes involved in the susceptibility pathway and those affecting the clinical phenotyp
- …