61 research outputs found

    SNP-based pathway enrichment analysis for genome-wide association studies

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Recently we have witnessed a surge of interest in using genome-wide association studies (GWAS) to discover the genetic basis of complex diseases. Many genetic variations, mostly in the form of single nucleotide polymorphisms (SNPs), have been identified in a wide spectrum of diseases, including diabetes, cancer, and psychiatric diseases. A common theme arising from these studies is that the genetic variations discovered by GWAS can only explain a small fraction of the genetic risks associated with the complex diseases. New strategies and statistical approaches are needed to address this lack of explanation. One such approach is the pathway analysis, which considers the genetic variations underlying a biological pathway, rather than separately as in the traditional GWAS studies. A critical challenge in the pathway analysis is how to combine evidences of association over multiple SNPs within a gene and multiple genes within a pathway. Most current methods choose the most significant SNP from each gene as a representative, ignoring the joint action of multiple SNPs within a gene. This approach leads to preferential identification of genes with a greater number of SNPs.</p> <p>Results</p> <p>We describe a SNP-based pathway enrichment method for GWAS studies. The method consists of the following two main steps: 1) for a given pathway, using an adaptive truncated product statistic to identify all representative (potentially more than one) SNPs of each gene, calculating the average number of representative SNPs for the genes, then re-selecting the representative SNPs of genes in the pathway based on this number; and 2) ranking all selected SNPs by the significance of their statistical association with a trait of interest, and testing if the set of SNPs from a particular pathway is significantly enriched with high ranks using a weighted Kolmogorov-Smirnov test. We applied our method to two large genetically distinct GWAS data sets of schizophrenia, one from European-American (EA) and the other from African-American (AA). In the EA data set, we found 22 pathways with nominal P-value less than or equal to 0.001 and corresponding false discovery rate (FDR) less than 5%. In the AA data set, we found 11 pathways by controlling the same nominal P-value and FDR threshold. Interestingly, 8 of these pathways overlap with those found in the EA sample. We have implemented our method in a JAVA software package, called <it>SNP Set Enrichment Analysis </it>(SSEA), which contains a user-friendly interface and is freely available at <url>http://cbcl.ics.uci.edu/SSEA.</url></p> <p>Conclusions</p> <p>The SNP-based pathway enrichment method described here offers a new alternative approach for analysing GWAS data. By applying it to schizophrenia GWAS studies, we show that our method is able to identify statistically significant pathways, and importantly, pathways that can be replicated in large genetically distinct samples.</p

    Association of the rs2242446 polymorphism in the norepinephrine transporter gene SLC6A2 and anxious arousal symptoms of posttraumatic stress disorder

    Get PDF
    To the Editor: Recently, we found that greater norepinephrine transporter (NET) availability in the locus ceruleus of trauma survivors with posttraumatic stress disorder (PTSD) was associated with increased severity of anxious arousal (ie, hypervigilance and exaggerated startle) symptoms, but not any of the other empirically derived symptom clusters that characterize this disorder.1 This finding suggests that greater NET availability in the locus ceruleus may serve a compensatory function of clearing elevated synaptic norepinephrine and maintaining anxious arousal symptoms in persons with PTSD

    Increased CNV-Region Deletions in Mild Cognitive Impairment (MCI) and Alzheimer\u27s Disease (AD) Subjects in the ADNI Sample

    Get PDF
    We investigated the genome-wide distribution of CNVs in the Alzheimer\u27s disease (AD) Neuroimaging Initia- tive (ADNI) sample (146 with AD, 313 with Mild Cognitive Impairment (MCI), and 181 controls). Comparison of single CNVs between cases (MCI and AD) and controls shows overrepresentation of large hetero- zygous deletions in cases (p-value b 0.0001). The analysis of CNV-Regions identifies 44 copy number variable loci of heterozygous deletions, with more CNV-Regions among affected than controls (p = 0.005). Seven of the 44 CNV-Regions are nominally significant for association with cognitive impairment. We validated and con- firmed our main findings with genome re-sequencing of selected patients and controls. The functional pathway analysis of the genes putatively affected by deletions of CNV-Regions reveals enrichment of genes implicated in axonal guidance, cell–cell adhesion, neuronal morphogenesis and differentiation. Our findings support the role of CNVs in AD, and suggest an association between large deletions and the development of cognitive impairment

    SNPLims: a data management system for genome wide association studies

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Recent progresses in genotyping technologies allow the generation high-density genetic maps using hundreds of thousands of genetic markers for each DNA sample. The availability of this large amount of genotypic data facilitates the whole genome search for genetic basis of diseases.</p> <p>We need a suitable information management system to efficiently manage the data flow produced by whole genome genotyping and to make it available for further analyses.</p> <p>Results</p> <p>We have developed an information system mainly devoted to the storage and management of SNP genotype data produced by the Illumina platform from the raw outputs of genotyping into a relational database.</p> <p>The relational database can be accessed in order to import any existing data and export user-defined formats compatible with many different genetic analysis programs.</p> <p>After calculating family-based or case-control association study data, the results can be imported in SNPLims. One of the main features is to allow the user to rapidly identify and annotate statistically relevant polymorphisms from the large volume of data analyzed. Results can be easily visualized either graphically or creating ASCII comma separated format output files, which can be used as input to further analyses.</p> <p>Conclusions</p> <p>The proposed infrastructure allows to manage a relatively large amount of genotypes for each sample and an arbitrary number of samples and phenotypes. Moreover, it enables the users to control the quality of the data and to perform the most common screening analyses and identify genes that become “candidate” for the disease under consideration.</p
    corecore