94 research outputs found

    SNPPicker: High quality tag SNP selection across multiple populations

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Linkage Disequilibrium (LD) bin-tagging algorithms identify a reduced set of tag SNPs that can capture the genetic variation in a population without genotyping every single SNP. However, existing tag SNP selection algorithms for designing custom genotyping panels do not take into account all platform dependent factors affecting the likelihood of a tag SNP to be successfully genotyped and many of the constraints that can be imposed by the user.</p> <p>Results</p> <p>SNPPicker optimizes the selection of tag SNPs from common bin-tagging programs to design custom genotyping panels. The application uses a multi-step search strategy in combination with a statistical model to maximize the genotyping success of the selected tag SNPs. User preference toward functional SNPs can also be taken into account as secondary criteria. SNPPicker can also optimize tag SNP selection for a panel tagging multiple populations. SNPPicker can optimize custom genotyping panels including all the assay-specific constraints of Illumina's GoldenGate and Infinium assays.</p> <p>Conclusions</p> <p>A new application has been developed to maximize the success of custom multi-population genotyping panels. SNPPicker also takes into account user constraints including options for controlling runtime. Perl Scripts, Java source code and executables are available under an open source license for download at <url>http://mayoresearch.mayo.edu/mayo/research/biostat/software.cfm</url></p

    GLOSSI: a method to assess the association of genetic loci-sets with complex diseases

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The developments of high-throughput genotyping technologies, which enable the simultaneous genotyping of hundreds of thousands of single nucleotide polymorphisms (SNP) have the potential to increase the benefits of genetic epidemiology studies. Although the enhanced resolution of these platforms increases the chance of interrogating functional SNPs that are themselves causative or in linkage disequilibrium with causal SNPs, commonly used single SNP-association approaches suffer from serious multiple hypothesis testing problems and provide limited insights into combinations of loci that may contribute to complex diseases. Drawing inspiration from Gene Set Enrichment Analysis developed for gene expression data, we have developed a method, named GLOSSI (Gene-loci Set Analysis), that integrates prior biological knowledge into the statistical analysis of genotyping data to test the association of a group of SNPs (loci-set) with complex disease phenotypes. The most significant loci-sets can be used to formulate hypotheses from a functional viewpoint that can be validated experimentally.</p> <p>Results</p> <p>In a simulation study, GLOSSI showed sufficient power to detect loci-sets with less than 10% of SNPs having moderate-to-large effect sizes and intermediate minor allele frequency values. When applied to a biological dataset where no single SNP-association was found in a previous study, GLOSSI was able to identify several loci-sets that are significantly related to blood pressure response to an antihypertensive drug.</p> <p>Conclusion</p> <p>GLOSSI is valuable for association of SNPs at multiple genetic loci with complex disease phenotypes. In contrast to methods based on the Kolmogorov-Smirnov statistic, the approach is parametric and only utilizes information from within the interrogated loci-set. It properly accounts for dependency among SNPs and allows the testing of loci-sets of any size.</p

    A Targeted Genetic Association Study of Epithelial Ovarian Cancer Susceptibility

    Get PDF
    BACKGROUND: Genome-wide association studies have identified several common susceptibility alleles for epithelial ovarian cancer (EOC). To further understand EOC susceptibility, we examined previously ungenotyped candidate variants, including uncommon variants and those residing within known susceptibility loci. RESULTS: At nine of eleven previously published EOC susceptibility regions (2q31, 3q25, 5p15, 8q21, 8q24, 10p12, 17q12, 17q21.31, and 19p13), novel variants were identified that were more strongly associated with risk than previously reported variants. Beyond known susceptibility regions, no variants were found to be associated with EOC risk at genome-wide statistical significance (p \u3c5x10(-8)), nor were any significant after Bonferroni correction for 17,000 variants (p\u3c 3x10-6). METHODS: A customized genotyping array was used to assess over 17,000 variants in coding, non-coding, regulatory, and known susceptibility regions in 4,973 EOC cases and 5,640 controls from 13 independent studies. Susceptibility for EOC overall and for select histotypes was evaluated using logistic regression adjusted for age, study site, and population substructure. CONCLUSION: Given the novel variants identified within the 2q31, 3q25, 5p15, 8q21, 8q24, 10p12, 17q12, 17q21.31, and 19p13 regions, larger follow-up genotyping studies, using imputation where necessary, are needed for fine-mapping and confirmation of low frequency variants that fall below statistical significance

    Bipolar disorder with binge eating behavior: a genome-wide association study implicates PRR5-ARHGAP8

    Get PDF
    Bipolar disorder (BD) is associated with binge eating behavior (BE), and both conditions are heritable. Previously, using data from the Genetic Association Information Network (GAIN) study of BD, we performed genome-wide association (GWA) analyses of BD with BE comorbidity. Here, utilizing data from the Mayo Clinic BD Biobank (969 BD cases, 777 controls), we performed a GWA analysis of a BD subtype defined by BE, and case-only analysis comparing BD subjects with and without BE. We then performed a meta-analysis of the Mayo and GAIN results. The meta-analysis provided genome-wide significant evidence of association between single nucleotide polymorphisms (SNPs) in PRR5-ARHGAP8 and BE in BD cases (rs726170 OR=1.91, P=3.05E-08). In the meta-analysis comparing cases with BD with comorbid BE vs. non-BD controls, a genome-wide significant association was observed at SNP rs111940429 in an intergenic region near PPP1R2P5 (p=1.21E-08). PRR5-ARHGAP8 is a read-through transcript resulting in a fusion protein of PRR5 and ARHGAP8. PRR5 encodes a subunit of mTORC2, a serine/threonine kinase that participates in food intake regulation, while ARHGAP8 encodes a member of the RhoGAP family of proteins that mediate cross-talk between Rho GTPases and other signaling pathways. Without BE information in controls, it is not possible to determine whether the observed association reflects a risk factor for BE in general, risk for BE in individuals with BD, or risk of a subtype of BD with BE. The effect of PRR5-ARHGAP8 on BE risk thus warrants further investigation

    Genome-Wide Transcriptional Profiling Reveals MicroRNA-Correlated Genes and Biological Processes in Human Lymphoblastoid Cell Lines

    Get PDF
    Expression level of many genes shows abundant natural variation in human populations. The variations in gene expression are believed to contribute to phenotypic differences. Emerging evidence has shown that microRNAs (miRNAs) are one of the key regulators of gene expression. However, past studies have focused on the miRNA target genes and used loss- or gain-of-function approach that may not reflect natural association between miRNA and mRNAs.To examine miRNA regulatory effect on global gene expression under endogenous condition, we performed pair-wise correlation coefficient analysis on expression levels of 366 miRNAs and 14,174 messenger RNAs (mRNAs) in 90 immortalized lymphoblastoid cell lines, and observed significant correlations between the two species of RNA transcripts. We identified a total of 7,207 significantly correlated miRNA-mRNA pairs (false discovery rate q<0.01). Of those, 4,085 pairs showed positive correlations while 3,122 pairs showed negative correlations. Gene ontology analyses on the miRNA-correlated genes revealed significant enrichments in several biological processes related to cell cycle, cell communication and signal transduction. Individually, each of three miRNAs (miR-331, -98 and -33b) demonstrated significant correlation with the genes in cell cycle-related biological processes, which is consistent with important role of miRNAs in cell cycle regulation.This study demonstrates feasibility of using naturally expressed transcript profiles to identify endogenous correlation between miRNA and miRNA. By applying this genome-wide approach, we have identified thousands of miRNA-correlated genes and revealed potential role of miRNAs in several important cellular functions. The study results along with accompanying data sets will provide a wealth of high-throughput data to further evaluate the miRNA-regulated genes and eventually in phenotypic variations of human populations

    Integrated genomic characterization of pancreatic ductal adenocarcinoma

    Get PDF
    We performed integrated genomic, transcriptomic, and proteomic profiling of 150 pancreatic ductal adenocarcinoma (PDAC) specimens, including samples with characteristic low neoplastic cellularity. Deep whole-exome sequencing revealed recurrent somatic mutations in KRAS, TP53, CDKN2A, SMAD4, RNF43, ARID1A, TGFβR2, GNAS, RREB1, and PBRM1. KRAS wild-type tumors harbored alterations in other oncogenic drivers, including GNAS, BRAF, CTNNB1, and additional RAS pathway genes. A subset of tumors harbored multiple KRAS mutations, with some showing evidence of biallelic mutations. Protein profiling identified a favorable prognosis subset with low epithelial-mesenchymal transition and high MTOR pathway scores. Associations of non-coding RNAs with tumor-specific mRNA subtypes were also identified. Our integrated multi-platform analysis reveals a complex molecular landscape of PDAC and provides a roadmap for precision medicine
    corecore