35 research outputs found

    Clusters of Adaptive Evolution in the Human Genome

    Get PDF
    Considerable work has been devoted to identifying regions of the human genome that have been subjected to recent positive selection. Although detailed follow-up studies of putatively selected regions are critical for a deeper understanding of human evolutionary history, such studies have received comparably less attention. Recently, we have shown that ALMS1 has been the target of recent positive selection acting on standing variation in Eurasian populations. Here, we describe a careful follow-up analysis of genetic variation across the ALMS1 region, which unexpectedly revealed a cluster of substrates of positive selection. Specifically, through the analysis of SNP data from the HapMap and Human Genome Diversity Project–Centre d’Etude du Polymorphisme Humain samples as well sequence data from the region, we find compelling evidence for three independent and distinct signals of recent positive selection across this 3 Mb region surrounding ALMS1. Moreover, we analyzed the HapMap data to identify other putative clusters of independent selective events and conservatively discovered 19 additional clusters of adaptive evolution. This work has important implications for the interpretation of genome-scans for positive selection in humans and more broadly contributes to a better understanding of how recent positive selection has shaped genetic variation across the human genome

    Patterns of Ancestry, Signatures of Natural Selection, and Genetic Association with Stature in Western African Pygmies

    Get PDF
    African Pygmy groups show a distinctive pattern of phenotypic variation, including short stature, which is thought to reflect past adaptation to a tropical environment. Here, we analyze Illumina 1M SNP array data in three Western Pygmy populations from Cameroon and three neighboring Bantu-speaking agricultural populations with whom they have admixed. We infer genome-wide ancestry, scan for signals of positive selection, and perform targeted genetic association with measured height variation. We identify multiple regions throughout the genome that may have played a role in adaptive evolution, many of which contain loci with roles in growth hormone, insulin, and insulin-like growth factor signaling pathways, as well as immunity and neuroendocrine signaling involved in reproduction and metabolism. The most striking results are found on chromosome 3, which harbors a cluster of selection and association signals between approximately 45 and 60 Mb. This region also includes the positional candidate genes DOCK3, which is known to be associated with height variation in Europeans, and CISH, a negative regulator of cytokine signaling known to inhibit growth hormone-stimulated STAT5 signaling. Finally, pathway analysis for genes near the strongest signals of association with height indicates enrichment for loci involved in insulin and insulin-like growth factor signaling

    Common Treatment, Common Variant: Evolutionary Prediction of Functional Pharmacogenomic Variants

    No full text
    Pharmacogenomics holds the promise of personalized drug efficacy optimization and drug toxicity minimization. Much of the research conducted to date, however, suffers from an ascertainment bias towards European participants. Here, we leverage publicly available, whole genome sequencing data collected from global populations, evolutionary characteristics, and annotated protein features to construct a new in silico machine learning pharmacogenetic identification method called XGB-PGX. When applied to pharmacogenetic data, XGB-PGX outperformed all existing prediction methods and identified over 2000 new pharmacogenetic variants. While there are modest pharmacogenetic allele frequency distribution differences across global population samples, the most striking distinction is between the relatively rare putatively neutral pharmacogene variants and the relatively common established and newly predicted functional pharamacogenetic variants. Our findings therefore support a focus on individual patient pharmacogenetic testing rather than on clinical presumptions about patient race, ethnicity, or ancestral geographic residence. We further encourage more attention be given to the impact of common variation on drug response and propose a new ‘common treatment, common variant’ perspective for pharmacogenetic prediction that is distinct from the types of variation that underlie complex and Mendelian disease. XGB-PGX has identified many new pharmacovariants that are present across all global communities; however, communities that have been underrepresented in genomic research are likely to benefit the most from XGB-PGX’s in silico predictions

    Challenges in Translating GWAS Results to Clinical Care

    No full text
    Clinical genetic testing for Mendelian disorders is standard of care in many cases; however, it is less clear to what extent and in which situations clinical genetic testing may improve preventive efforts, diagnosis and/or prognosis of complex disease. One challenge is that much of the reported research relies on tag single nucleotide polymorphisms (SNPs) to act as proxies for assumed underlying functional variants that are not yet known. Here we use coronary artery disease and melanoma as case studies to evaluate how well reported genetic risk variants tag surrounding variants across population samples in the 1000 Genomes Project Phase 3 data. We performed a simulation study where we randomly assigned a “functional” variant and evaluated how often this simulated functional variant was correctly tagged in diverse population samples. Our results indicate a relatively large error rate when generalizing increased genetic risk of complex disease across diverse population samples, even when generalizing within geographic regions. Our results further highlight the importance of including diverse populations in genome-wide association studies. Future work focused on identifying functional variants will eliminate the need for tag SNPs; however, until functional variants are known, caution should be used in the interpretation of genetic risk for complex disease using tag SNPs

    Genetic and genomic stability across lymphoblastoid cell line expansions

    No full text
    Abstract Objective Lymphoblastoid cell lines are widely used in genetic and genomic studies. Previous work has characterized variant stability in transformed culture and across culture passages. Our objective was to extend this work to evaluate single nucleotide polymorphism and structural variation across cell line expansions, which are commonly used in biorepository distribution. Our study used DNA and cell lines sampled from six research participants. We assayed genome-wide genetic variants and inferred structural variants for DNA extracted from blood, from transformed cell cultures, and from three generations of expansions. Results Single nucleotide variation was stable between DNA and expanded cell lines (ranging from 99.90 to 99.98% concordance). Structural variation was less consistent across expansions (median 33% concordance) with a noticeable decrease in later expansions. In summary, we demonstrate consistency between SNPs assayed from whole blood DNA and LCL DNA; however, more caution should be taken in using LCL DNA to study structural variation

    e-GRASP: an integrated evolutionary and GRASP resource for exploring disease associations

    No full text
    Abstract Background Genome-wide association studies (GWAS) have become a mainstay of biological research concerned with discovering genetic variation linked to phenotypic traits and diseases. Both discrete and continuous traits can be analyzed in GWAS to discover associations between single nucleotide polymorphisms (SNPs) and traits of interest. Associations are typically determined by estimating the significance of the statistical relationship between genetic loci and the given trait. However, the prioritization of bona fide, reproducible genetic associations from GWAS results remains a central challenge in identifying genomic loci underlying common complex diseases. Evolutionary-aware meta-analysis of the growing GWAS literature is one way to address this challenge and to advance from association to causation in the discovery of genotype-phenotype relationships. Description We have created an evolutionary GWAS resource to enable in-depth query and exploration of published GWAS results. This resource uses the publically available GWAS results annotated in the GRASP2 database. The GRASP2 database includes results from 2082 studies, 177 broad phenotype categories, and ~8.87 million SNP-phenotype associations. For each SNP in e-GRASP, we present information from the GRASP2 database for convenience as well as evolutionary information (e.g., rate and timespan). Users can, therefore, identify not only SNPs with highly significant phenotype-association P-values, but also SNPs that are highly replicated and/or occur at evolutionarily conserved sites that are likely to be functionally important. Additionally, we provide an evolutionary-adjusted SNP association ranking (E-rank) that uses cross-species evolutionary conservation scores and population allele frequencies to transform P-values in an effort to enhance the discovery of SNPs with a greater probability of biologically meaningful disease associations. Conclusion By adding an evolutionary dimension to the GWAS results available in the GRASP2 database, our e-GRASP resource will enable a more effective exploration of SNPs not only by the statistical significance of trait associations, but also by the number of studies in which associations have been replicated, and the evolutionary context of the associated mutations. Therefore, e-GRASP will be a valuable resource for aiding researchers in the identification of bona fide, reproducible genetic associations from GWAS results. This resource is freely available at http://www.mypeg.info/egrasp
    corecore