17 research outputs found

    The GWAS-MAP platform for aggregation of results of genome-wide association studies and the GWAS-MAP|homo database of 70 billion genetic associations of human traits

    Get PDF
    Hundreds of genome-wide association studies (GWAS) of human traits are performed each year. The results of GWAS are often published in the form of summary statistics. Information from summary statistics can be used for multiple purposes – from fundamental research in biology and genetics to the search for potential biomarkers and therapeutic targets. While the amount of GWAS summary statistics collected by the scientific community is rapidly increasing, the use of this data is limited by the lack of generally accepted standards. In particular, the researchers who would like to use GWAS summary statistics in their studies have to become aware that the data are scattered across multiple websites, are presented in a variety of formats, and, often, were not quality controlled. Moreover, each available summary statistics analysis tools will ask for data to be presented in their own internal format. To address these issues, we developed GWAS-MAP, a high-throughput platform for aggregating, storing, analyzing, visualizing and providing access to a database of big data that result from region- and genome-wide association studies. The database currently contains information on more than 70 billion associations between genetic variants and human diseases, quantitative traits, and β€œomics” traits. The GWAS-MAP platform and database can be used for studying the etiology of human diseases, building predictive risk models and finding potential biomarkers and therapeutic interventions. In order to demonstrate a typical application of the platform as an approach for extracting new biological knowledge and establishing mechanistic hypotheses, we analyzed varicose veins, a disease affecting on average every third adult in Russia. The results of analysis confirmed known epidemiologic associations for this disease and led us to propose a hypothesis that increased levels of MICB and CD209 proteins in human plasma may increase susceptibility to varicose veins

    The GWAS-MAP|ovis platform for aggregation and analysis of genome-wide association study results in sheep

    Get PDF
    In recent years, the number of genome-wide association studies (GWAS) carried out for various economically important animal traits has been increasing. GWAS discoveries provide summary statistics that can be used both for targeted marker-oriented selection and for studying the genetic control of economically important traits of farm animals. In contrast to research in human genetics, GWAS on farm animals often does not meet generally accepted standards (availability of information about effect and reference alleles, the size and direction of the effect, etc.). This greatly complicates the use of GWAS results for breeding needs. Within the framework of human genetics, there are several technological solutions for researching the harmonized results of GWAS, including one of the largest, the GWAS-MAP platform. For other types of living organisms, including economically important agricultural animals, there are no similar solutions. To our knowledge, no similar solution has been proposed to date for any of the species of economically important animals. As part of this work, we focused on creating a platform similar to GWAS-MAP for working with the results of GWAS of sheep, since sheep breeding is one of the most important branches of agriculture. By analogy with the GWAS-MAP platform for storing, unifying and analyzing human GWAS, we have created the GWAS-MAP|ovis platform. The platform currently contains information on more than 34 million associations between genomic sequence variants and traits of meat production in sheep. The platform can also be used to conduct colocalization analysis, a method that allows one to determine whether the association of a particular locus with two different traits is the result of pleiotropy or whether these traits are associated with different variants that are in linkage disequilibrium. This platform will be useful for breeders to select promising markers for breeding, as well as to obtain information for the introduction of genomic breeding and for scientists to replicate the results obtained

    The GWAS-MAP|ovis platform for aggregation and analysis of genome-wide association study results in sheep.

    Full text link
    peer reviewedIn recent years, the number of genome-wide association studies (GWAS) carried out for various economically important animal traits has been increasing. GWAS discoveries provide summary statistics that can be used both for targeted marker-oriented selection and for studying the genetic control of economically important traits of farm animals. In contrast to research in human genetics, GWAS on farm animals often does not meet generally accepted standards (availability of information about effect and reference alleles, the size and direction of the effect, etc.). This greatly complicates the use of GWAS results for breeding needs. Within the framework of human genetics, there are several technological solutions for researching the harmonized results of GWAS, including one of the largest, the GWAS-MAP platform. For other types of living organisms, including economically important agricultural animals, there are no similar solutions. To our knowledge, no similar solution has been proposed to date for any of the species of economically important animals. As part of this work, we focused on creating a platform similar to GWAS-MAP for working with the results of GWAS of sheep, since sheep breeding is one of the most important branches of agriculture. By analogy with the GWAS-MAP platform for storing, unifying and analyzing human GWAS, we have created the GWAS-MAP|ovis platform. The platform currently contains information on more than 34 million associations between genomic sequence variants and traits of meat production in sheep. The platform can also be used to conduct colocalization analysis, a method that allows one to determine whether the association of a particular locus with two different traits is the result of pleiotropy or whether these traits are associated with different variants that are in linkage disequilibrium. This platform will be useful for breeders to select promising markers for breeding, as well as to obtain information for the introduction of genomic breeding and for scientists to replicate the results obtained.Π’ послСдниС Π³ΠΎΠ΄Ρ‹ увСличиваСтся количСство ΠΏΠΎΠ»Π½ΠΎΠ³Π΅Π½ΠΎΠΌΠ½Ρ‹Ρ… исслСдований ассоциаций (ΠŸΠ“Π˜Π, GWAS), ΠΏΡ€ΠΎΠ²Π΅Π΄Π΅Π½Π½Ρ‹Ρ… для Ρ€Π°Π·Π»ΠΈΡ‡Π½Ρ‹Ρ… экономичСски Π²Π°ΠΆΠ½Ρ‹Ρ… ΠΏΡ€ΠΈΠ·Π½Π°ΠΊΠΎΠ² ΠΆΠΈΠ²ΠΎΡ‚Π½Ρ‹Ρ…. Π Π΅Π·ΡƒΠ»ΡŒΡ‚Π°Ρ‚Ρ‹ этих исслСдований прСдставлСны Π² Π²ΠΈΠ΄Π΅ суммарных статистик, ΠΊΠΎΡ‚ΠΎΡ€Ρ‹Π΅ ΠΌΠΎΠΆΠ½ΠΎ ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΠΎΠ²Π°Ρ‚ΡŒ для изучСния гСнСтичСского контроля экономичСски Π²Π°ΠΆΠ½Ρ‹Ρ… ΠΏΡ€ΠΈΠ·Π½Π°ΠΊΠΎΠ² ΡΠ΅Π»ΡŒΡΠΊΠΎΡ…ΠΎΠ·ΡΠΉΡΡ‚Π²Π΅Π½Π½Ρ‹Ρ… ΠΆΠΈΠ²ΠΎΡ‚Π½Ρ‹Ρ…, Π² Ρ‚ΠΎΠΌ числС ΠΈ ΠΏΡ€ΠΈ Ρ€Π°Π·Ρ€Π°Π±ΠΎΡ‚ΠΊΠ΅ ΠΌΠ΅Ρ‚ΠΎΠ΄ΠΈΠΊ ΠΌΠ°Ρ€ΠΊΠ΅Ρ€-ΠΎΡ€ΠΈΠ΅Π½Ρ‚ΠΈΡ€ΠΎΠ²Π°Π½Π½ΠΎΠΉ сСлСкции. Π’ Π±ΠΎΠ»ΡŒΡˆΠΈΠ½ΡΡ‚Π²Π΅ случаСв ΠŸΠ“Π˜Π ΡΠ΅Π»ΡŒΡΠΊΠΎΡ…ΠΎΠ·ΡΠΉΡΡ‚Π²Π΅Π½Π½Ρ‹Ρ… ΠΆΠΈΠ²ΠΎΡ‚Π½Ρ‹Ρ… Π½Π΅ ΡΠΎΠΎΡ‚Π²Π΅Ρ‚ΡΡ‚Π²ΡƒΡŽΡ‚ общСпринятым Π² области исслСдований Π³Π΅Π½Π΅Ρ‚ΠΈΠΊΠΈ Ρ‡Π΅Π»ΠΎΠ²Π΅ΠΊΠ° стандартам Ρ„ΠΎΡ€ΠΌΠ°Ρ‚Π° ΠΏΡƒΠ±Π»ΠΈΠΊΠ°Ρ†ΠΈΠΉ Ρ€Π΅Π·ΡƒΠ»ΡŒΡ‚Π°Ρ‚ΠΎΠ² ΠŸΠ“Π˜Π Π² Π²ΠΈΠ΄Π΅ суммарных статистик (Π½Π°Π»ΠΈΡ‡ΠΈΠ΅ ΠΈΠ½Ρ„ΠΎΡ€ΠΌΠ°Ρ†ΠΈΠΈ ΠΎΠ± эффСкторном ΠΈ рСфСрСнсном аллСлях, Π·Π½Π°Ρ‡Π΅Π½ΠΈΠ΅ ΠΈ Π½Π°ΠΏΡ€Π°Π²Π»Π΅Π½ΠΈΠ΅ эффСкта ΠΈ Π΄Ρ€.). Π­Ρ‚ΠΎ сущСствСнно затрудняСт использованиС суммарных статистик для Π½ΡƒΠΆΠ΄ сСлСкции. Π’ области исслСдований Π³Π΅Π½Π΅Ρ‚ΠΈΠΊΠΈ Ρ‡Π΅Π»ΠΎΠ²Π΅ΠΊΠ° имССтся нСсколько тСхнологичСских Ρ€Π΅ΡˆΠ΅Π½ΠΈΠΉ для Π°Π½Π°Π»ΠΈΠ·Π° Ρ€Π΅Π·ΡƒΠ»ΡŒΡ‚Π°Ρ‚ΠΎΠ² ΠŸΠ“Π˜Π, Π² Ρ‚ΠΎΠΌ числС ΠΎΠ΄Π½ΠΎ ΠΈΠ· самых ΠΊΡ€ΡƒΠΏΠ½Ρ‹Ρ… – ΠΏΠ»Π°Ρ‚Ρ„ΠΎΡ€ΠΌΠ° GWAS-MAP. Для Π΄Ρ€ΡƒΠ³ΠΈΡ… Π²ΠΈΠ΄ΠΎΠ² ΠΆΠΈΠ²Ρ‹Ρ… ΠΎΡ€Π³Π°Π½ΠΈΠ·ΠΌΠΎΠ², Π²ΠΊΠ»ΡŽΡ‡Π°ΡŽΡ‰ΠΈΡ… ΠΈ экономичСски Π²Π°ΠΆΠ½Ρ‹Ρ… ΡΠ΅Π»ΡŒΡΠΊΠΎΡ…ΠΎΠ·ΡΠΉΡΡ‚Π²Π΅Π½Π½Ρ‹Ρ… ΠΆΠΈΠ²ΠΎΡ‚Π½Ρ‹Ρ…, ΠΏΠΎΠ΄ΠΎΠ±Π½Ρ‹Ρ… Ρ€Π΅ΡˆΠ΅Π½ΠΈΠΉ Π½Π΅Ρ‚. Π’ настоящСй Ρ€Π°Π±ΠΎΡ‚Π΅ ΠΌΡ‹ ΡΡ„ΠΎΠΊΡƒΡΠΈΡ€ΠΎΠ²Π°Π»ΠΈΡΡŒ Π½Π° создании схоТСй ΠΏΠ»Π°Ρ‚Ρ„ΠΎΡ€ΠΌΡ‹ для Ρ€Π°Π±ΠΎΡ‚Ρ‹ с суммарными статистиками ΠŸΠ“Π˜Π Ρ€Π°Π·Π»ΠΈΡ‡Π½Ρ‹Ρ… ΠΏΡ€ΠΈΠ·Π½Π°ΠΊΠΎΠ² ΠΎΠ²Π΅Ρ†, Ρ‚Π°ΠΊ ΠΊΠ°ΠΊ овцСводство Π² послСднСС врСмя становится всС Π±ΠΎΠ»Π΅Π΅ Π°ΠΊΡ‚ΡƒΠ°Π»ΡŒΠ½ΠΎΠΉ ΠΎΠ±Π»Π°ΡΡ‚ΡŒΡŽ сСльского хозяйства. По Π°Π½Π°Π»ΠΎΠ³ΠΈΠΈ с ΠΏΠ»Π°Ρ‚Ρ„ΠΎΡ€ΠΌΠΎΠΉ GWAS-MAP для хранСния, ΡƒΠ½ΠΈΡ„ΠΈΠΊΠ°Ρ†ΠΈΠΈ ΠΈ Π°Π½Π°Π»ΠΈΠ·Π° GWAS Ρ‡Π΅Π»ΠΎΠ²Π΅ΠΊΠ° ΠΌΡ‹ создали ΠΏΠ»Π°Ρ‚Ρ„ΠΎΡ€ΠΌΡƒ GWAS-MAP|ovis. На сСгодняшний дСнь ΠΏΠ»Π°Ρ‚Ρ„ΠΎΡ€ΠΌΠ° содСрТит ΠΈΠ½Ρ„ΠΎΡ€ΠΌΠ°Ρ†ΠΈΡŽ ΠΎ Π±ΠΎΠ»Π΅Π΅ Ρ‡Π΅ΠΌ 34 ΠΌΠ»Π½ ассоциаций ΠΌΠ΅ΠΆΠ΄Ρƒ Π²Π°Ρ€ΠΈΠ°Π½Ρ‚Π°ΠΌΠΈ Π³Π΅Π½ΠΎΠΌΠ½ΠΎΠΉ ΠΏΠΎΡΠ»Π΅Π΄ΠΎΠ²Π°Ρ‚Π΅Π»ΡŒΠ½ΠΎΡΡ‚ΠΈ ΠΈ ΠΏΡ€ΠΈΠ·Π½Π°ΠΊΠ°ΠΌΠΈ мясной продуктивности. ΠŸΠ»Π°Ρ‚Ρ„ΠΎΡ€ΠΌΠ° ΠΌΠΎΠΆΠ΅Ρ‚ Π±Ρ‹Ρ‚ΡŒ использована ΠΈ для провСдСния Π°Π½Π°Π»ΠΈΠ·Π° ΠΊΠΎΠ»ΠΎΠΊΠ°Π»ΠΈΠ·Π°Ρ†ΠΈΠΈ – ΠΌΠ΅Ρ‚ΠΎΠ΄Π°, ΠΊΠΎΡ‚ΠΎΡ€Ρ‹ΠΉ позволяСт ΡƒΡΡ‚Π°Π½ΠΎΠ²ΠΈΡ‚ΡŒ, являСтся Π»ΠΈ ассоциация ΠΎΠΏΡ€Π΅Π΄Π΅Π»Π΅Π½Π½ΠΎΠ³ΠΎ локуса с двумя Ρ€Π°Π·Π½Ρ‹ΠΌΠΈ ΠΏΡ€ΠΈΠ·Π½Π°ΠΊΠ°ΠΌΠΈ Ρ€Π΅Π·ΡƒΠ»ΡŒΡ‚Π°Ρ‚ΠΎΠΌ ΠΏΠ»Π΅ΠΉΠΎΡ‚Ρ€ΠΎΠΏΠΈΠΈ ΠΈΠ»ΠΈ ΠΆΠ΅ Π΄Π°Π½Π½Ρ‹Π΅ ΠΏΡ€ΠΈΠ·Π½Π°ΠΊΠΈ ассоциированы с Ρ€Π°Π·Π½Ρ‹ΠΌΠΈ Π²Π°Ρ€ΠΈΠ°Π½Ρ‚Π°ΠΌΠΈ, ΠΊΠΎΡ‚ΠΎΡ€Ρ‹Π΅ находятся Π² нСравновСсии ΠΏΠΎ ΡΡ†Π΅ΠΏΠ»Π΅Π½ΠΈΡŽ. Π­Ρ‚Π° ΠΏΠ»Π°Ρ‚Ρ„ΠΎΡ€ΠΌΠ° Π±ΡƒΠ΄Π΅Ρ‚ ΠΏΠΎΠ»Π΅Π·Π½Π° ΠΊΠ°ΠΊ сСлСкционСрам для Π²Ρ‹Π±ΠΎΡ€Π° пСрспСктивных ΠΌΠ°Ρ€ΠΊΠ΅Ρ€ΠΎΠ² для сСлСкции (эффСкты ΠΈ Π°Π»Π»Π΅Π»ΠΈ Ρ€Π°Π·Π»ΠΈΡ‡Π½Ρ‹Ρ… ΠΌΠ°Ρ€ΠΊΠ΅Ρ€ΠΎΠ², Π²Π»ΠΈΡΡŽΡ‰ΠΈΡ… Π½Π° ΠΈΠ·ΡƒΡ‡Π°Π΅ΠΌΡ‹Π΅ ΠΏΡ€ΠΈΠ·Π½Π°ΠΊΠΈ), Ρ‚Π°ΠΊ ΠΈ для ΡƒΡ‡Π΅Π½Ρ‹Ρ…, Π²Π΅Π΄ΡƒΡ‰ΠΈΡ… исслСдования Π² области Π³Π΅Π½Π΅Ρ‚ΠΈΠΊΠΈ ΠΎΠ²Π΅Ρ†

    Negative heterosis for meiotic recombination rate inΒ spermatocytes of the domestic chicken Gallus gallus

    Get PDF
    Benefits and costs of meiotic recombination are a matter of discussion. Because recombination breaks allele combinations already tested by natural selection and generates new ones of unpredictable fitness, a high recombination rate is generally beneficial for the populations living in a fluctuating or a rapidly changing environment and costly in a stable environment. Besides genetic benefits and costs, there are cytological effects of recombination, both positive and negative. Recombination is necessary for chromosome synapsis and segregation. However, it involves a massive generation of double-strand DNA breaks, erroneous repair of which may lead to germ cell death or various mutations and chromosome rearrangements. Thus, the benefits of recombination (generation of new allele combinations) would prevail over its costs (occurrence of deleterious mutations) as long as the population remains sufficiently heterogeneous. Using immunolocalization of MLH1, a mismatch repair protein, at the synaptonemal complexes, we examined the number and distribution of recombination nodules in spermatocytes of two chicken breeds with high (Pervomai) and low (Russian Crested) recombination rates and their F1 hybrids and backcrosses. We detected negative heterosis for recombination rate in the F1 hybrids. Backcrosses to the Pervomai breed were rather homogenous and showed an intermediate recombination rate. The differences in overall recombination rate between the breeds, hybrids and backcrosses were mainly determined by the differences in the crossing over number in the seven largest macrochromosomes. The decrease in recombination rate in F1 is probably determined by difficulties in homology matching between the DNA sequences of genetically divergent breeds. The suppression of recombination in the hybrids may impede gene flow between parapatric populations and therefore accelerate their genetic divergence

    A network-based conditional genetic association analysis of the human metabolome

    Get PDF
    Background: Genome-wide association studies have identified hundreds of loci that influence a wide variety of complex human traits;however, little is known regarding the biological mechanism of action of these loci. The recent accumulation of functional genomics ("omics"), including metabolomics data, has created new opportunities for studying the functional role of specific changes in the genome. Functional genomic data are characterized by their high dimensionality, the presence of (strong) statistical dependency between traits, and, potentially, complex genetic control. Therefore, the analysis of such data requires specific statistical genetics methods. Results: To facilitate our understanding of the genetic control of omics phenotypes, we propose a trait-centered, network-based conditional genetic association (cGAS) approach for identifying the direct effects of genetic variants on omics-based traits. For each trait of interest, we selected from a biological network a set of other traits to be used as covariates in the cGAS. The network can be reconstructed either from biological pathway databases (a mechanistic approach) or directly from the data, using a Gaussian graphical model applied to the metabolome (a data-driven approach). We derived mathematical expressions that allow comparison of the power of univariate analyses with conditional genetic association analyses. We then tested our approach using data from a population-based Cooperative Health Research in the region of Augsburg (KORA) study (n = 1,784 subjects, 1.7 million single-nucleotide polymorphisms) with measured data for 151 metabolites. Conclusions: We found that compared to single-trait analysis, performing a genetic association analysis that includes biologically relevant covariates can either gain or lose power, depending on specific pleiotropic scenarios, for which we provide empirical examples. In the context of analyzed metabolomics data, the mechanistic network approach had more power compared to the data-driven approach. Nevertheless, we believe that our analysis shows that neither a prior-knowledge-only approach nor a phenotypic-data-only approach is optimal, and we discuss possibilities for improvement

    Genome-wide meta-analysis of 158,000 individuals of European ancestry identifies three loci associated with chronic back pain

    Get PDF
    Back pain is the #1 cause of years lived with disability worldwide, yet surprisingly little is known regarding the biology underlying this symptom. We conducted a genome-wide association study (GWAS) meta-analysis of ch

    Development and application of genomic control methods for genome-wide association studies using non-additive models

    Get PDF
    Genome-wide association studies (GWAS) comprise a powerful tool for mapping genes of complex traits. However, an inflation of the test statistic can occur because of population substructure or cryptic relatedness, which could cause spurious associations. If information on a large number of genetic markers is available, adjusting the analysis results by using the method of genomic control (GC) is possible. GC was originally proposed to correct the Cochran-Armitage additive trend test. For non-additive models, correction has been shown to depend on allele frequencies. Therefore, usage of GC is limited to situations where allele frequencies of null markers and candidate markers are matched. In this work, we extended the capabilities of the GC method for non-additive models, which allows us to use null markers with arbitrary allele frequencies for GC. Analytical expressions for the inflation of a test statistic describing its dependency on allele frequency and several population parameters were obtained for recessive, dominant, and over-dominant models of inheritance. We proposed a method to estimate these required population parameters. Furthermore, we suggested a GC method based on approximation of the correction coefficient by a polynomial of allele frequency and described procedures to correct the genotypic (two degrees of freedom) test for cases when the model of inheritance is unknown. Statistical properties of the described methods were investigated using simulated and real data. We demonstrated that all considered methods were effective in controlling type 1 error in the presence of genetic substructure. The proposed GC methods can be applied to statistical tests for GWAS with various models of inheritance. All methods developed and tested in this work were implemented using R language as a part of the GenABEL package

    Population specific analysis of Yakut exomes

    Get PDF
    We studied the genetic diversity of the Yakut population using exome sequencing. We performed comparative analysis of the Yakut population and the populations that are included in the "1000 Genomes" project and we identified the alleles specific to the Yakut population. We showed, that the Yakuts population is a separate cluster between Europeans and East Asians

    Mendelian randomization of genetically independent aging phenotypes identifies LPA and VCAM1 as biological targets for human aging

    No full text
    Length and quality of life are important to us all, yet identification of promising drug targets for human aging using genetics has had limited success. In the present study, we combine six European-ancestry genome-wide association studies of human aging traitsβ€”healthspan, father and mother lifespan, exceptional longevity, frailty index and self-rated healthβ€”in a principal component framework that maximizes their shared genetic architecture. The first principal component (aging-GIP1) captures both length of life and indices of mental and physical wellbeing. We identify 27 genomic regions associated with aging-GIP1, and provide additional, independent evidence for an effect on human aging for loci near HTT and MAML3 using a study of Finnish and Japanese survival. Using proteome-wide, two-sample, Mendelian randomization and colocalization, we provide robust evidence for a detrimental effect of blood levels of apolipoprotein(a) and vascular cell adhesion molecule 1 on aging-GIP1. Together, our results demonstrate that combining multiple aging traits using genetic principal components enhances the power to detect biological targets for human aging
    corecore