17 research outputs found
The GWAS-MAP platform for aggregation of results of genome-wide association studies and the GWAS-MAP|homo database of 70 billion genetic associations of human traits
Hundreds of genome-wide association studies (GWAS) of human traits are performed each year. The results of GWAS are often published in the form of summary statistics. Information from summary statistics can be used for multiple purposes β from fundamental research in biology and genetics to the search for potential biomarkers and therapeutic targets. While the amount of GWAS summary statistics collected by the scientific community is rapidly increasing, the use of this data is limited by the lack of generally accepted standards. In particular, the researchers who would like to use GWAS summary statistics in their studies have to become aware that the data are scattered across multiple websites, are presented in a variety of formats, and, often, were not quality controlled. Moreover, each available summary statistics analysis tools will ask for data to be presented in their own internal format. To address these issues, we developed GWAS-MAP, a high-throughput platform for aggregating, storing, analyzing, visualizing and providing access to a database of big data that result from region- and genome-wide association studies. The database currently contains information on more than 70 billion associations between genetic variants and human diseases, quantitative traits, and βomicsβ traits. The GWAS-MAP platform and database can be used for studying the etiology of human diseases, building predictive risk models and finding potential biomarkers and therapeutic interventions. In order to demonstrate a typical application of the platform as an approach for extracting new biological knowledge and establishing mechanistic hypotheses, we analyzed varicose veins, a disease affecting on average every third adult in Russia. The results of analysis confirmed known epidemiologic associations for this disease and led us to propose a hypothesis that increased levels of MICB and CD209 proteins in human plasma may increase susceptibility to varicose veins
The GWAS-MAP|ovis platform for aggregation and analysis of genome-wide association study results in sheep
In recent years, the number of genome-wide association studies (GWAS) carried out for various economically important animal traits has been increasing. GWAS discoveries provide summary statistics that can be used both for targeted marker-oriented selection and for studying the genetic control of economically important traits of farm animals. In contrast to research in human genetics, GWAS on farm animals often does not meet generally accepted standards (availability of information about effect and reference alleles, the size and direction of the effect, etc.). This greatly complicates the use of GWAS results for breeding needs. Within the framework of human genetics, there are several technological solutions for researching the harmonized results of GWAS, including one of the largest, the GWAS-MAP platform. For other types of living organisms, including economically important agricultural animals, there are no similar solutions. To our knowledge, no similar solution has been proposed to date for any of the species of economically important animals. As part of this work, we focused on creating a platform similar to GWAS-MAP for working with the results of GWAS of sheep, since sheep breeding is one of the most important branches of agriculture. By analogy with the GWAS-MAP platform for storing, unifying and analyzing human GWAS, we have created the GWAS-MAP|ovis platform. The platform currently contains information on more than 34 million associations between genomic sequence variants and traits of meat production in sheep. The platform can also be used to conduct colocalization analysis, a method that allows one to determine whether the association of a particular locus with two different traits is the result of pleiotropy or whether these traits are associated with different variants that are in linkage disequilibrium. This platform will be useful for breeders to select promising markers for breeding, as well as to obtain information for the introduction of genomic breeding and for scientists to replicate the results obtained
The GWAS-MAP|ovis platform for aggregation and analysis of genome-wide association study results in sheep.
peer reviewedIn recent years, the number of genome-wide association studies (GWAS) carried out for various economically important animal traits has been increasing. GWAS discoveries provide summary statistics that can be used both for targeted marker-oriented selection and for studying the genetic control of economically important traits of farm animals. In contrast to research in human genetics, GWAS on farm animals often does not meet generally accepted standards (availability of information about effect and reference alleles, the size and direction of the effect, etc.). This greatly complicates the use of GWAS results for breeding needs. Within the framework of human genetics, there are several technological solutions for researching the harmonized results of GWAS, including one of the largest, the GWAS-MAP platform. For other types of living organisms, including economically important agricultural animals, there are no similar solutions. To our knowledge, no similar solution has been proposed to date for any of the species of economically important animals. As part of this work, we focused on creating a platform similar to GWAS-MAP for working with the results of GWAS of sheep, since sheep breeding is one of the most important branches of agriculture. By analogy with the GWAS-MAP platform for storing, unifying and analyzing human GWAS, we have created the GWAS-MAP|ovis platform. The platform currently contains information on more than 34 million associations between genomic sequence variants and traits of meat production in sheep. The platform can also be used to conduct colocalization analysis, a method that allows one to determine whether the association of a particular locus with two different traits is the result of pleiotropy or whether these traits are associated with different variants that are in linkage disequilibrium. This platform will be useful for breeders to select promising markers for breeding, as well as to obtain information for the introduction of genomic breeding and for scientists to replicate the results obtained.Π ΠΏΠΎΡΠ»Π΅Π΄Π½ΠΈΠ΅ Π³ΠΎΠ΄Ρ ΡΠ²Π΅Π»ΠΈΡΠΈΠ²Π°Π΅ΡΡΡ ΠΊΠΎΠ»ΠΈΡΠ΅ΡΡΠ²ΠΎ ΠΏΠΎΠ»Π½ΠΎΠ³Π΅Π½ΠΎΠΌΠ½ΡΡ
ΠΈΡΡΠ»Π΅Π΄ΠΎΠ²Π°Π½ΠΈΠΉ Π°ΡΡΠΎΡΠΈΠ°ΡΠΈΠΉ (ΠΠΠΠ, GWAS), ΠΏΡΠΎΠ²Π΅Π΄Π΅Π½Π½ΡΡ
Π΄Π»Ρ ΡΠ°Π·Π»ΠΈΡΠ½ΡΡ
ΡΠΊΠΎΠ½ΠΎΠΌΠΈΡΠ΅ΡΠΊΠΈ Π²Π°ΠΆΠ½ΡΡ
ΠΏΡΠΈΠ·Π½Π°ΠΊΠΎΠ² ΠΆΠΈΠ²ΠΎΡΠ½ΡΡ
. Π Π΅Π·ΡΠ»ΡΡΠ°ΡΡ ΡΡΠΈΡ
ΠΈΡΡΠ»Π΅Π΄ΠΎΠ²Π°Π½ΠΈΠΉ ΠΏΡΠ΅Π΄ΡΡΠ°Π²Π»Π΅Π½Ρ Π² Π²ΠΈΠ΄Π΅ ΡΡΠΌΠΌΠ°ΡΠ½ΡΡ
ΡΡΠ°ΡΠΈΡΡΠΈΠΊ, ΠΊΠΎΡΠΎΡΡΠ΅ ΠΌΠΎΠΆΠ½ΠΎ ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°ΡΡ Π΄Π»Ρ ΠΈΠ·ΡΡΠ΅Π½ΠΈΡ Π³Π΅Π½Π΅ΡΠΈΡΠ΅ΡΠΊΠΎΠ³ΠΎ ΠΊΠΎΠ½ΡΡΠΎΠ»Ρ ΡΠΊΠΎΠ½ΠΎΠΌΠΈΡΠ΅ΡΠΊΠΈ Π²Π°ΠΆΠ½ΡΡ
ΠΏΡΠΈΠ·Π½Π°ΠΊΠΎΠ² ΡΠ΅Π»ΡΡΠΊΠΎΡ
ΠΎΠ·ΡΠΉΡΡΠ²Π΅Π½Π½ΡΡ
ΠΆΠΈΠ²ΠΎΡΠ½ΡΡ
, Π² ΡΠΎΠΌ ΡΠΈΡΠ»Π΅ ΠΈ ΠΏΡΠΈ ΡΠ°Π·ΡΠ°Π±ΠΎΡΠΊΠ΅ ΠΌΠ΅ΡΠΎΠ΄ΠΈΠΊ ΠΌΠ°ΡΠΊΠ΅Ρ-ΠΎΡΠΈΠ΅Π½ΡΠΈΡΠΎΠ²Π°Π½Π½ΠΎΠΉ ΡΠ΅Π»Π΅ΠΊΡΠΈΠΈ. Π Π±ΠΎΠ»ΡΡΠΈΠ½ΡΡΠ²Π΅ ΡΠ»ΡΡΠ°Π΅Π² ΠΠΠΠ ΡΠ΅Π»ΡΡΠΊΠΎΡ
ΠΎΠ·ΡΠΉΡΡΠ²Π΅Π½Π½ΡΡ
ΠΆΠΈΠ²ΠΎΡΠ½ΡΡ
Π½Π΅ ΡΠΎΠΎΡΠ²Π΅ΡΡΡΠ²ΡΡΡ ΠΎΠ±ΡΠ΅ΠΏΡΠΈΠ½ΡΡΡΠΌ Π² ΠΎΠ±Π»Π°ΡΡΠΈ ΠΈΡΡΠ»Π΅Π΄ΠΎΠ²Π°Π½ΠΈΠΉ Π³Π΅Π½Π΅ΡΠΈΠΊΠΈ ΡΠ΅Π»ΠΎΠ²Π΅ΠΊΠ° ΡΡΠ°Π½Π΄Π°ΡΡΠ°ΠΌ ΡΠΎΡΠΌΠ°ΡΠ° ΠΏΡΠ±Π»ΠΈΠΊΠ°ΡΠΈΠΉ ΡΠ΅Π·ΡΠ»ΡΡΠ°ΡΠΎΠ² ΠΠΠΠ Π² Π²ΠΈΠ΄Π΅ ΡΡΠΌΠΌΠ°ΡΠ½ΡΡ
ΡΡΠ°ΡΠΈΡΡΠΈΠΊ (Π½Π°Π»ΠΈΡΠΈΠ΅ ΠΈΠ½ΡΠΎΡΠΌΠ°ΡΠΈΠΈ ΠΎΠ± ΡΡΡΠ΅ΠΊΡΠΎΡΠ½ΠΎΠΌ ΠΈ ΡΠ΅ΡΠ΅ΡΠ΅Π½ΡΠ½ΠΎΠΌ Π°Π»Π»Π΅Π»ΡΡ
, Π·Π½Π°ΡΠ΅Π½ΠΈΠ΅ ΠΈ Π½Π°ΠΏΡΠ°Π²Π»Π΅Π½ΠΈΠ΅ ΡΡΡΠ΅ΠΊΡΠ° ΠΈ Π΄Ρ.). ΠΡΠΎ ΡΡΡΠ΅ΡΡΠ²Π΅Π½Π½ΠΎ Π·Π°ΡΡΡΠ΄Π½ΡΠ΅Ρ ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°Π½ΠΈΠ΅ ΡΡΠΌΠΌΠ°ΡΠ½ΡΡ
ΡΡΠ°ΡΠΈΡΡΠΈΠΊ Π΄Π»Ρ Π½ΡΠΆΠ΄ ΡΠ΅Π»Π΅ΠΊΡΠΈΠΈ. Π ΠΎΠ±Π»Π°ΡΡΠΈ ΠΈΡΡΠ»Π΅Π΄ΠΎΠ²Π°Π½ΠΈΠΉ Π³Π΅Π½Π΅ΡΠΈΠΊΠΈ ΡΠ΅Π»ΠΎΠ²Π΅ΠΊΠ° ΠΈΠΌΠ΅Π΅ΡΡΡ Π½Π΅ΡΠΊΠΎΠ»ΡΠΊΠΎ ΡΠ΅Ρ
Π½ΠΎΠ»ΠΎΠ³ΠΈΡΠ΅ΡΠΊΠΈΡ
ΡΠ΅ΡΠ΅Π½ΠΈΠΉ Π΄Π»Ρ Π°Π½Π°Π»ΠΈΠ·Π° ΡΠ΅Π·ΡΠ»ΡΡΠ°ΡΠΎΠ² ΠΠΠΠ, Π² ΡΠΎΠΌ ΡΠΈΡΠ»Π΅ ΠΎΠ΄Π½ΠΎ ΠΈΠ· ΡΠ°ΠΌΡΡ
ΠΊΡΡΠΏΠ½ΡΡ
β ΠΏΠ»Π°ΡΡΠΎΡΠΌΠ° GWAS-MAP. ΠΠ»Ρ Π΄ΡΡΠ³ΠΈΡ
Π²ΠΈΠ΄ΠΎΠ² ΠΆΠΈΠ²ΡΡ
ΠΎΡΠ³Π°Π½ΠΈΠ·ΠΌΠΎΠ², Π²ΠΊΠ»ΡΡΠ°ΡΡΠΈΡ
ΠΈ ΡΠΊΠΎΠ½ΠΎΠΌΠΈΡΠ΅ΡΠΊΠΈ Π²Π°ΠΆΠ½ΡΡ
ΡΠ΅Π»ΡΡΠΊΠΎΡ
ΠΎΠ·ΡΠΉΡΡΠ²Π΅Π½Π½ΡΡ
ΠΆΠΈΠ²ΠΎΡΠ½ΡΡ
, ΠΏΠΎΠ΄ΠΎΠ±Π½ΡΡ
ΡΠ΅ΡΠ΅Π½ΠΈΠΉ Π½Π΅Ρ. Π Π½Π°ΡΡΠΎΡΡΠ΅ΠΉ ΡΠ°Π±ΠΎΡΠ΅ ΠΌΡ ΡΡΠΎΠΊΡΡΠΈΡΠΎΠ²Π°Π»ΠΈΡΡ Π½Π° ΡΠΎΠ·Π΄Π°Π½ΠΈΠΈ ΡΡ
ΠΎΠΆΠ΅ΠΉ ΠΏΠ»Π°ΡΡΠΎΡΠΌΡ Π΄Π»Ρ ΡΠ°Π±ΠΎΡΡ Ρ ΡΡΠΌΠΌΠ°ΡΠ½ΡΠΌΠΈ ΡΡΠ°ΡΠΈΡΡΠΈΠΊΠ°ΠΌΠΈ ΠΠΠΠ ΡΠ°Π·Π»ΠΈΡΠ½ΡΡ
ΠΏΡΠΈΠ·Π½Π°ΠΊΠΎΠ² ΠΎΠ²Π΅Ρ, ΡΠ°ΠΊ ΠΊΠ°ΠΊ ΠΎΠ²ΡΠ΅Π²ΠΎΠ΄ΡΡΠ²ΠΎ Π² ΠΏΠΎΡΠ»Π΅Π΄Π½Π΅Π΅ Π²ΡΠ΅ΠΌΡ ΡΡΠ°Π½ΠΎΠ²ΠΈΡΡΡ Π²ΡΠ΅ Π±ΠΎΠ»Π΅Π΅ Π°ΠΊΡΡΠ°Π»ΡΠ½ΠΎΠΉ ΠΎΠ±Π»Π°ΡΡΡΡ ΡΠ΅Π»ΡΡΠΊΠΎΠ³ΠΎ Ρ
ΠΎΠ·ΡΠΉΡΡΠ²Π°. ΠΠΎ Π°Π½Π°Π»ΠΎΠ³ΠΈΠΈ Ρ ΠΏΠ»Π°ΡΡΠΎΡΠΌΠΎΠΉ GWAS-MAP Π΄Π»Ρ Ρ
ΡΠ°Π½Π΅Π½ΠΈΡ, ΡΠ½ΠΈΡΠΈΠΊΠ°ΡΠΈΠΈ ΠΈ Π°Π½Π°Π»ΠΈΠ·Π° GWAS ΡΠ΅Π»ΠΎΠ²Π΅ΠΊΠ° ΠΌΡ ΡΠΎΠ·Π΄Π°Π»ΠΈ ΠΏΠ»Π°ΡΡΠΎΡΠΌΡ GWAS-MAP|ovis. ΠΠ° ΡΠ΅Π³ΠΎΠ΄Π½ΡΡΠ½ΠΈΠΉ Π΄Π΅Π½Ρ ΠΏΠ»Π°ΡΡΠΎΡΠΌΠ° ΡΠΎΠ΄Π΅ΡΠΆΠΈΡ ΠΈΠ½ΡΠΎΡΠΌΠ°ΡΠΈΡ ΠΎ Π±ΠΎΠ»Π΅Π΅ ΡΠ΅ΠΌ 34 ΠΌΠ»Π½ Π°ΡΡΠΎΡΠΈΠ°ΡΠΈΠΉ ΠΌΠ΅ΠΆΠ΄Ρ Π²Π°ΡΠΈΠ°Π½ΡΠ°ΠΌΠΈ Π³Π΅Π½ΠΎΠΌΠ½ΠΎΠΉ ΠΏΠΎΡΠ»Π΅Π΄ΠΎΠ²Π°ΡΠ΅Π»ΡΠ½ΠΎΡΡΠΈ ΠΈ ΠΏΡΠΈΠ·Π½Π°ΠΊΠ°ΠΌΠΈ ΠΌΡΡΠ½ΠΎΠΉ ΠΏΡΠΎΠ΄ΡΠΊΡΠΈΠ²Π½ΠΎΡΡΠΈ. ΠΠ»Π°ΡΡΠΎΡΠΌΠ° ΠΌΠΎΠΆΠ΅Ρ Π±ΡΡΡ ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°Π½Π° ΠΈ Π΄Π»Ρ ΠΏΡΠΎΠ²Π΅Π΄Π΅Π½ΠΈΡ Π°Π½Π°Π»ΠΈΠ·Π° ΠΊΠΎΠ»ΠΎΠΊΠ°Π»ΠΈΠ·Π°ΡΠΈΠΈ β ΠΌΠ΅ΡΠΎΠ΄Π°, ΠΊΠΎΡΠΎΡΡΠΉ ΠΏΠΎΠ·Π²ΠΎΠ»ΡΠ΅Ρ ΡΡΡΠ°Π½ΠΎΠ²ΠΈΡΡ, ΡΠ²Π»ΡΠ΅ΡΡΡ Π»ΠΈ Π°ΡΡΠΎΡΠΈΠ°ΡΠΈΡ ΠΎΠΏΡΠ΅Π΄Π΅Π»Π΅Π½Π½ΠΎΠ³ΠΎ Π»ΠΎΠΊΡΡΠ° Ρ Π΄Π²ΡΠΌΡ ΡΠ°Π·Π½ΡΠΌΠΈ ΠΏΡΠΈΠ·Π½Π°ΠΊΠ°ΠΌΠΈ ΡΠ΅Π·ΡΠ»ΡΡΠ°ΡΠΎΠΌ ΠΏΠ»Π΅ΠΉΠΎΡΡΠΎΠΏΠΈΠΈ ΠΈΠ»ΠΈ ΠΆΠ΅ Π΄Π°Π½Π½ΡΠ΅ ΠΏΡΠΈΠ·Π½Π°ΠΊΠΈ Π°ΡΡΠΎΡΠΈΠΈΡΠΎΠ²Π°Π½Ρ Ρ ΡΠ°Π·Π½ΡΠΌΠΈ Π²Π°ΡΠΈΠ°Π½ΡΠ°ΠΌΠΈ, ΠΊΠΎΡΠΎΡΡΠ΅ Π½Π°Ρ
ΠΎΠ΄ΡΡΡΡ Π² Π½Π΅ΡΠ°Π²Π½ΠΎΠ²Π΅ΡΠΈΠΈ ΠΏΠΎ ΡΡΠ΅ΠΏΠ»Π΅Π½ΠΈΡ. ΠΡΠ° ΠΏΠ»Π°ΡΡΠΎΡΠΌΠ° Π±ΡΠ΄Π΅Ρ ΠΏΠΎΠ»Π΅Π·Π½Π° ΠΊΠ°ΠΊ ΡΠ΅Π»Π΅ΠΊΡΠΈΠΎΠ½Π΅ΡΠ°ΠΌ Π΄Π»Ρ Π²ΡΠ±ΠΎΡΠ° ΠΏΠ΅ΡΡΠΏΠ΅ΠΊΡΠΈΠ²Π½ΡΡ
ΠΌΠ°ΡΠΊΠ΅ΡΠΎΠ² Π΄Π»Ρ ΡΠ΅Π»Π΅ΠΊΡΠΈΠΈ (ΡΡΡΠ΅ΠΊΡΡ ΠΈ Π°Π»Π»Π΅Π»ΠΈ ΡΠ°Π·Π»ΠΈΡΠ½ΡΡ
ΠΌΠ°ΡΠΊΠ΅ΡΠΎΠ², Π²Π»ΠΈΡΡΡΠΈΡ
Π½Π° ΠΈΠ·ΡΡΠ°Π΅ΠΌΡΠ΅ ΠΏΡΠΈΠ·Π½Π°ΠΊΠΈ), ΡΠ°ΠΊ ΠΈ Π΄Π»Ρ ΡΡΠ΅Π½ΡΡ
, Π²Π΅Π΄ΡΡΠΈΡ
ΠΈΡΡΠ»Π΅Π΄ΠΎΠ²Π°Π½ΠΈΡ Π² ΠΎΠ±Π»Π°ΡΡΠΈ Π³Π΅Π½Π΅ΡΠΈΠΊΠΈ ΠΎΠ²Π΅Ρ
Negative heterosis for meiotic recombination rate inΒ spermatocytes of the domestic chicken Gallus gallus
Benefits and costs of meiotic recombination are a matter of discussion. Because recombination breaks allele combinations already tested by natural selection and generates new ones of unpredictable fitness, a high recombination rate is generally beneficial for the populations living in a fluctuating or a rapidly changing environment and costly in a stable environment. Besides genetic benefits and costs, there are cytological effects of recombination, both positive and negative. Recombination is necessary for chromosome synapsis and segregation. However, it involves a massive generation of double-strand DNA breaks, erroneous repair of which may lead to germ cell death or various mutations and chromosome rearrangements. Thus, the benefits of recombination (generation of new allele combinations) would prevail over its costs (occurrence of deleterious mutations) as long as the population remains sufficiently heterogeneous. Using immunolocalization of MLH1, a mismatch repair protein, at the synaptonemal complexes, we examined the number and distribution of recombination nodules in spermatocytes of two chicken breeds with high (Pervomai) and low (Russian Crested) recombination rates and their F1 hybrids and backcrosses. We detected negative heterosis for recombination rate in the F1 hybrids. Backcrosses to the Pervomai breed were rather homogenous and showed an intermediate recombination rate. The differences in overall recombination rate between the breeds, hybrids and backcrosses were mainly determined by the differences in the crossing over number in the seven largest macrochromosomes. The decrease in recombination rate in F1 is probably determined by difficulties in homology matching between the DNA sequences of genetically divergent breeds. The suppression of recombination in the hybrids may impede gene flow between parapatric populations and therefore accelerate their genetic divergence
A network-based conditional genetic association analysis of the human metabolome
Background: Genome-wide association studies have identified hundreds of loci that influence a wide variety of complex human traits;however, little is known regarding the biological mechanism of action of these loci. The recent accumulation of functional genomics ("omics"), including metabolomics data, has created new opportunities for studying the functional role of specific changes in the genome. Functional genomic data are characterized by their high dimensionality, the presence of (strong) statistical dependency between traits, and, potentially, complex genetic control. Therefore, the analysis of such data requires specific statistical genetics methods. Results: To facilitate our understanding of the genetic control of omics phenotypes, we propose a trait-centered, network-based conditional genetic association (cGAS) approach for identifying the direct effects of genetic variants on omics-based traits. For each trait of interest, we selected from a biological network a set of other traits to be used as covariates in the cGAS. The network can be reconstructed either from biological pathway databases (a mechanistic approach) or directly from the data, using a Gaussian graphical model applied to the metabolome (a data-driven approach). We derived mathematical expressions that allow comparison of the power of univariate analyses with conditional genetic association analyses. We then tested our approach using data from a population-based Cooperative Health Research in the region of Augsburg (KORA) study (n = 1,784 subjects, 1.7 million single-nucleotide polymorphisms) with measured data for 151 metabolites. Conclusions: We found that compared to single-trait analysis, performing a genetic association analysis that includes biologically relevant covariates can either gain or lose power, depending on specific pleiotropic scenarios, for which we provide empirical examples. In the context of analyzed metabolomics data, the mechanistic network approach had more power compared to the data-driven approach. Nevertheless, we believe that our analysis shows that neither a prior-knowledge-only approach nor a phenotypic-data-only approach is optimal, and we discuss possibilities for improvement
Genome-wide meta-analysis of 158,000 individuals of European ancestry identifies three loci associated with chronic back pain
Back pain is the #1 cause of years lived with disability worldwide, yet surprisingly little is known regarding the biology underlying this symptom. We conducted a genome-wide association study (GWAS) meta-analysis of ch
Development and application of genomic control methods for genome-wide association studies using non-additive models
Genome-wide association studies (GWAS) comprise a powerful tool for mapping genes of complex traits. However, an inflation of the test statistic can occur because of population substructure or cryptic relatedness, which could cause spurious associations. If information on a large number of genetic markers is available, adjusting the analysis results by using the method of genomic control (GC) is possible. GC was originally proposed to correct the Cochran-Armitage additive trend test. For non-additive models, correction has been shown to depend on allele frequencies. Therefore, usage of GC is limited to situations where allele frequencies of null markers and candidate markers are matched. In this work, we extended the capabilities of the GC method for non-additive models, which allows us to use null markers with arbitrary allele frequencies for GC. Analytical expressions for the inflation of a test statistic describing its dependency on allele frequency and several population parameters were obtained for recessive, dominant, and over-dominant models of inheritance. We proposed a method to estimate these required population parameters. Furthermore, we suggested a GC method based on approximation of the correction coefficient by a polynomial of allele frequency and described procedures to correct the genotypic (two degrees of freedom) test for cases when the model of inheritance is unknown. Statistical properties of the described methods were investigated using simulated and real data. We demonstrated that all considered methods were effective in controlling type 1 error in the presence of genetic substructure. The proposed GC methods can be applied to statistical tests for GWAS with various models of inheritance. All methods developed and tested in this work were implemented using R language as a part of the GenABEL package
Population specific analysis of Yakut exomes
We studied the genetic diversity of the Yakut population using exome sequencing. We performed comparative analysis of the Yakut population and the populations that are included in the "1000 Genomes" project and we identified the alleles specific to the Yakut population. We showed, that the Yakuts population is a separate cluster between Europeans and East Asians
Mendelian randomization of genetically independent aging phenotypes identifies LPA and VCAM1 as biological targets for human aging
Length and quality of life are important to us all, yet identification of promising drug targets for human aging using genetics has had limited success. In the present study, we combine six European-ancestry genome-wide association studies of human aging traitsβhealthspan, father and mother lifespan, exceptional longevity, frailty index and self-rated healthβin a principal component framework that maximizes their shared genetic architecture. The first principal component (aging-GIP1) captures both length of life and indices of mental and physical wellbeing. We identify 27 genomic regions associated with aging-GIP1, and provide additional, independent evidence for an effect on human aging for loci near HTT and MAML3 using a study of Finnish and Japanese survival. Using proteome-wide, two-sample, Mendelian randomization and colocalization, we provide robust evidence for a detrimental effect of blood levels of apolipoprotein(a) and vascular cell adhesion molecule 1 on aging-GIP1. Together, our results demonstrate that combining multiple aging traits using genetic principal components enhances the power to detect biological targets for human aging