Search CORE

27 research outputs found

<i>In silico</i> mapping of coronary artery disease genes

Author: A. V. Kirichenko
G. R. Svishcheva
I. V. Zorkoltseva
N. M. Belonogova
T. I. Axenovich
Publication venue: 'Institute of Cytology and Genetics, SB RAS'
Publication date: 01/01/2020
Field of study

To date, more than 100 loci associated with coronary artery disease (CAD) have been detected in large-scale genome-wide studies. For some of the several hundreds of genes located in these loci, roles in the pathogenesis of the disease have been shown. However, the genetic mechanisms and specific genes controlling this disease are still not fully understood. This study is aimed at in silico search for new CAD genes. We performed a gene-based association analysis, where all polymorphic variants within a gene are analyzed simultaneously. The analysis was based on the results of the genome-wide association studies (GWAS) available from the open databases MICAD (120,575 people, 85,112 markers) and UK Biobank (337,199 people, 10,894,597 markers). We used the sumFREGAT package implementing a wide range of new methods for gene-based association analysis using summary statistics. We found 88 genes demonstrating significant gene-based associations. Forty-four of the identified genes were already known as CAD genes. Furthermore, we identified 28 additional genes in the known CAD loci. They can be considered as new candidate genes. Finally, we identified sixteen new genes (AGPAT4, ARHGEF12, BDP1, DHX58, EHBP1, FBF1, HSPB9, NPBWR2, PDLIM5, PLCB3, PLEKHM2, POU2F3, PRKD2, TMEM136, TTC29 and UTP20) outside the known loci. Information about the functional role of these genes allows us to consider many of them as candidates for CAD. The 41 identified genes did not have significant GWAS signals and they were identified only due to simultaneous consideration of all variants within the gene in the framework of gene-based analysis. These results demonstrate that gene-based association analysis is a powerful tool for gene mapping. The method can utilize huge amounts of GWAS results accumulated in the world to map different traits and diseases. This type of studies is widely available, as it does not require additional material costs

Directory of Open Access Journals

A combined linkage and exome sequencing analysis for electrocardiogram parameters in the erasmus rucphen family study

Author: Amin Najaf
Axenovich Tatiana I.
Demirkan Ayse
Isaacs Aaron
Kirichenko Anatoly V.
Kors Jan A.
Oostra Ben A.
Stricker Bruno H.
Tamar Silva Claudia
Uitterlinden André
van den Berg Marten
van Duijn Cornelia M.
van Leeuwen Elisabeth M.
Willemsen Rob
Witteman Jacqueline C. M.
Zorkoltseva Irina V.
Publication venue
Publication date: 12/05/2020
Field of study

Electrocardiogram (ECG) measurements play a key role in the diagnosis and prediction of cardiac arrhythmias and sudden cardiac death. ECG parameters, such as the PR, QRS, and QT intervals, are known to be heritable and genome-wide association studies of these phenotypes have been successful in identifying common variants; however, a large proportion of the genetic variability of these traits remains to be elucidated. The aim of this study was to discover loci potentially harboring rare variants utilizing variance component linkage analysis in 1547 individuals from a large family-based study, the Erasmus Rucphen Family Study (ERF). Linked regions were further explored using exome sequencing. Five suggestive linkage peaks were identified: two for QT interval (1q24, LOD = 2.63; 2q34, LOD = 2.05), one for QRS interval (1p35, LOD = 2.52) and two for PR interval (9p22, LOD = 2.20; 14q11, LOD = 2.29). Fine-mapping using exome sequence data identified a C > G missense variant (c.713C > G, p.Ser238Cys) in the FCRL2 gene associated with QT (rs74608430; P = 2.8 × 10-4, minor allele frequency = 0.019). Heritability analysis demonstrated that the SNP explained 2.42% of the trait's genetic variability in ERF (P = 0.02). Pathway analysis suggested that the gene is involved in cytosolic Ca2+ levels (P = 3.3 × 10-3) and AMPK stimulated fatty acid oxidation in muscle (P = 4.1 × 10-3). Look-ups in bioinformatics resources showed that expression of FCRL2 is associated with ARHGAP24 and SETBP1 expression. This finding was not replicated in the Rotterdam study. Combining the bioinformatics information with the association and linkage analyses, FCRL2 emerges as a strong candidate gene for QT interval. © 2016 Silva, Zorkoltseva, Amin, Demirkan, van Leeuwen, Kors, van den Berg, Stricker, Uitterlinden, Kirichenko, Witteman, Willemsen, Oostra, Axenovich, van Duijn and Isaacs

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

A Combined Linkage and Exome Sequencing Analysis for Electrocardiogram Parameters in the Erasmus Rucphen Family Study

Author: Aaron Isaacs
Anatoly V. Kirichenko
André G. Uitterlinden
Ayşe Demirkan
Ben A. Oostra
Bruno H. Stricker
Claudia T. Silva
Cornelia M. van Duijn
Elisabeth M. van Leeuwen
Irina V. Zorkoltseva
Jacqueline C. M. Witteman
Jan A. Kors
Marten van den Berg
Najaf Amin
Rob Willemsen
Tatiana I. Axenovich
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2016
Field of study

Electrocardiogram (ECG) measurements play a key role in the diagnosis and prediction of cardiac arrhythmias and sudden cardiac death. ECG parameters, such as the PR, QRS, and QT intervals, are known to be heritable and genome-wide association studies of these phenotypes have been successful in identifying common variants; however, a large proportion of the genetic variability of these traits remains to be elucidated. The aim of this study was to discover loci potentially harboring rare variants utilizing variance component linkage analysis in 1547 individuals from a large family-based study, the Erasmus Rucphen Family Study (ERF). Linked regions were further explored using exome sequencing. Five suggestive linkage peaks were identified: two for QT interval (1q24, LOD = 2.63; 2q34, LOD = 2.05), one for QRS interval (1p35, LOD = 2.52) and two for PR interval (9p22, LOD = 2.20; 14q11, LOD = 2.29). Fine-mapping using exome sequence data identified a C > G missense variant (c.713C > G, p.Ser238Cys) in the FCRL2 gene associated with QT (rs74608430; P = 2.8 x 10(-4), minor allele frequency = 0.019). Heritability analysis demonstrated that the SNP explained 2.42% of the trait's genetic variability in ERF (P = 0.02). Pathway analysis suggested that the gene is involved in cytosolic Ca2+ levels (P = 3.3 x 10(-3)) and AMPK stimulated fatty acid oxidation in muscle (P = 4.1 x 10(-3)). Look-ups in bioinformatics resources showed that expression of FCRL2 is associated with ARHGAP24 and SETBP1 expression. This finding was not replicated in the Rotterdam study. Combining the bioinformatics information with the association and linkage analyses, FCRL2 emerges as a strong candidate gene for QT interval

Maastricht University Research Portal

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Frontiers - Publisher Connector

PubMed Central

EUR Research Repository

Oxford University Research Archive

edocUR

A combined linkage, microarray and exome analysis suggests MAP3K11 as a candidate gene for left ventricular hypertrophy

Author: Amin Najaf
Axenovich Tatiana I.
Demirkan Ayşe
Duijn Cornelia M. van
Iglesias Adriana I.
Isaacs Aaron
Kirichenko Anatoly V.
Kors Jan A.
Leeuwen Elisa van
Niemeijer Maartje N.
Oostra Ben A.
Piñeros-Hernández Laura B.
Restrepo Fernández Carlos Martín
Stricker Bruno H.
Tamar Silva Claudia
Uitterlinden André G.
van den Berg Marten E.
Willemsen Rob
Zorkoltseva Irina V.
Publication venue
Publication date: 03/10/2019
Field of study

Background: Electrocardiographic measures of left ventricular hypertrophy (LVH) are used as predictors of cardiovascular risk. We combined linkage and association analyses to discover novel rare genetic variants involved in three such measures and two principal components derived from them. Methods: The study was conducted among participants from the Erasmus Rucphen Family Study (ERF), a Dutch family-based sample from the southwestern Netherlands. Variance components linkage analyses were performed using Merlin. Regions of interest (LOD > 1.9) were fine-mapped using microarray and exome sequence data. Results: We observed one significant LOD score for the second principal component on chromosome 15 (LOD score = 3.01) and 12 suggestive LOD scores. Several loci contained variants identified in GWAS for these traits; however, these did not explain the linkage peaks, nor did other common variants. Exome sequence data identified two associated variants after multiple testing corrections were applied. Conclusions: We did not find common SNPs explaining these linkage signals. Exome sequencing uncovered a relatively rare variant in MAPK3K11 on chromosome 11 (MAF = 0.01) that helped account for the suggestive linkage peak observed for the first principal component. Conditional analysis revealed a drop in LOD from 2.01 to 0.88 for MAP3K11, suggesting that this variant may partially explain the linkage signal at this chromosomal location. MAP3K11 is related to the JNK pathway and is a pro-apoptotic kinase that plays an important role in the induction of cardiomyocyte apoptosis in various pathologies, including LVH. © 2018 The Author(s)

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Genetic Architecture of Plasma Adiponectin Overlaps With the Genetics of Metabolic Syndrome–Related Traits

Author: Aulchenko
B. A. Oostra
Brame
C. M. van Duijn
Chen
Fain
Fischer-Posovszky
Giannessi
Heid
I. V. Zorkoltseva
K. W. van Dijk
Kadowaki
Ling
M. C. Zillikens
M. Frolich
M hlig
O'Connell
P. Henneman
Pardo
Patel
Pilia
R. R. Frants
Rasmussen-Torvik
Rasmussen-Torvik
Wolf
Y. S. Aulchenko
Yamauchi
Zillikens
Publication venue: American Diabetes Association
Publication date: 01/01/2010
Field of study

OBJECTIVE - Adiponectin, a hormone secreted by adipose tissue, is of particular interest in metabolic syndrome, because it is inversely correlated with obesity and insulin sensitivity. However, it is not known to what extent the genetics of plasma adiponectin and the genetics of obesity and insulin sensitivity are interrelated. We aimed to evaluate the heritability of plasma adiponectin and its genetic correlation with the metabolic syndrome and metabolic syndrome-related traits and the association between these traits and 10 ADIPOQ single nucleotide polymorphisms (SNPs). RESEARCH DESIGN AND METHODS - We made use of a family-based population, the Erasmus Rucphen Family study (1,258 women and 967 men). Heritability analysis was performed using a polygenic model. Genetic correlations were estimated using bivariate heritability analyses. Genetic association analysis was performed using a mixed model. RESULTS - Plasma adiponectin showed a heritability of 55.1%. Genetic correlations between plasma adiponectin HDL cholesterol and plasma insulin ranged from 15 to 24% but were not significant for fasting glucose, triglycerides, blood pressure, homeostasis model assessment of insulin resistance (HOMA-IR), and C-reactive protein. A significant association with plasma adiponectin was found for ADIPOQ variants rs17300539 and rs182052. A nominally significant association was found with plasma insulin and HOMA-IR and ADIPOQ variant rs17300539 after adjustment for plasma adiponectin. CONCLUSIONS - The significant genetic correlation between plasma adiponectin and HDL cholesterol and plasma insulin should be taken into account in the interpretation of genome-wide association studies. Association of ADIPOQ SNPs with plasma adiponectin was replicated, and we showed association between one ADIPOQ SNP and plasma insulin and HOMA-IR

Crossref

PubMed Central

EUR Research Repository

Leiden University Scholary Publications

Erasmus University Digital Repository

Genetic Determinants of Circulating Sphingolipid Concentrations in European Populations

Author: A Caspi
A Schulz
A. CecileJ.W. Janssens
Aaron Isaacs
AE Bielawska
Alan Wright
Anatoly V. Kirichenko
Andrew A. Hicks
Annette Peters
AR Morgan
Ayse Demirkan
B Devlin
Ben A. Oostra
C Gieger
C Pattaro
Caroline Hayward
Carsten Gnewuch
Christian Hengstenberg
Christine Schwienbacher
Christopher S. Franklin
Cornelia M. van Duijn
Cristian Pattaro
E Wang
F Dudbridge
Fabio Marroni
G Liebisch
G Liebisch
Gerd Schmitz
Gerhard Liebisch
Ghazal Zaboli
Greg Gibson
H.-Erich Wichmann
Harry Campbell
Heribert Schunkert
I Rudan
Igor Rudan
Inger Jonasson
Inke R. König
Irene Pichler
Irina V. Zorkoltseva
Ivana Kolcic
J Bras
J Erdmann
Jacqueline C. M. Witteman
James F. Wilson
Jeanette Erdmann
JL Dawkins
KJ Brookes
L Schaeffer
LN Clark
M Mehrabian
MA Simpson
N Amin
N Martinelli
Nick Hastie
NJ Samani
Ozren Polasek
P Wiesner
Peter P. Pramstaller
Peter Ugocsai
S Flamant
S Kathiresan
S Narayan
Sarah H. Wild
SL Schissel
ST Pruett
Stefan Schreiber
Susan Campbell
T Kolter
T Tanaka
Tatiana I. Axenovich
Thomas Meitinger
Ulf Gyllensten
V Vitart
Veronique Vitart
W Zheng
WC Nichols
Wilmar Igl
WL Holland
WM Chen
YS Aulchenko
YS Aulchenko
Yurii Aulchenko
Zrinka Biloglav
Åsa Johansson
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2009
Field of study

Sphingolipids have essential roles as structural components of cell membranes and in cell signalling, and disruption of their metabolism causes several diseases, with diverse neurological, psychiatric, and metabolic consequences. Increasingly, variants within a few of the genes that encode enzymes involved in sphingolipid metabolism are being associated with complex disease phenotypes. Direct experimental evidence supports a role of specific sphingolipid species in several common complex chronic disease processes including atherosclerotic plaque formation, myocardial infarction (MI), cardiomyopathy, pancreatic beta-cell failure, insulin resistance, and type 2 diabetes mellitus. Therefore, sphingolipids represent novel and important intermediate phenotypes for genetic analysis, yet little is known about the major genetic variants that influence their circulating levels in the general population. We performed a genome-wide association study (GWAS) between 318,237 single-nucleotide polymorphisms (SNPs) and levels of circulating sphingomyelin (SM), dihydrosphingomyelin (Dih-SM), ceramide (Cer), and glucosylceramide (GluCer) single lipid species (33 traits); and 43 matched metabolite ratios measured in 4,400 subjects from five diverse European populations. Associated variants (32) in five genomic regions were identified with genome-wide significant corrected p-values ranging down to 9.08 x 10(-66). The strongest associations were observed in or near 7 genes functionally involved in ceramide biosynthesis and trafficking: SPTLC3, LASS4, SGPP1, ATP10D, and FADS1-3. Variants in 3 loci (ATP10D, FADS3, and SPTLC3) associate with MI in a series of three German MI studies. An additional 70 variants across 23 candidate genes involved in sphingolipid-metabolizing pathways also demonstrate association (p = 10(-4) or less). Circulating concentrations of several key components in sphingolipid metabolism are thus under strong genetic control, and variants in these loci can be tested for a role in the development of common cardiovascular, metabolic, neurological, and psychiatric diseases

Crossref

Archivio istituzionale della ricerca - Università degli Studi di Udine

Directory of Open Access Journals

Open Access LMU ( Ludwig-Maximilians-Univ. München)

PubMed Central

EUR Research Repository

Edinburgh Research Explorer

Erasmus University Digital Repository

PuSH

Automated workflow-based exploitation of pathway databases provides new insights into genetic associations of metabolite profiles

Author: Adamski J. (Jerzy)
Aulchenko Y.S. (Yurii)
Axenovich T.I. (Tatiana)
Biloglav Z. (Zrinka)
Broer L. (Linda)
Campbell H. (Harry)
Campbell S. (Susan)
Demirkan A. (Ayşe)
Dharuri H. (Harish)
Domingues I. (Inês)
Duijn C.M. (Cornelia) van
Floyd J. (Jamie)
Franke L. (Lude)
Franklin C.S. (Christopher)
Gieger C. (Christian)
Gnewuch C. (Carsten)
Gyllensten U. (Ulf)
Hastie N. (Nick)
Hayward C. (Caroline)
Henneman P. (Peter)
Hettne K.M. (Kristina)
Hicks A.A. (Andrew)
Hoen P.A.C. (Peter) 't
Hofman A. (Albert)
Huffman J.E. (Jennifer)
Igl W. (Wilmar)
Isaacs A.J. (Aaron)
Jansen R.C. (Ritsert)
Janssens A.C.J.W. (Cécile)
Johansson A. (Åsa)
Jonasson I. (Inger)
Karssen L.C. (Lennart)
Kirichenko A.V. (Anatoly)
Klinken J.B. (Jan) van
Kolcic I. (Ivana)
Liebisch G. (Gerhard)
Meitinger T. (Thomas)
Mook-Kanamori D.O. (Dennis)
Oostra B.A. (Ben)
Pattaro C. (Cristian)
Pfeufer A. (Arne)
Pichler I. (Irene)
Polasek O. (Ozren)
Pramstaller P.P. (Peter Paul)
Rivadeneira Ramirez F. (Fernando)
Roos M. (Marco)
Rudan I. (Igor)
Schmitz G. (Gerd)
Struchalin M.V. (Maksim)
Suhre K. (Karsten)
Ugocsai P. (Peter)
Uitterlinden A.G. (André)
Vitart V. (Veronique)
Wang-Sattler R. (Rui)
Wild S.H. (Sarah)
Willems van Dijk J.A.P. (Ko)
Wilson J.F. (James F)
Witteman J.C.M. (Jacqueline)
Wright A.F. (Alan)
Zaboli G. (Ghazal)
Zorkoltseva I.V. (Irina)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/12/2013
Field of study

Background: Genome-wide association studies (GWAS) have identified many common single nucleotide polymorphisms (SNPs) that associate with clinical phenotypes, but these SNPs usually explain just a small part of the heritability and have relatively modest effect sizes. In contrast, SNPs that associate with metabolite levels generally explain a higher percentage of the genetic variation and demonstrate larger effect sizes. Still, the discovery of SNPs associated with metabolite levels is challenging since testing all metabolites measured in typical metabolomics studies with all SNPs comes with a severe multiple testing penalty. We have developed an automated workflow approach that utilizes prior knowledge of biochemical pathways present in databases like KEGG and BioCyc to generate a smaller SNP set relevant to the metabolite. This paper explores the opportunities and challenges in the analysis of GWAS of metabolomic phenotypes and provides novel insights into the genetic basis of metabolic variation through the re-analysis of published GWAS datasets. Results: Re-analysis of the published GWAS dataset from Illig et al. (Nature Genetics, 2010) using a pathway-based workflow (http://www.myexperiment.org/packs/319.html), confirmed previously identified hits and identified a new locus of human metabolic individuality, associating Aldehyde dehydrogenase family1 L1 (ALDH1L1) with serine/glycine ratios in blood. Replication in an independent GWAS dataset of phospholipids (Demirkan et al., PLoS Genetics, 2012) identified two novel loci supported by additional literature evidence: GPAM (Glycerol-3 phosphate acyltransferase) and CBS (Cystathionine beta-synthase). In addition, the workflow approach provided novel insight into the affected pathways and relevance of some of these gene-metabolite pairs in disease development and progression. Conclusions: We demonstrate the utility of automated exploitation of background knowledge present in pathway databases for the analysis of GWAS datasets of metabolomic phenotypes. We report novel loci and potential biochemical mechanisms that contribute to our understanding of the genetic basis of metabolic variation and its relationship to disease development and progression

Erasmus University Digital Repository

A combined linkage, microarray and exome analysis suggests MAP3K11 as a candidate gene for left ventricular hypertrophy

Author: A Hofman
A Sironen
Aaron Isaacs
Adriana I. Iglesias
Anatoly V. Kirichenko
André G. Uitterlinden
AV Kirichenko
Ayşe Demirkan
Ben A. Oostra
BL Fridley
BM Mayosi
Bruno H. Stricker
Carlos M. Restrepo
Claudia Tamar Silva
Cornelia M. van Duijn
CT Silva
D Zhi
DF Gudbjartsson
DK Arnett
DK Arnett
DN Chadee
EE Eichler
EJ Benjamin
Elisa van Leeuwen
FA Sayed-Tabatabaei
G Schillaci
GR Abecasis
H Li
Irina V. Zorkoltseva
Jan A. Kors
JD Storey
JH van Bemmel
JL Willems
JL Willems
JN Bella
JR O'Connell
L Wang
Laura B. Piñeros-Hernández
LM Pardo
M Eijgelsheim
M Foppa
M Gorski
M Nothnagel
Maartje N. Niemeijer
Marten E. van den Berg
MC de Bruyne
MJ Leening
Najaf Amin
P van der Harst
RJ Gelpi
Rob Willemsen
RW Brouwer
S Mutikainen
S Shah
T Rui
Tatiana I. Axenovich
V Barrios
WS Post
Y Du
Y Li
YS Aulchenko
YS Aulchenko
YS Aulchenko
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Maastricht University Research Portal

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

EUR Research Repository

edocUR

A genome-wide linkage study of individuals with high scores on NEO personality traits

Author: A C J W Janssens
A Canuto
A Isaacs
A Kong
A Terracciano
A Terracciano
A V Kirichenko
AE Poropat
AJ Rosellini
B A Oostra
B Zhou
BA Shipley
BM Neale
C M van Duijn
C Yau
CM Mazzanti
CR Cloninger
CR Cloninger
CR Cloninger
DA Robertson
DJ Winstow
DS Verbeek
E S Gusareva
E Sobel
E Sobel
EJ van den Oord
F Clerget-Darpoux
FC Calboli
G Andelfinger
GR Abecasis
GT Capone
HH Goring
HW Deng
I V Zorkoltseva
J Fullerton
J Van Os
JC Biesanz
JC Loehlin
JE Lonnqvist
JM Hettema
JM Hettema
KL Jang
KL Jang
KP Lesch
KS Kendler
L Almasy
LK Conlin
LM Pardo
M Crowe
M Durner
M Karayiorgou
M Menza
M Schuur
MA Menza
MS McPeek
MW Nash
MW Nash
N Amin
N Waller
NA Gillespie
NM Laird
NR Wray
PH Kuo
R Riemann
RD Goodwin
RR McCrae
RS Wilson
S Horvath
S Sen
S Van Gestel
SA Miller
SBG Eysenck
T I Axenovich
V Kaasinen
W Meins
WM van der Flier
X Altafaj
X Luo
X Zhu
Y S Aulchenko
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/08/2011
Field of study

peer reviewedThe NEO-Five-Factor Inventory divides human personality traits into five dimensions: neuroticism, extraversion, openness, conscientiousness and agreeableness. In this study, we sought to identify regions harboring genes with large effects on the five NEO personality traits by performing genome-wide linkage analysis of individuals scoring in the extremes of these traits ( > 90th percentile). Affected-only linkage analysis was performed using an Illumina 6K linkage array in a family-based study, the Erasmus Rucphen Family study. We subsequently determined whether distinct, segregating haplotypes found with linkage analysis were associated with the trait of interest in the population. Finally, a dense single-nucleotide polymorphism genotyping array (Illumina 318K) was used to search for copy number variations (CNVs) in the associated regions. In the families with extreme phenotype scores, we found significant evidence of linkage for conscientiousness to 20p13 (rs1434789, log of odds (LOD) = 5.86) and suggestive evidence of linkage (LOD > 2.8) for neuroticism to 19q, 21q and 22q, extraversion to 1p, 1q, 9p and12q, openness to 12q and 19q, and agreeableness to 2p, 6q, 17q and 21q. Further analysis determined haplotypes in 21q22 for neuroticism (P-values = 0.009, 0.007), in 17q24 for agreeableness (marginal P-value = 0.018) and in 20p13 for conscientiousness (marginal P-values = 0.058, 0.038) segregating in families with large contributions to the LOD scores. No evidence for CNVs in any of the associated regions was found. Our findings imply that there may be genes with relatively large effects involved in personality traits, which may be identified with next-generation sequencing techniques

Crossref

Open Repository and Bibliography - Liège

sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics.

Author: Anatoly V Kirichenko
Gulnara R Svishcheva
Irina V Zorkoltseva
Nadezhda M Belonogova
Tatiana I Axenovich
Yakov A Tsepilov
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/06/2022
Field of study

Gene-based association analysis is an effective gene-mapping tool. Many gene-based methods have been proposed recently. However, their power depends on the underlying genetic architecture, which is rarely known in complex traits, and so it is likely that a combination of such methods could serve as a universal approach. Several frameworks combining different gene-based methods have been developed. However, they all imply a fixed set of methods, weights and functional annotations. Moreover, most of them use individual phenotypes and genotypes as input data. Here, we introduce sumSTAAR, a framework for gene-based association analysis using summary statistics obtained from genome-wide association studies (GWAS). It is an extended and modified version of STAAR framework proposed by Li and colleagues in 2020. The sumSTAAR framework offers a wider range of gene-based methods to combine. It allows the user to arbitrarily define a set of these methods, weighting functions and probabilities of genetic variants being causal. The methods used in the framework were adapted to analyse genes with large number of SNPs to decrease the running time. The framework includes the polygene pruning procedure to guard against the influence of the strong GWAS signals outside the gene. We also present new improved matrices of correlations between the genotypes of variants within genes. These matrices estimated on a sample of 265,000 individuals are a state-of-the-art replacement of widely used matrices based on the 1000 Genomes Project data

Directory of Open Access Journals