298 research outputs found
Sequence-based GWAS using thousands Sardinian genomes: an application to quantitative traits
In the last ten years, genome-wide association studies (GWAS) have been extensively used to dissect the genetic architecture of complex quantitative traits.Despite these findings, much of the genetic contribution to complex traits remains largely unexplained, even in diseases for which large GWAS meta-analyses have been undertaken.It is plausible that, using whole genome sequencing data and enlarging the spectrum of variants, these could explain additional disease risk or trait variability.Here I present how two different applications of sequence-based GWAS improve the current knowledge of genetic variation associated to important human traits. In particular, I show how whole-genome sequencing integrated with genotyping arrays by statistical inference led to the identification of novel common, low frequency and rare variants associated with levels of five inflammatory biomarkers and with two parameters related to thyroid function.This work highlights not only advantages but also current pitfalls of the sequence-based GWAS approach, such as the statistical methods utilized and the difficulty in replication of the association results and in estimation of variance explained.Nevertheless, sequence-based GWAS will enlarge our current knowledge of genes associated to complex traits, highlight novel biological pathways and elucidate underlying mechanisms, suggesting critical points and issues to be considered in further developments and improvements of existing statistical methods
An Initial Investigation of Mental Well-being Monitoring through Personal Healthcare Devices
Smart sensory devices, such as smart watches, scales, and blood pressure gauges, are increasingly adopted by individuals aiming to improve their health and fitness. Those devices gather extensive data about cardiovascular parameters, physical activities, sleep quality, and behavior. Thanks to data analytics and artificial intelligence algorithms, they provide insights into the health status of individuals. Derived data is used to support self-care interventions and to provide practitioners with additional health information acquired on a continuous basis. However, most of the current solutions focus on the physical dimension of health, while the mental dimension is often neglected. In this paper, we present the initial investigation of a system to recognize a wide range of psychological parameters, including behavioral inhibition/activation, anxiety, and stress, leveraging data acquired from personal healthcare devices. We experimented with the application of different supervised learning algorithms on features extracted from heart, sleep, and inertial sensor data acquired from a cohort of 21 individuals over 24 hours each. Our preliminary findings suggest that our method may yield promising outcomes in recognizing different aspects of mental well-being. However, due to the limited size of the used dataset, a more comprehensive experimental evaluation, with a broader number of participants and carried out over an extended monitoring period, is imperative to substantiate the results
Participation bias in the UK Biobank distorts genetic associations and downstream analyses
While volunteer-based studies such as the UK Biobank have become the cornerstone of genetic epidemiology, the participating individuals are rarely representative of their target population. To evaluate the impact of selective participation, here we derived UK Biobank participation probabilities on the basis of 14 variables harmonized across the UK Biobank and a representative sample. We then conducted weighted genome-wide association analyses on 19 traits. Comparing the output from weighted genome-wide association analyses (neffective = 94,643 to 102,215) with that from standard genome-wide association analyses (n = 263,464 to 283,749), we found that increasing representativeness led to changes in SNP effect sizes and identified novel SNP associations for 12 traits. While heritability estimates were less impacted by weighting (maximum change in h2, 5%), we found substantial discrepancies for genetic correlations (maximum change in rg, 0.31) and Mendelian randomization estimates (maximum change in βSTD, 0.15) for socio-behavioural traits. We urge the field to increase representativeness in biobank samples, especially when studying genetic correlates of behaviour, lifestyles and social outcomes
Genome-wide association study of susceptibility loci for breast cancer in Sardinian population
Abstract
Background
Despite progress in identifying genes associated with breast cancer, many more risk loci exist. Genome-wide association analyses in genetically-homogeneous populations, such as that of Sardinia (Italy), could represent an additional approach to detect low penetrance alleles.
Methods
We performed a genome-wide association study comparing 1431 Sardinian patients with non-familial, BRCA1/2-mutation-negative breast cancer to 2171 healthy Sardinian blood donors. DNA was genotyped using GeneChip Human Mapping 500 K Arrays or Genome-Wide Human SNP Arrays 6.0. To increase genomic coverage, genotypes of additional SNPs were imputed using data from HapMap Phase II. After quality control filtering of genotype data, 1367 cases (9 men) and 1658 controls (1156 men) were analyzed on a total of 2,067,645 SNPs.
Results
Overall, 33 genomic regions (67 candidate SNPs) were associated with breast cancer risk at the p < 10−6 level. Twenty of these regions contained defined genes, including one already associated with breast cancer risk: TOX3. With a lower threshold for preliminary significance to p < 10−5, we identified 11 additional SNPs in FGFR2, a well-established breast cancer-associated gene. Ten candidate SNPs were selected, excluding those already associated with breast cancer, for technical validation as well as replication in 1668 samples from the same population. Only SNP rs345299, located in intron 1 of VAV3, remained suggestively associated (p-value, 1.16x10−5), but it did not associate with breast cancer risk in pooled data from two large, mixed-population cohorts.
Conclusions
This study indicated the role of TOX3 and FGFR2 as breast cancer susceptibility genes in BRCA1/2-wild-type breast cancer patients from Sardinian population
Parent-of-origin-specific allelic associations among 106 genomic loci for age at menarche.
Age at menarche is a marker of timing of puberty in females. It varies widely between individuals, is a heritable trait and is associated with risks for obesity, type 2 diabetes, cardiovascular disease, breast cancer and all-cause mortality. Studies of rare human disorders of puberty and animal models point to a complex hypothalamic-pituitary-hormonal regulation, but the mechanisms that determine pubertal timing and underlie its links to disease risk remain unclear. Here, using genome-wide and custom-genotyping arrays in up to 182,416 women of European descent from 57 studies, we found robust evidence (P < 5 × 10(-8)) for 123 signals at 106 genomic loci associated with age at menarche. Many loci were associated with other pubertal traits in both sexes, and there was substantial overlap with genes implicated in body mass index and various diseases, including rare disorders of puberty. Menarche signals were enriched in imprinted regions, with three loci (DLK1-WDR25, MKRN3-MAGEL2 and KCNK9) demonstrating parent-of-origin-specific associations concordant with known parental expression patterns. Pathway analyses implicated nuclear hormone receptors, particularly retinoic acid and γ-aminobutyric acid-B2 receptor signalling, among novel mechanisms that regulate pubertal timing in humans. Our findings suggest a genetic architecture involving at least hundreds of common variants in the coordinated timing of the pubertal transition
Genome-wide association study of susceptibility loci for breast cancer in Sardinian population.
BACKGROUND: Despite progress in identifying genes associated with breast cancer, many more risk loci exist. Genome-wide association analyses in genetically-homogeneous populations, such as that of Sardinia (Italy), could represent an additional approach to detect low penetrance alleles. METHODS: We performed a genome-wide association study comparing 1431 Sardinian patients with non-familial, BRCA1/2-mutation-negative breast cancer to 2171 healthy Sardinian blood donors. DNA was genotyped using GeneChip Human Mapping 500 K Arrays or Genome-Wide Human SNP Arrays 6.0. To increase genomic coverage, genotypes of additional SNPs were imputed using data from HapMap Phase II. After quality control filtering of genotype data, 1367 cases (9 men) and 1658 controls (1156 men) were analyzed on a total of 2,067,645 SNPs. RESULTS: Overall, 33 genomic regions (67 candidate SNPs) were associated with breast cancer risk at the p < 0(-6) level. Twenty of these regions contained defined genes, including one already associated with breast cancer risk: TOX3. With a lower threshold for preliminary significance to p < 10(-5), we identified 11 additional SNPs in FGFR2, a well-established breast cancer-associated gene. Ten candidate SNPs were selected, excluding those already associated with breast cancer, for technical validation as well as replication in 1668 samples from the same population. Only SNP rs345299, located in intron 1 of VAV3, remained suggestively associated (p-value, 1.16 x 10(-5)), but it did not associate with breast cancer risk in pooled data from two large, mixed-population cohorts. CONCLUSIONS: This study indicated the role of TOX3 and FGFR2 as breast cancer susceptibility genes in BRCA1/2-wild-type breast cancer patients from Sardinian population
Mendelian randomization integrating GWAS and eQTL data reveals genetic determinants of complex and clinical traits
Genome-wide association studies (GWAS) have identified thousands of variants associated with complex traits, but their biological interpretation often remains unclear. Most of these variants overlap with expression QTLs, indicating their potential involvement in regulation of gene expression. Here, we propose a transcriptome-wide summary statistics-based Mendelian Randomization approach (TWMR) that uses multiple SNPs as instruments and multiple gene expression traits as exposures, simultaneously. Applied to 43 human phenotypes, it uncovers 3,913 putatively causal gene-trait associations, 36% of which have no genome-wide significant SNP nearby in previous GWAS. Using independent association summary statistics, we find that the majority of these loci were missed by GWAS due to power issues. Noteworthy among these links is educational attainment-associated BSCL2, known to carry mutations leading to a Mendelian form of encephalopathy. We also find pleiotropic causal effects suggestive of mechanistic connections. TWMR better accounts for pleiotropy and has the potential to identify biological mechanisms underlying complex traits
Rare coding variants and X-linked loci associated with age at menarche.
More than 100 loci have been identified for age at menarche by genome-wide association studies; however, collectively these explain only ∼3% of the trait variance. Here we test two overlooked sources of variation in 192,974 European ancestry women: low-frequency protein-coding variants and X-chromosome variants. Five missense/nonsense variants (in ALMS1/LAMB2/TNRC6A/TACR3/PRKAG1) are associated with age at menarche (minor allele frequencies 0.08-4.6%; effect sizes 0.08-1.25 years per allele; P<5 × 10(-8)). In addition, we identify common X-chromosome loci at IGSF1 (rs762080, P=9.4 × 10(-13)) and FAAH2 (rs5914101, P=4.9 × 10(-10)). Highlighted genes implicate cellular energy homeostasis, post-transcriptional gene silencing and fatty-acid amide signalling. A frequently reported mutation in TACR3 for idiopathic hypogonatrophic hypogonadism (p.W275X) is associated with 1.25-year-later menarche (P=2.8 × 10(-11)), illustrating the utility of population studies to estimate the penetrance of reportedly pathogenic mutations. Collectively, these novel variants explain ∼0.5% variance, indicating that these overlooked sources of variation do not substantially explain the 'missing heritability' of this complex trait.UK sponsors (see article for overseas ones):
This work made use of data and samples generated by the 1958 Birth Cohort (NCDS). Access to these resources was enabled via the 58READIE Project funded by Wellcome Trust and Medical Research Council (grant numbers WT095219MA and G1001799). A full list of the financial, institutional and personal contributions to the development of the 1958 Birth Cohort Biomedical resource is available at http://www2.le.ac.uk/projects/birthcohort. Genotyping was undertaken as part of the Wellcome Trust Case-Control Consortium (WTCCC) under Wellcome Trust award 076113, and a full list of the investigators who contributed to the generation of the data is available at www.wtccc.org.uk
...
The Fenland Study is funded by the Wellcome Trust and the Medical Research Council, as well as by the Support for Science Funding programme and CamStrad.
...
SIBS - CRUK ref: C1287/A8459 SEARCH - CRUK ref: A490/A10124 EMBRACE is supported by Cancer Research UK Grants C1287/A10118, C1287/A16563 and C1287/A17523. Genotyping was supported by Cancer Research - UK grant C12292/A11174D
and C8197/A16565. Gareth Evans and Fiona Lalloo are supported by an NIHR grant to the Biomedical Research Centre, Manchester.
The Investigators at The Institute of Cancer Research and The Royal Marsden NHS Foundation Trust are supported by an NIHR grant to the Biomedical Research Centre at The Institute of Cancer Research and The Royal Marsden NHS Foundation Trust. Ros Eeles and Elizabeth Bancroft are supported by Cancer Research UK Grant C5047/A8385.
...
Generation Scotland - Scottish Executive Health Department, Chief Scientist Office, grant number CZD/16/6. Exome array genotyping for GS:SFHS was funded by the Medical Research Council UK. 23andMe - This work was supported in part by NIH Award 2R44HG006981-02 from the National Human Genome Research Institute.This is the final version of the article. It first appeared from NPG via http://dx.doi.org/10.1038/ncomms875
Mendelian randomization integrating GWAS and eQTL data reveals genetic determinants of complex and clinical traits
Genome-wide association studies (GWAS) have identified thousands of variants associated with complex traits, but their biological interpretation often remains unclear. Most of these variants overlap with expression QTLs, indicating their potential involvement in regulation of gene expression. Here, we propose a transcriptome-wide summary statistics-based Mendelian Randomization approach (TWMR) that uses multiple SNPs as instruments and multiple gene expression traits as exposures, simultaneously. Applied to 43 human phenotypes, it uncovers 3,913 putatively causal gene-trait associations, 36% of which have no genome-wide significant SNP nearby in previous GWAS. Using independent association summary statistics, we find that the majority of these loci were missed by GWAS due to power issues. Noteworthy among these links is educational attainment-associated BSCL2, known to carry mutations leading to a Mendelian form of encephalopathy. We also find pleiotropic causal effects suggestive of mechanistic connections. TWMR better accounts for pleiotropy and has the potential to identify biological mechanisms underlying complex traits.Peer reviewe
Limited evidence for blood eQTLs in human sexual dimorphism
The genetic underpinning of sexual dimorphism is very poorly understood. The prevalence of many diseases differs between men and women, which could be in part caused by sex-specific genetic effects. Nevertheless, only a few published genome-wide association studies (GWAS) were performed separately in each sex. The reported enrichment of expression quantitative trait loci (eQTLs) among GWAS-associated SNPs suggests a potential role of sex-specific eQTLs in the sex-specific genetic mechanism underlying complex traits.
To explore this scenario, we combined sex-specific whole blood RNA-seq eQTL data from 3447 European individuals included in BIOS Consortium and GWAS data from UK Biobank. Next, to test the presence of sex-biased causal effect of gene expression on complex traits, we performed sex-specific transcriptome-wide Mendelian randomization (TWMR) analyses on the two most sexually dimorphic traits, waist-to-hip ratio (WHR) and testosterone levels. Finally, we performed power analysis to calculate the GWAS sample size needed to observe sex-specific trait associations driven by sex-biased eQTLs.
Among 9 million SNP-gene pairs showing sex-combined associations, we found 18 genes with significant sex-biased cis-eQTLs (FDR 5%). Our phenome-wide association study of the 18 top sex-biased eQTLs on >700 traits unraveled that these eQTLs do not systematically translate into detectable sex-biased trait-associations. In addition, we observed that sex-specific causal effects of gene expression on complex traits are not driven by sex-specific eQTLs. Power analyses using real eQTL- and causal-effect sizes showed that millions of samples would be necessary to observe sex-biased trait associations that are fully driven by sex-biased cis-eQTLs. Compensatory effects may further hamper their detection.
Our results suggest that sex-specific eQTLs in whole blood do not translate to detectable sex-specific trait associations of complex diseases, and vice versa that the observed sex-specific trait associations cannot be explained by sex-specific eQTLs
- …
