7 research outputs found
NRLiSt BDB, the Manually Curated Nuclear Receptors Ligands and Structures Benchmarking Database
Nuclear receptors (NRs) constitute
an important class of drug targets.
We created the most exhaustive NR-focused benchmarking database to
date, the NRLiSt BDB (NRs ligands and structures benchmarking database).
The 9905 compounds and 339 structures of the NRLiSt BDB are ready
for structure-based and ligand-based virtual screening. In the present
study, we detail the protocol used to generate the NRLiSt BDB and
its features. We also give some examples of the errors that we found
in ChEMBL that convinced us to manually review all original papers.
Since extensive and manually curated experimental data about NR ligands
and structures are provided in the NRLiSt BDB, it should become a
powerful tool to assess the performance of virtual screening methods
on NRs, to assist the understanding of NRâs function and modulation,
and to support the discovery of new drugs targeting NRs. NRLiSt BDB
is freely available online at http://nrlist.drugdesign.fr
List of SNP/gene pairs associated with AIDS progression.
<p>(*) Note that rs3749971 is in linkage disequilibrium with rs3130350 and is therefore not considered an independent finding (see text).</p><p>The genetic association is linked back to its association with gene expression levels to provide an association between transcription levels (we use the word âregulationâ for convenienceâs sake) and AIDS progression.</p
List of the significant associations (<i>p</i> †0.05) with slow and non-progression.
<p>(*) Note that rs3749971 is in linkage disequilibrium with rs3130350 and is therefore not considered an independent finding in our statistics (see text).</p><p>Alleles, allele frequencies (AF), positional data and genetic modes are provided with the results of the statistical inferences. Opposite signs for the ÎČ coefficients are required for an association to be replicated in the GRIV (non-progression) and ACS cohorts (time to AIDS93).</p
Statistical significance of our associations.
<p>Histogram of the number of SNPs that pass the significance criterion for this study using phenotype and SNP randomisations. These results provide us with a way to estimate the sensitivity of our study (diamond): it would be extremely unlikely for our eight independent findings to arise by chance alone (<i>p</i> = 0.001).</p
Schematic summary of our methodology.
<p>The data from three databases are integrated to provide us with functional SNPs likely to be associated with changes in gene transcription in the tissue of interest. Using the SNAP Pairwise LD server, we only kept independent SNPs by removing superfluous SNPs that were in linkage disequilibrium (<i>r</i><sup>2</sup> â„ 0.2). Among those SNPs, associations with slow and non-progression towards AIDS are sought and replicated. Randomisations are carried out in order to evaluate the statistical robustness of our results. Finally, the genetic associations are used to link progression to AIDS and gene expression in candidate genes.</p
DataSheet1_Multi-omics insights into the biological mechanisms underlying statistical gene-by-lifestyle interactions with smoking and alcohol consumption.pdf
Though both genetic and lifestyle factors are known to influence cardiometabolic outcomes, less attention has been given to whether lifestyle exposures can alter the association between a genetic variant and these outcomes. The Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortiumâs Gene-Lifestyle Interactions Working Group has recently published investigations of genome-wide gene-environment interactions in large multi-ancestry meta-analyses with a focus on cigarette smoking and alcohol consumption as lifestyle factors and blood pressure and serum lipids as outcomes. Further description of the biological mechanisms underlying these statistical interactions would represent a significant advance in our understanding of gene-environment interactions, yet accessing and harmonizing individual-level genetic and âomics data is challenging. Here, we demonstrate the coordinated use of summary-level data for gene-lifestyle interaction associations on up to 600,000 individuals, differential methylation data, and gene expression data for the characterization and prioritization of loci for future follow-up analyses. Using this approach, we identify 48 genes for which there are multiple sources of functional support for the identified gene-lifestyle interaction. We also identified five genes for which differential expression was observed by the same lifestyle factor for which a gene-lifestyle interaction was found. For instance, in gene-lifestyle interaction analysis, the T allele of rs6490056 (ALDH2) was associated with higher systolic blood pressure, and a larger effect was observed in smokers compared to non-smokers. In gene expression studies, this allele is associated with decreased expression of ALDH2, which is part of a major oxidative pathway. Other results show increased expression of ALDH2 among smokers. Oxidative stress is known to contribute to worsening blood pressure. Together these data support the hypothesis that rs6490056 reduces expression of ALDH2, which raises oxidative stress, leading to an increase in blood pressure, with a stronger effect among smokers, in whom the burden of oxidative stress is greater. Other genes for which the aggregation of data types suggest a potential mechanism include: GCNT4Ăcurrent smoking (HDL), PTPRZ1Ăever-smoking (HDL), SYN2Ăcurrent smoking (pulse pressure), and TMEM116Ăever-smoking (mean arterial pressure). This work demonstrates the utility of careful curation of summary-level data from a variety of sources to prioritize gene-lifestyle interaction loci for follow-up analyses.</p
DataSheet2_Multi-omics insights into the biological mechanisms underlying statistical gene-by-lifestyle interactions with smoking and alcohol consumption.pdf
Though both genetic and lifestyle factors are known to influence cardiometabolic outcomes, less attention has been given to whether lifestyle exposures can alter the association between a genetic variant and these outcomes. The Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortiumâs Gene-Lifestyle Interactions Working Group has recently published investigations of genome-wide gene-environment interactions in large multi-ancestry meta-analyses with a focus on cigarette smoking and alcohol consumption as lifestyle factors and blood pressure and serum lipids as outcomes. Further description of the biological mechanisms underlying these statistical interactions would represent a significant advance in our understanding of gene-environment interactions, yet accessing and harmonizing individual-level genetic and âomics data is challenging. Here, we demonstrate the coordinated use of summary-level data for gene-lifestyle interaction associations on up to 600,000 individuals, differential methylation data, and gene expression data for the characterization and prioritization of loci for future follow-up analyses. Using this approach, we identify 48 genes for which there are multiple sources of functional support for the identified gene-lifestyle interaction. We also identified five genes for which differential expression was observed by the same lifestyle factor for which a gene-lifestyle interaction was found. For instance, in gene-lifestyle interaction analysis, the T allele of rs6490056 (ALDH2) was associated with higher systolic blood pressure, and a larger effect was observed in smokers compared to non-smokers. In gene expression studies, this allele is associated with decreased expression of ALDH2, which is part of a major oxidative pathway. Other results show increased expression of ALDH2 among smokers. Oxidative stress is known to contribute to worsening blood pressure. Together these data support the hypothesis that rs6490056 reduces expression of ALDH2, which raises oxidative stress, leading to an increase in blood pressure, with a stronger effect among smokers, in whom the burden of oxidative stress is greater. Other genes for which the aggregation of data types suggest a potential mechanism include: GCNT4Ăcurrent smoking (HDL), PTPRZ1Ăever-smoking (HDL), SYN2Ăcurrent smoking (pulse pressure), and TMEM116Ăever-smoking (mean arterial pressure). This work demonstrates the utility of careful curation of summary-level data from a variety of sources to prioritize gene-lifestyle interaction loci for follow-up analyses.</p