16 research outputs found

    Drug prescription clusters in the UK Biobank: An assessment of drug-drug interactions and patient outcomes in a large patient cohort

    Full text link
    In recent decades, there has been an increase in polypharmacy, the concurrent administration of multiple drugs per patient. Studies have shown that polypharmacy is linked to adverse patient outcomes and there is interest in elucidating the exact causes behind this observation. In this paper, we are studying the relationship between drug prescriptions, drug-drug interactions (DDIs) and patient mortality. Our focus is not so much on the number of prescribed drugs, the typical metric in polypharmacy research, but rather on the specific combinations of drugs leading to a DDI. To learn the space of real-world drug combinations, we first assessed the drug prescription landscape of the UK Biobank, a large patient data registry. We observed distinct drug constellation patterns driven by the UK Biobank participants' disease status. We show that these drug prescription clusters matter in terms of the number and types of expected DDIs, and may possibly explain observed differences in health outcomes

    Drug prescription clusters in the UK Biobank: An assessment of drug-drug interactions and patient outcomes in a large patient cohort

    Full text link
    In recent decades, there has been an increase in polypharmacy, the concurrent administration of multiple drugs per patient. Studies have shown that polypharmacy is linked to adverse patient outcomes and there is interest in elucidating the exact causes behind this observation. In this paper, we are studying the relationship between drug prescriptions, drug-drug interactions (DDIs) and patient mortality. Our focus is not so much on the number of prescribed drugs, the typical metric in polypharmacy research, but rather on the specific combinations of drugs leading to a DDI. To learn the space of real-world drug combinations, we first assessed the drug prescription landscape of the UK Biobank, a large patient data registry. We observed distinct drug constellation patterns driven by the UK Biobank participants' disease status. We show that these drug prescription clusters matter in terms of the number and types of expected DDIs, and may possibly explain observed differences in health outcomes

    Genomic architecture and prediction of censored time-to-event phenotypes with a Bayesian genome-wide analysis

    Get PDF
    While recent advancements in computation and modelling have improved the analysis of complex traits, our understanding of the genetic basis of the time at symptom onset remains limited. Here, we develop a Bayesian approach (BayesW) that provides probabilistic inference of the genetic architecture of age-at-onset phenotypes in a sampling scheme that facilitates biobank-scale time-to-event analyses. We show in extensive simulation work the benefits BayesW provides in terms of number of discoveries, model performance and genomic prediction. In the UK Biobank, we find many thousands of common genomic regions underlying the age-at-onset of high blood pressure (HBP), cardiac disease (CAD), and type-2 diabetes (T2D), and for the genetic basis of onset reflecting the underlying genetic liability to disease. Age-at-menopause and age-at-menarche are also highly polygenic, but with higher variance contributed by low frequency variants. Genomic prediction into the Estonian Biobank data shows that BayesW gives higher prediction accuracy than other approaches

    A Bayesian approach for structure learning in oscillating regulatory networks

    Get PDF
    Oscillations lie at the core of many biological processes, from the cell cycle, to circadian oscillations and developmental processes. Time-keeping mechanisms are essential to enable organisms to adapt to varying conditions in environmental cycles, from day/night to seasonal. Transcriptional regulatory networks are one of the mechanisms behind these biological oscillations. However, while identifying cyclically expressed genes from time series measurements is relatively easy, determining the structure of the interaction network underpinning the oscillation is a far more challenging problem. Here, we explicitly leverage the oscillatory nature of the transcriptional signals and present a method for reconstructing network interactions tailored to this special but important class of genetic circuits. Our method is based on projecting the signal onto a set of oscillatory basis functions using a Discrete Fourier Transform. We build a Bayesian Hierarchical model within a frequency domain linear model in order to enforce sparsity and incorporate prior knowledge about the network structure. Experiments on real and simulated data show that the method can lead to substantial improvements over competing approaches if the oscillatory assumption is met, and remains competitive also in cases it is not

    Multi-method genome- and epigenome-wide studies of inflammatory protein levels in healthy older adults

    Get PDF
    The molecular factors which control circulating levels of inflammatory proteins are not well understood. Furthermore, association studies between molecular probes and human traits are often performed by linear model-based methods which may fail to account for complex structure and interrelationships within molecular datasets.In this study, we perform genome- and epigenome-wide association studies (GWAS/EWAS) on the levels of 70 plasma-derived inflammatory protein biomarkers in healthy older adults (Lothian Birth Cohort 1936; n = 876; Olink® inflammation panel). We employ a Bayesian framework (BayesR+) which can account for issues pertaining to data structure and unknown confounding variables (with sensitivity analyses using ordinary least squares- (OLS) and mixed model-based approaches). We identified 13 SNPs associated with 13 proteins (n = 1 SNP each) concordant across OLS and Bayesian methods. We identified 3 CpG sites spread across 3 proteins (n = 1 CpG each) that were concordant across OLS, mixed-model and Bayesian analyses. Tagged genetic variants accounted for up to 45% of variance in protein levels (for MCP2, 36% of variance alone attributable to 1 polymorphism). Methylation data accounted for up to 46% of variation in protein levels (for CXCL10). Up to 66% of variation in protein levels (for VEGFA) was explained using genetic and epigenetic data combined. We demonstrated putative causal relationships between CD6 and IL18R1 with inflammatory bowel disease and between IL12B and Crohn’s disease. Our data may aid understanding of the molecular regulation of the circulating inflammatory proteome as well as causal relationships between inflammatory mediators and disease

    Blood-based epigenome-wide analyses of cognitive abilities

    Get PDF
    BACKGROUND: Blood-based markers of cognitive functioning might provide an accessible way to track neurodegeneration years prior to clinical manifestation of cognitive impairment and dementia. RESULTS: Using blood-based epigenome-wide analyses of general cognitive function, we show that individual differences in DNA methylation (DNAm) explain 35.0% of the variance in general cognitive function (g). A DNAm predictor explains ~4% of the variance, independently of a polygenic score, in two external cohorts. It also associates with circulating levels of neurology- and inflammation-related proteins, global brain imaging metrics, and regional cortical volumes. CONCLUSIONS: As sample sizes increase, the ability to assess cognitive function from DNAm data may be informative in settings where cognitive testing is unreliable or unavailable. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-021-02596-5

    Integrating transcriptional activity in genome-scale models of metabolism

    No full text
    Background: Genome-scale metabolic models provide an opportunity for rational approaches to studies of the different reactions taking place inside the cell. The integration of these models with gene regulatory networks is a hot topic in systems biology. The methods developed to date focus mostly on resolving the metabolic elements and use fairly straightforward approaches to assess the impact of genome expression on the metabolic phenotype.[br/] Results: We present here a method for integrating the reverse engineering of gene regulatory networks into these metabolic models. We applied our method to a high-dimensional gene expression data set to infer a background gene regulatory network. We then compared the resulting phenotype simulations with those obtained by other relevant methods.[br/] Conclusions: Our method outperformed the other approaches tested and was more robust to noise. We also illustrate the utility of this method for studies of a complex biological phenomenon, the diauxic shift in yeast

    Improving GWAS discovery and genomic prediction accuracy in biobank data

    No full text
    Genetically informed, deep-phenotyped biobanks are an important research resource and it is imperative that the most powerful, versatile, and efficient analysis approaches are used. Here, we apply our recently developed Bayesian grouped mixture of regressions model (GMRM) in the UK and Estonian Biobanks and obtain the highest genomic prediction accuracy reported to date across 21 heritable traits. When compared to other approaches, GMRM accuracy was greater than annotation prediction models run in the LDAK or LDPred-funct software by 15% (SE 7%) and 14% (SE 2%), respectively, and was 18% (SE 3%) greater than a baseline BayesR model without single-nucleotide polymorphism (SNP) markers grouped into minor allele frequency–linkage disequilibrium (MAF-LD) annotation categories. For height, the prediction accuracy R2 was 47% in a UK Biobank holdout sample, which was 76% of the estimated h2SNP. We then extend our GMRM prediction model to provide mixed-linear model association (MLMA) SNP marker estimates for genome-wide association (GWAS) discovery, which increased the independent loci detected to 16,162 in unrelated UK Biobank individuals, compared to 10,550 from BoltLMM and 10,095 from Regenie, a 62 and 65% increase, respectively. The average χ2 value of the leading markers increased by 15.24 (SE 0.41) for every 1% increase in prediction accuracy gained over a baseline BayesR model across the traits. Thus, we show that modeling genetic associations accounting for MAF and LD differences among SNP markers, and incorporating prior knowledge of genomic function, is important for both genomic prediction and discovery in large-scale individual-level studies

    A versatile computational pipeline for the preprocessing of cell-free DNA fragmentation data

    Full text link
    Cell-free DNA (cfDNA) emerges as a promising liquid biopsy biomarker for cancer diagnosis and patient monitoring. Complementing mutation-based assays, cfDNA carries information about epigenetic modifications from decaying cells. This information is encoded in the shape of the cfDNA fragments. Specifically, fragments from cancer tend to be shorter than those originating from other adult cells, enabling a distinction between cancer patients and healthy individuals. Additional cfDNA features such as fragment end motifs and information on nucleosome positioning provide further insight into cancer biology. These cfDNA measures are typically inferred from low-pass whole genome sequencing and subsequent bioinformatics processing. A key bioinformatics step is the alignment of DNA sequencing reads to the reference genome, which critically depends on preprocessing steps such as read trimming and
    corecore