40 research outputs found

    An eScience-Bayes strategy for analyzing omics data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The omics fields promise to revolutionize our understanding of biology and biomedicine. However, their potential is compromised by the challenge to analyze the huge datasets produced. Analysis of omics data is plagued by the curse of dimensionality, resulting in imprecise estimates of model parameters and performance. Moreover, the integration of omics data with other data sources is difficult to shoehorn into classical statistical models. This has resulted in <it>ad hoc </it>approaches to address specific problems.</p> <p>Results</p> <p>We present a general approach to omics data analysis that alleviates these problems. By combining eScience and Bayesian methods, we retrieve scientific information and data from multiple sources and coherently incorporate them into large models. These models improve the accuracy of predictions and offer new insights into the underlying mechanisms. This "eScience-Bayes" approach is demonstrated in two proof-of-principle applications, one for breast cancer prognosis prediction from transcriptomic data and one for protein-protein interaction studies based on proteomic data.</p> <p>Conclusions</p> <p>Bayesian statistics provide the flexibility to tailor statistical models to the complex data structures in omics biology as well as permitting coherent integration of multiple data sources. However, Bayesian methods are in general computationally demanding and require specification of possibly thousands of prior distributions. eScience can help us overcome these difficulties. The eScience-Bayes thus approach permits us to fully leverage on the advantages of Bayesian methods, resulting in models with improved predictive performance that gives more information about the underlying biological system.</p

    Multiomics links global surfactant dysregulation with airflow obstruction and emphysema in COPD

    Get PDF
    RATIONALE: Pulmonary surfactant is vital for lung homeostasis as it reduces surface tension to prevent alveolar collapse and provides essential immune-regulatory and antipathogenic functions. Previous studies demonstrated dysregulation of some individual surfactant components in COPD. We investigated relationships between COPD disease measures and dysregulation of surfactant components to gain new insights into potential disease mechanisms. METHODS: Bronchoalveolar lavage proteome and lipidome were characterised in ex-smoking mild/moderate COPD subjects (n=26) and healthy ex-smoking (n=20) and never-smoking (n=16) controls using mass spectrometry. Serum surfactant protein analysis was performed. RESULTS: Total phosphatidylcholine, phosphatidylglycerol, phosphatidylinositol, surfactant protein (SP)-B, SP-A and SP-D concentrations were lower in COPD versus controls (log2 fold change (log2FC) -2.0, -2.2, -1.5, -0.5, -0.7 and -0.5 (adjusted p<0.02), respectively) and correlated with lung function. Total phosphatidylcholine, phosphatidylglycerol, phosphatidylinositol, SP-A, SP-B, SP-D, napsin A and CD44 inversely correlated with computed tomography small airways disease measures (expiratory to inspiratory mean lung density) (r= -0.56, r= -0.58, r= -0.45, r= -0.36, r= -0.44, r= -0.37, r= -0.40 and r= -0.39 (adjusted p<0.05)). Total phosphatidylcholine, phosphatidylglycerol, phosphatidylinositol, SP-A, SP-B, SP-D and NAPSA inversely correlated with emphysema (% low-attenuation areas): r= -0.55, r= -0.61, r= -0.48, r= -0.51, r= -0.41, r= -0.31 and r= -0.34, respectively (adjusted p<0.05). Neutrophil elastase, known to degrade SP-A and SP-D, was elevated in COPD versus controls (log2FC 0.40, adjusted p=0.0390), and inversely correlated with SP-A and SP-D. Serum SP-D was increased in COPD versus healthy ex-smoking volunteers, and predicted COPD status (area under the curve 0.85). CONCLUSIONS: Using a multiomics approach, we demonstrate, for the first time, global surfactant dysregulation in COPD that was associated with emphysema, giving new insights into potential mechanisms underlying the cause or consequence of disease

    Unexplored therapeutic opportunities in the human genome

    Get PDF
    A large proportion of biomedical research and the development of therapeutics is focused on a small fraction of the human genome. In a strategic effort to map the knowledge gaps around proteins encoded by the human genome and to promote the exploration of currently understudied, but potentially druggable, proteins, the US National Institutes of Health launched the Illuminating the Druggable Genome (IDG) initiative in 2014. In this article, we discuss how the systematic collection and processing of a wide array of genomic, proteomic, chemical and disease-related resource data by the IDG Knowledge Management Center have enabled the development of evidence-based criteria for tracking the target development level (TDL) of human proteins, which indicates a substantial knowledge deficit for approximately one out of three proteins in the human proteome. We then present spotlights on the TDL categories as well as key drug target classes, including G protein-coupled receptors, protein kinases and ion channels, which illustrate the nature of the unexplored opportunities for biomedical research and therapeutic development. © 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved

    Evaluation of FOXO1 target engagement using a single-cell microfluidic platform

    No full text
    The cellular thermal shift assay (CETSA) has been used extensively since its introduction to study drug–target engagement within both live cells and cellular lysate. This has proven to be a useful tool in early stage drug discovery and is used to study a wide range of protein classes. We describe the application of a single-cell CETSA workflow within a microfluidic affinity capture (MAC) chip. This has enabled us to quantitatively determine the active FOXO1 single-molecule count and observe FOXO1 stabilization and destabilization in the presence of three small molecule inhibitors, including demonstrating the determination of EC50. The successful use of the MAC chip for single-cell CETSA paves the way for the study of precious clinical samples owing to the low number of cells needed by the chip. It also provides a useful tool for studying any underlying population heterogeneity that exists within a cellular system, a feature that is usually masked when conducting ensemble measurements

    Identification of a missense variant in SPDL1 associated with idiopathic pulmonary fibrosis.

    Get PDF
    Idiopathic pulmonary fibrosis (IPF) is a fatal disorder characterised by progressive, destructive lung scarring. Despite substantial progress, the genetic determinants of this disease remain incompletely defined. Using whole genome and whole exome sequencing data from 752 individuals with sporadic IPF and 119,055 UK Biobank controls, we performed a variant-level exome-wide association study (ExWAS) and gene-level collapsing analyses. Our variant-level analysis revealed a novel association between a rare missense variant in SPDL1 and IPF (NM_017785.5:g.169588475 G > A p.Arg20Gln; p = 2.4 × 10-7, odds ratio = 2.87, 95% confidence interval: 2.03-4.07). This signal was independently replicated in the FinnGen cohort, which contains 1028 cases and 196,986 controls (combined p = 2.2 × 10-20), firmly associating this variant as an IPF risk allele. SPDL1 encodes Spindly, a protein involved in mitotic checkpoint signalling during cell division that has not been previously described in fibrosis. To the best of our knowledge, these results highlight a novel mechanism underlying IPF, providing the potential for new therapeutic discoveries in a disease of great unmet need

    Rare variant contribution to human disease in 281,104 UK Biobank exomes

    No full text
    Genome-wide association studies have uncovered thousands of common variants associated with human disease, but the contribution of rare variants to common disease remains relatively unexplored. The UK Biobank contains detailed phenotypic data linked to medical records for approximately 500,000 participants, offering an unprecedented opportunity to evaluate the effect of rare variation on a broad collection of traits1,2. Here we study the relationships between rare protein-coding variants and 17,361 binary and 1,419 quantitative phenotypes using exome sequencing data from 269,171 UK Biobank participants of European ancestry. Gene-based collapsing analyses revealed 1,703 statistically significant gene-phenotype associations for binary traits, with a median odds ratio of 12.4. Furthermore, 83% of these associations were undetectable via single-variant association tests, emphasizing the power of gene-based collapsing analysis in the setting of high allelic heterogeneity. Gene-phenotype associations were also significantly enriched for loss-of-function-mediated traits and approved drug targets. Finally, we performed ancestry-specific and pan-ancestry collapsing analyses using exome sequencing data from 11,933 UK Biobank participants of African, East Asian or South Asian ancestry. Our results highlight a significant contribution of rare variants to common disease. Summary statistics are publicly available through an interactive portal ( http://azphewas.com/ )
    corecore