11 research outputs found

    República: Año III Número 346 - (10/08/33)

    Get PDF
    BACKGROUND: Biomedical knowledge graphs have become important tools to computationally analyse the comprehensive body of biomedical knowledge. They represent knowledge as subject-predicate-object triples, in which the predicate indicates the relationship between subject and object. A triple can also contain provenance information, which consists of references to the sources of the triple (e.g. scientific publications or database entries). Knowledge graphs have been used to classify drug-disease pairs for drug efficacy screening, but existing computational methods have often ignored predicate and provenance information. Using this information, we aimed to develop a supervised machine learning classifier and determine the added value of predicate and provenance information for drug efficacy screening. To ensure the biological plausibility of our method we performed our research on the protein level, where drugs are represented by their drug target proteins, and diseases by their disease proteins. RESULTS: Using random forests with repeated 10-fold cross-validation, our method achieved an area under the ROC curve (AUC) of 78.1% and 74.3% for two reference sets. We benchmarked against a state-of-the-art knowledge-graph technique that does not use predicate and provenance information, obtaining AUCs of 65.6% and 64.6%, respectively. Classifiers that only used predicate information performed superior to classifiers that only used provenance information, but using both performed best. CONCLUSION: We conclude that both predicate and provenance information provide added value for drug efficacy screening

    Discovering information from an integrated graph database

    Get PDF
    The information explosion in science has become a different problem, not the sheer amount per se, but the multiplicity and heterogeneity of massive sets of data sources. Relations mined from these heterogeneous sources, namely texts, database records, and ontologies have been mapped to Resource Description Framework (RDF) triples in an integrated database. The subject and object resources are expressed as references to concepts in a biomedical ontology consisting of the Unified Medical Language System (UMLS), UniProt and EntrezGene and for the predicate resource to a predicate thesaurus. All RDF triples have been stored in a graph database, including provenance. For evaluation we used an actual formal PRISMA literature study identifying 61 cerebral spinal fluid biomarkers and 200 blood biomarkers for migraine. These biomarkers sets could be retrieved with weighted mean average precision values of 0.32 and 0.59, respectively, and can be used as a first reference for further refinements

    Automated extraction of potential migraine biomarkers using a semantic graph

    Get PDF
    Problem Biomedical literature and databases contain important clues for the identification of potential disease biomarkers. However, searching these enormous knowledge reservoirs and integrating findings across heterogeneous sources is costly and difficult. Here we demonstrate how semantically integrated knowledge, extracted from biomedical literature and structured databases, can be used to automatically identify potential migraine biomarkers. Method We used a knowledge graph containing more than 3.5 million biomedical concepts and 68.4 million relationships. Biochemical compound concepts were filtered and ranked by their potential as biomarkers based on their connections to a subgraph of migraine-related concepts. The ranked results were evaluated against the results of a systematic literature review that was performed manually by migraine researchers. Weight points were assigned to these reference compounds to indicate their relative importance. Results Ranked results automatically generated by the knowledge graph were highly consistent with results from the manual literature review. Out of 222 reference compounds, 163 (73%) ranked in the top 2000, with 547 out of the 644 (85%) weight points assigned to the reference compounds. For reference compounds that were not in the top of the list, an extensive error analysis has been performed. When evaluating the overall performance, we obtained a ROC-AUC of 0.974. Discussion Semantic knowledge graphs composed of information integrated from multiple and varying sources can assist researchers in identifying potential disease biomarkers

    Drug prioritization using the semantic properties of a knowledge graph

    Get PDF
    Abstract Compounds that are candidates for drug repurposing can be ranked by leveraging knowledge available in the biomedical literature and databases. This knowledge, spread across a variety of sources, can be integrated within a knowledge graph, which thereby comprehensively describes known relationships between biomedical concepts, such as drugs, diseases, genes, etc. Our work uses the semantic information between drug and disease concepts as features, which are extracted from an existing knowledge graph that integrates 200 different biological knowledge sources. RepoDB, a standard drug repurposing database which describes drug-disease combinations that were approved or that failed in clinical trials, is used to train a random forest classifier. The 10-times repeated 10-fold cross-validation performance of the classifier achieves a mean area under the receiver operating characteristic curve (AUC) of 92.2%. We apply the classifier to prioritize 21 preclinical drug repurposing candidates that have been suggested for Autosomal Dominant Polycystic Kidney Disease (ADPKD). Mozavaptan, a vasopressin V2 receptor antagonist is predicted to be the drug most likely to be approved after a clinical trial, and belongs to the same drug class as tolvaptan, the only treatment for ADPKD that is currently approved. We conclude that semantic properties of concepts in a knowledge graph can be exploited to prioritize drug repurposing candidates for testing in clinical trials

    From camp to city

    No full text
    A long term housing solution for refugee camps using a digitally fabricated building system.Architectural Engineering and TechnologyArchitecture and The Built Environmen

    Proteomic analysis of HIV–T cell interaction: an update

    Get PDF
    This mini-review summarizes techniques applied in, and results obtained with, proteomic studies of human immunodeficiency virus type 1 (HIV-1)–T cell interaction. Our group previously reported on the use of two-dimensional differential gel electrophoresis (2D-DIGE) coupled to matrix assisted laser-desorption time of flight peptide mass fingerprint analysis, to study T cell responses upon HIV-1 infection. Only one in three differentially expressed proteins could be identified using this experimental setup. Here we report on our latest efforts to test models generated by this data set and extend its analysis by using novel bioinformatic algorithms. The 2D-DIGE results are compared with other studies including a pilot study using one-dimensional peptide separation coupled to MS(E), a novel mass spectrometric approach. It can be concluded that although the latter method detects fewer proteins, it is much faster and less labor intensive. Last but not least, recent developments and remaining challenges in the field of proteomic studies of HIV-1 infection and proteomics in general are discussed

    Candidate prioritization for low-abundant differentially expressed proteins in 2D-DIGE datasets

    Get PDF
    BACKGROUND: Two-dimensional differential gel electrophoresis (2D-DIGE) provides a powerful technique to separate proteins on their isoelectric point and apparent molecular mass and quantify changes in protein expression. Abundantly available proteins in spots can be identified using mass spectrometry-based approaches. However, identification is often not possible for low-abundant proteins. RESULTS: We present a novel computational approach to prioritize candidate proteins for unidentified spots. Our approach exploits noisy information on the isoelectric point and apparent molecular mass of a protein spot in combination with functional similarities of candidate proteins to already identified proteins to select and rank candidates. We evaluated our method on a 2D-DIGE dataset comparing protein expression in uninfected and HIV-1 infected T-cells. Using leave-one-out cross-validation, we show that the true-positive rate for the top-5 ranked proteins is 43.8%. CONCLUSIONS: Our approach shows good performance on a 2D-DIGE dataset comparing protein expression in uninfected and HIV-1 infected T-cells. We expect our method to be highly useful in (re-)mining other 2D-DIGE experiments in which especially the low-abundant protein spots remain to be identified

    Drug prioritization using the semantic properties of a knowledge graph

    Get PDF
    Contains fulltext : 203466.pdf (publisher's version ) (Open Access

    Predicting the development of anti-drug antibodies against recombinant alpha-galactosidase a in male patients with classical fabry disease

    Get PDF
    Fabry Disease (FD) is a rare, X-linked, lysosomal storage disease that mainly causes renal, cardiac and cerebral complications. Enzyme replacement therapy (ERT) with recombinant alphagalactosidase A is available, but approximately 50% of male patients with classical FD develop inhibiting anti-drug antibodies (iADAs) that lead to reduced biochemical responses and an accelerated loss of renal function. Once immunization has occurred, iADAs tend to persist and tolerization is hard to achieve. Here we developed a pre-treatment prediction model for iADA development in FD using existing data from 120 classical male FD patients from three European centers, treated with ERT. We found that nonsense and frameshift mutations in the α-galactosidase A gene (p = 0.05), higher plasma lysoGb3 at baseline (p < 0.001) and agalsidase beta as first treatment (p = 0.006) were significantly associated with iADA de

    Cellular and Molecular Signatures of Androgen Ablation of Prostate Cancer

    No full text
    corecore