Search CORE

1,808 research outputs found

Generation and Applications of Knowledge Graphs in Systems and Networks Biology

Author: Hoyt Charles Tapley
Publication venue: Universitäts- und Landesbibliothek Bonn
Publication date
Field of study

The acceleration in the generation of data in the biomedical domain has necessitated the use of computational approaches to assist in its interpretation. However, these approaches rely on the availability of high quality, structured, formalized biomedical knowledge. This thesis has the two goals to improve methods for curation and semantic data integration to generate high granularity biological knowledge graphs and to develop novel methods for using prior biological knowledge to propose new biological hypotheses. The first two publications describe an ecosystem for handling biological knowledge graphs encoded in the Biological Expression Language throughout the stages of curation, visualization, and analysis. Further, the second two publications describe the reproducible acquisition and integration of high-granularity knowledge with low contextual specificity from structured biological data sources on a massive scale and support the semi-automated curation of new content at high speed and precision. After building the ecosystem and acquiring content, the last three publications in this thesis demonstrate three different applications of biological knowledge graphs in modeling and simulation. The first demonstrates the use of agent-based modeling for simulation of neurodegenerative disease biomarker trajectories using biological knowledge graphs as priors. The second applies network representation learning to prioritize nodes in biological knowledge graphs based on corresponding experimental measurements to identify novel targets. Finally, the third uses biological knowledge graphs and develops algorithmics to deconvolute the mechanism of action of drugs, that could also serve to identify drug repositioning candidates. Ultimately, the this thesis lays the groundwork for production-level applications of drug repositioning algorithms and other knowledge-driven approaches to analyzing biomedical experiments

bonndoc – Der Publikationsserver der Universität Bonn

Discovering lesser known molecular players and mechanistic patterns in Alzheimer's disease using an integrative disease modelling approach

Author: Kawalia Shweta Bagewadi
Publication venue: Universitäts- und Landesbibliothek Bonn
Publication date
Field of study

Convergence of exponentially advancing technologies is driving medical research with life changing discoveries. On the contrary, repeated failures of high-profile drugs to battle Alzheimer's disease (AD) has made it one of the least successful therapeutic area. This failure pattern has provoked researchers to grapple with their beliefs about Alzheimer's aetiology. Thus, growing realisation that Amyloid-β and tau are not 'the' but rather 'one of the' factors necessitates the reassessment of pre-existing data to add new perspectives. To enable a holistic view of the disease, integrative modelling approaches are emerging as a powerful technique. Combining data at different scales and modes could considerably increase the predictive power of the integrative model by filling biological knowledge gaps. However, the reliability of the derived hypotheses largely depends on the completeness, quality, consistency, and context-specificity of the data. Thus, there is a need for agile methods and approaches that efficiently interrogate and utilise existing public data. This thesis presents the development of novel approaches and methods that address intrinsic issues of data integration and analysis in AD research. It aims to prioritise lesser-known AD candidates using highly curated and precise knowledge derived from integrated data. Here much of the emphasis is put on quality, reliability, and context-specificity. This thesis work showcases the benefit of integrating well-curated and disease-specific heterogeneous data in a semantic web-based framework for mining actionable knowledge. Furthermore, it introduces to the challenges encountered while harvesting information from literature and transcriptomic resources. State-of-the-art text-mining methodology is developed to extract miRNAs and its regulatory role in diseases and genes from the biomedical literature. To enable meta-analysis of biologically related transcriptomic data, a highly-curated metadata database has been developed, which explicates annotations specific to human and animal models. Finally, to corroborate common mechanistic patterns — embedded with novel candidates — across large-scale AD transcriptomic data, a new approach to generate gene regulatory networks has been developed. The work presented here has demonstrated its capability in identifying testable mechanistic hypotheses containing previously unknown or emerging knowledge from public data in two major publicly funded projects for Alzheimer's, Parkinson's and Epilepsy diseases

bonndoc – Der Publikationsserver der Universität Bonn

Setting the basis of best practices and standards for curation and annotation of logical models in biology

Author: Calzone Laurence
Casals-Casas Cristina
Chaouiya Claudine
Freeman Tom
Helikar Tomás
Kuiper Martin
Naldi Aurélien
Niarakis Anna
Noel Vincent
Oshurko Eugenia
Ostaszewski Marek
Saez-Rodriguez Julio
Sheriff Rahuman S Malik
Soliman Sylvain
Stoll Gautier
Thieffry Denis
Thomas Paul
Toure Vasundra
Xenarios Ioannis
Publication venue: 'Oxford University Press (OUP)'
Publication date: 21/04/2020
Field of study

International audienceThe fast accumulation of biological data calls for their integration, analysis and exploitation through more systematic approaches. The generation of novel, relevant hypotheses from this enormous quantity of data remains challenging. Logical models have long been used to answer a variety of questions regarding the dynamical behaviours of regulatory networks. As the number of published logical models increases, there is a pressing need for systematic model annotation, referencing and curation in community-supported and standardised formats. This article summarises the key topics and future directions of a meeting entitled ‘Annotation and curation of computational models in biology’, organised as part of the 2019 [BC]2 conference. The purpose of the meeting was to develop and drive forward a plan towards the standardised annotation of logical models, review and connect various ongoing projects of experts from different communities involved in the modelling and annotation of molecular biological entities, interactions, pathways and models. This article defines a roadmap towards the annotation and curation of logical models, including milestones for best practices and minimum standard requirements

INRIA a CCSD electronic archive server

HAL Descartes

Edinburgh Research Explorer

HAL-MINES ParisTech

Hal-Diderot

Recent advances in biocuration: Meeting report from the Fifth International Biocuration Conference

Author: +17 additional authors
Arighi Cecilia
Bastian Frederic
Bateman Alex
Blake Judith A.
Gaudet Pascale
Mazumder Raja
Publication venue: Health Sciences Research Commons
Publication date: 01/01/2012
Field of study

The 5th International Biocuration Conference brought together over 300 scientists to exchange on their work, as well as discuss issues relevant to the International Society for Biocuration’s (ISB) mission. Recurring themes this year included the creation and promotion of gold standards, the need for more ontologies, and more formal interactions with journals. The conference is an essential part of the ISB\u27s goal to support exchanges among members of the biocuration community. Next year\u27s conference will be held in Cambridge, UK, from 7 to 10 April 2013. In the meanwhile, the ISB website provides information about the society\u27s activities (http://biocurator.org), as well as related events of interest

George Washington University: Health Sciences Research Commons (HSRC)

Overview of the gene ontology task at BioCreative IV

Author: Mao Yuqing
Van Auken Kimberly
Publication venue: 'Oxford University Press (OUP)'
Publication date: 25/08/2014
Field of study

Gene Ontology (GO) annotation is a common task among model organism databases (MODs) for capturing gene function data from journal articles. It is a time-consuming and labor-intensive task, and is thus often considered as one of the bottlenecks in literature curation. There is a growing need for semiautomated or fully automated GO curation techniques that will help database curators to rapidly and accurately identify gene function information in full-length articles. Despite multiple attempts in the past, few studies have proven to be useful with regard to assisting real-world GO curation. The shortage of sentence-level training data and opportunities for interaction between text-mining developers and GO curators has limited the advances in algorithm development and corresponding use in practical circumstances. To this end, we organized a text-mining challenge task for literature-based GO annotation in BioCreative IV. More specifically, we developed two subtasks: (i) to automatically locate text passages that contain GO-relevant information (a text retrieval task) and (ii) to automatically identify relevant GO terms for the genes in a given article (a concept-recognition task). With the support from five MODs, we provided teams with >4000 unique text passages that served as the basis for each GO annotation in our task data. Such evidence text information has long been recognized as critical for text-mining algorithm development but was never made available because of the high cost of curation. In total, seven teams participated in the challenge task. From the team results, we conclude that the state of the art in automatically mining GO terms from literature has improved over the past decade while much progress is still needed for computer-assisted GO curation. Future work should focus on addressing remaining technical challenges for improved performance of automatic GO concept recognition and incorporating practical benefits of text-mining tools into real-world GO annotation

Caltech Authors

Construction of antimicrobial peptide-drug combination networks from scientific literature based on a semi-automated curation workflow

Author: Abi-Haidar
Anália Lourenço
Arias
Florentino Fdez-Riverola
Gael Pérez Rodríguez
Jorge
Kolchinsky
Lourenço
Maria Olívia Pereira
Martín Pérez-Pérez
Masadeh
NCBI Resource Coordinators and Coordinators N. R
Paula Jorge
Pérez-Pérez
Saito
Sousa
Strempel
Tiikkainen
Tong
Wagner
Wozniak
Zhou
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2016
Field of study

Considerable research efforts are being invested in the development of novel antimicrobial therapies effective against the growing number of multi-drug resistant (MDR) pathogens. Notably, the combination of different agents is increasingly explored as means to exploit and improve individual agent actions while minimising microorganism resistance. Although there are several databases on antimicrobial agents, scientific literature is the primary source of information on experimental antimicrobial combination testing. This work presents a semi-automated database curation workflow that supports the mining of scientific literature and enables the reconstruction of recently documented antimicrobial combinations. Currently, the database contains data on antimicrobial combinations that have been experimentally tested against Pseudomonas aeruginosa, Staphylococcus aureus, Escherichia coli, Listeria monocytogenes and Candida albicans, which are prominent pathogenic organisms and are well-known for their wide and growing resistance to conventional antimicrobials. Researchers are able to explore the experimental results for a single organism or across organisms. Likewise, researchers may look into indirect network associations and identify new potential combinations to be tested. The database is available without charges. Database URL: http://sing.ei.uvigo.es/antimicrobialCombination/This study was supported by the Portuguese Foundation for Science and Technology (FCT) under the scope of the strategic funding of UID/BIO/04469/2013 unit and COMPETE 2020 (POCI-01-0145FEDER-006684), and support by FCT and the European Community fund FEDER, through the Programme COMPETE, under the scope of the Projects AntiPep PTDC/SAU-SAP/113196/2009 (FCOMP-01-0124-FEDER-016012) and RECI/BBB-EBI/0179/2012 (FCOMP-01-0124-FEDER-027462). Authors acknowledge the PhD Grant of Paula Jorge, funded by FCT Ref. SFRH/BD/ 88192/2012, and the PhD grants of Martin Pérez-Pérez and Gael Pe´rez-Rodriguez, funded by the University of Vigo. Finally, this study was partially funded by the [15VI013] Contract-Programme from the University of Vigo and the Agrupamento INBIOMED from DXPCTSUG-FEDER unha maneira de facer Europa (2012/273). This document reflects only the authors views and the European Union is not liable for any use that may be made of the information contained herein

Universidade do Minho: RepositoriUM

Crossref

PubMed Central

Literature Based Discovery (LBD): Towards Hypothesis Generation and Knowledge Discovery in Biomedical Text Mining

Author: Bhasuran Balu
Murugesan Gurusamy
Natarajan Jeyakumar
Publication venue
Publication date: 03/10/2023
Field of study

Biomedical knowledge is growing in an astounding pace with a majority of this knowledge is represented as scientific publications. Text mining tools and methods represents automatic approaches for extracting hidden patterns and trends from this semi structured and unstructured data. In Biomedical Text mining, Literature Based Discovery (LBD) is the process of automatically discovering novel associations between medical terms otherwise mentioned in disjoint literature sets. LBD approaches proven to be successfully reducing the discovery time of potential associations that are hidden in the vast amount of scientific literature. The process focuses on creating concept profiles for medical terms such as a disease or symptom and connecting it with a drug and treatment based on the statistical significance of the shared profiles. This knowledge discovery approach introduced in 1989 still remains as a core task in text mining. Currently the ABC principle based two approaches namely open discovery and closed discovery are mostly explored in LBD process. This review starts with general introduction about text mining followed by biomedical text mining and introduces various literature resources such as MEDLINE, UMLS, MESH, and SemMedDB. This is followed by brief introduction of the core ABC principle and its associated two approaches open discovery and closed discovery in LBD process. This review also discusses the deep learning applications in LBD by reviewing the role of transformer models and neural networks based LBD models and its future aspects. Finally, reviews the key biomedical discoveries generated through LBD approaches in biomedicine and conclude with the current limitations and future directions of LBD.Comment: 43 Pages, 5 Figures, 4 Table

arXiv.org e-Print Archive

Knowledge-based Biomedical Data Science 2019

Author: Callahan Tiffany J.
Hunter Lawrence E.
Pielke-Lombardo Harrison
Tripodi Ignacio J.
Publication venue
Publication date: 08/10/2019
Field of study

Knowledge-based biomedical data science (KBDS) involves the design and implementation of computer systems that act as if they knew about biomedicine. Such systems depend on formally represented knowledge in computer systems, often in the form of knowledge graphs. Here we survey the progress in the last year in systems that use formally represented knowledge to address data science problems in both clinical and biological domains, as well as on approaches for creating knowledge graphs. Major themes include the relationships between knowledge graphs and machine learning, the use of natural language processing, and the expansion of knowledge-based approaches to novel domains, such as Chinese Traditional Medicine and biodiversity.Comment: Manuscript 43 pages with 3 tables; Supplemental material 43 pages with 3 table

arXiv.org e-Print Archive

The role of ontologies in biological and biomedical research: a functional perspective.

Author: Gkoutos Georgios V
Hoehndorf Robert
Schofield Paul N
Publication venue: Brief Bioinform
Publication date: 10/04/2015
Field of study

Ontologies are widely used in biological and biomedical research. Their success lies in their combination of four main features present in almost all ontologies: provision of standard identifiers for classes and relations that represent the phenomena within a domain; provision of a vocabulary for a domain; provision of metadata that describes the intended meaning of the classes and relations in ontologies; and the provision of machine-readable axioms and definitions that enable computational access to some aspects of the meaning of classes and relations. While each of these features enables applications that facilitate data integration, data access and analysis, a great potential lies in the possibility of combining these four features to support integrative analysis and interpretation of multimodal data. Here, we provide a functional perspective on ontologies in biology and biomedicine, focusing on what ontologies can do and describing how they can be used in support of integrative research. We also outline perspectives for using ontologies in data-driven science, in particular their application in structured data mining and machine learning applications.This is the final version of the article. It first appeared from Oxford University Press via http://dx.doi.org/10.1093/bib/bbv01

CiteSeerX

University of Birmingham Research Portal

PubMed Central

Apollo (Cambridge)