Search CORE

29 research outputs found

Scenario driven data modelling: a method for integrating diverse sources of data and data streams

Author: Brettin Thomas S
Cottingham Robert W
Griffith Shelton D
Quest Daniel J
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Enabling Web-scale data integration in biomedicine through Linked Open Data

Author: Fernandez Garcia Javier David
Kamdar Maulik R.
Musen Mark A.
Polleres Axel
Tudorache Tania
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

The biomedical data landscape is fragmented with several isolated, heterogeneous data and knowledge sources, which use varying formats, syntaxes, schemas, and entity notations, existing on the Web. Biomedical researchers face severe logistical and technical challenges to query, integrate, analyze, and visualize data from multiple diverse sources in the context of available biomedical knowledge. Semantic Web technologies and Linked Data principles may aid toward Web-scale semantic processing and data integration in biomedicine. The biomedical research community has been one of the earliest adopters of these technologies and principles to publish data and knowledge on the Web as linked graphs and ontologies, hence creating the Life Sciences Linked Open Data (LSLOD) cloud. In this paper, we provide our perspective on some opportunities proffered by the use of LSLOD to integrate biomedical data and knowledge in three domains: (1) pharmacology, (2) cancer research, and (3) infectious diseases. We will discuss some of the major challenges that hinder the wide-spread use and consumption of LSLOD by the biomedical research community. Finally, we provide a few technical solutions and insights that can address these challenges. Eventually, LSLOD can enable the development of scalable, intelligent infrastructures that support artificial intelligence methods for augmenting human intelligence to achieve better clinical outcomes for patients, to enhance the quality of biomedical research, and to improve our understanding of living systems

Elektronische Publikationen der Wirtschaftsuniversität Wien

Reply to: Soils need to be considered when assessing the impacts of land-use change on carbon sequestration

Author: Bruckner Martin
Canelas Joana
Eisenmenger Nina
Erb Karl-Heinz
Hilbers Jelle P.
Huijbregts Mark A. J.
Kastner Thomas
Marques Alexandra
Martins Inês S.
Pereira Henrique M.
Plutzar Christoph
Stadler Konstantin
Theurl Michaela C.
Tukker Arnold
Wood Richard
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 04/11/2019
Field of study

Industrial Ecolog

Elektronische Publikationen der Wirtschaftsuniversität Wien

Leiden University Scholary Publications

Adventures in Semantic Publishing: Exemplar Semantic Enhancements of a Research Article

Author: Klyne Graham
Miles Alistair
Portwin Katie
Shotton David
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Scientific innovation depends on finding, integrating, and re-using the products of previous research. Here we explore how recent developments in Web technology, particularly those related to the publication of data and metadata, might assist that process by providing semantic enhancements to journal articles within the mainstream process of scholarly journal publishing. We exemplify this by describing semantic enhancements we have made to a recent biomedical research article taken from PLoS Neglected Tropical Diseases, providing enrichment to its content and increased access to datasets within it. These semantic enhancements include provision of live DOIs and hyperlinks; semantic markup of textual terms, with links to relevant third-party information resources; interactive figures; a re-orderable reference list; a document summary containing a study summary, a tag cloud, and a citation analysis; and two novel types of semantic enrichment: the first, a Supporting Claims Tooltip to permit “Citations in Context”, and the second, Tag Trees that bring together semantically related terms. In addition, we have published downloadable spreadsheets containing data from within tables and figures, have enriched these with provenance information, and have demonstrated various types of data fusion (mashups) with results from other research articles and with Google Maps. We have also published machine-readable RDF metadata both about the article and about the references it cites, for which we developed a Citation Typing Ontology, CiTO (http://purl.org/net/cito/). The enhanced article, which is available at http://dx.doi.org/10.1371/journal.pntd.0000228.x001, presents a compelling existence proof of the possibilities of semantic publication. We hope the showcase of examples and ideas it contains, described in this paper, will excite the imaginations of researchers and publishers, stimulating them to explore the possibilities of semantic publishing for their own research articles, and thereby break down present barriers to the discovery and re-use of information within traditional modes of scholarly communication

Directory of Open Access Journals

PubMed Central

Oxford University Research Archive

Linking the Resource Description Framework to cheminformatics and proteochemometrics

Author: Alvarsson Jonathan
Andersson Annsofie
Eklund Martin
Lampa Samuel
Lapins Maris
Spjuth Ola
Wikberg Jarl ES
Willighagen Egon L
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Semantic web technologies are finding their way into the life sciences. Ontologies and semantic markup have already been used for more than a decade in molecular sciences, but have not found widespread use yet. The semantic web technology Resource Description Framework (RDF) and related methods show to be sufficiently versatile to change that situation. Results The work presented here focuses on linking RDF approaches to existing molecular chemometrics fields, including cheminformatics, QSAR modeling and proteochemometrics. Applications are presented that link RDF technologies to methods from statistics and cheminformatics, including data aggregation, visualization, chemical identification, and property prediction. They demonstrate how this can be done using various existing RDF standards and cheminformatics libraries. For example, we show how IC50 and K<it>i</it> values are modeled for a number of biological targets using data from the ChEMBL database. Conclusions We have shown that existing RDF standards can suitably be integrated into existing molecular chemometrics methods. Platforms that unite these technologies, like Bioclipse, makes this even simpler and more transparent. Being able to create and share workflows that integrate data aggregation and analysis (visual and statistical) is beneficial to interoperability and reproducibility. The current work shows that RDF approaches are sufficiently powerful to support molecular chemometrics workflows.</p

Maastricht University Research Portal

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Juelich Shared Electronic Resources

Finding Complex Biological Relationships in Recent PubMed Articles Using Bio-LDA

Author: Ding Ying
Dong Xiao
He Bing
Qiu Judy
Tang Jie
Wang Huijun
Wild David J.
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

The overwhelming amount of available scholarly literature in the life sciences poses significant challenges to scientists wishing to keep up with important developments related to their research, but also provides a useful resource for the discovery of recent information concerning genes, diseases, compounds and the interactions between them. In this paper, we describe an algorithm called Bio-LDA that uses extracted biological terminology to automatically identify latent topics, and provides a variety of measures to uncover putative relations among topics and bio-terms. Relationships identified using those approaches are combined with existing data in life science datasets to provide additional insight. Three case studies demonstrate the utility of the Bio-LDA model, including association predication, association search and connectivity map generation. This combined approach offers new opportunities for knowledge discovery in many areas of biology including target identification, lead hopping and drug repurposing.Comment: 14 pages, 8 figures, 10 table

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Directory of Open Access Journals

PubMed Central

RegenBase: a knowledge base of spinal cord injury biology for translational research.

Author: Abeyruwan Saminda W
Al-Ali Hassan
Bixby John L
Callahan Alison
Ferguson Adam R
Lemmon Vance P
Popovich Phillip G
Sakurai Kunie
Shah Nigam H
Visser Ubbo
Publication venue: eScholarship, University of California
Publication date: 01/01/2016
Field of study

Spinal cord injury (SCI) research is a data-rich field that aims to identify the biological mechanisms resulting in loss of function and mobility after SCI, as well as develop therapies that promote recovery after injury. SCI experimental methods, data and domain knowledge are locked in the largely unstructured text of scientific publications, making large scale integration with existing bioinformatics resources and subsequent analysis infeasible. The lack of standard reporting for experiment variables and results also makes experiment replicability a significant challenge. To address these challenges, we have developed RegenBase, a knowledge base of SCI biology. RegenBase integrates curated literature-sourced facts and experimental details, raw assay data profiling the effect of compounds on enzyme activity and cell growth, and structured SCI domain knowledge in the form of the first ontology for SCI, using Semantic Web representation languages and frameworks. RegenBase uses consistent identifier schemes and data representations that enable automated linking among RegenBase statements and also to other biological databases and electronic resources. By querying RegenBase, we have identified novel biological hypotheses linking the effects of perturbagens to observed behavioral outcomes after SCI. RegenBase is publicly available for browsing, querying and download.Database URL:http://regenbase.org

Crossref

PubMed Central

eScholarship - University of California

University of Miami: Scholarship Miami

Gode -- Integrating Biochemical Knowledge Graph into Pre-training Molecule Graph Neural Network

Author: Jiang Pengcheng
Publication venue
Publication date: 02/06/2023
Field of study

The precise prediction of molecular properties holds paramount importance in facilitating the development of innovative treatments and comprehending the intricate interplay between chemicals and biological systems. In this study, we propose a novel approach that integrates graph representations of individual molecular structures with multi-domain information from biomedical knowledge graphs (KGs). Integrating information from both levels, we can pre-train a more extensive and robust representation for both molecule-level and KG-level prediction tasks with our novel self-supervision strategy. For performance evaluation, we fine-tune our pre-trained model on 11 challenging chemical property prediction tasks. Results from our framework demonstrate our fine-tuned models outperform existing state-of-the-art models.Comment: It's an ongoing work. We're exploring the ability of Gode on other task

arXiv.org e-Print Archive

Distributed Conversion of RDF Data to the Relational Model

Author: Příhoda David
Publication venue: Czech Technical University in Prague. Computing and Information Centre.
Publication date
Field of study

Ve formátu RDF je ukládán rostoucí objem hodnotných informací. Relační databáze však stále přinášejí výhody z hlediska výkonu a množství podporovaných nástrojů. Představujeme RDF2X, nástroj pro automatický distribuovaný převod RDF dat do relačního modelu. Poskytujeme srovnání souvisejících přístupů, analyzujeme měření převodu 8.4 miliard RDF trojic a ilustrujeme přínos našeho nástroje na dvou případových studiích.The Resource Description Framework (RDF) stores a growing volume of valuable information. However, relational databases still provide advantages in terms of performance, familiarity and the number of supported tools. We present RDF2X, a tool for automatic distributed conversion of RDF datasets to the relational model. We provide a comparison of related approaches, report on the conversion of 8.4 billion RDF triples and demonstrate the contribution of our tool on case studies from two different domains

Digital Library of the Czech Technical University in Prague

OPPL-Galaxy, a Galaxy tool for enhancing ontology exploitation as part of bioinformatics workflows

Author: Antezana E.
Egaña Aranguren Mikel
Fernández-Breis Jesualdo Tomás
Mungall C.
Rodríguez González A.
Wilkinson M.
Publication venue: Facultad de Informática (UPM)
Publication date: 01/01/2013
Field of study

Biomedical ontologies are key elements for building up the Life Sciences Semantic Web. Reusing and building biomedical ontologies requires flexible and versatile tools to manipulate them efficiently, in particular for enriching their axiomatic content. The Ontology Pre Processor Language (OPPL) is an OWL-based language for automating the changes to be performed in an ontology. OPPL augments the ontologists’ toolbox by providing a more efficient, and less error-prone, mechanism for enriching a biomedical ontology than that obtained by a manual treatment. Results We present OPPL-Galaxy, a wrapper for using OPPL within Galaxy. The functionality delivered by OPPL (i.e. automated ontology manipulation) can be combined with the tools and workflows devised within the Galaxy framework, resulting in an enhancement of OPPL. Use cases are provided in order to demonstrate OPPL-Galaxy’s capability for enriching, modifying and querying biomedical ontologies. Conclusions Coupling OPPL-Galaxy with other bioinformatics tools of the Galaxy framework results in a system that is more than the sum of its parts. OPPL-Galaxy opens a new dimension of analyses and exploitation of biomedical ontologies, including automated reasoning, paving the way towards advanced biological data analyses

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Springer - Publisher Connector

PubMed Central

Archivo Digital UPM