Search CORE

4,260 research outputs found

Ontology of core data mining entities

Author: A Bernstein
A Golbraikh
A Karalic
B Smith
B Smith
B Smith
C Silla
C Vens
D Demšar
D Kocev
D Kocev
D Qi
D Young
DJ Hand
F Serban
G Madjarov
G Tsoumakas
GH Bakir
H Mannila
HP Kriegel
I Slavkov
J Vanschoren
K Button
Larisa Soldatova
LN Soldatova
M Courtot
M Ford
M Žáková
MA Avery
MA Avery
MF López
O Spjuth
P Robinson
Panče Panov
Q Yang
R Caruana
R Guha
R Guha
RD King
RD King
RR Brinkman
Sašo Džeroski
T Dietterich
V Podpečan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/07/2014
Field of study

In this article, we present OntoDM-core, an ontology of core data mining entities. OntoDM-core defines themost essential datamining entities in a three-layered ontological structure comprising of a specification, an implementation and an application layer. It provides a representational framework for the description of mining structured data, and in addition provides taxonomies of datasets, data mining tasks, generalizations, data mining algorithms and constraints, based on the type of data. OntoDM-core is designed to support a wide range of applications/use cases, such as semantic annotation of data mining algorithms, datasets and results; annotation of QSAR studies in the context of drug discovery investigations; and disambiguation of terms in text mining. The ontology has been thoroughly assessed following the practices in ontology engineering, is fully interoperable with many domain resources and is easy to extend

Crossref

Brunel University Research Archive

GI Systems for public health with an ontology based approach

Author: Gür Nurefşan
Publication venue
Publication date: 05/03/2012
Field of study

Dissertation submitted in partial fulfillment of the requirements for the Degree of Master of Science in Geospatial Technologies.Health is an indispensable attribute of human life. In modern age, utilizing technologies for health is one of the emergent concepts in several applied fields. Computer science, (geographic) information systems are some of the interdisciplinary fields which motivates this thesis. Inspiring idea of the study is originated from a rhetorical disease DbHd: Database Hugging Disorder, defined by Hans Rosling at World Bank Open Data speech in May 2010. The cure of this disease can be offered as linked open data, which contains ontologies for health science, diseases, genes, drugs, GEO species etc. LOD-Linked Open Data provides the systematic application of information by publishing and connecting structured data on the Web. In the context of this study we aimed to reduce boundaries between semantic web and geo web. For this reason a use case data is studied from Valencia CSISP- Research Center of Public Health in which the mortality rates for particular diseases are represented spatio-temporally. Use case data is divided into three conceptual domains (health, spatial, statistical), enhanced with semantic relations and descriptions by following Linked Data Principles. Finally in order to convey complex health-related information, we offer an infrastructure integrating geo web and semantic web. Based on the established outcome, user access methods are introduced and future researches/studies are outlined

Repositório da Universidade Nova de Lisboa

Generation of open biomedical datasets through ontology-driven transformation and integration processes

Author
Publication venue: BioMed Central
Publication date: 03/06/2016
Field of study

Springer - Publisher Connector

Searching Data: A Review of Observational Data Retrieval Practices in Selected Disciplines

Author: Aloia N.
Beran B.
Borgman C.L.
Carlson J.
Fielding N.G.
Honor L.B.
Ingwersen P.
Maier D.
Meyer E.T.
Pasquetto I.V.
Zimmerman A.S.
Publication venue: 'Wiley'
Publication date: 03/04/2019
Field of study

A cross-disciplinary examination of the user behaviours involved in seeking and evaluating data is surprisingly absent from the research data discussion. This review explores the data retrieval literature to identify commonalities in how users search for and evaluate observational research data. Two analytical frameworks rooted in information retrieval and science technology studies are used to identify key similarities in practices as a first step toward developing a model describing data retrieval

arXiv.org e-Print Archive

Maastricht University Research Portal

Crossref

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Linked Data based Health Information Representation, Visualization and Retrieval System on the Semantic Web

Author: Tilahun Binyam Chakilu
Publication venue
Publication date: 30/01/2013
Field of study

Dissertation submitted in partial fulfillment of the requirements for the Degree of Master of Science in Geospatial Technologies.To better facilitate health information dissemination, using flexible ways to represent, query and visualize health data becomes increasingly important. Semantic Web technologies, which provide a common framework by allowing data to be shared and reused between applications, can be applied to the management of health data. Linked open data - a new semantic web standard to publish and link heterogonous data- allows not only human, but also machine to brows data in unlimited way. Through a use case of world health organization HIV data of sub Saharan Africa - which is severely affected by HIV epidemic, this thesis built a linked data based health information representation, querying and visualization system. All the data was represented with RDF, by interlinking it with other related datasets, which are already on the cloud. Over all, the system have more than 21,000 triples with a SPARQL endpoint; where users can download and use the data and – a SPARQL query interface where users can put different type of query and retrieve the result. Additionally, It has also a visualization interface where users can visualize the SPARQL result with a tool of their preference. For users who are not familiar with SPARQL queries, they can use the linked data search engine interface to search and browse the data. From this system we can depict that current linked open data technologies have a big potential to represent heterogonous health data in a flexible and reusable manner and they can serve in intelligent queries, which can support decision-making. However, in order to get the best from these technologies, improvements are needed both at the level of triple stores performance and domain-specific ontological vocabularies

Repositório da Universidade Nova de Lisboa

Towards Automatic Generation of Shareable Synthetic Clinical Notes Using Neural Language Models

Author: Melamud Oren
Shivade Chaitanya
Publication venue
Publication date: 01/01/2019
Field of study

Large-scale clinical data is invaluable to driving many computational scientific advances today. However, understandable concerns regarding patient privacy hinder the open dissemination of such data and give rise to suboptimal siloed research. De-identification methods attempt to address these concerns but were shown to be susceptible to adversarial attacks. In this work, we focus on the vast amounts of unstructured natural language data stored in clinical notes and propose to automatically generate synthetic clinical notes that are more amenable to sharing using generative models trained on real de-identified records. To evaluate the merit of such notes, we measure both their privacy preservation properties as well as utility in training clinical NLP models. Experiments using neural language models yield notes whose utility is close to that of the real ones in some clinical NLP tasks, yet leave ample room for future improvements.Comment: Clinical NLP Workshop 201

arXiv.org e-Print Archive

Crossref

Building Semantic Knowledge Graphs from (Semi-)Structured Data: A Review

Author: Roman Dumitru
Soylu Ahmet
Vetle Ryen
Publication venue: 'MDPI AG'
Publication date: 01/01/2022
Field of study

Knowledge graphs have, for the past decade, been a hot topic both in public and private domains, typically used for large-scale integration and analysis of data using graph-based data models. One of the central concepts in this area is the Semantic Web, with the vision of providing a well-defined meaning to information and services on the Web through a set of standards. Particularly, linked data and ontologies have been quite essential for data sharing, discovery, integration, and reuse. In this paper, we provide a systematic literature review on knowledge graph creation from structured and semi-structured data sources using Semantic Web technologies. The review takes into account four prominent publication venues, namely, Extended Semantic Web Conference, International Semantic Web Conference, Journal of Web Semantics, and Semantic Web Journal. The review highlights the tools, methods, types of data sources, ontologies, and publication methods, together with the challenges, limitations, and lessons learned in the knowledge graph creation processes.publishedVersio

SINTEF Open

NORA - Norwegian Open Research Archives

Generation of open biomedical datasets through ontology-driven transformation and integration processes

Author: A Abello
A Tapuria
C Bizer
C Goble
C Lange
C Martínez-Costa
C Martínez-Costa
C Martínez-Costa
C Martínez-Costa
C Tao
C Tsinaraki
CA Knoblock
D Pérez-Rey
E Antezana
E Sirin
F Belleau
F Breitling
F Chen
FC Bernstein
G Būmans
JA Miñarro-Gimenez
JA Miñarro-Giménez
JF Sequeda
JJ Irwin
JJ Saleem
JT Fernández-Breis
JT Fernández-Breis
K Degtyarenko
K Janowicz
M Ashburner
M Legaz-García
M Mesiti
M Pellegrini
M Remm
MY Galperin
N Juty
NF Noy
NH Shah
O Bodenreider
R Stevens
S Jupp
T Attwood
T Berners-Lee
T Schmitt
TR Gruber
VA McKusick
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Overview of ImageCLEF 2018: Challenges, Datasets and Evaluation

Author: Andrearczyk Vincent
Dang-Nguyen Duc-Tien
Dicente Cid Yashin
Eickhoff Carsten
Farri Oladimeji
Garcia Seco De Herrera Alba
Gurrin Cathal
Hasan Sadid A
Ionescu Bogdan
Kovalev Vassili
Liauchuk Vitali
Ling Yuan
Liu Joey
Lungren Matthew
Lux Mathias
Müller Henning
Piras Luca
Riegler Michael
Villegas Mauricio
Zhou Liting
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

This paper presents an overview of the ImageCLEF 2018 evaluation campaign, an event that was organized as part of the CLEF (Conference and Labs of the Evaluation Forum) Labs 2018. ImageCLEF is an ongoing initiative (it started in 2003) that promotes the evaluation of technologies for annotation, indexing and retrieval with the aim of providing information access to collections of images in various usage scenarios and domains. In 2018, the 16th edition of ImageCLEF ran three main tasks and a pilot task: (1) a caption prediction task that aims at predicting the caption of a figure from the biomedical literature based only on the figure image; (2) a tuberculosis task that aims at detecting the tuberculosis type, severity and drug resistance from CT (Computed Tomography) volumes of the lung; (3) a LifeLog task (videos, images and other sources) about daily activities understanding and moment retrieval, and (4) a pilot task on visual question answering where systems are tasked with answering medical questions. The strong participation, with over 100 research groups registering and 31 submitting results for the tasks, shows an increasing interest in this benchmarking campaign

University of Essex Research Repository

Crossref

Hes-so: ArODES Open Archive (University of Applied Sciences and Arts Western Switzerland / Haute école spécialisée de Suisse occidentale / FH Westschweiz)

Archivio istituzionale della ricerca - Università di Cagliari

DCU Online Research Access Service