Search CORE

24 research outputs found

Ontology of core data mining entities

Author: A Bernstein
A Golbraikh
A Karalic
B Smith
B Smith
B Smith
C Silla
C Vens
D Demšar
D Kocev
D Kocev
D Qi
D Young
DJ Hand
F Serban
G Madjarov
G Tsoumakas
GH Bakir
H Mannila
HP Kriegel
I Slavkov
J Vanschoren
K Button
Larisa Soldatova
LN Soldatova
M Courtot
M Ford
M Žáková
MA Avery
MA Avery
MF López
O Spjuth
P Robinson
Panče Panov
Q Yang
R Caruana
R Guha
R Guha
RD King
RD King
RR Brinkman
Sašo Džeroski
T Dietterich
V Podpečan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/07/2014
Field of study

In this article, we present OntoDM-core, an ontology of core data mining entities. OntoDM-core defines themost essential datamining entities in a three-layered ontological structure comprising of a specification, an implementation and an application layer. It provides a representational framework for the description of mining structured data, and in addition provides taxonomies of datasets, data mining tasks, generalizations, data mining algorithms and constraints, based on the type of data. OntoDM-core is designed to support a wide range of applications/use cases, such as semantic annotation of data mining algorithms, datasets and results; annotation of QSAR studies in the context of drug discovery investigations; and disambiguation of terms in text mining. The ontology has been thoroughly assessed following the practices in ontology engineering, is fully interoperable with many domain resources and is easy to extend

Crossref

Brunel University Research Archive

Ontologies to Enable Interoperability of Multi-Agent Electricity Markets Simulation and Decision Support

Author: Pinto Tiago
Santos Gabriel
Vale Zita
Publication venue: 'MDPI AG'
Publication date: 01/05/2021
Field of study

This paper presents the AiD-EM Ontology, which provides a semantic representation of the concepts required to enable the interoperability between multi-agent-based decision support systems, namely AiD-EM, and the market agents that participate in electricity market simulations. Electricity markets’ constant changes, brought about by the increasing necessity for adequate integration of renewable energy sources, make them complex and dynamic environments with very particular characteristics. Several modeling tools directed at the study and decision support in the scope of the restructured wholesale electricity markets have emerged. However, a common limitation is identified: the lack of interoperability between the various systems. This gap makes it impossible to exchange information and knowledge between them, test different market models, enable players from heterogeneous systems to interact in common market environments, and take full advantage of decision support tools. To overcome this gap, this paper presents the AiD-EM Ontology, which includes the necessary concepts related to the AiD-EM multi-agent decision support system, to enable interoperability with easier cooperation and adequate communication between AiD-EM and simulated market agents wishing to take advantage of this decision support toolThis work has received funding from the EU Horizon 2020 research and innovation program under project TradeRES (grant agreement No 864276), from FEDER Funds through COMPETE program and from National Funds through (FCT) under projects CEECIND/01811/2017 and UID/EEA/00760/2019. Gabriel Santos was supported by the PhD grant SFRH/BD/118487/2016 from National Funds through FCTinfo:eu-repo/semantics/publishedVersio

Multidisciplinary Digital Publishing Institute

Repositório Científico do Instituto Politécnico do Porto

ZENODO

Ontologias para Manutenção Preditiva com Dados sensíveis ao tempo

Author: Nobre Armando Jorge Ventura
Publication venue
Publication date: 01/01/2022
Field of study

As empresas de fabrico industrial devem assegurar um processo produtivo contínuo para serem competitivas e fornecer os produtos fabricados no prazo e com a qualidade exigida pelos clientes. A quebra da cadeia de fabrico pode ter desfechos graves, resultando numa redução da produção e na interrupção da cadeia de abastecimento. Estes processos são compostos por cadeias de máquinas que executam tarefas em etapas. Cada máquina tem uma tarefa específica a executar, e o resultado de cada etapa é fornecido à próxima etapa. Uma falha imprevista numa das máquinas tende a interromper toda a cadeia produtiva. A manutenção preventiva agendada tem como objetivo evitar a ocorrência de falhas, tendo como base o tempo médio antes da falha (MTBF), que representa a expectativa média de vida de componentes individuais com base em dados históricos. As tarefas de manutenção podem implicar um período de paralisação e a interrupção da produção. Esta manutenção é executada rotineiramente e a substituição de componentes não considera a necessidade premente da sua substituição, sendo os mesmos substituídos com base no ciclo do agendamento. É aqui que a manutenção preditiva é aplicável. Efetuando a recolha de dados de sensores dos equipamentos, é possível detetar irregularidades nos dados recolhidos, através da aplicação de processos de raciocínio e inferência, conduzindo à atempada previsão e deteção de falhas. Levando este cenário à otimização do tempo de manutenção, evitando falhas inesperadas, à redução de custos e ao aumento da produtividade em comparação com a manutenção preventiva. Os dados fornecidos pelos sensores são sensíveis ao tempo, variações e flutuações ocorrem ao longo do tempo e devem ser analisados em relação ao período em que ocorrem. Esta dissertação tem como objetivo o desenvolvimento de uma ontologia para a manutenção preditiva que descreva a sua abrangência e o campo da sua aplicação. A aplicabilidade da ontologia será demonstrada com uma ferramenta, igualmente desenvolvida, que transforma dados sensíveis ao tempo recolhidos em tempo real a partir de sensores de máquinas industriais, fornecidos por WebServices, em indivíduos dessa mesma ontologia, considerando a representação do fator temporal dos dados.Manufacturing companies must ensure a continuous production process to be competitive and supply the manufactured goods in time and with the desired quality the customers expect. Any disruption in the manufacturing chain may have disastrous consequences, representing a shortage of production and the interruption of the supply chain. The manufacturing processes are composed of a chain of industrial machines operating in stages. Each machine has a specific task to complete, and the result of each stage is forwarded to the next stage. An unpredicted malfunction of one of the machines tends to interrupt the whole production chain. Scheduled Preventive maintenance intends to avoid causes leading to faults, but relies on parameters such as Mean Time Before Failure (MTBF), which represents the average expected life span of individual components based on statistical data. A maintenance task may lead to a period of downtime and consequently to a production halt. Being the maintenance scheduled and executed routinely, the replacement of components, does not consider the effective need of its replacement, they are replaced based on the scheduling cycle. This is where predictive maintenance is applicable. By collecting sensor data of industrial equipment, anomalies can be determined through reasoning and inference processes applied to the data, leading to an early fault and time to failure prediction. This scenario leads to maintenance timing optimization, avoidance of unexpected failures, cost savings and improved productivity when compared to preventive maintenance. Data supplied by sensors is timesensitive, as variations and fluctuations occur over periods of time and must be analysed concerning the period they occur. This dissertation aims to develop an ontology for predictive maintenance that describes the scope and field of application. The applicability of the ontology will be demonstrated with a tool, also to be developed, that transforms time-sensitive data collected in real time from sensors of industrial machines, provided by a WebServices, into individuals of the same ontology, considering the representation of the temporal factor of the data

Repositório Científico do Instituto Politécnico do Porto

OpenTox predictive toxicology framework: toxicological ontology and semantic media wiki-based OpenToxipedia

Author: Baier Thomas
Batke Monika
Benigni Romualdo
Escher Sylvia E
Hardy Barry
Jeliazkova Nina
Lagunin Alexey
Nikolova Ivelina
Poroikov Vladimir
Rautenberg Micha
Tcheremenskaia Olga
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background The OpenTox Framework, developed by the partners in the OpenTox project (<url>http://www.opentox.org</url>), aims at providing a unified access to toxicity data, predictive models and validation procedures. Interoperability of resources is achieved using a common information model, based on the OpenTox ontologies, describing predictive algorithms, models and toxicity data. As toxicological data may come from different, heterogeneous sources, a deployed ontology, unifying the terminology and the resources, is critical for the rational and reliable organization of the data, and its automatic processing. Results The following related ontologies have been developed for OpenTox: a) Toxicological ontology – listing the toxicological endpoints; b) Organs system and Effects ontology – addressing organs, targets/examinations and effects observed in <it>in vivo</it> studies; c) ToxML ontology – representing semi-automatic conversion of the ToxML schema; d) OpenTox ontology– representation of OpenTox framework components: chemical compounds, datasets, types of algorithms, models and validation web services; e) ToxLink–ToxCast assays ontology and f) OpenToxipedia community knowledge resource on toxicology terminology. OpenTox components are made available through standardized REST web services, where every compound, data set, and predictive method has a unique resolvable address (URI), used to retrieve its Resource Description Framework (RDF) representation, or to initiate the associated calculations and generate new RDF-based resources. The services support the integration of toxicity and chemical data from various sources, the generation and validation of computer models for toxic effects, seamless integration of new algorithms and scientifically sound validation routines and provide a flexible framework, which allows building arbitrary number of applications, tailored to solving different problems by end users (e.g. toxicologists). Availability The OpenTox toxicological ontology projects may be accessed via the OpenTox ontology development page <url>http://www.opentox.org/dev/ontology</url>; the OpenTox ontology is available as OWL at <url>http://opentox.org/api/1 1/opentox.owl</url>, the ToxML - OWL conversion utility is an open source resource available at <url>http://ambit.svn.sourceforge.net/viewvc/ambit/branches/toxml-utils/</url></p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Fraunhofer-ePrints

PubMed Central

Publications at Bielefeld University

Semantic Web in data mining and knowledge discovery: A comprehensive survey

Author: Abel
Abel
Auer
Bellandi
Bernstein
Bhagavatula
Bicer
Bicharra~Garcia
Bizer
Bloehdorn
Bloem
Blum
Bollacker
Bontcheva
Brunetti
Brüggemann
Chen
Dadzie
Daiber
De~Clercq
de~Vries
Diamantini
Ding
Di~Noia
Di~Noia
Dou
Džeroski
d’Aquin
Eronen
Euzenat
Fanizzi
Fayyad
Finin
Fürber
Fürber
Fürber
Fürber
Fürnkranz
Gabriel
Gruber
Han
Hand
Hassanzadeh
Heckmann
Heiko Paulheim
Hienert
Hilario
Huang
Huang
Huynh
Jay
John
Kauppinen
Kauppinen
Kedad
Kietz
Kietz
Klsgen
Kramer
Langegger
Lavrač
Lavrač
Liaw
Limaye
Lösch
Marinica
Mendes
Milano
Miller
Moss
Mulwad
Mulwad
Mulwad
Muoz
Muñoz
Nigro
Pan
Pan
Pang-Ning
Panov
Panov
Panov
Panov
Passant
Paulheim
Paulheim
Paulheim
Paulheim
Pennacchiotti
Petar Ristoski
Phillips
Pinto
Podpecan
Podpečan
Prez-Rey
Qu
Quboa
Rettinger
Ristoski
Ristoski
Ristoski
Rizzo
Scerri
Schmachtenberg
Schuhmacher
Schulz
Serban
Shervashidze
Spanos
Srikant
Stumme
Suchanek
Suchanek
Suyama
Svátek
Tiddi
Tiddi
Tiddi
Trajkovski
Tresp
Tummarello
Unbehauen
van Hage
Vavpetič
Vavpetič
Vavpetič
Venetis
Wang
Wang
Wang
Wu
Zhang
Zhang
Zhou
Žáková
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

The Ontology of Biological and Clinical Statistics (OBCS) for standardized and reproducible statistical analysis

Author: Harris Marcelline R.
He Yongqun
Hero Alfred
Masci Anna Maria
Smith Barry
Yu Lin
Zheng Jie
Publication venue
Publication date: 01/01/2016
Field of study

Statistics play a critical role in biological and clinical research. However, most reports of scientific results in the published literature make it difficult for the reader to reproduce the statistical analyses performed in achieving those results because they provide inadequate documentation of the statistical tests and algorithms applied. The Ontology of Biological and Clinical Statistics (OBCS) is put forward here as a step towards solving this problem. Terms in OBCS, including ‘data collection’, ‘data transformation in statistics’, ‘data visualization’, ‘statistical data analysis’, and ‘drawing a conclusion based on data’, cover the major types of statistical processes used in basic biological research and clinical outcome studies. OBCS is aligned with the Basic Formal Ontology (BFO) and extends the Ontology of Biomedical Investigations (OBI), an OBO (Open Biological and Biomedical Ontologies) Foundry ontology supported by over 20 research communities. We discuss two examples illustrating how the ontology is being applied. In the first (biological) use case, we describe how OBCS was applied to represent the high throughput microarray data analysis of immunological transcriptional profiles in human subjects vaccinated with an influenza vaccine. In the second (clinical outcomes) use case, we applied OBCS to represent the processing of electronic health care data to determine the associations between hospital staffing levels and patient mortality. Our case studies were designed to show how OBCS can be used for the consistent representation of statistical analysis pipelines under two different research paradigms. By representing statistics-related terms and their relations in a rigorous fashion, OBCS facilitates standard data analysis and integration, and supports reproducible biological and clinical research

PhilPapers

Crossref

Springer - Publisher Connector

PubMed Central

Recommended from our members

Towards a Model-Driven Platform for Evidence based Public Health Policy Making

Author: Koutsouris D.
Prasinos M.
Spanoudakis G.
Publication venue: KSI Research Inc. and Knowledge Systems Institute
Publication date
Field of study

City Research Online

The Data Mining OPtimization Ontology

Author: d’Amato Claudia
Hilario Mélanie
Kalousis Alexandros
Keet C. Maria
Nguyen Phong
Palma Raul
Stevens Robert
Ławrynowicz Agnieszka
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

The Data Mining OPtimization Ontology (DMOP) has been developed to support informed decision-making at various choice points of the data mining process. The ontology can be used by data miners and deployed in ontology-driven information systems. The primary purpose for which DMOP has been developed is the automation of algorithm and model selection through semantic meta-mining that makes use of an ontology-based meta-analysis of complete data mining processes in view of extracting patterns associated with mining performance. To this end, DMOP contains detailed descriptions of data mining tasks (e.g., learning, feature selection), data, algorithms, hypotheses such as mined models or patterns, and workflows. A development methodology was used for DMOP, including items such as competency questions and foundational ontology reuse. Several non-trivial modeling problems were encountered and due to the complexity of the data mining details, the ontology requires the use of the OWL 2 DL profile. DMOP was successfully evaluated for semantic meta-mining and used in constructing the Intelligent Discovery Assistant, deployed at the popular data mining environment RapidMiner

Crossref

UCT Computer Science Research Document Archive

Hes-so: ArODES Open Archive (University of Applied Sciences and Arts Western Switzerland / Haute école spécialisée de Suisse occidentale / FH Westschweiz)

Archivio istituzionale della ricerca - Università di Bari

The University of Manchester - Institutional Repository

Archive ouverte UNIGE

Recommended from our members

A Modelling Framework for Evidence-Based Public Health Policy Making

Author: Anisetti M.
Basdekis I.
Damiani E.
Koutsouris D.
Prasinos M.
Spanoudakis G.
Publication venue
Publication date: 31/05/2022
Field of study

It is widely recognised that the process of public health policy making (i.e., the analysis, action plan design, execution, monitoring and evaluation of public health policies) should be evidenced based, and supported by data analytics and decision-making tools tailored to it. This is because the management of health conditions and their consequences at a public health policy making level can benefit from such type of analysis of heterogeneous data, including health care devices usage, physiological, cognitive, clinical and medication, personal, behavioural, lifestyle data, occupational and environmental data. In this paper we present a novel approach to public health policy making in a form of an ontology, and an integrated platform for realising this approach. Our solution is model-driven and makes use of big data analytics technology. More specifically, it is based on public health policy decision making (PHPDM) models that steer the public health policy decision making process by defining the data that need to be collected, the ways in which they should be analysed in order to produce the evidence useful for public health policymaking, how this evidence may support or contradict various policy interventions (actions), and the stakeholders involved in the decision-making process. The resulted web-based platform has been implemented using Hadoop, Spark and HBASE, developed in the context of a research programme on public health policy making for the management of hearing loss called EVOTION, funded by the Horizon 2020

City Research Online

Exploiting semantic web knowledge graphs in data mining

Author: Ristoski Petar
Publication venue
Publication date: 01/01/2018
Field of study

Data Mining and Knowledge Discovery in Databases (KDD) is a research field concerned with deriving higher-level insights from data. The tasks performed in that field are knowledge intensive and can often benefit from using additional knowledge from various sources. Therefore, many approaches have been proposed in this area that combine Semantic Web data with the data mining and knowledge discovery process. Semantic Web knowledge graphs are a backbone of many information systems that require access to structured knowledge. Such knowledge graphs contain factual knowledge about real word entities and the relations between them, which can be utilized in various natural language processing, information retrieval, and any data mining applications. Following the principles of the Semantic Web, Semantic Web knowledge graphs are publicly available as Linked Open Data. Linked Open Data is an open, interlinked collection of datasets in machine-interpretable form, covering most of the real world domains. In this thesis, we investigate the hypothesis if Semantic Web knowledge graphs can be exploited as background knowledge in different steps of the knowledge discovery process, and different data mining tasks. More precisely, we aim to show that Semantic Web knowledge graphs can be utilized for generating valuable data mining features that can be used in various data mining tasks. Identifying, collecting and integrating useful background knowledge for a given data mining application can be a tedious and time consuming task. Furthermore, most data mining tools require features in propositional form, i.e., binary, nominal or numerical features associated with an instance, while Linked Open Data sources are usually graphs by nature. Therefore, in Part I, we evaluate unsupervised feature generation strategies from types and relations in knowledge graphs, which are used in different data mining tasks, i.e., classification, regression, and outlier detection. As the number of generated features grows rapidly with the number of instances in the dataset, we provide a strategy for feature selection in hierarchical feature space, in order to select only the most informative and most representative features for a given dataset. Furthermore, we provide an end-to-end tool for mining the Web of Linked Data, which provides functionalities for each step of the knowledge discovery process, i.e., linking local data to a Semantic Web knowledge graph, integrating features from multiple knowledge graphs, feature generation and selection, and building machine learning models. However, we show that such feature generation strategies often lead to high dimensional feature vectors even after dimensionality reduction, and also, the reusability of such feature vectors across different datasets is limited. In Part II, we propose an approach that circumvents the shortcomings introduced with the approaches in Part I. More precisely, we develop an approach that is able to embed complete Semantic Web knowledge graphs in a low dimensional feature space, where each entity and relation in the knowledge graph is represented as a numerical vector. Projecting such latent representations of entities into a lower dimensional feature space shows that semantically similar entities appear closer to each other. We use several Semantic Web knowledge graphs to show that such latent representation of entities have high relevance for different data mining tasks. Furthermore, we show that such features can be easily reused for different datasets and different tasks. In Part III, we describe a list of applications that exploit Semantic Web knowledge graphs, besides the standard data mining tasks, like classification and regression. We show that the approaches developed in Part I and Part II can be used in applications in various domains. More precisely, we show that Semantic Web graphs can be exploited for analyzing statistics, building recommender systems, entity and document modeling, and taxonomy induction. %In Part III, we focus on semantic annotations in HTML pages, which are another realization of the Semantic Web vision. Semantic annotations are integrated into the code of HTML pages using markup languages, like Microformats, RDFa, and Microdata. While such data covers various domains and topics, and can be useful for developing various data mining applications, additional steps of cleaning and integrating the data need to be performed. In this thesis, we describe a set of approaches for processing long literals and images extracted from semantic annotations in HTML pages. We showcase the approaches in the e-commerce domain. Such approaches contribute in building and consuming Semantic Web knowledge graphs

MAnnheim DOCument Server

CERN Document Server