51 research outputs found
Aspects of dealing with imperfect data in temporal databases
In reality, some objects or concepts have properties with a time-variant or time-related nature. Modelling these kinds of objects or concepts in a (relational) database schema is possible, but time-variant and time-related attributes have an impact on the consistency of the entire database. Therefore, temporal database models have been proposed to deal with this. Time itself can be at the source of imprecision, vagueness and uncertainty, since existing time measuring devices are inherently imperfect. Accordingly, human beings manage time using temporal indications and temporal notions, which may contain imprecision, vagueness and uncertainty. However, the imperfection in human-used temporal indications is supported by human interpretation, whereas information systems need extraordinary support for this. Several proposals for dealing with such imperfections when modelling temporal aspects exist. Some of these proposals consider the basis of the system to be the conversion of the specificity of temporal notions between used temporal expressions. Other proposals consider the temporal indications in the used temporal expressions to be the source of imperfection. In this chapter, an overview is given, concerning the basic concepts and issues related to the modelling of time as such or in (relational) database models and the imperfections that may arise during or as a result of this modelling. Next to this, a novel and currently researched technique for handling some of these imperfections is presented
Enabling Complex Semantic Queries to Bioinformatics Databases through Intuitive Search Over Data
Data integration promises to be one of the main catalysts in enabling new insights to be drawn from the wealth of biological data already available publicly. However, the heterogene- ity of the existing data sources still poses significant challenges for achieving interoperability among biological databases. Furthermore, merely solving the technical challenges of data in- tegration, for example through the use of common data representation formats, leaves open the larger problem. Namely, the steep learning curve required for understanding the data models of each public source, as well as the technical language through which the sources can be queried and joined. As a consequence, most of the available biological data remain practically unexplored today.
In this thesis, we address these problems jointly, by first introducing an ontology-based data integration solution in order to mitigate the data source heterogeneity problem. We illustrate through the concrete example of Bgee, a gene expression data source, how relational databases can be exposed as virtual Resource Description Framework (RDF) graphs, through relational-to-RDF mappings. This has the important advantage that the original data source can remain unmodified, while still becoming interoperable with external RDF sources.
We complement our methods with applied case studies designed to guide domain experts in formulating expressive federated queries targeting the integrated data across the domains of evolutionary relationships and gene expression. More precisely, we introduce two com- parative analyses, first within the same domain (using orthology data from multiple, inter- operable, data sources) and second across domains, in order to study the relation between expression change and evolution rate following a duplication event.
Finally, in order to bridge the semantic gap between users and data, we design and im- plement Bio-SODA, a question answering system over domain knowledge graphs, that does not require training data for translating user questions to SPARQL. Bio-SODA uses a novel ranking approach that combines syntactic and semantic similarity, while also incorporating node centrality metrics to rank candidate matches for a given user question. Our results in testing Bio-SODA across several real-world databases that span multiple domains (both within and outside bioinformatics) show that it can answer complex, multi-fact queries, be- yond the current state-of-the-art in the more well-studied open-domain question answering.
--
L’intégration des données promet d’être l’un des principaux catalyseurs permettant d’extraire des nouveaux aperçus de la richesse des données biologiques déjà disponibles publiquement. Cependant, l’hétérogénéité des sources de données existantes pose encore des défis importants pour parvenir à l’interopérabilité des bases de données biologiques. De plus, en surmontant seulement les défis techniques de l’intégration des données, par exemple grâce à l’utilisation de formats standard de représentation de données, on laisse ouvert un problème encore plus grand. À savoir, la courbe d’apprentissage abrupte nécessaire pour comprendre la modéli- sation des données choisie par chaque source publique, ainsi que le langage technique par lequel les sources peuvent être interrogés et jointes. Par conséquent, la plupart des données biologiques publiquement disponibles restent pratiquement inexplorés aujourd’hui.
Dans cette thèse, nous abordons l’ensemble des deux problèmes, en introduisant d’abord une solution d’intégration de données basée sur ontologies, afin d’atténuer le problème d’hété- rogénéité des sources de données. Nous montrons, à travers l’exemple de Bgee, une base de données d’expression de gènes, une approche permettant les bases de données relationnelles d’être publiés sous forme de graphes RDF (Resource Description Framework) virtuels, via des correspondances relationnel-vers-RDF (« relational-to-RDF mappings »). Cela présente l’important avantage que la source de données d’origine peut rester inchangé, tout en de- venant interopérable avec les sources RDF externes.
Nous complétons nos méthodes avec des études de cas appliquées, conçues pour guider les experts du domaine dans la formulation de requêtes fédérées expressives, ciblant les don- nées intégrées dans les domaines des relations évolutionnaires et de l’expression des gènes. Plus précisément, nous introduisons deux analyses comparatives, d’abord dans le même do- maine (en utilisant des données d’orthologie provenant de plusieurs sources de données in- teropérables) et ensuite à travers des domaines interconnectés, afin d’étudier la relation entre le changement d’expression et le taux d’évolution suite à une duplication de gène.
Enfin, afin de mitiger le décalage sémantique entre les utilisateurs et les données, nous concevons et implémentons Bio-SODA, un système de réponse aux questions sur des graphes de connaissances domaine-spécifique, qui ne nécessite pas de données de formation pour traduire les questions des utilisateurs vers SPARQL. Bio-SODA utilise une nouvelle ap- proche de classement qui combine la similarité syntactique et sémantique, tout en incorporant des métriques de centralité des nœuds, pour classer les possibles candidats en réponse à une question utilisateur donnée. Nos résultats suite aux tests effectués en utilisant Bio-SODA sur plusieurs bases de données à travers plusieurs domaines (tantôt liés à la bioinformatique qu’extérieurs) montrent que Bio-SODA réussit à répondre à des questions complexes, en- gendrant multiples entités, au-delà de l’état actuel de la technique en matière de systèmes de réponses aux questions sur les données structures, en particulier graphes de connaissances
Treatment of imprecision in data repositories with the aid of KNOLAP
Traditional data repositories introduced for the needs of business
processing, typically focus on the storage and querying of crisp
domains of data. As a result, current commercial data repositories
have no facilities for either storing or querying imprecise/
approximate data.
No significant attempt has been made for a generic and applicationindependent
representation of value imprecision mainly as a
property of axes of analysis and also as part of dynamic
environment, where potential users may wish to define their “own”
axes of analysis for querying either precise or imprecise facts. In
such cases, measured values and facts are characterised by
descriptive values drawn from a number of dimensions, whereas
values of a dimension are organised as hierarchical levels.
A solution named H-IFS is presented that allows the representation
of flexible hierarchies as part of the dimension structures. An
extended multidimensional model named IF-Cube is put forward,
which allows the representation of imprecision in facts and
dimensions and answering of queries based on imprecise
hierarchical preferences. Based on the H-IFS and IF-Cube
concepts, a post relational OLAP environment is delivered, the
implementation of which is DBMS independent and its performance
solely dependent on the underlying DBMS engine
Intelligent Information Systems for Web Product Search
Over the last few years, we have experienced an increase in online shopping. Consequently, there is a need for efficient and effective product search engines. The rapid growth of e-commerce, however, has also introduced some challenges. Studies show that users can get overwhelmed by the information and offerings presented online while searching for products. In an attempt to lighten this information overload burden on consumers, there are several product search engines that aggregate product descriptions and price information from the Web and allow the user to easily query this information. Most of these search engines expect to receive the data from the participating Web shops in a specific format, which means Web shops need to transform their data more than once, as each product search engine requires a different format. Because currently most product information aggregation services rely on Web shops to send them their data, there is a big opportunity for solutions that aim to tackle this problem using a more automated approach. This dissertation addresses key aspects of implementing such a system, including hierarchical product classification, entity resolution, ontology population and schema mapping, and lastly, the optimization of faceted user interfaces. The findings of this work show us how one can design Web product search engines that automatically aggregate product information while allowing users to perform effective and efficient queries
JURI SAYS:An Automatic Judgement Prediction System for the European Court of Human Rights
In this paper we present the web platform JURI SAYS that automatically predicts decisions of the European Court of Human Rights based on communicated cases, which are published by the court early in the proceedings and are often available many years before the final decision is made. Our system therefore predicts future judgements of the court. The platform is available at jurisays.com and shows the predictions compared to the actual decisions of the court. It is automatically updated every month by including the prediction for the new cases. Additionally, the system highlights the sentences and paragraphs that are most important for the prediction (i.e. violation vs. no violation of human rights)
The Role of preferences in logic programming: nonmonotonic reasoning, user preferences, decision under uncertainty
Intelligent systems that assist users in fulfilling complex tasks need a concise and processable representation of incomplete and
uncertain information. In order to be able to choose among different options, these systems also need a compact and processable
representation of the concept of preference.
Preferences can provide an effective way to choose the best solutions to a given problem. These solutions can represent the most
plausible states of the world when we model incomplete information, the most satisfactory states of the world when we express
user preferences, or optimal decisions when we make decisions under uncertainty.
Several domains, such as, reasoning under incomplete and uncertain information, user preference modeling, and qualitative
decision making under uncertainty, have benefited from advances on preference representation. In the literature, several symbolic
approaches of nonclassical reasoning have been proposed. Among them, logic programming under answer set semantics offers a
good compromise between symbolic representation and computation of knowledge and several extensions for handling
preferences.
Nevertheless, there are still some open issues to be considered in logic programming. In nonmonotonic reasoning, first, most
approaches assume that exceptions to logic program rules are already specified. However, sometimes, it is possible to consider
implicit preferences based on the specificity of the rules to handle incomplete information. Secondly, the joint handling of
exceptions and uncertainty has received little attention: when information is uncertain, the selection of default rules can be a matter
of explicit preferences and uncertainty. In user preference modeling, although existing logic programming specifications allow to
express user preferences which depend both on incomplete and contextual information, in some applications, some preferences in
some context may be more important than others. Furthermore, more complex preference expressions need to be supported. In
qualitative decision making under uncertainty, existing logic programming-based methodologies for making decisions seem to lack
a satisfactory handling of preferences and uncertainty.
The aim of this dissertation is twofold: 1) to tackle the role played by preferences in logic programming from different perspectives,
and 2) to contribute to this novel field by proposing several frameworks and methods able to address the above issues. To this
end, we will first show how preferences can be used to select default rules in logic programs in an implicit and explicit way. In
particular, we propose (i) a method for selecting logic program rules based on specificity, and (ii) a framework for selecting
uncertain default rules based on explicit preferences and the certainty of the rules. Then, we will see how user preferences can be
modeled and processed in terms of a logic program (iii) in order to manage user profiles in a context-aware system and (iv) in order
to propose a framework for the specification of nested (non-flat) preference expressions. Finally, in the attempt to bridge the gap
between logic programming and qualitative decision under uncertainty, (v) we propose a classical- and a possibilistic-based logic
programming methodology to compute an optimal decision when uncertainty and preferences are matters of degrees.Els sistemes intel.ligents que assisteixen a usuaris en la realització de tasques complexes necessiten
una representació concisa i formal de la informació que permeti un raonament nomonòton
en condicions d’incertesa. Per a poder escollir entre les diferents opcions, aquests
sistemes solen necessitar una representació del concepte de preferència.
Les preferències poden proporcionar una manera efectiva de triar entre les millors solucions
a un problema. Aquestes solucions poden representar els estats del món més plausibles
quan es tracta de modelar informació incompleta, els estats del món més satisfactori
quan expressem preferències de l’usuari, o decisions òptimes quan estem parlant de presa
de decisió incorporant incertesa.
L’ús de les preferències ha beneficiat diferents dominis, com, el raonament en presència
d’informació incompleta i incerta, el modelat de preferències d’usuari, i la presa de decisió
sota incertesa. En la literatura, s’hi troben diferents aproximacions al raonament no clàssic
basades en una representació simbòlica de la informació. Entre elles, l’enfocament de programació
lògica, utilitzant la semàntica de answer set, ofereix una bona aproximació entre
representació i processament simbòlic del coneixement, i diferents extensions per gestionar
les preferències.
No obstant això, en programació lògica es poden identificar diferents problemes pel
que fa a la gestió de les preferències. Per exemple, en la majoria d’enfocaments de raonament
no-monòton s’assumeix que les excepcions a default rules d’un programa lògic ja
estan expressades. Però de vegades es poden considerar preferències implícites basades en
l’especificitat de les regles per gestionar la informació incompleta. A més, quan la informació
és també incerta, la selecció de default rules pot dependre de preferències explícites i de la
incertesa. En el modelatge de preferències del usuari, encara que els formalismes existents
basats en programació lògica permetin expressar preferències que depenen d’informació
contextual i incompleta, en algunes aplicacions, donat un context, algunes preferències
poden ser més importants que unes altres. Per tant, resulta d’interès un llenguatge que
permeti capturar preferències més complexes. En la presa de decisions sota incertesa, les
metodologies basades en programació lògica creades fins ara no ofereixen una solució del
tot satisfactòria pel que fa a la gestió de les preferències i la incertesa.
L’objectiu d’aquesta tesi és doble: 1) estudiar el paper de les preferències en la programació
lògica des de diferents perspectives, i 2) contribuir a aquesta jove àrea d’investigació
proposant diferents marcs teòrics i mètodes per abordar els problemes anteriorment citats.
Per a aquest propòsit veurem com les preferències es poden utilitzar de manera implícita i
explícita per a la selecció de default rules proposant: (i) un mètode basat en l’especificitat
de les regles, que permeti seleccionar regles en un programa lògic; (ii) un marc teòric per a
la selecció de default rules incertes basat en preferències explícites i la incertesa de les regles.
També veurem com les preferències de l’usuari poden ser modelades i processades usant
un enfocament de programació lògica (iii) que suporti la creació d’un mecanisme de gestió
dels perfils dels usuaris en un sistema amb reconeixement del context; (iv) que permeti
proposar un marc teòric capaç d’expressar preferències amb fòrmules imbricades. Per últim,
amb l’objectiu de disminuir la distància entre programació lògica i la presa de decisió
amb incertesa proposem (v) una metodologia basada en programació lògica clàssica i en
una extensió de la programació lògica que incorpora lògica possibilística per modelar un
problema de presa de decisions i per inferir una decisió òptima.Los sistemas inteligentes que asisten a usuarios en tareas complejas necesitan una representación
concisa y procesable de la información que permita un razonamiento nomonótono
e incierto. Para poder escoger entre las diferentes opciones, estos sistemas suelen
necesitar una representación del concepto de preferencia.
Las preferencias pueden proporcionar una manera efectiva para elegir entre las mejores
soluciones a un problema. Dichas soluciones pueden representar los estados del mundo
más plausibles cuando hablamos de representación de información incompleta, los estados
del mundo más satisfactorios cuando hablamos de preferencias del usuario, o decisiones
óptimas cuando estamos hablando de toma de decisión con incertidumbre.
El uso de las preferencias ha beneficiado diferentes dominios, como, razonamiento en
presencia de información incompleta e incierta, modelado de preferencias de usuario, y
toma de decisión con incertidumbre. En la literatura, distintos enfoques simbólicos de razonamiento
no clásico han sido creados. Entre ellos, la programación lógica con la semántica
de answer set ofrece un buen acercamiento entre representación y procesamiento simbólico
del conocimiento, y diferentes extensiones para manejar las preferencias.
Sin embargo, en programación lógica se pueden identificar diferentes problemas con
respecto al manejo de las preferencias. Por ejemplo, en la mayoría de enfoques de razonamiento
no-monótono se asume que las excepciones a default rules de un programa lógico
ya están expresadas. Pero, a veces se pueden considerar preferencias implícitas basadas en
la especificidad de las reglas para manejar la información incompleta. Además, cuando la
información es también incierta, la selección de default rules pueden depender de preferencias
explícitas y de la incertidumbre. En el modelado de preferencias, aunque los formalismos
existentes basados en programación lógica permitan expresar preferencias que
dependen de información contextual e incompleta, in algunas aplicaciones, algunas preferencias
en un contexto puede ser más importantes que otras. Por lo tanto, un lenguaje
que permita capturar preferencias más complejas es deseable. En la toma de decisiones con
incertidumbre, las metodologías basadas en programación lógica creadas hasta ahora no
ofrecen una solución del todo satisfactoria al manejo de las preferencias y la incertidumbre.
El objectivo de esta tesis es doble: 1) estudiar el rol de las preferencias en programación
lógica desde diferentes perspectivas, y 2) contribuir a esta joven área de investigación proponiendo
diferentes marcos teóricos y métodos para abordar los problemas anteriormente
citados. Para este propósito veremos como las preferencias pueden ser usadas de manera implícita y explícita para la selección de default rules proponiendo: (i) un método para
seleccionar reglas en un programa basado en la especificad de las reglas; (ii) un marco
teórico para la selección de default rules basado en preferencias explícitas y incertidumbre.
También veremos como las preferencias del usuario pueden ser modeladas y procesadas
usando un enfoque de programación lógica (iii) para crear un mecanismo de manejo de
los perfiles de los usuarios en un sistema con reconocimiento del contexto; (iv) para crear
un marco teórico capaz de expresar preferencias con formulas anidadas. Por último, con
el objetivo de disminuir la distancia entre programación lógica y la toma de decisión con
incertidumbre proponemos (v) una metodología para modelar un problema de toma de
decisiones y para inferir una decisión óptima usando un enfoque de programación lógica
clásica y uno de programación lógica extendida con lógica posibilística.Sistemi intelligenti, destinati a fornire supporto agli utenti in processi decisionali complessi,
richiedono una rappresentazione dell’informazione concisa, formale e che permetta
di ragionare in maniera non monotona e incerta. Per poter scegliere tra le diverse opzioni,
tali sistemi hanno bisogno di disporre di una rappresentazione del concetto di preferenza
altrettanto concisa e formale.
Le preferenze offrono una maniera efficace per scegliere le miglior soluzioni di un problema.
Tali soluzioni possono rappresentare gli stati del mondo più credibili quando si tratta
di ragionamento non monotono, gli stati del mondo più soddisfacenti quando si tratta delle
preferenze degli utenti, o le decisioni migliori quando prendiamo una decisione in condizioni
di incertezza.
Diversi domini come ad esempio il ragionamento non monotono e incerto, la strutturazione
del profilo utente, e i modelli di decisione in condizioni d’incertezza hanno tratto
beneficio dalla rappresentazione delle preferenze. Nella bibliografia disponibile si possono
incontrare diversi approcci simbolici al ragionamento non classico. Tra questi, la programmazione
logica con answer set semantics offre un buon compromesso tra rappresentazione
simbolica e processamento dell’informazione, e diversi estensioni per la gestione delle preferenze
sono state proposti in tal senso.
Nonostante ció, nella programmazione logica esistono ancora delle problematiche aperte.
Prima di tutto, nella maggior parte degli approcci al ragionamento non monotono, si suppone
che nel programma le eccezioni alle regole siano già specificate. Tuttavia, a volte per
trattare l’informazione incompleta è possibile prendere in considerazione preferenze implicite
basate sulla specificità delle regole. In secondo luogo, la gestione congiunta di eccezioni
e incertezza ha avuto scarsa attenzione: quando l’informazione è incerta, la scelta
di default rule può essere una questione di preferenze esplicite e d’incertezza allo stesso
tempo. Nella creazione di preferenze dell’utente, anche se le specifiche di programmazione
logica esistenti permettono di esprimere preferenze che dipendono sia da un’informazione
incompleta che da una contestuale, in alcune applicazioni talune preferenze possono essere
più importanti di altre, o espressioni più complesse devono essere supportate. In un processo
decisionale con incertezza, le metodologie basate sulla programmazione logica viste
sinora, non offrono una gestione soddisfacente delle preferenze e dell’incertezza.
Lo scopo di questa dissertazione è doppio: 1) chiarire il ruolo che le preferenze giocano
nella programmazione logica da diverse prospettive e 2) contribuire proponendo in questo nuovo settore di ricerca, diversi framework e metodi in grado di affrontare le citate
problematiche. Per prima cosa, dimostreremo come le preferenze possono essere usate per
selezionare default rule in un programma in maniera implicita ed esplicita. In particolare
proporremo: (i) un metodo per la selezione delle regole di un programma logico basato
sulla specificità dell’informazione; (ii) un framework per la selezione di default rule basato
sulle preferenze esplicite e sull’incertezza associata alle regole del programma. Poi, vedremo
come le preferenze degli utenti possono essere modellate attraverso un programma
logico, (iii) per creare il profilo dell’utente in un sistema context-aware, e (iv) per proporre
un framework che supporti la definizione di preferenze complesse. Infine, per colmare le
lacune in programmazione logica applicata a un processo di decisione con incertezza (v)
proporremo una metodologia basata sulla programmazione logica classica e una metodologia
basata su un’estensione della programmazione logica con logica possibilistica
Enhanced Living Environments
This open access book was prepared as a Final Publication of the COST Action IC1303 “Algorithms, Architectures and Platforms for Enhanced Living Environments (AAPELE)”. The concept of Enhanced Living Environments (ELE) refers to the area of Ambient Assisted Living (AAL) that is more related with Information and Communication Technologies (ICT). Effective ELE solutions require appropriate ICT algorithms, architectures, platforms, and systems, having in view the advance of science and technology in this area and the development of new and innovative solutions that can provide improvements in the quality of life for people in their homes and can reduce the financial burden on the budgets of the healthcare providers. The aim of this book is to become a state-of-the-art reference, discussing progress made, as well as prompting future directions on theories, practices, standards, and strategies related to the ELE area. The book contains 12 chapters and can serve as a valuable reference for undergraduate students, post-graduate students, educators, faculty members, researchers, engineers, medical doctors, healthcare organizations, insurance companies, and research strategists working in this area
- …