Search CORE

97 research outputs found

Automatic case acquisition from texts for process-oriented case-based reasoning

Author: Ber Florence Le
Dufour-Lussier Valmi
Lieber Jean
Nauer Emmanuel
Publication venue: 'Elsevier BV'
Publication date: 20/12/2012
Field of study

This paper introduces a method for the automatic acquisition of a rich case representation from free text for process-oriented case-based reasoning. Case engineering is among the most complicated and costly tasks in implementing a case-based reasoning system. This is especially so for process-oriented case-based reasoning, where more expressive case representations are generally used and, in our opinion, actually required for satisfactory case adaptation. In this context, the ability to acquire cases automatically from procedural texts is a major step forward in order to reason on processes. We therefore detail a methodology that makes case acquisition from processes described as free text possible, with special attention given to assembly instruction texts. This methodology extends the techniques we used to extract actions from cooking recipes. We argue that techniques taken from natural language processing are required for this task, and that they give satisfactory results. An evaluation based on our implemented prototype extracting workflows from recipe texts is provided.Comment: Sous presse, publication pr\'evue en 201

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

De l'importance de la normalisation en bibliométrie

Author: Nauer Emmanuel
Publication venue: HAL CCSD
Publication date: 01/01/1999
Field of study

Colloque avec actes et comité de lectureSi la consultation de multiples sources d'information garantit une meilleure couverture du sujet, elle nécessite -en parallèle- de prendre en compte l'hétérogénéité des données. En effet, comme chaque producteur d'information possède son propre modèle de document, on obtient des représentations différentes d'une même information. Cette hétérogénéité occasionne deux problèmes majeurs pour la bibliométrie qui faussent statistiques et analyses. Ces difficultés concernent : • la présence de variations pour représenter une même information (auteurs, descripteurs, affiliation, etc.) ; • la présence de doublons (c'est-à-dire plusieurs représentations de la même référence bibliographique). Dans cet article, nous proposons une approche visant à pallier ces problèmes. Nous discutons tout d'abord les choix retenus en terme de normalisation et de dédoublonnage

INRIA a CCSD electronic archive server

IntoWeb : une plate forme hypertexte d'extraction de connaissances et de recherche d'information

Author: Nauer Emmanuel
Publication venue: HAL CCSD
Publication date: 01/01/2007
Field of study

National audienceDans cet article, nous présentons un système hypertexte, nommé IntoWeb, qui fournit aux chercheurs ou spécialistes de l'information scientifique les moyens d'exploiter les données structurées sur leur domaine et des données - textuelles - du web pour des besoins de recherche d'information, d'analyse de leur domaine ou de veille. IntoWeb est un système générique d'exploitation de données qui implémente un processus complet et itératif d'extraction de connaissances à partir de données. Le système permet de manipuler différents types d'objets (documents structurés, documents textuels, vecteurs, classifications, etc.). Des opérateurs (génération d'un vecteur à partir d'un document textuel, classification de documents structurés, etc.) permettent d'exploiter chacun des différents types d'objets à des fins d'analyses ou de recherche d'information. L'application d'un opérateur sur un ensemble d'objets produit de nouveaux objets, à leur tour exploitable dans le système. La résolution complète d'un problème d'extraction de connaissances ou de recherche d'information prend la forme d'une succession d'opérations appliquées à des objets. Le choix des objets à exploiter et des opérations à appliquer à ces objets est à la charge de l'utilisateur et dépend du problème à résoudre ; l'enchaînement des opérations est grandement facilité par IntoWeb grâce à la mise en place d'une interface web simple à utiliser

INRIA a CCSD electronic archive server

Les problèmes de variations terminologiques dans l'indexation de références bibliographiques

Author: Nauer Emmanuel
Publication venue: HAL CCSD
Publication date: 01/01/1999
Field of study

Colloque avec actes et comité de lectureUne activité de la terminologie est de spécifier le vocabulaire d'un domaine, en associant aux notions du domaine, les unités terminologiques les plus adéquates. On pourrait croire que le vocabulaire associé à un domaine est un ensemble limité de termes. Or, ce n'est pas le cas, car chaque acteur du domaine utilise son propre vocabulaire. C'est notamment le cas des fournisseurs d'informations -les producteurs de bases de données bibliographiques notamment- qui mettent chacun en place un système d'indexation des références propre. Les variations concernent tant par l'étendue sémantique couverte par les concepts pris en compte, que le vocabulaire associé. Nous montrons à partir d'exemples concrets que les variations terminologiques des concepts d'un domaine sont courantes dans une même langue, et qui plus est à travers différentes indexations

INRIA a CCSD electronic archive server

Semi-automatic annotation process for procedural texts: An application on cooking recipes

Author: Ber Florence Le
Dufour-Lussier Valmi
Lieber Jean
Meilender Thomas
Nauer Emmanuel
Publication venue
Publication date: 27/08/2012
Field of study

Taaable is a case-based reasoning system that adapts cooking recipes to user constraints. Within it, the preparation part of recipes is formalised as a graph. This graph is a semantic representation of the sequence of instructions composing the cooking process and is used to compute the procedure adaptation, conjointly with the textual adaptation. It is composed of cooking actions and ingredients, among others, represented as vertices, and semantic relations between those, shown as arcs, and is built automatically thanks to natural language processing. The results of the automatic annotation process is often a disconnected graph, representing an incomplete annotation, or may contain errors. Therefore, a validating and correcting step is required. In this paper, we present an existing graphic tool named \kcatos, conceived for representing and editing decision trees, and show how it has been adapted and integrated in WikiTaaable, the semantic wiki in which the knowledge used by Taaable is stored. This interface provides the wiki users with a way to correct the case representation of the cooking process, improving at the same time the quality of the knowledge about cooking procedures stored in WikiTaaable

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

HAL-INSU

Adaptation knowledge discovery using positive and negative cases

Author: Lieber Jean
Nauer Emmanuel
Publication venue: HAL CCSD
Publication date: 13/09/2021
Field of study

International audienceCase-based reasoning usually exploits positive source cases, each of them consisting in a problem and a correct solution to this problem. Now, the general issue of exploiting also negative cases-i.e., problem-solution pairs where the solution answers incorrectly the problem-can be raised. Indeed, such cases are "naturally" generated by a CBR system as long as it sometimes proposes incorrect solutions. This paper aims at addressing this issue for adaptation knowledge (AK) discovery: how positive and negative cases can be used for this purpose. The idea is that positive cases are used to propose adaptation rules and that negative cases are used to filter out some of these rules. In a preliminary work, this kind of AK discovery has been applied using frequent closed itemset (FCI) extraction on variations within the case base and tested on a toy Boolean use case, with promising first results. This paper resumes this study and evaluates it on 4 benchmarks, which confirms the benefit of exploiting negative cases for AK discovery. This involves some adjustments in the data preparation and in adaptation rule filtering, in particular because FCI extraction works only with Boolean features, hence some methodology lessons learned for AK discovery with positive and negative cases

INRIA a CCSD electronic archive server

Collaborative Knowledge Acquisition under Control of a Non-Regression Test System

Author: Canals Gérôme
Cordier Amélie
Desmontils Emmanuel
Infante-Blanco Laura
Nauer Emmanuel
Publication venue: HAL CCSD
Publication date: 27/05/2013
Field of study

14 pagesInternational audienceThis paper introduces BeGoood, a generic system for man- aging non-regression tests on knowledge-bases. BeGoood is a system al- lowing to define test plans in order to monitor the evolution of knowledge- bases. Any system answering queries by providing results in the form of set of strings can be tested with BeGoood. BeGoood has been devel- oped following a REST architecture and is independent of any applica- tion domain. This paper describes the architecture of the system and gives a use case to illustrate how BeGoood is able to manage a collab- orative knowledge evolution in the framework of a case-based reasoning system

Classification dynamique par treillis de concepts pour la recherche d'information sur le web.

Author: Nauer Emmanuel
Toussaint Yannick
Publication venue: HAL CCSD
Publication date: 01/03/2008
Field of study

National audienceL'analyse de concepts formels (ACF) permet d'organiser des objets en fonction de leurs propriétés. Des travaux récents ont utilisé l'ACF pour réorganiser, sous la forme d'un treillis de concepts, les réponses fournies par un moteur de recherche du web. L'utilisateur navigue dans le treillis pour explorer un résultat structuré et synthétique. Or, un tel treillis contient des concepts qui sont pertinents par rapport à une tâche de recherche d'information donnée et d'autres qui ne le sont pas. Pour que l'utilisateur puisse se focaliser sur ce qui l'intéresse et éliminer ce qui ne l'intéresse pas, nous proposons un système interactif dans lequel il va exprimer son intérêt (positif ou négatif) pour certains concepts du treillis. Ce contrôle de pertinence est exploité dans la classification pour faire évoluer le treillis et ainsi mieux l'adapter au besoin de l'utilisateur

INRIA a CCSD electronic archive server

Quand l'adaptation des cas par révision des croyances et l'extrapolation analogique se rencontrent

Author: Lieber Jean
Nauer Emmanuel
Prade Henri
Publication venue: HAL CCSD
Publication date: 28/06/2021
Field of study

National audienceCase-based reasoning, where cases are described in terms of problem-solution pairs case =

(x,y)

, amounts to propose a solution to a new problem on the basis of past experience made of stored cases. On the one hand, the building of the solution to a new problem may be viewed as a form of belief revision of the solution of a retrieved case (whose problem part is similar to the new problem) constrained by domain knowledge. On the other hand, an extrapolation mechanism based on analogical proportions has been proposed. It exploits triplets of cases (

case^a

case^b

case^c

) whose descriptions of problem parts

x^a

x^b

x^c

form an analogical proportion with the new problem xtgt, in such a way that “xa is to xb as xc is to

x^{tgt}

”. Then, the analogical inference amounts to compute a solution

y^{tgt}

of xtgt by solving (when possible) an equation expressing that “

y^a

is to

y^b

y^c

is to

y^{tgt}

” (where

y^a

y^b

and

y^c

are respectively the solution parts of

case^a

case^b

case^c

). The paper investigates how the belief revision view and analogical extrapolation relate. Besides that it constitutes an unexpected bridge between areas which ignore each other, it casts some light on the adaptation mechanism in case-based reasoning.The paper is illustrated by a running exampleLe raisonnement à partir de cas vise à résoudre des problèmes en s'appuyant sur une base de cas, où un cas est un couple problème-solution (x, y). D'un côté, l'élaboration d'une solution au problème à résoudre (ou problème cible) peut être vue comme une forme de révision des croyances de la solution du cas remémoré (dont la partie problème est similaire au problème cible), contrainte par les connaissances du domaine. D'un autre côté, un mécanisme d'extrapolation fondé sur les proportions analogiques a été proposé récemment. Une proportion analogique est une relation entre 4 termes a, b, c, d qui se lit « a est à b ce que c est à d ». L'extrapolation analogique exploite des triplets de cas dont les parties problèmes forment une proportion analogique avec le problème cible. Puis, l'inférence consiste à résoudre une équation analogique entre les trois solutions des cas du triplet et l'inconnue de cette équation. Cet article étudie une relation entre ces deux approches de l'adaptation. En plus de constituer une passerelle inattendue entre ces deux domaines, il propose un nouvel éclairage sur le mécanisme d'adaptation en raisonnement à partir de cas. L'article est illustré à l'aide d'un exemple suivi

INRIA a CCSD electronic archive server

Man-Machine Collaboration to Acquire Cooking Adaptation Knowledge for the TAAABLE Case-Based Reasoning System

Author: Cordier Amélie
Gaillard Emmanuelle
Nauer Emmanuel
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 17/04/2012
Field of study

International audienceThis paper shows how humans and machines can better collaborate to acquire adaptation knowledge (AK) in the framework of a case-based reasoning (CBR) system whose knowledge is encoded in a semantic wiki. Automatic processes like the CBR reasoning process itself, or speci c tools for acquiring AK are integrated as wiki extensions. These tools and processes are combined on purpose to collect AK. Users are at the center of our approach, as they are in a classical wiki, but they will now bene t from automatic tools for helping them to feed the wiki. In particular, the CBR system, which is currently only a consumer for the knowledge encoded in the semantic wiki, will also be used for producing knowledge for the wiki. A use case in the domain of cooking is given to exemplify the man-machine collaboration

INRIA a CCSD electronic archive server