Search CORE

10 research outputs found

OBIRS-feedback, une méthode de reformulation utilisant une ontologie de domaine

Author: Montmain Jacky
Ranwez Sylvie
Ranwez Vincent
Sy Mohameth-François
Publication venue: HAL CCSD
Publication date: 21/03/2012
Field of study

National audienceLes performances d'un système de recherche d'information (SRI) peuvent être dégradées en termes de précision du fait de la difficulté pour des utilisateurs à formuler précisément leurs besoins en information. La reformulation ou l'expansion de requêtes constitue une des réponses à ce problème dans le cadre des SRI. Dans cet article, nous proposons une nouvelle méthode de reformulation de requêtes conceptuelles qui, à partir de documents jugés pertinents par l'utilisateur et d'une ontologie de domaine, cherche un ensemble de concepts maximisant les performances du SRI. Celles-ci sont évaluées, de manière originale, à l'aide d'indicateurs dont une formalisation est proposée. Cette méthode a été évaluée en utilisant notre moteur OBIRS, l'ontologie de domaine MeSH et la collection de tests MuCHMORE

HAL Descartes

HAL-CIRAD

How ontology based information retrieval systems may benefit from lexical text analysis

Author: Augereau Patrick
Duthil Benjamin
Montmain Jacky
Ranwez Sylvie
Ranwez Vincent
Sy Mohameth-François
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2013
Field of study

International audienceThe exponential growth of available electronic data is almost useless without efficient tools to retrieve the right information at the right time. It is now widely acknowledged that information retrieval systems need to take semantics into account to enhance the use of available information. However, there is still a gap between the amounts of relevant information that can be accessed through optimized IRSs on the one hand, and users' ability to grasp and process a handful of relevant data at once on the other. This chapter shows how conceptual and lexical approaches may be jointly used to enrich document description. After a survey on semantic based methodologies designed to efficiently retrieve and exploit information, hybrid approaches are discussed. The original approach presented here benefits from both lexical and ontological document description, and combines them in a software architecture dedicated to information retrieval and rendering in specific domains

HAL Descartes

HAL-CIRAD

Hal-Diderot

ONTOLOGY BASED INFORMATION RETRIEVAL

Author: Sy Mohameth François
Publication venue
Publication date: 11/12/2012
Field of study

Les ontologies offrent une modélisation des connaissances d'un domaine basée sur une hiérarchie des concepts clefs de ce domaine. Leur utilisation dans le cadre des Systèmes de Recherche d'Information (SRI), tant pour indexer les documents que pour exprimer une requête, permet notamment d'éviter les ambiguïtés du langage naturel qui pénalisent les SRI classiques. Les travaux de cette thèse portent essentiellement sur l'utilisation d'ontologies lors du processus d'appariement durant lequel les SRI ordonnent les documents d'une collection en fonction de leur pertinence par rapport à une requête utilisateur. Nous proposons de calculer cette pertinence à l'aide d'une stratégie d'agrégation de scores élémentaires entre chaque document et chaque concept de la requête. Cette agrégation, simple et intuitive, intègre un modèle de préférences dépendant de l'utilisateur et une mesure de similarité sémantique associée à l'ontologie. L'intérêt majeur de cette approche est qu'elle permet d'expliquer à l'utilisateur pourquoi notre SRI, OBIRS, estime que les documents qu'il a sélectionnés sont pertinents. Nous proposons de renforcer cette justification grâce à une visualisation originale où les résultats sont représentés par des pictogrammes, résumant leurs pertinences élémentaires, puis disposés sur une carte sémantique en fonction de leur pertinence globale. La Recherche d'Information étant un processus itératif, il est nécessaire de permettre à l'utilisateur d'interagir avec le SRI, de comprendre et d'évaluer les résultats et de le guider dans sa reformulation de requête. Nous proposons une stratégie de reformulation de requêtes conceptuelles basée sur la transposition d'une méthode éprouvée dans le cadre de SRI vectoriels. La reformulation devient alors un problème d'optimisation utilisant les retours faits par l'utilisateur sur les premiers résultats proposés comme base d'apprentissage. Nous avons développé une heuristique permettant de s'approcher d'une requête optimale en ne testant qu'un sous-espace des requêtes conceptuelles possibles. Nous montrons que l'identification efficace des concepts de ce sous-espace découle de deux propriétés qu'une grande partie des mesures de similarité sémantique vérifient, et qui suffisent à garantir la connexité du voisinage sémantique d'un concept.Les modèles que nous proposons sont validés tant sur la base de performances obtenues sur des jeux de tests standards, que sur la base de cas d'études impliquant des experts biologistes.Domain ontologies provide a knowledge model where the main concepts of a domain are organized through hierarchical relationships. In conceptual Information Retrieval Systems (IRS), where they are used to index documents as well as to formulate a query, their use allows to overcome some ambiguities of classical IRSs based on natural language processes.One of the contributions of this study consists in the use of ontologies within IRSs, in particular to assess the relevance of documents with respect to a given query. For this matching process, a simple and intuitive aggregation approach is proposed, that incorporates user dependent preferences model on one hand, and semantic similarity measures attached to a domain ontology on the other hand. This matching strategy allows justifying the relevance of the results to the user. To complete this explanation, semantic maps are built, to help the user to grasp the results at a glance. Documents are displayed as icons that detail their elementary scores. They are organized so that their graphical distance on the map reflects their relevance to a query represented as a probe. As Information Retrieval is an iterative process, it is necessary to involve the users in the control loop of the results relevancy in order to better specify their information needs. Inspired by experienced strategies in vector models, we propose, in the context of conceptual IRS, to formalize ontology based relevance feedback. This strategy consists in searching a conceptual query that optimizes a tradeoff between relevant documents closeness and irrelevant documents remoteness, modeled through an objective function. From a set of concepts of interest, a heuristic is proposed that efficiently builds a near optimal query. This heuristic relies on two simple properties of semantic similarities that are proved to ensure semantic neighborhood connectivity. Hence, only an excerpt of the ontology dag structure is explored during query reformulation.These approaches have been implemented in OBIRS, our ontological based IRS and validated in two ways: automatic assessment based on standard collections of tests, and case studies involving experts from biomedical domain

Theses.fr

Utilisation d'ontologies comme support à la recherche et à la navigation dans une collection de documents

Author: Sy Mohameth-François
Publication venue: HAL CCSD
Publication date: 11/12/2012
Field of study

Domain ontologies provide conceptual formalization of domain knowledge. One contribution of this study consists in using them in conceptual Information Retrieval Systems (IRS), in particular to assess the relevance of documents with respect to a given query. For this matching process a model is proposed that incorporates both user preferences and semantic similarity measures attached to domain ontology. Our approach allows justifying the relevance of the results to the user, using visualization tools. As Information Retrieval is an iterative process, users may be involved in the control loop of the results relevancy to better specify their information needs. We propose to formalize ontology based relevance feedback using an objective function and a heuristic that efficiently builds a near optimal query. These approaches have been validated in two ways: automatic assessment based on standard collections of tests, and case studies involving experts from biomedical domain.Les ontologies modélisent la connaissance d'un domaine avec une hiérarchie de concepts. Cette thèse porte sur leur utilisation dans les Systèmes de Recherche d'Information (SRI) pour estimer la pertinence des documents par rapport à une requête. Nous calculons cette pertinence à l'aide d'un modèle des préférences de l'utilisateur et d'une mesure de similarité sémantique associée à l'ontologie. Cette approche permet d'expliquer à l'utilisateur pourquoi les documents sélectionnés sont pertinents grâce à une visualisation originale. La RI étant un processus itératif, l'utilisateur doit être guidé dans sa reformulation de requête. Une stratégie de reformulation de requêtes conceptuelles est formalisée en un problème d'optimisation utilisant les retours faits par l'utilisateur sur les premiers résultats proposés comme base d'apprentissage. Nos modèles sont validés sur la base de performances obtenues sur des jeux de tests standards et de cas d'études impliquant des experts biologistes

Thèses en Ligne

Utilisation d'ontologies comme support à la recherche et à la navigation dans une collection de documents

Author: CRAMPES Michel
RANWEZ Sylvie
RANWEZ Vincent
SY Mohameth François
Publication venue
Publication date: 01/01/2012
Field of study

OpenGrey Repository

Utilisation de proximités sémantiques pour améliorer la recherche et le rendu d'information

Author: Crampes Michel
Montmain Jacky
Ranwez Sylvie
Ranwez Vincent
Sy Mohameth-François
Publication venue: Presse des Mines
Publication date: 09/06/2010
Field of study

12 pagesNational audiencePour exploiter efficacement des corpus documentaires toujours plus volumineux, les moteurs de recherche doivent évoluer. Leurs limites actuelles concernent principalement le fait que la mesure de la pertinence d'un document par rapport à une requête est souvent non-explicite et que l'interaction avec la liste des réponses est limitée. Nous proposons une méthode et un environnement de requêtage basés sur les ontologies, qui utilisent des opérateurs d'agrégation pour calculer une mesure de pertinence globale, fonction de la proximité sémantique des documents du corpus avec chaque concept de la requête d'une part, et des préférences de l'utilisateur, d'autre part. Nous construisons ensuite une carte sémantique qui reflète la pertinence des documents sélectionnés et explicite leur adéquation avec la requête. Cette interface homme/machine laisse envisager un processus de requêtage itératif et interactif

HAL-IRD

HAL-CIRAD

Hal-Diderot

User centered and ontology based information retrieval system for life science - OBIRS

Author: Crampes Michel
Montmain Jacky
Ranwez Sylvie
Ranwez Vincent
Sy Mohameth-François
Publication venue: HAL CCSD
Publication date: 09/12/2010
Field of study

International audienceBecause of the increasing number of electronic data, designing efficient tools to retrieve and exploit documents is a major challenge. Current search engines suffer from two main drawbacks: there is limited interaction with the list of retrieved documents and no explanation for their adequacy to the query. Users may thus be confused by the selection and have no idea how to adapt their query so that the results match their expectations. This paper describes a request method and an environment based on aggregating models to assess the relevance of documents annotated by concepts of ontology. The selection of documents is then displayed in a semantic map to provide graphical indications that make explicit to what extent they match the user's query; this man/machine interface favors a more interactive exploration of data corpus

HAL-IRD

HAL-CIRAD

Hal-Diderot

User centered and ontology based information retrieval system for life sciences

Author: Crampes Michel
Montmain Jacky
Ranwez Sylvie
Ranwez Vincent
Regnault Armelle
Sy Mohameth-François
Publication venue: BMC
Publication date: 01/01/2011
Field of study

Abstract Background Because of the increasing number of electronic resources, designing efficient tools to retrieve and exploit them is a major challenge. Some improvements have been offered by semantic Web technologies and applications based on domain ontologies. In life science, for instance, the Gene Ontology is widely exploited in genomic applications and the Medical Subject Headings is the basis of biomedical publications indexation and information retrieval process proposed by PubMed. However current search engines suffer from two main drawbacks: there is limited user interaction with the list of retrieved resources and no explanation for their adequacy to the query is provided. Users may thus be confused by the selection and have no idea on how to adapt their queries so that the results match their expectations. Results This paper describes an information retrieval system that relies on domain ontology to widen the set of relevant documents that is retrieved and that uses a graphical rendering of query results to favor user interactions. Semantic proximities between ontology concepts and aggregating models are used to assess documents adequacy with respect to a query. The selection of documents is displayed in a semantic map to provide graphical indications that make explicit to what extent they match the user's query; this man/machine interface favors a more interactive and iterative exploration of data corpus, by facilitating query concepts weighting and visual explanation. We illustrate the benefit of using this information retrieval system on two case studies one of which aiming at collecting human genes related to transcription factors involved in hemopoiesis pathway. Conclusions The ontology based information retrieval system described in this paper (OBIRS) is freely available at: <url>http://www.ontotoolkit.mines-ales.fr/ObirsClient/</url>. This environment is a first step towards a user centred application in which the system enlightens relevant information to provide decision help.</p

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

The Neuron Phenotype Ontology: A FAIR Approach to Proposing and Classifying Neuronal Types.

Author: Gillespie Thomas H
Hill Sean L
Martone Maryann E
Sy Mohameth François
Tripathy Shreejoy J
Publication venue: eScholarship, University of California
Publication date: 01/01/2022
Field of study

The challenge of defining and cataloging the building blocks of the brain requires a standardized approach to naming neurons and organizing knowledge about their properties. The US Brain Initiative Cell Census Network, Human Cell Atlas, Blue Brain Project, and others are generating vast amounts of data and characterizing large numbers of neurons throughout the nervous system. The neuroscientific literature contains many neuron names (e.g. parvalbumin-positive interneuron or layer 5 pyramidal cell) that are commonly used and generally accepted. However, it is often unclear how such common usage types relate to many evidence-based types that are proposed based on the results of new techniques. Further, comparing different types across labs remains a significant challenge. Here, we propose an interoperable knowledge representation, the Neuron Phenotype Ontology (NPO), that provides a standardized and automatable approach for naming cell types and normalizing their constituent phenotypes using identifiers from community ontologies as a common language. The NPO provides a framework for systematically organizing knowledge about cellular properties and enables interoperability with existing neuron naming schemes. We evaluate the NPO by populating a knowledge base with three independent cortical neuron classifications derived from published data sets that describe neurons according to molecular, morphological, electrophysiological, and synaptic properties. Competency queries to this knowledge base demonstrate that the NPO knowledge model enables interoperability between the three test cases and neuron names commonly used in the literature

Infoscience - École polytechnique fédérale de Lausanne

PubMed Central

eScholarship - University of California