6 research outputs found

    Using ontology to build news network

    Get PDF
    One of the main activities in the knowledge sharing is to search and retrieve textual document.Traditional searching methods use user-specified keywords to search for documents.The common problem with this method is that the retrieved documents are not the ones that they are actually looking for even the searching is based on user-defined keywords The proposal in the research work is to build a well-defined domain where semantic relationship can be defined among the text documents in the repository to enhance the searching and retrieval performance.Reuters news is chosen as the domain where the ontology that defined the relationship is established to address the synonymy and polysemy problems.The ontology uses keywords to quantify the relationship strengths and labels the qualitative semantics.The ontology structure is a network of documents that is arranged based on hierarchy.This paper discusses the implementation of the document ontology which is applied to Reuters news corpus where the retrieval performance is measured based on the recall and precision

    Syntactic Features and Word Similarity for Supervised Metonymy Resolution

    Get PDF
    We present a supervised machine learning algorithm for metonymy resolution, which exploits the similarity between examples of conventional metonymy. We show that syntactic head-modifier relations are a high precision feature for metonymy recognition but suffer from data sparseness

    English sentence retrieval system based on dependency structure and its evaluation

    Full text link

    Coreferential Definite and Demonstrative Descriptions in French: A Corpus Study for Text Generation

    Get PDF
    Colloque avec actes et comité de lecture. internationale.International audienceThis paper presents a new classification for the use of definite and demonstrative descriptions, its application in a corpus analysis and the results of this analysis. The proposed classification is based on existing literature and extended to support the generation of definite and demonstrative NPs. The corpus analysis shows in particular, that subsequent mentions of a referent can perform two functions (repeating given information and/or introducing new information). The comparison between definite and demonstrative determiners leads to preliminary data for generation algorithms

    Une analyse des emplois du démonstratif en corpus

    Get PDF
    Colloque avec actes et comité de lecture. nationale.National audienceCet article propose une nouvelle classification des utilisations des démonstratifs, une mise en oeuvre de cette classification dans une analyse de corpus et présente les résultats obtenus au terme de cette analyse. La classification proposée est basée sur celles existant dans la littérature et étendues pour permettre la génération de groupes nominaux démonstratifs. L'analyse de corpus montre en particulier que la nature "reclassifiante" du démonstratif lui permet d'assumer deux fonctions (une fonction anaphorique et une fonction de support pour de l'information nouvelle) et qu'il existe des moyens variés de réaliser ces fonctions

    THE LINGUIST'S SEARCH ENGINE: GETTING STARTED GUIDE

    Get PDF
    The World Wide Web can be viewed as a naturally occurring resource that embodies the rich and dynamic nature of language, a data repository of unparalleled size and diversity. However, current Web search methods are oriented more toward shallow information retrieval techniques than toward the more sophisticated needs of linguists. Using the Web in linguistic research is not easy. It will, however, be getting easier. This report introduces the Linguist's Search Engine, a new linguist-friendly tool that makes it possible to retrieve naturally occurring sentences from the World Wide Web on the basis of lexical content and syntactic structure. Its aim is to help linguists of all stripes in conducting more thoroughly empirical exploration of evidence, with particular attention to variability and the role of context. LAMP-TR-108 UMIACS-TR-2003-10
    corecore