Search CORE

33 research outputs found

Chunk Tagger - Statistical Recognition of Noun Phrases

Author: Brants Thorsten
Skut Wojciech
Publication venue
Publication date: 01/01/1998
Field of study

We describe a stochastic approach to partial parsing, i.e., the recognition of syntactic structures of limited depth. The technique utilises Markov Models, but goes beyond usual bracketing approaches, since it is capable of recognising not only the boundaries, but also the internal structure and syntactic category of simple as well as complex NP's, PP's, AP's and adverbials. We compare tagging accuracy for different applications and encoding schemes.Comment: 7 pages, LaTe

arXiv.org e-Print Archive

CiteSeerX

Enrichment of Renaissance texts with proper names

Author: Eshkol-Taravella Iris
Friburger Nathalie
Maurel Denis
Publication venue: Serbian Academic Library Association
Publication date: 01/09/2014
Field of study

International audienceThe Renom project proposes to enrich Renaissance texts by proper names. These texts present two new challenges: great diversity due to various spellings of words; numerous XML-TEI tags to save the exact format of original edition. The task consisted to add Named Entity tags to this format tagging with generally the left context and sometimes the right context of a name. To do that, we improved the free and open source program CasSys to parse texts with Unitex graph cascades and we built dictionaries and specific cascades. The slot error rate was 6.1%. Proper Names and maps. were to allow navigating into. So, this paper deals with Named Entity Recognition in Renaissance texts

HAL Université de Tours

Automated Proof Reading of Clinical Notes

Author: Nguyen Dung
Patrick Jon
Publication venue: Institute of Digital Enhancement of Cognitive Processing, Waseda University
Publication date: 01/01/2011
Field of study

Waseda University Repository

Le raisonnement à partir de cas pour l'identification de rôles sémantiques dans des énoncés en langue naturelle

Author: Chakkour Fairouz
Napoli Amedeo
Toussaint Yannick
Publication venue: HAL CCSD
Publication date: 01/05/2000
Field of study

Colloque avec actes sans comité de lecture. nationale.National audienceLes énoncés en langue naturelle dans les domaines techniques présentent des constructions syntaxiques récurrentes. Nous proposons de mettre en oeuvre un système de raisonnement à partir de cas pour nous permettre de passer de l'analyse syntaxique d'une phrase à sa représentation conceptuell

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

DFKI finite-state machine toolkit

Author: Piskorski Jakub
Publication venue: Sonstige Einrichtungen. DFKI Deutsches Forschungszentrum für Künstliche Intelligenz
Publication date: 01/01/2002
Field of study

Finite-state devices such as finite-state automata and finite-state transducers have been known since the emergence of computer science and are recently extensively used in many areas of language technology. The use of finite-state devices is mainly motivated by their time and space efficiency. In this paper we present the Finite-State Machine Toolkit for building, combining and optimizing the finite-state machines, developed at the Language Technology Lab of the German Research Center for Artificial Intelligence

Universaar

Acronym

Trouver des réponses dans le web et dans une collection fermée

Author: Berthelin Jean-Baptiste
De Chalendar Gaël
Elkateb-Gara Faïza
Ferret Olivier
Grau Brigitte
Hurault-Plantet Martine
Illouz Gabriel
Monceaux Laura
Robba Isabelle
Vilnat Anne
Publication venue: HAL CCSD
Publication date: 01/01/2003
Field of study

National audienceThe task of question answering, as defined in the TREC-11 evaluation, may rely on a Web search. However, this strategy is not a sufficient one, since Web results are not certified. Our system, QALC, searches both the Web and the AQUAINT text base. This implies that the system exists in two versions, each one of them dealing with one kind of resource. Particularly, Web requests may be extremely precise, and still be successful. Relying upon both kinds of search results yields a better ranking of the answers, hence a better functioning of the QALC system.La tâche de réponse à des questions, comme elle se présente dans le cadre de l'évaluation TREC-11, peut déclencher une recherche de la réponse en question sur le Web. Mais cette stratégie, à elle seule, ne garantit pas une bonne fiabilité de la réponse. Notre système, QALC, effectue donc une double recherche, sur le Web et sur la collection de référence AQUAINT. Cela suppose d'avoir deux versions du système, adaptées à ces deux ressources documentaires. En particulier, le Web peut être interrogé avec succès en gardant la question sous une forme extrêmement précise. Le fait de s'appuyer sur des résultats communs à ces deux recherches permet de mieux classer les réponses, et donc d'améliorer la performance du système QALC

Hal-Diderot