Search CORE

148 research outputs found

Qualitative terminology extraction: Identifying relational adjectives

Author: Daille Béatrice
Publication venue: 'John Benjamins Publishing Company'
Publication date: 01/01/2001
Field of study

International audienceThis paper presents the identification in corpora of French relational adjectives, phenomena considered by linguists as highly informative. The approach uses a termer which is applied on a tagged and lemmatized corpus. Relational adjectives and nominal compounds which include a relational adjective are then quantified and their informative status is evaluated thanks to a thesaurus of the domain. We conclude with a discussion of the interesting status of such adjectives and nominal compounds for terminology extraction and other automatic terminology tasks

Hal-Diderot

Identification of Fertile Translations in Medical Comparable Corpora: a Morpho-Compositional Approach

Author: Daille Béatrice
Delpech Estelle
Lemaire Claire
Morin Emmanuel
Publication venue
Publication date: 11/09/2012
Field of study

This paper defines a method for lexicon in the biomedical domain from comparable corpora. The method is based on compositional translation and exploits morpheme-level translation equivalences. It can generate translations for a large variety of morphologically constructed words and can also generate 'fertile' translations. We show that fertile translations increase the overall quality of the extracted lexicon for English to French translation

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

Influence des domaines de spécialité dans l'extraction de termes-clés

Author: Boudin Florian
Bougouin Adrien
Daille Béatrice
Publication venue: HAL CCSD
Publication date: 01/07/2014
Field of study

National audienceLes termes-clés sont les mots ou les expressions polylexicales qui représentent le contenu principal d'un document. Ils sont utiles pour diverses applications, telles que l'indexation automatique ou le résumé automatique, mais ne sont pas toujours disponibles. De ce fait, nous nous intéressons à l'extraction automatique de termes-clés et, plus particulièrement, à la difficulté de cette tâche lors du traitement de documents appartenant à certaines disciplines scientifiques. Au moyen de cinq corpus représentant cinq disciplines différentes (archéologie, linguistique, sciences de l'information, psychologie et chimie), nous déduisons une échelle de difficulté disciplinaire et analysons les facteurs qui influent sur cette difficulté

Hal-Diderot

Tools for Terminology Processing

Author: Daille Béatrice
Enguehard Chantal
Morin Emmanuel
Publication venue: Tata McGraw-Hill
Publication date: 01/06/2002
Field of study

International audienceAutomatic terminology processing appeared 10 years ago when electronic corpora became widely available. Such processing may be statistically or linguistically based and produces terminology resources that can be used in a number of applications : indexing, information retrieval, technology watch, etc. We present the tools that have been developed in the IRIN Institute. They all take as input texts (or collection of texts) and reflect different states of terminology processing: term acquisition, term recognition and term structuring

6th International Workshop on Computational Terminology (COMPUTERM 2020), Proceedings

Author: Daille Béatrice
Kageura Kyo
Rigouts Terryn Ayla
Publication venue: European Language Resources Association (ELRA)
Publication date: 01/01/2020
Field of study

Ghent University Academic Bibliography

Comparability measurement for terminology extraction

Author: Blancafort Helena
Daille Béatrice
Jacquin Christine
Monceaux Laura
Morin Emmanuel
Poulard Fabien
Publication venue
Publication date: 08/05/2011
Field of study

Proceedings of the Workshop CHAT 2011: Creation, Harmonization and Application of Terminology Resources. Editors: Tatiana Gornostay and Andrejs Vasiļjevs. NEALT Proceedings Series, Vol. 12 (2011), 3-10. © 2011 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/16956

DSpace at Tartu University Library

Extraction d'expressions-cibles de l'opinion : de l'anglais au français

Author: Claveau Vincent
Daille Béatrice
Jadi Grégoire
Monceaux Laura
Publication venue: HAL CCSD
Publication date: 04/07/2016
Field of study

National audienceIn this paper, we present the development of an Opinion Target Extraction system in english and transpose it to french. In addition, we realize an analysis of the features and their effectiveness in english and french which suggest that it is possible to build an Opinion Target Extraction system independant of the domain. Finally, we propose a comparative study of the errors of our systems in both english and french and propose several solutions to these problems.Dans cet article, nous présentons le développement d'un système d'extraction d'expressions-cibles pour l'anglais et sa transposition au français. En complément, nous avons réalisé une étude de l'efficacité des traits en anglais et en français qui tend à montrer qu'il est possible de réaliser un système d'extraction d'expressions-cibles indépendant du domaine. Pour finir, nous proposons une analyse comparative des erreurs commises par nos systèmes en anglais et français et envisageons différentes solutions à ces problèmes

INRIA a CCSD electronic archive server

Evaluating Lexical Similarity to build Sentiment Similarity

Author: Claveau Vincent
Daille Béatrice
Jadi Grégoire
Monceaux-Cachard Laura
Publication venue: HAL CCSD
Publication date: 23/05/2016
Field of study

International audienceIn this article, we propose to evaluate the lexical similarity information provided by word representations against several opinion resourcesusing traditional Information Retrieval tools. Word representation have been used to build and to extend opinion resources such aslexicon, and ontology and their performance have been evaluated on sentiment analysis tasks. We question this method by measuring thecorrelation between the sentiment proximity provided by opinion resources and the semantic similarity provided by word representationsusing different correlation coefficients. We also compare the neighbors found in word representations and list of similar opinion words.Our results show that the proximity of words in state-of-the-art word representations is not very effective to build sentiment similarity

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

HAL-Rennes 1