Improving Term Extraction with Terminological Resources

C.G. Chute; T.G.O. Consortium; T.G.O. Consortium; Y. Tsuruoka

research

Improving Term Extraction with Terminological Resources

Authors: C.G. Chute
T.G.O. Consortium
T.G.O. Consortium
Y. Tsuruoka
Publication date: 1 January 2006
Publisher
Doi

Abstract

Studies of different term extractors on a corpus of the biomedical domain revealed decreasing performances when applied to highly technical texts. The difficulty or impossibility of customising them to new domains is an additional limitation. In this paper, we propose to use external terminologies to influence generic linguistic data in order to augment the quality of the extraction. The tool we implemented exploits testified terms at different steps of the process: chunking, parsing and extraction of term candidates. Experiments reported here show that, using this method, more term candidates can be acquired with a higher level of reliability. We further describe the extraction process involving endogenous disambiguation implemented in the term extractor YaTeA

Similar works

Full text

Available Versions

Hal-Diderot

oai:HAL:hal-00091444v1

Last time updated on 14/04/2021

HAL-Paris 13

oai:HAL:hal-00091444v1

Last time updated on 11/11/2016

HAL Descartes

oai:HAL:hal-00091444v1

Last time updated on 14/04/2021

Crossref

Last time updated on 22/03/2019