research

Enriching a Portuguese WordNet using synonyms from a monolingual dictionary

Abstract

In this article we present an exploratory approach to enrich a WordNet-like lexical ontology with the synonyms present in a standard monolingual Portuguese dictionary. The dictionary was converted from PDF into XML and senses were automatically identified and annotated. This allowed us to extract them, independently of definitions, and to create sets of synonyms (synsets). These synsets were then aligned with WordNet synsets, both in the same language (Portuguese) and projecting the Portuguese terms into English, Spanish and Galician. This process allowed both the addition of new term variants to existing synsets, as to create new synsets for Portuguese.This work has been supported by COMPETE: POCI01-0145-FEDER-007043 and FCT – Fundação para a Ciência e Tecnologia within the Project Scope: UID/CEC/00319/2013; and thanks to the Project SKATeR (TIN2012-38584-C06-04) supported by the Ministry of Economy and Competitiveness of the Spanish Government

    Similar works