Search CORE

247 research outputs found

An overview of portuguese wordnets

Author: Freitas Cláudia
Oliveira Hugo Gonçalo
Paiva Valeria de
Rademaker Alexandre
Real Livy
Simões Alberto
Publication venue: Global Wordnet Association
Publication date: 01/01/2016
Field of study

Semantic relations between words are key to building systems that aim to understand and manipulate language. For En- glish, the “de facto” standard for representing this kind of knowledge is Princeton’s WordNet. Here, we describe the wordnet-like resources currently available for Portuguese: their origins, methods of creation, sizes, and usage restrictions. We start tackling the problem of comparing them, but only in quantitative terms. Finally, we sketch ideas for potential collaboration between some of the projects.(undefined

Universidade do Minho: RepositoriUM

Predicate Matrix: an interoperable lexical knowledge base for predicates

Author: López de Lacalle Maddalen
Publication venue
Publication date: 10/07/2023
Field of study

183 p.La Matriz de Predicados (Predicate Matrix en inglés) es un nuevo recurso léxico-semántico resultado de la integración de múltiples fuentes de conocimiento, entre las cuales se encuentran FrameNet, VerbNet, PropBank y WordNet. La Matriz de Predicados proporciona un léxico extenso y robusto que permite mejorar la interoperabilidad entre los recursos semánticos mencionados anteriormente. La creación de la Matriz de Predicados se basa en la integración de Semlink y nuevos mappings obtenidos utilizando métodos automáticos que enlazan el conocimiento semántico a nivel léxico y de roles. Asimismo, hemos ampliado la Predicate Matrix para cubrir los predicados nominales (inglés, español) y predicados en otros idiomas (castellano, catalán y vasco). Como resultado, la Matriz de predicados proporciona un léxico multilingüe que permite el análisis semántico interoperable en múltiples idiomas

Archivo Digital para la Docencia y la Investigación

On the Utility of Word Embeddings for Enriching OpenWordNet-PT

Author: Aguiar Fredson Silva de Souza
Rademaker Alexandre
Publication venue: OASIcs - OpenAccess Series in Informatics. 3rd Conference on Language, Data and Knowledge (LDK 2021)
Publication date: 01/01/2021
Field of study

The maintenance of wordnets and lexical knwoledge bases typically relies on time-consuming manual effort. In order to minimise this issue, we propose the exploitation of models of distributional semantics, namely word embeddings learned from corpora, in the automatic identification of relation instances missing in a wordnet. Analogy-solving methods are first used for learning a set of relations from analogy tests focused on each relation. Despite their low accuracy, we noted that a portion of the top-given answers are good suggestions of relation instances that could be included in the wordnet. This procedure is applied to the enrichment of OpenWordNet-PT, a public Portuguese wordnet. Relations are learned from data acquired from this resource, and illustrative examples are provided. Results are promising for accelerating the identification of missing relation instances, as we estimate that about 17% of the potential suggestions are good, a proportion that almost doubles if some are automatically invalidated

Dagstuhl Research Online Publication Server

Proceedings

Author: Dickinson Markus
Müürisep Kaili
Passarotti Marco
Publication venue
Publication date: 01/12/2010
Field of study

Proceedings of the Ninth International Workshop on Treebanks and Linguistic Theories. Editors: Markus Dickinson, Kaili Müürisep and Marco Passarotti. NEALT Proceedings Series, Vol. 9 (2010), 268 pages. © 2010 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/15891

DSpace at Tartu University Library

Action nominalizations in Early Modern scientific English

Author: Vázquez López Vera
Publication venue
Publication date: 01/01/2013
Field of study

The present dissertation, Action nominalizations in Early Modern scientific English, was conceived as a contribution to the literature on nominalizations. Its point of departure was the assertion that action nominalizations are the result of a word-formation process which aims at filling gaps in the vocabulary of a particular language, English in this case. Action nominalizations are clear cases of grammatical metaphor (Halliday 2004 [1985]), since they are nouns, but they refer to actions as verbs do. For this reason, attention is given to the evolution and use of action nominalizations in the Early Modern English period (henceforth EModE), the time which sees the greatest increase of vocabulary in the history of the English language. Given that nouns prototypically refer to objects rather than actions, the question arises as to how they behave when they denote actions, and what the consequences of this use are

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositorio Institucional da Universidade de Santiago de Compostela

As Wordnets do Português

Author: Freitas Cláudia
Oliveira Hugo Gonçalo
Paiva Valéria de
Rademaker Alexandre
Real Livy
Simões Alberto
Publication venue: 'University of Oslo Library'
Publication date: 01/01/2015
Field of study

Series: "Oslo Studies in Language". ISSN 1890-9639. 7(1), 2015.Not many years ago it was usual to comment on the lack of an open lexical- semantic knowledge base, following the lines of Princeton WordNet, but for Portuguese. Today, the landscape has changed significantly, and re- searchers that need access to this specific kind of resource have not one, but several alternatives to choose from. The present article describes the wordnet-like resources currently available for Portuguese. It provides some context on their origin, creation approach, size and license for utilization. Apart from being an obvious starting point for those looking for a computational resource with information on the meaning of Portuguese words, this article describes the resources available, compares them and lists some plans for future work, sketching ideas for potential collaboration between the projects described.CLUPFundação para a Ciência e a Tecnologia (FCT

Universidade do Minho: RepositoriUM

Repositório Comum

University of Oslo (UiO): FRITT (E-Journals)

Texts as image-schemas: A crosslinguistic analysis of discourse in the light of cognitive linguistics

Author: Salies Tania Mara Gastao
Publication venue
Publication date: 01/12/1997
Field of study

SHAREOK repository

Semi-automatic approaches for exploiting shifter patterns in domain-specific sentiment analysis

Author: Brazdil Pavel
Cordeiro João
Leal António
Muhammad Shamsuddeen Hassan
Oliveira Fátima
Silva Maria de Fátima Henriques da
Silvano Maria da Purificação
Publication venue: 'MDPI AG'
Publication date: 01/01/2022
Field of study

This paper describes two different approaches to sentiment analysis. The first is a form of symbolic approach that exploits a sentiment lexicon together with a set of shifter patterns and rules. The sentiment lexicon includes single words (unigrams) and is developed automatically by exploiting labeled examples. The shifter patterns include intensification, attenuation/downtoning and inversion/reversal and are developed manually. The second approach exploits a deep neural network, which uses a pre-trained language model. Both approaches were applied to texts on economics and finance domains from newspapers in European Portuguese. We show that the symbolic approach achieves virtually the same performance as the deep neural network. In addition, the symbolic approach provides understandable explanations, and the acquired knowledge can be communicated to others. We release the shifter patterns to motivate future research in this direction

Repositório Aberto da Universidade do Porto

Visualización del lenguaje a través de corpus

Author: Crespo Begoña
Lareo Inés
Lojo Sandino Paula
Moskowich Isabel
Publication venue: Universidade da Coruña, Servizo de Publicacións
Publication date: 01/01/2021
Field of study

Digital version of the print publication, published in A Coruña: Universidade da Coruña, Servizo de Publicacións, 2010 (ISBN 978-84-9749-401-4)This book contains the papers presented at the Second International Conference on Corpus Linguistics held at the University of A Coruña in 2010 and organised by the MuStE group. The essays deal with different aspects of corpus linguistics both as a methodology and as a branch of Linguistics.[Abstract] The collection of essays we are presenting here are just a mere sample of the interest the topics relating to Corpus Linguistics have arisen everywhere. Such different topics as those related to Computational Linguistics found in “Obtaining computational resources for languages with scarce resources from closely related computationally-developed languages. The Galician and Portuguese case“ or “Corpus-Based Modelling of Lexical Changes in Manic Depression Disorders: The Case of Edgar Allan Poe” belonging to the field of Corpus and Literary Studies can be found in the ensuing pages. Almost all research areas can nowadays be investigated using Corpus Linguistics as a valid methodology. This is reason why Language Windowing through Corpora gathers papers dealing with discourse, variation and change, grammatical studies, lexicology and lexicography, corpus design, contrastive analyses, language acquisition and learning or translation. This work’s title aims at reflecting not only the great variety of topics gathered in it but also the worldwide interest awaken by the computer processing of language. In fact, researchers from many different institutions all over the world have contributed to this book. Apart from the twenty-two Spanish Universities, people from other Higher Education Institutions have authored and co-authored the essays contained here, namely, Russia, Venezuela, Brazil, UK, Finland, Portugal, Poland, Austria, Mexico, Thailand, Iran, the Netherlands, Belgium, Japan, Turkey, China, Italy, Malaysia, Romania and Sweden. All these essays have been alphabetically arranged, by the names of their authors, in two parts. Part 1 contains the papers by authors from A to K and Part 2, those of authors from L to Z

Repositorio da Universidade da Coruña