Search CORE

65 research outputs found

word etymology as native language interference

Author: Carlo Strapparava
Vivi Nastase
Publication venue
Publication date: 01/01/2017
Field of study

Crossref

Archivio della ricerca - Fondazione Bruno Kessler

Open Access Repository

Recommended from our members

Finding Semantic Associations on Express Lane

Author: Mihalcea Rada, 1974-
Nastase Vivi
Publication venue: European Language Resources Association
Publication date: 01/05/2004
Field of study

This paper introduces a new codification scheme for efficient computation of measures in semantic networks. The scheme is particularly useful for fast computation of semantic associations between words and implementation of an informational retrieval operator for efficient search in semantic spaces. Other applications may also be possible

UNT Digital Library

anglicized words and misspelled cognates in native language identification

Author: Carlo Strapparava
Ilia Markov
Vivi Nastase
Publication venue
Publication date: 01/01/2019
Field of study

VU Research Portal

Archivio della ricerca - Fondazione Bruno Kessler

Institutional Repository Universiteit Antwerpen

Open Access Repository

Taxonomy Induction using Hypernym Subsequences

Author: Biemann Chris
Cram Damien
Grefenstette Gregory
Gupta Amit
Kozareva Zornitsa
Nastase Vivi
Oakes Michael P
Ponzetto S.
Ponzetto Simone Paolo
Snow Rion
Publication venue
Publication date: 05/05/2017
Field of study

We propose a novel, semi-supervised approach towards domain taxonomy induction from an input vocabulary of seed terms. Unlike all previous approaches, which typically extract direct hypernym edges for terms, our approach utilizes a novel probabilistic framework to extract hypernym subsequences. Taxonomy induction from extracted subsequences is cast as an instance of the minimumcost flow problem on a carefully designed directed graph. Through experiments, we demonstrate that our approach outperforms stateof- the-art taxonomy induction approaches across four languages. Importantly, we also show that our approach is robust to the presence of noise in the input vocabulary. To the best of our knowledge, no previous approaches have been empirically proven to manifest noise-robustness in the input vocabulary

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Crossref

CoSyne: A Framework for Multilingual Content Synchronization of Wikis

Author: Angela Fahrni
Christof Monz
Matteo Negri
Michael Strube
Vivi Nastase
Yashar Mehdad
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2011
Field of study

Wikis allow a large base of contributors easy access to shared content, and freedom in editing it. One of the side-effects of this freedom was the emergence of parallel and independently evolving versions in a variety of languages, reflecting the multilingual background of the pool of contributors. For the Wiki to properly represent the user-added content, this should be fully available in all its languages. Working on parallel Wikis in several European languages, we investigate the possibility to “synchronize” different language versions of the same document, by: i) pinpointing topically related pieces of information in the different languages, ii) identifying information that is missing or less detailed in one of the two versions,iii) translating this in the appropriate language, iv) inserting it in the appropriate place. Progress along such directions will allow users to share more easily content across language boundaries

Archivio della ricerca - Fondazione Bruno Kessler

multi level alignments as an extensible representation basis for textual entailment algorithms

Author: Ido Dagan
Kathrin Eichler
Lili Kotlerman
Meni Adler
Sebastian Padó
Tae-Gil Noh
Vered Shwartz
Vivi Nastase
Publication venue
Publication date: 01/01/2015
Field of study

A major problem in research on Textual Entailment (TE) is the high implementation effort for TE systems. Recently, interoperable standards for annotation and preprocessing have been proposed. In contrast, the algorithmic level remains unstandardized, which makes component re-use in this area very difficult in practice. In this paper, we introduce multi-level alignments as a central, powerful representation for TE algorithms that encourages modular, reusable, multilingual algorithm development. We demonstrate that a pilot open-source implementation of multi-level alignment with minimal features competes with state-of-theart open-source TE engines in three languages

Crossref

Archivio della ricerca - Fondazione Bruno Kessler

Open Access Repository