Search CORE

73 research outputs found

BME-HAS System for CoNLL–SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection

Author: Ács Judit
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2018
Field of study

Synonym acquisition from translation graph

Author: Ács Judit
Publication venue
Publication date: 01/01/2015
Field of study

We present a language-independent method for leveraging synonyms from a large translation graph. A new WordNet-based precisionlike measure is introduced

University of Szeged

Building basic vocabulary across 40 languages

Author: Kornai András
Pajkossy Katalin
Ács Judit
Publication venue: Omnipress
Publication date: 01/01/2013
Field of study

The paper explores the options for building bilingual dictionaries by automated methods. We define the notion ‘basic vocabulary ’ and investigate how well the conceptual units that make up this language-independent vocabulary are covered by language-specific bindings in 40 languages

CiteSeerX

SZTAKI Publication Repository

Automatic punctuation restoration with BERT models

Author: Bial Bence
Nagy Attila
Ács Judit
Publication venue
Publication date: 01/01/2021
Field of study

We present an approach for automatic punctuation restoration with BERT models for English and Hungarian. For English, we conduct our experiments on Ted Talks, a commonly used benchmark for punctuation restoration, while for Hungarian we evaluate our models on the Szeged Treebank dataset. Our best models achieve a macro-averaged F1-score of 79.8 in English and 82.2 in Hungarian. Our code is publicly available

University of Szeged

Identification of Disaster-implicated Named Entities

Author: Kornai András
Nemeskey Dávid Márk
Ács Judit
Publication venue
Publication date: 01/01/2017
Field of study

SZTAKI Publication Repository

Building word embeddings from dictionary definitions

Author: Nemeskey Dávid Márk
Recski Gábor András
Ács Judit
Publication venue: Research Institute for Linguistics, Hungarian Academy of Sciences (RIL HAS)
Publication date: 01/01/2017
Field of study

SZTAKI Publication Repository

Investigation of epilithic biofilms in the River Danube

Author: Makk Judit
Ács Éva
Publication venue: 'Iktisadi Girisim ve Is Ahlaki Dernegi (IGIAD)'
Publication date: 01/01/1997
Field of study

Repository of the Academy's Library

Entitásorientált véleménykinyerés magyar nyelven

Author: Huszti Dániel
Ács Judit
Publication venue
Publication date: 01/01/2017
Field of study

Napjainkban a digitális formában fellelhető, strukturálatlan adatok mennyisége folyamatosan növekszik, ezáltal a bennük említett entitásokra vonatkozó vélemények polaritásának automatizált elemzése is egyre fontosabbá válik. Cikkünkben bemutatunk egy olyan alkalmazást, mely segítségével magyar nyelvű szövegekből lehetséges a tulajdon-, földrajzi- és cégnevekre vonatkozó, részletes szerzői attitűd kinyerése. A forráskódot és a megoldást virtualizált formában is nyilvánosságra hoztuk

University of Szeged

The Role of Interpretable Patterns in Deep Learning for Morphology

Author: Kornai András
Ács Judit
Publication venue: Szegedi Tudományegyetem, Informatikai Intézet
Publication date: 01/01/2020
Field of study

We examine the role of character patterns in three tasks: morphological analysis, lemmatization and copy. We use a modified version of the standard sequence-to-sequence model, where the encoder is a pattern matching network. Each pattern scores all possible N character long subwords (substrings) on the source side, and the highest scoring subword’s score is used to initialize the decoder as well as the input to the attention mechanism. This method allows learning which subwords of the input are important for generating the output. By training the models on the same source but different target, we can compare what subwords are important for different tasks and how they relate to each other. We define a similarity metric, a generalized form of the Jaccard similarity, and assign a similarity score to each pair of the three tasks that work on the same source but may differ in target. We examine how these three tasks are related to each other in 12 languages. Our code is publicly available

arXiv.org e-Print Archive

SZTAKI Publication Repository

University of Szeged