Search CORE

5 research outputs found

Data Augmentation for Machine Translation via Dependency Subtree Swapping

Author: Barta Botond
Lakatos Dorina Petra
Nagy Attila
Nanys Patrick
Ács Judit
Publication venue
Publication date: 13/07/2023
Field of study

We present a generic framework for data augmentation via dependency subtree swapping that is applicable to machine translation. We extract corresponding subtrees from the dependency parse trees of the source and target sentences and swap these across bisentences to create augmented samples. We perform thorough filtering based on graphbased similarities of the dependency trees and additional heuristics to ensure that extracted subtrees correspond to the same meaning. We conduct resource-constrained experiments on 4 language pairs in both directions using the IWSLT text translation datasets and the Hunglish2 corpus. The results demonstrate consistent improvements in BLEU score over our baseline models in 3 out of 4 language pairs. Our code is available on GitHub

arXiv.org e-Print Archive

Data augmentation for machine translation via dependency subtree swapping

Author: Barta Botond
Lakatos Dorina Petra
Nagy Attila
Nanys Patrick
Ács Judit
Publication venue
Publication date: 01/01/2023
Field of study

University of Szeged

Data Augmentation for Machine Translation via Dependency Subtree Swapping

Author: Barta Botond
Lakatos Dorina Petra
Nagy A
Nanys P
Ács Judit
Publication venue: 'SZTE Hungarian Scientific Society of the Silicate Industry'
Publication date: 01/01/2023
Field of study

SZTAKI Publication Repository

HunSum-1: an Abstractive Summarization Dataset for Hungarian

Author: Barta Botond
Lakatos Dorina Petra
Nagy A
Nyist M K
Ács Judit
Publication venue: 'SZTE Hungarian Scientific Society of the Silicate Industry'
Publication date: 01/01/2023
Field of study

SZTAKI Publication Repository

Bírósági határozatok automatikus mondatszegmentálásának hatékonyságmérése

Author: Csányi Gergely
Fülöp Anna
Lakatos Dorina Petra
Megyeri Andrea
Nagy Dániel
Vadász János Pál
Vági Renátó
Üveges István
Publication venue: Universití of Szeged
Publication date: 01/01/2024
Field of study

SZTE Publicatio Repozitórium - SZTE - Repository of Publications