Search CORE

547 research outputs found

From Document Retrieval to Question Answering

Author: Monz C.
Publication venue: ILLC
Publication date: 01/01/2003
Field of study

CiteSeerX

University of Twente Research Information

UvA-DARE

International Migration, Integration and Social Cohesion online publications

Improving Statistical Machine Translation Performance by Oracle-BLEU Model Re-estimation

Author: Dakwale P.
Monz C.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2016
Field of study

International Migration, Integration and Social Cohesion online publications

Back-Translation Sampling by Targeting Difficult Words in Neural Machine Translation

Author: Fadaee M.
Monz C.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2018
Field of study

International Migration, Integration and Social Cohesion online publications

Power-Law Distributions for Paraphrases Extracted from Bilingual Corpora

Author: Martzoukos S.
Monz C.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2012
Field of study

International Migration, Integration and Social Cohesion online publications

UvA-DARE

NonFactS: NonFactual Summary Generation for Factuality Evaluation in Document Summarization

Author: Monz C.
Soleimani A.
Worring M.
Publication venue
Publication date: 01/01/2023
Field of study

Pre-trained abstractive summarization models can generate fluent summaries and achieve high ROUGE scores. Previous research has found that these models often generate summaries that are inconsistent with their context document and contain nonfactual information. To evaluate factuality in document summarization, a document-level Natural Language Inference (NLI) classifier can be used. However, training such a classifier requires large-scale high-quality factual and nonfactual samples. To that end, we introduce NonFactS, a data generation model, to synthesize nonfactual summaries given a context document and a human-annotated (reference) factual summary. Compared to previous methods, our nonfactual samples are more abstractive and more similar to their corresponding factual samples, resulting in state-of-the-art performance on two factuality evaluation benchmarks, FALSESUM and SUMMAC. Our experiments demonstrate that even without human-annotated summaries, NonFactS can use random sentences to generate nonfactual summaries and a classifier trained on these samples generalizes to out-of-domain documents

International Migration, Integration and Social Cohesion online publications

UvA-DARE

NLQuAD: A Non-Factoid Long Question Answering Data Set

Author: Monz C.
Soleimani A.
Worring M.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2021
Field of study

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Joint Dropout: Improving Generalizability in Low-Resource Neural Machine Translation through Phrase Pair Variables

Author: Araabi A.
Monz C.
Niculae V.
Publication venue: Asia-Pacific Association for Machine Translation
Publication date: 01/01/2023
Field of study

Despite the tremendous success of Neural Machine Translation (NMT), its performance on low- resource language pairs still remains subpar, partly due to the limited ability to handle previously unseen inputs, i.e., generalization. In this paper, we propose a method called Joint Dropout, that addresses the challenge of low-resource neural machine translation by substituting phrases with variables, resulting in significant enhancement of compositionality, which is a key aspect of generalization. We observe a substantial improvement in translation quality for language pairs with minimal resources, as seen in BLEU and Direct Assessment scores. Furthermore, we conduct an error analysis, and find Joint Dropout to also enhance generalizability of low-resource NMT in terms of robustness and adaptability across different domains

International Migration, Integration and Social Cohesion online publications

BERT for Evidence Retrieval and Claim Verification

Author: Monz C.
Soleimani A.
Worring M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

International Migration, Integration and Social Cohesion online publications

Examining the Tip of the Iceberg: A Data Set for Idiom Translation

Author: Bisazza A.
Fadaee M.
Monz C.
Publication venue: European Language Resources Association (ELRA)
Publication date: 01/01/2018
Field of study

International Migration, Integration and Social Cohesion online publications

Aligning Predictive Uncertainty with Clarification Questions in Grounded Dialog

Author: Manggala P.
Monz C.
Naszádi K.
Publication venue
Publication date: 01/01/2023
Field of study

Asking for clarification is fundamental to effective collaboration. An interactive artificial agent must know when to ask a human instructor for more information in order to ascertain their goals. Previous work bases the timing of questions on supervised models learned from interactions between humans. Instead of a supervised classification task, we wish to ground the need for questions in the acting agent’s predictive uncertainty. In this work, we investigate if ambiguous linguistic instructions can be aligned with uncertainty in neural models. We train an agent using the T5 encoder-decoder architecture to solve the Minecraft Collaborative Building Task and identify uncertainty metrics that achieve better distributional separation between clear and ambiguous instructions. We further show that well-calibrated prediction probabilities benefit the detection of ambiguous instructions. Lastly, we provide a novel empirical analysis on the relationship between uncertainty and dialog history length and highlight an important property that poses a difficulty for detection

International Migration, Integration and Social Cohesion online publications

UvA-DARE