Search CORE

15 research outputs found

NLP for writing: What has changed?

Author: De Smedt Koenraad
Publication venue
Publication date: 17/02/2009
Field of study

Proceedings of the Workshop on NLP for Reading and Writing – Resources, Algorithms and Tools (SLTC 2008). Editors: Rickard Domeij, Sofie Johansson Kokkinakis, Ola Knutsson and Sylvana Sofkova Hashemi. NEALT Proceedings Series, Vol. 3 (2009), 1-11. © 2009 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/4116

DSpace at Tartu University Library

Part of Speech Tagging for Text Clustering in Swedish

Author: Rosell Magnus
Publication venue
Publication date: 13/05/2009
Field of study

Proceedings of the 17th Nordic Conference of Computational Linguistics NODALIDA 2009. Editors: Kristiina Jokinen and Eckhard Bick. NEALT Proceedings Series, Vol. 4 (2009), 150-157. © 2009 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/9206

DSpace at Tartu University Library

Evaluation of really good grammatical error correction

Author: Gillholm Katarina
Kurfalı Murathan
Mattson Marie
Wirén Mats
Östling Robert
Publication venue
Publication date: 17/08/2023
Field of study

Although rarely stated, in practice, Grammatical Error Correction (GEC) encompasses various models with distinct objectives, ranging from grammatical error detection to improving fluency. Traditional evaluation methods fail to fully capture the full range of system capabilities and objectives. Reference-based evaluations suffer from limitations in capturing the wide variety of possible correction and the biases introduced during reference creation and is prone to favor fixing local errors over overall text improvement. The emergence of large language models (LLMs) has further highlighted the shortcomings of these evaluation strategies, emphasizing the need for a paradigm shift in evaluation methodology. In the current study, we perform a comprehensive evaluation of various GEC systems using a recently published dataset of Swedish learner texts. The evaluation is performed using established evaluation metrics as well as human judges. We find that GPT-3 in a few-shot setting by far outperforms previous grammatical error correction systems for Swedish, a language comprising only 0.11% of its training data. We also found that current evaluation methods contain undesirable biases that a human evaluation is able to reveal. We suggest using human post-editing of GEC system outputs to analyze the amount of change required to reach native-level human performance on the task, and provide a dataset annotated with human post-edits and assessments of grammaticality, fluency and meaning preservation of GEC system outputs

arXiv.org e-Print Archive

Linguistically Fuelled Text Similarity

Author: Andrist Björn
Hassel Martin
Publication venue
Publication date: 23/05/2007
Field of study

Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Joakim Nivre, Heiki-Jaan Kaalep, Kadri Muischnek and Mare Koit. University of Tartu, Tartu, 2007. ISBN 978-9985-4-0513-0 (online) ISBN 978-9985-4-0514-7 (CD-ROM) pp. 207-211

DSpace at Tartu University Library

Recreating Humorous Split Compound Errors in Swedish by Using Grammaticality

Author: Araki Kenji
Sjöbergh Jonas
Publication venue
Publication date: 23/05/2007
Field of study

CiteSeerX

DSpace at Tartu University Library

Developing and Evaluating a Searchable Swedish-Thai Lexicon

Author: Khanaraksombat Wanwisa
Sjöbergh Jonas
Publication venue
Publication date: 23/05/2007
Field of study

CiteSeerX

DSpace at Tartu University Library

Erroreak automatikoki detektatzeko tekniken azterlana eta euskararentzako aplikazioak

Author: Díaz de Ilarraza Sánchez Arantza
Gojenola Galletebeitia Koldobika
Oronoz Anchordoqui Maite
Publication venue: Servicio Editorial de la Universidad del País Vasco/Euskal Herriko Unibertsitatearen Argitalpen Zerbitzua
Publication date: 01/01/2009
Field of study

In this article, we study the techniques used for detecting errors in Natural Language Processing (NLP). We classify the techniques according to their approach (symbolic or empirical), and then we describe them in depth. Following that, we describe the systems we have developed for detecting syntactic errors in Basque, by using that technique as a criterion for the classification of those systems, and enhancing it with examples

Archivo Digital para la Docencia y la Investigación

Universidad del País Vasco / Euskal Herriko Unibertsitatea: Ciencia - Portal de revistas digitales de la UPV/EHU

Nodalida 2005 - proceedings of the 15th NODALIDA conference

Author
Publication venue: University of Joensuu
Publication date
Field of study

UEF Electronic Publications

Proceedings (all articles)

Author: Domeij Rickard
Johansson Kokkinakis Sofie
Knutsson Ola
Sofkova Hashemi Sylvana
Publication venue
Publication date: 17/02/2009
Field of study

Proceedings of the Workshop on NLP for Reading and Writing – Resources, Algorithms and Tools (SLTC 2008). Editors: Rickard Domeij, Sofie Johansson Kokkinakis, Ola Knutsson and Sylvana Sofkova Hashemi. NEALT Proceedings Series, Vol. 3 (2009), v+23 pp. © 2009 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/4116

DSpace at Tartu University Library

Proceedings of the 24th Scandinavian Conference of Linguistics

Author: Anttikoski Esa
Tirkkonen Jani-Matti
Publication venue: University of Eastern Finland
Publication date
Field of study

UEF Electronic Publications