Search CORE

4,295 research outputs found

From Task Classification Towards Similarity Measures for Recommendation in Crowdsourcing Systems

Author: Neitzel Svenja
Rensing Christoph
Schnitzer Steffen
Publication venue
Publication date: 01/01/2017
Field of study

Task selection in micro-task markets can be supported by recommender systems to help individuals to find appropriate tasks. Previous work showed that for the selection process of a micro-task the semantic aspects, such as the required action and the comprehensibility, are rated more important than factual aspects, such as the payment or the required completion time. This work gives a foundation to create such similarity measures. Therefore, we show that an automatic classification based on task descriptions is possible. Additionally, we propose similarity measures to cluster micro-tasks according to semantic aspects.Comment: Work in Progress Paper at HCOMP 201

arXiv.org e-Print Archive

TUbiblio

Comprehension and trust in crises: investigating the impact of machine translation and post-editing

Author: Cadwell Patrick
O'Brien Sharon
Rossetti Alessandra
Publication venue: European Association for Machine Translation
Publication date: 01/01/2020
Field of study

We conducted a survey to understand the impact of machine translation and postediting awareness on comprehension of and trust in messages disseminated to prepare the public for a weather-related crisis, i.e. flooding. The translation direction was English–Italian. Sixty-one participants—all native Italian speakers with different English proficiency levels— answered our survey. Each participant read and evaluated between three and six crisis messages using ratings and openended questions on comprehensibility and trust. The messages were in English and Italian. All the Italian messages had been machine translated and post-edited. Nevertheless, participants were told that only half had been post-edited, so that we could test the impact of post-editing awareness. We could not draw firm conclusions when comparing the scores for trust and comprehensibility assigned to the three types of messages—English, post-edits, and purported raw outputs. However, when scores were triangulated with open-ended answers, stronger patterns were observed, such as the impact of fluency of the translations on their comprehensibility and trustworthiness. We found correlations between comprehensibility and trustworthiness, and identified other factors influencing these aspects, such as the clarity and soundness of the messages. We conclude by outlining implications for crisis preparedness, limitations, and areas for future research

Irish Universities

Institutional Repository Universiteit Antwerpen

DCU Online Research Access Service

Text Simplification for Scientific Information Access:CLEF 2021 SimpleText Workshop

Author: Bellot P.
Braslavski P.
Ermakova L.
Kamps J.
Mothe J.
Nurbakova D.
Ovchinnikova I.
San-Juan E.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

International Migration, Integration and Social Cohesion online publications

Community-based post-editing of machine-translated content: monolingual vs. bilingual

Author: Mitchell Linda
O'Brien Sharon
Roturier Johann
Publication venue: European Association for Machine Translation
Publication date: 01/01/2013
Field of study

We carried out a machine-translation postediting pilot study with users of an IT support forum community. For both language pairs (English to German, English to French), 4 native speakers for each language were recruited. They performed monolingual and bilingual postediting tasks on machine-translated forum content. The post-edited content was evaluated using human evaluation (fluency, comprehensibility, fidelity). We found that monolingual post-editing can lead to improved fluency and comprehensibility scores similar to those achieved through bilingual post-editing, while we found that fidelity improved considerably more for the bilingual set-up. Furthermore, the performance across post-editors varied greatly and it was found that some post-editors are able to produce better quality in a monolingual set-up than others

Irish Universities

DCU Online Research Access Service

Google Translate Error Analysis for Mental Healthcare Information: Evaluating Accuracy, Comprehensibility, and Implications for Multilingual Healthcare Communication

Author: Braun Sabine
Delfani Jaleh
Kanojia Diptesh
Orasan Constantin
Saadany Hadeel
Schouten Barbara
Taylor-Stilgoe Eleanor
Temizoz Ozlem
Publication venue
Publication date: 06/02/2024
Field of study

This study explores the use of Google Translate (GT) for translating mental healthcare (MHealth) information and evaluates its accuracy, comprehensibility, and implications for multilingual healthcare communication through analysing GT output in the MHealth domain from English to Persian, Arabic, Turkish, Romanian, and Spanish. Two datasets comprising MHealth information from the UK National Health Service website and information leaflets from The Royal College of Psychiatrists were used. Native speakers of the target languages manually assessed the GT translations, focusing on medical terminology accuracy, comprehensibility, and critical syntactic/semantic errors. GT output analysis revealed challenges in accurately translating medical terminology, particularly in Arabic, Romanian, and Persian. Fluency issues were prevalent across various languages, affecting comprehension, mainly in Arabic and Spanish. Critical errors arose in specific contexts, such as bullet-point formatting, specifically in Persian, Turkish, and Romanian. Although improvements are seen in longer-text translations, there remains a need to enhance accuracy in medical and mental health terminology and fluency, whilst also addressing formatting issues for a more seamless user experience. The findings highlight the need to use customised translation engines for Mhealth translation and the challenges when relying solely on machine-translated medical content, emphasising the crucial role of human reviewers in multilingual healthcare communication

arXiv.org e-Print Archive

Easy-to-read Meets Accessible Web in the E-government Context

Author: Geest Thea van der
Velleman Eric
Publication venue: Elsevier
Publication date: 01/01/2014
Field of study

In the e-government context, content of information and service systems needs to be accessible and easy-to-read. E-government systems are increasingly self-service systems. If the content of these systems is incomprehensible, citizens are not able to exercise their rights or fulfill their duties. Comprehensibility, however, is more than just providing text that is easy to read. The ease-of-understanding of a text is a result of the interplay between content characteristics, reader characteristics and task/context characteristics, as is the case for usability. This multi-faceted form of accessibility cannot be assessed and evaluated with just the existing easy-to-read guidelines. Measuring ease-of-understanding, which is a legal requirement for e-government systems and other public services, requires a process-oriented approach besides the currently available product-oriented easy-to-read guidelines

Elsevier - Publisher Connector

Crossref

University of Twente Research Information

Overview of SimpleText CLEF 2021 workshop and pilot tasks

Author: Bellot P.
Braslavski P.
Ermakova L.
Kamps J.
Mothe J.
Nurbakova D.
Ovchinnikova I.
San-Juan E.
Publication venue: CEUR-WS
Publication date: 01/01/2021
Field of study

International Migration, Integration and Social Cohesion online publications

Understanding Website Privacy Policies—A Longitudinal Analysis Using Natural Language Processing

Author: Belcheva Veronika
Ermakova Tatiana
Fabian Benjamin
Publication venue: Humboldt-Universität zu Berlin
Publication date: 19/11/2023
Field of study

Privacy policies are the main method for informing Internet users of how their data are collected and shared. This study aims to analyze the deficiencies of privacy policies in terms of readability, vague statements, and the use of pacifying phrases concerning privacy. This represents the undertaking of a step forward in the literature on this topic through a comprehensive analysis encompassing both time and website coverage. It characterizes trends across website categories, top-level domains, and popularity ranks. Furthermore, studying the development in the context of the General Data Protection Regulation (GDPR) offers insights into the impact of regulations on policy comprehensibility. The findings reveal a concerning trend: privacy policies have grown longer and more ambiguous, making it challenging for users to comprehend them. Notably, there is an increased proportion of vague statements, while clear statements have seen a decrease. Despite this, the study highlights a steady rise in the inclusion of reassuring statements aimed at alleviating readers’ privacy concerns.Peer Reviewe

Dokumenten-Publikationsserver der Humboldt-Universität zu Berlin

Overview of SimpleText CLEF 2021 workshop and pilot tasks

Author: Bellot P.
Braslavski P.
Ermakova L.
Kamps J.
Mothe J.
Nurbakova D.
Ovchinnikova I.
San-Juan E.
Publication venue: CEUR-WS
Publication date: 01/01/2021
Field of study

International Migration, Integration and Social Cohesion online publications

Overview of SimpleText 2021 - CLEF Workshop on Text Simplification for Scientific Information Access

Author: Bellot P.
Braslavski P.
Ermakova L.
Kamps J.
Mothe J.
Nurbakova D.
Ovchinnikova I.
SanJuan E.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

International Migration, Integration and Social Cohesion online publications