4,295 research outputs found
From Task Classification Towards Similarity Measures for Recommendation in Crowdsourcing Systems
Task selection in micro-task markets can be supported by recommender systems
to help individuals to find appropriate tasks. Previous work showed that for
the selection process of a micro-task the semantic aspects, such as the
required action and the comprehensibility, are rated more important than
factual aspects, such as the payment or the required completion time. This work
gives a foundation to create such similarity measures. Therefore, we show that
an automatic classification based on task descriptions is possible.
Additionally, we propose similarity measures to cluster micro-tasks according
to semantic aspects.Comment: Work in Progress Paper at HCOMP 201
Comprehension and trust in crises: investigating the impact of machine translation and post-editing
We conducted a survey to understand the impact of machine translation and postediting awareness on comprehension of and trust in messages disseminated to prepare the public for a weather-related crisis, i.e. flooding. The translation direction was EnglishâItalian. Sixty-one participantsâall native Italian speakers with different English proficiency levelsâ answered our survey. Each participant read and evaluated between three and six crisis messages using ratings and openended questions on comprehensibility and trust. The messages were in English
and Italian. All the Italian messages had been machine translated and post-edited.
Nevertheless, participants were told that only half had been post-edited, so that we could test the impact of post-editing awareness. We could not draw firm conclusions when comparing the scores for trust and comprehensibility assigned to the three types of messagesâEnglish,
post-edits, and purported raw outputs.
However, when scores were triangulated with open-ended answers, stronger patterns were observed, such as the impact of fluency of the translations on their comprehensibility and trustworthiness.
We found correlations between comprehensibility and trustworthiness, and identified other factors influencing these aspects, such as the clarity and soundness of the messages. We conclude by outlining implications for crisis preparedness, limitations, and areas for future research
Community-based post-editing of machine-translated content: monolingual vs. bilingual
We carried out a machine-translation postediting pilot study with users of an IT support forum community. For both language pairs (English to German, English to French), 4 native speakers for each language were recruited. They performed monolingual and bilingual postediting tasks on machine-translated forum content. The post-edited content was evaluated using human evaluation (fluency, comprehensibility, fidelity). We found that monolingual post-editing can lead to improved fluency and comprehensibility scores similar to those achieved through bilingual post-editing, while we found that fidelity improved considerably more for the bilingual set-up. Furthermore, the performance across post-editors varied greatly and it was found that some post-editors are able to produce better quality in a monolingual set-up than others
Google Translate Error Analysis for Mental Healthcare Information: Evaluating Accuracy, Comprehensibility, and Implications for Multilingual Healthcare Communication
This study explores the use of Google Translate (GT) for translating mental
healthcare (MHealth) information and evaluates its accuracy, comprehensibility,
and implications for multilingual healthcare communication through analysing GT
output in the MHealth domain from English to Persian, Arabic, Turkish,
Romanian, and Spanish. Two datasets comprising MHealth information from the UK
National Health Service website and information leaflets from The Royal College
of Psychiatrists were used. Native speakers of the target languages manually
assessed the GT translations, focusing on medical terminology accuracy,
comprehensibility, and critical syntactic/semantic errors. GT output analysis
revealed challenges in accurately translating medical terminology, particularly
in Arabic, Romanian, and Persian. Fluency issues were prevalent across various
languages, affecting comprehension, mainly in Arabic and Spanish. Critical
errors arose in specific contexts, such as bullet-point formatting,
specifically in Persian, Turkish, and Romanian. Although improvements are seen
in longer-text translations, there remains a need to enhance accuracy in
medical and mental health terminology and fluency, whilst also addressing
formatting issues for a more seamless user experience. The findings highlight
the need to use customised translation engines for Mhealth translation and the
challenges when relying solely on machine-translated medical content,
emphasising the crucial role of human reviewers in multilingual healthcare
communication
Easy-to-read Meets Accessible Web in the E-government Context
In the e-government context, content of information and service systems needs to be accessible and easy-to-read. E-government systems are increasingly self-service systems. If the content of these systems is incomprehensible, citizens are not able to exercise their rights or fulfill their duties. Comprehensibility, however, is more than just providing text that is easy to read. The ease-of-understanding of a text is a result of the interplay between content characteristics, reader characteristics and task/context characteristics, as is the case for usability. This multi-faceted form of accessibility cannot be assessed and evaluated with just the existing easy-to-read guidelines. Measuring ease-of-understanding, which is a legal requirement for e-government systems and other public services, requires a process-oriented approach besides the currently available product-oriented easy-to-read guidelines
Understanding Website Privacy PoliciesâA Longitudinal Analysis Using Natural Language Processing
Privacy policies are the main method for informing Internet users of how their data are collected and shared. This study aims to analyze the deficiencies of privacy policies in terms of readability, vague statements, and the use of pacifying phrases concerning privacy. This represents the undertaking of a step forward in the literature on this topic through a comprehensive analysis encompassing both time and website coverage. It characterizes trends across website categories, top-level domains, and popularity ranks. Furthermore, studying the development in the context of the General Data Protection Regulation (GDPR) offers insights into the impact of regulations on policy comprehensibility. The findings reveal a concerning trend: privacy policies have grown longer and more ambiguous, making it challenging for users to comprehend them. Notably, there is an increased proportion of vague statements, while clear statements have seen a decrease. Despite this, the study highlights a steady rise in the inclusion of reassuring statements aimed at alleviating readersâ privacy concerns.Peer Reviewe
Overview of SimpleText 2021 - CLEF Workshop on Text Simplification for Scientific Information Access
- âŠ