25,960 research outputs found

    Counter-intuitive moral judgement following traumatic brain injury

    Get PDF
    Several neurological patient populations, including Traumatic Brain Injury (TBI), appear to produce an abnormally ‘utilitarian’ pattern of judgements to moral dilemmas; they tend to make judgements that maximise the welfare of the majority, rather than deontological judgements based on the following of moral rules (e.g., do not harm others). However, this patient research has always used extreme dilemmas with highly valued moral rules (e.g., do not kill). Data from healthy participants, however, suggests that when a wider range of dilemmas are employed, involving less valued moral rules (e.g., do not lie), moral judgements demonstrate sensitivity to the psychological intuitiveness of the judgements, rather than their deontological or utilitarian content (Kahane et al., 2011). We sought the moral judgements of 30 TBI participants and 30 controls on moral dilemmas where content (utilitarian/deontological) and intuition (intuitive/counterintuitive) were measured concurrently. Overall TBI participants made utilitarian judgements in equal proportions to controls; disproportionately favouring utilitarian judgements only when they were counterintuitive, and deontological judgements only when they were counterintuitive. These results speak against the view that TBI causes a specific utilitarian bias, suggesting instead that moral intuition is broadly disrupted following TBI

    Assessing ELLs in New Zealand primary schools: Gaps between the literature, policy, and practice

    Get PDF
    In February 2015, all New Zealand schools moved to assessing English Language Learners (ELLs) using the English Language Learning Progressions (ELLP) to determine eligibility for additional funding to support these learners. This paper firstly provides the background to the current assessment situation, and summarises the literature regarding key principles of assessment. It then describes key guidelines made available to schools by the Ministry of Education for using the new assessment system, particularly the use of Overall Teacher Judgements (OTJs). The paper then presents findings from interviews with three primary school English language specialist teachers regarding their experiences with using the new system, known as ‘ELLP assessment’. The gaps that exist between the literature, Ministry guidelines, and ESOL teacher practice are described, and recommendations are made for bridging these gaps. Currently little is known regarding teacher practice in regard to ELLP assessment, so this study fills a gap in the literature relating to the assessment of young ELLs in the New Zealand context

    RankME: Reliable Human Ratings for Natural Language Generation

    Full text link
    Human evaluation for natural language generation (NLG) often suffers from inconsistent user ratings. While previous research tends to attribute this problem to individual user preferences, we show that the quality of human judgements can also be improved by experimental design. We present a novel rank-based magnitude estimation method (RankME), which combines the use of continuous scales and relative assessments. We show that RankME significantly improves the reliability and consistency of human ratings compared to traditional evaluation methods. In addition, we show that it is possible to evaluate NLG systems according to multiple, distinct criteria, which is important for error analysis. Finally, we demonstrate that RankME, in combination with Bayesian estimation of system quality, is a cost-effective alternative for ranking multiple NLG systems.Comment: Accepted to NAACL 2018 (The 2018 Conference of the North American Chapter of the Association for Computational Linguistics

    Multi-criteria analysis: a manual

    Get PDF

    Closing the loop: assisting archival appraisal and information retrieval in one sweep

    Get PDF
    In this article, we examine the similarities between the concept of appraisal, a process that takes place within the archives, and the concept of relevance judgement, a process fundamental to the evaluation of information retrieval systems. More specifically, we revisit selection criteria proposed as result of archival research, and work within the digital curation communities, and, compare them to relevance criteria as discussed within information retrieval's literature based discovery. We illustrate how closely these criteria relate to each other and discuss how understanding the relationships between the these disciplines could form a basis for proposing automated selection for archival processes and initiating multi-objective learning with respect to information retrieval

    Assessment and reporting arrangements: Early years foundation stage, 2014

    Get PDF

    Creating a test collection to evaluate diversity in image retrieval

    Get PDF
    This paper describes the adaptation of an existing test collection for image retrieval to enable diversity in the results set to be measured. Previous research has shown that a more diverse set of results often satisfies the needs of more users better than standard document rankings. To enable diversity to be quantified, it is necessary to classify images relevant to a given theme to one or more sub-topics or clusters. We describe the challenges in building (as far as we are aware) the first test collection for evaluating diversity in image retrieval. This includes selecting appropriate topics, creating sub-topics, and quantifying the overall effectiveness of a retrieval system. A total of 39 topics were augmented for cluster-based relevance and we also provide an initial analysis of assessor agreement for grouping relevant images into sub-topics or clusters

    Understanding customers' holistic perception of switches in automotive human–machine interfaces

    Get PDF
    For successful new product development, it is necessary to understand the customers' holistic experience of the product beyond traditional task completion, and acceptance measures. This paper describes research in which ninety-eight UK owners of luxury saloons assessed the feel of push-switches in five luxury saloon cars both in context (in-car) and out of context (on a bench). A combination of hedonic data (i.e. a measure of ‘liking’), qualitative data and semantic differential data was collected. It was found that customers are clearly able to differentiate between switches based on the degree of liking for the samples' perceived haptic qualities, and that the assessment environment had a statistically significant effect, but that it was not universal. A factor analysis has shown that perceived characteristics of switch haptics can be explained by three independent factors defined as ‘Image’, ‘Build Quality’, and ‘Clickiness’. Preliminary steps have also been taken towards identifying whether existing theoretical frameworks for user experience may be applicable to automotive human–machine interfaces
    • …
    corecore