27 research outputs found

    ZARA: Improving Few-Shot Self-Rationalization for Small Language Models

    Full text link
    Language models (LMs) that jointly generate end-task answers as well as free-text rationales are known as self-rationalization models. Recent works demonstrate great performance gain for self-rationalization by few-shot prompting LMs with rationale-augmented exemplars. However, the ability to benefit from explanations only emerges with large-scale LMs, which have poor accessibility. In this work, we explore the less-studied setting of leveraging explanations for small LMs to improve few-shot self-rationalization. We first revisit the relationship between rationales and answers. Inspired by the implicit mental process of how human beings assess explanations, we present a novel approach, Zero-shot Augmentation of Rationale-Answer pairs (ZARA), to automatically construct pseudo-parallel data for self-training by reducing the problem of plausibility judgement to natural language inference. Experimental results show ZARA achieves SOTA performance on the FEB benchmark, for both the task accuracy and the explanation metric. In addition, we conduct human and quantitative evaluation validating ZARA's ability to automatically identify plausible and accurate rationale-answer pairs.Comment: Accepted as a long paper at EMNLP Findings 202

    Wizundry: A Cooperative Wizard of Oz Platform for Simulating Future Speech-based Interfaces with Multiple Wizards

    Full text link
    Wizard of Oz (WoZ) as a prototyping method has been used to simulate intelligent user interfaces, particularly for speech-based systems. However, as our societies' expectations on artificial intelligence (AI) grows, the question remains whether a single Wizard is sufficient for it to simulate smarter systems and more complex interactions. Optimistic visions of 'what artificial intelligence (AI) can do' places demands on WoZ platforms to simulate smarter systems and more complex interactions. This raises the question of whether the typical approach of employing a single Wizard is sufficient. Moreover, while existing work has employed multiple Wizards in WoZ studies, a multi-Wizard approach has not been systematically studied in terms of feasibility, effectiveness, and challenges. We offer Wizundry, a real-time, web-based WoZ platform that allows multiple Wizards to collaboratively operate a speech-to-text based system remotely. We outline the design and technical specifications of our open-source platform, which we iterated over two design phases. We report on two studies in which participant-Wizards were tasked with negotiating how to cooperatively simulate an interface that can handle natural speech for dictation and text editing as well as other intelligent text processing tasks. We offer qualitative findings on the Multi-Wizard experience for Dyads and Triads of Wizards. Our findings reveal the promises and challenges of the multi-Wizard approach and open up new research questions.Comment: 34 page

    Extremity Exercise Program in Breast Cancer Survivors Suffering from Chemotherapy-Induced Peripheral Neuropathy: A Feasibility Pilot Study

    Get PDF
    Objectives: To evaluate the feasibility of implementation of an extremity exercise program and to examine its preliminary effects in breast cancer survivors suffering from chemotherapy-induced peripheral neuropathy (CIPN). Sample & Setting: Thirteen breast cancer survivors from one hospital in northern Taiwan. Methods and Variables: A single group with repeated measures, and a quasi-experimental design. The intervention program was a four week, home-based extremity exercise program that was comprised of 10 skilled hand exercises and Buerger-Allen exercises. The Total Neuropathy Scale (clinical version), Functional Assessment of Cancer Therapy/Gynecologic Oncology Group, Neurotoxicity (13-Item Version), Identification Pain Questionnaire, and pain Visual Analogue Scale were used to measure CIPN before exercise (T1), during (T2~T4), and after exercise (T5). Qualitative data were also collected at each time point. Data were analyzed by using descriptive statistics, generalized estimating equations, and directed content analysis. Results: None of the participants reported adverse events during the study period. The extremity exercise program significantly improved patient-reported CIPN after intervention at T4 or T5 but was insignificant on clinician-assessed CIPN. The qualitative data of participant experience indicated that this program is feasible and easy to follow. Conclusion: The extremity exercise program is feasible but needs to increase the sample size and prolong the intervention period for confirmation

    Unanswerable Question Correction in Question Answering over Personal Knowledge Base

    No full text
    People often encounter situations where they need to recall past experiences from their daily life. In this paper, we aim to construct a question answering system that enables human to query their past experiences over personal knowledge base. Previous works on knowledge base question answering focus on finding answers for answerable questions. In the real world applications, however, people often muddle up facts and ask those questions that cannot be answered with knowledge base. This work presents a novel system consisting of question answering model and question generation model. It not only answers answerable questions, but also corrects unanswerable questions if necessary. Our question answering model recognizes the question that is inconsistent with the state of the personal knowledge base and suggests facts that can form a feasible question. Then, the facts are converted to an answerable question by the question generation model. For refining question, we propose a question generation model based on the reinforcement learning (RL) with question editing mechanism. Experimental results show that our proposed system is effective for correcting unanswerable questions in personal knowledge base question answering

    Characteristics of Traditional Chinese Medicine Use in Pediatric Dislocations, Sprains and Strains

    No full text
    Background and Objectives: Dislocations, sprains and strains are common childhood musculoskeletal injuries, requiring medical attention. We investigated the characteristics associated with using traditional Chinese medicine (TCM) for children suffering from these injuries. Methods: From a nationwide representative insurance database of Taiwan, this cross-sectional study identified 50,769 children with dislocations, sprains and strains under 18 years of age, newly diagnosed between 1999 and 2009, without previous TCM experience. Children who initiated treatment with TCM (n = 24,063, 47.4%) were defined as TCM users, others were in the non-TCM group. Multivariable logistic regression models estimated odds ratios (ORs) of TCM use. Results: Girls and children living in central Taiwan (vs. northern) were associated with higher TCM use. The adjusted ORs (95% confidence interval (CI)) of TCM uses were 1.60 (1.42–1.79) for patients of 3–5 years, 2.20 (1.99–2.42) of 6–12 years and 1.82 (1.64–2.01) of 13–17 years, compared with those of the <2 years group. TCM users were less likely to have outpatient visits for Western medicine care and hospitalizations in the previous year. The TCM group was nearly twice more likely than the non-user group to receive treatments at local clinics (99.1% vs. 53.3%, p < 0.001). Conclusions: This study reveals important demographic and medical factors associated with TCM uses for children with dislocations, sprains and strains. Interestingly, local clinics are the main healthcare facilities providing TCM services. Further studies are needed to evaluate the outcomes of TCM treatment for these musculoskeletal injuries

    Extremity Exercise Program in Breast Cancer Survivors Suffering from Chemotherapy-Induced Peripheral Neuropathy: A Feasibility Pilot Study

    No full text
    Objectives: To evaluate the feasibility of implementation of an extremity exercise program and to examine its preliminary effects in breast cancer survivors suffering from chemotherapy-induced peripheral neuropathy (CIPN). Sample & Setting: Thirteen breast cancer survivors from one hospital in northern Taiwan. Methods and Variables: A single group with repeated measures, and a quasi-experimental design. The intervention program was a four week, home-based extremity exercise program that was comprised of 10 skilled hand exercises and Buerger-Allen exercises. The Total Neuropathy Scale (clinical version), Functional Assessment of Cancer Therapy/Gynecologic Oncology Group, Neurotoxicity (13-Item Version), Identification Pain Questionnaire, and pain Visual Analogue Scale were used to measure CIPN before exercise (T1), during (T2~T4), and after exercise (T5). Qualitative data were also collected at each time point. Data were analyzed by using descriptive statistics, generalized estimating equations, and directed content analysis. Results: None of the participants reported adverse events during the study period. The extremity exercise program significantly improved patient-reported CIPN after intervention at T4 or T5 but was insignificant on clinician-assessed CIPN. The qualitative data of participant experience indicated that this program is feasible and easy to follow. Conclusion: The extremity exercise program is feasible but needs to increase the sample size and prolong the intervention period for confirmation

    Design of a Multi-Sensor System for Exploring the Relation between Finger Spasticity and Voluntary Movement in Patients with Stroke

    No full text
    A novel wearable multi-sensor data glove system is developed to explore the relation between finger spasticity and voluntary movement in patients with stroke. Many stroke patients suffer from finger spasticity, which is detrimental to their manual dexterity. Diagnosing and assessing the degrees of spasticity require neurological testing performed by trained professionals to estimate finger spasticity scores via the modified Ashworth scale (MAS). The proposed system offers an objective, quantitative solution to assess the finger spasticity of patients with stroke and complements the manual neurological test. In this work, the hardware and software components of this system are described. By requiring patients to perform five designated tasks, biomechanical measurements including linear and angular speed, acceleration, and pressure at every finger joint and upper limb are recorded, making up more than 1000 features for each task. We conducted a preliminary clinical test with 14 subjects using this system. Statistical analysis is performed on the acquired measurements to identify a small subset of features that are most likely to discriminate a healthy patient from patients suffering from finger spasticity. This encouraging result validates the feasibility of this proposed system to quantitatively and objectively assess finger spasticity
    corecore