40 research outputs found

    RACE: Large-scale ReAding Comprehension Dataset From Examinations

    Full text link
    We present RACE, a new dataset for benchmark evaluation of methods in the reading comprehension task. Collected from the English exams for middle and high school Chinese students in the age range between 12 to 18, RACE consists of near 28,000 passages and near 100,000 questions generated by human experts (English instructors), and covers a variety of topics which are carefully designed for evaluating the students' ability in understanding and reasoning. In particular, the proportion of questions that requires reasoning is much larger in RACE than that in other benchmark datasets for reading comprehension, and there is a significant gap between the performance of the state-of-the-art models (43%) and the ceiling human performance (95%). We hope this new dataset can serve as a valuable resource for research and evaluation in machine comprehension. The dataset is freely available at http://www.cs.cmu.edu/~glai1/data/race/ and the code is available at https://github.com/qizhex/RACE_AR_baselines.Comment: EMNLP 201

    A Self-enhancement Approach for Domain-specific Chatbot Training via Knowledge Mining and Digest

    Full text link
    Large Language Models (LLMs), despite their great power in language generation, often encounter challenges when dealing with intricate and knowledge-demanding queries in specific domains. This paper introduces a novel approach to enhance LLMs by effectively extracting the relevant knowledge from domain-specific textual sources, and the adaptive training of a chatbot with domain-specific inquiries. Our two-step approach starts from training a knowledge miner, namely LLMiner, which autonomously extracts Question-Answer pairs from relevant documents through a chain-of-thought reasoning process. Subsequently, we blend the mined QA pairs with a conversational dataset to fine-tune the LLM as a chatbot, thereby enriching its domain-specific expertise and conversational capabilities. We also developed a new evaluation benchmark which comprises four domain-specific text corpora and associated human-crafted QA pairs for testing. Our model shows remarkable performance improvement over generally aligned LLM and surpasses domain-adapted models directly fine-tuned on domain corpus. In particular, LLMiner achieves this with minimal human intervention, requiring only 600 seed instances, thereby providing a pathway towards self-improvement of LLMs through model-synthesized training data.Comment: Work in progres

    MCR-ALS-based muscle synergy extraction method combined with LSTM neural network for motion intention detection

    Get PDF
    IntroductionThe time-varying and individual variability of surface electromyographic signals (sEMG) can lead to poorer motor intention detection results from different subjects and longer temporal intervals between training and testing datasets. The consistency of using muscle synergy between the same tasks may be beneficial to improve the detection accuracy over long time ranges. However, the conventional muscle synergy extraction methods, such as non-negative matrix factorization (NMF) and principal component analysis (PCA) have some limitations in the field of motor intention detection, especially in the continuous estimation of upper limb joint angles.MethodsIn this study, we proposed a reliable multivariate curve-resolved-alternating least squares (MCR-ALS) muscle synergy extraction method combined with long-short term memory neural network (LSTM) to estimate continuous elbow joint motion by using the sEMG datasets from different subjects and different days. The pre-processed sEMG signals were then decomposed into muscle synergies by MCR-ALS, NMF and PCA methods, and the decomposed muscle activation matrices were used as sEMG features. The sEMG features and elbow joint angular signals were input to LSTM to establish a neural network model. Finally, the established neural network models were tested by using sEMG dataset from different subjects and different days, and the detection accuracy was measured by correlation coefficient.ResultsThe detection accuracy of elbow joint angle was more than 85% by using the proposed method. This result was significantly higher than the detection accuracies obtained by using NMF and PCA methods. The results showed that the proposed method can improve the accuracy of motor intention detection results from different subjects and different acquisition timepoints.DiscussionThis study successfully improves the robustness of sEMG signals in neural network applications using an innovative muscle synergy extraction method. It contributes to the application of human physiological signals in human-machine interaction

    Hardware-In-the-Loop Simulation System In the Development of Temperature Controller of Blood Glucose Meter

    No full text
    You should leave 8 mm of space above the abstract and 10 mm after the abstract. The heading Abstract should be typed in bold 8,5-point Times. The body of the abstract should be typed in normal 8,5-point Times in a single paragraph, immediately following the heading. The text should be set to 1.15 line spacing. The abstract should be centred across the page, indented 15 mm from the left and right page margins and justified. It should not normally exceed 200 words

    Hardware-In-the-Loop Simulation System In the Development of Temperature Controller of Blood Glucose Meter

    No full text
    You should leave 8 mm of space above the abstract and 10 mm after the abstract. The heading Abstract should be typed in bold 8,5-point Times. The body of the abstract should be typed in normal 8,5 -point Times in a single paragraph, immediately following the heading. The text should be set to 1.15 line spacing. The abstract should be centred across the page, indented 15 mm from the left and right page margins and justified. It should not normally exceed 200 words
    corecore