Search CORE

40 research outputs found

RACE: Large-scale ReAding Comprehension Dataset From Examinations

Author: Hovy Eduard
Lai Guokun
Liu Hanxiao
Xie Qizhe
Yang Yiming
Publication venue
Publication date: 01/01/2017
Field of study

We present RACE, a new dataset for benchmark evaluation of methods in the reading comprehension task. Collected from the English exams for middle and high school Chinese students in the age range between 12 to 18, RACE consists of near 28,000 passages and near 100,000 questions generated by human experts (English instructors), and covers a variety of topics which are carefully designed for evaluating the students' ability in understanding and reasoning. In particular, the proportion of questions that requires reasoning is much larger in RACE than that in other benchmark datasets for reading comprehension, and there is a significant gap between the performance of the state-of-the-art models (43%) and the ceiling human performance (95%). We hope this new dataset can serve as a valuable resource for research and evaluation in machine comprehension. The dataset is freely available at http://www.cs.cmu.edu/~glai1/data/race/ and the code is available at https://github.com/qizhex/RACE_AR_baselines.Comment: EMNLP 201

arXiv.org e-Print Archive

Crossref

A Self-enhancement Approach for Domain-specific Chatbot Training via Knowledge Mining and Digest

Author: Ai Fangzhou
Fan Zhen
Gao Luyu
Lai Guokun
Yang Hongxia
Yang Yiming
Zhang Ruohong
Zhang Zheng
Zheng Chen
Publication venue
Publication date: 17/11/2023
Field of study

Large Language Models (LLMs), despite their great power in language generation, often encounter challenges when dealing with intricate and knowledge-demanding queries in specific domains. This paper introduces a novel approach to enhance LLMs by effectively extracting the relevant knowledge from domain-specific textual sources, and the adaptive training of a chatbot with domain-specific inquiries. Our two-step approach starts from training a knowledge miner, namely LLMiner, which autonomously extracts Question-Answer pairs from relevant documents through a chain-of-thought reasoning process. Subsequently, we blend the mined QA pairs with a conversational dataset to fine-tune the LLM as a chatbot, thereby enriching its domain-specific expertise and conversational capabilities. We also developed a new evaluation benchmark which comprises four domain-specific text corpora and associated human-crafted QA pairs for testing. Our model shows remarkable performance improvement over generally aligned LLM and surpasses domain-adapted models directly fine-tuned on domain corpus. In particular, LLMiner achieves this with minimal human intervention, requiring only 600 seed instances, thereby providing a pathway towards self-improvement of LLMs through model-synthesized training data.Comment: Work in progres

arXiv.org e-Print Archive

MCR-ALS-based muscle synergy extraction method combined with LSTM neural network for motion intention detection

Author: Changcheng Shi
Changcheng Shi
Dazheng Zhao
Dazheng Zhao
Dazheng Zhao
Guokun Zuo
Guokun Zuo
Jiaji Zhang
Jiaji Zhang
Jingyan Meng
Jingyan Meng
Jingyan Meng
Mengqi Hong
Xiao Lv
Yang Hu
Yang Hu
Yang Hu
Yehao Ma
Yehao Ma
Yehao Ma
Yunfeng Liu
Publication venue: 'Frontiers Media SA'
Publication date: 01/06/2023
Field of study

IntroductionThe time-varying and individual variability of surface electromyographic signals (sEMG) can lead to poorer motor intention detection results from different subjects and longer temporal intervals between training and testing datasets. The consistency of using muscle synergy between the same tasks may be beneficial to improve the detection accuracy over long time ranges. However, the conventional muscle synergy extraction methods, such as non-negative matrix factorization (NMF) and principal component analysis (PCA) have some limitations in the field of motor intention detection, especially in the continuous estimation of upper limb joint angles.MethodsIn this study, we proposed a reliable multivariate curve-resolved-alternating least squares (MCR-ALS) muscle synergy extraction method combined with long-short term memory neural network (LSTM) to estimate continuous elbow joint motion by using the sEMG datasets from different subjects and different days. The pre-processed sEMG signals were then decomposed into muscle synergies by MCR-ALS, NMF and PCA methods, and the decomposed muscle activation matrices were used as sEMG features. The sEMG features and elbow joint angular signals were input to LSTM to establish a neural network model. Finally, the established neural network models were tested by using sEMG dataset from different subjects and different days, and the detection accuracy was measured by correlation coefficient.ResultsThe detection accuracy of elbow joint angle was more than 85% by using the proposed method. This result was significantly higher than the detection accuracies obtained by using NMF and PCA methods. The results showed that the proposed method can improve the accuracy of motor intention detection results from different subjects and different acquisition timepoints.DiscussionThis study successfully improves the robustness of sEMG signals in neural network applications using an innovative muscle synergy extraction method. It contributes to the application of human physiological signals in human-machine interaction

Directory of Open Access Journals

Hardware-In-the-Loop Simulation System In the Development of Temperature Controller of Blood Glucose Meter

Author: Xiao Shangqing
Xu Jialin
Yang Zhongzhu
Zuo Guokun
Publication venue: 'EDP Sciences'
Publication date: 01/01/2015
Field of study

You should leave 8 mm of space above the abstract and 10 mm after the abstract. The heading Abstract should be typed in bold 8,5-point Times. The body of the abstract should be typed in normal 8,5-point Times in a single paragraph, immediately following the heading. The text should be set to 1.15 line spacing. The abstract should be centred across the page, indented 15 mm from the left and right page margins and justified. It should not normally exceed 200 words

Directory of Open Access Journals

Hardware-In-the-Loop Simulation System In the Development of Temperature Controller of Blood Glucose Meter

Author: Xiao Shangqing
Xu Jialin
Xu JL
Yang Zhongzhu
Zuo Guokun
Publication venue: 2015 7TH INTERNATIONAL CONFERENCE ON MECHANICAL AND ELECTRONICS ENGINEERING (ICMEE 2015)
Publication date: 01/01/2015
Field of study

You should leave 8 mm of space above the abstract and 10 mm after the abstract. The heading Abstract should be typed in bold 8,5-point Times. The body of the abstract should be typed in normal 8,5 -point Times in a single paragraph, immediately following the heading. The text should be set to 1.15 line spacing. The abstract should be centred across the page, indented 15 mm from the left and right page margins and justified. It should not normally exceed 200 words

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

Institutional Repository of Ningbo Institute of Material Technology & Engineering, CAS