Search CORE

2 research outputs found

Active learning with deep pre-trained models for sequence tagging of clinical and biomedical texts

Author: Dylov Dmitry
Fedulova Irina
Khromov Nikita
Kireev Danil
Liventsev Vadim
Panchenko Alexander
Shelmanov Artem
Publication venue: Institute of Electrical and Electronics Engineers
Publication date: 06/02/2020
Field of study

Active learning is a technique that helps to minimize the annotation budget required for the creation of a labeled dataset while maximizing the performance of a model trained on this dataset. It has been shown that active learning can be successfully applied to sequence tagging tasks of text processing in conjunction with deep learning models even when a limited amount of labeled data is available. Recent advances in transfer learning methods for natural language processing based on deep pre-trained models such as ELMo and BERT offer a much better ability to generalize on small annotated datasets compared to their shallow counterparts. The combination of deep pre-trained models and active learning leads to a powerful approach to dealing with annotation scarcity. In this work, we investigate the potential of this approach on clinical and biomedical data. The experimental evaluation shows that the combination of active learning and deep pre-trained models outperforms the standard methods of active learning. We also suggest a modification to a standard uncertainty sampling strategy and empirically show that it could be beneficial for annotation of very skewed datasets. Finally, we propose an annotation tool empowered with active learning and deep pre-trained models that could be used for entity annotation directly from Jupyter IDE

Crossref

Pure OAI Repository

Active learning with deep pre-trained models for sequence tagging of clinical and biomedical texts

Author: Bi Jinbo
Dylov Dmitry
Fedulova Irina
Hu Xiaohua Tony
Khromov Nikita
Kireev Danil
Liventsev Vadim
Panchenko Alexander
Shelmanov Artem
Yoo Illhoi
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 06/02/2020
Field of study