Search CORE

13 research outputs found

An Improved Baseline for Sentence-level Relation Extraction

Author: Chen Muhao
Zhou Wenxuan
Publication venue
Publication date: 02/06/2021
Field of study

Sentence-level relation extraction (RE) aims at identifying the relationship between two entities in a sentence. Many efforts have been devoted to this problem, while the best performing methods are still far from perfect. In this paper, we revisit two problems that affect the performance of existing RE models, namely entity representation and noisy or ill-defined labels. Our improved baseline model, incorporated with entity representations with typed markers, achieves an F1 of 74.6% on TACRED, significantly outperforms previous SOTA methods. Furthermore, the presented new baseline achieves an F1 of 91.1% on the refined Re-TACRED dataset, demonstrating that the pre-trained language models achieve unexpectedly high performance on this task. We release our code to the community for future research.Comment: Code available at https://github.com/wzhouad/RE_improved_baselin

arXiv.org e-Print Archive

On Robustness and Bias Analysis of BERT-based Relation Extraction

Author: Bi Zhen
Chen Huajun
Chen Xiang
Deng Shumin
Li Luoqiu
Ye Hongbin
Zhang Ningyu
Publication venue
Publication date: 06/04/2021
Field of study

Fine-tuning pre-trained models have achieved impressive performance on standard natural language processing benchmarks. However, the resultant model generalizability remains poorly understood. We do not know, for example, how excellent performance can lead to the perfection of generalization models. In this study, we analyze a fine-tuned BERT model from different perspectives using relation extraction. We also characterize the differences in generalization techniques according to our proposed improvements. From empirical experimentation, we find that BERT suffers a bottleneck in terms of robustness by way of randomizations, adversarial and counterfactual tests, and biases (i.e., selection and semantic). These findings highlight opportunities for future improvements. Our open-sourced testbed DiagnoseRE is available in \url{https://github.com/zjunlp/DiagnoseRE}.Comment: work in progres

arXiv.org e-Print Archive

SocAoG: Incremental Graph Parsing for Social Relation Inference in Dialogues

Author: Liang Yuan
Lu Pan
Peng Baolin
Qiu Liang
Wu Ying Nian
Yu Zhou
Zhao Yizhou
Zhu Song-Chun
Publication venue
Publication date: 24/06/2021
Field of study

Inferring social relations from dialogues is vital for building emotionally intelligent robots to interpret human language better and act accordingly. We model the social network as an And-or Graph, named SocAoG, for the consistency of relations among a group and leveraging attributes as inference cues. Moreover, we formulate a sequential structure prediction task, and propose an

\alpha

\beta

\gamma

strategy to incrementally parse SocAoG for the dynamic inference upon any incoming utterance: (i) an

\alpha

process predicting attributes and relations conditioned on the semantics of dialogues, (ii) a

\beta

process updating the social relations based on related attributes, and (iii) a

\gamma

process updating individual's attributes based on interpersonal social relations. Empirical results on DialogRE and MovieGraph show that our model infers social relations more accurately than the state-of-the-art methods. Moreover, the ablation study shows the three processes complement each other, and the case study demonstrates the dynamic relational inference.Comment: Long paper (oral) accepted by ACL-IJCNLP 202

arXiv.org e-Print Archive

Chemical-protein relation extraction with ensembles of carefully tuned pretrained language models

Author: Alt C.
Barth F.
Garda S.
Leser U.
Sänger M.
Weber L.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 18/11/2022
Field of study

The identification of chemical-protein interactions described in the literature is an important task with applications in drug design, precision medicine and biotechnology. Manual extraction of such relationships from the biomedical literature is costly and often prohibitively time-consuming. The BioCreative VII DrugProt shared task provides a benchmark for methods for the automated extraction of chemical-protein relations from scientific text. Here we describe our contribution to the shared task and report on the achieved results. We define the task as a relation classification problem, which we approach with pretrained transformer language models. Upon this basic architecture, we experiment with utilizing textual and embedded side information from knowledge bases as well as additional training data to improve extraction performance. We perform a comprehensive evaluation of the proposed model and the individual extensions including an extensive hyperparameter search leading to 2647 different runs. We find that ensembling and choosing the right pretrained language model are crucial for optimal performance, whereas adding additional data and embedded side information did not improve results. Our best model is based on an ensemble of 10 pretrained transformers and additional textual descriptions of chemicals taken from the Comparative Toxicogenomics Database. The model reaches an F1 score of 79.73% on the hidden DrugProt test set and achieves the first rank out of 107 submitted runs in the official evaluation. Database URL: https://github.com/leonweber/drugprot

PubMed Central

Dokumenten-Publikationsserver der Humboldt-Universität zu Berlin

MDC Repository