Construction of cardiovascular information extraction corpus based on electronic medical records

Bingfei Zhao; Hongyang Chang; Hongying Zan; Kunli Zhang; Shuai Zhang

Construction of cardiovascular information extraction corpus based on electronic medical records

Authors: Bingfei Zhao
Hongyang Chang
Hongying Zan
Kunli Zhang
Shuai Zhang
Publication date: 1 June 2023
Publisher: 'American Institute of Mathematical Sciences (AIMS)'
Doi

Abstract

Cardiovascular disease has a significant impact on both society and patients, making it necessary to conduct knowledge-based research such as research that utilizes knowledge graphs and automated question answering. However, the existing research on corpus construction for cardiovascular disease is relatively limited, which has hindered further knowledge-based research on this disease. Electronic medical records contain patient data that span the entire diagnosis and treatment process and include a large amount of reliable medical information. Therefore, we collected electronic medical record data related to cardiovascular disease, combined the data with relevant work experience and developed a standard for labeling cardiovascular electronic medical record entities and entity relations. By building a sentence-level labeling result dictionary through the use of a rule-based semi-automatic method, a cardiovascular electronic medical record entity and entity relationship labeling corpus (CVDEMRC) was constructed. The CVDEMRC contains 7691 entities and 11,185 entity relation triples, and the results of consistency examination were 93.51% and 84.02% for entities and entity-relationship annotations, respectively, demonstrating good consistency results. The CVDEMRC constructed in this study is expected to provide a database for information extraction research related to cardiovascular diseases

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Directory of Open Access Journals

oai:doaj.org/article:f9a7e1928...

Last time updated on 30/06/2023