CORE
🇺🇦
make metadata, not war
Services
Services overview
Explore all CORE services
Access to raw data
API
Dataset
FastSync
Content discovery
Recommender
Discovery
OAI identifiers
OAI Resolver
Managing content
Dashboard
Bespoke contracts
Consultancy services
Support us
Support us
Membership
Sponsorship
Community governance
Advisory Board
Board of supporters
Research network
About
About us
Our mission
Team
Blog
FAQs
Contact us
Identification, Expansion, And Disambiguation Of Acronyms In Biomedical Texts
Authors
David B. Bracewell
Scott Russell
Annie S. Wu
Publication date
1 December 2005
Publisher
'Information Bulletin on Variable Stars (IBVS)'
Abstract
With the ever growing amount of biomedical literature there is an increasing desire to use sophisticated language processing algorithms to mine these texts. In order to use these algorithms we must first deal with acronyms, abbreviations, and misspellings.In this paper we look at identifying, expanding, and disambiguating acronyms in biomedical texts. We break the task up into three modular steps: Identification, Expansion, and Disambiguation. For Identification we use a hybrid approach that is composed of a naive Bayesian classifier and a couple of handcrafted rules. We are able to achieve results of 99.96% accuracy with a small training set. We break the expansion up into two categories, local and global expansion. For local expansion we use windowing and longest common subsequence to generate the possible expansions. Global expansion requires an acronym database. To disambiguate the different candidate expansions we use WordNet and semantic similarity. Overall we obtain a recall and precision of over 91%. © Springer-Verlag Berlin Heidelberg 2005
Similar works
Full text
Available Versions
University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:stars.library.ucf.edu:scop...
Last time updated on 19/07/2022