Skip to main content
Article thumbnail
Location of Repository

A Web Service for Biomedical Term Look-Up

By Henk Harkema, Ian Roberts, Rob Gaizauskas and Mark Hepple

Abstract

Recent years have seen a huge increase in the amount of biomedical information that is available in electronic format. Consequently, for biomedical researchers wishing to relate their experimental results to relevant data lurking somewhere within this expanding universe of on-line information, the ability to access and navigate biomedical information sources in an efficient manner has become increasingly important. Natural language and text processing techniques can facilitate this task by making the information contained in textual resources such as MEDLINE more readily accessible and amenable to computational processing. Names of biological entities such as genes and proteins provide critical links between different biomedical information sources and researchers' experimental data. Therefore, automatic identification and classification of these terms in text is an essential capability of any natural language processing system aimed at managing the wealth of biomedical information that is available electronically. To support term recognition in the biomedical domain, we have developed Termino, a large-scale terminological resource for text processing applications, which has two main components: first, a database into which very large numbers of terms can be loaded from resources such as UMLS, and stored together with various kinds of relevant information; second, a finite state recognizer, for fast and efficient identification and mark-up of terms within text. Since many biomedical applications require this functionality, we have made Termino available to the community as a web service, which allows for its integration into larger applications as a remotely located component, accessed through a standardized interface over the web

Topics: Research Article
Publisher: Hindawi Publishing Corporation
OAI identifier: oai:pubmedcentral.nih.gov:2448598
Provided by: PubMed Central
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://www.pubmedcentral.nih.g... (external link)
  • Suggested articles

    Citations

    1. (2004). A large scale terminology resource for biomedical text processing.
    2. (2004). A large-scale resource for storing and recognizing technical terminology.
    3. (2003). AMBIT: acquiring medical and biological information from text.
    4. (1997). An interactive system for finding complementary literatures: a stimulus to scientific discovery.
    5. Apache Axis web services toolkit; http://ws.apache.org/axis/
    6. Apache Tomcat web server;
    7. (2003). Blueprint for a high performance NLP infrastructure.
    8. (2004). Content-rich biological network constructed by mining PubMed abstracts.
    9. (2002). Creating knowledge repositories from biomedical reports: the medSynDiKATe text mining system.
    10. for Biotechnology Information. Basic local alignment search tool (BLAST);
    11. (2002). GATE: a framework and graphical development environment for robust NLP tools and applications.
    12. General Architecture for Text Engineering;
    13. (2002). Genew: The human nomenclature database.
    14. Institute. Gene Ontology Annotation at EBI;
    15. Integrating text mining into distributed bioinformatics workflows: a web sevices implementation.
    16. (2003). Protein structures and information extraction from biological texts: the PASTA system.
    17. (2002). Rutabaga by any other name: extracting biological names.
    18. (2001). The Gene Ontology Consortium.
    19. The Unified Medical Language System: an informatics research collaboration.
    20. Web Consortium (W3C). Web services activity,
    21. (2004). Web services architecture for language resources.
    22. (2004). Web services for language resources and language technology applications.
    23. (2003). Web services in language technology and terminology management.
    24. (2003). What are web services?

    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.