Skip to main content
Article thumbnail
Location of Repository

Embedding Metadata and Other Semantics In Word-Processing Documents

By Peter Sefton, Ian Barnes, Ron Ward and Jim Downing


This paper was presented at the International Digital Curation Conference in Edinburgh in Dec 2008.This paper describes a technique for embedding document metadata, and potentially other semantic references inline in word processing documents, which the authors have implemented with the help of a software development team. Several assumptions underly the approach; It must be available across computing platforms and work with both Microsoft Word (because of its user base) and (because of its free availability). Further the application needs to be acceptable to and usable by users, so the initial implementation covers only small number of features, which will only be extended after user-testing. Within these constraints the system provides a mechanism for encoding not only simple metadata, but for inferring hierarchical relationships between metadata elements from a "flat" word processing file. The paper includes links to open source code implementing the techniques as part of a broader suite of tools for academic writing. This addresses tools and software, semantic web and data curation, integrating curation into research workflows and will provide a platform for integrating work on ontologies, vocabularies and folksonomies into word processing tools

Topics: metadata, wordprocessing, microsformat
Year: 2008
OAI identifier:
Provided by: Apollo

Suggested articles

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.