Towards Linked Data for Oceanographic Science: The R2R Eventlogger Project, Controlled Vocabularies, and Ontologies at The MBLWHOI Library

Abstract

Objective: Research vessels coordinated by the United States University-National Oceanographic Laboratory System (US-UNOLS) collect data which are considered important oceanographic science research products. The NSF-funded Rolling Deck to Repository (R2R) project aims to improve access to these data and diminish barriers to their use. One aspect of the R2R project has been to develop a shipboard scientific event logging system, Eventlogger, which incorporates best practice guidelines, controlled vocabularies, a cruise metadata schema, and a scientific event log. Eventlogger facilitates the eventual ingestion of datasets into oceanographic data repositories for subsequent integration and synthesis by investigators. The careful use of controlled vocabularies and ontologies is an important feature of this system, as the use of internationally-informed, consensus-driven controlled vocabularies will make data sets more interoperable, discoverable and reusable. Methods: The R2R Eventlogger project is led by Woods Hole Oceanographic Institution (WHOI), and the management of the controlled vocabularies is led by the Data Librarian in the Marine Biological Laboratory/Woods Hole Oceanographic Institution (MBLWHOI) Library. The first target vocabulary has been one for oceanographic instruments. Management of this vocabulary has thus far consisted of reconciling project vocabulary terms with the more widely used community vocabularies served by the NERC Vocabulary Server v2.0 (NVS2.0): terms included in the SeaDataNet Device Catalogue (L22) and the SeaDataNet Device Category vocabularies (L05). Rather than adopt existing community terms, it is more often the case that local terms are mapped by the Data Managers in the NSF-funded Biological and Chemical Oceanographic Data Management Office (BCO-DMO) to community terms, which preserves any important information and meaning investigators impart through the process of assigning these local terms, and has less impact on researchers. New terms, those that cannot be mapped to the existing community vocabularies (often custom, or modified instruments), are submitted for review to the SeaVOX governance process for addition to the community vocabularies. These vocabularies and their mappings are an important part of the aforementioned Eventlogger system. Before a research cruise, investigators configure the instruments they intend to use for their science activities. The instruments available for selection are provided by the MBLWHOI Data Librarian, who curates UNOLS ship-specific lists of standard shipboard instruments using terms for instruments from the R2R Eventlogger Project Vocabulary. Nonstandard shipboard instruments a researcher or investigator wishes to use can also be added, and these instrument terms will eventually be inducted into the R2R Eventlogger Project Vocabulary. Results: Eventlogger is currently being tested across the UNOLS fleet. A large submission of suggested instrument terms to the SeaDataNet community listserv is currently in progress. New tools for facilitating the management, mapping, and use of these controlled vocabularies are being developed, and new projects with eager partners are envisioned. Ideas for future controlled vocabularies for the ocean science community include: Cruise IDs, Persons, and Ships. Conclusions: The promotion and use of controlled vocabularies and ontologies will pave the way for linked data in oceanographic science. By mapping local terms to authoritative and community-accepted terms, links are created whereby related data sets can be better discovered, and utilized. Librarians have an established history of working with controlled vocabularies and metadata. Libraries, have and will continue to, serve as centers for information discovery as well as a natural home for the management of standards

    Similar works