19 research outputs found
Recommended from our members
WW1LOD: an application of CIDOC-CRM to World War 1 linked data
The CIDOC-CRM standard indicates that common events, actors, places and timeframes are important in linking together cultural material, and provides a framework for describing them. However, merely describing entities in this way in two datasets does not yet interlink them. To do that, the identities of instances still need to be either reconciled, or be based on a shared vocabulary.
The WW1LOD dataset presented in this paper was created to facilitate both of these approaches for collections dealing with the First World War. For this purpose, the dataset includes events, places, agents, times, keywords, and themes related to the war, based on over ten different authoritative data sources from providers such as the Imperial War Museum. The content is harmonized into RDF, and published as a Linked Open Data service.
While generally basing on CIDOC-CRM, some modeling choices used also deviate from it where our experience dictated such. In the article, these deviations are discussed in the hope that they may serve as examples where CIDOC-CRM itself may warrant further examination.
As a demonstration of use, the dataset and online service have been used to create a contextual reader application that is able link together and pull in information related to WW1 from e.g. 1914â1918 Online, Wikipedia, WW1 Discovery, Europeana and the Digital Public Library of America
Recommended from our members
Leveraging Linked Data to Enhance Subject Access - A Case Study of the University of Colorado Boulder's World War I Collection Online
Academic users often find work with online primary sources both rewarding and challenging. Improving subject access in these sources is essential as digital collections propagate and work with primary sources becomes increasingly important in humanities curricula. A user needs assessment was conducted with humanities users at the University of Colorado Boulder to facilitate engagement with these sources. Two of the major user needs identified were improving findability and context, particularly for historical subjects. Linked Data can help meet these needs by linking related concepts in the sources using a specialized vocabulary, enriching them with outside resources, and enabling semantically rich services that empower users. This paper discusses a project the authors undertook to enhance subject access in CUâs WWI Collection Online by deep linking historical data on the civilian experience in occupied Belgium. This work is intended to lead to a richer understanding of forces shaping the WWI period
Beacon v2 and Beacon networks: A "lingua franca" for federated data discovery in biomedical genomics, and beyond
Beacon is a basic data discovery protocol issued by the Global Alliance for Genomics and Health (GA4GH). The main goal addressed by version 1 of the Beacon protocol was to test the feasibility of broadly sharing human genomic data, through providing simple "yes" or "no" responses to queries about the presence of a given variant in datasets hosted by Beacon providers. The popularity of this concept has fostered the design of a version 2, that better serves real-world requirements and addresses the needs of clinical genomics research and healthcare, as assessed by several contributing projects and organizations. Particularly, rare disease genetics and cancer research will benefit from new case level and genomic variant level requests and the enabling of richer phenotype and clinical queries as well as support for fuzzy searches. Beacon is designed as a "lingua franca" to bridge data collections hosted in software solutions with different and rich interfaces. Beacon version 2 works alongside popular standards like Phenopackets, OMOP, or FHIR, allowing implementing consortia to return matches in beacon responses and provide a handover to their preferred data exchange format. The protocol is being explored by other research domains and is being tested in several international projects
GA4GH: International policies and standards for data sharing across genomic research and healthcare.
The Global Alliance for Genomics and Health (GA4GH) aims to accelerate biomedical advances by enabling the responsible sharing of clinical and genomic data through both harmonized data aggregation and federated approaches. The decreasing cost of genomic sequencing (along with other genome-wide molecular assays) and increasing evidence of its clinical utility will soon drive the generation of sequence data from tens of millions of humans, with increasing levels of diversity. In this perspective, we present the GA4GH strategies for addressing the major challenges of this data revolution. We describe the GA4GH organization, which is fueled by the development efforts of eight Work Streams and informed by the needs of 24 Driver Projects and other key stakeholders. We present the GA4GH suite of secure, interoperable technical standards and policy frameworks and review the current status of standards, their relevance to key domains of research and clinical care, and future plans of GA4GH. Broad international participation in building, adopting, and deploying GA4GH standards and frameworks will catalyze an unprecedented effort in data sharing that will be critical to advancing genomic medicine and ensuring that all populations can access its benefits
Ontologiaperusteisten tapahtumien tunnistus piilevÀn semantiikan analyysillÀ
Perinteinen tekstihaku vertaa toisiinsa tekstistÀ löytyviÀ merkkijonoja, jolloin esimerkiksi hakusanalla 'Nokia' voidaan tulokseksi saada dokumentteja matkapuhelinvalmistajasta, Nokian kaupungista tai F.E SillanpÀÀn Ihmiset suviyössÀ teoksen pÀÀhenkilöstÀ. TÀssÀ tutkielmassa esitetÀÀn informaation haussa (engl. Information Retrieval, IR) kÀytettÀvÀ menetelmÀ, jolla on mahdollista hakea tekstidokumentteja tarkasti mÀÀritellyllÀ kÀsitteellÀ. Tarkasti mÀÀritellyllÀ kÀsitteellÀ tarkoitetaan ontologiassa, koneymmÀrrettÀvÀssÀ sanastossa, mÀÀriteltyÀ kÀsitettÀ. TÀssÀ tutkielmassa keskitytÀÀn erityisesti historiaontologiassa mÀÀriteltyihin tapahtumiin. Tutkielmassa esitetty menetelmÀ pyrkii tunnistamaan dokumentissa esiintyvÀt kÀsitteet sanoja ympÀröivÀn semantiikan perusteella. TÀsmÀllisesti sanaa ympÀröivÀ semantiikka saadaan niin kutsutusta semanttisesta avaruudesta, joka muodostetaan piilevÀn semantiikan analyysiksi (engl. Latent Semantic Analysis, LSA) kutsutulla matemaattisella menetelmÀllÀ, ja ympÀröivÀÀ semantiikkaa sovelletaan ontologiseen kyselyn laajentamiseen.
Mallin toimivuutta pyrittiin arvioimaan koejÀrjestelyllÀ, jossa aineistona kÀytetÀÀn Suomalaista historiaontologiaa ja suomenkielisen Wikipedia-tietosanakirjan artikkeleita. KoejÀrjestelyssÀ ilmenneiden vaikeuksien vuoksi toimivuuden arviointi jÀi puutteelliseksi. Tutkielman lopussa on pohdittu menetelmÀn merkitystÀ informaation haussa yleisesti, sillÀ tutkielmassa kuvattu menetelmÀ ontologiassa mÀÀriteltyjen kÀsitteiden kuvaamisesta tekstidokumenttien mÀÀrÀÀmÀÀn semanttiseen avaruuteen on uusi, eikÀ aiempaa tutkimusta menetelmÀn toiminnasta tai kehittÀmisestÀ ole tehty
Recommended from our members
WW1LOD
The CIDOC-CRM standard indicates that common events, actors, places and timeframes are important in linking together cultural material, and provides a framework for describing them. However, merely describing entities in this way in two datasets does not yet interlink them. To do that, the identities of instances still need to be either reconciled, or be based on a shared vocabulary. The WW1LOD dataset presented in this paper was created to facilitate both of these approaches for collections dealing with the First World War. For this purpose, the dataset includes events, places, agents, times, keywords, and themes related to the war, based on over ten different authoritative data sources from providers such as the Imperial War Museum. The content is harmonized into RDF, and published as a Linked Open Data service. While generally based on CIDOC-CRM, some modeling choices used also deviate from it where our experience dictated such. In the article, these deviations are discussed in the hope that they may serveas examples where CIDOC-CRM itself may warrant further examination. As a demonstration of use, the dataset and online service have been used to create a contextual reader application that is able to link together and pull in information related to WW1 from, e.g., 1914â1918 Online, Wikipedia, WW1 Discovery, Europeana and the Digital Public Library of America.Peer reviewe
Recommended from our members
History on the Semantic Web as Linked Data - An Event Gazetteer and Timeline for World War I
Events are an essential component of cultural heritage Linked Data: they link actors, places, times, objects, and other events into larger narrative structures, providing a rich basis for semantic searching, recommending, analysis, and visualization of cultural heritage data. This paper argues that shared vocabularies (gazetteers, ontologies) of events, such as the âBattle of Normandyâ or âCrucifixion of Jesusâ, are necessary to facilitate the aggregation and linking of heterogeneous content from various collections. For example, biographies, histories, photos, and paintings often reference or depict events. A set of general requirements for an event gazetteer is presented, based on the needs of publishing, aggregating, and reusing cultural heritage content as Linked Data. After this, a metadata model addressing the presented requirements for representing historical events is outlined. The model is being applied in a case study aimed at developing an event ontology for World War I (WWI). Our goals from an end-user perspective are twofold: 1) Facilitate event-based cataloging for curators in memory organizations; 2) Utilize semantic event descriptions and narrative event structures in end-user applications for searching and linking documents and other content about WWI, and for structuring and visualizing them
Modular Pre-Ingest Tool for Diverse Needs of Producers
We introduce an open-source pre-ingest tool that assists the generation of Submission Information Packages (SIPs) that are to be submitted to the national digital preservation service in Finland. The pre-ingest tool consists of several independent components that produce the parts of a METS document required by the national preservation service. These components are easy to modify when developing services for different user demands or for different repositories. Users of the tool provide the necessary information as parameters for the tool, which produces the structure and descriptions for the SIP. The pre-ingest tool reduces the need to deeply understand either METS, PREMIS or other metadata formats to be able to preserve digital assets
IOS Press World War 1 as Linked Open Data
Abstract. The WW1LOD dataset is primarily a reference dataset meant to bind together collections dealing with the First World War. For this purpose, the dataset gathers events, places and agents related to the war from various authoritative sources. These are then made available for indexing and other use through a variety of interfaces and APIs. Additional information on the entities is also collected, in order to be able to answer more complex questions relating to them. The approach is being evaluated using a concrete WW1 online collection