Search CORE

3 research outputs found

Adaptive Low-level Storage of Very Large Knowledge Graphs

Author: Azzam A.
Baolin Liu
Bordes Antoine
Chong Eugene Inseok
DATASTAX
Fan Jing
Gonzalez E.
Gonzalez E.
Gray Jim
Guha R.
Harris Steve
Kim Jinha
L.
McBride Brian
Modoni E.
Motik Boris
Rietveld Laurens
Urbani Jacopo
Urbani Jacopo
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 24/01/2020
Field of study

The increasing availability and usage of Knowledge Graphs (KGs) on the Web calls for scalable and general-purpose solutions to store this type of data structures. We propose Trident, a novel storage architecture for very large KGs on centralized systems. Trident uses several interlinked data structures to provide fast access to nodes and edges, with the physical storage changing depending on the topology of the graph to reduce the memory footprint. In contrast to single architectures designed for single tasks, our approach offers an interface with few low-level and general-purpose primitives that can be used to implement tasks like SPARQL query answering, reasoning, or graph analytics. Our experiments show that Trident can handle graphs with 10^11 edges using inexpensive hardware, delivering competitive performance on multiple workloads.Comment: Accepted WWW 202

arXiv.org e-Print Archive

VU Research Portal

Crossref

The health care and life sciences community profile for dataset descriptions

Access to consistent, high-quality metadata is critical to finding, understanding, and reusing scientific data. However, while there are many relevant vocabularies for the annotation of a dataset, none sufficiently captures all the necessary metadata. This prevents uniform indexing and querying of dataset repositories. Towards providing a practical guide for producing a high quality description of biomedical datasets, the W3C Semantic Web for Health Care and the Life Sciences Interest Group (HCLSIG) identified Resource Description Framework (RDF) vocabularies that could be used to specify common metadata elements and their value sets. The resulting guideline covers elements of description, identification, attribution, versioning, provenance, and content summarization. This guideline reuses existing vocabularies, and is intended to meet key functional requirements including indexing, discovery, exchange, query, and retrieval of datasets, thereby enabling the publication of FAIR data. The resulting metadata profile is generic and could be used by other domains with an interest in providing machine readable descriptions of versioned datasets

Carleton University's Institutional Repository

The dataLegend Ecosystem for Historical Statistics

Crossref