A file-based linked data fragments approach to prefix search

Colpaert, Pieter; Dedecker, Ruben; Delva, Harm; Verborgh, Ruben

A file-based linked data fragments approach to prefix search

Authors: Pieter Colpaert
Ruben Dedecker
Harm Delva
Ruben Verborgh
Publication date: 1 January 2021
Publisher: 'Springer Science and Business Media LLC'
Doi

Abstract

Text-fields that need to look up specific entities in a dataset can be equipped with autocompletion functionality. When a dataset becomes too large to be embedded in the page, setting up a full-text search API is not the only alternative. Alternate API designs that balance different trade-offs such as archivability, cacheability and privacy, may not require setting up a new back-end architecture. In this paper, we propose to perform prefix search over a fragmentation of the dataset, enabling the client to take part in the query execution by navigating through the fragmented dataset. Our proposal consists of (i) a self-describing fragmentation strategy, (ii) a client search algorithm, and (iii) an evaluation of the proposed solution, based on a small dataset of 73k entities and a large dataset of 3.87 m entities. We found that the server cache hit ratio is three times higher compared to a server-side prefix search API, at the cost of a higher bandwidth consumption. Nevertheless, an acceptable user-perceived performance has been measured: assuming 150 ms as an acceptable waiting time between keystrokes, this approach allows 15 entities per prefix to be retrieved in this interval. We conclude that an alternate set of trade-offs has been established for specific prefix search use cases: having added more choice to the spectrum of Web APIs for autocompletion, a file-based approach enables more datasets to afford prefix search

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Ghent University Academic Bibliography

oai:archive.ugent.be:8709018

Last time updated on 25/06/2021