Non-Overlapping Indexing - Cache Obliviously

Abedin, Paniz; Hooshmand, Sahar; Thankachan, Sharma V.

Non-Overlapping Indexing - Cache Obliviously

Authors: Paniz Abedin
Sahar Hooshmand
Sharma V. Thankachan
Publication date: 1 January 2018
Publisher: LIPIcs - Leibniz International Proceedings in Informatics. Annual Symposium on Combinatorial Pattern Matching (CPM 2018)
Doi

Abstract

The non-overlapping indexing problem is defined as follows: pre-process a given text T[1,n] of length n into a data structure such that whenever a pattern P[1,p] comes as an input, we can efficiently report the largest set of non-overlapping occurrences of P in T. The best known solution is by Cohen and Porat [ISAAC, 2009]. Their index size is O(n) words and query time is optimal O(p+nocc), where nocc is the output size. We study this problem in the cache-oblivious model and present a new data structure of size O(n log n) words. It can answer queries in optimal O(p/(B)+log_B n+nocc/B) I/Os, where B is the block size

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

oai:stars.library.ucf.edu:scop...

Last time updated on 18/10/2022

Dagstuhl Research Online Publication Server

oai:drops-oai.dagstuhl.de:8700

Last time updated on 19/06/2018