7 research outputs found

    Optimally Computing Compressed Indexing Arrays Based on the Compact Directed Acyclic Word Graph

    Full text link
    In this paper, we present the first study of the computational complexity of converting an automata-based text index structure, called the Compact Directed Acyclic Word Graph (CDAWG), of size ee for a text TT of length nn into other text indexing structures for the same text, suitable for highly repetitive texts: the run-length BWT of size rr, the irreducible PLCP array of size rr, and the quasi-irreducible LPF array of size ee, as well as the lex-parse of size O(r)O(r) and the LZ77-parse of size zz, where r,z≤er, z \le e. As main results, we showed that the above structures can be optimally computed from either the CDAWG for TT stored in read-only memory or its self-index version of size ee without a text in O(e)O(e) worst-case time and words of working space. To obtain the above results, we devised techniques for enumerating a particular subset of suffixes in the lexicographic and text orders using the forward and backward search on the CDAWG by extending the results by Belazzougui et al. in 2015.Comment: The short version of this paper will appear in SPIRE 2023, Pisa, Italy, September 26-28, 2023, Lecture Notes in Computer Science, Springe

    LIPIcs, Volume 244, ESA 2022, Complete Volume

    Get PDF
    LIPIcs, Volume 244, ESA 2022, Complete Volum
    corecore