Optimized Indexes for Data Structured Retrieval

Aponte Báez, Yosvanys; Marco Such, Manuel; Sánchez, Alexander

research

Optimized Indexes for Data Structured Retrieval

Authors: Yosvanys Aponte Báez
Manuel Marco Such
Alexander Sánchez
Publication date: 1 January 2015
Publisher: IJARCSSE

Abstract

The aim of this work is to show the novel index structure based suffix array and ternary search tree with rank and select succinct data structure. Suffix arrays were originally developed to reduce memory consumption compared to a suffix tree and ternary search tree combine the time efficiency of digital tries with the space efficiency of binary search trees. Rank of a symbol at a given position equals the number of times the symbol appears in the corresponding prefix of the sequence. Select is the inverse, retrieving the positions of the symbol occurrences. These operations are widely used in information retrieval and management, being the base of several data structures and algorithms for text collections, graphs, trees, etc. The resulting structure is faster than hashing for many typical search problems, and supports a broader range of useful problems and operations. There for we implement a path index based on those data structures that shown to be highly efficient when dealing with digital collection consist in structured documents. We describe how the index architecture works and we compare the searching algorithms with others, and finally experiments show the outperforms with earlier approaches

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Repositorio Institucional de la Universidad de Alicante

oai:rua.ua.es:10045/58374

Last time updated on 01/03/2017

RUA

oai:rua.ua.es:10045/58374

Last time updated on 09/04/2020