Search CORE

62 research outputs found

Practical Evaluation of Lempel-Ziv-78 and Lempel-Ziv-Welch Tries

Author: A Poyias
D Arroyuelo
D Lemire
D Lemire
D Lemire
G Marsaglia
GH Gonnet
H Bannai
H Luan
J Fischer
J Fischer
J Jansson
J Kärkkäinen
J Ziv
J Ziv
JA Feldman
JG Cleary
K Chung
L Carter
P Tchebychev
RM Karp
RM Robinson
TA Welch
Y Nakashima
Publication venue
Publication date: 09/06/2017
Field of study

We present the first thorough practical study of the Lempel-Ziv-78 and the Lempel-Ziv-Welch computation based on trie data structures. With a careful selection of trie representations we can beat well-tuned popular trie data structures like Judy, m-Bonsai or Cedar

arXiv.org e-Print Archive

Crossref

On An Improved Parallel Construction Of Suffix Arrays For Low Bandwidth Pc-Cluster.

Author: Abdul Rashid Nur'Aini
Abdullah Rosni
Kok Jun Lee
Md. Ali Norhashidah
Publication venue
Publication date: 01/10/2003
Field of study

An algorithm for the parallel construction of suffix arrays generation for any texts with larger alphabet size on distributed memory architecture is presente

Repository@USM

Optimized Indexes for Data Structured Retrieval

Author: Aponte Báez Yosvanys
Marco Such Manuel
Sánchez Alexander
Publication venue: IJARCSSE
Publication date: 01/01/2015
Field of study

The aim of this work is to show the novel index structure based suffix array and ternary search tree with rank and select succinct data structure. Suffix arrays were originally developed to reduce memory consumption compared to a suffix tree and ternary search tree combine the time efficiency of digital tries with the space efficiency of binary search trees. Rank of a symbol at a given position equals the number of times the symbol appears in the corresponding prefix of the sequence. Select is the inverse, retrieving the positions of the symbol occurrences. These operations are widely used in information retrieval and management, being the base of several data structures and algorithms for text collections, graphs, trees, etc. The resulting structure is faster than hashing for many typical search problems, and supports a broader range of useful problems and operations. There for we implement a path index based on those data structures that shown to be highly efficient when dealing with digital collection consist in structured documents. We describe how the index architecture works and we compare the searching algorithms with others, and finally experiments show the outperforms with earlier approaches

Repositorio Institucional de la Universidad de Alicante

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

CLB-деревья: новый способ индексации больших массивов текстов

Author: Веретенников А. Б.
Лукач Ю. С.
Publication venue
Publication date: 01/01/2006
Field of study

Предложена новая гибридная структура данных для работы с большими массивами текстовой информации - CLB-дерево. Эта структура сочетает высокую скорость поиска, характерную для инвертированных файлов, с высокой скоростью обновления B-деревьев

Institutional repository of Ural Federal University named after the first President of Russia B.N.Yeltsin