Prospects and limitations of full-text index structures in genome analysis

Dawyndt, Peter; De Baets, Bernard; Fack, Veerle; Vyverman, Michaël

research

Prospects and limitations of full-text index structures in genome analysis

Authors: Peter Dawyndt
Bernard De Baets
Veerle Fack
Michaël Vyverman
Publication date: 1 January 2012
Publisher: 'Oxford University Press (OUP)'
Doi

Abstract

The combination of incessant advances in sequencing technology producing large amounts of data and innovative bioinformatics approaches, designed to cope with this data flood, has led to new interesting results in the life sciences. Given the magnitude of sequence data to be processed, many bioinformatics tools rely on efficient solutions to a variety of complex string problems. These solutions include fast heuristic algorithms and advanced data structures, generally referred to as index structures. Although the importance of index structures is generally known to the bioinformatics community, the design and potency of these data structures, as well as their properties and limitations, are less understood. Moreover, the last decade has seen a boom in the number of variant index structures featuring complex and diverse memory-time trade-offs. This article brings a comprehensive state-of-the-art overview of the most popular index structures and their recently developed variants. Their features, interrelationships, the trade-offs they impose, but also their practical limitations, are explained and compared

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Ghent University Academic Bibliography

oai:archive.ugent.be:2974977

Last time updated on 12/11/2016

Crossref

info:doi/10.1093%2Fnar%2Fgks40...

Last time updated on 01/04/2019