Multiple sequence alignment based on set covers

A. Bahr; B. Manthey; B. Morgenstern; B. Morgenstern; C. Notredame; D. Gusfield; G. Vogt; J.D. Thompson; K. Katoh; O. Gotoh; P. Zhao; R.E. Green; R.F. Smith; S. Henikoff; T. Müller; T.P. Li

research

Multiple sequence alignment based on set covers

Authors: A. Bahr
B. Manthey
B. Morgenstern
B. Morgenstern
C. Notredame
D. Gusfield
G. Vogt
J.D. Thompson
K. Katoh
O. Gotoh
P. Zhao
R.E. Green
R.F. Smith
S. Henikoff
T. Müller
T.P. Li
Publication date: 1 January 2004
Publisher: 'Springer Science and Business Media LLC'
Doi

Abstract

We introduce a new heuristic for the multiple alignment of a set of sequences. The heuristic is based on a set cover of the residue alphabet of the sequences, and also on the determination of a significant set of blocks comprising subsequences of the sequences to be aligned. These blocks are obtained with the aid of a new data structure, called a suffix-set tree, which is constructed from the input sequences with the guidance of the residue-alphabet set cover and generalizes the well-known suffix tree of the sequence set. We provide performance results on selected BAliBASE amino-acid sequences and compare them with those yielded by some prominent approaches

Similar works

Full text

Available Versions

Crossref

Last time updated on 03/01/2020