Searching protein sequence databases using BRBMH matching algorithm

Abstract

Huge amounts of data are stored in linear files. This is also the case for biological data. Biological proteins are sequences of amino acids. The quantities of data in these fields tend to increase year on year. For this reason efficient string-matching algorithms should be used which use minimal computer storage and which minimize the searching response time. In this paper, we propose a new algorithm called BRBMH algorithm which offers improved number of comparison and elapsed searching time when compared to the well-known algorithms such as BF, KMP, BM, BMH, KR, ZT, QS, BR, FS, SSABS, TVSBS, ZTBMH and BRFS algorithms. The experimental results show that the new algorithm is faster and perform fewer numbers of comparisons than other compared algorithms for any length of alphabets and patterns. So the proposed algorithm is applicable for searching protein sequence databases as well as in any other string searching applications

Similar works

This paper was published in University of Huddersfield Repository.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.