Skip to main content
Article thumbnail
Location of Repository

Linear-time off-line text compression by longest-first substitution

By Shunsuke Inenaga, Takashi Funamoto, Masayuki Takeda and Ayumi Shinohara

Abstract

Abstract. Given a text, grammar-based compression is to construct a grammar that generates the text. There are many kinds of text compression techniques of this type. Each compression scheme is categorized as being either off-line or on-line, according to how a text is processed. One representative tactics for off-line compression is to substitute the longest repeated factors of a text with a production rule. In this paper, we present an algorithm that compresses a text basing on this longestfirst principle, in linear time. The algorithm employs a suitable index structure for a text, and involves technically efficient operations on the structure.

Publisher: Springer-Verlag
Year: 2003
OAI identifier: oai:CiteSeerX.psu:10.1.1.134.2194
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • http://www.i.kyushu-u.ac.jp/~t... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.