A Succinct Grammar Compression

Tabei, Yasuo; Takabatake, Yoshimasa; Sakamoto, Hiroshi

Computer Science > Data Structures and Algorithms

arXiv:1304.0917 (cs)

[Submitted on 3 Apr 2013 (v1), last revised 15 Jun 2013 (this version, v3)]

Title:A Succinct Grammar Compression

Authors:Yasuo Tabei, Yoshimasa Takabatake, Hiroshi Sakamoto

View PDF

Abstract:We solve an open problem related to an optimal encoding of a straight line program (SLP), a canonical form of grammar compression deriving a single string deterministically. We show that an information-theoretic lower bound for representing an SLP with n symbols requires at least 2n+logn!+o(n) bits. We then present a succinct representation of an SLP; this representation is asymptotically equivalent to the lower bound. The space is at most 2n log {rho}(1 + o(1)) bits for rho leq 2sqrt{n}, while supporting random access to any production rule of an SLP in O(log log n) time. In addition, we present a novel dynamic data structure associating a digram with a unique symbol. Such a data structure is called a naming function and has been implemented using a hash table that has a space-time tradeoff. Thus, the memory space is mainly occupied by the hash table during the development of production rules. Alternatively, we build a dynamic data structure for the naming function by leveraging the idea behind the wavelet tree. The space is strictly bounded by 2n log n(1 + o(1)) bits, while supporting O(log n) query and update time.

Comments:	The paper is accepted to 24th Annual Symposium on Combinatorial Pattern Matching (CPM2013)
Subjects:	Data Structures and Algorithms (cs.DS)
Cite as:	arXiv:1304.0917 [cs.DS]
	(or arXiv:1304.0917v3 [cs.DS] for this version)
	https://doi.org/10.48550/arXiv.1304.0917

Submission history

From: Yasuo Tabei [view email]
[v1] Wed, 3 Apr 2013 11:13:00 UTC (2,350 KB)
[v2] Thu, 4 Apr 2013 03:15:33 UTC (2,338 KB)
[v3] Sat, 15 Jun 2013 02:13:38 UTC (2,345 KB)

Computer Science > Data Structures and Algorithms

Title:A Succinct Grammar Compression

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Data Structures and Algorithms

Title:A Succinct Grammar Compression

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators