Answering multiple queries in compressed texts

Bin Wang*, Minghe Yu, Xiaochun Yang, Guoren Wang

*此作品的通讯作者

科研成果: 会议稿件论文同行评审

摘要

With the exponential increment of data, compression technology becomes an important tool in the field of data management, especially in text management. An increasing pressing challenge is how to efficiently query these massive amounts of sequence data in their compressed format. In this paper we study the problem of answering subsequence-search queries on LZ78 format of texts. We propose the concept of conditional common sub strings of queries to improve query performance. We present a techniques to find minimal conditional common sub strings in compressed text and a local uncompressing technique to verify and locate positions of answers in text. Finally, the experimental results over real data demonstrate the efficiency of our algorithm.

源语言英语
61-66
页数6
DOI
出版状态已出版 - 2012
已对外发布
活动9th Web Information Systems and Applications Conference, WISA 2012 - Haikou, Hainan, 中国
期限: 16 11月 201218 11月 2012

会议

会议9th Web Information Systems and Applications Conference, WISA 2012
国家/地区中国
Haikou, Hainan
时期16/11/1218/11/12

指纹

探究 'Answering multiple queries in compressed texts' 的科研主题。它们共同构成独一无二的指纹。

引用此