Finding repetitions in DNA sequences based on a new index-succeeding unit array

Di Wang*, Baichen Chen, Qingquan Wu, Yi Zhao, Changyong Yu, Guoren Wang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Since the repetitions in a DNA sequence are of great biological significance searching for the repetitions has naturally been an important topic in gene analysis. This paper proposes two new concepts of repetitions-LPR for perfect repetitions and TSAR for approximate repetitions. A lightweight index structure, namely, the Succeeding Unit Array (SUA) is designed based on pattern unit. The SUA decreases the space consumption efficiently and solves the space bottleneck in search of repetitions. On the SUA all the LPRs and TSARs can be detected. The theoretical analysis and experimental results show that both space and time complexity of the algorithms is satisfying.

Original languageEnglish
Pages (from-to)1371-1378
Number of pages8
JournalJournal of Computational Information Systems
Volume2
Issue number4
Publication statusPublished - Nov 2006
Externally publishedYes

Keywords

  • LPRs
  • Repetitions
  • Succeeding unit array
  • TSARs

Fingerprint

Dive into the research topics of 'Finding repetitions in DNA sequences based on a new index-succeeding unit array'. Together they form a unique fingerprint.

Cite this