Abstract
Since the repetitions in a DNA sequence are of great biological significance searching for the repetitions has naturally been an important topic in gene analysis. This paper proposes two new concepts of repetitions-LPR for perfect repetitions and TSAR for approximate repetitions. A lightweight index structure, namely, the Succeeding Unit Array (SUA) is designed based on pattern unit. The SUA decreases the space consumption efficiently and solves the space bottleneck in search of repetitions. On the SUA all the LPRs and TSARs can be detected. The theoretical analysis and experimental results show that both space and time complexity of the algorithms is satisfying.
| Original language | English |
|---|---|
| Pages (from-to) | 1371-1378 |
| Number of pages | 8 |
| Journal | Journal of Computational Information Systems |
| Volume | 2 |
| Issue number | 4 |
| Publication status | Published - Nov 2006 |
| Externally published | Yes |
Keywords
- LPRs
- Repetitions
- Succeeding unit array
- TSARs