A new method for finding approximate repetitions in DNA sequences

Di Wang*, Guoren Wang, Qingquan Wu, Baichen Chen, Yi Zhao

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Searching for approximate repetitions in a DNA sequence has been an important topic in gene analysis. One of the problems in the study is that because of the varying lengths of patterns, the similarity between patterns cannot be judged accurately if we use only the concept of ED (Edit Distance). In this paper we shall make effort to define a new function to compute similarity, which considers both the difference and sameness between patterns at the same time. Seeing the computational complexity, we shall also propose two new filter methods based on frequency distance and Pearson correlation, with which we can sort out candidate set of approximate repetitions efficiently. We use SUA instead of sliding window to get the fragments in a DNA sequence, so that the patterns of an approximate repetition have no limitation on length. The results show that with our technique we are able to find a bigger number of approximate repetitions than that of those found with tandem repeat finder.

Original languageEnglish
Title of host publicationAdvances in Web-Age Information Management - 7th International Conference, WAIM 2006, Proceedings
PublisherSpringer Verlag
Pages397-409
Number of pages13
ISBN (Print)3540352252, 9783540352259
DOIs
Publication statusPublished - 2006
Externally publishedYes
Event7th International Conference on Advances in Web-Age Information Management, WAIM 2006 - Hong Kong, China
Duration: 17 Jun 200619 Jun 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4016 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference7th International Conference on Advances in Web-Age Information Management, WAIM 2006
Country/TerritoryChina
CityHong Kong
Period17/06/0619/06/06

Fingerprint

Dive into the research topics of 'A new method for finding approximate repetitions in DNA sequences'. Together they form a unique fingerprint.

Cite this