Skip to main navigation Skip to search Skip to main content

A two-tire index structure for approximate string matching with block moves

  • Bin Wang*
  • , Long Xie
  • , Guoren Wang
  • *Corresponding author for this work
  • Northeastern University
  • Northeastern University China
  • Liaoning University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Many applications need to solve the problem of approximate string matching with block moves. It is an NP-Complete problem to compute block edit distance between two strings. Our goal is to filter non-candidate strings as much as possible. Based on the two matured filter strategies, frequency distance and positional q-gram, we propose a two-tire index structure to make the use of the two filters more efficiently. We give a full specification of the index structure, including how to choose character order to achieve a better filterability and how to balance number of strings in different clusters. We present our experiments on real data sets to evaluate our technique and show the proposed index structure can provide a good performance.

Original languageEnglish
Title of host publicationDatabase Systems for Advanced Applications - DASFAA 2009 International Workshops
Subtitle of host publicationBenchmarX, MCIS, WDPP, PPDA, MBC, PhD
Pages197-211
Number of pages15
DOIs
Publication statusPublished - 2009
Externally publishedYes
EventInternational Workshops on Database Systems for Advanced Applications, DASFAA 2009: BenchmarX, MCIS, WDPP, PPDA, MBC, PhD - Brisbane, QLD, Australia
Duration: 20 Apr 200923 Apr 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5667 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceInternational Workshops on Database Systems for Advanced Applications, DASFAA 2009: BenchmarX, MCIS, WDPP, PPDA, MBC, PhD
Country/TerritoryAustralia
CityBrisbane, QLD
Period20/04/0923/04/09

Fingerprint

Dive into the research topics of 'A two-tire index structure for approximate string matching with block moves'. Together they form a unique fingerprint.

Cite this