Computing structural similarity of source XML schemas against domain XML schema

Jianxin Li*, Chengfei Liu, Jeffrey Xu Yu, Jixue Liu, Guoren Wang, Chi Yangt

*此作品的通讯作者

科研成果: 期刊稿件会议文章同行评审

2 引用 (Scopus)

摘要

In this paper, we study the problem of measuring structural similarities of large number of source schemas against a single domain schema, which is useful for enhancing the quality of searching and ranking big volume of source documents on the Web with the help of structural information. After analyzing the improperness of adopting existing edit-distance based methods, we propose a new similarity measure model that caters for the requirements of the problem. Given the asymmetric nature of the similarity comparisons of source schemas with a domain schema, similarity preserving rules and algorithm are designed to filter out uninteresting elements in source schemas for the purpose of optimizing the similarity computation. Based on the model, a basic algorithm and an improved algorithm are developed for structural similarity computation. The improved algorithm makes full use of a new coding scheme that is devised to reduce the number of comparisons. Complexities of both algorithms are analyzed and extensive experiments are conducted showing the significant performance gain achieved by the improved algorithm.

源语言英语
页(从-至)155-164
页数10
期刊Conferences in Research and Practice in Information Technology Series
75
出版状态已出版 - 2008
已对外发布
活动19th Australasian Database Conference, ADC 2008 - Wollongong, NSW, 澳大利亚
期限: 1 1月 20081 1月 2008

指纹

探究 'Computing structural similarity of source XML schemas against domain XML schema' 的科研主题。它们共同构成独一无二的指纹。

引用此