Top-K generation of mediated schemas over multiple data sources

Guohui Ding*, Guoren Wang, Bin Wang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

2 引用 (Scopus)

摘要

Schema integration has been widely used in many database applications, such as Data Warehousing, Life Science and Ontology Merging. Though schema integration has been intensively studied in recent yeas, it is still a challenging issue, because it is almost impossible to find the perfect target schema. An automatic method to schema integration, which explores multiple possible integrated schemas over a set of source schemas from the same domain, is proposed in this paper. Firstly, the concept graph is introduced to represent the source schemas at a higher-level of abstraction. Secondly, we divide the similarity between concepts into intervals to generate three merging strategies for schemas. Finally, we design a novel top-k ranking algorithm for the automatic generation of the best candidate mediated schemas. The key component of our algorithm is the pruning technique which uses the ordered buffer and the threshold to filter out the candidates. The extensive experimental studies show that our algorithm is effective and runs in polynomial time.

源语言英语
主期刊名Database Systems for Advanced Applications - 15th International Conference, DASFAA 2010, International Workshops
主期刊副标题GDM, BenchmarX, MCIS, SNSMW, DIEW, UDM, Revised Selected Papers
143-155
页数13
DOI
出版状态已出版 - 2010
已对外发布
活动15th International Conference on Database Systems for Advanced Applications, DASFAA 2010 - Tsukuba, 日本
期限: 1 4月 20104 4月 2010

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
6193 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议15th International Conference on Database Systems for Advanced Applications, DASFAA 2010
国家/地区日本
Tsukuba
时期1/04/104/04/10

指纹

探究 'Top-K generation of mediated schemas over multiple data sources' 的科研主题。它们共同构成独一无二的指纹。

引用此