Outdated fact detection in knowledge bases

Shuang Hao, Chengliang Chai, Guoliang Li, Nan Tang, Ning Wang, Xiang Yu

科研成果: 书/报告/会议事项章节会议稿件同行评审

13 引用 (Scopus)

摘要

Knowledge bases (KBs), which store high-quality information, are crucial for many applications, such as enhancing search results and serving as external sources for data cleaning. Not surprisingly, there exist outdated facts in most KBs due to the rapid change of information. Naturally, it is important to keep KBs up-to-date. Traditional wisdom has investigated the problem of using reference data (such as new facts extracted from the news) to detect outdated facts in KBs. However, existing approaches can only cover a small percentage of facts in KBs. In this paper, we propose a novel human-in-the-loop approach for outdated fact detection in KBs. It trains a binary classifier using features such as historical update frequency and existence time of a fact to compute the likelihood of a fact in a KB to be outdated. Then, it interacts with humans to verify whether a fact with high likelihood is indeed outdated. In addition, it also uses logical rules to detect more outdated facts based on human feedback. The outdated facts detected by the logical rules will also be fed back to train the ML model further for data augmentation. Extensive experiments on real-world KBs, such as Yago and DBpedia, show the effectiveness of our solution.

源语言英语
主期刊名Proceedings - 2020 IEEE 36th International Conference on Data Engineering, ICDE 2020
出版商IEEE Computer Society
1890-1893
页数4
ISBN(电子版)9781728129037
DOI
出版状态已出版 - 4月 2020
已对外发布
活动36th IEEE International Conference on Data Engineering, ICDE 2020 - Dallas, 美国
期限: 20 4月 202024 4月 2020

出版系列

姓名Proceedings - International Conference on Data Engineering
2020-April
ISSN(印刷版)1084-4627

会议

会议36th IEEE International Conference on Data Engineering, ICDE 2020
国家/地区美国
Dallas
时期20/04/2024/04/20

指纹

探究 'Outdated fact detection in knowledge bases' 的科研主题。它们共同构成独一无二的指纹。

引用此