HOFD: An Outdated Fact Detector for Knowledge Bases

Shuang Hao, Chengliang Chai*, Guoliang Li, Nan Tang, Ning Wang, Xiang Yu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Knowledge bases (KBs), which store high-quality information, are crucial for many applications, such as enhancing search results and serving as external sources for data cleaning. Not surprisingly, there exist outdated facts in most KBs due to the rapid change of information. Naturally, it is important to keep KBs up-to-date. Traditional wisdom has investigated the problem of using reference data (such as new facts extracted from the news) to detect outdated facts in KBs. However, existing approaches can only cover a small percentage of facts in KBs. In this paper, we propose HOFD, a novel human-in-the-loop approach for outdated fact detection in KBs. HOFD trains a binary classifier using features such as historical update frequency and update time of a fact to compute the likelihood of a fact in a KB to be outdated. Then, HOFD interacts with humans to verify whether a fact with high likelihood is indeed outdated. In addition, HOFD also uses logical rules to detect more outdated facts based on human feedback. The outdated facts detected by the logical rules will also be fed back to train the ML model further for data augmentation. Extensive experiments on real-world KBs, such as Yago and DBpedia, show the effectiveness of our solution.

Original languageEnglish
Pages (from-to)10775-10789
Number of pages15
JournalIEEE Transactions on Knowledge and Data Engineering
Volume35
Issue number10
DOIs
Publication statusPublished - 1 Oct 2023

Keywords

  • Fact prediction
  • human-in-the-loop
  • knowledge base
  • logical rule
  • outdated fact

Fingerprint

Dive into the research topics of 'HOFD: An Outdated Fact Detector for Knowledge Bases'. Together they form a unique fingerprint.

Cite this