Knowledge fusion framework based on web page texts

Sikang Hu*, Yuanda Cao

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

With the proliferation of Web page texts, it is important to fuse these texts to useful documents that users need. However, there is still no complete and unified theoretical model for studying the research issues including redundancy, localization, and fuzziness existing in the process of fusing Web page texts. This paper proposes a fusion frame-work called Web Pages Knowledge Fusion Framework (WP-KFF) to synthesize the knowledge of Web page texts. First, sentences in Web page texts are extracted and transformed into triple semantic net as knowledge representation. Then a semantic description of attribute fusion rules, description information fusion rules and attribute-value and description information fusion rules are defined in WPKFF. These rules are used to fuse the attributes of same domain concepts in triple semantic net. The features of attributes include de-scription (string) and value data (number). The results of the experiments indicate that the fusion framework is a feasible model in terms of precision and recall.

Original languageEnglish
Pages (from-to)457-464
Number of pages8
JournalFrontiers of Computer Science in China
Volume3
Issue number4
DOIs
Publication statusPublished - 2009

Keywords

  • Fusion framework
  • Fusion rules
  • Knowledge acquisition
  • Web text formal semanteme

Fingerprint

Dive into the research topics of 'Knowledge fusion framework based on web page texts'. Together they form a unique fingerprint.

Cite this