Keyword Search over Probabilistic XML Documents Based on Node Classification

Yue Zhao*, Ye Yuan, Guoren Wang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

3 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 3
  • Captures
    • Readers: 10
see details

摘要

This paper describes a keyword search measure on probabilistic XML data based on ELM (extreme learning machine). We use this method to carry out keyword search on probabilistic XML data. A probabilistic XML document differs from a traditional XML document to realize keyword search in the consideration of possible world semantics. A probabilistic XML document can be seen as a set of nodes consisting of ordinary nodes and distributional nodes. ELM has good performance in text classification applications. As the typical semistructured data; the label of XML data possesses the function of definition itself. Label and context of the node can be seen as the text data of this node. ELM offers significant advantages such as fast learning speed, ease of implementation, and effective node classification. Set intersection can compute SLCA quickly in the node sets which is classified by using ELM. In this paper, we adopt ELM to classify nodes and compute probability. We propose two algorithms that are based on ELM and probability threshold to improve the overall performance. The experimental results verify the benefits of our methods according to various evaluation metrics.

源语言英语
文章编号210961
期刊Mathematical Problems in Engineering
2015
DOI
出版状态已出版 - 2015
已对外发布

指纹

探究 'Keyword Search over Probabilistic XML Documents Based on Node Classification' 的科研主题。它们共同构成独一无二的指纹。

引用此

Zhao, Y., Yuan, Y., & Wang, G. (2015). Keyword Search over Probabilistic XML Documents Based on Node Classification. Mathematical Problems in Engineering, 2015, 文章 210961. https://doi.org/10.1155/2015/210961