Keyword Search over Probabilistic XML Documents Based on Node Classification

Yue Zhao; Ye Yuan; Guoren Wang

doi:10.1155/2015/210961

Keyword Search over Probabilistic XML Documents Based on Node Classification

Yue Zhao^*, Ye Yuan, Guoren Wang

^*此作品的通讯作者

Northeastern University China

科研成果: 期刊稿件 › 文章 › 同行评审

3 引用（Scopus）

摘要

This paper describes a keyword search measure on probabilistic XML data based on ELM (extreme learning machine). We use this method to carry out keyword search on probabilistic XML data. A probabilistic XML document differs from a traditional XML document to realize keyword search in the consideration of possible world semantics. A probabilistic XML document can be seen as a set of nodes consisting of ordinary nodes and distributional nodes. ELM has good performance in text classification applications. As the typical semistructured data; the label of XML data possesses the function of definition itself. Label and context of the node can be seen as the text data of this node. ELM offers significant advantages such as fast learning speed, ease of implementation, and effective node classification. Set intersection can compute SLCA quickly in the node sets which is classified by using ELM. In this paper, we adopt ELM to classify nodes and compute probability. We propose two algorithms that are based on ELM and probability threshold to improve the overall performance. The experimental results verify the benefits of our methods according to various evaluation metrics.

源语言	英语
文章编号	210961
期刊	Mathematical Problems in Engineering
卷	2015
DOI	https://doi.org/10.1155/2015/210961
出版状态	已出版 - 2015
已对外发布	是

访问文件

10.1155/2015/210961

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhao, Y., Yuan, Y., & Wang, G. (2015). Keyword Search over Probabilistic XML Documents Based on Node Classification. Mathematical Problems in Engineering, 2015, 文章 210961. https://doi.org/10.1155/2015/210961

@article{f9d18acaaee14ae99254f89380845db9,

title = "Keyword Search over Probabilistic XML Documents Based on Node Classification",

abstract = "This paper describes a keyword search measure on probabilistic XML data based on ELM (extreme learning machine). We use this method to carry out keyword search on probabilistic XML data. A probabilistic XML document differs from a traditional XML document to realize keyword search in the consideration of possible world semantics. A probabilistic XML document can be seen as a set of nodes consisting of ordinary nodes and distributional nodes. ELM has good performance in text classification applications. As the typical semistructured data; the label of XML data possesses the function of definition itself. Label and context of the node can be seen as the text data of this node. ELM offers significant advantages such as fast learning speed, ease of implementation, and effective node classification. Set intersection can compute SLCA quickly in the node sets which is classified by using ELM. In this paper, we adopt ELM to classify nodes and compute probability. We propose two algorithms that are based on ELM and probability threshold to improve the overall performance. The experimental results verify the benefits of our methods according to various evaluation metrics.",

author = "Yue Zhao and Ye Yuan and Guoren Wang",

note = "Publisher Copyright: {\textcopyright} 2015 Yue Zhao et al.",

year = "2015",

doi = "10.1155/2015/210961",

language = "English",

volume = "2015",

journal = "Mathematical Problems in Engineering",

issn = "1024-123X",

publisher = "John Wiley and Sons Ltd",

}

TY - JOUR

T1 - Keyword Search over Probabilistic XML Documents Based on Node Classification

AU - Zhao, Yue

AU - Yuan, Ye

AU - Wang, Guoren

PY - 2015

Y1 - 2015

N2 - This paper describes a keyword search measure on probabilistic XML data based on ELM (extreme learning machine). We use this method to carry out keyword search on probabilistic XML data. A probabilistic XML document differs from a traditional XML document to realize keyword search in the consideration of possible world semantics. A probabilistic XML document can be seen as a set of nodes consisting of ordinary nodes and distributional nodes. ELM has good performance in text classification applications. As the typical semistructured data; the label of XML data possesses the function of definition itself. Label and context of the node can be seen as the text data of this node. ELM offers significant advantages such as fast learning speed, ease of implementation, and effective node classification. Set intersection can compute SLCA quickly in the node sets which is classified by using ELM. In this paper, we adopt ELM to classify nodes and compute probability. We propose two algorithms that are based on ELM and probability threshold to improve the overall performance. The experimental results verify the benefits of our methods according to various evaluation metrics.

AB - This paper describes a keyword search measure on probabilistic XML data based on ELM (extreme learning machine). We use this method to carry out keyword search on probabilistic XML data. A probabilistic XML document differs from a traditional XML document to realize keyword search in the consideration of possible world semantics. A probabilistic XML document can be seen as a set of nodes consisting of ordinary nodes and distributional nodes. ELM has good performance in text classification applications. As the typical semistructured data; the label of XML data possesses the function of definition itself. Label and context of the node can be seen as the text data of this node. ELM offers significant advantages such as fast learning speed, ease of implementation, and effective node classification. Set intersection can compute SLCA quickly in the node sets which is classified by using ELM. In this paper, we adopt ELM to classify nodes and compute probability. We propose two algorithms that are based on ELM and probability threshold to improve the overall performance. The experimental results verify the benefits of our methods according to various evaluation metrics.

UR - http://www.scopus.com/inward/record.url?scp=84935836754&partnerID=8YFLogxK

U2 - 10.1155/2015/210961

DO - 10.1155/2015/210961

M3 - Article

AN - SCOPUS:84935836754

SN - 1024-123X

VL - 2015

JO - Mathematical Problems in Engineering

JF - Mathematical Problems in Engineering

M1 - 210961

ER -

Keyword Search over Probabilistic XML Documents Based on Node Classification

摘要

访问文件

其它文件与链接

指纹

引用此