基于深度学习的创新主题智能挖掘算法研究

Changlei Fu; Li Qian; Huaping Zhang; Huaming Zhao; Jing Xie

doi:10.11925/infotech.2096-3467.2018.1365

基于深度学习的创新主题智能挖掘算法研究

Translated title of the contribution: Mining Innovative Topics Based on Deep Learning

Changlei Fu, Li Qian^*, Huaping Zhang, Huaming Zhao, Jing Xie

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

1 Citation (Scopus)

Abstract

[Objective] This paper aims to identify innovative topics from massive volumes of texts. [Methods] First, we extracted knowledge points with heavier weights from the data of scholarly knowledge graph. Then, these knowledge points were labeled as innovative seeds from the perspectives of “popularity”, “novelty” and “authority”. Third, we computed the knowledge correlation of the innovative seeds. Finally, the results were input to a deep learning model trained by large amounts of sci-tech papers to generate innovative topics. Note: the model is sequence to sequence with Bi-LSTM. [Results] We used Chinese research papers on artificial intelligence as the experimental data and found the average innovation score of the retrieved topics was 6.52, which were evaluated by experts manually. [Limitations] At present, contents of the knowledge graph and the training datasets need to be improved. [Conclusions] The proposed model, which identifies innovative topics from scholarly papers, could be optimized in the future.

Translated title of the contribution	Mining Innovative Topics Based on Deep Learning
Original language	Chinese (Traditional)
Pages (from-to)	46-54
Number of pages	9
Journal	Data Analysis and Knowledge Discovery
Volume	3
Issue number	1
DOIs	https://doi.org/10.11925/infotech.2096-3467.2018.1365
Publication status	Published - Jan 2019

Access to Document

10.11925/infotech.2096-3467.2018.1365

Cite this

@article{17b4046a37784d15aba9e61e7105be1a,

title = "基于深度学习的创新主题智能挖掘算法研究",

abstract = "[Objective] This paper aims to identify innovative topics from massive volumes of texts. [Methods] First, we extracted knowledge points with heavier weights from the data of scholarly knowledge graph. Then, these knowledge points were labeled as innovative seeds from the perspectives of “popularity”, “novelty” and “authority”. Third, we computed the knowledge correlation of the innovative seeds. Finally, the results were input to a deep learning model trained by large amounts of sci-tech papers to generate innovative topics. Note: the model is sequence to sequence with Bi-LSTM. [Results] We used Chinese research papers on artificial intelligence as the experimental data and found the average innovation score of the retrieved topics was 6.52, which were evaluated by experts manually. [Limitations] At present, contents of the knowledge graph and the training datasets need to be improved. [Conclusions] The proposed model, which identifies innovative topics from scholarly papers, could be optimized in the future.",

keywords = "Deep Learning, Innovative Topic, Intelligent Mining, Seq2Seq",

author = "Changlei Fu and Li Qian and Huaping Zhang and Huaming Zhao and Jing Xie",

year = "2019",

month = jan,

doi = "10.11925/infotech.2096-3467.2018.1365",

language = "繁体中文",

volume = "3",

pages = "46--54",

journal = "Data Analysis and Knowledge Discovery",

issn = "2096-3467",

publisher = "Chinese Academy of Sciences",

number = "1",

}

TY - JOUR

T1 - 基于深度学习的创新主题智能挖掘算法研究

AU - Fu, Changlei

AU - Qian, Li

AU - Zhang, Huaping

AU - Zhao, Huaming

AU - Xie, Jing

PY - 2019/1

Y1 - 2019/1

N2 - [Objective] This paper aims to identify innovative topics from massive volumes of texts. [Methods] First, we extracted knowledge points with heavier weights from the data of scholarly knowledge graph. Then, these knowledge points were labeled as innovative seeds from the perspectives of “popularity”, “novelty” and “authority”. Third, we computed the knowledge correlation of the innovative seeds. Finally, the results were input to a deep learning model trained by large amounts of sci-tech papers to generate innovative topics. Note: the model is sequence to sequence with Bi-LSTM. [Results] We used Chinese research papers on artificial intelligence as the experimental data and found the average innovation score of the retrieved topics was 6.52, which were evaluated by experts manually. [Limitations] At present, contents of the knowledge graph and the training datasets need to be improved. [Conclusions] The proposed model, which identifies innovative topics from scholarly papers, could be optimized in the future.

AB - [Objective] This paper aims to identify innovative topics from massive volumes of texts. [Methods] First, we extracted knowledge points with heavier weights from the data of scholarly knowledge graph. Then, these knowledge points were labeled as innovative seeds from the perspectives of “popularity”, “novelty” and “authority”. Third, we computed the knowledge correlation of the innovative seeds. Finally, the results were input to a deep learning model trained by large amounts of sci-tech papers to generate innovative topics. Note: the model is sequence to sequence with Bi-LSTM. [Results] We used Chinese research papers on artificial intelligence as the experimental data and found the average innovation score of the retrieved topics was 6.52, which were evaluated by experts manually. [Limitations] At present, contents of the knowledge graph and the training datasets need to be improved. [Conclusions] The proposed model, which identifies innovative topics from scholarly papers, could be optimized in the future.

KW - Deep Learning

KW - Innovative Topic

KW - Intelligent Mining

KW - Seq2Seq

UR - http://www.scopus.com/inward/record.url?scp=85166937660&partnerID=8YFLogxK

U2 - 10.11925/infotech.2096-3467.2018.1365

DO - 10.11925/infotech.2096-3467.2018.1365

M3 - 文章

AN - SCOPUS:85166937660

SN - 2096-3467

VL - 3

SP - 46

EP - 54

JO - Data Analysis and Knowledge Discovery

JF - Data Analysis and Knowledge Discovery

IS - 1

ER -

基于深度学习的创新主题智能挖掘算法研究

Abstract

Access to Document

Other files and links

Fingerprint

Cite this