A discretization algorithm of numerical attributes for digital library evaluation based on data mining technology

Yumin Zhao*, Zhendong Niu, Xueping Peng, Lin Dai

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

6 引用 (Scopus)

摘要

We present here a discretization algorithm for numerical attributes of digital collections. In our research data mining technology is imported into digital library evaluation to provide a better decision-making support. But data prediction algorithms work not well based on the traditional discretization method during the data mining process. The reason is that numerical attributes of digital collections are complicated and not in the same scale of distribution distance. We study the characteristic of numerical attributes and put forward a discretization method based on the Z-score idea of mathematical statistics. This algorithm can reflect the dynamic semantic distance for different numerical attributes and significantly enhance the precision rate and recall rate of data prediction algorithms. Furthermore a 'nonlinear conditional relationship' among attributes of digital collections is discovered during the study of discretization algorithm and impacts the actual application result of traditional data mining algorithms.

源语言英语
主期刊名Digital Libraries
主期刊副标题For Cultural Heritage, Knowledge Dissemination, and Future Creation - 13th International Conference on Asia-Pacific Digital Libraries, ICADL 2011, Proceedings
70-76
页数7
DOI
出版状态已出版 - 2011
活动13th International Conference on Asia-Pacific Digital Libraries, ICADL 2011 - Beijing, 中国
期限: 24 10月 201127 10月 2011

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
7008 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议13th International Conference on Asia-Pacific Digital Libraries, ICADL 2011
国家/地区中国
Beijing
时期24/10/1127/10/11

指纹

探究 'A discretization algorithm of numerical attributes for digital library evaluation based on data mining technology' 的科研主题。它们共同构成独一无二的指纹。

引用此