Mining frequent itemsets based on projection array

Hai Tao He*, Hai Yan Cao, Rui Xia Yao, Jia Dong Ren, Chang Zhen Hu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Frequent itemsets mining is a crucial problem in the field of data mining. Although many related studies have been suggested, these algorithms may suffer from high computation cost and spatial complexity in dense database, especially when mining long frequent itemsets or support threshold is very lower. To address this problem, a new data structure called PArray is proposed. PArray makes use of data horizontally and vertically like BitTableFI, and those itemsets that co-occurence with single frequent items are found by computing intersection in PArray. Then, a new algorithm, call MFIPA, is proposed based on PArray. Some frequent itemsets which have the same supports as single frequent item can be found firstly by connecting the single frequent item with every nonempty subsets of its projection, then all other frequent itemsets can be found by using depth-first search strategy. The experimental results show that the proposed algorithm is superior to BitTableFI in execution efficiency and memory requirement, especially for dense database.

Original languageEnglish
Title of host publication2010 International Conference on Machine Learning and Cybernetics, ICMLC 2010
Pages454-459
Number of pages6
DOIs
Publication statusPublished - 2010
Event2010 International Conference on Machine Learning and Cybernetics, ICMLC 2010 - Qingdao, China
Duration: 11 Jul 201014 Jul 2010

Publication series

Name2010 International Conference on Machine Learning and Cybernetics, ICMLC 2010
Volume1

Conference

Conference2010 International Conference on Machine Learning and Cybernetics, ICMLC 2010
Country/TerritoryChina
CityQingdao
Period11/07/1014/07/10

Keywords

  • Depth-first search
  • Frequent itemsets
  • Projection array

Fingerprint

Dive into the research topics of 'Mining frequent itemsets based on projection array'. Together they form a unique fingerprint.

Cite this