Mining top-K frequent closed patterns without minimum support

Jiawei Han, Jianyong Wang, Ying Lu, Petre Tzvetkov

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, we propose a new mining task: mining top-k frequent closed patterns of length no less than min-l, where k is the desired number of frequent closed patterns to be mined, and min-l is the minimal length of each pattern. An efficient algorithm, called TFP, is developed for mining such patterns without minimum support. Two methods, closed-node-count and descendant-sum are proposed to effectively raise support threshold and prune FP-tree both during and after the construction of FP-tree. During the mining process, a novel top-down and bottom-up combined FP-tree mining strategy is developed to speed-up support-raising and closed frequent pattern discovering. In addition, a fast hash-based closed pattern verification scheme has been employed to check efficiently if a potential closed pattern is really closed. Our performance study shows that in most cases, TFP outperforms CLOSET and CHARM, two efficient frequent closed pattern mining algorithms, even when both are running with the best tuned min-support. Furthermore, the method can be extended to generate association rules and to incorporate user-specified constraints. Thus we conclude that for frequent pattern mining, mining top-k frequent closed patterns without min-support is more preferable than the traditional min-support-based mining.

Original languageEnglish (US)
Title of host publicationProceedings - 2002 IEEE International Conference on Data Mining, ICDM 2002
Pages211-218
Number of pages8
StatePublished - Dec 1 2002
Event2nd IEEE International Conference on Data Mining, ICDM '02 - Maebashi, Japan
Duration: Dec 9 2002Dec 12 2002

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Other

Other2nd IEEE International Conference on Data Mining, ICDM '02
Country/TerritoryJapan
CityMaebashi
Period12/9/0212/12/02

ASJC Scopus subject areas

  • Engineering(all)

Fingerprint

Dive into the research topics of 'Mining top-K frequent closed patterns without minimum support'. Together they form a unique fingerprint.

Cite this