TY - GEN
T1 - TSP
T2 - 3rd IEEE International Conference on Data Mining, ICDM '03
AU - Tzvetkov, Petre
AU - Yan, Xifeng
AU - Han, Jiawei
PY - 2003
Y1 - 2003
N2 - Sequential pattern mining has been studied extensively in data mining community. Most previous studies require the specification of a minimum support threshold to perform the mining. However, it is difficult for users to provide an appropriate threshold in practice. To overcome this difficulty, we propose an alternative task: mining top-k frequent closed sequential patterns of length no less than min-ℓ, where k is the desired number of closed sequential patterns to be mined, and min-ℓ is the minimum length of each pattern. We mine closed patterns since they are compact representations of frequent patterns. We developed an efficient algorithm, called TSP, which makes use of the length constraint and the properties of top-k closed sequential patterns to perform dynamic support-raising and projected database-pruning. Our extensive performance study shows that TSP outperforms the closed sequential pattern mining algorithm even when the latter is running with the best tuned minimum support threshold.
AB - Sequential pattern mining has been studied extensively in data mining community. Most previous studies require the specification of a minimum support threshold to perform the mining. However, it is difficult for users to provide an appropriate threshold in practice. To overcome this difficulty, we propose an alternative task: mining top-k frequent closed sequential patterns of length no less than min-ℓ, where k is the desired number of closed sequential patterns to be mined, and min-ℓ is the minimum length of each pattern. We mine closed patterns since they are compact representations of frequent patterns. We developed an efficient algorithm, called TSP, which makes use of the length constraint and the properties of top-k closed sequential patterns to perform dynamic support-raising and projected database-pruning. Our extensive performance study shows that TSP outperforms the closed sequential pattern mining algorithm even when the latter is running with the best tuned minimum support threshold.
UR - http://www.scopus.com/inward/record.url?scp=78149328301&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78149328301&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:78149328301
SN - 0769519784
SN - 9780769519784
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 347
EP - 354
BT - Proceedings - 3rd IEEE International Conference on Data Mining, ICDM 2003
Y2 - 19 November 2003 through 22 November 2003
ER -