TY - JOUR
T1 - Efficient rule-based attribute-oriented induction for data mining
AU - Cheung, David W.
AU - Hwang, H. Y.
AU - Fu, Ada W.
AU - Han, Jiawei
N1 - Funding Information:
The research of the authors were supported in part by RGC (the Hong Kong Research Grants Council) grant 338/065/0026. Research of the fourth author was supported in part by grants from the Natural Sciences and Engineering Research Council of Canada the Centre for Systems Science of Simon Fraser University.
PY - 2000
Y1 - 2000
N2 - Data mining has become an important technique which has tremendous potential in many commercial and industrial applications. Attribute-oriented induction is a powerful mining technique and has been successfully implemented in the data mining system DBMiner. However, its induction capability is limited by the unconditional concept generalization. In this paper, we extend the concept generalization to rule-based concept hierarchy, which enhances greatly its induction power. When previously proposed induction algorithm is applied to the more general rule-based case, a problem of induction anomaly occurs which impacts its efficiency. We have developed an efficient algorithm to facilitate induction on the rule-based case which can avoid the anomaly. Performance studies have shown that the algorithm is superior than a previously proposed algorithm based on backtracking.
AB - Data mining has become an important technique which has tremendous potential in many commercial and industrial applications. Attribute-oriented induction is a powerful mining technique and has been successfully implemented in the data mining system DBMiner. However, its induction capability is limited by the unconditional concept generalization. In this paper, we extend the concept generalization to rule-based concept hierarchy, which enhances greatly its induction power. When previously proposed induction algorithm is applied to the more general rule-based case, a problem of induction anomaly occurs which impacts its efficiency. We have developed an efficient algorithm to facilitate induction on the rule-based case which can avoid the anomaly. Performance studies have shown that the algorithm is superior than a previously proposed algorithm based on backtracking.
UR - http://www.scopus.com/inward/record.url?scp=0034275407&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0034275407&partnerID=8YFLogxK
U2 - 10.1023/A:1008778107391
DO - 10.1023/A:1008778107391
M3 - Article
AN - SCOPUS:0034275407
SN - 0925-9902
VL - 15
SP - 175
EP - 200
JO - Journal of Intelligent Information Systems
JF - Journal of Intelligent Information Systems
IS - 2
M1 - 267357
ER -