TY - JOUR
T1 - Generalization-based data mining in object-oriented databases using an object cube model
AU - Han, Jiawei
AU - Nishio, Shojiro
AU - Kawano, Hiroyuki
AU - Wang, Wei
N1 - This research was supported in part by the following: for the first author, the grants from the Natural Sciences and Engineering Research Council of Canada and NCE/IRIS; for the second author, the Ministry of Education, Science, Sports, Sports and Culture of Japan under Scientific Research Grant-in-Aid, for the first three authors, the Research Grant-in-Aid from the Artificial Intelligence Research Promotion Foundation, Japan, and for the third author, a scholarship from the Ministry of Education, Science, Sports and Culture of Japan and the work was done during his visit to Simon Fraser University.
PY - 1998/3
Y1 - 1998/3
N2 - Data mining is the discovery of knowledge and useful information from the large amounts of data stored in databases. With the increasing popularity of object-oriented database systems in advanced database applications, it is important to study the data mining methods for object-oriented databases because mining knowledge from such databases may improve understanding, organization, and utilization of the data stored there. In this paper, issues on generalization-based data mining in object-oriented databases are investigated in three aspects: (1) generalization of complex objects, (2) class-based generalization, and (3) extraction of different kinds of rules. An object cube model is proposed for class-based generalization, on-line analytical processing, and data mining. The study shows that (i) a set of sophisticated generalization operators can be constructed for generalization of complex data objects, (ii) a dimension-based class generalization mechanism can be developed for object cube construction, and (iii) sophisticated rule formation methods can be developed for extraction of different kinds of knowledge from data, including characteristic rules, discriminant rules, association rules, and classification rules. Furthermore, the application of such discovered knowledge may substantially enhance the power and flexibility of browsing databases, organizing databases and querying data and knowledge in object-oriented databases.
AB - Data mining is the discovery of knowledge and useful information from the large amounts of data stored in databases. With the increasing popularity of object-oriented database systems in advanced database applications, it is important to study the data mining methods for object-oriented databases because mining knowledge from such databases may improve understanding, organization, and utilization of the data stored there. In this paper, issues on generalization-based data mining in object-oriented databases are investigated in three aspects: (1) generalization of complex objects, (2) class-based generalization, and (3) extraction of different kinds of rules. An object cube model is proposed for class-based generalization, on-line analytical processing, and data mining. The study shows that (i) a set of sophisticated generalization operators can be constructed for generalization of complex data objects, (ii) a dimension-based class generalization mechanism can be developed for object cube construction, and (iii) sophisticated rule formation methods can be developed for extraction of different kinds of knowledge from data, including characteristic rules, discriminant rules, association rules, and classification rules. Furthermore, the application of such discovered knowledge may substantially enhance the power and flexibility of browsing databases, organizing databases and querying data and knowledge in object-oriented databases.
KW - Data mining
KW - Knowledge discovery in databases
KW - Object cube model
KW - Object-oriented databases
UR - https://www.scopus.com/pages/publications/0032022027
UR - https://www.scopus.com/pages/publications/0032022027#tab=citedBy
U2 - 10.1016/S0169-023X(97)00051-7
DO - 10.1016/S0169-023X(97)00051-7
M3 - Article
AN - SCOPUS:0032022027
SN - 0169-023X
VL - 25
SP - 55
EP - 97
JO - Data and Knowledge Engineering
JF - Data and Knowledge Engineering
IS - 1-2
ER -