TY - JOUR
T1 - Divide-and-approximate
T2 - A novel constraint push strategy for iceberg cube mining
AU - Wang, Ke
AU - Jiang, Yuelong
AU - Yu, Jeffrey Xu
AU - Dong, Guozhu
AU - Han, Jiawei
N1 - Funding Information:
The authors wish to thank the reviewers for their helpful comments. This work was supported in part by the Natural Sciences and Engineering Research Council of Canada, Networks of Centres of Excellence/ Institute for Robotic and Intelligent Systems, and the Research Grants Council of the Hong Kong Special Administrative region, China (CUHK4229/01E).
PY - 2005/3
Y1 - 2005/3
N2 - The iceberg cube mining computes all cells v, corresponding to GROUP BY partitions, that satisfy a given constraint on aggregated behaviors of the tuples in a GROUP BY partition. The number of cells often is so large that the result cannot be realistically searched without pushing the constraint into the search. Previous works have pushed antimonotone and monotone constraints. However, many useful constraints are neither antimonotone nor monotone. We consider a general class of aggregate constraints of the form f(v)0σ, where f is an arithmetic function of SQL-like aggregates and θ is one of <, ≤, ≥, >. We propose a novel pushing technique, called Divide-and-Approximate, to push such constraints. The idea is to recursively divide the search space and approximate the given constraint using antimonotone or monotone constraints in subspaces. This technique applies to a class called separable constraints, which properly contains all constraints built by an arithmetic function f of all SQL aggregates.
AB - The iceberg cube mining computes all cells v, corresponding to GROUP BY partitions, that satisfy a given constraint on aggregated behaviors of the tuples in a GROUP BY partition. The number of cells often is so large that the result cannot be realistically searched without pushing the constraint into the search. Previous works have pushed antimonotone and monotone constraints. However, many useful constraints are neither antimonotone nor monotone. We consider a general class of aggregate constraints of the form f(v)0σ, where f is an arithmetic function of SQL-like aggregates and θ is one of <, ≤, ≥, >. We propose a novel pushing technique, called Divide-and-Approximate, to push such constraints. The idea is to recursively divide the search space and approximate the given constraint using antimonotone or monotone constraints in subspaces. This technique applies to a class called separable constraints, which properly contains all constraints built by an arithmetic function f of all SQL aggregates.
KW - Aggregate constraint
KW - Constrained data mining
KW - Data cube
KW - Iceberg cube mining
KW - Iceberg query
UR - http://www.scopus.com/inward/record.url?scp=14644393625&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=14644393625&partnerID=8YFLogxK
U2 - 10.1109/TKDE.2005.45
DO - 10.1109/TKDE.2005.45
M3 - Article
AN - SCOPUS:14644393625
SN - 1041-4347
VL - 17
SP - 354
EP - 367
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 3
ER -