TY - GEN
T1 - ARCube
T2 - 2008 ACM SIGMOD International Conference on Management of Data 2008, SIGMOD'08
AU - Wu, Tianyi
AU - Xin, Dong
AU - Han, Jiawei
PY - 2008
Y1 - 2008
N2 - Supporting ranking queries in database systems has been a popular research topic recently. However, there is a lack of study on supporting ranking queries in data warehouses where ranking is on multidimensional aggregates instead of on measures of base facts. To address this problem, we propose a query execution model to answer different types of ranking aggregate queries based on a unified, partial cube structure, ARCube. The query execution model follows a candidate generation and verification framework, where the most promising candidate cells are generated using a set of high-level guiding cells. We also identify a bounding principle for effective pruning: once a guiding cell is pruned, all of its children candidate cells can be pruned. We further address the problem of efficient online candidate aggregation and verification by developing a chunk-based execution model to verify a bulk of candidates within a bounded memory buffer. Our extensive performance study shows that the new framework not only leads to an order of magnitude performance improvements over the state-of-the-art method, but also is much more flexible in terms of the types of ranking aggregate queries supported.
AB - Supporting ranking queries in database systems has been a popular research topic recently. However, there is a lack of study on supporting ranking queries in data warehouses where ranking is on multidimensional aggregates instead of on measures of base facts. To address this problem, we propose a query execution model to answer different types of ranking aggregate queries based on a unified, partial cube structure, ARCube. The query execution model follows a candidate generation and verification framework, where the most promising candidate cells are generated using a set of high-level guiding cells. We also identify a bounding principle for effective pruning: once a guiding cell is pruned, all of its children candidate cells can be pruned. We further address the problem of efficient online candidate aggregation and verification by developing a chunk-based execution model to verify a bulk of candidates within a bounded memory buffer. Our extensive performance study shows that the new framework not only leads to an order of magnitude performance improvements over the state-of-the-art method, but also is much more flexible in terms of the types of ranking aggregate queries supported.
KW - Data cube
KW - Partial materialization
KW - Ranking aggregate queries
UR - http://www.scopus.com/inward/record.url?scp=57149127093&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=57149127093&partnerID=8YFLogxK
U2 - 10.1145/1376616.1376627
DO - 10.1145/1376616.1376627
M3 - Conference contribution
AN - SCOPUS:57149127093
SN - 9781605581026
T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data
SP - 79
EP - 91
BT - SIGMOD 2008
Y2 - 9 June 2008 through 12 June 2008
ER -