Abstract
This chapter proposes a novel approach for online analytical processing (OLAP) in high-dimensional datasets with a moderate number of tuples. Data cube is playing an essential role in fast OLAP in many multi-dimensional data warehouses. There exist data sets in applications, such as bioinformatics, statistics, and text processing, that are characterized by high dimensionality and moderate size. No feasible data cube can be constructed with such data sets. Data analysis tasks may involve a high dimensional space, but most OLAP operations are performed only on a small number of dimensions at a time. Using inverted indices and pre-aggregated results, OLAP queries are computed online by dynamically constructing cuboids from the fragment data cubes. With this design, for high-dimensional OLAPing, the total space that needs to store such shell-fragments is negligible in comparison with a high-dimensional cube, so is the online computation overhead. The investigations exhibit that the storage cost grows linearly with the number dimensions. Moreover, the query I/O costs for large data sets are reasonable and are comparable with solutions from a materialized data cube, if such a cube is available.
Original language | English (US) |
---|---|
Title of host publication | Proceedings 2004 VLDB Conference |
Subtitle of host publication | The 30th International Conference on Very Large Databases (VLDB) |
Publisher | Elsevier |
Pages | 528-539 |
Number of pages | 12 |
ISBN (Electronic) | 9780120884698 |
DOIs | |
State | Published - Jan 1 2004 |
ASJC Scopus subject areas
- General Computer Science