Text cube: Computing IR measures for multidimensional text database analysis

Cindy Xide Lin, Bolin Ding, Jiawei Han, Feida Zhu, Bo Zhao

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Since Jim Gray introduced the concept of "data cube" in 1997, data cube, associated with online analytical processing v(OLAP), has become a driving engine in data warehouse industry. Because the boom of Internet has given rise to an ever increasing amount of text data associated with other multidimensional information, it is natural to propose a data cube model that integrates the power of traditional OLAP and IR techniques for text. In this paper, we propose a Text-Cube model on multidimensional text database and study effective OLAP over such data. Two kinds of hierarchies are distinguishable inside: dimensional hierarchy and term hierarchy. By incorporating these hierarchies, we conduct systematic studies on efficient text-cube implementation, OLAP execution and query processing. Our performance study shows the high promise of our methods.

Original languageEnglish (US)
Title of host publicationProceedings - 8th IEEE International Conference on Data Mining, ICDM 2008
Pages905-910
Number of pages6
DOIs
StatePublished - 2008
Event8th IEEE International Conference on Data Mining, ICDM 2008 - Pisa, Italy
Duration: Dec 15 2008Dec 19 2008

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Other

Other8th IEEE International Conference on Data Mining, ICDM 2008
CountryItaly
CityPisa
Period12/15/0812/19/08

ASJC Scopus subject areas

  • Engineering(all)

Fingerprint Dive into the research topics of 'Text cube: Computing IR measures for multidimensional text database analysis'. Together they form a unique fingerprint.

Cite this