Error-adaptive and time-aware maintenance of frequency counts over data streams

Hongyan Liu, Ying Lu, Jiawei Han, Jun He

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Maintaining frequency counts for items over data stream has a wide range of applications such as web advertisement fraud detection. Study of this problem has attracted great attention from both researchers and practitioners. Many algorithms have been proposed. In this paper, we propose a new method, error-adaptive pruning method, to maintain frequency more accurately. We also propose a method called fractionization to record time information together with the frequency information. Using these two methods, we design three algorithms for finding frequent items and top-k frequent items. Experimental results show these methods are effective in terms of improving the maintenance accuracy.

Original languageEnglish (US)
Title of host publicationAdvances in Web-Age Information Management - 7th International Conference, WAIM 2006, Proceedings
PublisherSpringer-Verlag
Pages484-495
Number of pages12
ISBN (Print)3540352252, 9783540352259
DOIs
StatePublished - Jan 1 2006
Event7th International Conference on Advances in Web-Age Information Management, WAIM 2006 - Hong Kong, China
Duration: Jun 17 2006Jun 19 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4016 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other7th International Conference on Advances in Web-Age Information Management, WAIM 2006
CountryChina
CityHong Kong
Period6/17/066/19/06

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'Error-adaptive and time-aware maintenance of frequency counts over data streams'. Together they form a unique fingerprint.

  • Cite this

    Liu, H., Lu, Y., Han, J., & He, J. (2006). Error-adaptive and time-aware maintenance of frequency counts over data streams. In Advances in Web-Age Information Management - 7th International Conference, WAIM 2006, Proceedings (pp. 484-495). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4016 LNCS). Springer-Verlag. https://doi.org/10.1007/11775300_41