Privacy-preserving data publishing: A constraint-based clustering approach

Anthony K.H. Tung, Jiawei Han, Laks V.S. Lakshmanan, Raymond T. Ng

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Abstract Privacy-preserving data publishing has drawn much research interest re-cently. In this chapter, we address this topic from the viewpoint of constrained clustering, i.e., the problem of finding clusters that satisfy certain user-specified constraints. More specifically, we begin with the problem of clustering under aggregate constraints (without privacy considerations) and explain how traditional algorithms for the unconstrained problem (e.g., the c-means algorithm) break down in the presence of constraints. From there, we develop scalable algorithms that overcome this problem and finally illustrate how our algorithm can also be used for privacy-preserving data publishing.

Original languageEnglish (US)
Title of host publicationConstrained Clustering
Subtitle of host publicationAdvances in Algorithms, Theory, and Applications
PublisherCRC Press
Pages375-396
Number of pages22
ISBN (Electronic)9781584889977
ISBN (Print)9781584889960
StatePublished - Jan 1 2008

ASJC Scopus subject areas

  • Business, Management and Accounting(all)
  • Computer Science(all)
  • Economics, Econometrics and Finance(all)

Fingerprint Dive into the research topics of 'Privacy-preserving data publishing: A constraint-based clustering approach'. Together they form a unique fingerprint.

  • Cite this

    Tung, A. K. H., Han, J., Lakshmanan, L. V. S., & Ng, R. T. (2008). Privacy-preserving data publishing: A constraint-based clustering approach. In Constrained Clustering: Advances in Algorithms, Theory, and Applications (pp. 375-396). CRC Press.