Privacy-preserving data publishing: A constraint-based clustering approach

Anthony K.H. Tung, Jiawei Han, Laks V.S. Lakshmanan, Raymond T. Ng

Research output: Chapter in Book/Report/Conference proceedingChapter


Abstract Privacy-preserving data publishing has drawn much research interest re-cently. In this chapter, we address this topic from the viewpoint of constrained clustering, i.e., the problem of finding clusters that satisfy certain user-specified constraints. More specifically, we begin with the problem of clustering under aggregate constraints (without privacy considerations) and explain how traditional algorithms for the unconstrained problem (e.g., the c-means algorithm) break down in the presence of constraints. From there, we develop scalable algorithms that overcome this problem and finally illustrate how our algorithm can also be used for privacy-preserving data publishing.

Original languageEnglish (US)
Title of host publicationConstrained Clustering
Subtitle of host publicationAdvances in Algorithms, Theory, and Applications
PublisherCRC Press
Number of pages22
ISBN (Electronic)9781584889977
ISBN (Print)9781584889960
StatePublished - Jan 1 2008

ASJC Scopus subject areas

  • General Business, Management and Accounting
  • General Computer Science
  • Economics, Econometrics and Finance(all)


Dive into the research topics of 'Privacy-preserving data publishing: A constraint-based clustering approach'. Together they form a unique fingerprint.

Cite this