Improved conic reformulations for k-means clustering

Madhushini Narayana Prasad, Grani A. Hanasusanto

Research output: Contribution to journalArticlepeer-review

Abstract

In this paper, we show that the popular K-means clustering problem can equivalently be reformulated as a conic program of polynomial size. The arising convex optimization problem is NP-hard, but amenable to a tractable semidefinite programming (SDP) relaxation that is tighter than the current SDP relaxation schemes in the literature. In contrast to the existing schemes, our proposed SDP formulation gives rise to solutions that can be leveraged to identify the clusters. We devise a new approximation algorithm for K-means clustering that utilizes the improved formulation and empirically illustrate its superiority over the state-of-the-art solution schemes.

Original languageEnglish (US)
Pages (from-to)3105-3126
Number of pages22
JournalSIAM Journal on Optimization
Volume28
Issue number4
DOIs
StatePublished - 2018
Externally publishedYes

Keywords

  • Conic programming
  • Copositive programming
  • K-means
  • Semidefinite programming

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science

Fingerprint

Dive into the research topics of 'Improved conic reformulations for k-means clustering'. Together they form a unique fingerprint.

Cite this