Motif and Hypergraph Correlation Clustering

Pan Li, Gregory J. Puleo, Olgica Milenkovic

Research output: Contribution to journalArticlepeer-review

Abstract

Motivated by applications in social and biological network analysis we introduce a new form of agnostic clustering termed motif correlation clustering, which aims to minimize the cost of clustering errors associated with both edges and higher-order network structures. The problem may be succinctly described as follows: Given a complete graph $G$ , partition the vertices of the graph so that certain predetermined 'important' subgraphs mostly lie within the same cluster, while 'less relevant' subgraphs are allowed to lie across clusters. Our contributions are as follows: We first introduce several variants of motif correlation clustering and then show that these clustering problems are NP-hard. We then proceed to describe polynomial-time clustering algorithms that provide constant approximation guarantees for the problems at hand. Despite following the frequently used LP relaxation and rounding procedure, the algorithms involve a sophisticated and carefully designed neighborhood growing step that combines information about both edges and motifs. We conclude with several examples illustrating the performance of the developed algorithms on synthetic and real networks.

Original languageEnglish (US)
Article number8830473
Pages (from-to)3065-3078
Number of pages14
JournalIEEE Transactions on Information Theory
Volume66
Issue number5
DOIs
StatePublished - May 2020

Keywords

  • Correlation clustering
  • graph clustering
  • hypergraph
  • network motif

ASJC Scopus subject areas

  • Information Systems
  • Computer Science Applications
  • Library and Information Sciences

Fingerprint

Dive into the research topics of 'Motif and Hypergraph Correlation Clustering'. Together they form a unique fingerprint.

Cite this