Spectral graph clustering via the expectation-solution algorithm

Zachary M. Pisano, Joshua S. Agterberg, Carey E. Priebe, Daniel Q. Naiman

Research output: Contribution to journalArticlepeer-review

Abstract

The stochastic blockmodel (SBM) models the connectivity within and between disjoint subsets of nodes in networks. Prior work demonstrated that the rows of an SBM’s adjacency spectral embedding (ASE) and Lapla-cian spectral embedding (LSE) both converge in law to Gaussian mixtures where the components are curved exponential families. Maximum likelihood estimation via the Expectation-Maximization (EM) algorithm for a full Gaussian mixture model (GMM) can then perform the task of clustering graph nodes, albeit without appealing to the components’ curvature. Noting that EM is a special case of the Expectation-Solution (ES) algo-rithm, we propose two ES algorithms that allow us to take full advantage of these curved structures. After presenting the ES algorithm for the gen-eral curved-Gaussian mixture, we develop those corresponding to the ASE and LSE limiting distributions. Simulating from artificial SBMs and a brain connectome SBM reveals that clustering graph nodes via our ES algorithms can improve upon that of EM for a full GMM for a wide range of settings.

Original languageEnglish (US)
Pages (from-to)3135-3175
Number of pages41
JournalElectronic Journal of Statistics
Volume16
Issue number1
DOIs
StatePublished - 2022
Externally publishedYes

Keywords

  • curved exponential family
  • EM algorithm
  • estimating equations
  • mixture model
  • random graph

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Spectral graph clustering via the expectation-solution algorithm'. Together they form a unique fingerprint.

Cite this