Bayesian model selection via mean-field variational approximation

Yangfan Zhang, Yun Yang

Research output: Contribution to journalArticlepeer-review

Abstract

This article considers Bayesian model selection via mean-field (MF) variational approximation. Towards this goal, we study the non-Asymptotic properties of MF inference that allows latent variables and model misspecification. Concretely, we show a Bernstein-von Mises (BvM) theorem for the variational distribution from MF under possible model misspecification, which implies the distributional convergence of MF variational approximation to a normal distribution centring at the maximal likelihood estimator. Motivated by the BvM theorem, we propose a model selection criterion using the evidence lower bound (ELBO), and demonstrate that the model selected by ELBO tends to asymptotically agree with the one selected by the commonly used Bayesian information criterion (BIC) as the sample size tends to infinity. Compared to BIC, ELBO tends to incur smaller approximation error to the log-marginal likelihood (a.k.a. model evidence) due to a better dimension dependence and full incorporation of the prior information. Moreover, we show the geometric convergence of the coordinate ascent variational inference algorithm, which provides a practical guidance on how many iterations one typically needs to run when approximating the ELBO. These findings demonstrate that variational inference is capable of providing a computationally efficient alternative to conventional approaches in tasks beyond obtaining point estimates.

Original languageEnglish (US)
Pages (from-to)742-770
Number of pages29
JournalJournal of the Royal Statistical Society. Series B: Statistical Methodology
Volume86
Issue number3
Early online dateApr 9 2024
DOIs
StatePublished - Jul 2024

Keywords

  • Bayesian inference
  • coordinate ascent
  • mean-field inference
  • oracle inequality

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Bayesian model selection via mean-field variational approximation'. Together they form a unique fingerprint.

Cite this