Exact dimensionality selection for Bayesian PCA

Research output: Contribution to journalArticlepeer-review

Abstract

We present a Bayesian model selection approach to estimate the intrinsic dimensionality of a high-dimensional dataset. To this end, we introduce a novel formulation of the probabilisitic principal component analysis model based on a normal-gamma prior distribution. In this context, we exhibit a closed-form expression of the marginal likelihood which allows to infer an optimal number of components. We also propose a heuristic based on the expected shape of the marginal likelihood curve in order to choose the hyperparameters. In nonasymptotic frameworks, we show on simulated data that this exact dimensionality selection approach is competitive with both Bayesian and frequentist state-of-the-art methods.

Original languageEnglish
Pages (from-to)196-211
Number of pages16
JournalScandinavian Journal of Statistics
Volume47
Issue number1
DOIs
Publication statusPublished - 1 Mar 2020
Externally publishedYes

Keywords

  • Singular value decomposition
  • bayesian model selection
  • dimension reduction
  • marginal likelihood
  • principal Component Analysis

Fingerprint

Dive into the research topics of 'Exact dimensionality selection for Bayesian PCA'. Together they form a unique fingerprint.

Cite this