Abstract
We present a Bayesian model selection approach to estimate the intrinsic dimensionality of a high-dimensional dataset. To this end, we introduce a novel formulation of the probabilisitic principal component analysis model based on a normal-gamma prior distribution. In this context, we exhibit a closed-form expression of the marginal likelihood which allows to infer an optimal number of components. We also propose a heuristic based on the expected shape of the marginal likelihood curve in order to choose the hyperparameters. In nonasymptotic frameworks, we show on simulated data that this exact dimensionality selection approach is competitive with both Bayesian and frequentist state-of-the-art methods.
| Original language | English |
|---|---|
| Pages (from-to) | 196-211 |
| Number of pages | 16 |
| Journal | Scandinavian Journal of Statistics |
| Volume | 47 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - 1 Mar 2020 |
| Externally published | Yes |
Keywords
- Singular value decomposition
- bayesian model selection
- dimension reduction
- marginal likelihood
- principal Component Analysis
Fingerprint
Dive into the research topics of 'Exact dimensionality selection for Bayesian PCA'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver