Combining a relaxed EM algorithm with Occam's razor for Bayesian variable selection in high-dimensional regression

Pierre Latouche, Pierre Alexandre Mattei, Charles Bouveyron, Julien Chiquet

Research output: Contribution to journalArticlepeer-review

Abstract

We address the problem of Bayesian variable selection for high-dimensional linear regression. We consider a generative model that uses a spike-and-slab-like prior distribution obtained by multiplying a deterministic binary vector, which traduces the sparsity of the problem, with a random Gaussian parameter vector. The originality of the work is to consider inference through relaxing the model and using a type-II log-likelihood maximization based on an EM algorithm. Model selection is performed afterwards relying on Occam's razor and on a path of models found by the EM algorithm. Numerical comparisons between our method, called spinyReg, and state-of-the-art high-dimensional variable selection algorithms (such as lasso, adaptive lasso, stability selection or spike-and-slab procedures) are reported. Competitive variable selection results and predictive performances are achieved on both simulated and real benchmark data sets. An original regression data set involving the prediction of the number of visitors of the Orsay museum in Paris using bike-sharing system data is also introduced, illustrating the efficiency of the proposed approach. The R package spinyReg implementing the method proposed in this paper is available on CRAN.

Original languageEnglish
Pages (from-to)177-190
Number of pages14
JournalJournal of Multivariate Analysis
Volume146
DOIs
Publication statusPublished - 1 Apr 2016
Externally publishedYes

Keywords

  • EM algorithm
  • High-dimensional data
  • Linear regression
  • Occam's razor
  • Spike-and-slab
  • Variable selection

Fingerprint

Dive into the research topics of 'Combining a relaxed EM algorithm with Occam's razor for Bayesian variable selection in high-dimensional regression'. Together they form a unique fingerprint.

Cite this