Differentiable samplers for deep latent variable models

Arnaud Doucet, Eric Moulines, Achille Thin

Research output: Contribution to journalReview articlepeer-review

Abstract

Latent variable models are a popular class of models in statistics. Combined with neural networks to improve their expressivity, the resulting deep latent variable models have also found numerous applications in machine learning. A drawback of these models is that their likelihood function is intractable so approximations have to be carried out to perform inference. A standard approach consists of maximizing instead an evidence lower bound (ELBO) obtained based on a variational approximation of the posterior distribution of the latent variables. The standard ELBO can, however, be a very loose bound if the variational family is not rich enough. A generic strategy to tighten such bounds is to rely on an unbiased low-variance Monte Carlo estimate of the evidence. We review here some recent importance sampling, Markov chain Monte Carlo and sequential Monte Carlo strategies that have been proposed to achieve this. This article is part of the theme issue 'Bayesian inference: challenges, perspectives, and prospects'.

Original languageEnglish
Article number20220147
JournalPhilosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences
Volume381
Issue number2247
DOIs
Publication statusPublished - 1 Jan 2023

Keywords

  • Bayesian inference
  • Monte Carlo methods
  • importance sampling
  • variational inference

Fingerprint

Dive into the research topics of 'Differentiable samplers for deep latent variable models'. Together they form a unique fingerprint.

Cite this