Passer à la navigation principale Passer à la recherche Passer au contenu principal

Central limit theorem for bayesian neural network trained with variational inference

  • Arnaud Descours
  • , Tom Huix
  • , Eric Moulines
  • , Arnaud Guillin
  • , Manon Michel
  • , Boris Nectoux
  • INRIA Institut National de Recherche en Informatique et en Automatique
  • Ecole polytechnique
  • Centre CIS

Résultats de recherche: Contribution à un journalArticleRevue par des pairs

Résumé

In this paper, we rigorously derive Central Limit Theorems (CLT) for Bayesian two-layer neural networks in the infinite-width limit and trained by variational inference on a regression task. The different networks are trained via different maximization schemes of the regularized evidence lower bound: (i) the idealized case with exact estimation of a multiple Gaussian integral from the reparametrization trick, (ii) a minibatch scheme using Monte Carlo sampling, commonly known as Bayes-by-Backprop, and (iii) a computationally cheaper algorithm named Minimal VI. The latter was recently introduced by leveraging the information obtained at the level of the mean-field limit. Laws of large numbers are already rigorously proven for the three schemes that admits the same asymptotic limit. By deriving CLT, this work shows that the idealized and Bayes-by-Backprop schemes have similar fluctuation behavior, that is different from the Minimal VI one. Numerical experiments then illustrate that the Minimal VI scheme is still more efficient, in spite of bigger variances, thanks to its important gain in computational complexity.

langue originaleAnglais
journalStochastics and Partial Differential Equations: Analysis and Computations
Les DOIs
étatAccepté/En presse - 1 janv. 2026

Empreinte digitale

Examiner les sujets de recherche de « Central limit theorem for bayesian neural network trained with variational inference ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation