Passer à la navigation principale Passer à la recherche Passer au contenu principal

Law of Large Numbers for Bayesian two-layer Neural Network trained with Variational Inference

  • Arnaud Descours
  • , Tom Huix
  • , Arnaud Guillin
  • , Manon Michel
  • , Éric Moulines
  • , Boris Nectoux
  • Centre CIS
  • Ecole polytechnique

Résultats de recherche: Contribution à un journalArticle de conférenceRevue par des pairs

Résumé

We provide a rigorous analysis of training by variational inference (VI) of Bayesian neural networks in the two-layer and infinite-width case. We consider a regression problem with a regularized evidence lower bound (ELBO) which is decomposed into the expected log-likelihood of the data and the Kullback-Leibler (KL) divergence between the a priori distribution and the variational posterior. With an appropriate weighting of the KL, we prove a law of large numbers for three different training schemes: (i) the idealized case with exact estimation of a multiple Gaussian integral from the reparametrization trick, (ii) a minibatch scheme using Monte Carlo sampling, commonly known as Bayes by Backprop, and (iii) a new and computationally cheaper algorithm which we introduce as Minimal VI. An important result is that all methods converge to the same mean-field limit. Finally, we illustrate our results numerically and discuss the need for the derivation of a central limit theorem.

langue originaleAnglais
Pages (de - à)4657-4695
Nombre de pages39
journalProceedings of Machine Learning Research
Volume195
étatPublié - 1 janv. 2023
Evénement36th Annual Conference on Learning Theory, COLT 2023 - Bangalore, Inde
Durée: 12 juil. 202315 juil. 2023

Empreinte digitale

Examiner les sujets de recherche de « Law of Large Numbers for Bayesian two-layer Neural Network trained with Variational Inference ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation