Skip to main navigation Skip to search Skip to main content

Learning Probabilities From Random Observables in High Dimensions: The Maximum Entropy Distribution and Others

  • Tokyo Institute of Technology
  • Sorbonne Université

Research output: Contribution to journalArticlepeer-review

Abstract

We consider the problem of learning a target probability distribution over a set of N binary variables from the knowledge of the expectation values (with this target distribution) of M observables, drawn uniformly at random. The space of all probability distributions compatible with these M expectation values within some fixed accuracy, called version space, is studied. We introduce a biased measure over the version space, which gives a boost increasing exponentially with the entropy of the distributions and with an arbitrary inverse ‘temperature’ Γ. The choice of Γ allows us to interpolate smoothly between the unbiased measure over all distributions in the version space (Γ=0) and the pointwise measure concentrated at the maximum entropy distribution (Γ→∞). Using the replica method we compute the volume of the version space and other quantities of interest, such as the distance R between the target distribution and the center-of-mass distribution over the version space, as functions of α=(logM)/N and Γ for large N. Phase transitions at critical values of α are found, corresponding to qualitative improvements in the learning of the target distribution and to the decrease of the distance R. However, for fixed α, the distance R does not vary with Γ, which means that the maximum entropy distribution is not closer to the target distribution than any other distribution compatible with the observable values. Our results are confirmed by Monte Carlo sampling of the version space for small system sizes (N≤10).

Original languageEnglish
Pages (from-to)598-632
Number of pages35
JournalJournal of Statistical Physics
Volume161
Issue number3
DOIs
Publication statusPublished - 1 Nov 2015

Keywords

  • Maximum entropy principle
  • Probabilistic inference
  • Replica method

Fingerprint

Dive into the research topics of 'Learning Probabilities From Random Observables in High Dimensions: The Maximum Entropy Distribution and Others'. Together they form a unique fingerprint.

Cite this