In Bayesian inference, the Bernstein–von Mises theorem provides the basis for the important result that the posterior distribution for unknown quantities in any problem is effectively independent of the prior distribution (assuming it obeys Cromwell's rule) once the amount of information supplied by a sample of data is large enough.
The theorem is named after Richard von Mises and S. N. Bernstein even though the first proper proof was given by Joseph L. Doob in 1949 for random variables with finite probability space. Later Lucien Le Cam, his PhD student Lorraine Schwartz, David A. Freedman and Persi Diaconis extended the proof under more general assumptions.
A remarkable result was found by Freedman in 1965: the Bernstein–von Mises theorem does not hold almost surely if the random variable has an infinite countable probability space; however this depends on allowing a very broad range of possible priors. In practice, the priors used typically in research do have the desirable property even with an infinite countable probability space.
Also, it is important to draw a distinction between the posterior mode and other summaries of the posterior, such as its mean. Under Freedman's examples, the posterior density (including its mean, for example) can converge on the wrong result, but it should be noted that the posterior mode is consistent and will converge on the correct result.
The statistician A. W. F. Edwards has remarked, "It is sometimes said, in defence of the Bayesian concept, that the choice of prior distribution is unimportant in practice, because it hardly influences the posterior distribution at all when there are moderate amounts of data. The less said about this 'defence' the better." This criticism does not apply to the posterior mode.