In Bayesian statistics, the posterior predictive distribution is the distribution of possible unobserved values conditional on the observed values.
Given a set of N i.i.d. observations , a new value will be drawn from a distribution that depends on a parameter :
It may seem tempting to plug in a single best estimate for , but this ignores uncertainty about , and because a source of uncertainty is ignored, the predicted distribution will be too narrow. Extreme values of will occur more often than the posterior distribution suggests.