*** Welcome to piglix ***

Bhattacharyya distance


In statistics, the Bhattacharyya distance measures the similarity of two discrete or continuous probability distributions. It is closely related to the Bhattacharyya coefficient which is a measure of the amount of overlap between two statistical samples or populations. Both measures are named after Anil Kumar Bhattacharya, a statistician who worked in the 1930s at the Indian Statistical Institute.

The coefficient can be used to determine the relative closeness of the two samples being considered. It is used to measure the separability of classes in classification and it is considered to be more reliable than the Mahalanobis distance, as the Mahalanobis distance is a particular case of the Bhattacharyya distance when the standard deviations of the two classes are the same. Therefore, when two classes have similar means but different standard deviations, the Mahalanobis distance would tend to zero, however, the Bhattacharyya distance would grow depending on the difference between the standard deviations.

For probability distributions p and q over the same domain X, the Bhattacharyya distance is defined as:

where:

is the Bhattacharyya coefficient for discrete probability distributions.

For continuous probability distributions, the Bhattacharyya coefficient is defined as:

In either case, and . does not obey the triangle inequality, but the Hellinger distance does obey the triangle inequality.


...
Wikipedia

...