A loss function used in classification. In general the cross entropy beetween discrete probability distribution is defined as

Recallin the deifnition of Kullback-Leibler divergence, one finds that

where is the entropy of .

For the simple case where the suppor of and is a two values set , we get