Minimum entropy regularization can be applied to any model of posterior distribution.
The learning set is denoted ,
where :
If is labeled as , then
and for ; if is unlabeled,
then for .
The conditional entropy of class labels conditioned on the observed variables:
Assuming that labels are missing at random, we have that
The posterior distribution is defined as