Minimum entropy regularization can be applied to any model of posterior distribution.
The learning set is denoted
,
where
:
If
is labeled as
, then
and
for
; if
is unlabeled,
then
for
.
The conditional entropy of class labels conditioned on the observed variables:
The posterior distribution is defined as