Difference between revisions of "Entropy Minimization for Semi-supervised Learning"

Revision as of 20:33, 8 October 2010

Minimum entropy regularization can be applied to any model of posterior distribution.

The learning set is denoted $L_{n}=\{x_{i},z_{i}\}_{i=1}^{n}$ , where $z_{i}\in \{0,1\}^{K}$ : If $x_{i}$ is labeled as $w_{i}$ , then $z_{ik}=1$ and $z_{il}=0$ for $l\not =k$ ; if $x_{i}$ is unlabeled, then $z_{il}=1$ for $l=1\dots K$ .

The conditional entropy of class labels conditioned on the observed variables:

$H(Y|X,Z;L_{n})=-{\frac {1}{n}}\sum _{i=1}^{n}\sum _{k=1}^{K}P(Y=w_{k}|x_{i},z_{i}){\text{log}}P(Y=w_{k}|x_{i},z_{i})$

The posterior distribution is defined as

${\begin{alignedat}{2}C(\theta ,\lambda ;L_{n})&=L(\theta ;{\mathcal {L}}_{n})-\lambda H(Y|X,Z;{\mathcal {L}}_{n})\\&=\sum _{i=1}^{n}{\text{log}}(\sum _{k=1}^{K}z_{ik}P(Y^{i}=w_{k}|X^{i}))\end{alignedat}}$

@@ Line 17: / Line 17: @@
 <math>
 \begin{alignat}{2}
-C(\theta, \lambda; L_{n}) = L(\theta; \mathcal{L}_{n}) - \lambda H(Y|X,Z; \mathcal{L}_{n}) \\
+C(\theta, \lambda; L_{n}) & = L(\theta; \mathcal{L}_{n}) - \lambda H(Y|X,Z; \mathcal{L}_{n}) \\
-= \sum^{n}_{i=1} \text{log}(\sum^{K}_{k=1} z_{ik}P(Y^{i}=w_{k}|X^{i}))
+& = \sum^{n}_{i=1} \text{log}(\sum^{K}_{k=1} z_{ik}P(Y^{i}=w_{k}|X^{i}))
 \end{alignat}
 </math>

Difference between revisions of "Entropy Minimization for Semi-supervised Learning"

Revision as of 20:33, 8 October 2010

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools