Difference between revisions of "Expectation Regularization"

Revision as of 17:46, 30 November 2010

This is a method introduced in G.S Mann and A. McCallum, ICML 2007. It is often served as a regularized term with the likelihood function. In practice human often have an insight of label prior distribution. This method introduced a way to take advantage of this prior knowledge.

Let's denote human-provided prior as ${\tilde {p}}$ . We minimizes the distance between ${\tilde {p}}$ and ${\hat {p}}$ . KL-distance is used here so the regularization becomes $D({\tilde {p}}||{\hat {p}})=\sum _{y}{\tilde {p}}(y){\text{log}}{\frac {{\tilde {p}}(y)}{{\hat {p}}(y)}}$

@@ Line 3: / Line 3: @@
 This method introduced a way to take advantage of this prior knowledge.
-Let's denote human-provided prior <math> \tilde{p} </math>.
+Let's denote human-provided prior as <math> \tilde{p} </math>.
+We minimizes the distance between <math> \tilde{p} </math> and <math> \hat{p} </math>.
+KL-distance is used here so the regularization becomes
+<math>
+D(\tilde{p}||\hat{p})=\sum_{y} \tilde{p}(y) \text{log} \frac{\tilde{p}(y)}{\hat{p}(y)}
+</math>

Difference between revisions of "Expectation Regularization"

Revision as of 17:46, 30 November 2010

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools