Difference between revisions of "Gimpel and Smith, NAACL 2010"

Revision as of 18:56, 25 September 2011

Softmax-Margin CRFs: Training Log-Linear Models with Cost Functions

This can be found at: [1]

Citation

Kevin Gimpel and Noah A. Smith. Softmax-margin CRFs: Training log-linear models with loss functions. In Proceedings of the Human Language Technologies Conference of the North American Chapter of the Association for Computational Linguistics, pages 733-736, Los Angeles, California, USA, June 2010.

Summary

The authors want to be able to incorporate a cost function (present in structured SVMs) into standard conditional log-likelihood models. They introduce the softmax-margin objective function that achieves the best of both worlds. Using a NER task, it performs significantly better than a standard conditional loglikelihood model, a max-margin model, and the perceptron, but is indistinguishable from MIRA, risk, and JRB (Jensen risk bound; defined in the paper).

Brief Description of the Softmax-Margin objective function

Consider the objective functions for these four methods. Our

Conditional log likelihood: $\min _{\theta }\sum _{i=1}^{n}-{\boldsymbol {\theta }}^{T}{\boldsymbol {f}}(x^{(i)},y^{(i)})+\log \sum _{y\in {\mathcal {Y}}(x^{(i)})}\exp\{{\boldsymbol {\theta }}^{T}{\boldsymbol {f}}(x^{(i)},y)\}$

Max-margin: $\min _{\theta }\sum _{i=1}^{n}-{\boldsymbol {\theta }}^{T}{\boldsymbol {f}}(x^{(i)},y^{(i)})+\max _{y\in {\mathcal {Y}}(x^{(i)})}({\boldsymbol {\theta }}^{T}{\boldsymbol {f}}(x^{(i)},y)+cost(y^{(i)},y))$

Risk: $\min _{\theta }\sum _{i=1}^{n}\sum _{y\in {\mathcal {Y}}(x^{(i)})}cost(y^{(i)},y){\dfrac {\exp\{{\boldsymbol {\theta }}^{T}{\boldsymbol {f}}(x^{(i)},y)\}}{\sum _{y'\in {\mathcal {Y}}(x^{(i)})}\exp\{{\boldsymbol {\theta }}^{T}{\boldsymbol {f}}(x^{(i)},y')\}}}$

Softmax-margin: $\min _{\theta }\sum _{i=1}^{n}-{\boldsymbol {\theta }}^{T}{\boldsymbol {f}}(x^{(i)},y^{(i)})+\log \sum _{y\in {\mathcal {Y}}(x^{(i)})}\exp\{{\boldsymbol {\theta }}^{T}{\boldsymbol {f}}(x^{(i)},y)+cost(y^{(i)},y)\}$

Experimental Results

Related Work

@@ Line 1: / Line 1: @@
 '''Softmax-Margin CRFs: Training Log-Linear Models with Cost Functions'''
-Online: [http://www.cs.cmu.edu/~kgimpel/papers/gimpel+smith.naacl10.pdf]
+This [[Category:Paper|paper]] can be found at: [http://www.cs.cmu.edu/~kgimpel/papers/gimpel+smith.naacl10.pdf]
 ==Citation==

Difference between revisions of "Gimpel and Smith, NAACL 2010"

Revision as of 18:56, 25 September 2011

Contents

Citation

Summary

Brief Description of the Softmax-Margin objective function

Experimental Results

Related Work

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools