Difference between revisions of "Martins et al 2010"

Latest revision as of 02:08, 11 October 2011

This paper generalizes the loss function of CRFs, structured SVMs, structured perceptron, and Softmax-margin CRFs into a single loss function, and then derives an online learning algorithm that can be used to learn with that more general loss function. For the hinge loss, the learning algorithm reduces to MIRA.

Method

The general loss function is:

Different choices of $\mathrm {B}$ and $\gamma$ correspond to various well known loss functions. They are:

The function minimized is the loss function with a regularizer:

The online learning algorithm proposed to minimize this function is called Dual Coordinate Ascent (DCA):

The parameters can be updated using algorithm 2:

Experimental Result

Related Papers

MIRA CRF Softmax-margin CRFs

In progress by User:Jmflanig

@@ Line 5: / Line 5: @@
 === Summary ===
-This paper generalizes the loss function of CRFs, structured SVMs, structured perceptron, and Softmax-margin CRFs into a single loss function, and then derives an online learning algorithm that can be used to learn with that more general loss function.  For the hinge loss, the learning algorithm reduces to MIRA.
+This [[Category::paper]] generalizes the loss function of CRFs, structured SVMs, structured perceptron, and Softmax-margin CRFs into a single loss function, and then derives an online learning algorithm that can be used to learn with that more general loss function.  For the hinge loss, the learning algorithm reduces to MIRA.
 === Method ===
-The general loss function they use is:
+The general loss function is:
 [[file:Martins et al 2010 Loss Function.png]]
@@ Line 17: / Line 17: @@
 [[file:Martins et al 2010 Parameter Choices.png]]
-The function they minimize is the empirical risk with a regularizer:
+The function minimized is the loss function with a regularizer:
 [[file:Martins et al 2010 Learning Problem.png]]  [[file:Martins et al Relarizer.png]]
 [[file:Martins et al Regularize Coeff.png]]
+The online learning algorithm proposed to minimize this function is called Dual Coordinate Ascent (DCA):
+[[file:Martins et al 2010 DCA.png]]
+The parameters can be updated using algorithm 2:
+[[file:Martins et al 2010 Alg2.png]]
 === Experimental Result ===

Difference between revisions of "Martins et al 2010"

Latest revision as of 02:08, 11 October 2011

Contents

Citation and Online Link

Summary

Method

Experimental Result

Related Papers

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools