Difference between revisions of "Structured Output Learning with Indirect Supervision"
(Created page with '== Citation == {{MyCiteconference | booktitle = Proceedings of the 27th International Conference on Machine Learning | coauthors = V.Srikumar, D.Goldwasser, D.Roth | date = 2010|…') |
|||
Line 14: | Line 14: | ||
The authors present their approach, which they call "Joint Learning with Indirect Supervision", or J-LIS, as a generalization of the structured output [[Support_Vector_Machines || SVMs]]. The basic idea is to learn from a small number of fully or directly supervised training examples, and augment training with indirectly supervised (binary labels) training data. | The authors present their approach, which they call "Joint Learning with Indirect Supervision", or J-LIS, as a generalization of the structured output [[Support_Vector_Machines || SVMs]]. The basic idea is to learn from a small number of fully or directly supervised training examples, and augment training with indirectly supervised (binary labels) training data. | ||
− | The goal of learning in standard structured output prediction is to find a weight vector | + | The goal of learning in standard structured output prediction is to find a weight vector '''w''' such that <math>\mathbf{h}_i = \arg\max_{\mathbf{h} \in \mathcal{H}(\mathbf{x})} \mathbf{w}^T \Phi (\mathbf{x}_i, \mathbf{h})</math>. |
== Baseline & Results == | == Baseline & Results == | ||
== Related Work == | == Related Work == |
Revision as of 17:34, 29 September 2011
Citation
Structured Output Learning with Indirect Supervision, by M.W. Chang, V.Srikumar, D.Goldwasser, D.Roth. In Proceedings of the 27th International Conference on Machine Learning, 2010.
This Paper is available online [1].
Summary
The problem with a lot of structured output problems is that is often time-consuming and difficult to obtain labelings and annotated structures for training. For example, one may need to obtain a parse tree, a POS tag sequence, or a translation for every given sentence in your training data. Naturally, this becomes more onerous the larger your training set. The key idea of this paper is that often, structured output problems have a companion learning problem which is to determine whether a given structure is legitimate or not. For example, for POS tagging, the binary companion problem is to determine whether the POS tag sequence for a given input sentence is valid or not. The basic assumption (which seems quite realistic to me) is that "direct supervision", which in this case would mean the ground truth tag sequence, is difficult and cumbersome to obtain, obtaining binary annotations (good/bad, legitimate/illegitimate) is far easier.
This paper presents a large margin-based framework that learns on both fully supervised (i.e., traditional supervised training) data as well as binary "indirectly supervised" training data. Experiments show that binary data helps in improving performance on three different NLP structure learning problems.
Main Approach
The authors present their approach, which they call "Joint Learning with Indirect Supervision", or J-LIS, as a generalization of the structured output | SVMs. The basic idea is to learn from a small number of fully or directly supervised training examples, and augment training with indirectly supervised (binary labels) training data.
The goal of learning in standard structured output prediction is to find a weight vector w such that .