Difference between revisions of "R. K. Ando and T. Zhang. ACL 2005"
PastStudents (talk | contribs) |
PastStudents (talk | contribs) |
||
Line 10: | Line 10: | ||
This [[Category::paper]] investigates a new semi-supervised learning method that addresses the problem of | This [[Category::paper]] investigates a new semi-supervised learning method that addresses the problem of | ||
[[AddressesProblem::NE chunking]] and [[AddressesProblem::syntactic chunking]]. | [[AddressesProblem::NE chunking]] and [[AddressesProblem::syntactic chunking]]. | ||
+ | |||
+ | To utilize the unlabeled data, they created numerous auxiliary problems related to the target task and | ||
+ | train classifiers for each of those. | ||
+ | Then they learn the common predictive structure shared by those problems. The authors argued that | ||
+ | such common structure can be used to improve the result of target task. | ||
+ | One example of such auxiliary problem is: predict whether a word is "IBM" or not from its context. This | ||
+ | is related to NE chunking since knowing a word is "IBM" helps to predict whether its part of a name. |
Revision as of 16:50, 31 October 2010
Citation
R. K. Ando & T. Zhang, A High-Performance Semi-Supervised Learning Method for Text Chunking, in ACL 2005
Online version
Summary
This paper investigates a new semi-supervised learning method that addresses the problem of NE chunking and syntactic chunking.
To utilize the unlabeled data, they created numerous auxiliary problems related to the target task and train classifiers for each of those. Then they learn the common predictive structure shared by those problems. The authors argued that such common structure can be used to improve the result of target task. One example of such auxiliary problem is: predict whether a word is "IBM" or not from its context. This is related to NE chunking since knowing a word is "IBM" helps to predict whether its part of a name.