Difference between revisions of "User talk:Xxiong"

@@ Line 1: / Line 1: @@
-== Citation ==
-Einat Minkov, Richard C. Wang & William W. Cohen, Extracting Personal Names from Emails:
-Applying Named Entity Recognition to Informal Text, in HLT/EMNLP 2005
-== Online version ==
-[http://www.cs.cmu.edu/~einat/email.pdf Extracting Personal Names from Emails]
-== Summary ==
-This unpublished [[Category::Paper|manuscript]] describes how [[UsesMethod::SEARN]] can be used for three Natural Language Processing related tasks: [[AddressesProblem::Sequence Labeling]], [[AddressesProblem::Parsing]], and [[AddressesProblem::Machine Translation]]
-The key points of the paper are:
-* Authors state that [[UsesMethod::SEARN]] is efficient, widely applicable, theoretically justified, and simple.
-* [[UsesMethod::SEARN]] looks at problems a search problems, and learns classifiers that walk through the search space in a good way.
-* Authors looked at 3 sample problems: [[AddressesProblem::Sequence Labeling]], [[AddressesProblem::Parsing]], and [[AddressesProblem::Machine Translation]]
-* Efficacy of [[UsesMethod::SEARN]] hinges on ability to compute an optimal/near-optimal policy. When an optimal policy is not available, authors suggest performing explicit search as an approximation. For segmentaiton and parsing, optimal policy is closed form; for summarization and machine translation, the optimal policy is not available.
-== Example SEARN Usage ==
-'''Sequence Labeling'''
-* Discussed SEARN's application to [[AddressesProblem::POS tagging]] and [[AddressesProblem::NP chunking]]
-''Tagging''
-* Task is to produce a label sequence from an input sequence.
-* Search framed as left-to-right greedy search.
-* ''Loss function'': Hamming loss
-* Optimal Policy:
-[[File:op-tagging.png]]
-''NP Chunking''
-* Chunking is a joint segmentation and labeling problem.
-* ''Loss function'': F1 measure
-* Optimal Policy:
-[[File:op-chunking.png]]
-'''Parsing'''
-* Looked at dependency parsing with a shift-reduce framework.
-* ''Loss funtion'': Hamming loss over dependencies.
-* ''Decisions'': shift/reduce
-* ''Optimal Policy'':
-[[File:op-parsing.png]]
-'''Machine Translation'''
-* Framed task as a left-to-right translation problem.
-* Search space over prefixes of translations.
-* Actions are adding a word (or phrase to end of existing translation.
-* ''Loss function'': 1 - BLEU or 1 - NIST
-* ''Optimal policy'': given set of reference translations R, English translation prefix e_1, ... e_i-1, what word (or phrase) should be produced next / are we finished.
-== Related papers ==
-* '''Search-based Structured Prediction''': This is the journal version of the paper that introduces the [[UsesMethod::SEARN]] algorithm - [[RelatedPaper::Daume_et_al,_ML_2009]].

Difference between revisions of "User talk:Xxiong"

Latest revision as of 16:53, 8 October 2010

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools