Difference between revisions of "Structured Linear Classifier"

Latest revision as of 06:08, 6 November 2012

Online

Discriminative training methods for hidden Markov models: Theory and experiments with perceptron algorithms, Collins. 2002, InProc. EMNLP. Weblink

Abstract

We describe new algorithms for training tagging models, as an alternative to maximum-entropy models or conditional random fields (CRFs). The algorithms rely on Viterbi decoding of training examples, combined with simple additive updates. We describe theory justifying the algorithms through a modification of the proof of convergence of the perceptron algorithm for classification problems. We give experimental results on part-of-speech tagging and base noun phrase chunking, in both cases showing improvements over results for a maximum-entropy tagger.

Revision as of 06:08, 6 November 2012 (view source) Ydalal (talk \| contribs) (Created page with '== Online == Discriminative training methods for hidden Markov models: Theory and experiments with perceptron algorithms, Collins. 2002, InProc. EMNLP. [http://dl.acm.org/citati…')		Latest revision as of 06:08, 6 November 2012 (view source) Ydalal (talk \| contribs) (→‎Abstract)
Line 4:		Line 4:

	== Abstract ==		== Abstract ==
−	We describe new algorithms for training tagging models, as an alternative to maximum-entropy models or conditional random ~~�fields~~ (CRFs). The algorithms rely on Viterbi decoding of training examples, combined with simple additive updates. We describe theory justifying the algorithms through a ~~modi�fication~~ of the proof of convergence of the perceptron algorithm for ~~classi�cation~~ problems. We give experimental results on part-of-speech tagging and base noun phrase chunking, in both cases showing improvements over results for a maximum-entropy tagger.	+	We describe new algorithms for training tagging models, as an alternative to maximum-entropy models or conditional random fields (CRFs). The algorithms rely on Viterbi decoding of training examples, combined with simple additive updates. We describe theory justifying the algorithms through a modification of the proof of convergence of the perceptron algorithm for classification problems. We give experimental results on part-of-speech tagging and base noun phrase chunking, in both cases showing improvements over results for a maximum-entropy tagger.

Difference between revisions of "Structured Linear Classifier"

Latest revision as of 06:08, 6 November 2012

Online

Abstract

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools