Elworthy, 1994 Does Baum-Welch re-estimation help taggers

From Cohen Courses
Revision as of 19:25, 2 November 2011 by Dkulkarn (talk | contribs)
Jump to navigationJump to search

Citation

David Elworthy, Does Baum-Welch Re-estimation :Help Taggers?. In Proceedings of the Fourth Conference on Applied Natural Language Processing, 53--58

Online version

ACL anthology

Summary

This Paper originated during mid-nineties when there was increasing interest in hidden markov models for the determining POS Tagging. It built on the work done at Xerox on supervised learning. It analysed the effect of the Baum-Welch over the quality of annotations of the data.

The paper discusses two experiments -

  1. Effect of initial conditions on Baum-Welch re-estimation
  2. Study the behavior of Baum-Welch on a given data.

Dataset

For the first experiment, the author constructs four corpora from the LOB corpus

  1. LOB-B from part B
  2. LOB-L from part L
  3. LOB-B-G from parts B to G inclusive
  4. LOB-B-J from parts B to J inclusive

The last part is used to train the model and the other three were used as untagged data.

The second experiment used Penn Treebank.


Under construction by User:dkulkarn