Klein et al, CONLL 2003
From Cohen Courses
Revision as of 21:54, 30 November 2010 by PastStudents (talk | contribs)
Citation
Dan Klein, Joseph Smarr, Huy Nguyen and Christopher D. Manning. 2003. Named Entity Recognition with Character-Level Model. In Proceedings of CoNLL-2003.
Online version
Summary
In this paper, the authors propose using character representations instead of word representations in the Named Entity Recognition task. In word model,
Conditional Random Fields approach to the Arabic Named Entity Recognition problem. Arabic is a highly inflectional language in which words can take both prefixes and suffixes. In addition to the complex morphology of Arabic, there is also the absence of capital letters which makes NER task even harder.
A previous paper that uses character-level approach was the Cucerzan and Yarowsky, SIGDAT 1999. In that paper the authors used the prefix and suffix tries but in this paper all the characters are used.