Difference between revisions of "Apappu writeup on Krishnan and Manning"

From Cohen Courses
Jump to navigationJump to search
 
m (1 revision)
 
(No difference)

Latest revision as of 11:42, 3 September 2010

This is a review of krishnan_2006_an_effective_two_stage_model_for_exploiting_non_local_dependencies_in_named_entity_recognition by user:Apappu.

  • Authors propose a two-stage CRF model to capture non-local features to improve corpus wide named entity recognition.
  • An initial baseline with standard feature classes has been trained using CRFs and on top of it a second-stage CRF is incorporated to address document and corpus level features.
  • Second-stage features are motivated by corpus statistics provided in Table 1 and 2, where off-diagonal numbers are less dense.


  • I like their two-stage approach since they are doing fine with time taken for inference compared to other approaches on a similar task.
  • But, they never describe what do they actually do to enforce soft or hard constraint especially in case of LOC vs ORG labeling.
  • They have a very good baseline but what if they have a mediocre baseline and they had to improve on top of it. Also, I don't see corpus level features contributing to the performance when compared to document level features.