Apappu writeup on Poon and Domingos

From Cohen Courses
Jump to navigationJump to search

This is a review of the paper poon_2007_joint_inference_in_information_extraction by user:Apappu.

  • My first impression of this paper is that it is trying to imitate PCFGs through predicates and rules.
  • Solving segmentation and entity-extraction problem (in citation realm) jointly with the help of Markov Logic Network (set of weighted first oder clauses).
  • Authors show that individualy systems comparatively easy to assemble show it does indeed improve extraction accuracy
  • In this work, MC-SAT is used for inference, discriminative weight learning using voted perceptron for learning.
  • Authors have tried standalone segmentation and entity extraction treating them as baseline.
  • Segmentation model is a HMM with enhanced ability to detect field boundaries, whereas Entity resolution model is an MLN described in previous work (Singla and Domingos '06)
  • Predicates and rules are employed to transfer information from one stage to another.
  • To perform joint segmentation (as opposed to isloated one) a predicate has been defined over a trigram with additional constraints over field(like title, author etc.) boundary.
  • Essentially, this is a combination of appropriate linguistic cues and statistical model which performs better than isolated inference methods.
  • I like this paper !