Apappu writeup on Poon and Domingos

From Cohen Courses

Jump to navigation Jump to search

This is a review of the paper poon_2007_joint_inference_in_information_extraction by user:Apappu.

My first impression of this paper is that it is trying to imitate PCFGs through predicates and rules.

Solving segmentation and entity-extraction problem (in citation realm) jointly with the help of Markov Logic Network (set of weighted first oder clauses).

Authors show that individualy systems comparatively easy to assemble show it does indeed improve extraction accuracy

In this work, MC-SAT is used for inference, discriminative weight learning using voted perceptron for learning.

Authors have tried standalone segmentation and entity extraction treating them as baseline.

Segmentation model is a HMM with enhanced ability to detect field boundaries, whereas Entity resolution model is an MLN described in previous work (Singla and Domingos '06)

Predicates and rules are employed to transfer information from one stage to another.

To perform joint segmentation (as opposed to isloated one) a predicate has been defined over a trigram with additional constraints over field(like title, author etc.) boundary.

Essentially, this is a combination of appropriate linguistic cues and statistical model which performs better than isolated inference methods.

I like this paper !

Retrieved from "http://curtis.ml.cmu.edu/w/courses/index.php?title=Apappu_writeup_on_Poon_and_Domingos&oldid=505"

Navigation menu