Difference between revisions of "Bikel et al MLJ 1999"

From Cohen Courses
Jump to navigationJump to search
Line 7: Line 7:
 
== Summary ==
 
== Summary ==
  
In this [[Category::paper]] the authors present IdentiFinder, an [[UsesMethod::HMM|Hidden Markov Model]] approach to the [[AddressesProblem::Named Entity Recognition]] problem.
+
In this [[Category::paper]] the authors present IdentiFinder, an [[UsesMethod::HMM|Hidden Markov Model]] approach to the [[AddressesProblem::Named Entity Recognition]] problem. Most techniques used in  [[AddressesProblem::Named Entity Recognition]] until the time of the paper, were mainly based on handcrafted patterns that are completely language dependent, and not flexible to different inputs (speech input, upper case texts, etc).
 +
 
 +
This was the first [[Category::paper]] that addressed [[AddressesProblem::Named Entity Recognition]] with [[UsesMethod::HMM]]'s, recognizing a structure in the identification of named entities, formulating it as a classification problem where a word is either part of some class or not.
 +
 
 +
 
 +
 
 +
== Brief Description of the Method ==
 +
 
 +
Their solution had a model for each name-class and a model for the not-a-name text. Additionally, there are tow special states, the START-OF-SENTENCE and END-OF-SENTENCE. The figure below provides a graphical representation of the model (the dashed edges assure the completion of the graph).
 +
 
 +
[Fig:BikelHmmGraph.png]
  
 
== Results ==
 
== Results ==
 
* 100k words of training = 90% performance
 
* 100k words of training = 90% performance

Revision as of 22:03, 27 September 2011

Being edited by Rui Correia

Citation

D. M. Bikel, R. L. Schwartz, and R. M. Weischedel. An algorithm that learns what's in a name. Machine Learning Journal, 34: 211–-231, 1999.

Summary

In this paper the authors present IdentiFinder, an Hidden Markov Model approach to the Named Entity Recognition problem. Most techniques used in Named Entity Recognition until the time of the paper, were mainly based on handcrafted patterns that are completely language dependent, and not flexible to different inputs (speech input, upper case texts, etc).

This was the first paper that addressed Named Entity Recognition with HMM's, recognizing a structure in the identification of named entities, formulating it as a classification problem where a word is either part of some class or not.


Brief Description of the Method

Their solution had a model for each name-class and a model for the not-a-name text. Additionally, there are tow special states, the START-OF-SENTENCE and END-OF-SENTENCE. The figure below provides a graphical representation of the model (the dashed edges assure the completion of the graph).

[Fig:BikelHmmGraph.png]

Results

  • 100k words of training = 90% performance