Rbalasub writeup of Etzioni et al.

From Cohen Courses
Revision as of 11:42, 3 September 2010 by WikiAdmin (talk | contribs) (1 revision)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

A review of Etzioni_2004_methods_for_domain_independent_information_extraction_from_the_web_an_experimental_comparison by user:rbalasub

This work addresses the task of collecting sets of entities from the web by augmenting the domain independent KnowItAll system with three approaches namely

  • Rule Learning - which looks at contexts around seed entities (like SEAL)
  • Subclass Extraction
  • List Extraction - inducing wrappers for sites with lists of relevant entities

This enhancement increases the recall of the fact collection process.