Difference between revisions of "Selen writeup of Brin 1999"
From Cohen Courses
Jump to navigationJump to searchm (1 revision) |
(No difference)
|
Latest revision as of 11:42, 3 September 2010
This is a review of Brin_1999_extracting_patterns_and_relations_from_the_world_wide_web by user:Selen
This is a very early paper of bootstrapping, they start with 5 author title pairs and they search the web looking for the occurences and extract relations from the results. Using the relations they find new occurences.
My crisitism is:
-- The usage of regular expressions: matching authors names with the proposed regrex will result in only english names
-- The results are sci-fi they could have expanded their seed list
-- There is still manual inference
--It is slow