Liuliu writeup of Cohen 2003

From Cohen Courses
Revision as of 10:42, 3 September 2010 by WikiAdmin (talk | contribs) (1 revision)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

This is a review of Cohen_2003_a_comparison_of_string_distance_metrics_for_name_matching_tasks by user:Liuliu.

This paper compares several string distance metrics: token-based distance metrics, edit distance based methods and their proposed hybrid distance method(SoftTFIDF) which combines the above two schemes. Results show that the hybrid approach achieves a better performance on both matching and clustering tasks. Further, a hybrid approach which combines distance metrics by using SVM is evaluated and achieves a better performance.