Difference between revisions of "J. Artiles et al. EMNLP 2009"
From Cohen Courses
Jump to navigationJump to searchPastStudents (talk | contribs) |
PastStudents (talk | contribs) |
||
Line 16: | Line 16: | ||
For experiments, they used two standard datasets for Web People Search Systems: [[UsesDataset::WePS-1]] and [[UsesDataset::WePS-2]]. | For experiments, they used two standard datasets for Web People Search Systems: [[UsesDataset::WePS-1]] and [[UsesDataset::WePS-2]]. | ||
+ | They concluded | ||
+ | |||
+ | # NEs do not improve the clustering when compared with a combination of simpler features | ||
+ | such as local, global and snippet tokens, n-grams, etc. | ||
+ | # results are sensitive to the NER system used. |
Revision as of 00:52, 31 October 2010
Citation
Javier Artiles, Enrique Amigó & Julio Gonzalo, The role of named entities in web people search, in EMNLP 2009
Online version
The role of named entities in web people search
Summary
This paper tries to determine the role of a number of features on solving Web People Search clustering problem. In order to compare different features, they reformulated this clustering problem into a classification problem such that each pair of documents will be classified as coreferent if they share the same cluster or not coreferent, Otherwise.
For experiments, they used two standard datasets for Web People Search Systems: WePS-1 and WePS-2. They concluded
- NEs do not improve the clustering when compared with a combination of simpler features
such as local, global and snippet tokens, n-grams, etc.
- results are sensitive to the NER system used.