Difference between revisions of "Class Meeting for 10-707 11/1/2010"
From Cohen Courses
Jump to navigationJump to search
(2 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
+ | |||
+ | |||
This is one of the class meetings on the [[Syllabus for Information Extraction 10-707 in Fall 2010|schedule]] for the course [[Information Extraction 10-707 in Fall 2010]]. | This is one of the class meetings on the [[Syllabus for Information Extraction 10-707 in Fall 2010|schedule]] for the course [[Information Extraction 10-707 in Fall 2010]]. | ||
− | === | + | === IE and Reasoning 1 - WHIRL === |
− | * [http://www.cs.cmu.edu/~wcohen/10-707/11-01- | + | * [http://www.cs.cmu.edu/~wcohen/10-707/11-01-whirl.ppt Slides]. |
=== Required Readings === | === Required Readings === | ||
− | * [[ | + | * [[required::cohen_2000_whirl_a_word_based_information_representation_language | {{MyCitejournal| date = 2000| doi = http://dx.doi.org/10.1016/S0004-3702(99)00102-2| first = William W| issn = 0004-3702| issue = 1-2| journal = Artif. Intell.| last = Cohen| pages = 163–196| title = WHIRL: a word-based information representation language| volume = 118}}]] |
+ | * [[required::cohen_2000_hardening_soft_information_sources | {{MyCiteconference| booktitle = KDD '00: Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining| coauthors = Henry Kautz, David McAllester| date = 2000| doi = http://doi.acm.org/10.1145/347090.347141| first = William W| isbn = 1-58113-233-6| last = Cohen| location = New York, NY, USA| pages = 255–259| publisher = ACM| title = Hardening soft information sources}}]] | ||
+ | * [[required::cohen_2003_a_comparison_of_string_distance_metrics_for_name_matching_tasks | {{MyCiteconference| booktitle = Proceedings of the IJCAI-2003 Workshop on Information Integration on the Web (IIWeb-03)| coauthors = P. Ravikumar, S. E Fienberg| date = 2003| first = W. W| last = Cohen| title = A comparison of string distance metrics for name-matching tasks}}]] | ||
=== Optional Readings === | === Optional Readings === | ||
− | * [[ | + | * [[artiles_2009_the_role_of_named_entities_in_web_people_search | {{MyCiteconference | booktitle = Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing| coauthors = S. Madrid, E. Amigó, J. Gonzalo| date = 2009| first = J.| last = Artiles| pages = 534-542| title = The role of named entities in Web People Search }}]] |
− | + | * [[bhattacharya_2006_a_latent_dirichlet_model_for_unsupervised_entity_resolution | {{MyCiteconference | booktitle = SIAM International Conference on Data Mining| coauthors = L. Getoor| date = 2006| first = I.| last = Bhattacharya| pages = 47-58| title = A latent dirichlet model for unsupervised entity resolution }}]] | |
− | + | * [[gravano_2003_text_joins_in_an_rdbms_for_web_data_integration | {{MyCiteconference | accessdate = 2009-08-03| booktitle = Proceedings of the 12th international conference on World Wide Web| coauthors = Panagiotis G. Ipeirotis, Nick Koudas, Divesh Srivastava| date = 2003| doi = 10.1145/775152.775166| first = Luis| isbn = 1-58113-680-3| last = Gravano| location = Budapest, Hungary| pages = 90-101| publisher = ACM| title = Text joins in an RDBMS for web data integration| url = http://portal.acm.org/citation.cfm?id=775166 }}]] | |
− | * [[ | + | * [[li_2004_robust_reading_identification_and_tracing_of_ambiguous_names | {{MyCiteconference | booktitle = Proc. of NAACL| coauthors = P. Morie, D. Roth| date = 2004| first = X.| last = Li| pages = 17-24| title = Robust reading: Identification and tracing of ambiguous names }}]] |
− | * [[ | + | * [[moreau_2008_robust_similarity_measures_for_named_entities_matching | {{MyCiteconference | booktitle = Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)| coauthors = F. Yvon, O. Cappe| date = 2008| first = E.| last = Moreau| title = Robust Similarity Measures for Named Entities Matching }}]] |
− | |||
− | |||
− | * [[ | ||
− | * [[ | ||
− |
Latest revision as of 14:24, 29 October 2010
This is one of the class meetings on the schedule for the course Information Extraction 10-707 in Fall 2010.
IE and Reasoning 1 - WHIRL
Required Readings
- WHIRL: a word-based information representation language. By William W Cohen, {{{coauthors}}}. In Artif. Intell., vol. 118 (1-2), 2000.
- Hardening soft information sources, by William W Cohen, Henry Kautz, David McAllester. In KDD '00: Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000.
- A comparison of string distance metrics for name-matching tasks, by W. W Cohen, P. Ravikumar, S. E Fienberg. In Proceedings of the IJCAI-2003 Workshop on Information Integration on the Web (IIWeb-03), 2003.
Optional Readings
- The role of named entities in Web People Search, by J. Artiles, S. Madrid, E. Amigó, J. Gonzalo. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009.
- A latent dirichlet model for unsupervised entity resolution, by I. Bhattacharya, L. Getoor. In SIAM International Conference on Data Mining, 2006.
- Text joins in an RDBMS for web data integration, by Luis Gravano, Panagiotis G. Ipeirotis, Nick Koudas, Divesh Srivastava. In Proceedings of the 12th international conference on World Wide Web, 2003.
- Robust reading: Identification and tracing of ambiguous names, by X. Li, P. Morie, D. Roth. In Proc. of NAACL, 2004.
- Robust Similarity Measures for Named Entities Matching, by E. Moreau, F. Yvon, O. Cappe. In Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), 2008.