Syllabus for Information Extraction 10-707 in Fall 2009
From Cohen Courses
Jump to navigationJump to searchThis is the syllabus for Information Extraction 10-707 in Fall 2009. The slides available may be out of date (from the last version of the class) but will be updated after each lecture.
Contents
September
- Wed 9/9: Organizational meeting and overview of IE
- Mon 9/14: Overview of Relation Extraction and Some Case Studies
- Wed 9/16: NER via classification
- Mon 9/21: HMMs, CMMs, and MEMMs
- Wed 9/23: Linear-chain CRFs
- Mon 9/28: Yom Kippur - no lecture
- Wed 9/30: Meta-learning: Stacking and Sequential Models
- Student presentation 1
- Student presentation 2
October
- Mon 10/5: Perceptrons as margin learners
- Student presentation 1
- Student presentation 2
- One or two page team proposal for project due
- Wed 10/7: Ranking perceptrons and NER
- Student presentation 1
- Student presentation 2
- Mon 10/12: Long-range dependencies and general CRFs
- Student presentation 1 - Ramnath Balasubramanyan - Integer Linear Programming Inference for Conditional Random Fields, Roth and Yih, ICML 2005 ---Rbalasub
- Student presentation 2
- Wed 10/14: Relation extraction with kernels
- Student presentation 1
- Student presentation 2
- Mon 10/19: More relation and fact extraction
- Student presentation 1
- Student presentation 2 - Yandong Liu - Joint Extraction of Entities and Relations for Opinion Recognition, Choi, Breck, Cardie, EMNLP 2006. ---Yandongl
- Wed 10/21: IE and reasoning 1
- Student presentation 1 - Bhavana Dalvi - Hierarchical Hidden Markov Models for Information Extraction, Skounakis, Craven and Ray, IJCAI 2003 ---Bbd
- Student presentation 2 -
- Mon 10/26: IE and reasoning 2
- Student presentation 1: Aasish Pappu - Improving out-of-vocabulary name resolution, Palmer and Ostendorf, Computer Speech & Language Volume 19, Issue 1, January 2005, Pages 107-128 Apappu
- Student presentation 2 Keisuke Kamataki Jointly Identifying Temporal Relations with Markov Logic, by K. Yoshikawa, J. NAIST, S. Riedel, M. Asahara, Y. Matsumoto. In Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP of the AFNLP, 2009.
- Wed 10/28: IE and Reasoning 3
- Student presentation 1: Nathan Schneider - Sameer Singh, Karl Schultz, and Andrew McCallum (2009). Bi-directional joint inference for entity resolution and segmentation using imperatively-defined factor graphs. In Machine Learning and Knowledge Discovery in Databases (pp. 414-429).
November
- Student presentation 1 - Graph-based Analysis of Semantic Drift in Espresso-like bootstrapping Algorithms, Proceedings of the Conference on Empirical Methods in Natural Language Processing 2008 ---Reza Bosagh Zadeh
- Student presentation 2- Liu Yang - Kernel Conditional Random Fields: ..., Lafferty et al, ICML 2004. ---Liuy
- Wed 11/4: The TextRunner system
- Student presentation 1: Banko_2008 - Ni Lao
- Student presentation 2: Outclassing Wikipedia in Open-Domain Information Extraction: Weakly-Supervised Acquisition of Attributes over Conceptual Hierarchies, by M. Pasca, {{{coauthors}}}. In Proceedings of the 12th Conference of the European Chapter of the ACL, 2009. --- Selen
- Two page status update on project due
- Mon 11/9: Guest lecture: Tom Mitchell on the Read the Web project
- No student presentations since William is out of town
- Wed 11/11: Catchup and student presentations
- Student presentation 1: Minh Duong - A latent dirichlet model for unsupervised entity resolution, by I. Bhattacharya, L. Getoor. In SIAM International Conference on Data Mining, 2006.
- Student presentation 2: Liu Liu - Unsupervised Relation Extraction by Mining Wikipedia Texts Using Information from the Web, by Y. Yan, N. Okazaki, Y. Matsuo, Z. Yang, M. Ishizuka. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, 2009..
- Student presentation 3: Sgardine - A Unified Model of Phrasal and Sentential Evidence for Information Extraction, by S. Patwardhan, E. Riloff. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing,, 2009.
- Student presentation 4
- Mon 11/16: Graph-based methods for set expansion 1
- Wed 11/18: Graph-based methods for set expansion 2
- Student Unsupervised Coreference Resolution in a Nonparametric Bayesian Model, Aria Haghighi and Dan Klein, In proceedings of ACL 2007 ---Mehrbod Sharifi
- Student Philgoo Han: Lin_2009_phrase_clustering_for_discriminative_learning
- Student presentation, Weam AbuZaki: Co-EM Support Vector Learning, U. Brefeld and T. Scheffer, Proceedings of the International Conference on Machine Learning, 2004.
- Mon 11/23: Invited Lecture by Hagit Shatkay
- No student presentations today
- Wed 11/25: Thanksgiving--no class
- Mon 11/30: Project presentations
- Lao, Liu and Liu
- Dalvi and Balasubramanyan
- Uguroglu and Gopal
December
- Wed 12/2: Project presentations
- Pappu and Duong
- Reza B
- Keisuke K
- Mon 12/7: Project presentations. Note there is a different location this week - we're meeting in Hillman 6501.
- Nathan S and Liu
- Harshit Surunah
- Philgoo Han
- Wed 12/9: Project presentations. Note there is a different location this week - we're meeting in Hillman 6501.
- Steve Gardiner
- Weam AbuZaki
- Fri 12/11: Final project writeup is due.
- Your writeup should be submitted in the format used by the ICML 2009 Conference, which is 8 pp double-column. You should send your paper to me via email by midnight Samoa time on 12/11.