Difference between revisions of "Class Meeting for 10-707 10/20/2010"
From Cohen Courses
Jump to navigationJump to search
(Created page with 'This is one of the class meetings on the schedule for the course Information Extraction 10-707 in Fall 2010. === …') |
|||
(5 intermediate revisions by 3 users not shown) | |||
Line 1: | Line 1: | ||
This is one of the class meetings on the [[Syllabus for Information Extraction 10-707 in Fall 2010|schedule]] for the course [[Information Extraction 10-707 in Fall 2010]]. | This is one of the class meetings on the [[Syllabus for Information Extraction 10-707 in Fall 2010|schedule]] for the course [[Information Extraction 10-707 in Fall 2010]]. | ||
− | === | + | === Overview of Bootstrapping and KnowItAll === |
− | * [http://www.cs.cmu.edu/~wcohen/10-707/10-20- | + | * [http://www.cs.cmu.edu/~wcohen/10-707/10-20-semisupervised.ppt Slides] |
=== Required Readings === | === Required Readings === | ||
− | * [[ | + | * [[etzioni_2004_methods_for_domain_independent_information_extraction_from_the_web_an_experimental_comparison | {{MyCiteconference| booktitle = Proceedings of the national conference on artificial intelligence| coauthors = M. Cafarella, D. Downey, A. M Popescu, T. Shaked, S. Soderland, D. S Weld, A. Yates| date = 2004| first = O.| last = Etzioni| pages = 391–398| title = Methods for domain-independent information extraction from the web: An experimental comparison}}]]. About KnowItAll. |
− | |||
− | |||
− | + | === Optional Readings === | |
− | === | + | * [[tomita_2006_expanding_the_recall_of_relation_extraction_by_bootstrapping | {{MyCiteconference| booktitle = Adaptive Text Extraction and Mining (ATEM 2006)| coauthors = S. S.O Etzioni| date = 2006| first = J.| last = Tomita| pages = 56| title = Expanding the recall of relation extraction by bootstrapping}}]] |
+ | * [[pasca_2007_weakly_supervised_discovery_of_named_entities_using_web_search_queries | {{MyCiteconference| booktitle = CIKM '07| date = 2007| first = M.| last = Pasca| title = Weakly-supervised discovery of named entities using web search queries}}]] | ||
+ | * [[schoenmackers_2008_scaling_textual_inference_to_the_web | {{MyCiteconference| booktitle = Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing| coauthors = O. Etzioni, D. S Weld, T. Center| date = 2008| first = S.| last = Schoenmackers| pages = 79–88| title = Scaling textual inference to the Web}}]] | ||
+ | * [[hovy_2009_toward_completeness_in_concept_extraction_and_classification | {{MyCiteconference| booktitle = Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing| coauthors = Z. Kozareva, E. Riloff| date = 2009| first = E.| last = Hovy| title = Toward Completeness in Concept Extraction and Classification}}]] | ||
+ | * [[mota_2009_updating_a_name_tagger_using_contemporary_unlabeled_data | {{MyCiteconference| booktitle = Proceedings of the ACL-IJCNLP 2009 Conference Short Papers| coauthors = R. Grishman| date = 2009| first = C.| last = Mota| pages = 353–356| title = Updating a Name Tagger Using Contemporary Unlabeled Data}}]] | ||
+ | * [[pantel_2009_web_scale_distributional_similarity_and_entity_set_expansion | {{MyCitejournal| coauthors = E. Crestan, A. Borkovsky, A. M Popescu, V. Vyas| date = 2009| first = P.| journal = Proceedings of EMNLP-09, Singapore| last = Pantel| title = Web-scale distributional similarity and entity set expansion}}]] | ||
+ | * [[tomanek_noyear_semi_supervised_active_learning_for_sequence_labeling | {{MyCiteconference| booktitle = Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP| coauthors = U. Hahn| first = K.| last = Tomanek| pages = 1039–1047| date = 2009 | title = Semi-Supervised Active Learning for Sequence Labeling}}]] | ||
+ | * [[yan_noyear_unsupervised_relation_extraction_by_mining_wikipedia_texts_using_information_from_the_web | {{MyCiteconference| booktitle = Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP| date = 2009 | coauthors = N. Okazaki, Y. Matsuo, Z. Yang, M. Ishizuka| first = Y.| last = Yan| pages = 1021-1029| title = Unsupervised Relation Extraction by Mining Wikipedia Texts Using Information from the Web}}]]. Combines dependency patterns in parse wikipedia text and bootstrapping-style surface patterns on Web text. | ||
+ | * [[talukdar_noyear_a_context_pattern_induction_method_for_named_entity_extraction | {{MyCiteconference | booktitle = Tenth Conference on Computational Natural Language Learning| coauthors = T. Brants, M. L.F Pereira| first = P. P| last = Talukdar| title = A context pattern induction method for named entity extraction }}]]. | ||
+ | * Talukdar, P. P, and F. Pereira. 2010. Experiments in Graph-based Semi-Supervised Learning Methods for Class-Instance Acquisition. In 48th Annual Meeting of the Association for Computational Linguistics (ACL). Vol. 45. Comparison of different graph-based semi-supervised learning methods for information extraction tasks. | ||
+ | * Druck, G., and A. McCallum. 2010. High-Performance Semi-Supervised Learning using Discriminatively Constrained Generative Models. In ICML 2010. Constrain a generative HMM training procedure to also satisfy the feature expectations associated with a CRF model. | ||
+ | * Bollegala, D. T, Y. Matsuo, and M. Ishizuka. 2010. Relational duality: unsupervised extraction of semantic relations between entities on the web. In Proceedings of the 19th international conference on World wide web, 151–160. A fast and effective method for simultaneously clustering entity-pairs into relations and entities into classes. | ||
+ | * Yin, X., W. Tan, X. Li, and Y. C Tu. 2010. Automatic extraction of clickable structured web contents for name entity queries. In Proceedings of the 19th international conference on World wide web, 991–1000. Finds seeds to use for site-specific wrappers by analyzing query logs. | ||
− | + | == Student Presentation == | |
− | + | [http://malt.ml.cmu.edu/mw/index.php/User:Rnshah Rushin Shah] | |
− | |||
− | |||
− |
Latest revision as of 11:14, 20 October 2010
This is one of the class meetings on the schedule for the course Information Extraction 10-707 in Fall 2010.
Contents
Overview of Bootstrapping and KnowItAll
Required Readings
Optional Readings
- Expanding the recall of relation extraction by bootstrapping, by J. Tomita, S. S.O Etzioni. In Adaptive Text Extraction and Mining (ATEM 2006), 2006.
- Weakly-supervised discovery of named entities using web search queries, by M. Pasca, {{{coauthors}}}. In CIKM '07, 2007.
- Scaling textual inference to the Web, by S. Schoenmackers, O. Etzioni, D. S Weld, T. Center. In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008.
- Toward Completeness in Concept Extraction and Classification, by E. Hovy, Z. Kozareva, E. Riloff. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009.
- Updating a Name Tagger Using Contemporary Unlabeled Data, by C. Mota, R. Grishman. In Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, 2009.
- Web-scale distributional similarity and entity set expansion. By P. Pantel, E. Crestan, A. Borkovsky, A. M Popescu, V. Vyas. In Proceedings of EMNLP-09, Singapore, vol. {{{volume}}} ({{{issue}}}), 2009.
- Semi-Supervised Active Learning for Sequence Labeling, by K. Tomanek, U. Hahn. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, 2009.
- Unsupervised Relation Extraction by Mining Wikipedia Texts Using Information from the Web, by Y. Yan, N. Okazaki, Y. Matsuo, Z. Yang, M. Ishizuka. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, 2009.. Combines dependency patterns in parse wikipedia text and bootstrapping-style surface patterns on Web text.
- A context pattern induction method for named entity extraction, by P. P Talukdar, T. Brants, M. L.F Pereira. In Tenth Conference on Computational Natural Language Learning, {{{date}}}..
- Talukdar, P. P, and F. Pereira. 2010. Experiments in Graph-based Semi-Supervised Learning Methods for Class-Instance Acquisition. In 48th Annual Meeting of the Association for Computational Linguistics (ACL). Vol. 45. Comparison of different graph-based semi-supervised learning methods for information extraction tasks.
- Druck, G., and A. McCallum. 2010. High-Performance Semi-Supervised Learning using Discriminatively Constrained Generative Models. In ICML 2010. Constrain a generative HMM training procedure to also satisfy the feature expectations associated with a CRF model.
- Bollegala, D. T, Y. Matsuo, and M. Ishizuka. 2010. Relational duality: unsupervised extraction of semantic relations between entities on the web. In Proceedings of the 19th international conference on World wide web, 151–160. A fast and effective method for simultaneously clustering entity-pairs into relations and entities into classes.
- Yin, X., W. Tan, X. Li, and Y. C Tu. 2010. Automatic extraction of clickable structured web contents for name entity queries. In Proceedings of the 19th international conference on World wide web, 991–1000. Finds seeds to use for site-specific wrappers by analyzing query logs.