Difference between revisions of "Class Meeting for 10-707 10/25/2010"
From Cohen Courses
Jump to navigationJump to search
m (1 revision) |
|||
(2 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
This is one of the class meetings on the [[Syllabus for Information Extraction 10-707 in Fall 2010|schedule]] for the course [[Information Extraction 10-707 in Fall 2010]]. | This is one of the class meetings on the [[Syllabus for Information Extraction 10-707 in Fall 2010|schedule]] for the course [[Information Extraction 10-707 in Fall 2010]]. | ||
− | === | + | === The TextRunner system === |
− | * [http://www.cs.cmu.edu/~wcohen/10-707/10-25- | + | |
+ | * [http://www.cs.cmu.edu/~wcohen/10-707/10-25-knowitall-textrunner.ppt Slides - Know It All and TextRunner] | ||
=== Required Readings === | === Required Readings === | ||
− | * [[ | + | * [[banko_2007_open_information_extraction_from_the_web | {{MyCiteconference| booktitle = Procs. of IJCAI| coauthors = M. J Cafarella, S. Soderland, M. Broadhead, O. Etzioni| date = 2007| first = M.| last = Banko| title = Open information extraction from the web}}]]. About TextRunner. |
− | |||
− | |||
=== Optional Readings === | === Optional Readings === | ||
− | * [[ | + | * [[pasca_2009_outclassing_wikipedia_in_open_domain_information_extraction_weakly_supervised_acquisition_of_attributes_over_conceptual_hierarchies | {{MyCiteconference | booktitle = Proceedings of the 12th Conference of the European Chapter of the ACL| conference = EACL| date = 2009| first = M.| last = Pasca| pages = 639-647| title = Outclassing Wikipedia in Open-Domain Information Extraction: Weakly-Supervised Acquisition of Attributes over Conceptual Hierarchies }}]] |
+ | * [[patwardhan_2009_a_unified_model_of_phrasal_and_sentential_evidence_for_information_extraction | {{MyCiteconference | booktitle = Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing,| coauthors = E. Riloff| date = 2009| first = S.| last = Patwardhan| pages = 151-160| title = A Unified Model of Phrasal and Sentential Evidence for Information Extraction }}]] | ||
+ | * [[ravi_2008_using_structured_text_for_large_scale_attribute_extraction | {{MyCiteconference | booktitle = Proc CIKM 2008| coauthors = M. PaÅŸca| date = 2008| first = S.| last = Ravi| title = Using structured text for large-scale attribute extraction }}]] | ||
+ | * [[zhao_2007_corroborate_and_learn_facts_from_the_web | {{MyCiteconference | booktitle = KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining| coauthors = Jonathan Betz| date = 2007| doi = http://doi.acm.org/10.1145/1281192.1281299| first = Shubin| isbn = 978-1-59593-609-7| last = Zhao| location = New York, NY, USA| pages = 995-1003| publisher = ACM| title = Corroborate and learn facts from the web }}]] | ||
+ | * Li, S., C. Y Lin, Y. I Song, and Z. Li. n.d. Comparable Entity Mining from Comparative Questions. ACL 2010 | ||
+ | * Wu, F., and D. S Weld. 2010. Open information extraction using wikipedia. In The Annual Meeting of the Association for Computational Linguistics (ACL-2010). A text-runner like system trained on dependence-parsed wikipedia pages. | ||
+ | * Ittoo, A., and G. Bouma. n.d. On Learning Subtypes of the Part-Whole Relation: Do Not Mix your Seeds. ACL-2010. Some interesting analysis of experiments in bootstrapping to learn part-whole relations. | ||
+ | * Schoenmackers, S., J. Davis, O. Etzioni, and D. S Weld. 2010. Learning first-order horn clauses from web text. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP). Learning (simple) first-order relationships using data produced by TextRunner. | ||
+ | * Schoenmackers, S., J. Davis, O. Etzioni, and D. S Weld. 2010. Learning first-order horn clauses from web text. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP). Predicts the "functionality" of relations extracted from web text. |
Latest revision as of 10:03, 25 October 2010
This is one of the class meetings on the schedule for the course Information Extraction 10-707 in Fall 2010.
The TextRunner system
Required Readings
- Open information extraction from the web, by M. Banko, M. J Cafarella, S. Soderland, M. Broadhead, O. Etzioni. In Procs. of IJCAI, 2007.. About TextRunner.
Optional Readings
- Outclassing Wikipedia in Open-Domain Information Extraction: Weakly-Supervised Acquisition of Attributes over Conceptual Hierarchies, by M. Pasca, {{{coauthors}}}. In Proceedings of the 12th Conference of the European Chapter of the ACL, 2009.
- A Unified Model of Phrasal and Sentential Evidence for Information Extraction, by S. Patwardhan, E. Riloff. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing,, 2009.
- Using structured text for large-scale attribute extraction, by S. Ravi, M. PaÅŸca. In Proc CIKM 2008, 2008.
- Corroborate and learn facts from the web, by Shubin Zhao, Jonathan Betz. In KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, 2007.
- Li, S., C. Y Lin, Y. I Song, and Z. Li. n.d. Comparable Entity Mining from Comparative Questions. ACL 2010
- Wu, F., and D. S Weld. 2010. Open information extraction using wikipedia. In The Annual Meeting of the Association for Computational Linguistics (ACL-2010). A text-runner like system trained on dependence-parsed wikipedia pages.
- Ittoo, A., and G. Bouma. n.d. On Learning Subtypes of the Part-Whole Relation: Do Not Mix your Seeds. ACL-2010. Some interesting analysis of experiments in bootstrapping to learn part-whole relations.
- Schoenmackers, S., J. Davis, O. Etzioni, and D. S Weld. 2010. Learning first-order horn clauses from web text. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP). Learning (simple) first-order relationships using data produced by TextRunner.
- Schoenmackers, S., J. Davis, O. Etzioni, and D. S Weld. 2010. Learning first-order horn clauses from web text. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP). Predicts the "functionality" of relations extracted from web text.