Difference between revisions of "Class Meeting for 10-707 10/25/2010"

Latest revision as of 10:03, 25 October 2010

This is one of the class meetings on the schedule for the course Information Extraction 10-707 in Fall 2010.

The TextRunner system

Slides - Know It All and TextRunner

Required Readings

Open information extraction from the web, by M. Banko, M. J Cafarella, S. Soderland, M. Broadhead, O. Etzioni. In Procs. of IJCAI, 2007.. About TextRunner.

Optional Readings

Outclassing Wikipedia in Open-Domain Information Extraction: Weakly-Supervised Acquisition of Attributes over Conceptual Hierarchies, by M. Pasca, {{{coauthors}}}. In Proceedings of the 12th Conference of the European Chapter of the ACL, 2009.
A Unified Model of Phrasal and Sentential Evidence for Information Extraction, by S. Patwardhan, E. Riloff. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing,, 2009.
Using structured text for large-scale attribute extraction, by S. Ravi, M. PaÅŸca. In Proc CIKM 2008, 2008.
Corroborate and learn facts from the web, by Shubin Zhao, Jonathan Betz. In KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, 2007.
Li, S., C. Y Lin, Y. I Song, and Z. Li. n.d. Comparable Entity Mining from Comparative Questions. ACL 2010
Wu, F., and D. S Weld. 2010. Open information extraction using wikipedia. In The Annual Meeting of the Association for Computational Linguistics (ACL-2010). A text-runner like system trained on dependence-parsed wikipedia pages.
Ittoo, A., and G. Bouma. n.d. On Learning Subtypes of the Part-Whole Relation: Do Not Mix your Seeds. ACL-2010. Some interesting analysis of experiments in bootstrapping to learn part-whole relations.
Schoenmackers, S., J. Davis, O. Etzioni, and D. S Weld. 2010. Learning first-order horn clauses from web text. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP). Learning (simple) first-order relationships using data produced by TextRunner.
Schoenmackers, S., J. Davis, O. Etzioni, and D. S Weld. 2010. Learning first-order horn clauses from web text. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP). Predicts the "functionality" of relations extracted from web text.

@@ Line 1: / Line 1: @@
 This is one of the class meetings on the [[Syllabus for Information Extraction 10-707 in Fall 2010|schedule]] for the course [[Information Extraction 10-707 in Fall 2010]].
-=== IE and Reasoning 2: Joint Inference with MLNs ===
+=== The TextRunner system ===
-* [http://www.cs.cmu.edu/~wcohen/10-707/10-25-poon-domingos.ppt Slides]
+* [http://www.cs.cmu.edu/~wcohen/10-707/10-25-knowitall-textrunner.ppt Slides - Know It All and TextRunner]
 === Required Readings ===
-* [[required::poon_2007_joint_inference_in_information_extraction | {{MyCiteconference| booktitle = PROCEEDINGS OF THE NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE| coauthors = P. Domingos| date = 2007| first = H.| last = Poon| pages = 913| title = Joint inference in information extraction| volume = 22}}]]
+* [[banko_2007_open_information_extraction_from_the_web | {{MyCiteconference| booktitle = Procs. of IJCAI| coauthors = M. J Cafarella, S. Soderland, M. Broadhead, O. Etzioni| date = 2007| first = M.| last = Banko| title = Open information extraction from the web}}]].  About TextRunner.
-'''Update''': we will ''not'' cover the Bellare & McCallum paper Monday -- i.e.,  [[bellare_2009_generalized_expectation_criteria_for_bootstrapping_extractors_using_record_text_alignment | {{MyCiteconference| booktitle = Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing| coauthors = A. McCallum| date = 2009| first = K.| last = Bellare| pages = 131â€“140| title = Generalized Expectation Criteria for Bootstrapping Extractors using Record-Text Alignment}}]]
 === Optional Readings ===
-* [[yoshikawa_2009_jointly_identifying_temporal_relations_with_markov_logic | {{MyCiteconference| booktitle = Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP of the AFNLP| coauthors = J. NAIST, S. Riedel, M. Asahara, Y. Matsumoto| conference = ACL| date = 2009| first = K.| last = Yoshikawa| pages = 405-413| title = Jointly Identifying Temporal Relations with Markov Logic}}]]
+* [[pasca_2009_outclassing_wikipedia_in_open_domain_information_extraction_weakly_supervised_acquisition_of_attributes_over_conceptual_hierarchies | {{MyCiteconference | booktitle = Proceedings of the 12th Conference of the European Chapter of the ACL| conference = EACL| date = 2009| first = M.| last = Pasca| pages = 639-647| title = Outclassing Wikipedia in Open-Domain Information Extraction: Weakly-Supervised Acquisition of Attributes over Conceptual Hierarchies }}]]
+* [[patwardhan_2009_a_unified_model_of_phrasal_and_sentential_evidence_for_information_extraction | {{MyCiteconference | booktitle = Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing,| coauthors = E. Riloff| date = 2009| first = S.| last = Patwardhan| pages = 151-160| title = A Unified Model of Phrasal and Sentential Evidence for Information Extraction }}]]
+* [[ravi_2008_using_structured_text_for_large_scale_attribute_extraction | {{MyCiteconference | booktitle = Proc CIKM 2008| coauthors = M. PaÅŸca| date = 2008| first = S.| last = Ravi| title = Using structured text for large-scale attribute extraction }}]]
+* [[zhao_2007_corroborate_and_learn_facts_from_the_web | {{MyCiteconference | booktitle = KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining| coauthors = Jonathan Betz| date = 2007| doi = http://doi.acm.org/10.1145/1281192.1281299| first = Shubin| isbn = 978-1-59593-609-7| last = Zhao| location = New York, NY, USA| pages = 995-1003| publisher = ACM| title = Corroborate and learn facts from the web }}]]
+* Li, S., C. Y Lin, Y. I Song, and Z. Li. n.d. Comparable Entity Mining from Comparative Questions. ACL 2010
+* Wu, F., and D. S Weld. 2010. Open information extraction using wikipedia. In The Annual Meeting of the Association for Computational Linguistics (ACL-2010).  A text-runner like system trained on dependence-parsed wikipedia pages.
+* Ittoo, A., and G. Bouma. n.d. On Learning Subtypes of the Part-Whole Relation: Do Not Mix your Seeds. ACL-2010.  Some interesting analysis of experiments in bootstrapping to learn part-whole relations.
+* Schoenmackers, S., J. Davis, O. Etzioni, and D. S Weld. 2010. Learning first-order horn clauses from web text. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP).  Learning (simple) first-order relationships using data produced by TextRunner.
+* Schoenmackers, S., J. Davis, O. Etzioni, and D. S Weld. 2010. Learning first-order horn clauses from web text. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP).  Predicts the "functionality" of relations extracted from web text.

Difference between revisions of "Class Meeting for 10-707 10/25/2010"

Latest revision as of 10:03, 25 October 2010

The TextRunner system

Required Readings

Optional Readings

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools