Difference between revisions of "Class meeting for 10-605 Phrase Finding"
From Cohen Courses
Jump to navigationJump to search (→Slides) |
|||
(9 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
− | This is one of the class meetings on the [[Syllabus for Machine Learning with Large Datasets 10-605 in Fall | + | This is one of the class meetings on the [[Syllabus for Machine Learning with Large Datasets 10-605 in Fall 2016|schedule]] for the course [[Machine Learning with Large Datasets 10-605 in Fall 2016]]. |
=== Slides === | === Slides === | ||
− | * [http://www.cs.cmu.edu/~wcohen/10-605/ | + | * [http://www.cs.cmu.edu/~wcohen/10-605/2016/workflow-4-and-phrases.pptx Slides in Powerpoint], [http://www.cs.cmu.edu/~wcohen/10-605/2016/workflow-4-and-phrases.pdf Slides in Powerpoint]. The slides also cover Spark and other map-reduce languages. |
+ | |||
+ | === Quiz === | ||
+ | |||
+ | * [https://qna.cs.cmu.edu/#/pages/view/42 Quiz] | ||
+ | |||
+ | === Sample Code === | ||
+ | |||
+ | * A [http://www.cs.cmu.edu/~wcohen/10-605/code/pig-examples.tgz tarball with code for phrase-finding in PIG]. | ||
+ | * Some sample data for this is on HDFS on opencloud under /user/wcohen/phrases/data | ||
=== Readings for the Class === | === Readings for the Class === | ||
Line 14: | Line 23: | ||
* [http://arxiv.org/pdf/cs.LG/0212032 Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews], Turney, ACL 2002 | * [http://arxiv.org/pdf/cs.LG/0212032 Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews], Turney, ACL 2002 | ||
* [http://arxiv.org/pdf/cs.CL/0212015 Answering subcognitive Turing Test questions: A reply to French], Turney, Journal of Experimental and Theoretical Artificial Intelligence, 13 (4), 409-419. | * [http://arxiv.org/pdf/cs.CL/0212015 Answering subcognitive Turing Test questions: A reply to French], Turney, Journal of Experimental and Theoretical Artificial Intelligence, 13 (4), 409-419. | ||
+ | |||
+ | === Things to Remember === | ||
+ | |||
+ | * The request-and-answer pattern | ||
+ | * BLRT and pointwise KL-divergence statistics |
Latest revision as of 16:38, 1 August 2017
This is one of the class meetings on the schedule for the course Machine Learning with Large Datasets 10-605 in Fall 2016.
Contents
Slides
- Slides in Powerpoint, Slides in Powerpoint. The slides also cover Spark and other map-reduce languages.
Quiz
Sample Code
- A tarball with code for phrase-finding in PIG.
- Some sample data for this is on HDFS on opencloud under /user/wcohen/phrases/data
Readings for the Class
- A Language Model Approach to Keyphrase Extraction, Takashi Tomokiyo and Matthew Hurst, MWE '03 Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, acquisition and treatment.
Also discussed
- Locating complex named entities in web text, Downey et al, IJCAI 2007.
- Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews, Turney, ACL 2002
- Answering subcognitive Turing Test questions: A reply to French, Turney, Journal of Experimental and Theoretical Artificial Intelligence, 13 (4), 409-419.
Things to Remember
- The request-and-answer pattern
- BLRT and pointwise KL-divergence statistics