Difference between revisions of "Syllabus for Machine Learning with Large Datasets 10-605 in Fall 2015"

From Cohen Courses
Jump to navigationJump to search
Line 8: Line 8:
 
* Thus Sep 3. [[Class meeting for 10-605 Probability Review|Review of probabilities, joint distributions and naive Bayes]]
 
* Thus Sep 3. [[Class meeting for 10-605 Probability Review|Review of probabilities, joint distributions and naive Bayes]]
 
* Tues Sep 8.  [[Class meeting for 10-605 Streaming Naive Bayes|Streaming algorithms and Naive Bayes; The stream-and-sort design pattern; Naive Bayes for large feature sets.]]
 
* Tues Sep 8.  [[Class meeting for 10-605 Streaming Naive Bayes|Streaming algorithms and Naive Bayes; The stream-and-sort design pattern; Naive Bayes for large feature sets.]]
 +
** HW1 out: streaming naive Bayes in Java.
 
* Thus Sep 10. [[Class meeting for 10-605 Phase Finding|Messages, records and workflows; Phrase finding.]]
 
* Thus Sep 10. [[Class meeting for 10-605 Phase Finding|Messages, records and workflows; Phrase finding.]]
 
* Tues Sep 15. [[Class meeting for 10-605 Hadoop 1|Hadoop and Map-Reduce]]
 
* Tues Sep 15. [[Class meeting for 10-605 Hadoop 1|Hadoop and Map-Reduce]]
 +
** HW2 out: naive Bayes training on Hadoop in Java.
 
* Thus Sep 17. [[Class meeting for 10-605 PIG|PIG and Other Workflow Systems for Hadoop]]
 
* Thus Sep 17. [[Class meeting for 10-605 PIG|PIG and Other Workflow Systems for Hadoop]]
 
* Tues Sep 22. [[Class_meeting_for_10-605_Rocchio_and_On-line_Learning|Rocchio and TFIDF]]
 
* Tues Sep 22. [[Class_meeting_for_10-605_Rocchio_and_On-line_Learning|Rocchio and TFIDF]]
 +
** HW3 out: applying a large linear classifier to a large test set in Hadoop.
 
* Thus Sep 24.  [[Class meeting for 10-605 Similarity Joins|Fast KNN and similarity joins]]
 
* Thus Sep 24.  [[Class meeting for 10-605 Similarity Joins|Fast KNN and similarity joins]]
  

Revision as of 17:02, 23 August 2015

This is the syllabus for Machine Learning with Large Datasets 10-605 in Fall 2015.

Notes:

  • The assignments posted are drafts based on the assignments from spring 2015, and will be modified over the course of the semester - some may be changed substantially.
  • Lecture notes and/or slides will be (re)posted around the time of the lectures.



Topics covered in previous years but not in 2015