Difference between revisions of "Syllabus for Machine Learning with Large Datasets 10-605 in Fall 2015"

From Cohen Courses
Jump to navigationJump to search
Line 8: Line 8:
 
* Thus Sep 3. [[Class meeting for 10-605 Probability Review|Review of probabilities, joint distributions and naive Bayes]]
 
* Thus Sep 3. [[Class meeting for 10-605 Probability Review|Review of probabilities, joint distributions and naive Bayes]]
 
* Tues Sep 8.  [[Class meeting for 10-605 Streaming Naive Bayes|Streaming algorithms and Naive Bayes; The stream-and-sort design pattern; Naive Bayes for large feature sets.]]
 
* Tues Sep 8.  [[Class meeting for 10-605 Streaming Naive Bayes|Streaming algorithms and Naive Bayes; The stream-and-sort design pattern; Naive Bayes for large feature sets.]]
** HW1 out: streaming naive Bayes in Java.
+
** HW1 out: streaming naive Bayes in Java. [https://s3.amazonaws.com/vincy/10605-15Fall/HW1_StreamingNB.pdf PDF Handout]
 
* Thus Sep 10. [[Class meeting for 10-605 Phase Finding|Messages, records and workflows; Phrase finding.]]
 
* Thus Sep 10. [[Class meeting for 10-605 Phase Finding|Messages, records and workflows; Phrase finding.]]
 
* Tues Sep 15. [[Class meeting for 10-605 Hadoop 1|Hadoop and Map-Reduce]]
 
* Tues Sep 15. [[Class meeting for 10-605 Hadoop 1|Hadoop and Map-Reduce]]

Revision as of 16:29, 4 September 2015

This is the syllabus for Machine Learning with Large Datasets 10-605 in Fall 2015.

Notes:

  • Homeworks, unless otherwise posted, will be due when the next HW comes out.
  • Lecture notes and/or slides will be (re)posted around the time of the lectures.

  • Tues Sep 29. Scalable SGD and Hash Kernels
    • HW3 out: applying a large linear classifier to a large test set in Hadoop.
  • Thus Oct 1. TBA
    • For 805 students: an initial project proposal is due. You will get feedback on it from the instructors, and it will also be posted to the class - mainly for 605 students that are interested in collaborating, but also for general interest.
  • Tues Oct 6. Parallel Perceptrons 1.
  • Thus Oct 8. Parallel Perceptrons 2.
  • Tues Oct 13. Parameter servers and AllReduce
    • HW4 out: streaming logistic regression classifier
  • Thus Oct 15. Matrix Factorization and SGD
    • For 805 students: the final project proposal is due.
  • Tues Oct 20. guest lecture from Mark Torrance of RocketFuel
  • Thus Oct 22. midterm exam


Topics covered in previous years but not in 2015