Difference between revisions of "Syllabus for Machine Learning with Large Datasets 10-605 in Fall 2015"

From Cohen Courses
Jump to navigationJump to search
Line 11: Line 11:
 
* Thus Sep 10. [[Class meeting for 10-605 Scalable Testing|More on stream-and-sort]]
 
* Thus Sep 10. [[Class meeting for 10-605 Scalable Testing|More on stream-and-sort]]
 
* Tues Sep 15. [[Class meeting for 10-605 Hadoop 1|Hadoop and Map-Reduce]]
 
* Tues Sep 15. [[Class meeting for 10-605 Hadoop 1|Hadoop and Map-Reduce]]
 +
** Also: Guest lecture from Manik Varma, MSR.
 
* Thus Sep 17. [[Class meeting for 10-605 PIG|PIG and Other Workflow Systems for Hadoop]]
 
* Thus Sep 17. [[Class meeting for 10-605 PIG|PIG and Other Workflow Systems for Hadoop]]
 
** HW2 out: naive Bayes training on Hadoop in Java.
 
** HW2 out: naive Bayes training on Hadoop in Java.

Revision as of 16:00, 10 September 2015

This is the syllabus for Machine Learning with Large Datasets 10-605 in Fall 2015.

Notes:

  • Homeworks, unless otherwise posted, will be due when the next HW comes out.
  • Lecture notes and/or slides will be (re)posted around the time of the lectures.

  • Tues Sep 29. Scalable SGD and Hash Kernels
    • HW3 out: applying a large linear classifier to a large test set in Hadoop.
  • Thus Oct 1. TBA
    • For 805 students: an initial project proposal is due. You will get feedback on it from the instructors, and it will also be posted to the class - mainly for 605 students that are interested in collaborating, but also for general interest.
  • Tues Oct 6. Parallel Perceptrons 1.
  • Thus Oct 8. Parallel Perceptrons 2.
  • Tues Oct 13. Parameter servers and AllReduce
    • HW4 out: streaming logistic regression classifier
  • Thus Oct 15. Matrix Factorization and SGD
    • For 805 students: the final project proposal is due.
  • Tues Oct 20. guest lecture from Mark Torrance of RocketFuel
  • Thus Oct 22. midterm exam


Topics covered in previous years but not in 2015