Difference between revisions of "Syllabus for Machine Learning with Large Datasets 10-605 in Fall 2015"

From Cohen Courses
Jump to navigationJump to search
Line 26: Line 26:
 
* Tues Oct 13. [[Class meeting for 10-605 Randomized|Randomized Algorithms 2]]
 
* Tues Oct 13. [[Class meeting for 10-605 Randomized|Randomized Algorithms 2]]
 
* Thus Oct 15. [[Class meeting for 10-605 SGD for MF|Matrix Factorization and SGD]]
 
* Thus Oct 15. [[Class meeting for 10-605 SGD for MF|Matrix Factorization and SGD]]
 +
* Tues Oct 20. TBA
 +
* Thus Oct 22. TBA
 +
* Tues Oct 27. TBA
 +
* Thus Oct 29. TBA
  
* Tues Oct 13. TBA
+
== November ==
* Thus Oct 15. ''student presentations''
+
 
** Quiz: [https://qna-app.appspot.com/view.html?aglzfnFuYS1hcHByGQsSDFF1ZXN0aW9uTGlzdBiAgIDAvI2ZCAw]
+
* Tues Nov 3. [[Class meeting for 10-605 Subsample A Graph|Scalable PageRank]] [http://curtis.ml.cmu.edu/w/courses/images/e/eb/ApproxPageRank.pdf PDF handout]
** Matt Gardner (mg1 at cs): Large-scale extensions of the path ranking algorithm [http://www.cs.cmu.edu/~wcohen/10-605/2015-guest-lecture/matt-805-presentation.pdf]
+
* Thus Nov 5. [[Class meeting for 10-605 Subsampling Graphs|Subsampling a graph with RWR]]
** Jesse Dodge (jessed at andrew): large-scale lasso regularization [http://www.cs.cmu.edu/~wcohen/10-605/2015-guest-lecture/jesse.pdf]
+
* Tues Nov 10. TBA
** Ishan Misra (imisra at andrew): LSH for object detection [http://www.cs.cmu.edu/~wcohen/10-605/2015-guest-lecture/ishan.pdf]
+
* Thus Nov 12. TBA
** ''HW5: memory-efficient SGD'' [http://www.andrew.cmu.edu/user/amaurya/docs/10605/homework5.pdf PDF handout]
+
* Tues Nov 17. [[Class meeting for 10-605 LDA 1|Sparse sampling and parallelization for LDA]]
** ''For 10/11-805 students:'' '''project proposal is due.'''  This must contain a complete description of the data you will use.
 
* Sat Mar 7 ('''extended from Friday'''):
 
** '''HW4 due: Phrase-finding with Hadoop'''
 
* Tues Mar 10. ''no class - spring break.''
 
* Thus Mar 12. ''no class - spring break.''
 
* Tues Mar 17. [[Class meeting for 10-605 Subsample A Graph|Scalable PageRank]] [http://curtis.ml.cmu.edu/w/courses/images/e/eb/ApproxPageRank.pdf PDF handout]
 
* Thus Mar 19. [[Class meeting for 10-605 Subsampling Graphs|Subsampling a graph with RWR]]
 
** '''HW5 due: memory-efficient SGD'''
 
** ''HW6: Subsampling and visualizing a graph.  [http://bit.ly/605_hw6 PDF handout]
 
* Tues Mar 24.
 
** Student presentation: Rohan Ramanath, Bayesian Optimization
 
** Guest lecture: Dai Wei, CMU, Parameter servers.  ('''Note''': This will be very relevant for one of the later HWs) [https://dl.dropboxusercontent.com/u/65353654/daiwei01_release.pdf PDF] and [https://dl.dropboxusercontent.com/u/65353654/daiwei01_release.pptx ppt].
 
* Thus Mar 26. Guest lecture: D. Sculley, Google, TBA
 
* Tues Mar 31. [[Class meeting for 10-605 LDA 1|Sparse sampling and parallelization for LDA]]
 
  
 
== April  and May ==
 
== April  and May ==

Revision as of 17:26, 7 July 2015

This is the syllabus for Machine Learning with Large Datasets 10-605 in Fall 2015.

Notes:

  • The assignments posted are drafts based on the assignments from spring 2015, and will be modified over the course of the semester - some may be changed substantially.
  • Lecture notes and/or slides will be (re)posted around the time of the lectures.

September

need to revise

October

November

April and May

  • Tues May 5.
    • For 10/11-805 students: project reports are due

Topics covered in previous years but not in 2015