Difference between revisions of "Syllabus for Machine Learning with Large Datasets 10-605 in Fall 2015"

From Cohen Courses
Jump to navigationJump to search
Line 27: Line 27:
 
* Thus Oct 15. [[Class meeting for 10-605 SGD for MF|Matrix Factorization and SGD]]
 
* Thus Oct 15. [[Class meeting for 10-605 SGD for MF|Matrix Factorization and SGD]]
  
== March  ==
+
* Tues Oct 13. TBA
 
+
* Thus Oct 15. ''student presentations''
* Sun Mar 1.
 
** '''HW3 due: Naive Bayes with Hadoop MapReduce'''
 
** HW4: [http://www.andrew.cmu.edu/user/amaurya/docs/10605/homework4.pdf PDF wrteup]
 
* Tues Mar 3. ''student presentations''
 
** Adams Wei Yu (weiyu at andrew): fast PPR on Map-Reduce [http://www.cs.cmu.edu/~wcohen/10-605/2015-guest-lecture/ppr_mapreduce.pdf]
 
** Jakub Pachocki: factorization machines (and hash kernels?)  [http://www.cs.cmu.edu/~wcohen/10-605/2015-guest-lecture/FM.pdf]
 
** <strike>Wanli Ma (wanlim at andrew): coresets for k-segmentation of streams</strike>
 
* Thus Mar 5. ''student presentations''
 
 
** Quiz: [https://qna-app.appspot.com/view.html?aglzfnFuYS1hcHByGQsSDFF1ZXN0aW9uTGlzdBiAgIDAvI2ZCAw]
 
** Quiz: [https://qna-app.appspot.com/view.html?aglzfnFuYS1hcHByGQsSDFF1ZXN0aW9uTGlzdBiAgIDAvI2ZCAw]
 
** Matt Gardner (mg1 at cs): Large-scale extensions of the path ranking algorithm [http://www.cs.cmu.edu/~wcohen/10-605/2015-guest-lecture/matt-805-presentation.pdf]
 
** Matt Gardner (mg1 at cs): Large-scale extensions of the path ranking algorithm [http://www.cs.cmu.edu/~wcohen/10-605/2015-guest-lecture/matt-805-presentation.pdf]

Revision as of 17:23, 7 July 2015

This is the syllabus for Machine Learning with Large Datasets 10-605 in Fall 2015.

Notes:

  • The assignments posted are drafts based on the assignments from spring 2015, and will be modified over the course of the semester - some may be changed substantially.
  • Lecture notes and/or slides will be (re)posted around the time of the lectures.

September

need to revise

October

  • Tues Oct 13. TBA
  • Thus Oct 15. student presentations
    • Quiz: [1]
    • Matt Gardner (mg1 at cs): Large-scale extensions of the path ranking algorithm [2]
    • Jesse Dodge (jessed at andrew): large-scale lasso regularization [3]
    • Ishan Misra (imisra at andrew): LSH for object detection [4]
    • HW5: memory-efficient SGD PDF handout
    • For 10/11-805 students: project proposal is due. This must contain a complete description of the data you will use.
  • Sat Mar 7 (extended from Friday):
    • HW4 due: Phrase-finding with Hadoop
  • Tues Mar 10. no class - spring break.
  • Thus Mar 12. no class - spring break.
  • Tues Mar 17. Scalable PageRank PDF handout
  • Thus Mar 19. Subsampling a graph with RWR
    • HW5 due: memory-efficient SGD
    • HW6: Subsampling and visualizing a graph. PDF handout
  • Tues Mar 24.
    • Student presentation: Rohan Ramanath, Bayesian Optimization
    • Guest lecture: Dai Wei, CMU, Parameter servers. (Note: This will be very relevant for one of the later HWs) PDF and ppt.
  • Thus Mar 26. Guest lecture: D. Sculley, Google, TBA
  • Tues Mar 31. Sparse sampling and parallelization for LDA

April and May

  • Tues May 5.
    • For 10/11-805 students: project reports are due

Topics covered in previous years but not in 2015