Difference between revisions of "Syllabus for Machine Learning with Large Datasets 10-605 in Spring 2015"

From Cohen Courses
Jump to navigationJump to search
Line 51: Line 51:
 
** ''HW5: memory-efficient SGD'' [http://curtis.ml.cmu.edu/w/courses/images/0/08/Sgd.pdf PDF handout]
 
** ''HW5: memory-efficient SGD'' [http://curtis.ml.cmu.edu/w/courses/images/0/08/Sgd.pdf PDF handout]
 
** ''For 10/11-805 students:'' '''project proposal is due.'''  This must contain a complete description of the data you will use.
 
** ''For 10/11-805 students:'' '''project proposal is due.'''  This must contain a complete description of the data you will use.
 +
* Fri Mar 6.
 +
** '''HW4 due: Phrase-finding with Hadoop'''
 
* Tues Mar 10. ''no class - spring break.''
 
* Tues Mar 10. ''no class - spring break.''
** '''HW4 due: Phrase-finding with Hadoop'''
 
 
* Thus Mar 12. ''no class - spring break.''
 
* Thus Mar 12. ''no class - spring break.''
 
* Tues Mar 17. [[Class meeting for 10-605 Subsample A Graph|Scalable PageRank]]
 
* Tues Mar 17. [[Class meeting for 10-605 Subsample A Graph|Scalable PageRank]]
 +
[http://curtis.ml.cmu.edu/w/courses/images/e/eb/ApproxPageRank.pdf PDF handout]
 +
* Thus Mar 19. [[Class meeting for 10-605 Subsampling Graphs|Subsampling a graph with RWR]]
 
** '''HW5 due: memory-efficient SGD'''  
 
** '''HW5 due: memory-efficient SGD'''  
** ''HW6: Subsampling and visualizing a graph.'' [http://curtis.ml.cmu.edu/w/courses/images/e/eb/ApproxPageRank.pdf PDF handout]
+
** ''HW6: Subsampling and visualizing a graph.''  
* Thus Mar 19. [[Class meeting for 10-605 Subsampling Graphs|Subsampling a graph with RWR]]
 
 
* Tues Mar 24. [[Class meeting for 10-605 SSL on Graphs|Subsamping continued and SSL on Graphs]]  '''AAAI Spring Symposium week'''
 
* Tues Mar 24. [[Class meeting for 10-605 SSL on Graphs|Subsamping continued and SSL on Graphs]]  '''AAAI Spring Symposium week'''
 
* Thus Mar 26. Guest lecture: D. Sculley, Google, TBA
 
* Thus Mar 26. Guest lecture: D. Sculley, Google, TBA

Revision as of 17:52, 23 February 2015

This is the syllabus for Machine Learning with Large Datasets 10-605 in Spring 2015.

Notes:

  • The assignments posted are drafts based on the assignments from 2014, and will be modified over the course of the semester - some may be changed substantially.
  • Lecture notes and/or slides will be (re)posted around the time of the lectures.

January

February

March

  • Sun Mar 1.
    • HW3 due: Naive Bayes with Hadoop MapReduce
  • Tues Mar 3. student presentations
    • Adams Wei Yu (weiyu at andrew): fast PPR on Map-Reduce
    • Jakub Pachocki: factorization machines (and hash kernels?)
    • Wanli Ma (wanlim at andrew): coresets for k-segmentation of streams
  • Thus Mar 5. student presentations
    • Matt Gardner (mg1 at cs): Large-scale extensions of the path ranking algorithm
    • Jesse Dodge (jessed at andrew): large-scale lasso regularization
    • Ishan Misra (imisra at andrew): LSH for object detection
    • HW5: memory-efficient SGD PDF handout
    • For 10/11-805 students: project proposal is due. This must contain a complete description of the data you will use.
  • Fri Mar 6.
    • HW4 due: Phrase-finding with Hadoop
  • Tues Mar 10. no class - spring break.
  • Thus Mar 12. no class - spring break.
  • Tues Mar 17. Scalable PageRank

PDF handout

April and May

  • Tues May 5.
    • For 10/11-805 students: project reports are due

Topics covered in previous years but not in 2015