Difference between revisions of "Syllabus for Machine Learning with Large Datasets 10-605 in Spring 2012"

From Cohen Courses
Jump to navigationJump to search
Line 26: Line 26:
 
* Thus Feb 23. ''Guest lecture'': Ron Bekkerman, LinkedIn, Scaling up Machine Learning
 
* Thus Feb 23. ''Guest lecture'': Ron Bekkerman, LinkedIn, Scaling up Machine Learning
 
* Tues Feb 28. Bloom Filters and Locality sensitive hashing 1.
 
* Tues Feb 28. Bloom Filters and Locality sensitive hashing 1.
** '''Hadoop assignments due'''
+
 
** ''New Assignment: memory-efficient SGD''
 
  
 
== March ==
 
== March ==
Line 33: Line 32:
 
* Thus Mar 1.  ''Guest Lecture'': Ben van Durme, JHU, Randomized Algorithms for Large-Scale Learning
 
* Thus Mar 1.  ''Guest Lecture'': Ben van Durme, JHU, Randomized Algorithms for Large-Scale Learning
 
* Tues Mar 6. Learning on graphs. PageRank, Harmonic field, RWR; tools and design patterns for graphs (Pregel, GraphLab, Schimmy, ...)
 
* Tues Mar 6. Learning on graphs. PageRank, Harmonic field, RWR; tools and design patterns for graphs (Pregel, GraphLab, Schimmy, ...)
** '''Assignment due: memory-efficient SGD'''
+
** '''Hadoop assignments due'''
 +
** ''New Assignment: memory-efficient SGD''
 
** ''New assignment: mini-project proposals (first draft).''
 
** ''New assignment: mini-project proposals (first draft).''
 
* Thus Mar 8. ''Guest Lecture'': Joey Gonzales, CMU, GraphLab and Dynamic Asynchronous Computation
 
* Thus Mar 8. ''Guest Lecture'': Joey Gonzales, CMU, GraphLab and Dynamic Asynchronous Computation
Line 40: Line 40:
 
* Tues Mar 20. Spectral clustering and PIC.
 
* Tues Mar 20. Spectral clustering and PIC.
 
** '''Assignment due: mini-project proposals (first draft).'''
 
** '''Assignment due: mini-project proposals (first draft).'''
 +
** '''Assignment due: memory-efficient SGD'''
 
** ''New Assignment: Subsampling and visualizing a graph.''
 
** ''New Assignment: Subsampling and visualizing a graph.''
 
* Thus Mar 22. Tentative: Guest lecture by U Kang, CMU.
 
* Thus Mar 22. Tentative: Guest lecture by U Kang, CMU.

Revision as of 18:06, 23 February 2012

This is the syllabus for Machine Learning with Large Datasets 10-605 in Spring 2012.

January

February


March

  • Thus Mar 1. Guest Lecture: Ben van Durme, JHU, Randomized Algorithms for Large-Scale Learning
  • Tues Mar 6. Learning on graphs. PageRank, Harmonic field, RWR; tools and design patterns for graphs (Pregel, GraphLab, Schimmy, ...)
    • Hadoop assignments due
    • New Assignment: memory-efficient SGD
    • New assignment: mini-project proposals (first draft).
  • Thus Mar 8. Guest Lecture: Joey Gonzales, CMU, GraphLab and Dynamic Asynchronous Computation
  • Tues Mar 13. no class - spring break.
  • Thus Mar 15. no class - spring break.
  • Tues Mar 20. Spectral clustering and PIC.
    • Assignment due: mini-project proposals (first draft).
    • Assignment due: memory-efficient SGD
    • New Assignment: Subsampling and visualizing a graph.
  • Thus Mar 22. Tentative: Guest lecture by U Kang, CMU.
  • Tues Mar 27. Gibbs sampling and LDA.
    • Assignment due: Subsampling and visualizing a graph.
    • New Assignment: mini-project proposals (final version)
  • Thus Mar 29. KNN classification and inverted indices.
    • Assignment due: mini-project proposals (final version).

April

  • Tues Apr 3. Decision trees and random forests 1.
  • Thus Apr 5. Decision trees and random forests 2.
  • Tues Apr 10. Soft joins with KNN and inverted indices 1.
  • Thus Apr 12. Soft joins with KNN and inverted indices 1.
  • Tues Apr 17. Structured prediction 1.
  • Thus Apr 19. no class - Carnival
  • Tues Apr 24. Structured prediction 2.
  • Thus Apr 26. Additional topics.

May

  • Tues May 1. Project reports.
  • Thus May 3. Project reports.