Difference between revisions of "Syllabus for Machine Learning with Large Datasets 10-605 in Spring 2015"

From Cohen Courses
Jump to navigationJump to search
Line 29: Line 29:
 
* Thus Feb 12. '''student presentations'''
 
* Thus Feb 12. '''student presentations'''
 
* Tues Feb 17. [[Class meeting for 10-605 SGD and Hash Kernels|Scalable SGD and Hash Kernels]]
 
* Tues Feb 17. [[Class meeting for 10-605 SGD and Hash Kernels|Scalable SGD and Hash Kernels]]
** '''HW3 due. (Naive Bayes with Hadoop)'''
+
** '''HW3 due: Naive Bayes with Hadoop'''
 
* Thus Feb 19. [[Class meeting for 10-605 SGD for MF|Matrix Factorization and SGD, plus another Hadoop demo]]
 
* Thus Feb 19. [[Class meeting for 10-605 SGD for MF|Matrix Factorization and SGD, plus another Hadoop demo]]
 
* Tues Feb 24. [[Class meeting for 10-605 SGD for MF 2 and Randomized Algorithms|SGD for Matrix Factorization, and Randomized Algorithms 1 (Bloom Filters)]]
 
* Tues Feb 24. [[Class meeting for 10-605 SGD for MF 2 and Randomized Algorithms|SGD for Matrix Factorization, and Randomized Algorithms 1 (Bloom Filters)]]
Line 39: Line 39:
 
* Tues Mar 3. '''student presentations'''
 
* Tues Mar 3. '''student presentations'''
 
* Thus Mar 5. '''student presentations'''
 
* Thus Mar 5. '''student presentations'''
** '''HW4 due. (Phrase-finding with Hadoop)'''
+
** '''HW4 due: Phrase-finding with Hadoop'''
 
** ''HW5: memory-efficient SGD'' [http://curtis.ml.cmu.edu/w/courses/images/0/08/Sgd.pdf PDF handout]
 
** ''HW5: memory-efficient SGD'' [http://curtis.ml.cmu.edu/w/courses/images/0/08/Sgd.pdf PDF handout]
 
* Tues Mar 10. ''no class - spring break.''
 
* Tues Mar 10. ''no class - spring break.''
Line 49: Line 49:
 
* Tues Mar 24. [[Class meeting for 10-605 SSL on Graphs|Subsamping continued and SSL on Graphs]]  '''AAAI Spring Symposium week'''
 
* Tues Mar 24. [[Class meeting for 10-605 SSL on Graphs|Subsamping continued and SSL on Graphs]]  '''AAAI Spring Symposium week'''
 
* Thus Mar 26. [[Class meeting for 10-605 Spectral Clustering|Scalable spectral clustering techniques.]] '''AAAI Spring Symposium week'''
 
* Thus Mar 26. [[Class meeting for 10-605 Spectral Clustering|Scalable spectral clustering techniques.]] '''AAAI Spring Symposium week'''
 
 
* Tues Mar 31. [[Class meeting for 10-605 LDA 1|Sparse sampling and parallelization for LDA]]
 
* Tues Mar 31. [[Class meeting for 10-605 LDA 1|Sparse sampling and parallelization for LDA]]
 
+
** '''HW6 due: Subsampling and visualizing a graph.'''
  
 
== April  ==
 
== April  ==
Line 59: Line 58:
 
* Thus Apr 9. [[Class meeting for 10-605 Similarity Joins|Fast KNN and similarity joins]]
 
* Thus Apr 9. [[Class meeting for 10-605 Similarity Joins|Fast KNN and similarity joins]]
 
* Tues Apr 14.  [[Class meeting for 10-605 Parallel Similarity Joins|Parallel/Scalable Similarity Joins]]
 
* Tues Apr 14.  [[Class meeting for 10-605 Parallel Similarity Joins|Parallel/Scalable Similarity Joins]]
** '''Assignment due: Subsampling and visualizing a graph.'''
+
 
 
** ''New Assignment: Workflows with Pig'' [http://curtis.ml.cmu.edu/w/courses/images/4/46/Nb_pig.pdf PDF handout]
 
** ''New Assignment: Workflows with Pig'' [http://curtis.ml.cmu.edu/w/courses/images/4/46/Nb_pig.pdf PDF handout]
 
* Thus Apr 16. ''no class : carnival''
 
* Thus Apr 16. ''no class : carnival''

Revision as of 16:48, 5 January 2015

This is the syllabus for Machine Learning with Large Datasets 10-605 in Spring 2015.

Notes:

  • The assignments are from 2014, and will be modified over the course of the semester - some may be changed substantially.
  • Lecture notes and/or slides will be posted around the time of the lectures.

January

February

PDF Handout (4b) HW4 PDF Handout (4c) HW5


March

April

Topics covered in previous years but not in 2015