Difference between revisions of "Syllabus for Machine Learning with Large Datasets 10-605 in Spring 2013"

From Cohen Courses
Jump to navigationJump to search
Line 20: Line 20:
 
* Wed Feb 6. [[Class meeting for 10-605 2013 02 06|Map-reduce and Hadoop 1]].
 
* Wed Feb 6. [[Class meeting for 10-605 2013 02 06|Map-reduce and Hadoop 1]].
 
* Mon Feb 11.  [[Class meeting for 10-605 2013 02 11|Map-reduce and Hadoop 2]].
 
* Mon Feb 11.  [[Class meeting for 10-605 2013 02 11|Map-reduce and Hadoop 2]].
 +
* Wed Feb 13. [[Class meeting for 10-605 2013 02 13|Hadoop helpers and Scalable SGD]]
 
** '''Assignment due: phrase finding with stream-and-sort'''
 
** '''Assignment due: phrase finding with stream-and-sort'''
** ''New Assignments: Naive Bayes with Hadoop & Phrase-finding with Hadoop'' [http://www.cs.cmu.edu/~afyshe/Assignment4.pdf PDF Handout]
+
** ''New Assignments: Naive Bayes with Hadoop & Phrase-finding with Hadoop''  
* Wed Feb 13. [[Class meeting for 10-605 2013 02 13|Hadoop helpers and Scalable SGD]]
 
 
* Mon Feb 18. [[Class meeting for 10-605 2013 02 18|Scalable SGD and Hash Kernels]]
 
* Mon Feb 18. [[Class meeting for 10-605 2013 02 18|Scalable SGD and Hash Kernels]]
 +
* Wed Feb 20. '' Guest lecture: Chris Dyer.  Scalable feature selection with Map-Reduce.''
 
** '''Streaming run on Hadoop of Naive Bayes due''' - checkpoint
 
** '''Streaming run on Hadoop of Naive Bayes due''' - checkpoint
* Wed Feb 20. '' Guest lecture: Chris Dyer.  Scalable feature selection with Map-Reduce.''
 
 
* Mon Feb 25. [[Class meeting for 10-605 2013 02 25|Background on randomized algorithms; Graph computations 1.]]
 
* Mon Feb 25. [[Class meeting for 10-605 2013 02 25|Background on randomized algorithms; Graph computations 1.]]
 +
* Wed Feb 27.  ''Guest Lecture: Aappo Kyrola - GraphLab and GraphChi''
 
** '''Hadoop assignment (Naive Bayes) due'''
 
** '''Hadoop assignment (Naive Bayes) due'''
* Wed Feb 27.  ''Guest Lecture: Aappo Kyrola - GraphLab and GraphChi''
 
  
 
== March ==
 
== March ==
  
 
* Mon Mar 4. [[Class meeting for 10-605 2013 03 04|Learning on graphs 2]].  
 
* Mon Mar 4. [[Class meeting for 10-605 2013 03 04|Learning on graphs 2]].  
 +
* Wed Mar 6. ''Guest lecture: John Wong (Google)''
 
** '''Hadoop assignment (phrase-finding) due'''
 
** '''Hadoop assignment (phrase-finding) due'''
 
** ''New Assignment: memory-efficient SGD'' [http://www.cs.cmu.edu/~wcohen/10-605/assignments/sgd.pdf PDF writeup]
 
** ''New Assignment: memory-efficient SGD'' [http://www.cs.cmu.edu/~wcohen/10-605/assignments/sgd.pdf PDF writeup]
 
** ''New assignment: initial project proposals.'' [http://www.cs.cmu.edu/~wcohen/10-605/assignments/initial-project-proposal.pdf PDF writeup]
 
** ''New assignment: initial project proposals.'' [http://www.cs.cmu.edu/~wcohen/10-605/assignments/initial-project-proposal.pdf PDF writeup]
* Wed Mar 6. ''Guest lecture: John Wong (Google)''
 
 
* Mon Mar 11. ''no class - spring break.''
 
* Mon Mar 11. ''no class - spring break.''
 
* Wed Mar 13. ''no class - spring break.''
 
* Wed Mar 13. ''no class - spring break.''
 
* Mon Mar 18. [[Class meeting for 10-605 2013 03 18|Subsampling a graph with RWR]]
 
* Mon Mar 18. [[Class meeting for 10-605 2013 03 18|Subsampling a graph with RWR]]
 +
* Wed Mar 20. [[Class meeting for 10-605 2013 03 20|Semi-supervised learning via label propagation on graphs]]
 
** '''Assignment due: initial mini-project proposals.'''  
 
** '''Assignment due: initial mini-project proposals.'''  
 
** '''Assignment due: memory-efficient SGD'''
 
** '''Assignment due: memory-efficient SGD'''
 
** ''New Assignment: Subsampling and visualizing a graph.'' [http://www.cs.cmu.edu/~wcohen/10-605/assignments/snowball.pdf PDF writeup]
 
** ''New Assignment: Subsampling and visualizing a graph.'' [http://www.cs.cmu.edu/~wcohen/10-605/assignments/snowball.pdf PDF writeup]
* Wed Mar 20. [[Class meeting for 10-605 2013 03 20|Semi-supervised learning via label propagation on graphs]]
 
 
* Mon Mar 25. [[Class meeting for 10-605 2013 03 25|Label propagation 2: Unsupervised label propagation, label propagation as optimization, bipartite graphs]]
 
* Mon Mar 25. [[Class meeting for 10-605 2013 03 25|Label propagation 2: Unsupervised label propagation, label propagation as optimization, bipartite graphs]]
 +
* Wed Mar 27. [[Class meeting for 10-605 2013 03 27|Understanding spectral clustering techniques.]]
 
** '''Assignment due: Subsampling and visualizing a graph.'''
 
** '''Assignment due: Subsampling and visualizing a graph.'''
** ''New Assignment: mini-project proposals (final version)''
 
* Wed Mar 27. [[Class meeting for 10-605 2013 03 27|Understanding spectral clustering techniques.]]
 
 
** '''Assignment due: mini-project proposals (final version).'''
 
** '''Assignment due: mini-project proposals (final version).'''
  

Revision as of 14:31, 8 February 2013

This is the syllabus for Machine Learning with Large Datasets 10-605 in Spring 2013.

January

February

March

April and May

May

  • Fri May 3.
    • Project writeups due at 5:00pm. Submit a paper to Blackbook in PDF in the ICML 2013 format (up to 8pp double column), except, of course, do not submit it anonymously.