Difference between revisions of "Syllabus for Machine Learning with Large Datasets 10-605 in Fall 2017"

From Cohen Courses
Jump to navigationJump to search
Line 29: Line 29:
 
* Tues Sep 5, 2017 [[Class meeting for 10-605 Streaming Naive Bayes|Streaming Naive Bayes]].  Notes on scalable naive bayes, Alternatives to stream and sort, Local counting in stream and sort, Stream and sort examples
 
* Tues Sep 5, 2017 [[Class meeting for 10-605 Streaming Naive Bayes|Streaming Naive Bayes]].  Notes on scalable naive bayes, Alternatives to stream and sort, Local counting in stream and sort, Stream and sort examples
 
* Thurs Sep 7, 2017 [[Class meeting for 10-605 Hadoop Overview|Hadoop Overview]].  Intro to Hadoop, Hadoop Streaming, Debugging Hadoop, Combiners
 
* Thurs Sep 7, 2017 [[Class meeting for 10-605 Hadoop Overview|Hadoop Overview]].  Intro to Hadoop, Hadoop Streaming, Debugging Hadoop, Combiners
** '''Start work on''' Assignment 1b: Streaming NB on Hadoop; Draft at http://www.cs.cmu.edu/~wcohen/10-605/assignments/2016-fall/hw-1-naivebayes-streaming/main-b.pdf
+
** '''Start work on''' Assignment 1b: Streaming NB on Hadoop; writeup [http://www.cs.cmu.edu/~wcohen/10-605/assignments/2017-fall/bigml-assignment-1b.pdf here]
 
* Tues Sep 12, 2017 [[Class meeting for 10-605 Workflows For Hadoop|Workflows For Hadoop 1]].  Scalable classification, Abstracts for map-reduce algorithms, Joins in Hadoop
 
* Tues Sep 12, 2017 [[Class meeting for 10-605 Workflows For Hadoop|Workflows For Hadoop 1]].  Scalable classification, Abstracts for map-reduce algorithms, Joins in Hadoop
 
* Thurs Sep 14, 2017 [[Class meeting for 10-605 Workflows For Hadoop|Workflows For Hadoop 2]].  Guinea Pig intro, Similarity joins, Similarity joins with TFIDF
 
* Thurs Sep 14, 2017 [[Class meeting for 10-605 Workflows For Hadoop|Workflows For Hadoop 2]].  Guinea Pig intro, Similarity joins, Similarity joins with TFIDF

Revision as of 20:03, 7 September 2017

This is the syllabus for Machine Learning with Large Datasets 10-605 in Fall 2017.


Notes:

  • Homeworks, unless otherwise posted, will be due when the next HW comes out.
  • Lecture notes and/or slides will be (re)posted around the time of the lectures.
  • Classes are cancelled for Sept 21 (Rosh Hashana)
  • No classes will be held on Nov 23 (Thanksgiving)

Schedule for 805 projects:



Tentative schedule for lectures and 605 assignments: