Syllabus for Machine Learning with Large Datasets 10-605 in Spring 2014

From Cohen Courses
Revision as of 12:53, 20 February 2014 by Yww (talk | contribs) (→‎February)
Jump to navigationJump to search

This is the syllabus for Machine Learning with Large Datasets 10-605 in Spring 2014.

Notes:

  • The assignments are from 2013, and will be modified over the course of the semester - some may be changed substantially.
  • Lecture notes will be posted around the time of the lectures.

January

February

  • Mon Feb 3. Rocchio and Parallel Perceptrons
  • Wed Feb 5. Perceptrons/Map-reduce and Hadoop.
    • Assignment due: streaming Naive Bayes 2 (with feature counts on disk) with stream-and-sort
    • New Assignment: phrase finding with stream-and-sort. PDF Handout
  • Mon Feb 10. Parallel Perceptrons.
  • Wed Feb 12. Guest lecture: Matt Hurst, Microsoft/Bing: Local Search at Bing. One-on-one meetings with Matt can be scheduled for Thursday 12/13 between 9-12 in Gates-Hillman 6501, afternoon meetings 12:30-1:30pm in Gates-Hillman 6002.
  • Mon Feb 17. Scalable SGD and Hash Kernels
    • Assignment due: phrase finding with stream-and-sort
    • New Assignments: Naive Bayes with Streaming Hadoop, Naive Bayes with Streaming Hadoop & Phrase-finding with Hadoop. PDF Handout (4a)

PDF Handout (4b)

March

April and May