Difference between revisions of "Syllabus for Machine Learning with Large Datasets 10-405 in Spring 2018"

From Cohen Courses
Jump to navigationJump to search
Line 14: Line 14:
 
* Wed Jan 31, 2018 [[Class meeting for 10-405 Workflows For Hadoop|Workflows For Hadoop 1]].  Scalable classification, Abstracts for map-reduce algorithms, Joins in Hadoop
 
* Wed Jan 31, 2018 [[Class meeting for 10-405 Workflows For Hadoop|Workflows For Hadoop 1]].  Scalable classification, Abstracts for map-reduce algorithms, Joins in Hadoop
 
* Mon Feb 5, 2018 [[Class meeting for 10-405 Workflows For Hadoop|Workflows For Hadoop 2]].  Guinea Pig intro, Similarity joins, Similarity joins with TFIDF, Parallel simjoins
 
* Mon Feb 5, 2018 [[Class meeting for 10-405 Workflows For Hadoop|Workflows For Hadoop 2]].  Guinea Pig intro, Similarity joins, Similarity joins with TFIDF, Parallel simjoins
** '''Start work on''' Assignment 2: Naive bayes testing in Guinea Pig; Draft at https://autolab.andrew.cmu.edu/courses/10405-s18/assessments/hw2anbwithguineapig/writeup
+
** '''Start work on''' Assignment 2a: Naive bayes training in Guinea Pig
 
* Wed Feb 7, 2018 [[Class meeting for 10-405 Workflows For Hadoop|Workflows For Hadoop 3]].  PageRank, PageRank in Pig and Guinea Pig, K-means in Pig, Spark, Systems built on top of Hadoop
 
* Wed Feb 7, 2018 [[Class meeting for 10-405 Workflows For Hadoop|Workflows For Hadoop 3]].  PageRank, PageRank in Pig and Guinea Pig, K-means in Pig, Spark, Systems built on top of Hadoop
 
* Mon Feb 12, 2018 [[Class meeting for 10-405 SGD and Hash Kernels|SGD and Hash Kernels]].  Learning as optimization, Logistic regression with SGD, Regularized SGD, Efficient regularized SGD, Hash kernels for logistic regression
 
* Mon Feb 12, 2018 [[Class meeting for 10-405 SGD and Hash Kernels|SGD and Hash Kernels]].  Learning as optimization, Logistic regression with SGD, Regularized SGD, Efficient regularized SGD, Hash kernels for logistic regression
 +
** '''Start work on''' Assignment 2b: Draft at https://autolab.andrew.cmu.edu/courses/10405-s18/assessments/hw2anbwithguineapig/writeup
 
* Wed Feb 14, 2018 [[Class meeting for 10-405 Parallel Perceptrons|Parallel Perceptrons 1]].  The "delta trick", Averaged perceptrons, Debugging ML algorithms
 
* Wed Feb 14, 2018 [[Class meeting for 10-405 Parallel Perceptrons|Parallel Perceptrons 1]].  The "delta trick", Averaged perceptrons, Debugging ML algorithms
 +
* Mon Feb 19, 2018 [[Class meeting for 10-405 Parallel Perceptrons|Parallel Perceptrons 2]].  Hash kernels, Ranking perceptrons, Structured perceptrons
 
** '''Start work on''' Assignment 3: scalable SGD; Draft at http://www.cs.cmu.edu/~wcohen/10-405/assignments/2016-fall/hw-3-sga-logreg/main.pdf
 
** '''Start work on''' Assignment 3: scalable SGD; Draft at http://www.cs.cmu.edu/~wcohen/10-405/assignments/2016-fall/hw-3-sga-logreg/main.pdf
* Mon Feb 19, 2018 [[Class meeting for 10-405 Parallel Perceptrons|Parallel Perceptrons 2]].  Hash kernels, Ranking perceptrons, Structured perceptrons
 
 
* Wed Feb 21, 2018 [[Class meeting for 10-405 Parallel Perceptrons|Parallel Perceptrons 3]].  Iterative parameter mixing paper, Parallel SGD via Param Mixing
 
* Wed Feb 21, 2018 [[Class meeting for 10-405 Parallel Perceptrons|Parallel Perceptrons 3]].  Iterative parameter mixing paper, Parallel SGD via Param Mixing
 
* Mon Feb 26, 2018 [[Class meeting for 10-405 SGD for MF|SGD for MF]].  Matrix factorization, Matrix factorization with SGD, distributed matrix factorization with SGD
 
* Mon Feb 26, 2018 [[Class meeting for 10-405 SGD for MF|SGD for MF]].  Matrix factorization, Matrix factorization with SGD, distributed matrix factorization with SGD
 
* Wed Feb 28, 2018 [[Class meeting for 10-405 Guest lecture - tentative|Guest lecture - tentative]].   
 
* Wed Feb 28, 2018 [[Class meeting for 10-405 Guest lecture - tentative|Guest lecture - tentative]].   
** '''Last assignment due'''
 
 
* Mon Mar 5, 2018 [[Class meeting for 10-405 Midterm review and catchup|Midterm review and catchup]].  Midterm review
 
* Mon Mar 5, 2018 [[Class meeting for 10-405 Midterm review and catchup|Midterm review and catchup]].  Midterm review
 +
** '''Previous assignment due'''
 
* Wed Mar 7, 2018 [[Class meeting for 10-405 Midterm|Midterm]].   
 
* Wed Mar 7, 2018 [[Class meeting for 10-405 Midterm|Midterm]].   
 
* Mon Mar 19, 2018 [[Class meeting for 10-405 Computing with GPUs|Computing with GPUs]].  Introduction to GPUs, CUDA, Vectorization
 
* Mon Mar 19, 2018 [[Class meeting for 10-405 Computing with GPUs|Computing with GPUs]].  Introduction to GPUs, CUDA, Vectorization

Revision as of 17:11, 9 February 2018

This is the syllabus for Machine Learning with Large Datasets 10-405 in Spring 2018.

Notes

  • Homeworks, unless otherwise posted, will be due when the next HW comes out.
  • Lecture notes and/or slides will be (re)posted around the time of the lectures.