Difference between revisions of "Syllabus for Machine Learning with Large Datasets 10-405 in Spring 2018"

From Cohen Courses
Jump to navigationJump to search
Line 51: Line 51:
 
* Mon Mar 26, 2018 [[Class meeting for 10-405 Deep Learning|Deep Learning 3]].  Expressiveness of MLPs, Exploding and vanishing gradients, Modern deep learning models, Word2vec and GloVE, Recursive ANNs, Convolutional ANNs, Achitectures using RNNs
 
* Mon Mar 26, 2018 [[Class meeting for 10-405 Deep Learning|Deep Learning 3]].  Expressiveness of MLPs, Exploding and vanishing gradients, Modern deep learning models, Word2vec and GloVE, Recursive ANNs, Convolutional ANNs, Achitectures using RNNs
 
* Wed Mar 28, 2018 [[Class meeting for 10-405 Computing with GPUs|Computing with GPUs]].  Introduction to GPUs, CUDA
 
* Wed Mar 28, 2018 [[Class meeting for 10-405 Computing with GPUs|Computing with GPUs]].  Introduction to GPUs, CUDA
 +
* Fri Mar 30, 2018
 +
** '''Start work on''' Assignment 5: Autodiff with IPM part 2/2
 
* Mon Apr 2, 2018 [[Class meeting for 10-405 Randomized Algorithms|Randomized Algorithms 1]].  Bloom filters, The countmin sketch, CM Sketches in Deep Learning
 
* Mon Apr 2, 2018 [[Class meeting for 10-405 Randomized Algorithms|Randomized Algorithms 1]].  Bloom filters, The countmin sketch, CM Sketches in Deep Learning
 +
** '''HW 4 is due'''
 
* Wed Apr 4, 2018 [[Class meeting for 10-405 Randomized Algorithms|Randomized Algorithms 2]].  Review of Bloom filters, Locality sensitive hashing, Online LSH
 
* Wed Apr 4, 2018 [[Class meeting for 10-405 Randomized Algorithms|Randomized Algorithms 2]].  Review of Bloom filters, Locality sensitive hashing, Online LSH
** '''Start work on''' Assignment 5: Autodiff with IPM part 2/2
 
 
* Mon Apr 9, 2018 [[Class meeting for 10-405 Graph Architectures for ML|Graph Architectures for ML]].  Graph-based ML architectures, Pregel, Signal-collect, GraphLab, PowerGraph, GraphChi, GraphX
 
* Mon Apr 9, 2018 [[Class meeting for 10-405 Graph Architectures for ML|Graph Architectures for ML]].  Graph-based ML architectures, Pregel, Signal-collect, GraphLab, PowerGraph, GraphChi, GraphX
 
* Wed Apr 11, 2018 [[Class meeting for 10-405 SSL on Graphs|SSL on Graphs]].  Semi-supervised learning intro, Multirank-walk SSL method, Harmonic fields, Modified Adsorption SSL method, MAD with countmin sketches
 
* Wed Apr 11, 2018 [[Class meeting for 10-405 SSL on Graphs|SSL on Graphs]].  Semi-supervised learning intro, Multirank-walk SSL method, Harmonic fields, Modified Adsorption SSL method, MAD with countmin sketches

Revision as of 15:16, 20 March 2018

This is the syllabus for Machine Learning with Large Datasets 10-405 in Spring 2018.

Ideas for open-ended extensions to the HW assignments

This is not a complete list! you can use any of these as a starting point, but feel free to think up your own extensions.

HW2 (NB in GuineaPig):

  • The assignment proposes one particular scheme for parallelizing the training/testing algorithm. Consider another parallelization algorithm.
  • Implement a similarly scalable Rocchio algorithm and compare it with NB.
  • Reimplement the same algorithm in Spark (or some other dataflow language) and compare.
  • One or the extensions to GuineaPig not discussed in class is an in-memory map-reduce system. Design an experiment that makes use of this constructively.

HW3 (Logistic regression and SGD)

  • Evaluate the hash trick for Naive Bayes systematically on a series of datasets.
  • Implement a parameter-mixing version of logistic regression and evaluate it.
  • A recent paper proposes (roughly) using SVM with NB-transformed features. Implement this and compare.
  • The personalization method described in class is based on a transfer learning method which works similarly. Many wikipedia pages are available in multiple languages, and works in related languages tend to be lexically similar (eg, "astrónomo" is Spanish for "astronomer"). Suppose features were character n-grams (eg "astr", "stro", "tron", ...) - does domain transfer work for the task of classifying wikipedia pages? Construct a dataset and experiment to test this hypothesis.

Notes

  • Homeworks, unless otherwise posted, will be due when the next HW comes out.
  • Lecture notes and/or slides will be (re)posted around the time of the lectures.

Schedule