Difference between revisions of "10-601 Logistic Regression"

From Cohen Courses
Jump to navigationJump to search
Line 1: Line 1:
This a lecture used in the [[Syllabus for Machine Learning 10-601 in Fall 2014]]
+
This a lecture used in the [[Syllabus for Machine Learning 10-601B in Spring 2016]]
  
 
=== Slides ===
 
=== Slides ===
* Ziv's lecture: [http://www.cs.cmu.edu/~zivbj/classF14/LR.pdf Slides in pdf].
+
* William's lecture: [http://www.cs.cmu.edu/~wcohen/10-601/logreg.pptx in Powerpoint]
* William's lecture: [http://www.cs.cmu.edu/~wcohen/10-601/logreg.pptx in Powerpoint],  [http://www.cs.cmu.edu/~wcohen/10-601/logreg.pdf in PDF].
 
 
 
 
=== Readings ===
 
=== Readings ===
  
 
* Optional:
 
* Optional:
** Bishop 4.2-4.3
+
** Murphy 8.1-3, 8.6
 
** [http://www.cs.cmu.edu/~wcohen/10-605/notes/sgd-notes.pdf William's notes on SGD (for 10605)]
 
** [http://www.cs.cmu.edu/~wcohen/10-605/notes/sgd-notes.pdf William's notes on SGD (for 10605)]
 
** [http://cseweb.ucsd.edu/~elkan/250B/logreg.pdf Charles Elkan's notes on SGD]
 
** [http://cseweb.ucsd.edu/~elkan/250B/logreg.pdf Charles Elkan's notes on SGD]
** [http://lingpipe.files.wordpress.com/2008/04/lazysgdregression.pdf Lazy Sparse Stochastic Gradient Descent for Regularized Multinomial Logistic Regression], Carpenter, Bob. 2008. See also [http://alias-i.com/lingpipe/demos/tutorial/logistic-regression/read-me.html his blog post] on logistic regression.
 
  
 
=== What You Should Know Afterward ===
 
=== What You Should Know Afterward ===

Revision as of 15:14, 6 January 2016

This a lecture used in the Syllabus for Machine Learning 10-601B in Spring 2016

Slides

Readings

What You Should Know Afterward

  • How to implement logistic regression.
  • How to determine the best parameters for logistic regression models
  • Why regularization matters for logistic regression.
  • How logistic regression and naive Bayes are similar and different.
  • The difference between a discriminative and a generative classifier.
  • What "overfitting" is, and why optimizing performance on a training set does not necessarily lead to good performance on a test set.