10-601 Clustering

From Cohen Courses
Revision as of 20:25, 25 October 2014 by Bbd (talk | contribs)
Jump to navigationJump to search

This a pair of lectures used in the Syllabus for Machine Learning 10-601 in Fall 2014.

There will be a guest lecturer, Bhavana Dalvi, lecturing on k-means on Monday and Thursday, and Ziv will lecture on agglomerative clustering and spectral clustering on Tuesday and Wednesday. (So usually, the lectures will be in different orders to the two sections.)

Slides

 (Only the slides on advanced topics vary across lectures)

Readings

Bishop's Chapter 9

Mitchell 6.12 also has a nice description of EM and k-means.

What You Should Know Afterward

You should know how to implement these methods, and what their relative advantages and disadvantages are.

  • Overview of clustering
  • Distance functions and similarity measures and their impact
  • K-means algorithms
  • How to chose k and what is the impact of large and small k's
  • EM
  • Differences between GM and K-means