10-601 Deep Learning 2

From Cohen Courses
Revision as of 10:17, 13 April 2016 by Wcohen (talk | contribs) (→‎Slides)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

This a lecture used in the Syllabus for Machine Learning 10-601B in Spring 2016

Slides

Wrapup from next lecture:

Readings

This area is moving very fast and the textbooks are not up-to-date. Some recommended readings:

I also used some on-line visualizations in the materials for the lecture, especially the part on ConvNets.

The LSTM figures and examples I used are mostly from

Things to remember

  • How backprop can be generalized to a sequence of assignment operations
  • Convolutional networks
    • 2-d convolution
    • How to construct a convolution layer
    • Architecture of CNN: convolution/downsampling pairs
  • Recurrent neural networks
    • When they are useful
    • Why they are hard to train (if trained naively)
    • The basic ideas used in an LSTM: forget, insert, and output gates