Difference between revisions of "Class meeting for 10-605 Deep Learning"

Latest revision as of 13:38, 31 October 2017

More general neural networks:
- Neural Networks and Deep Learning An online book by Michael Nielsen, pitched at an appropriate level for 10-601, which has a bunch of exercises and on-line sample programs in Python.
- For much much more detail, look at the MIT Press book (in preparation) from Bengio - it's very complete but also fairly technical.

The underlying reasons deep networks are hard to train
Exploding/vanishing gradients
Saturation
The importance of key recent advances in neural networks:
Matrix operations and GPU training
ReLU, cross-entropy, softmax
How backprop can be generalized to a sequence of assignment operations (autodiff)
- Wengert lists
- How to evaluate and differentiate a Wengert list
Common architectures
- Multi-layer perceptron
- Recursive NNs (RNNS) and Long/short term memory networks (LSTMs)
- Convolutional Networks (CNNs)

@@ Line 5: / Line 5: @@
 * Lecture 1: [http://www.cs.cmu.edu/~wcohen/10-605/deep-1.pptx Powerpoint], [http://www.cs.cmu.edu/~wcohen/10-605/deep-1.pdf PDF].
-* Lecture 2: [http://www.cs.cmu.edu/~wcohen/10-605/2016/deep-2.pptx Powerpoint], [http://www.cs.cmu.edu/~wcohen/10-605/2016/deep-2.pdf PDF].
+* Lecture 2: [http://www.cs.cmu.edu/~wcohen/10-605/deep-2.pptx Powerpoint], [http://www.cs.cmu.edu/~wcohen/10-605/deep-2.pdf PDF].
+* Lecture 3: [http://www.cs.cmu.edu/~wcohen/10-605/deep-3.pptx Powerpoint], [http://www.cs.cmu.edu/~wcohen/10-605/deep-3.pdf PDF].
 === Quizzes ===
@@ Line 11: / Line 13: @@
 * [https://qna.cs.cmu.edu/#/pages/view/75 Quiz for lecture 1]
 * [https://qna.cs.cmu.edu/#/pages/view/79 Quiz for lecture 2]
+* [https://qna.cs.cmu.edu/#/pages/view/212 Quiz for lecture 3]
 === Sample code ===