Difference between revisions of "Class meeting for 10-605 Deep Learning"

Revision as of 13:54, 23 October 2017

More general neural networks:
- Neural Networks and Deep Learning An online book by Michael Nielsen, pitched at an appropriate level for 10-601, which has a bunch of exercises and on-line sample programs in Python.
- For much much more detail, look at the MIT Press book (in preparation) from Bengio - it's very complete but also fairly technical.

The underlying reasons deep networks are hard to train
Exploding/vanishing gradients
Saturation
The importance of key recent advances in neural networks:
Matrix operations and GPU training
ReLU, cross-entropy, softmax
How backprop can be generalized to a sequence of assignment operations (autodiff)
- Wengert lists
- How to evaluate and differentiate a Wengert list
Common architectures
- Multi-layer perceptron
- Recursive NNs (RNNS) and Long/short term memory networks (LSTMs)
- Convolutional Networks (CNNs)

Revision as of 16:45, 1 August 2017 (view source) Wcohen (talk \| contribs) (→‎Slides) ← Older edit		Revision as of 13:54, 23 October 2017 (view source) Wcohen (talk \| contribs) Newer edit →
Line 1:		Line 1:
−	This is one of the class meetings on the [[Syllabus for Machine Learning with Large Datasets 10-605 in Fall ~~2016~~\|schedule]] for the course [[Machine Learning with Large Datasets 10-605 in ~~Fall_2016~~]].	+	This is one of the class meetings on the [[Syllabus for Machine Learning with Large Datasets 10-605 in Fall 2017\|schedule]] for the course [[Machine Learning with Large Datasets 10-605 in Fall_2017]].

	=== Slides ===		=== Slides ===