Chapter 4.11: ADAM and friends

In this subchapter we introduce a couple of advanced algorithms all building on SGD such as ADAM and AdaGrad.