Discussion of the paper "Adam: A Method for Stochastic Optimization" by D. P. Kingma and J. Ba