"I initially got backprop wrong both times, comparison with numerical differenti...

gwern · on March 6, 2018

There might be something deeper there. I am thinking of the line of research, associated with Bengio, about biologically-plausible backprop - it turns out that you can backprop random weights and backprop will still work! Which is important because it's not too plausible that the brain is calculating exact derivatives and communicating them around to each neuron individually to update them, but it can send back error more easily.

hota_mazi · on March 5, 2018

It's actually not very different from graphic programming, where a simple rounding error can cause all kinds of troubles, from very small (surfaces or ray not reflecting perfectly where they should be) or very big (completely messing your entire rendering).

Both activities very much resemble chaotic systems and they are both very challenging to debug.

hotmilk · on March 5, 2018

Reproducing known results goes a long way.

BenoitEssiambre · on March 5, 2018

That is only possible when you are not experimenting with new algorithms.