44.
The convergence disadvantage RNNs have over the all-seeing Sauron eye is actually a feature. It is a cost you pay…
The convergence disadvantage RNNs have over the all-seeing Sauron eye is actually a feature. It is a cost you pay at training time to save big on inference. Learning to compress is almost always worth it.