Lists (1)
Sort Last updated
Stars
An optimizer that trains as fast as Adam and as good as SGD.
On the Variance of the Adaptive Learning Rate and Beyond
An optimizer that trains as fast as Adam and as good as SGD.
On the Variance of the Adaptive Learning Rate and Beyond