Details
-
New Feature
-
Status: To Do
-
Major
-
Resolution: Unresolved
-
None
Description
Implementation of RAdam optimizer based on the new paperĀ [ON THE VARIANCE OF THE ADAPTIVE LEARNING RATE AND BEYOND|https://arxiv.org/pdf/1908.03265v1.pdf]