美文网首页
Recitation 3 | Deep Learning Opt

Recitation 3 | Deep Learning Opt

作者: Ysgc | 来源:发表于2019-10-25 09:58 被阅读0次

grad in y axis decreasing

LR is the same for different param

to refine this process, Adagrad is introduced here

sparse data -> only a few params are frequently updated
automatically decaying LR -> pro or con?

RMSprop = adadelta



Batch norm


相关文章

网友评论

      本文标题:Recitation 3 | Deep Learning Opt

      本文链接:https://www.haomeiwen.com/subject/tbajvctx.html