Lecture 10 | Recurrent Neural Ne

Lecture 10 | Recurrent Neural Ne

作者: Ysgc | 来源:发表于2019-11-04 03:07 被阅读0次

2014, before batch normalization was invented, training NN was hard.

For example, VGG was trained for 11 layers first, and then randomly added more layers inside, so that it could converge.

Another example: Google net used early output

bad!

only bp within a batch

similar to mini batch

learned to recite the GNU license

675 Mass Ave -> central square ???

not perfect

soft attention -> weighted combination of all img location
hard attention -> forcing the model select only one location to look at -> more tricky -> not differentiable -> talk later in RL lecture

RNN typically not deep -> 2,3,4 layers usually

相关文章

网友评论

本文标题：Lecture 10 | Recurrent Neural Ne

本文链接：https://www.haomeiwen.com/subject/xckrbctx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

栏目导航

热点阅读

关于我们|服务条款|联系我们|Lecture 10 | Recurrent Neural Ne|投稿指南|网站地图|RSS订阅|排版工具|手机版

提供经典美文摘抄,优美散文欣赏,现代诗歌精选,短篇小说,心情随笔,表白情书范文,故事会在线阅读欣赏

Copyright © 2014-2023 Haomeiwen.com All Rights Reserved. 好美文阅读网版权所有

备案信息：桂公网安备 45052102000051号 · 桂ICP备13007215号-3

本站所收录作品、热点评论等信息部分来源互联网，目的只是为了系统归纳学习和传递资讯

所有作品版权归原创作者所有，与本站立场无关，如不慎侵犯了你的权益，请联系我们告知，我们将做删除处理！