美文网首页
Lecture 14 | Deep Reinforcement

Lecture 14 | Deep Reinforcement

作者: Ysgc | 来源:发表于2019-11-04 09:49 被阅读0次
value iteration

https://math.stackexchange.com/questions/2639577/why-is-the-gradient-of-this-expectation-intractable

turn a integration in high dim to a expectation problem???

computational efficiency -> low resolution to high resolution

this hard attention -> a lot applications!!! -> improve efficiency

but still need RNN -> may be slow

efficiency depends on the case

high resolution input -> fast by this method

Q learning may be harder to tune

相关文章

网友评论

      本文标题:Lecture 14 | Deep Reinforcement

      本文链接:https://www.haomeiwen.com/subject/cplrbctx.html