美文网首页
Lecture 14 | Deep Reinforcement

Lecture 14 | Deep Reinforcement

作者: Ysgc | 来源:发表于2019-11-04 09:49 被阅读0次
    value iteration

    https://math.stackexchange.com/questions/2639577/why-is-the-gradient-of-this-expectation-intractable

    turn a integration in high dim to a expectation problem???

    computational efficiency -> low resolution to high resolution

    this hard attention -> a lot applications!!! -> improve efficiency

    but still need RNN -> may be slow

    efficiency depends on the case

    high resolution input -> fast by this method

    Q learning may be harder to tune

    相关文章

      网友评论

          本文标题:Lecture 14 | Deep Reinforcement

          本文链接:https://www.haomeiwen.com/subject/cplrbctx.html