RL L1
作者:
NoneLand | 来源:发表于
2017-01-09 12:03 被阅读9次
-
markov decision process
-
Bellman equation
-
value iteration

3 Ways of Learning

Markov Decision Process

On Rewards

Two way is Infinite

Discount Factor

Polices

Finding Polices

Findn Polices Quiz

Finding Polices Again

V Function & Q Function

C Function

Ralation of Bellman Equations( Q Func is Cool!)

What've Learned
本文标题:RL L1
本文链接:https://www.haomeiwen.com/subject/wrthbttx.html
网友评论