Lecture 3 Part A: Follow up

Lecture 3 Part A: Follow up

作者: Ysgc | 来源:发表于2020-01-20 09:19 被阅读0次

Lecture 3 Part A: Follow up
托福听力2
超能英语【词汇课】的学习笔记
DAY3
Follow up
Biology Class——evolution of whal
You Drive Your Own Life 20190905
Follow me
Day3-follow-up
Day 3 follow up email

1. with DAgger fully works: $p_{train}(s) = p_{\theta}(s)$

the best of E[∑c] = epsilon · T
(once move to the red region, agent could move back, since DAgger)

2. Distribution mismatch: $p_{train}(s) \ne p_{\theta}(s)$

no mistake Prob: $(1-\epsilon)^T$
mistake Prob: the rest -> state distribution is $p_{mistake}(s_t)$
total variational divergence: between $p_{train}(s)$ and $p_{\theta}(s)$

what's the worse case of this divergence?

factor of "2"

$c_{max} = 1$

thoughts:

相关文章

Lecture 3 Part A: Follow up
1. with DAgger fully works: the best of E[∑c] = epsilon ·...
托福听力2
Tpo 1：Lecture2 Listen to part of a lecture in a geology c...
超能英语【词汇课】的学习笔记
开场： 1、Take notes and take part in; 2、Follow my steps; 3、B...
DAY3
Hi Mr.XX This is just a follow-up from our last meeting 3...
Follow up
The company manager must keep good communication with emp...
Biology Class——evolution of whal
NARRATOR：Listen to part of a lecture in a Marine Biology ...
You Drive Your Own Life 20190905
Aphorism: Follow your instincts. 追随你的直觉。 I woke up at 3:0...
Follow me
Follow me 跟我来,跟我来 Hands up Hands up 举起手,举起手 Follow me Fol...
Day3-follow-up
Dear Ms. Luna, Thank you for giving me valuable informati...
Day 3 follow up email
Dear Sir/Madam, Thank you for taking the time out of your...

网友评论

本文标题：Lecture 3 Part A: Follow up

本文链接：https://www.haomeiwen.com/subject/qqurzctx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

栏目导航

热点阅读

关于我们|服务条款|联系我们|Lecture 3 Part A: Follow up|投稿指南|网站地图|RSS订阅|排版工具|手机版

提供经典美文摘抄,优美散文欣赏,现代诗歌精选,短篇小说,心情随笔,表白情书范文,故事会在线阅读欣赏

Copyright © 2014-2023 Haomeiwen.com All Rights Reserved. 好美文阅读网版权所有

备案信息：桂公网安备 45052102000051号 · 桂ICP备13007215号-3

本站所收录作品、热点评论等信息部分来源互联网，目的只是为了系统归纳学习和传递资讯

所有作品版权归原创作者所有，与本站立场无关，如不慎侵犯了你的权益，请联系我们告知，我们将做删除处理！