2018-12-20 PPO debug experience

2018-12-20 PPO debug experience

作者: 云雨惊袭明月夜 | 来源:发表于2018-12-20 23:16 被阅读0次

2018-12-20 PPO debug experience
Android Studio Flyme 7.x Debug的坑
PPO
(GitHub+Hexo搭建个人博客系列)01、搭建本地的hex
VPG && TRPO && PPO
PPO算法解析
武汉市锦绣三江传媒有限公司
商务英语 Level 3 Unit 1 -4 Product F
深度强化学习从入门到大师：以刺猬索尼克游戏为例讲解PPO（第六部
在 Shearwater Teric OC 里，Deco PPO

PPO Debug Experience

Recently, I need to perform PPO in a complex env. I refer to some code in GitHub, however, I can't grasp their meaning...

After reading PPO paper, I decided to code by myself.

I already have some experience writing RL code. After several minutes, I finished the first version with gym-cart-pole-v0. However, that didn't work...

Then I started to check the core algorithm again and again...It's very sad, the code still did not work.

So I suspect whether the agent's interacting with env is right or not...
Then I started to debug the interaction between agent and env.
Luckily, I found that the reward(or Gt/advantage) went wrong. So I refer to some papers about advantage such as GAE, TRPO and so on...

Then I changed the way reward is calculated. The code work.
You can click here to ref my code.

相关文章

2018-12-20 PPO debug experience
PPO Debug Experience Recently, I need to perform PPO in a...
Android Studio Flyme 7.x Debug的坑
Debug设备以前我debug的机器是刷了pixel experience的渣又卡渣2(ZUK Z2)(andr...
PPO
On-policy VS Off-policy On-policy: The agent learned and ...
(GitHub+Hexo搭建个人博客系列)01、搭建本地的hex
文丨liyuhong2019发布时间:2018-12-20 (周四广州/晴)最后更新时间:2018-12-20 ...
VPG && TRPO && PPO
PPO（Proximal Policy Optimization）是一种解决 PG 算法中学习率不好确定的问题的...
PPO算法解析
在2017年的时候，无论是openai或者是deepmind，在深度强化学习领域都取得了重大突破，而能带来这个突破...
武汉市锦绣三江传媒有限公司
2018-12-20简书作者王军 2018-12-20☞22:24☞打开App 王军日精进打卡第25天】【知～学...
商务英语 Level 3 Unit 1 -4 Product F
user experience用户体验 User experience describes how a user ...
深度强化学习从入门到大师：以刺猬索尼克游戏为例讲解PPO（第六部
本文为 AI 研习社编译的技术博客，原标题： Proximal Policy Optimization (PPO...
在 Shearwater Teric OC 里，Deco PPO
在 Shearwater Teric OC 里，Deco PPO2 limit = 1.61 ata是怎么得到的？...

网友评论

本文标题：2018-12-20 PPO debug experience

本文链接：https://www.haomeiwen.com/subject/xecpkqtx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

栏目导航

热点阅读

关于我们|服务条款|联系我们|2018-12-20 PPO debug experience|投稿指南|网站地图|RSS订阅|排版工具|手机版

提供经典美文摘抄,优美散文欣赏,现代诗歌精选,短篇小说,心情随笔,表白情书范文,故事会在线阅读欣赏

Copyright © 2014-2023 Haomeiwen.com All Rights Reserved. 好美文阅读网版权所有

备案信息：桂公网安备 45052102000051号 · 桂ICP备13007215号-3

本站所收录作品、热点评论等信息部分来源互联网，目的只是为了系统归纳学习和传递资讯

所有作品版权归原创作者所有，与本站立场无关，如不慎侵犯了你的权益，请联系我们告知，我们将做删除处理！