Multiagent cooperation and compe

Multiagent cooperation and compe

作者: 空空格格 | 来源:发表于2018-05-10 22:12 被阅读0次

Multiagent cooperation and compe
Vans Custom: Everyone Can Be An
cooperation
Self-Attention Meta-Learner for
TRANSLATE
Cultivate customers loyalty
MARL 笔记
3.25的记录
2021-01-31 Unhealthy Competition
Buddha Supply Agent-Sales

论文复现 :

tensorflow_2player_pong

论文详述

Multiagent cooperation and competition with deep reinforcement learning

pong game-two agents

基础模型：pong game, two agents
算法结构：dqn
- reward：scoring:(-1,1) conceding(-1)
  未击中球得-1，击中球得分between (-1,1)
  双方均击中球得分0，游戏继续

reward

训练参数
- 50 epochs, 250000 time steps each.
- exploration rate: 1.0 to 0.05(in the 1000000 time steps) and stays fixed at that value

parameters.png

结果分析
- 是否收敛:monitor average maximal Q-values of 500 randomly selected game situations, set aside before training begins
  
  Q values
- 训练效果反馈:
  - Average paddle-bounces per point 在一方得分前球在players间来回的次数
  - Average wall-bounces per paddle-bounce 球在到达一方前撞墙的次数
  - Average serving time per point 球丢了以后players restart game的反应时间(一些rewarding scheme下players不希望重启游戏，serving time很长，如p = -1)

结果分析

scoring = -1时，双方为合作状态（均不希望球掉落）
最终双方均升至页面最上方，球水平传来传去
合作模式video-youtube
1.png
scoring = 1时，双方为竞争模式(希望自己多得分)
竞争模式video-youtube
2.png
p range from -1 to 1

3.png

multiplayer dqn vs single-player
(score表示a胜b的得分)

4

本文遵守知识共享协议：署名-非商业性使用-相同方式共享 (BY-NC-SA)及简书协议
转载请注明：作者空空格格，首发简书 Jianshu.com

相关文章

Multiagent cooperation and compe
论文复现 : tensorflow_2player_pong 论文详述 Multiagent cooperatio...
Vans Custom: Everyone Can Be An
In the fashion world, the design is one of the most compe...
cooperation
Today is Saturday, December 19, 2020 I get up early today...
Self-Attention Meta-Learner for
被aamas2021(Autonomous Agents and Multiagent Systems) CCF ...
TRANSLATE
翻译 Fluence in a foreign language can translate into compe...
Cultivate customers loyalty
Attract new customers and maintain existing clients compe...
MARL 笔记
16年的MARL概览: A comprehensive survey of multiagent reinforc...
3.25的记录
So, thinking of my two math homework, Python class, compe...
2021-01-31 Unhealthy Competition
Cooperation is the glue that keeps society together, but ...
Buddha Supply Agent-Sales
The agency cooperation platform is a platform for profess...

网友评论

强化学习

本文标题：Multiagent cooperation and compe

本文链接：https://www.haomeiwen.com/subject/hdbtdftx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

栏目导航

热点阅读

强化学习

Multiagent cooperation and compe

关于我们|服务条款|联系我们|Multiagent cooperation and compe|投稿指南|网站地图|RSS订阅|排版工具|手机版

提供经典美文摘抄,优美散文欣赏,现代诗歌精选,短篇小说,心情随笔,表白情书范文,故事会在线阅读欣赏

Copyright © 2014-2023 Haomeiwen.com All Rights Reserved. 好美文阅读网版权所有

备案信息：桂公网安备 45052102000051号 · 桂ICP备13007215号-3

本站所收录作品、热点评论等信息部分来源互联网，目的只是为了系统归纳学习和传递资讯

所有作品版权归原创作者所有，与本站立场无关，如不慎侵犯了你的权益，请联系我们告知，我们将做删除处理！