EMERGENT COORDINATION THROUGH CO

作者: 朱小虎XiaohuZhu | 来源:发表于2019-02-22 15:10 被阅读27次

EMERGENT COORDINATION THROUGH COMPETITION

Authors: Siqi Liu, Guy Lever, Josh Merel, Saran Tunyasuvunakool, Nicolas Heess, Thore Graepel
Institutes: DeepMind

ABSTRACT

We study the emergence of cooperative behaviors in reinforcement learning agents by introducing a challenging competitive multi-agent soccer environment with continuous simulated physics. We demonstrate that decentralized, populationbased training with co-play can lead to a progression in agents’ behaviors: from random, to simple ball chasing, and finally showing evidence of cooperation. Our study highlights several of the challenges encountered in large scale multi-agent training in continuous control. In particular, we demonstrate that the automatic optimization of simple shaping rewards, not themselves conducive to co-operative
behavior, can lead to long-horizon team behavior. We further apply an evaluation scheme, grounded by game theoretic principals, that can assess agent performance in the absence of pre-defined evaluation tasks or human baselines.

网友评论

本文标题：EMERGENT COORDINATION THROUGH CO

本文链接：https://www.haomeiwen.com/subject/xcseyqtx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

EMERGENT COORDINATION THROUGH CO