Inverse Reward Design

Inverse Reward Design

作者: 朱小虎XiaohuZhu | 来源:发表于2017-11-10 20:12 被阅读78次

Inverse Reward Design
UC Berkeley Machine Learning 189
Spring闲谈
Inverse RL
CS294 Lecture 6-Actor Critic
Challenge and Reward
oracel
亚瑟·克拉克~刘慈欣
阿里巴巴强化学习rank
Staking Reward Maximization Stra

Dylan Hadfield-Menell Smitha Milli Pieter Abbeel∗ Stuart Russell Anca Dragan
Department of Electrical Engineering and Computer Science
University of California, Berkeley
Berkeley, CA 94709
{dhm, smilli, pabbeel, russell, anca}@cs.berkeley.edu
Abstract
Autonomous agents optimize the reward function we give them. What they don’t
know is how hard it is for us to design a reward function that actually captures
what we want. When designing the reward, we might think of some specific
training scenarios, and make sure that the reward will lead to the right behavior
in those scenarios. Inevitably, agents encounter new scenarios (e.g., new types of
terrain) where optimizing that same reward may lead to undesired behavior. Our
insight is that reward functions are merely observations about what the designer
actually wants, and that they should be interpreted in the context in which they were
designed. We introduce inverse reward design (IRD) as the problem of inferring the
true objective based on the designed reward and the training MDP. We introduce
approximate methods for solving IRD problems, and use their solution to plan
risk-averse behavior in test MDPs. Empirical results suggest that this approach can
help alleviate negative side effects of misspecified reward functions and mitigate
reward hacking.

相关文章

Inverse Reward Design
Dylan Hadfield-Menell Smitha Milli Pieter Abbeel∗ Stuart ...
UC Berkeley Machine Learning 189
---- Sept.2: -Inverse Covariance Matrix Inverse Covarianc...
Spring闲谈
依赖注入(Inverse of Control) Spring 实现IoC(Inverse of Control)...
Inverse RL
CS294 Lecture 6-Actor Critic
从 "reward to go" 到 Actor Critic 回顾一下REINFORCE算法其中reward t...
Challenge and Reward
挑战与奖赏理解提高思维所必需的步骤是一回事，有效地使用它们是另一回事。后者的任务是一个艰巨的挑战，将需要持续的努...
oracel
统计百分比： select sum(t1.reward_money) as reward_money, t...
亚瑟·克拉克~刘慈欣
This award is a reward for imagination. Imagination is a ...
阿里巴巴强化学习rank
期望的成交价作为reward 折扣因子为1 如果用户购买了物品，得到reward。状态变为terminal sta...
Staking Reward Maximization Stra
The introduction of reward maximization strategy The majo...

网友评论

本文标题：Inverse Reward Design

本文链接：https://www.haomeiwen.com/subject/ygjimxtx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

栏目导航

热点阅读

关于我们|服务条款|联系我们|Inverse Reward Design|投稿指南|网站地图|RSS订阅|排版工具|手机版

提供经典美文摘抄,优美散文欣赏,现代诗歌精选,短篇小说,心情随笔,表白情书范文,故事会在线阅读欣赏

Copyright © 2014-2023 Haomeiwen.com All Rights Reserved. 好美文阅读网版权所有

备案信息：桂公网安备 45052102000051号 · 桂ICP备13007215号-3

本站所收录作品、热点评论等信息部分来源互联网，目的只是为了系统归纳学习和传递资讯

所有作品版权归原创作者所有，与本站立场无关，如不慎侵犯了你的权益，请联系我们告知，我们将做删除处理！