gym step报错：IndexError: invalid i

作者: fujindemi | 来源:发表于2019-04-17 10:54 被阅读0次

IndexError: invalid index to scalar variable.

在网上很难找到gym较好的参考，很多api的使用建议直接去看gym代码'''\\

def act(self, state):
        if np.random.rand() <= self.epsilon:
            return random.randrange(self.action_size)
        act_values = self.model.predict(state)
        return np.argmax(act_values[0])  # returns action

返回值是一个标量

action = agent.act(state)
next_state, reward, done, _ = env.step(action)

智能体调用act返回一个动作，环境根据动作返回新的状态，
但由于step函数中

newthdot = np.clip(newthdot, -self.max_speed, self.max_speed) #pylint: disable=E1111

使用的是np.clip，需要传入一个np.ndarray
参考spaces.box函数中

            self.low = np.full(self.shape, low)
            self.high = np.full(self.shape, high)

这里直接套用就行，

    def act(self, state):
        if np.random.rand() <= self.epsilon:
            return np.full((1,),random.randrange(self.action_size))
        act_values = self.model.predict(state)
        return np.full((1,),np.argmax(act_values[0]))  # returns action

根据web安全之强化学习和gan书中qlearn改。

网友评论

本文标题：gym step报错：IndexError: invalid i

本文链接：https://www.haomeiwen.com/subject/xvegwqtx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

gym step报错：IndexError: invalid i

相关文章

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读