keras实现REINFORCE算法强化学习

上传：long_biti 浏览： 79 推荐： 0 文件：大小：6.48MB 上传时间：2018-12-28 23:54:09 版权申诉

keras实现REINFORCE算法强化学习： # Policy Gradient Minimal implementation of Stochastic Policy Gradient Algorithm in Keras ## Pong Agent ![pg](./assets/pg.gif) This PG agent seems to get more frequent wins after about 8000 episodes. Below is the score graph.