keras实现REINFORCE算法强化学习

上传:long_biti 浏览: 79 推荐: 0 文件: 大小:6.48MB 上传时间:2018-12-28 23:54:09 版权申诉
keras实现REINFORCE算法强化学习: # Policy Gradient Minimal implementation of Stochastic Policy Gradient Algorithm in Keras ## Pong Agent ![pg](./assets/pg.gif) This PG agent seems to get more frequent wins after about 8000 episodes. Below is the score graph.
上传资源
用户评论