REINFORCE Algorithm

预备知识

核心思想

参考文献

  1. Williams Ronald J. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning. Machine Learning, 1992. [PDF] [Code]