REINFORCE Algorithm

研究背景

核心思想

参考文献

  1. Williams Ronald J. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning. Machine Learning, 1992. [PDF] [Code]