Proximal Policy Optimization Algorithms

预备知识

核心思想

20260305101120

参考文献

  1. Schulman J., Wolski F., Dhariwal P., Radford A. and Klimov O. Proximal Policy Optimization Algorithms arXiv, 2017. [PDF] [Code]