June 10, 2025Let’s Verify Step by Step来自 OpenAI 的 process supervisionNoteReward ModelProcess SupervisionOpenAIEmpiricalICLR2024