2021-12-11 19:57 I now know Trust Region Policy Optimization (paper) well enough. Next, let's look more into Proximal policy optimization.
2022-06-28 21:54
no longer on Do next, I'm at OpenAI, probably am OK not needng this for now
2021-12-11 19:57 I now know Trust Region Policy Optimization (paper) well enough. Next, let's look more into Proximal policy optimization.
2022-06-28 21:54
no longer on Do next, I'm at OpenAI, probably am OK not needng this for now