Finish reading Spinning up in Deep RL

Spinning up in Deep RL

2021-12-11 19:57 I now know Trust Region Policy Optimization (paper) well enough. Next, let's look more into Proximal policy optimization.

2022-06-28 21:54

no longer on Do next, I'm at OpenAI, probably am OK not needng this for now