https://papers.nips.cc/paper/1999/file/464d828b85b0bed98e80ade0a5c43b0f-Paper.pdf
Policy Gradient Methods for Reinforcement Learning with Function Approximation
NIPS-1999-policy-gradient-methods-for-reinforcement-learning-with-function-approximation-Paper.pdf