Policy Gradient Methods for Reinforcement Learning with Function Approximation
NIPS-1999-policy-gradient-methods-for-reinforcement-learning-with-function-approximation-Paper.pdf