Asynchronous Methods for Deep Reinforcement Learning

Paper introducing A2C / A3C method

Works on Atari environment