IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
1802.01561.pdf