Some classic: Proximal Policy Optimization Algorithms

https://arxiv.org/pdf/1707.06347.pdf