QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning https://arxiv.org/abs/1803.11485.