ТГХаб
Каналы
Агенты ИИ | AGI_and_RL
Policy Optimization for Markov Games: Unified Framework and Faster Convergence
https://arxiv.org/abs/2206.02640