ТГХаб
Каналы
Агенты ИИ | AGI_and_RL
[2004.00530] Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations
https://arxiv.org/abs/2004.00530