ТГХаб

Каналы

Агенты ИИ | AGI_and_RL

[2004.00530] Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations

https://arxiv.org/abs/2004.00530