ТГХаб
Каналы
Агенты ИИ | AGI_and_RL
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
https://arxiv.org/abs/2106.04895