Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization https://arxiv.org/abs/2011.08541