Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization