Probabilistic inverse reinforcement learning in unknown environments