Learning Reward Machines for Partially Observable Reinforcement Learning