Contextual Bilevel Reinforcement Learning for Incentive Alignment