Learning Efficiently Function Approximation for Contextual MDP