Play to Grade: Testing Coding Games as Classifying Markov Decision Process