Play to Grade: Testing Coding Games as Classifying Markov Decision Process

Oct-9-2024, 11:55:01 GMT–Neural Information Processing Systems

Contemporary coding education often presents students with the task of developing programs that have user interaction and complex dynamic systems, such as mouse based games. While pedagogically compelling, there are no contemporary autonomous methods for providing feedback. Notably, interactive programs are impossible to grade by traditional unit tests. Each student's program fully specifies an MDP where the agent needs to operate and decide, under reasonable generalization, if the dynamics and reward model of the input MDP should be categorized as correct or broken. We demonstrate that by designing a cooperative objective between an agent and an autoregressive model, we can use the agent to sample differential trajectories from the input MDP that allows a classifier to determine membership: Play to Grade.

classifying markov decision process, decision support system, machine learning, (7 more...)

Neural Information Processing Systems

Oct-9-2024, 11:55:01 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology
  - Decision Support Systems (0.45)
  - Artificial Intelligence > Machine Learning
    - Learning Graphical Models > Undirected Networks > Markov Models (0.45)