State Aware Imitation Learning

Yannick Schroecker, Charles L. Isbell

Neural Information Processing Systems 

Formally, we define the problem domain as a Markov decision process, i.e. by its states, actions and unknown Markovian transition probabilities

Similar Docs  Excel Report  more

TitleSimilaritySource
None found