Reinforcement Learning with Feedback Graphs

Neural Information Processing Systems 

We study RL in the tabular MDP setting where the agent receives additional observations per step in the form of transitions samples.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found