main

Zhuoran Yang

Neural Information Processing Systems 

The classical theory of reinforcement learning (RL) has focused on tabular and linear representations of value functions.