Value Function Decomposition for Iterative Design of Reinforcement Learning Agents

Neural Information Processing Systems 

Despite these successes, applying RL techniques to complex control problems remains a daunting undertaking, where initial attempts often result in underwhelming performance.