Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis

Neural Information Processing Systems 

One of the most effective continuous deep reinforcement learning algorithms is normalized advantage functions (NAF).

Similar Docs  Excel Report  more

TitleSimilaritySource
None found