Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis
–Neural Information Processing Systems
One of the most effective continuous deep reinforcement learning algorithms is normalized advantage functions (NAF).
Neural Information Processing Systems
Aug-17-2025, 00:02:01 GMT
- Country:
- Asia > Russia (0.14)
- Europe > Russia (0.14)
- North America > United States
- Massachusetts > Middlesex County > Cambridge (0.04)
- Technology: