UnderstandingDeepNeuralFunctionApproximation inReinforcementLearningviaϵ-GreedyExploration
–Neural Information Processing Systems
This problem setting is motivated by the successful deep Q-networks (DQN) framework that falls in this regime.
Neural Information Processing Systems
Feb-7-2026, 21:03:30 GMT