RUDDER: Return Decomposition for Delayed Rewards

Oct-2-2025, 05:30:42 GMT–Neural Information Processing Systems

reinforcement learning; delayed reward; reward redistribution; return decomposition; bias-variance; credit assignment; LSTM

artificial intelligence, machine learning, reward redistribution, (16 more...)

Neural Information Processing Systems

Oct-2-2025, 05:30:42 GMT

Conferences PDF

Industry:
- Leisure & Entertainment > Games (0.35)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.61)

Duplicate Docs Excel Report

Title
RUDDER: Return Decomposition for Delayed Rewards

Similar Docs Excel Report more

Title	Similarity	Source
None found