Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions
–Neural Information Processing Systems
However, Asynchronous VI still requires a maximization over the entire action space, making it impractical for domains with large action space.
Neural Information Processing Systems
Oct-3-2025, 00:22:11 GMT