Factored Policy Gradients: Leveraging Structure for Efficient Learning in MOMDPs
–Neural Information Processing Systems
Policy gradient methods can solve complex tasks but often fail when the dimensionality of the action-space or objective multiplicity grow very large.
Neural Information Processing Systems
Nov-13-2025, 17:31:17 GMT
- Country:
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Industry:
- Banking & Finance (0.68)
- Technology: