Factored DRO: Factored Distributionally Robust Policies for Contextual Bandits
–Neural Information Processing Systems
Prior work that either ignores potential shifts in the context, or considers them jointly, can lead to performance that is too conservative, especially under certain forms of reward feedback.
Neural Information Processing Systems
Aug-14-2025, 05:48:03 GMT
- Country:
- North America > United States
- Massachusetts (0.04)
- California > Santa Clara County
- Palo Alto (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- Genre:
- Research Report > Experimental Study (0.68)
- Industry:
- Health & Medicine (1.00)
- Government > Voting & Elections (0.46)
- Technology: