High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization
–Neural Information Processing Systems
Here we consider contextual policies that are a map from a discrete context to a set of continuous parameters . For example, video streaming and real-time conferencing systems use adaptive bitrate (ABR) algorithms to balance between video quality and uninterrupted playback.
Neural Information Processing Systems
Aug-17-2025, 09:18:04 GMT
- Country:
- North America
- Canada (0.04)
- United States
- Massachusetts > Middlesex County
- Cambridge (0.14)
- Ohio (0.04)
- Massachusetts > Middlesex County
- North America
- Genre:
- Research Report > Experimental Study (0.68)
- Industry:
- Information Technology (0.46)
- Technology: