No-Regret M-Concave Function Maximization: Stochastic Bandit Algorithms and NP-Hardness of Adversarial Full-Information Setting
Taihei Oki, Hokkaido University, Hokkaido, Japan, oki@icredd.hokudai.ac.jp "3026 Shinsaku Sakaue[1], The University of Tokyo and RIKEN AIP, Tokyo, Japan, sakaue@mist.i.u-tokyo.ac.jp
–Neural Information Processing Systems
While we obtain those positive results for the stochastic setting, another main result of our work is an impossibility in the adversarial setting.
Neural Information Processing Systems
Mar-21-2025, 21:17:04 GMT
- Country:
- Asia > Japan (0.28)
- North America > United States (0.28)
- Genre:
- Research Report > Experimental Study (0.93)
- Technology: