Online Multi-Armed Bandits with Adaptive Inference

Apr-24-2026, 17:29:38 GMT–Neural Information Processing Systems

During online decision making in multi-armed bandits, one needs to conduct inference on the true mean reward of each arm based on data collected so far at each step. However, since the arms are adaptively selected-thereby yielding non-i.i.d.

artificial intelligence, data mining, machine learning, (20 more...)

Neural Information Processing Systems

Apr-24-2026, 17:29:38 GMT

Conferences PDF

Add feedback

Genre:
- Research Report > New Finding (0.48)

Industry:
- Health & Medicine (0.33)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (1.00)
  - Data Science > Data Mining
    - Big Data (1.00)

Duplicate Docs Excel Report

Title
OnlineMulti-ArmedBanditswithAdaptiveInference

Similar Docs Excel Report more

Title	Similarity	Source
None found