Almost Minimax Optimal Best Arm Identification in Piecewise Stationary Linear Bandits

Mar-27-2025, 13:18:31 GMT–Neural Information Processing Systems

We propose a novel piecewise stationary linear bandit (PSLB) model, where the environment randomly samples a context from an unknown probability distribution at each changepoint, and the quality of an arm is measured by its return averaged over all contexts. The contexts and their distribution, as well as the changepoints are unknown to the agent.

artificial intelligence, data mining, machine learning, (20 more...)

Neural Information Processing Systems

Mar-27-2025, 13:18:31 GMT

Conferences PDF

Add feedback

Country:
- Asia > Singapore (0.27)

Genre:
- Research Report > Experimental Study (0.92)

Industry:
- Banking & Finance
  - Economy (0.45)
  - Trading (0.45)
- Food & Agriculture > Agriculture (0.45)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning > Search (0.40)
  - Data Science > Data Mining
    - Big Data (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found