On the Minimax Regret for Linear Bandits in a wide variety of Action Spaces

Jan-9-2023–arXiv.org Artificial Intelligence

As noted in the works of \cite{lattimore2020bandit}, it has been mentioned that it is an open problem to characterize the minimax regret of linear bandits in a wide variety of action spaces. In this article we present an optimal regret lower bound for a wide class of convex action spaces.

artificial intelligence, inequality follow, ti sign, (9 more...)

arXiv.org Artificial Intelligence

Jan-9-2023

arXiv.org PDF

Add feedback

Country:
- Asia > India (0.05)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.05)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.63)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found