Discovering, Learning and Exploiting Relevance

Feb-9-2025, 14:38:27 GMT–Neural Information Processing Systems

In this paper we consider the problem of learning online what is the information to consider when making sequential decisions. We formalize this as a contextual multi-armed bandit problem where a high dimensional (D-dimensional) context vector arrives to a learner which needs to select an action to maximize its expected reward at each time step. Each dimension of the context vector is called a type. We assume that there exists an unknown relation between actions and types, called the relevance relation, such that the reward of an action only depends on the contexts of the relevant types. When the relation is a function, i.e., the reward of an action only depends on the context of a single type, and the expected reward of an action is Lipschitz continuous in the context of its relevant type, we propose an algorithm that achieves Õ(T

artificial intelligence, data mining, machine learning, (16 more...)

Neural Information Processing Systems

Feb-9-2025, 14:38:27 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - California > Los Angeles County > Los Angeles (0.14)
- Asia > Vietnam
  - Long An Province (0.04)

Industry:
- Health & Medicine (0.46)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (1.00)
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning (0.93)

Duplicate Docs Excel Report

Title
Discovering, Learning and Exploiting Relevance
Discovering, Learning and Exploiting Relevance

Similar Docs Excel Report more

Title	Similarity	Source
None found