Contextual Bandits and Imitation Learning with Preference-Based Active Queries

Feb-9-2026, 01:45:40 GMT–Neural Information Processing Systems

We consider the problem of contextual bandits and imitation learning, where the learner lacks direct knowledge of the executed action's reward.

machine learning, natural language, reinforcement learning, (16 more...)

Neural Information Processing Systems

Feb-9-2026, 01:45:40 GMT

Conferences PDF

Country:
- North America > United States
  - Washington > King County
    - Seattle (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Representation & Reasoning (1.00)
  - Natural Language (1.00)
  - Machine Learning
    - Reinforcement Learning (0.94)
    - Statistical Learning (0.92)

Duplicate Docs Excel Report

Title
2567c95fd41459a98a73ba893775d22a-Paper-Conference.pdf
2567c95fd41459a98a73ba893775d22a-Supplemental-Conference.pdf
Contextual Bandits and Imitation Learning with Preference-Based Active Queries

Similar Docs Excel Report more

Title	Similarity	Source
None found