Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning

Mar-20-2025, 08:43:50 GMT–Neural Information Processing Systems

Recently formalized as the value equivalence principle, this algorithmic technique is perhaps unavoidable as real-world reinforcement learning demands consideration of a simple, computationally-bounded agent interacting with an overwhelmingly complex environment, whose underlying dynamics likely exceed the agent's capacity for representation. In this work, we consider the scenario where agent limitations may entirely preclude identifying an exactly value-equivalent model, immediately giving rise to a trade-off between identifying a model that is simple enough to learn while only incurring bounded sub-optimality.

artificial intelligence, machine learning, reinforcement learning, (9 more...)

Neural Information Processing Systems

Mar-20-2025, 08:43:50 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > Massachusetts (0.46)

Genre:
- Instructional Material > Course Syllabus & Notes (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.68)
    - Reinforcement Learning (1.00)
  - Representation & Reasoning (1.00)