NearOptimalExploration-Exploitationin Non-CommunicatingMarkovDecisionProcesses

Feb-12-2026, 15:41:04 GMT–Neural Information Processing Systems

Reinforcement learning (RL) [1] studies the problem of learning in sequential decision-making problems where the dynamics of the environment is unknown, but can be learnt by performing actions andobserving their outcome inanonline fashion. Asample-efficient RLagent must trade off the explorationneeded to collect information about the environment, and theexploitation of the experience gathered so far to gain as much reward as possible.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Feb-12-2026, 15:41:04 GMT

Conferences PDF

Add feedback

Country:
- North America
  - United States
    - Virginia > Arlington County
      - Arlington (0.04)
    - Georgia > Fulton County
      - Atlanta (0.04)
  - Canada > Quebec
    - Montreal (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)

Duplicate Docs Excel Report

Title
Near Optimal Exploration-Exploitation in Non-Communicating Markov Decision Processes
3a20f62a0af1aa152670bab3c602feed-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found