Identifiabilityininversereinforcementlearning

Feb-9-2026, 03:12:37 GMT–Neural Information Processing Systems

Inverse reinforcement learning attempts to reconstruct the reward function in a Markov decision problem, using observations of agent actions. As already observed in Russell [1998] the problem is ill-posed, and the reward function is not identifiable, even under the presence of perfect information about optimal behavior. We provide a resolution to this non-identifiability for problems with entropyregularization.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Feb-9-2026, 03:12:37 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Wisconsin > Dane County
    - Madison (0.04)
  - New York > New York County
    - New York City (0.04)
  - Illinois > Cook County
    - Chicago (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.90)

Duplicate Docs Excel Report

Title
671f0311e2754fcdd37f70a8550379bc-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found