Expectation Alignment: Handling Reward Misspecification in the Presence of Expectation Mismatch

Oct-10-2025, 06:04:55 GMT–Neural Information Processing Systems

Detecting and handling misspecified objectives, such as reward functions, has been widely recognized as one of the central challenges within the domain of Artificial Intelligence (AI) safety research.

occupancy frequency, optimal policy, reward function, (15 more...)

Neural Information Processing Systems

Oct-10-2025, 06:04:55 GMT

Conferences PDF

Country:
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America
  - United States
    - Colorado (0.04)
    - Arizona (0.04)
  - Canada > Alberta
    - Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)
- Europe > Switzerland
  - Vaud > Lausanne (0.04)
- Asia
  - Macao (0.04)
  - China (0.04)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Representation & Reasoning > Agents (0.67)
  - Machine Learning
    - Reinforcement Learning (0.68)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.93)

Duplicate Docs Excel Report

Title
72393bd47a35f5b3bee4c609e7bba733-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found