Safety through feedback in Constrained RL

Feb-18-2026, 19:30:34 GMT–Neural Information Processing Systems

This feedback can be system generated or elicited from a human observing the training process. Previous approaches have not been able to scale to complex environments and are constrained to receiving feedback at the state level which can be expensive to collect. To this end, we introduce an approach that scales to more complex domains and extends beyond state-level feedback, thus, reducing the burden on the evaluator.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Feb-18-2026, 19:30:34 GMT

Conferences PDF

Add feedback

Country:
- Asia
  - Singapore (0.04)
  - Afghanistan > Parwan Province
    - Charikar (0.04)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Information Technology (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Machine Learning > Reinforcement Learning (0.94)
  - Representation & Reasoning > Agents (0.67)

Duplicate Docs Excel Report

Title
Safety through feedback in Constrained RL

Similar Docs Excel Report more

Title	Similarity	Source
None found