Learning from Active Human Involvement through Proxy Value Propagation

Neural Information Processing Systems 

Learning from active human involvement enables the human subject to actively intervene and demonstrate to the AI agent during training. The interaction and corrective feedback from human brings safety and AI alignment to the learning process.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found