Learning Human-like Representations to Enable Learning Human Values Andrea H. Wynn

Neural Information Processing Systems 

How can we build AI systems that can learn any set of individual human values both quickly and safely, avoiding causing harm or violating societal standards for acceptable behavior during the learning process? We explore the effects of representational alignment between humans and AI agents on learning human values.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found