Learning Human-like Representations to Enable Learning Human Values Andrea H. Wynn

Feb-11-2026, 01:30:49 GMT–Neural Information Processing Systems

How can we build AI systems that can learn any set of individual human values both quickly and safely, avoiding causing harm or violating societal standards for acceptable behavior during the learning process? We explore the effects of representational alignment between humans and AI agents on learning human values.

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Feb-11-2026, 01:30:49 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Maryland > Baltimore (0.04)
  - New Jersey > Mercer County
    - Princeton (0.04)
  - California > Alameda County
    - Berkeley (0.04)
- Europe > United Kingdom
  - England > Oxfordshire > Oxford (0.04)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Education (1.00)
- Transportation
  - Passenger (0.46)
  - Ground > Road (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Cognitive Science (0.95)
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (0.68)
  - Machine Learning
    - Statistical Learning (1.00)
    - Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
3578fd44b2381db12bf16e28a667c934-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found