Imitation Learning from Vague Feedback
–Neural Information Processing Systems
Imitation learning from human feedback studies how to train well-performed imitation agents with an annotator's relative comparison of two demonstrations
Neural Information Processing Systems
Feb-16-2026, 00:22:21 GMT
- Country:
- Africa > Rwanda
- Asia
- China (0.04)
- Japan > Honshū
- Kansai > Osaka Prefecture
- Osaka (0.04)
- Kantō > Tokyo Metropolis Prefecture
- Tokyo (0.14)
- Kansai > Osaka Prefecture
- Europe
- Finland > Uusimaa
- Helsinki (0.04)
- Portugal (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.14)
- Finland > Uusimaa
- North America
- Canada
- British Columbia > Vancouver (0.04)
- Quebec > Montreal (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- United States
- California > Los Angeles County
- Long Beach (0.14)
- Georgia > Fulton County
- Atlanta (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- California > Los Angeles County
- Canada
- Oceania > New Zealand
- North Island > Auckland Region > Auckland (0.04)
- Genre:
- Research Report (0.46)
- Technology: