Learning Social Navigation from Positive and Negative Demonstrations and Rule-Based Specifications

Kim, Chanwoo, Yoon, Jihwan, Kim, Hyeonseong, Jeong, Taemoon, Yoo, Changwoo, Lee, Seungbeen, Byeon, Soohwan, Chung, Hoon, Pan, Matthew, Oh, Jean, Lee, Kyungjae, Choi, Sungjoon

Oct-15-2025–arXiv.org Artificial Intelligence

Abstract-- Mobile robot navigation in dynamic human environments requires policies that balance adaptability to diverse behaviors with compliance to safety constraints. We hypothesize that integrating data-driven rewards with rule-based objectives enables navigation policies to achieve a more effective balance of adaptability and safety. T o this end, we develop a framework that learns a density-based reward from positive and negative demonstrations and augments it with rule-based objectives for obstacle avoidance and goal reaching. A sampling-based looka-head controller produces supervisory actions that are both safe and adaptive, which are subsequently distilled into a compact student policy suitable for real-time operation with uncertainty estimates. Experiments in synthetic and elevator co-boarding simulations show consistent gains in success rate and time efficiency over baselines, and real-world demonstrations with human participants confirm the practicality of deployment. Mobile robot navigation in crowded, human-shared environments is inherently safety-critical and requires policies that remain reliable while adapting to diverse human behaviors.

artificial intelligence, demonstration, navigation, (15 more...)

arXiv.org Artificial Intelligence

Oct-15-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.28)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Representation & Reasoning > Rule-Based Reasoning (0.84)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found