Robot Learning with Super-Linear Scaling

Torne, Marcel, Jain, Arhan, Yuan, Jiayi, Macha, Vidaaranya, Ankile, Lars, Simeonov, Anthony, Agrawal, Pulkit, Gupta, Abhishek

Dec-6-2024–arXiv.org Artificial Intelligence

Scaling robot learning requires data collection pipelines that scale favorably with human effort. In this work, we propose Crowdsourcing and Amortizing Human Effort for Real-to-Sim-to-Real(CASHER), a pipeline for scaling up data collection and learning in simulation where the performance scales superlinearly with human effort. The key idea is to crowdsource digital twins of real-world scenes using 3D reconstruction and collect large-scale data in simulation, rather than the real-world. Data collection in simulation is initially driven by RL, bootstrapped with human demonstrations. As the training of a generalist policy progresses across environments, its generalization capabilities can be used to replace human effort with model generated demonstrations. This results in a pipeline where behavioral data is collected in simulation with continually reducing human effort. We show that CASHER demonstrates zero-shot and few-shot scaling laws on three real-world tasks across diverse scenarios. We show that CASHER enables fine-tuning of pre-trained policies to a target scenario using a video scan without any additional human effort. See our project website: https://casher-robot-learning.github.io/CASHER/

demonstration, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

Dec-6-2024

arXiv.org PDF

Add feedback

Country:
- Europe > Netherlands > South Holland > Delft (0.04)

Genre:
- Research Report (0.82)

Industry:
- Government (0.46)

Technology:
- Information Technology
  - Communications > Social Media
    - Crowdsourcing (0.70)
  - Artificial Intelligence
    - Robots (1.00)
    - Natural Language > Large Language Model (0.89)
    - Machine Learning > Neural Networks (0.67)